Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems by Sebastien Bubeck, Cesa-Bianchi Nicolo | The StoryGraph

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems by Sebastien Bubeck, Cesa-Bianchi Nicolo

Browse Similar Books

View Question Bank

Read With Others

Start a Readalong

Start a Buddy Read

Book Information

Report Missing/Incorrect Information

Foundations and Trends(r) in Machine Learning

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems

Sebastien Bubeck, Cesa-Bianchi Nicolo

138 pages • paperback • 1 edition

nonfiction computer science mathematics science informative medium-paced

Powered by AI (Beta)

Description

A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maxim...

Community Reviews

Content Warnings

add to favorites

mark as owned

More Options

add to favorites

Browse Similar Books

View Question Bank

Read With Others

Start a Readalong

Start a Buddy Read

Book Information

Report Missing/Incorrect Information

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems by Sebastien Bubeck, Cesa-Bianchi Nicolo

Foundations and Trends(r) in Machine Learning

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems

Sebastien Bubeck, Cesa-Bianchi Nicolo

138 pages • paperback

nonfiction computer science mathematics science informative medium-paced

add to favorites

mark as owned

More Options

add to favorites

Browse Similar Books

View Question Bank

Read With Others

Start a Readalong

Start a Buddy Read

Book Information

Report Missing/Incorrect Information

Powered by AI (Beta)

Description

A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maxim...

Community Reviews

Content Warnings