Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems: 15 (Foundations and Trends� in Machine Learning) - Softcover

Bubeck, S. Bastian; Cesa-Bianchi, Nicolo; Bubeck, Sebastien

9781601986269: Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems: 15 (Foundations and Trends� in Machine Learning)

Softcover

ISBN 10: 1601986262 ISBN 13: 9781601986269

Verlag: Now Publishers Inc, 2012

Alle Exemplare dieser ISBN-Ausgabe

2 Gebraucht

Von EUR 86,99

16 Neu

Von EUR 85,93

A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maximize the total payoff obtained in a sequence of allocations. The name bandit refers to the colloquial term for a slot machine (a "one-armed bandit" in American slang). In a casino, a sequential allocation problem is obtained when the player is facing many slot machines at once (a "multi-armed bandit"), and must repeatedly choose where to insert the next coin. Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. Although the study of bandit problems dates back to the 1930s, exploration-exploitation trade-offs arise in several modern applications, such as ad placement, website optimization, and packet routing. Mathematically, a multi-armed bandit is defined by the payoff process associated with each option. In this book, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it also analyzes some of the most important variants and extensions, such as the contextual bandit model. This monograph is an ideal reference for students and researchers with an interest in bandit problems.

Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.

Rese�a del editor

��ber diesen Titel� kann sich auf eine andere Ausgabe dieses Titels beziehen.

Verlag: Now Publishers Inc
Erscheinungsdatum: 2012
Sprache: Englisch
ISBN 10: 1601986262
ISBN 13: 9781601986269
Einband: Tapa blanda
Anzahl der Seiten: 138
Kontakt zum Hersteller: Nicht verf�gbar
Verantwortliche Person: Nicht verf�gbar

Gebraucht kaufen

Zustand: Wie neu

Unread book in perfect condition...

Diesen Artikel anzeigen

EUR 86,99

W�hrung umrechnen

EUR 17,19 f�r den Versand von USA nach Deutschland

Versandziele, Kosten & Dauer

In den Warenkorb

Neu kaufen

Diesen Artikel anzeigen

EUR 85,93

W�hrung umrechnen

EUR 4,47 f�r den Versand von Vereinigtes K�nigreich nach Deutschland

Versandziele, Kosten & Dauer

In den Warenkorb

Suchergebnisse f�r Regret Analysis of Stochastic and Nonstochastic Multi-Armed...

Beispielbild f�r diese ISBN

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

S�bastien Bubeck

Verlag: Now Publishers, 2012

ISBN 10: 1601986262 ISBN 13: 9781601986269

Neu PAP

Print-on-Demand

Anbieter: PBShop.store UK, Fairford, GLOS, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

PAP. Zustand: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bestandsnummer des Verk�ufers IQ-9781601986269

Verk�ufer kontaktieren

Neu kaufen

EUR 85,93

W�hrung umrechnen

Versand: EUR 4,47

Von Vereinigtes K�nigreich nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: 15 verf�gbar

In den Warenkorb

Foto des Verk�ufers

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Bubeck, S�bastien; Nicol�, Cesa-bianchi

Verlag: Now Publishers, 2012

ISBN 10: 1601986262 ISBN 13: 9781601986269

Neu Softcover

Anbieter: GreatBookPrices, Columbia, MD, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers 19193988-n

Verk�ufer kontaktieren

Neu kaufen

EUR 76,52

W�hrung umrechnen

Versand: EUR 17,19

Von USA nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems (Foundations and Trends(r) in Machine Learning)

Bubeck, S�bastien; Nicol�, Cesa-Bianchi

Verlag: Now Publishers, 2012

ISBN 10: 1601986262 ISBN 13: 9781601986269

Neu Softcover

Anbieter: California Books, Miami, FL, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers I-9781601986269

Verk�ufer kontaktieren

Neu kaufen

EUR 85,94

W�hrung umrechnen

Versand: EUR 8,60

Von USA nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Foto des Verk�ufers

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Bubeck, S� bastien; Nicol� , Cesa-bianchi

Verlag: Now Publishers, 2012

ISBN 10: 1601986262 ISBN 13: 9781601986269

Neu Softcover

Anbieter: GreatBookPricesUK, Woodford Green, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers 19193988-n

Verk�ufer kontaktieren

Neu kaufen

EUR 85,92

W�hrung umrechnen

Versand: EUR 17,30

Von Vereinigtes K�nigreich nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Foto des Verk�ufers

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Bubeck, S�bastien; Nicol�, Cesa-bianchi

Verlag: Now Publishers, 2012

ISBN 10: 1601986262 ISBN 13: 9781601986269

Gebraucht Softcover

Anbieter: GreatBookPrices, Columbia, MD, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: As New. Unread book in perfect condition. Bestandsnummer des Verk�ufers 19193988

Verk�ufer kontaktieren

Gebraucht kaufen

EUR 86,99

W�hrung umrechnen

Versand: EUR 17,19

Von USA nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Sebastien Bubeck

Verlag: now publishers Inc, 2012

ISBN 10: 1601986262 ISBN 13: 9781601986269

Neu Paperback / softback

Print-on-Demand

Anbieter: THE SAINT BOOKSTORE, Southport, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Paperback / softback. Zustand: New. This item is printed on demand. New copy - Usually dispatched within 5-9 working days 234. Bestandsnummer des Verk�ufers C9781601986269

Verk�ufer kontaktieren

Neu kaufen

EUR 100,87

W�hrung umrechnen

Versand: EUR 5,10

Von Vereinigtes K�nigreich nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Foto des Verk�ufers

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

S�bastien Bubeck, Cesa-Bianchi Nicol�

Verlag: now publishers Inc, US, 2012

ISBN 10: 1601986262 ISBN 13: 9781601986269

Neu Paperback

Anbieter: Rarewaves USA, OSWEGO, IL, USA

Verk�uferbewertung 5 von 5 Sternen

Paperback. Zustand: New. A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maximize the total payoff obtained in a sequence of allocations. The name bandit refers to the colloquial term for a slot machine (a ""one-armed bandit"" in American slang). In a casino, a sequential allocation problem is obtained when the player is facing many slot machines at once (a ""multi-armed bandit""), and must repeatedly choose where to insert the next coin.Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. Although the study of bandit problems dates back to the 1930s, exploration-exploitation trade-offs arise in several modern applications, such as ad placement, website optimization, and packet routing. Mathematically, a multi-armed bandit is defined by the payoff process associated with each option.In this book, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it also analyzes some of the most important variants and extensions, such as the contextual bandit model. This monograph is an ideal reference for students and researchers with an interest in bandit problems. Bestandsnummer des Verk�ufers LU-9781601986269

Verk�ufer kontaktieren

Neu kaufen

EUR 102,99

W�hrung umrechnen

Versand: EUR 3,44

Von USA nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Foto des Verk�ufers

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems

Bubeck, S. Bastian|Cesa-Bianchi, Nicolo|Bubeck, Sebastien

Verlag: Now Publishers Inc, 2012

ISBN 10: 1601986262 ISBN 13: 9781601986269

Neu Softcover

Print-on-Demand

Anbieter: moluna, Greven, Deutschland

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Dieser Artikel ist ein Print on Demand Artikel und wird nach Ihrer Bestellung fuer Sie gedruckt. Inhaltsverzeichnis1: Introduction 2: Stochastic bandits: fundamental results 3: Adversarial bandits: fundamental results 4: Contextual Bandits 5: Linear bandits 6: Nonlinear bandits 7: Variants. Acknowledgements. ReferencesKl. Bestandsnummer des Verk�ufers 448142518

Verk�ufer kontaktieren