Browse
Publications
Preprints
About
About UCL Open: Env.
Aims and Scope
Editorial Board
Indexing
APCs
How to cite
Publishing policies
Editorial policy
Peer review policy
Equality, Diversity & Inclusion
About UCL Press
Contact us
For authors
Information for authors
How it works
Benefits of publishing with us
Submit
How to submit
Preparing your manuscript
Article types
Open Data
ORCID
APCs
Contributor agreement
For reviewers
Information for reviewers
Review process
How to peer review
Peer review policy
My ScienceOpen
Sign in
Register
Dashboard
Search
Browse
Publications
Preprints
About
About UCL Open: Env.
Aims and Scope
Editorial Board
Indexing
APCs
How to cite
Publishing policies
Editorial policy
Peer review policy
Equality, Diversity & Inclusion
About UCL Press
Contact us
For authors
Information for authors
How it works
Benefits of publishing with us
Submit
How to submit
Preparing your manuscript
Article types
Open Data
ORCID
APCs
Contributor agreement
For reviewers
Information for reviewers
Review process
How to peer review
Peer review policy
My ScienceOpen
Sign in
Register
Dashboard
Search
59
views
0
references
Top references
cited by
224
Cite as...
0 reviews
Review
0
comments
Comment
0
recommends
+1
Recommend
0
collections
Add to
0
shares
Share
Twitter
Sina Weibo
Facebook
Email
3,885
similar
All similar
Record
: found
Abstract
: not found
Book Chapter
: not found
Machine Learning: ECML 2006
Bandit Based Monte-Carlo Planning
other
Author(s):
Levente Kocsis
,
Csaba Szepesvári
Publication date
(Print):
2006
Publisher:
Springer Berlin Heidelberg
Read this book at
Publisher
Further versions
open (via free pdf)
Powered by
Buy book
Review
Review book
Invite someone to review
Bookmark
Cite as...
There is no author summary for this book yet. Authors can add summaries to their books on ScienceOpen to make them more accessible to a non-specialist audience.
Related collections
Value-based Healthcare
Author and book information
Book Chapter
Publication date (Print):
2006
Pages
: 282-293
DOI:
10.1007/11871842_29
SO-VID:
8aa32846-812d-4bde-9438-78ca92cd684b
History
Data availability:
Comments
Comment on this book
Sign in to comment
Book chapters
pp. 270
Fast Variational Inference for Gaussian Process Models Through KL-Correction
pp. 679
B-Matching for Spectral Clustering
pp. 801
Dynamic Integration with Random Forests
pp. 282
Bandit Based Monte-Carlo Planning
pp. 318
Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees
pp. 533
To Select or To Weigh: A Comparative Study of Model Selection and Model Weighing for SPODE Ensembles
pp. 646
Reinforcement Learning for MDPs with Constraints
pp. 654
Efficient Non-linear Control Through Neuroevolution
Similar content
3,885
Distributed User Association in Energy Harvesting Dense Small Cell Networks: A Mean-Field Multi-Armed Bandit Approach
Authors:
Setareh Maghsudi
,
Ekram Hossain
Multi-point Feedback of Bandit Convex Optimization with Hard Constraints
Authors:
Yasunari Hikima
Continuous Multi-Armed Bandits and Multiparameter Processes
Authors:
Avi Mandelbaum
See all similar
Cited by
218
A Survey of Monte Carlo Tree Search Methods
Authors:
Cameron B. Browne
,
Edward Powley
,
Daniel Whitehouse
…
Mastering Atari, Go, chess and shogi by planning with a learned model
Authors:
Julian Schrittwieser
,
Ioannis Antonoglou
,
Thomas Hubert
…
PROGRESSIVE STRATEGIES FOR MONTE-CARLO TREE SEARCH
Authors:
JOS UITERWIJK
,
Bruno Bouzy
,
H. Jaap van den Herik
…
See all cited by