A new algorithm is introduced, Stochastic MuZero, that learns a stochastic model incorporating afterstates, and uses this model to perform a stochastic tree search and maintain the superhuman performance of standard MuZero in the game of Go.
Authors
Julian Schrittwieser
7 papers
David Silver
21 papers
Ioannis Antonoglou
8 papers
Sherjil Ozair
4 papers
Thomas K. Hubert
1 papers
Views
Field of Study
Computer Science
Venue Information
Name
International Conference on Learning Representations