CSC 384 MCTS for Othello
In this slide, we are going to walkthrough how MCTS works in Othello from a high level perspective. To simply demonstrate the algorithm, we have reduced the number of option (possible legal move). For instance, there should be 4 children instead of 2 in Iter1.
This slide is based on examples from this link (https://www.analyticsvidhya.com/blog/2019/01/monte-carlo-tree-search-introduction-algorithm-deepmind-alphago/). However, the original one does contain some errors in both diagram and description.
Iter1
Iter1 Selection
UCB_Sn = reward_Sn / total_Sn + bias*sqrt(ln(total_Sn_parent) / total_Sn)
Iter1 Expansion & Simulation
Iter1 Backpropagation
Iter2 Selection
Iter2 Expansion & Simulation
Iter2 Backpropagation
Iter3 Selection
Iter3 Expansion
*: We reduced it to 2 for simplicity here.
Iter3 Expansion (Selection)
Iter3 Simulation
Iter3 Backpropagation
Iter4 Selection