Special Topics Track
Reinforcement Learning in Games
Brought to you by: Ming and Henry
Today’s Plan
What is Reinforcement Learning?
(ChatGPT is not made with Reinforcement Learning)
What is Reinforcement Learning
What is Reinforcement Learning
Simple game setup:
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
What is Reinforcement Learning
The agent repeats the following actions:
After the game ends, the agent evaluates its decisions
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
What is Reinforcement Learning
Observation: The agent notes the position of the three shapes.
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
What is Reinforcement Learning
Prediction: The agent notes its own possible actions:
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
What is Reinforcement Learning
Action: The agent moves down.
| | | |
| | | |
| | | |
| | | |
+5
-1
What is Reinforcement Learning
Reward: The agent earns 1 point!
| | | |
| | | |
| | | |
| | | |
+5
-1
What is Reinforcement Learning
If the game ends here, the agent now remembers that moving towards the circle earned it a point!
| | | |
| | | |
| | | |
| | | |
+5
-1
What is Reinforcement Learning
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
What is Reinforcement Learning
More complex game setup:
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
What is Reinforcement Learning
Prediction: Assume the agent knows the point values:
How should the agent choose?
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
Hyperparameters
Hyperparameters - Gamma
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
Hyperparameters - Gamma
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
Hyperparameters - Beta
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
Hyperparameters - Beta
| | | |
| | | |
| | | |
| | | |
+5
+1
-1
How does ML Agents come into play?
How does ML Agents come into play?
How can we make it better?
How can we make it better?
Reward →
2 sets of runs, exact same settings, different random seeds!
Source: https://arxiv.org/pdf/1709.06560.pdf
How can we make it better?
How can we make it better?
How can we make it better?
More Resources
Thanks for coming!