A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Title | Filenames | ||||||||||||||||||||||||
2 | Fantasy Ice Hockey Lineup Decisions with Q-Learning and SARSA | 001.pdf | ||||||||||||||||||||||||
3 | Optimal Policy for Uber/Lyft drivers in the Bay Area | 002.mp4 | ||||||||||||||||||||||||
4 | Twenty-Twenty Learning: A 2048 solver powered by Deep Q-Networks | 003.mp4 | ||||||||||||||||||||||||
5 | Drag-based Reconfiguration of Spacecraft Formations Under Uncertainty | 004.mp4 | ||||||||||||||||||||||||
6 | Value Iteration applied to Orbital Debris Removal with a Satellite | 005.mp4 | ||||||||||||||||||||||||
7 | Mortal (Q)ombat: Training an AI agent with Reinforcement Learning to Play a Brawler Game | 006.pdf | ||||||||||||||||||||||||
8 | Flipping the Bit: Explorations in Learning the Ising Model | 007.mp4 | ||||||||||||||||||||||||
9 | Optimal Strategy Generation for Pandemic Game Using Monte Carlo Tree Search | 008.pdf | ||||||||||||||||||||||||
10 | Tropical Reforestation | 009.mp4 | ||||||||||||||||||||||||
11 | Queue RL | 012.pdf | ||||||||||||||||||||||||
12 | ClutchRL: Using Q-Learning To Find The Optimal Policy To Win A Close Basketball Game | 013.mp4 | ||||||||||||||||||||||||
13 | Decision Making in Quadcopter Navigation with The Effects of Wind | 015.pdf | ||||||||||||||||||||||||
14 | Reinforcement Learning Approaches for Sepsis Treatment | 019.mp4 | ||||||||||||||||||||||||
15 | Localization and path finding of a known environment | 020.pdf | ||||||||||||||||||||||||
16 | Particle Filter For Autonomous Vehicle Handover | 021.pdf | ||||||||||||||||||||||||
17 | Q-QWOP: Using Q-Learning for QWOP Policy Development | 024.pdf | ||||||||||||||||||||||||
18 | Wordlebot: Improving Wordle Performance Using Forward Search | 027.pdf | ||||||||||||||||||||||||
19 | Optimal Load Scheduling for Smart Homes | 028.pdf | ||||||||||||||||||||||||
20 | Trajectory Optimization for Autonomous Parking Lot Cleaning using Q-Learning | 029.pdf | ||||||||||||||||||||||||
21 | Comparison of Online Planning and Reinforcement Learning for Oil Spill Containment | 030.mp4 | ||||||||||||||||||||||||
22 | Estimating an Optimal Solution to Backgammon | 033.pdf | ||||||||||||||||||||||||
23 | I ♥ RL: Using Q-Learning to Play Hearts | 034.pdf | ||||||||||||||||||||||||
24 | AITA Judge Twitch Streamer | 035.mp4 | ||||||||||||||||||||||||
25 | Traffic Light Optimization Under Uncertainty | 036.pdf | ||||||||||||||||||||||||
26 | Fuel Efficient Autonomous Driving | 037.pdf | ||||||||||||||||||||||||
27 | Theseus and the Cyclops: Adventures in Navigation Using Monocular Depth Estimation | 038.pdf | ||||||||||||||||||||||||
28 | Maximum Entropy Reinforcement Learning for Prompt Tuning | 039.pdf | ||||||||||||||||||||||||
29 | Applying Reinforcement Learning and Online Planning to the Game ”Regenwormen" | 040.pdf | ||||||||||||||||||||||||
30 | Blackjack with Card Counting | 041.pdf | ||||||||||||||||||||||||
31 | Search and ResQ | 042.mp4 | ||||||||||||||||||||||||
32 | Optimal Operation of Renewable Energy Plus Storage Systems Using Reinforcement Learning | 043.pdf | ||||||||||||||||||||||||
33 | Landing a Plane on Final Approach with Q-Learning | 044.mp4 | ||||||||||||||||||||||||
34 | Blackjack: Are the "Basic Strategies" Sufficient? | 045.pdf | ||||||||||||||||||||||||
35 | Driving with Large Language Models | 046.mp4 | ||||||||||||||||||||||||
36 | Belief Updating for Improved Navigation in Wind | 047.pdf | ||||||||||||||||||||||||
37 | Taxi Route Recommendations Using Reinforcement Learning | 048.pdf | ||||||||||||||||||||||||
38 | Using Reinforcement Learning for Life and Death problem in Go | 050.mp4 | ||||||||||||||||||||||||
39 | Travelling with (Un)certainty | 051.pdf | ||||||||||||||||||||||||
40 | Optimal Drone Navigation with Stochastic Dynamics | 052.pdf | ||||||||||||||||||||||||
41 | Reinforcement Learning for Autonomous Navigation of Subterranean Environments | 054.pdf | ||||||||||||||||||||||||
42 | Measuring potential effects of an intelligent tutoring system | 055.pdf | ||||||||||||||||||||||||
43 | X's and O's: Optimization of Tic-Tac-Toe | 056.pdf | ||||||||||||||||||||||||
44 | Autonomous Driving in a Roundabout with Rule-Breaking Agents | 057.pdf | ||||||||||||||||||||||||
45 | Playing blackjack using reinforcement learning | 058.pdf | ||||||||||||||||||||||||
46 | Playing Mancala via Reinforcement Learning | 059.pdf | ||||||||||||||||||||||||
47 | Fantasy Football Team Draft Sequential Decision Process | 060.pdf | ||||||||||||||||||||||||
48 | Optimizing Store Inventory Management with Q-learning | 061.mp4 | ||||||||||||||||||||||||
49 | Deep Double Q Network for Lunar Lander | 062.mp4 | ||||||||||||||||||||||||
50 | Optimal Online Ad Allocation Under Uncertainty | 063.pdf | ||||||||||||||||||||||||
51 | Reinforcement Learning for Chess Variants | 065.mp4 | ||||||||||||||||||||||||
52 | A Chess Engine on Ice: Developing an Automated Curling Skip Capable of Decision Making under Uncertainty | 067.pdf | ||||||||||||||||||||||||
53 | Online Planning with Terrain Friction Estimation for Safe Rover Navigation | 068.pdf | ||||||||||||||||||||||||
54 | Simple Reinforcement Learning for Space Mining Bots | 069.pdf | ||||||||||||||||||||||||
55 | Preventing the Spread of Wind-Driven Wildfires with Partially Observable Markov Decision Processes | 070.pdf | ||||||||||||||||||||||||
56 | Making a (Good) Pokemon 1v1 Battle Agent | 072.mp4 | ||||||||||||||||||||||||
57 | Learning to Play Backgammon Using Reinforcement Learning | 073.pdf | ||||||||||||||||||||||||
58 | Self-Driving School Bus for Stanford Campus | 074.pdf | ||||||||||||||||||||||||
59 | Deep Q-learning Network on Atari Game | 075.pdf | ||||||||||||||||||||||||
60 | FishBot: Decision Making in Canadian Fish | 076.pdf | ||||||||||||||||||||||||
61 | Modeling Deceit in Coup | 079.pdf | ||||||||||||||||||||||||
62 | Buying a House without Going Broke | 081.mp4 | ||||||||||||||||||||||||
63 | Taxi Route Optimization using Reinforcement Learning | 082.pdf | ||||||||||||||||||||||||
64 | Minesweeper: A Reinforcement Learning Approach | 084.pdf | ||||||||||||||||||||||||
65 | Reinforcement Learning for Robotic Arm Reach Problem | 085.pdf | ||||||||||||||||||||||||
66 | Optimal Policy for Ground-to-Satellite Communication using Reinforcement Learning | 087.pdf | ||||||||||||||||||||||||
67 | Feel-Good Othello-Bot: Othello-Player Agent to Bring Satisfying Win for Human Player | 088.pdf | ||||||||||||||||||||||||
68 | Value Iteration and Contraceptive Method Choice: Applying Algorithmic Decision Making to Women’s Health Practices | 089.pdf | ||||||||||||||||||||||||
69 | Settlers of Catan Settlement Placement Optimization with Deep Q-Learning | 090.pdf | ||||||||||||||||||||||||
70 | A Markov-Based Approach to Evaluate the Optimal Position of Zeposia in the Treatment of Ulcerative Colitis | 091.pdf | ||||||||||||||||||||||||
71 | Multi-Rover Exploration using Dec-POMDPs | 092.pdf | ||||||||||||||||||||||||
72 | Optimizing Green Infrastructure Citing for Urban Flood Mitigation | 098.pdf | ||||||||||||||||||||||||
73 | Optimal Elevator Algorithm | 099.pdf | ||||||||||||||||||||||||
74 | Transmission Expansion for Offshore Wind Farms Under Uncertainty | 101.pdf | ||||||||||||||||||||||||
75 | Card-Counting Blackjack Agent | 102.mp4 | ||||||||||||||||||||||||
76 | Tic-Tac-Toe: An Unbeatable Foe | 105.pdf | ||||||||||||||||||||||||
77 | How do we walk? Controlling joint motor torque to achieve balanced movement | 106.pdf | ||||||||||||||||||||||||
78 | How to not lose your starship: Strategy Optimization for Corellian Spike | 108.pdf | ||||||||||||||||||||||||
79 | Geometry Dash AI | 109.mp4 | ||||||||||||||||||||||||
80 | Planar Manipulation of an Object with Unknown and Changing Center of Mass | 110.pdf | ||||||||||||||||||||||||
81 | Ping Pong Prodigy | 113.pdf | ||||||||||||||||||||||||
82 | Solving Half Cheetah Problem Using Offline RL | 114.mp4 | ||||||||||||||||||||||||
83 | Solving Wordle using Information Theory, Minimax Algorithms, and Q-Learning | 116.pdf | ||||||||||||||||||||||||
84 | Applying Q-Learning to Uno | 117.pdf | ||||||||||||||||||||||||
85 | Rocket Self-Landing using Proximal Policy Optimization | 118.mp4 | ||||||||||||||||||||||||
86 | Determining an optimal screening policy for colorectal cancer | 119.mp4 | ||||||||||||||||||||||||
87 | Code-Lint: Code Generation with Reinforcement Learning incorporating Linter Feedback | 120.pdf | ||||||||||||||||||||||||
88 | Safe Navigation: Training Autonomous Vehicles using Deep Reinforcement Learning in CARLA | 124.pdf | ||||||||||||||||||||||||
89 | Implementing Q-learning to Obtain Control Policy for Residential Battery System | 125.pdf | ||||||||||||||||||||||||
90 | Cubo: Your Friendly Neighborhood Rolling Cube | 126.pdf | ||||||||||||||||||||||||
91 | Deep Reinforcement Learning for In-Flight Calibration of a Lunar Lander | 127.pdf | ||||||||||||||||||||||||
92 | Learning and Improving the Intelligent Driver Model with Reinforcement Learning | 129.mp4 | ||||||||||||||||||||||||
93 | Safe Lane Merging for Autonomous Cars on the Highway through Markov Decision Process | 130.pdf | ||||||||||||||||||||||||
94 | Solving Wordle Using Monte Carlo Tree Search | 132.pdf | ||||||||||||||||||||||||
95 | Prepare to AI: Model-Free Reinforcement Learning for Dark Souls 1 | 134.pdf | ||||||||||||||||||||||||
96 | Dynamic Efficient Sampling Policy Learning - An Investigation of DRL-based Atari Game Playing | 137.pdf | ||||||||||||||||||||||||
97 | Cleaning Up with Q-Learning: Optimizing Roomba Navigation in Unknown Environments | 139.mp4 | ||||||||||||||||||||||||
98 | A Novel Reward Shaping Function for Single-Player Mahjong | 140.pdf | ||||||||||||||||||||||||
99 | Evaluating the Disposition Effect using Reinforcement Learning | 141.pdf | ||||||||||||||||||||||||
100 | Playing Risk with Reinforcement Learning | 142.mp4 |