Projects
Pong AI — Deep Q Reinforcement Learning
In this project, I replicated an influential reinforcement learning paper published in 2015: Human-level control through deep reinforcement learning. I self-studied CSC421 Deep Learning which had a publically available syllabus and whose lectures were based on accessible research papers, and after completing all the homework, I wanted to do a “capstone.”
The final results are shown below (the neural network is the green player). The most interesting thing, in my opinion, is how the neural net learned to deflect the ball in such a manner that the deterministic bot could not hit it back. This is in contrast to the random AI which, even when it hits the ball back, the deterministic bot is able to respond.
Randomly Sampled Actions | Trained Deep Q Neural Network |
---|---|