Chris Pollett > Students > Sodhi
[Bio] [Blog] [CS 298 Project Presentation - PDF] |
CS297 ProposalAI for Classic Video Games Using Reinforcement LearningShivika Sodhi (shivika.sodhi@sjsu.edu) Advisor: Dr. Chris Pollett Description:
Reinforcement learning is essential for training an agent to make smart decisions under uncertainty and to take small actions in order to achieve a higher overarching goal. In this project, we combined reinforcement learning and deep learning techniques to train an agent to play the game, Archon. The challenge is that the agent only sees the pixels and the rewards, similar to a human player. Using just this information, it is able to successfully play the game at a human or sometimes super-human level.. The input (training set) will be screen shots of the game (every one-tenth of a second) while it's being played by the computer. These screen shots shall be discarded after the processing is done. Based on this model, we are planning to make the computer make smarter critical decisions in order to play the game well. Schedule:
Deliverables: The full project will be done when CS298 is completed. The following will be done by the end of CS297: 1. Design a program that can take a screen capture of the region of the screen periodically every (5) seconds 2. Figure out a way to control the running game from a computer program (fakes as if there's a joystick being used by a person) using neural networks. Existing python api (simulate keystrokes and mouse movements). 3. Design simple multi layer AI using THEANO. 4. Design a program that labels in real time, what the current game context/position is. For example, splash screen, player selection, player turn and combat mode. 5. CS 297 Report. References: Playing Atari with Deep Reinforcement Learning: https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf |