Chris Pollett > Students >
Luo

    ( Print View)

    [Bio]

    [Blog]

    [CS 297 Proposal]

    [Deliverable 1]

    [Deliverable 2]

    [Deliverable 3]

    [Deliverable 4]

    [AI Methods for Dou Di Zhu-PDF]

    [Q-learning-PDF]

    [CS297 Report-PDF]

    [CS 298 Proposal]

    [CS298_Report-PDF]

    [Defense Slides-PDF]

Dou Di Zhu with Q-learning

I use the python3 to create the Dou Di Zhu with Q-Learning

In this part, I prepare three AI agents with separate Q-learning matrices

These are my training and testing results:

1000 training times, 200 testing time result:
  • Player1 Winning Percent: 65.0%
  • Player1 Peasant Winning Percent: 68.32%
  • Player1 Landlord Winning Percent: 51.28%
  • Player2 Winning Percent: 52.0%
  • Player2 Peasant Winning Percent: 63.64%
  • Player2 Landlord Winning Percent: 34.18%
  • Player3 Winning Percent: 47.5%
  • Player3 Peasant Winning Percent: 60.17%
  • Player3 Landlord Winning Percent: 29.27%


10000 training times, 2000 testing time result:
  • Player1 Winning Percent: 64.35%
  • Player1 Peasant Winning Percent: 65.97%
  • Player1 Landlord Winning Percent: 59.11%
  • Player2 Winning Percent: 53.65%
  • Player2 Peasant Winning Percent: 60.15%
  • Player2 Landlord Winning Percent: 40.15%
  • Player3 Winning Percent: 42.05%
  • Player3 Peasant Winning Percent: 51.87%
  • Player3 Landlord Winning Percent: 29.5%

This simple Q-learning algorithm works not very well on Dou Di Zhu game, it only can teach these agents playing Dou Di Zhu like newbie players

It's based on Python 3. Just run the 'main.py' to run this project.

Deliverable 3.zip (You need to install the random library in your Python 3)