Skip to content

A 2048 api for training supervised learning (imitation learning) or reinforcement learning agents

License

Notifications You must be signed in to change notification settings

mingsjtu/2048-api

Repository files navigation

2048-api

A 2048 game api for training supervised learning (imitation learning) follow ExpectiMax agent and create your own CNN model

Code structure

  • game2048/: the main package.
  • explore.ipynb: introduce how to use the Agent, Display and Game.
  • static/: frontend assets (based on Vue.js) for web app.
  • webapp.py: run the web app (backend) demo.
  • evaluate.py: evaluate your self-defined agent. test time = 50
  • online_train.py: get your own agent's weight.The structure of the model can be found
  • CNN_new_141.zip: get your own agent's weight.The structure of the model can be found

Requirements

  • code only tested on linux system (ubuntu 16.04)
  • Python 3 (Anaconda 3.6.3 specifically) with numpy and flask

for train

python online_train.py
  • you will get your own model taught by ExpectiMax agent

To define your own agents

from game2048.agents import Agent

class YourOwnAgent(Agent):

    def step(self):
        '''To define the agent's 1-step behavior given the `game`.
        You can find more instance in [`agents.py`](game2048/agents.py).
        
        :return direction: 0: left, 1: down, 2: right, 3: up
        '''
        direction = some_function(self.game)
        return direction

To compile the pre-defined ExpectiMax agent

cd game2048/expectimax
bash configure
make

To run the web app

python webapp.py

demo

LICENSE

The code is under Apache-2.0 License.

For EE369 students from SJTU only

Please read here.

About

A 2048 api for training supervised learning (imitation learning) or reinforcement learning agents

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 89.6%
  • JavaScript 8.4%
  • Python 0.6%
  • C++ 0.5%
  • Shell 0.5%
  • C 0.2%
  • Other 0.2%