Cart Pole experiment

This experiment uses Tensorflow.js in order to solve the cartpole problem using Policy Gradient Reinforcement learning.

  • The Source code is available on Github
  • Try the experiment in your browser

Algorithms

You can choose between several algorithms :

  • REINFORCE
  • REINFORCE with baseline
  • Actor Critic (A2C)

Technologies

  • Tensorflow.js is used for the neural network
  • The Experiment is rendered using Phaser 3 game engine
  • Tfjs-vis is used for learning graph visualization

 

The Settings page allows to:

  • Configure the Hypermarameters of the Policy and Value models
  • Store the trained models in the Browser
  • Retrieve and use stored models