Cart pole – Noxis Studio

Cart Pole experiment

This experiment uses Tensorflow.js in order to solve the cartpole problem using Policy Gradient Reinforcement learning.

The Source code is available on Github
Try the experiment in your browser

Algorithms

You can choose between several algorithms :

REINFORCE
REINFORCE with baseline
Actor Critic (A2C)

Technologies

Tensorflow.js is used for the neural network
The Experiment is rendered using Phaser 3 game engine
Tfjs-vis is used for learning graph visualization

The Settings page allows to:

Configure the Hypermarameters of the Policy and Value models
Store the trained models in the Browser
Retrieve and use stored models