PPO CartPole Artifacts #

This file visualizes the training curve produced by NN/Examples/Models/RL/PPOCartPole.lean (lake exe -K cuda=true torchlean ppo_cartpole --cuda ...).

The executable trainer stays in Examples/Models; this file is only the editor-side artifact viewer. That split keeps PPO training code in one place and avoids a duplicate "RL examples" model zoo.

Workflow:

Run:

python3 -m pip install --user 'gymnasium>=1.0'
lake exe -K cuda=true torchlean ppo_cartpole --cuda --updates 1 --eval-every 1 --eval-episodes 1 --eval-max-steps 8

Put the cursor on the command below in an editor. The infoview will render the saved log.

Notes:

The executable writes data/rl/ppo_cartpole_trainlog.json by default (override with --log).
This viewer is pure: if the file is missing, it shows an error panel instead of failing to build.

References:

Schulman et al., "Proximal Policy Optimization Algorithms" (2017): https://arxiv.org/abs/1707.06347
Schulman et al., "High-Dimensional Continuous Control Using Generalized Advantage Estimation" (2015): https://arxiv.org/abs/1506.02438

TorchLean API

NN.Examples.RL.PPOCartPoleView

PPO CartPole Artifacts #