PPO CartPole Artifacts #
This file visualizes the training curve produced by
NN/Examples/Models/RL/PPOCartPole.lean (lake exe torchlean ppo_cartpole).
The executable trainer stays in Examples/Models; this file is only the editor-side artifact
viewer. That split keeps PPO training code in one place and avoids a duplicate "RL examples" model
zoo.
Workflow:
- Run:
python3 -m pip install --user 'gymnasium>=1.0'
lake exe torchlean ppo_cartpole
lake build -R -K cuda=true && lake exe torchlean ppo_cartpole --cuda
- Put the cursor on the command below in an editor. The infoview will render the saved log.
Notes:
- The executable writes
data/rl/ppo_cartpole_trainlog.jsonby default (override with--log). - This viewer is pure: if the file is missing, it shows an error panel instead of failing to build.
References:
- Schulman et al., "Proximal Policy Optimization Algorithms" (2017): https://arxiv.org/abs/1707.06347
- Schulman et al., "High-Dimensional Continuous Control Using Generalized Advantage Estimation" (2015): https://arxiv.org/abs/1506.02438