TorchLean API

NN.Runtime.RL

Reinforcement Learning Runtime Entrypoint #

This is the runtime umbrella for typed reinforcement-learning utilities. The modules here are executable infrastructure: they define small MDP/bandit interfaces, value-learning and policy-gradient helpers, replay buffers, checked rollout boundaries, Gymnasium subprocess sessions, and PPO rollout infrastructure.

The split is intentional: