TorchLean API

NN.API.RL.Core

Public RL API #

This module exposes the mathematical and algorithmic RL surface under NN.API.rl.*.

Design intent:

References (background and terminology):

Differentiable policy-gradient losses over TorchLean backend references.

The pure exports above are algebra over concrete spec tensors. These helpers are the training-time counterpart: they build scalar losses from backend refs, so the same formulas can run through eager or compiled autograd.

Training Logs (Widgets and Examples) #

TorchLean does not aim to be a full “trainer framework”, but many executable examples want to:

This namespace re-exports the small, stable log types and JSON IO helpers.