TorchLean API

NN.API.Models.PPO

PPO Model Helpers (API) #

Reusable actor/critic MLP constructors for PPO examples.

These helpers intentionally cover only the neural-network shape. Environment collection, trust boundary checks, advantage computation, and optimizer loops stay in the examples/runtime modules.

Configuration for a simple PPO actor/critic pair over vector observations.

Instances For
    @[reducible, inline]
    Instances For
      @[reducible, inline]
      Instances For
        @[reducible, inline]
        Instances For

          Actor MLP mapping observations to action logits.

          Instances For

            Critic MLP mapping observations to a scalar value estimate.

            Instances For