TorchLean API

NN.Runtime.RL.Artifacts.GridWorld.Policy

GridWorld Policy-Difference Artifacts #

A PolicyDiff stores before/after greedy action maps for a fixed GridWorld. These files are small run artifacts for visualization and regression checks, not a general RL dataset format.

Policy snapshots #

Before/after greedy policy snapshots for a fixed width × height GridWorld.

before and after are flattened row-major arrays of action indices (0..3).

Instances For

    Validate a PolicyDiff record (lengths and action ranges).

    This is used defensively by IO readers/writers and widgets; it is scoped to IO specification layer for policies.

    Instances For

      Write a PolicyDiff JSON file to disk (creating parent directories if needed).

      Instances For