`Context α`: scalar interface for models + proofs #

TorchLean is designed to be scalar-polymorphic: the same model/layer definitions can be instantiated over many numeric backends:

Float (fast execution; trusted runtime semantics),
TorchLean.Floats.IEEE754.IEEE32Exec (executable bit-level IEEE-754 binary32),
interval enclosures for verification (see NN/Floats/Interval/*),
ℝ (proof-level mathematics).

Why we designed it this way:

We did not want separate "Float model code", "proof model code", and "verification model code" that slowly diverge and become inconsistent.
In practice, we iterate across phases: execute a small model, state the proof-level contract, then run verification bounds. Rewriting each model for each phase is error-prone.
A single scalar-polymorphic spec gives one source of truth for layers/models, while letting us swap numeric meaning by changing the scalar instance.
This keeps cross-checking honest: if behavior changes between backends, the difference is visible at the scalar semantics layer, not hidden inside duplicated model definitions.
The tradeoff is a slightly larger scalar interface (Context α), but we accept that complexity to keep architecture-level duplication low and proofs/reuse high.

Some relevant literature we were following / taking inspiration from:

Bezanson et al., "Julia: A Fresh Approach to Numerical Computing" (generic numeric code across many scalar types; performance via specialization): https://arxiv.org/abs/1411.1607
Spitters and van der Weegen, "Type classes for mathematics in type theory" (typeclass-based algebraic interfaces for reusable formalization and instances): https://doi.org/10.1017/S0960129511000119
Elliott, "The Simple Essence of Automatic Differentiation" (one abstract formulation specialized to multiple concrete semantics/representations): https://arxiv.org/abs/1804.00746
Mirman et al., "The Fundamental Limits of Interval Arithmetic for Neural Networks" (why interval backends are useful and where they become conservative): https://arxiv.org/abs/2112.05235

Our Context α is the same engineering pattern in a Lean setting: one model/layer definition, many scalar interpretations, and explicit tradeoffs about semantics.

To make this practical, we collect the numeric operations required by neural networks into a single typeclass:

Context α

This is intentionally broader than a standard algebraic structure: it bundles arithmetic, ordering, and common transcendental functions (exp/tanh/log/sqrt) used by activations and losses.

Notes #

Many spec definitions assume [Context α] so they can be re‑used at multiple dtypes.
For "paper theorems", the spec layer fixes Spec.SpecScalar := ℝ (see NN/Spec/Core/Scalar.lean).
Context.decidable_gt is included so executable code can decide comparisons (e.g. ReLU / argmax).
For executable examples, Context.gtBool converts x > y into a printable Bool.
For interval arithmetic, we override some order/comparison behavior (see namespace Interval below).

Context α: scalar interface for models + proofs #

Notes #

A lightweight ℚ backend #

`Context α`: scalar interface for models + proofs #

A lightweight `ℚ` backend #