2.1. Tensors, Shapes, and DTypes🔗

TorchLean is built for verification, but the user experience still starts with the same two objects you expect from PyTorch:

tensors (data), and
functions over tensors (models and losses).

We start with the practical tensor model: what the type means, how shapes show up in the type checker, and which constructors to use in examples.

The most important part of TorchLean's tensor design has no direct analogue in plain PyTorch: the scalar type parameter α is not an implementation detail. It is the switch that lets the same tensor code run as:

proof oriented math (ℝ / ℚ),
convenient runtime demo code (Float), or
executable IEEE-754 float32 code (IEEE32Exec).

The declarations behind this model are in the tensor and scalar reference pages, but the important idea is small enough to state here:

\mathrm{Tensor}(\alpha,s)

means values of scalar type α arranged according to shape s.

The proof layer then relates this shaped view to flat vectors, pointwise algebra, dot products, and runtime payloads.

2.1.1. The Canonical Tensor Type🔗

The spec layer defines the canonical tensor type:

Spec.Tensor α s is “a tensor with scalar type α and shape s”.

The shape grammar is inductive:

\mathrm{Shape} ::= \mathrm{scalar}\;|\;\mathrm{dim}(n,\mathrm{Shape})

So a matrix is not a tensor plus a loose runtime rank field. It is a value whose type remembers that it has two dimensions.

Most user code does not write Spec.Tensor explicitly. Instead, examples usually import NN.Tensor and use the constructors and shape aliases from that layer.

This is still a tensor in the spec layer, with a precise meaning. The NN.Tensor layer gives you good constructors and printing without changing the math.

2.1.1.1. PyTorch Similarity🔗

The closest PyTorch mental model is:

import torch

x = torch.tensor([[1.0, 2.0], [3.0, 4.0]])
y = torch.tensor([[0.2, -0.1], [0.0, 0.3]])
z = x + y

In TorchLean, the same idea is written with shape information in the type:

open Spec
open NN.Tensor

def x : Spec.Tensor Float (shape![2, 2]) :=
  tensor! [[1.0, 2.0], [3.0, 4.0]]

def y : Spec.Tensor Float (shape![2, 2]) :=
  tensor! [[0.2, -0.1], [0.0, 0.3]]

def z := Spec.add_spec x y

That is the basic pattern throughout TorchLean: the syntax feels familiar, but the shape is checked by Lean instead of being discovered only at runtime.

2.1.2. Shapes: Values (and Often Types)🔗

A tensor shape is a Spec.Shape. In examples, the most convenient way to write a shape is the shape![...] macro:

open NN.Tensor

def sVec : Spec.Shape := shape![4]
def sMat : Spec.Shape := shape![3, 2]

For common ML shapes, TorchLean also provides readable aliases:

Shape.Vec n for length-n vectors,
Shape.Mat rows cols for matrices,
Shape.Images n c h w for NCHW image batches,
plus Shape.CHW, Shape.NCHW, Shape.NHWC, and others.

The full list is generated in the tensor reference, but the convention is simple: names such as Vec, Mat, NCHW, and Images are readable aliases for repeated Shape.dim constructors.

2.1.3. DTypes: Just Lean Types🔗

TorchLean uses the scalar element type as the “dtype”. For example:

Tensor Float s is a runnable floating tensor (convenient for demos),
Tensor ℚ s is a rational tensor (useful for exact arithmetic),
Tensor TorchLean.Floats.IEEE32Exec s is an executable IEEE-754 binary32 tensor.

This is not a slogan: it means you can reuse normal Lean abstractions (typeclasses, polymorphism) when you write code that should run on multiple scalar backends.

One architecture can therefore be read as a family of functions:

f_\alpha : \mathrm{Tensor}(\alpha,s_{\mathrm{in}}) \longrightarrow \mathrm{Tensor}(\alpha,s_{\mathrm{out}})

The subscript changes when we choose Float, IEEE32Exec, ℝ, intervals, or another scalar domain. The architecture and shapes remain the same.

In the codebase, the scalar layer splits into a few pieces:

Spec.SpecScalar is fixed to ℝ for the mathematical spec layer.
Runtime.Scalar/Semantics.Scalar-driven code can run over different backends.
tensorF! and tensorF321d let you author examples once in Float and then cast to a runtime scalar or executable FP32 backend.

For readers in a theorem file, α often means “the scalar the theorem is polymorphic over”. In a training tutorial, α usually means “the runtime backend scalar we got from train.run”. That distinction matters because the same model can be instantiated over Float, IEEE32Exec, or ℝ depending on whether you are executing, validating, or proving.

2.1.4. Constructing Tensors (The Ergonomic Layer)🔗

For small examples, the most common constructors are:

tensor1d / tensor2d for lists of numbers,
tensorND for runtime dims plus a flat row-major list,
tensorND! for constants when the length proof should be solved automatically,
tensor! for nested bracket literals (often the nicest for handwritten tensors).

Example (nested brackets, like nested Python lists):

open Spec
open NN.Tensor

-- A 2×2 tensor (shape inferred from the nesting)
def x : Spec.Tensor Float (shape![2, 2]) :=
  tensor! [[1.0, 2.0], [3.0, 4.0]]

When you are writing code that should work with several backends, tensorF! is a useful trick: write Float literals once, then cast elementwise into a runtime scalar α.

-- `cast : Float → α` comes from the runtime runner (see the training tutorials).
def w {α : Type} (cast : Float → α) : Spec.Tensor α (shape![3, 2]) :=
  tensorF! cast [3, 2] [0.2, -0.1, 0.0, 0.3, -0.4, 0.1]

For runtime examples, the same idea shows up as a cast from Float constants into the active backend scalar in the model code. The runtime module API contains castTensor and castTList, which turn Float initializers into parameter tensors for the selected backend.

2.1.5. Four Everyday Shapes🔗

Most model code uses the same few shapes over and over. It helps to learn them as reading patterns, not as abstract syntax.

2.1.5.1. A feature vector🔗

def x : Spec.Tensor Float (Shape.Vec 3) :=
  tensor! [0.2, -0.1, 0.7]

Read this as one sample with three features.

2.1.5.2. A matrix batch🔗

def xs : Spec.Tensor Float (Shape.Mat 4 3) :=
  tensor! [
    [0.0, 0.1, 0.2],
    [0.3, 0.4, 0.5],
    [0.6, 0.7, 0.8],
    [0.9, 1.0, 1.1]
  ]

Read this as four samples, each with three features. A linear layer with pfx := Shape.Vec 4 will act on the last dimension and preserve the batch axis.

2.1.5.3. A CHW image🔗

def img : Spec.Tensor Float (Shape.CHW 1 4 4) :=
  tensorND! [1, 4, 4] [
    0, 0, 1, 1,
    0, 1, 1, 0,
    1, 1, 0, 0,
    1, 0, 0, 1
  ]

Read this as one channel, height four, width four. The CNN tutorials use this convention because it matches the usual PyTorch NCHW layout once a batch axis is added.

2.1.5.4. An image batch🔗

Shape.Images batch channels height width

This is the shape expected by the public nn.conv builder. The batch dimension is part of the model type, so a model trained with batch size 8 and a model trained with batch size 16 have different types unless the file abstracts over batch.

2.1.6. Reading Tensor Types In Model Code🔗

When you see a model type such as:

nn.Sequential (Shape.Mat batch 2) (Shape.Mat batch 1)

read it from right to left as a contract:

the input is a batch of two-feature samples;
the output is a batch of one-value predictions;
every layer in between must preserve or transform shapes so the sequence composes.

When you see:

nn.Sequential (Shape.Images batch 3 32 32) (shape![batch, 10])

read it as an image classifier: a batch of RGB images goes to ten logits per image.

This habit is more useful than memorizing every constructor. The shape tells you what the model claims to do before you read the body.

2.1.7. Shape Design Decisions🔗

TorchLean's tensor layer makes a few choices that are worth understanding early.

Shapes are part of the specification. A tensor is not just a pointer plus runtime metadata; in the core APIs, its shape appears in the Lean type.
Dynamic inputs are allowed at the boundary. File loaders, JSON artifacts, and CLI data often arrive with runtime dimensions. TorchLean checks those dimensions as data and then packages them into typed or dynamic tensors.
Broadcasting is explicit at the operator layer. The project avoids implicit promotion rules when those rules would make proof obligations harder to read.
Flattened and shaped views are connected by lemmas. Runtime and verifier code often wants flat buffers; proof code often wants shaped tensors. The tensor library keeps those views related.

This is why a tutorial tensor can feel close to PyTorch while the theorem declarations still have enough structure to prove algebraic facts about the same operations.

For example, pointwise addition is not "add two buffers and hope their metadata agrees." Its informal type is:

\mathrm{add} : \mathrm{Tensor}(\alpha,s) \times \mathrm{Tensor}(\alpha,s) \longrightarrow \mathrm{Tensor}(\alpha,s)

That one shared s is doing real work. It is why the verifier and proof layer can later talk about the same operation without rediscovering the layout from a runtime trace.

2.1.8. A Tiny Shape Bug Walkthrough🔗

Suppose a model expects a batch of two length-three inputs and a weight matrix from three features to four outputs. In ordinary Python code it is easy to accidentally transpose the weight and discover the mistake only when matmul runs.

In TorchLean, the intended shapes are visible at the definition site:

open Spec
open NN.Tensor

def xs : Spec.Tensor Float (shape![2, 3]) :=
  tensor! [[1, 2, 3], [4, 5, 6]]

def wGood : Spec.Tensor Float (shape![3, 4]) :=
  tensor! [[1, 0, 0, 1], [0, 1, 1, 0], [1, 1, 0, 0]]

A transposed weight has shape shape![4, 3], which is a different type. That does not prove the model is correct, but it prevents a large class of accidental wiring mistakes from reaching the runtime or verifier.

2.1.9. Dynamic Shapes🔗

Sometimes you truly do not know the shape statically (CLI inputs, JSON, file formats). For those cases TorchLean provides:

tensorND : List Nat → List α → Except String (Tensor α (shapeOfDims dims)), with a runtime length check;
DynTensor α, which stores the Shape as data alongside the tensor.

This lets tooling remain shape-aware even when the type cannot carry the full shape index.

If you want the most permissive constructors, the API also includes:

tensor2d? and tensor2d for nested lists,
tensor2dPadTo and tensor2dPadRight for ragged inputs that you intend to pad,
tensorDynND when you want a dynamic wrapper rather than a static shape index.

The relevant design note is that TorchLean does not force every input format to be perfectly typed at the boundary. Instead, it lets you choose between:

shapes with explicit proofs (tensorNDOfLenEq),
shapes checked at runtime (tensorND, tensorDynND), and
permissive padding behavior for sequence data (tensor2dPadTo, tensor2dPadRight).

That is the shape of a mixed proof/runtime ML stack.

2.1.9.1. How TorchLean Differs from Plain PyTorch🔗

The PyTorch version checks shapes when you run the code. TorchLean checks many of them when the file elaborates, so a shape mismatch becomes a compile error instead of a late failure.

For example, a common bug in numerical code is accidentally mixing a vector and a matrix with the wrong axis order or layout. In PyTorch you often discover that after a runtime error; in TorchLean the type forces the mismatch to be visible up front.

-- PyTorch style idea:
--   y = x @ w
--
-- TorchLean-shaped idea:
def xMat : Spec.Tensor Float (shape![2, 3]) := tensor! [[1, 2, 3], [4, 5, 6]]
def wMat : Spec.Tensor Float (shape![3, 4]) := tensor! [[1, 0, 0, 1], [0, 1, 1, 0], [1, 1, 0, 0]]
-- result shape is checked by the matmul API / layer shape rules, not by a hidden runtime guard.

2.1.10. A Runnable Starting Point🔗

If you want a small file that you can run immediately and that only exercises the tensor layer, start here:

It demonstrates:

dtype-as-type (Float, ℚ, Int),
tensor! nesting + row-major flattening,
tensorF321d for executable float32,
tensorND / tensorDynND for runtime-shaped inputs,
and the fact that printing is intentionally disabled for proof-only scalar backends like ℝ.

2.1.11. What the Tensor Layer Gives You🔗

TorchLean’s tensor layer is not just a convenience wrapper. It is strong because the shape index is part of the tensor’s type, and the proof layer already knows how to move between typed views, flattened views, and algebraic laws.

Representative theorems and definitions:

Spec.flattenR / Spec.unflattenR round trip the tensor view used in proofs.
Spec.toVec_add_spec and Spec.toVec_scale_spec connect tensor operations to pointwise algebra.
Spec.mul_spec_assoc and Spec.mul_spec_comm give the usual algebraic laws for pointwise tensor multiplication.
Spec.add_spec_comm and Spec.mul_spec_fill_zero show the expected behavior of tensor arithmetic on special cases.
Spec.dot packages the familiar “elementwise multiply, then sum” pattern.

Those facts matter because they let later chapters talk about models, losses, and verification using ordinary algebra instead of ad hoc shape bookkeeping. The same tensor library can be used for:

exact reasoning in ℚ or ℝ,
executable float32 examples,
and shape-safe data / model pipelines.

If you want the proof companion, start from the tensor proof API.