Evaluation helpers #

These utilities aggregate per-sample or per-batch StepReports into a single mean report. Metrics are matched by name and position.

Metric aggregation #

source

def Runtime.Autograd.Train.Eval.addMetrics {a : Type} [Add a] (tag : String) (xs ys : List (Metric a)) :

Result (List (Metric a))

Add two metric lists pointwise.

We require that names match (same metric in the same position). This keeps aggregation honest and avoids silently averaging unrelated quantities.

Instances For

source

def Runtime.Autograd.Train.Eval.scaleMetrics {a : Type} [Mul a] [Coe ℕ a] (count : ℕ) (metrics : List (Metric a)) :

List (Metric a)

Multiply every metric value by a scalar (used for weighted batch averaging).

Instances For

Report sums (for weighted aggregation) #

source

structure Runtime.Autograd.Train.Eval.ReportSum (a : Type) :

Type

An accumulator for averaging StepReports.

Instead of keeping a list of all reports and reducing at the end, we maintain:

count: how many samples contributed,
lossSum: the sum of losses (optionally weighted by batch size),
metricsSum: a pointwise sum of named metrics.

This is the same idea as computing streaming averages in a typical PyTorch evaluation loop.

count : ℕ
Number of samples represented by this accumulator.
lossSum : a
Sum of losses, already weighted by sample count for batch reports.
metricsSum : List (Metric a)
Pointwise sum of metrics; names must stay aligned across additions.

Instances For

source

def Runtime.Autograd.Train.Eval.ReportSum.ofReport {a : Type} (r : StepReport a) :

ReportSum a

Start an accumulator from a single-sample report.

Instances For

source

def Runtime.Autograd.Train.Eval.ReportSum.ofBatch {a : Type} [Mul a] [Coe ℕ a] (count : ℕ) (r : StepReport a) :

ReportSum a

Start an accumulator from a batch report, weighted by the number of samples in the batch.

This is the appropriate constructor when evalBatch returns means over the batch, but we want the final mean to weight by the number of items in each batch.

Instances For

source