Real Dataset Paths #
Small tutorial fixtures live in NN.Examples.Data.ToyPaths.
This module names the default paths used by examples that train on externally
downloaded datasets prepared by scripts/datasets/download_example_data.py.
The files are intentionally not committed: CIFAR-10 and text corpora are real datasets with their own licenses/citation expectations, and CI should not download hundreds of megabytes implicitly.
Default root for user-downloaded real datasets.
Instances For
Parse an optional --data-dir PATH flag for real-data examples.
Instances For
Directory containing prepared CIFAR-10 .npy arrays.
Instances For
Prepared CIFAR-10 training images, shape (N, 3, 32, 32), float32 in [0, 1].
Instances For
Prepared CIFAR-10 training labels, shape (N,), float32 integer labels 0..9.
Instances For
Prepared CIFAR-10 test images, shape (N, 3, 32, 32), float32 in [0, 1].
Instances For
Prepared CIFAR-10 test labels, shape (N,), float32 integer labels 0..9.
Instances For
Directory containing a user-prepared ImageNet-style 64x64 subset.
TorchLean does not redistribute ImageNet. Users point scripts/datasets/torchlean_data_convert.py image-folder at their own ImageNet/ILSVRC, ImageNet-compatible, or Tiny-ImageNet-style directory
tree and write the converted arrays here.
Instances For
Prepared ImageNet-style training images, shape (N, 3, 64, 64), float32 in [0, 1].
Instances For
Prepared ImageNet-style training labels, shape (N,), float32 integer class ids.
The converter assigns ids by sorted subdirectory name when --labels-from-dirs is used.
Instances For
Directory containing prepared UCI household-power forecasting windows.
Instances For
Prepared UCI household-power inputs, shape (N, 24, 1), float32 normalized to [0, 1].
Each row is a 24-hour window of hourly mean Global_active_power.
Instances For
Prepared UCI household-power targets, shape (N, 24, 1), float32 normalized to [0, 1].
Each target row is the corresponding input window shifted by one hour.
Instances For
Directory containing the prepared UCI Auto MPG tabular regression CSV.
Instances For
Prepared UCI Auto MPG CSV with normalized columns x1..x7,y.
Instances For
Directory containing downloaded text corpora.
Instances For
Karpathy tiny-shakespeare corpus.
Instances For
TinyStories validation split, useful for small local language-model smoke training.