CUDA eager-engine backend #
This umbrella collects the CUDA side of TorchLean's eager autograd engine.
The modules here are deliberately split by trust boundary:
TrustedandBufferexpose the opaque FFI buffer type and allocation/copy primitives.Kernels,ConvPool, andDGemmdeclare native CUDA/CPU-stub kernel entrypoints.TapeandOpsbuild the CUDA reverse-mode tape over those buffers.Float32ContractandKernelSpecstate the proof-facing reference contracts for native bits.NativeSourcesdocuments which C/CUDA files implement the external symbols.
Lean does not prove the compiled CUDA binary correct. It proves the pure specs and graph-level connections around this boundary, while runtime tests validate the native implementation path.