The Autograd Engine

The autograd module is the machinery that makes training possible. Every call to autograd::relu, autograd::matmul, or autograd::cross_entropy_loss does two things simultaneously: it computes the output value (the forward pass), and it records a backward function that knows how to compute the gradient of that output with respect to its inputs. When you call autograd::backward(arena, loss), GradCore-Tensor replays those recorded functions in reverse to propagate gradients all the way back to the leaf parameters.

Header: include/autograd/autograd.hpp
Namespace: gradientcore::autograd

The relationship to the tensor module

The autograd engine sits on top of the tensor module. Every autograd op calls the corresponding tensor_* function for the forward computation and tensor_*_grad function inside its backward closure. The autograd layer adds no mathematical logic of its own — it adds graph construction and gradient routing. See Tensors in the Autograd Engine for a deep dive into how the two layers connect.

What Lives in Autograd

autograd/
├── autograd.hpp         ← Single header — include this
│
├── engine.cpp           ← Variable, create_leaf, backward, topological sort
│
└── ops/
    ├── arithmetic/      ← add, sub, mul, matmul, scale, sum
    ├── activations/     ← relu, sigmoid, tanh, gelu, softmax, …
    └── loss/            ← mse_loss, cross_entropy_loss, bce_loss, …

The Core Abstractions

`Variable` — a tensor that remembers

struct Variable {
    Tensor  *data;            // The actual values
    Tensor  *grad;            // Gradient accumulator (same shape)
    bool     requires_grad;   // Compute gradients for this?
    bool     is_leaf;         // Parameter (true) or intermediate (false)?

    Edge    *parents;         // Inputs that produced this Variable
    uint32_t num_parents;

    Tensor **saved_tensors;   // Data saved for the backward pass
    uint32_t num_saved;

    uint32_t     reduction;       // For loss ops (REDUCTION_NONE/MEAN/SUM)
    float        metadata_float;  // alpha, delta, scale factor, dim, etc.

    void (*backward_fn)(Variable *self, Arena *arena);
};

Variable structs live on the graph arena. They are freed automatically when graph_arena->pop_to(pos) is called at the end of each batch — no per-object bookkeeping, no reference counting, no garbage collection.

`Edge` — a link in the graph

struct Edge {
    Variable *node;
};

Each non-leaf Variable holds a parents array of Edge structs pointing to the Variable nodes that were its inputs. These links form the directed acyclic computation graph that backward() traverses.

Lifecycle of a Forward Pass

Inputs and parameters wrapped as leaf Variables via create_leaf()
autograd::op() called → tensor_op() computes value, backward_fn recorded
Result is a new Variable on the graph arena with parents wired up
Repeat for each operation in the network
Loss Variable produced at the end of the chain

backward(arena, loss) called
Topological sort of the graph (DFS from loss node)
Each node's backward_fn called in reverse order
Gradients accumulated into leaf Variable::grad tensors
optimizer.step() reads those grad tensors and updates data tensors
graph_arena->pop_to(pos) frees the entire graph in O(1)

Quick Reference

What you want	Where to look
Wrapping tensors as Variables	Variables & the Graph
Running backpropagation	Variables & the Graph
Arithmetic ops (`add`, `matmul`, …)	Arithmetic Operations
Activation ops (`relu`, `gelu`, …)	Activation Operations
Loss ops (`cross_entropy`, `mse`, …)	Loss Operations
How tensor ops plug in	Tensors in the Autograd Engine

Namespaces

All autograd code is in gradientcore::autograd. The Reduction enum (REDUCTION_NONE, REDUCTION_MEAN, REDUCTION_SUM) lives in gradientcore (defined in tensor/tensor.hpp) and is used directly by loss functions.

#include "gradient.hpp"       // includes autograd.hpp automatically
using namespace gradientcore;

autograd::Variable *x = autograd::create_leaf(arena, tensor, true);
autograd::backward(arena, loss);

What Lives in Autograd​

The Core Abstractions​

Variable — a tensor that remembers​

Edge — a link in the graph​

Lifecycle of a Forward Pass​

Quick Reference​

Namespaces​