Rust ML Systems from First Principles

A beginner course for learning machine learning as a translation problem:

plain English <-> algebra <-> Rust

The goal is not to memorize symbols. The goal is to learn how to read formulas as programs, and how to read Rust code as precise mathematical structure.

For the canonical curriculum layout, see lessons/Course Structure.

Who This Is For

Beginners with little or no machine learning background
Rust learners who want a concrete reason to use vectors, structs, loops, and functions
Self-paced learners who want short lessons and small practice steps

Start Here

Read 01 Foundations.
Continue with 02 Vectors.
Continue with 03 Neuron.
Continue with 04 Learning.
Use Lessons index to see the full course map and the roadmap modules.

If you specifically want the current advanced preview after the core path, jump to 07 Transformer.

The repo uses sequential folder numbers even though the curriculum starts at Module 0:

Course Module 0 -> Repo folder lessons/01-foundations
Course Module 1 -> Repo folder lessons/02-vectors
Course Module 2 -> Repo folder lessons/03-neuron
Course Module 3 -> Repo folder lessons/04-learning
Course Module 4 -> Repo folder lessons/05-mlp
Course Module 5 -> Repo folder lessons/06-attention
Course Module 6 -> Repo folder lessons/07-transformer

What Exists Now

Current coherent path

Neuron track now included

Training bridge now included

Advanced authored preview

Executable companion code

Source material and roadmap

Repo Map

rust-ml/
├── lessons/    # canonical course content
├── references/ # transcripts and papers used as source material
├── code/       # runnable companion crates
├── book/       # future mdBook/site wrapper
└── README.md

Working Rules For This Repo

lessons/ is the source of truth for written teaching content.
code/ follows the lesson progression and now includes tested neuron and transformer crates.
book/ is intentionally thin in this pass so the course content does not drift into two competing copies.
lessons/COURSE-STRUCTURE.md is the canonical structure guide for module and lesson contracts.

Learning Strategy

The course keeps the same translation goal everywhere:

plain English <-> algebra <-> Rust

The current repo intentionally has two different learning depths:

a coherent beginner path through Modules 0, 1, 2, and 3
an advanced Transformer preview in Module 6

Course Phases

Phase	Modules	Status	Learner checkpoint
Orientation	0 Foundations, 1 Vectors	Active	Read ML notation and vector code without losing the plain-English meaning.
First trainable system	2 Neuron, 3 Learning	Active	Explain one model, one backward pass, one optimizer step, and one token-target bridge.
Bridge to architecture	4 MLP, 5 Attention	Planned	Connect training to layers, hidden activations, token interactions, and attention scores.
Architecture preview	6 Transformer	Active preview	Read the encoder path and understand why full Transformer training needs more machinery.

Module 6 applies the translation rule in two complementary ways:

narrative lessons that explain the architecture and the implementation choices
a chunked encoder lesson where every concept is written as English -> Algebra -> Rust

That repetition is intentional. Repetition is how the translation dictionary becomes automatic.

Suggested Study Flow

Read the module README.
Work through the lesson files in order.
Do the module exercises without copying from the solutions first.
Use the solution files to check reasoning, naming, and Rust syntax.
Move to the next module only after you can explain each formula out loud in English.

Current recommended sequence:

01 Foundations
02 Vectors
03 Neuron
04 Learning
07 Transformer only if you want the advanced preview before the MLP and Attention bridge modules are authored

Running The Code

The current runnable code artifacts are the neuron and Transformer teaching crates:

cargo test --manifest-path code/neuron/Cargo.toml

cargo test --manifest-path code/transformer/Cargo.toml

The neuron crate covers:

a single typed neuron
squared-error loss and manual gradients
SGD parameter updates
tiny boolean datasets and epoch loops
token-target utilities and cross-entropy bridge code
a tiny bigram next-token model that turns token -> embedding -> lm_head into a real training loop

The Transformer crate covers:

dense vectors and matrices
semantic model newtypes such as TokenEmbedding, Query, Key, and Value
expressive thiserror diagnostics for shape mistakes
standard self-attention and multi-head attention
a simplified linear-attention comparison point
positional encodings, layer norm, feed-forward layers, and an encoder block

Quality Automation

The repo now includes two GitHub Actions workflows for quality control:

CI runs deterministic checks for lesson structure, local Markdown links, and authored-section contracts.
CI also compile-checks Rust snippets embedded in lessons and runs cargo fmt, cargo clippy, and cargo test for both the neuron and Transformer teaching crates.
Gemini Writing Review reviews Markdown content on pull requests for English clarity, technical-teaching quality, structural discipline, and beginner friendliness.

The Gemini review is advisory, not a replacement for human judgment. It is designed to catch weak phrasing, excess cognitive load, mismatches between English and code, and places where the teaching flow violates common technical-writing or technical-instruction best practices.

To enable Gemini review in GitHub Actions, configure:

repository secret GEMINI_API_KEY
optional repository variable GEMINI_MODEL if you want a model other than the default gemini-2.0-flash

The workflow writes a review artifact named gemini-writing-review so the writing assessment can be read directly from the workflow run.

References

The repo keeps supporting source material in references/, including:

a Transformer explainer transcript
Bahdanau et al. (2014)
Luong et al. (2015)
Vaswani et al. (2017)
Sebastian Raschka's LLMs From Scratch repository as an external inspiration source for attention, GPT, and educational sequencing

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github		.github
.idea		.idea
assets		assets
book		book
code		code
lessons		lessons
references		references
scripts		scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rust ML Systems from First Principles

Who This Is For

Start Here

What Exists Now

Current coherent path

Neuron track now included

Training bridge now included

Advanced authored preview

Executable companion code

Source material and roadmap

Repo Map

Working Rules For This Repo

Learning Strategy

Course Phases

Suggested Study Flow

Running The Code

Quality Automation

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Rust ML Systems from First Principles

Who This Is For

Start Here

What Exists Now

Current coherent path

Neuron track now included

Training bridge now included

Advanced authored preview

Executable companion code

Source material and roadmap

Repo Map

Working Rules For This Repo

Learning Strategy

Course Phases

Suggested Study Flow

Running The Code

Quality Automation

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages