Skip to content

Saatvik-Aggarwal/picochat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

My attempt at rebuilding https://github.com/karpathy/nanochat/tree/master
and combining it with https://github.com/jingyaogong/minimind

PyTorch has a lot of syntax that makes even knowing what underlying math is being done. To combat this, this code contains far more comments than nanochat to battle the jargon. The dimensions of nearly every tensor are commented.

Creation Order

  1. Directory structure, README.md, .gitignore, initial files
  2. Tokenizer implementation in Rust (rustbpe/) + python bindings + tokenizer test/eval scripts (scripts/tok_eval.py)
  3. gpt.py: model implementation (attention, MLP, transformer block, GPT model) and engine.py (kv cache)

About

A transformer trained on my laptop, soon to power MonkeyLLM

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors