Contribution

🐦 Follow me on Twitter • ➡️ Jump to LLMs! 📧 Feedback

I struggled to grind for ML/AI interviews so I went back to the basics and created a list after careful research.

TorchLeet is broken into two sets of questions:

Question Set: A collection of PyTorch practice problems, ranging from basic to hard, designed to enhance your skills in deep learning and PyTorch.
LLM Set: A new set of questions focused on understanding and implementing Large Language Models (LLMs) from scratch, including attention mechanisms, embeddings, and more.

Note

Avoid using GPT. Try to solve these problems on your own. The goal is to learn and understand PyTorch concepts deeply.

Note

Yes. I used GPT to help write the code and I ended up testing it out myself as practise. I found the strategy to be super useful

Question Set

🔵Basic

Mostly for beginners to get started with PyTorch.

🟢Easy

Recommended for those who have a basic understanding of PyTorch and want to practice their skills.

🟡Medium

These problems are designed to challenge your understanding of PyTorch and deep learning concepts. They require you to implement things from scratch or apply advanced techniques.

Implement parameter initialization for a CNN (Solution)
Implement a CNN from Scratch (Solution)
Implement an LSTM from Scratch (Solution)
Implement AlexNet from scratch
Build a Dense Retrieval System using PyTorch
Implement KNN from scratch in PyTorch
Train a 3D CNN network for segmenting CT images Solution

🔴Hard

These problems are for advanced users who want to push their PyTorch skills to the limit. They involve complex architectures, custom layers, and advanced techniques.

Write a custom Autograd function for activation (SILU) (Solution)
Write a Neural Style Transfer
Build a Graph Neural Network (GNN) from scratch
Build a Graph Convolutional Network (GCN) from scratch
Write a Transformer (Solution)
Write a GAN (Solution)
Write Sequence-to-Sequence with Attention (Solution)
[Enable distributed training in pytorch (DistributedDataParallel)]
[Work with Sparse Tensors]
Add GradCam/SHAP to explain the model. (Solution)
Linear Probe on CLIP Features
Add Cross Modal Embedding Visualization to CLIP (t-SNE/UMAP)
Implement a Vision Transformer
Implement a Variational Autoencoder

LLM Set

An all new set of questions to help you understand and implement Large Language Models from scratch.

Each question is designed to take you one step closer to building your own LLM.

Implement KL Divergence Loss
Implement RMS Norm
Implement Byte Pair Encoding from Scratch (Solution)
Create a RAG Search of Embeddings from a set of Reviews
Implement Predictive Prefill with Speculative Decoding
Implement Attention from Scratch (Solution)
Implement Multi-Head Attention from Scratch (Solution)
Implement Grouped Query Attention from Scratch (Solution)
Implement KV Cache in Multi-Head Attention from Scratch
Implement Sinusoidal Embeddings (Solution)
Implement ROPE Embeddings (Solution)
Implement SmolLM from Scratch (Solution)
Implement Quantization of Models
1. GPTQ
Implement Beam Search atop LLM for decoding
Implement Top K Sampling atop LLM for decoding
Implement Top p Sampling atop LLM for decoding
Implement Temperature Sampling atop LLM for decoding
Implement LoRA on a layer of an LLM
1. QLoRA
Mix two models to create a mixture of Experts
Apply SFT on SmolLM
Apply RLHF on SmolLM
Implement DPO based RLHF
Add continuous batching to your LLM
Chunk Textual Data for Dense Passage Retrieval
Implement Large scale Training => 5D Parallelism

What's cool? 🚀

Diverse Questions: Covers beginner to advanced PyTorch concepts (e.g., tensors, autograd, CNNs, GANs, and more).
Guided Learning: Includes incomplete code blocks (... and #TODO) for hands-on practice along with Answers

Advanced ML Systems Set (v3)

A research-backed set of 30 questions covering what top AI companies actually ask in 2024-2025 interviews. Each question is tagged with the companies known to test that topic.

Note

These questions were compiled from real interview reports across Glassdoor, Blind, Reddit, and first-person accounts from candidates who interviewed at 15+ frontier AI labs. Questions are organized by interview role rather than just difficulty.

Classical ML from Scratch

Still asked at traditional FAANG companies (Google, Meta, Amazon, Uber, LinkedIn).

Implement Softmax from Scratch (numerically stable) (Solution) 🟢 Easy — Apple Meta Google Amazon
Implement K-Means Clustering in PyTorch (Solution) 🟢 Easy — Uber LinkedIn Google Amazon
Implement KNN in PyTorch (Solution) 🟢 Easy — Uber LinkedIn Meta
Implement Logistic Regression with Gradient Descent (Solution) 🟢 Easy — Google Meta Amazon

LLM Decoding

Core questions at frontier AI labs — Anthropic, OpenAI, DeepMind, Cohere, Perplexity.

Implement Contrastive Loss (InfoNCE) + CLIP Training Loop (Solution) 🟡 Medium — OpenAI Anthropic DeepMind Midjourney Apple
Implement 2D Positional Embeddings (Solution) 🟡 Medium — Anthropic DeepMind Midjourney Runway
Implement Top-p (Nucleus) Sampling (Solution) 🟡 Medium — Anthropic OpenAI DeepMind Perplexity Cohere
Implement Top-k Sampling (Solution) 🟡 Medium — Anthropic OpenAI DeepMind Cohere
Implement Beam Search for LLM Decoding (Solution) 🟡 Medium — Google DeepMind Meta Apple
Implement Temperature Sampling (Solution) 🟢 Easy — OpenAI Anthropic Cohere Perplexity

LLM Inference & Systems

Hot topic for LLM infrastructure roles at Perplexity, Together AI, Anyscale, Meta.

Implement LoRA on a Linear Layer (Solution) 🟡 Medium — Meta Google Anthropic OpenAI Databricks
Implement KV Cache for Autoregressive Generation (Solution) 🟡 Medium — Anthropic OpenAI Meta Perplexity Together AI
Implement Sliding Window Attention (Solution) 🟡 Medium — Mistral Anthropic Google DeepMind
Implement DPO Loss from Scratch (Solution) 🔴 Hard — Anthropic OpenAI DeepMind Meta
Implement PPO for RLHF (Solution) 🔴 Hard — Anthropic OpenAI DeepMind Meta
Implement Gradient Checkpointing (Solution) 🔴 Hard — Meta Google NVIDIA Tesla
Implement Mixture of Experts Layer (Solution) 🔴 Hard — Google DeepMind Mistral Databricks xAI
Implement Speculative Decoding (Solution) 🔴 Hard — Google DeepMind Anthropic Apple
Implement Continuous Batching for LLM Inference (Solution) 🔴 Hard — Perplexity Together AI Anyscale Meta

Modern Architectures

Cutting-edge topics at image-gen companies, research labs, and autonomous driving.

Implement DDPM (Denoising Diffusion) from Scratch (Solution) 🔴 Hard — Midjourney Runway Stability AI Adobe Google
Implement DDIM Sampling + Classifier-Free Guidance (Solution) 🔴 Hard — Midjourney Runway Stability AI Adobe
Implement Selective State Space Model (Mamba Block) (Solution) 🔴 Hard — DeepMind Google Anthropic
Implement Vision Transformer + MAE Pretraining (Solution) 🔴 Hard — Meta Google Apple Tesla Waymo
Implement Knowledge Distillation (Solution) 🟡 Medium — Google Apple Meta Qualcomm Tesla

GPU Systems & Kernels

For ML infrastructure and systems roles at NVIDIA, Meta, xAI, and frontier labs.

Write a Fused Softmax Kernel in Triton (Solution) 🟣 Expert — NVIDIA Meta Google xAI Tesla
Implement FlashAttention-2 in Triton (Solution) 🟣 Expert — NVIDIA Meta Together AI xAI
Implement FSDP (Fully Sharded Data Parallel) from Scratch (Solution) 🟣 Expert — Meta Google NVIDIA Anthropic xAI
Implement GRPO (DeepSeek-R1 Algorithm) (Solution) 🟣 Expert — DeepMind Anthropic OpenAI
Build a Complete LLM Inference Engine (Solution) 🟣 Expert — Perplexity Together AI Anyscale Fireworks AI
Implement Ring Attention for Long Contexts (Solution) 🟣 Expert — Anthropic Google Meta xAI

Company Quick-Reference

"If I'm interviewing at X, which v3 questions should I prioritize?"

Company	Priority Questions
Anthropic	5, 6, 7, 8, 10, 12, 13, 14, 15, 18, 22, 26, 27, 30
OpenAI	5, 7, 8, 10, 11, 12, 14, 15, 27
DeepMind	5, 6, 7, 8, 9, 13, 14, 15, 17, 18, 22, 27
Meta	1, 2, 3, 4, 9, 11, 12, 14, 15, 16, 19, 23, 24, 25, 26, 30
Google	1, 2, 4, 9, 11, 13, 16, 17, 18, 20, 22, 23, 24, 26, 29, 30
Apple	1, 5, 9, 18, 23, 29
NVIDIA	16, 24, 25, 26
Midjourney / Runway / Stability AI	5, 6, 20, 21
Perplexity / Together AI / Anyscale	7, 10, 12, 19, 25, 28
Tesla / Waymo	16, 23, 24, 29
xAI	17, 24, 25, 26, 30
Mistral / Cohere	7, 8, 10, 13, 17

Getting Started

1. Install Dependencies

Install pytorch: Install pytorch locally
Some problems need other packages. Install as needed.

2. Structure

<E/M/H><ID>/: Easy/Medium/Hard along with the question ID.
<E/M/H><ID>/qname.ipynb: The question file with incomplete code blocks.
<E/M/H><ID>/qname_SOLN.ipynb: The corresponding solution file.

3. How to Use

Navigate to questions/ and pick a problem
Fill in the missing code blocks (...) and address the #TODO comments.
Test your solution and compare it with the corresponding file in solutions/.

Happy Learning! 🚀

Contribution

Feel free to contribute by adding new questions or improving existing ones. Ensure that new problems are well-documented and follow the project structure. Submit a PR and tag the authors.

Authors

Chandrahas Aroori
💻 AI/ML Dev

Caslow Chien
💻 Developer

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
llm		llm
torch		torch
v3		v3
website		website
.gitignore		.gitignore
CheatSheet.md		CheatSheet.md
README.md		README.md
Tricks.md		Tricks.md
torch.png		torch.png
torchleet-llm.png		torchleet-llm.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents