AI Vault

My curated vault of resources for learning, building, shipping, and using AI in software development. The focus is practical AI engineering: LLM applications, agentic coding, RAG, local models, evaluation, observability, ML foundations, and durable learning material.

This list is intentionally selective. It favors resources that help developers understand AI systems or build with them, and leaves out entries that are mainly unrelated productivity, marketing, sales, entertainment, or generic software material.

Start Here
AI Coding Assistants
Software Engineering Agents
Agent Frameworks
LLM App Frameworks
RAG, Search, and Knowledge
Local LLMs and Inference
Model Providers and Families
Evals, Observability, and LLMOps
Prompting and Playgrounds
Machine Learning and Deep Learning
Multimodal AI
Datasets and Benchmarks
Courses and Books
Papers and Research Context
Communities, News, and Related Lists

Start Here

OpenAI Cookbook - Practical recipes for building with OpenAI APIs, from prompting to retrieval and tool use.
Hugging Face LLM Course - A hands-on path through transformers, tokenizers, fine-tuning, and modern LLM workflows.
Full Stack Deep Learning - Production-oriented course material for training, deploying, and maintaining ML systems.
Fast.ai Practical Deep Learning - Code-first deep learning course aimed at builders rather than pure theorists.
Google Machine Learning Crash Course - A compact introduction to supervised ML concepts and applied modeling.
Learn Prompting - Open educational material for prompting, structured outputs, and AI safety basics.
Prompt Engineering Guide - Community-maintained notes and examples for prompt design.
Transformer Explainer - Interactive browser visualization of how transformer language models process text.
AnimatedLLM - Visual explanations of core LLM mechanics.
R2D3: A Visual Introduction to Machine Learning - Friendly visual explanations for important ML ideas.

AI Coding Assistants

GitHub Copilot - AI pair programmer integrated into popular IDEs and GitHub workflows.
Cursor - AI-native code editor for codebase chat, multi-file edits, and refactoring.
Windsurf - AI-first development environment with code editing and agentic assistance.
Google Antigravity - Agent-first IDE from Google where autonomous agents plan, execute, and verify software tasks with artifacts and browser verification.
Continue - Open-source IDE assistant that can connect different models and custom context sources.
Amazon Q Developer - AWS-focused coding assistant for IDEs, terminals, and cloud development tasks.
Tabnine - Code completion assistant with support for team and enterprise workflows.
Replit Ghostwriter - Replit's coding assistant for generation, explanation, and iteration inside the browser IDE.
Jupyter AI - AI assistant for JupyterLab and notebooks with support for many local and hosted models.
Qodo - AI coding and review workflows focused on tests, pull requests, and code quality.
CodeRabbit - AI pull request review assistant for summarizing changes and surfacing issues.
PR-Agent - Open-source automation for PR review, descriptions, suggestions, and feedback.
Mintlify - AI-assisted documentation generation for codebases and developer products.
Stenography - Tool for generating code explanations and documentation.
AI2sql - Natural-language interface for generating SQL queries.
Vanna - Open-source RAG approach for text-to-SQL and database question answering.
Wren AI - Open-source generative BI and text-to-SQL agent with a semantic layer.
Gitingest - Converts repositories into LLM-friendly text digests.
Repomix - Packs source code into structured context files for AI tools.
screenshot-to-code - Converts UI screenshots into frontend code.
v0 - Prompt-based UI generation for React and Next.js projects.
Lovable - Conversational app generation for quickly turning product ideas into deployable code.

Software Engineering Agents

Codex CLI - Local terminal coding agent for repository-aware development tasks.
Claude Code - Terminal-based coding agent for navigating and editing larger codebases.
Gemini CLI - Open-source command-line agent built around Google's Gemini models.
Aider - Git-aware pair programmer that edits files and keeps changes easy to review.
OpenCode - Open-source terminal coding agent with provider flexibility.
OpenHands - Autonomous software engineering agent with shell, browser, and editor workflows.
Cline - VS Code agent extension with tool use, file editing, and multi-provider support.
RooCode - Autonomous VS Code coding agent for planning and implementation tasks.
Pi - Customizable terminal coding agent with an extension-friendly workflow.
Plandex - Terminal-based AI coding workflow for larger implementation plans.
GPT Engineer - Generates software projects from a high-level specification and clarifying questions.
GPT Pilot - App-building agent designed to keep a developer in the loop.
Devin - Commercial autonomous software engineering agent.
Open Interpreter - Local terminal interface for executing code and automating computer tasks with LLMs.
TurboPilot - Self-hosted Copilot-like experiment built around local inference.

Agent Frameworks

LangGraph - Stateful graph framework for multi-step and multi-agent LLM workflows.
AutoGen - Microsoft's framework for building multi-agent conversations and collaboration loops.
CrewAI - Agent orchestration framework with roles, tasks, and human review patterns.
Pydantic AI - Typed Python framework for structured LLM applications and reliable outputs.
Google ADK - Google's Agent Development Kit for local development, tools, and agent protocols.
PocketFlow - Minimal agent framework useful for understanding the core mechanics without much abstraction.
MetaGPT - Multi-agent framework that turns a requirement into product, design, and engineering tasks.
Mastra - TypeScript framework for agents, workflows, memory, and tool integrations.
PraisonAI - Multi-agent framework with workflows, memory, and tool support.
Hermes Agent - Personal agent platform with memory, messaging integrations, and sandboxed tool execution.
VoltAgent - TypeScript framework for agents with tools, memory, and observability hooks.
Portia AI - Agent framework designed around visible plans, progress updates, and human interruption.
Agent Skills - Open format and reference SDK for reusable agent capabilities.
Model Context Protocol - Open protocol for connecting AI systems to tools, data, and external capabilities.
ToolHive - Tool for finding and deploying MCP servers.
Steel Browser - Browser automation infrastructure for AI agents, including sessions, screenshots, and proxies.
Notte - Framework for browser-using AI agents.

LLM App Frameworks

LangChain - Broad framework for LLM apps, chains, agents, retrieval, and integrations.
LlamaIndex - Data framework for connecting LLMs to private, structured, and unstructured knowledge.
Haystack - Modular framework for search, question answering, agents, and RAG pipelines.
Docling - Document conversion and ingestion toolkit for AI pipelines.
LLM App - Framework for real-time LLM-enabled data pipelines.
LMQL - Query language for constraining and composing LLM calls.
SymbolicAI - Neuro-symbolic framework for building LLM-centered applications.
Ludwig - Low-code system for training and deploying ML and deep learning models.
TensorZero - Framework that combines gateway, observability, evaluation, optimization, and experimentation for LLM apps.
Bifrost - Open-source LLM gateway for routing, load balancing, guardrails, and observability.
Manifest - LLM router for choosing cost-effective models and enforcing usage limits.
Agentset - Platform for building and evaluating RAG and agentic systems.
Scale Spellbook - Platform for comparing, building, and deploying generative AI applications.

RAG, Search, and Knowledge

Perplexity - AI search interface that combines retrieval and generated answers.
Exa - Search API designed for language-model workflows.
Phind - Developer-focused AI search and answer engine.
You.com - AI search engine with personalized and privacy-oriented features.
privateGPT - Local document Q&A for private files.
Quivr - Personal knowledge base that lets users chat with stored files and notes.
LibreChat - Open-source multi-provider chat UI for assistant-style workflows.
Chatbot UI - Open-source ChatGPT-style interface.
NotebookLM - Gemini-powered tool for working with documents and notes.
Open Notebook - Open-source NotebookLM-like system with more local control.
STORM - Research assistant that gathers sources and produces citation-backed reports.
Local Deep Research - Research workflow for web, academic, and private-document sources using local or cloud models.
Elicit - AI research assistant for literature review and evidence extraction.
Consensus - Search engine for answers grounded in scientific papers.
SciSpace - Assistant for reading, explaining, and exploring academic literature.

Local LLMs and Inference

Ollama - Simple local runner for open-weight language models.
LM Studio - Desktop app for discovering, downloading, and running local models.
Open WebUI - Self-hosted web interface for local and remote LLMs.
Jan - Local-first desktop AI app that can run offline or connect to APIs.
Msty - Desktop interface for working with local and hosted AI models.
LLM - Simon Willison's CLI and Python library for using local and remote LLMs.
gpt4all - Local model ecosystem with desktop app and bindings.
llama.cpp - Efficient C/C++ inference for LLaMA-style models and many derivatives.
bitnet.cpp - Microsoft inference framework for 1-bit LLMs.
whisper.cpp - C/C++ implementation of Whisper for local speech recognition.
Harbor - Containerized stack for running local model backends, UIs, and supporting services.
RunThisLLM - Hardware-oriented guide for choosing models that can run locally.
OpenRouter - Unified API for accessing many hosted models through one interface.
Together AI - Hosted training, fine-tuning, and inference for open models.
Groq - Fast cloud inference for supported open models using LPU hardware.

Model Providers and Families

OpenAI API - APIs for language, multimodal generation, speech, vision, and agentic workflows.
ChatGPT - OpenAI's conversational assistant for general reasoning, coding, and multimodal work.
Claude - Anthropic's model family for writing, analysis, coding, and long-context tasks.
Gemini - Google's multimodal AI family and chat interface.
Llama - Meta's open-weight model family for self-hosting, fine-tuning, and research.
Mistral - Open-weight and hosted models from Mistral AI.
DeepSeek - Open-source model family known for strong reasoning and coding variants.
Qwen - Alibaba's multilingual model family with open-source releases.
Kimi - Moonshot AI assistant and model family with long-context and agentic use cases.
GLM - Open-source language model family from Z.ai.
Cohere - Enterprise NLP and LLM platform with retrieval-oriented APIs.
MiniMax - Multimodal foundation models spanning text, speech, video, and music.

Evals, Observability, and LLMOps

OpenAI Evals - Framework for writing and running model evaluation suites.
Langfuse - Open-source tracing, prompt management, metrics, and evaluation platform.
Phoenix - Open-source observability for ML and LLM applications.
OpenLIT - OpenTelemetry-native observability for generative AI apps.
Helicone - Logging, monitoring, caching, and debugging layer for LLM applications.
Opik - Open-source tracing, evaluation, and monitoring platform for LLM systems.
MLflow - Experiment tracking, model deployment, and evaluation platform with LLM support.
Agenta - Open-source platform for prompt management, evaluation, and production monitoring.
Portkey - LLMOps gateway for monitoring, routing, caching, and governance.
Maxim AI - Evaluation and observability platform for shipping AI products with quality checks.
Cleanlab TLM - API for detecting unreliable or hallucinated LLM outputs.
Prediction Guard - Controlled LLM access with privacy, safety, and compliance features.
rehydra - Local PII anonymization and rehydration SDK for LLM prompts.
Agentic Radar - Security scanner for agentic workflows.
OpenAI Downtime Monitor - Public status and latency tracker for major LLM APIs.
Artificial Analysis - Independent model comparisons across quality, price, speed, and hosting.
LMArena - Human-preference leaderboard for model comparison.
OpenRouter Rankings - Usage-based model rankings from OpenRouter traffic.
SEAL LLM Leaderboard - Expert-driven model benchmark leaderboard.
LLM Stats - Model comparison site covering context windows, price, speed, and benchmarks.
SWE-bench - Benchmark for software engineering tasks based on real GitHub issues.
Terminal-Bench - Benchmark for terminal-based agent performance.

Prompting and Playgrounds

OpenAI Playground - Browser workspace for testing prompts, models, and API behavior.
Google AI Studio - Prototyping environment for Gemini models and prompts.
GitHub Models - Model exploration and prototyping directly inside GitHub.
OpenAI Prompt Engineering Guide - Official tactics for improving prompt reliability.
DeepLearning.AI: ChatGPT Prompt Engineering for Developers - Short course on prompts for developer workflows.
Anthropic Courses - Educational notebooks and material for working with Anthropic models.
PromptPerfect - Tooling for prompt iteration and optimization.
GPT for Sheets and Docs - Spreadsheet and document extension for prompt-driven workflows.
ChatGPT for Jupyter - Jupyter extension for notebook-based prompting and assistance.

Machine Learning and Deep Learning

PyTorch - Popular deep learning framework with dynamic computation graphs and strong research adoption.
TensorFlow - End-to-end ML framework for training, deployment, and production pipelines.
Keras - High-level neural network API for fast experimentation.
MXNet - Deep learning framework with distributed and multi-language support.
PaddlePaddle - Baidu's deep learning platform for research and production.
DeepLearning4J - JVM-based deep learning framework.
mlpack - C++ machine learning library focused on speed and scalability.
cuDNN - NVIDIA GPU-accelerated primitives for deep neural networks.
Gymnasium - Toolkit for reinforcement learning environments and algorithm comparison.
TensorBoard - Visualization toolkit for model training and experiments.
Netron - Viewer for neural network, ONNX, and ML model files.
Jupyter Notebook - Interactive notebook environment widely used for ML experiments and analysis.
Scikit-Learn - Core Python toolkit for classical machine learning.
Albumentations - Fast image augmentation library for computer vision pipelines.
Activeloop - Dataset management and streaming platform for computer vision and AI workloads.
Unsloth - Library for faster and more memory-efficient LLM fine-tuning.
Kiln - App for synthetic data, fine-tuning, and model-building workflows.

Multimodal AI

Image

Stable Diffusion - Open text-to-image diffusion model ecosystem.
Flux - High-quality text-to-image models from Black Forest Labs.
Midjourney - Widely used image generation service for stylized and photorealistic outputs.
Ideogram - Image generation platform with strong text rendering.
Adobe Firefly - Creative Cloud integrated image generation and editing tools.
ComfyUI - Node-based interface for Stable Diffusion and image-generation workflows.
Civitai - Community hub for sharing diffusion models, LoRAs, and workflows.
Lexica - Search engine for Stable Diffusion images and prompts.
PromptHero - Prompt search and inspiration across major image models.
Hugging Face Diffusion Models Course - Course material for learning diffusion models in Python.

Video

Runway - AI video generation and editing platform for creative and production workflows.
Pika - Text-to-video and image-to-video generation platform.
Luma Dream Machine - Video generation model for realistic motion from text or images.
Kling AI - Image and video generation tools.
Google Flow - Google AI filmmaking workspace powered by Veo.
HyperFrames - Framework for programmatically rendering video with HTML, CSS, JavaScript, and agents.

Audio

Whisper - Speech recognition model released by OpenAI.
ElevenLabs - High-quality text-to-speech and voice generation platform.
Bark - Open-source transformer-based text-to-audio model.
TorToiSe - Open-source text-to-speech model with an emphasis on voice quality.
AudioCraft - Meta's generative audio toolkit for music and sound generation.
Suno - Text-to-music generation platform.
Udio - Music generation platform for creating and sharing songs.

Datasets and Benchmarks

MNIST - Classic handwritten digit dataset for introductory vision models.
CIFAR-10 and CIFAR-100 - Small image classification datasets for model experiments.
ImageNet - Large-scale visual recognition dataset that shaped modern computer vision.
Microsoft COCO - Detection, segmentation, captioning, and keypoint dataset for vision systems.
Visual Question Answering - Benchmark for answering natural-language questions about images.
UC Irvine Machine Learning Repository - Broad collection of datasets for classical ML tasks.
YouTube-8M - Large-scale labeled video dataset.
Open Images - Large annotated image dataset for classification, detection, and segmentation.
Pascal VOC 2012 - Object detection and segmentation benchmark.
Fashion-MNIST - Drop-in MNIST alternative using fashion product images.
DeepMind QA Corpus - Reading comprehension dataset built from CNN and Daily Mail articles.
DiffusionDB - Dataset and resource collection around Stable Diffusion prompts and generations.
LMArena Leaderboard - Crowdsourced preference benchmark for AI models.
Artificial Analysis - Benchmark hub for model quality, latency, throughput, and cost.

Courses and Books

Practical AI Engineering

AI Engineering - End-to-end guide to designing and shipping AI products.
Designing Machine Learning Systems - Production ML systems, data loops, deployment, and maintenance.
Hands-On Large Language Models - Visual and implementation-oriented guide to LLM applications.
LLM Engineer's Handbook - Production LLM workflows, fine-tuning, quantization, and serving.
Build a Large Language Model from Scratch - Layer-by-layer implementation of transformer language models.
Build an AI Agent from Scratch - Agent foundations covering tools, memory, planning, and multi-agent patterns.
Build a Reasoning Model from Scratch - Ground-up explanation of reasoning model construction.
Generative Deep Learning - Practical coverage of GANs, VAEs, diffusion, and generative modeling.

Foundations

Artificial Intelligence: A Modern Approach - Canonical textbook for the broader AI field.
Deep Learning - Foundational neural network text by Goodfellow, Bengio, and Courville.
Understanding Deep Learning - Modern deep learning book with math, intuition, and notebooks.
Deep Learning: Foundations and Concepts - Probability-grounded deep learning reference.
Speech and Language Processing - Standard reference for NLP and language technology.
Reinforcement Learning: An Introduction - Classic reinforcement learning text by Sutton and Barto.
The Hundred-Page Machine Learning Book - Compact machine learning overview.
Machine Learning Yearning - Andrew Ng's practical guide to structuring ML projects.
Understanding Machine Learning: From Theory to Algorithms - Theory-oriented ML textbook.

Courses

DeepLearning.AI Short Courses - Focused, practical short courses on LLMs, agents, evaluation, and prompting.
Stanford CS324: Large Language Models - University course on LLM capabilities, training, and societal impact.
MIT 6.S191: Introduction to Deep Learning - Fast-paced MIT course on modern deep learning.
Stanford CS231n - Convolutional neural networks and computer vision.
Stanford CS224n - Natural language processing with deep learning.
Berkeley Deep Reinforcement Learning - Course material on deep RL methods and applications.
Google Generative AI Learning Path - Introductory path for generative AI concepts and Google tooling.
Google DeepMind Introduction to Reinforcement Learning - Video course on RL fundamentals.
Karpathy: Neural Networks Zero to Hero - Bottom-up neural network and language-model implementation series.
AI for Everyone - Non-technical overview of AI strategy and capabilities.

Papers and Research Context

Attention Is All You Need - Introduced the transformer architecture behind modern LLMs.
Scaling Laws for Neural Language Models - Shows how language model performance scales with compute, data, and parameters.
Language Models are Few-Shot Learners - GPT-3 paper that popularized few-shot prompting at scale.
Constitutional AI - Alignment approach using model-written principles and critiques.
ImageNet Classification with Deep Convolutional Neural Networks - AlexNet paper that accelerated deep learning adoption in vision.
Batch Normalization - Training technique that stabilizes and accelerates deep neural networks.
Residual Learning - ResNet paper that enabled much deeper vision networks.
Sequence to Sequence Learning with Neural Networks - Early neural sequence transduction work for translation and related tasks.
Neural Turing Machines - Research on neural networks augmented with differentiable memory.
Mastering the Game of Go with Deep Neural Networks and Tree Search - AlphaGo paper combining deep learning and tree search.
Artificial General Intelligence: Concept, State of the Art and Future Prospects - Ben Goertzel's overview of AGI as a research program.
Mapping the Landscape of Human-Level Artificial General Intelligence - Survey-style map of AGI concepts and approaches.
Universal Intelligence: A Definition of Machine Intelligence - Formal discussion of machine intelligence definitions.
The AGI Containment Problem - Research framing around containment and control of advanced AI systems.

Communities, News, and Related Lists

AI Engineer Newsletter - Newsletter focused on AI engineering and practical LLM development.
The Rundown AI - General AI news and product updates.
AlphaSignal - AI research and engineering updates.
Superhuman AI - AI tools and workflow-oriented newsletter.
Lex Fridman AI Podcast - Long-form conversations on AI, science, engineering, and philosophy.
Journal of Artificial General Intelligence - Research journal dedicated to AGI.
MIT 6.S099: Artificial General Intelligence - MIT course and lecture material on AGI.
OpenAI - AI research and product organization.
Google DeepMind - AI research lab working across models, science, and general intelligence.
Machine Intelligence Research Institute - Research organization focused on advanced AI safety.
OpenCog - Open-source project exploring AGI architectures.
Numenta - Research organization studying intelligence and brain-inspired computation.
Awesome RAG Production - Curated resources for production retrieval-augmented generation.
Open LLMs - Curated list of commercially usable open LLMs.
Awesome ChatGPT - Resources, demos, and tools around ChatGPT-style applications.
Awesome ChatGPT Prompts - Prompt examples for ChatGPT workflows.
Awesome Music AI - AI music generation and analysis resources.
Papers for Molecular Design Using Deep Learning - Domain-specific generative AI and deep learning papers for molecular design.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Vault

Contents

Start Here

AI Coding Assistants

Software Engineering Agents

Agent Frameworks

LLM App Frameworks

RAG, Search, and Knowledge

Local LLMs and Inference

Model Providers and Families

Evals, Observability, and LLMOps

Prompting and Playgrounds

Machine Learning and Deep Learning

Multimodal AI

Image

Video

Audio

Datasets and Benchmarks

Courses and Books

Practical AI Engineering

Foundations

Courses

Papers and Research Context

Communities, News, and Related Lists

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

AI Vault

Contents

Start Here

AI Coding Assistants

Software Engineering Agents

Agent Frameworks

LLM App Frameworks

RAG, Search, and Knowledge

Local LLMs and Inference

Model Providers and Families

Evals, Observability, and LLMOps

Prompting and Playgrounds

Machine Learning and Deep Learning

Multimodal AI

Image

Video

Audio

Datasets and Benchmarks

Courses and Books

Practical AI Engineering

Foundations

Courses

Papers and Research Context

Communities, News, and Related Lists

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Packages