Skip to content

w512/AI-Vault

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

AI Vault

Awesome

My curated vault of resources for learning, building, shipping, and using AI in software development. The focus is practical AI engineering: LLM applications, agentic coding, RAG, local models, evaluation, observability, ML foundations, and durable learning material.

This list is intentionally selective. It favors resources that help developers understand AI systems or build with them, and leaves out entries that are mainly unrelated productivity, marketing, sales, entertainment, or generic software material.

AI Vault meme illustration

Contents

Start Here

AI Coding Assistants

  • GitHub Copilot - AI pair programmer integrated into popular IDEs and GitHub workflows.
  • Cursor - AI-native code editor for codebase chat, multi-file edits, and refactoring.
  • Windsurf - AI-first development environment with code editing and agentic assistance.
  • Google Antigravity - Agent-first IDE from Google where autonomous agents plan, execute, and verify software tasks with artifacts and browser verification.
  • Continue - Open-source IDE assistant that can connect different models and custom context sources.
  • Amazon Q Developer - AWS-focused coding assistant for IDEs, terminals, and cloud development tasks.
  • Tabnine - Code completion assistant with support for team and enterprise workflows.
  • Replit Ghostwriter - Replit's coding assistant for generation, explanation, and iteration inside the browser IDE.
  • Jupyter AI - AI assistant for JupyterLab and notebooks with support for many local and hosted models.
  • Qodo - AI coding and review workflows focused on tests, pull requests, and code quality.
  • CodeRabbit - AI pull request review assistant for summarizing changes and surfacing issues.
  • PR-Agent - Open-source automation for PR review, descriptions, suggestions, and feedback.
  • Mintlify - AI-assisted documentation generation for codebases and developer products.
  • Stenography - Tool for generating code explanations and documentation.
  • AI2sql - Natural-language interface for generating SQL queries.
  • Vanna - Open-source RAG approach for text-to-SQL and database question answering.
  • Wren AI - Open-source generative BI and text-to-SQL agent with a semantic layer.
  • Gitingest - Converts repositories into LLM-friendly text digests.
  • Repomix - Packs source code into structured context files for AI tools.
  • screenshot-to-code - Converts UI screenshots into frontend code.
  • v0 - Prompt-based UI generation for React and Next.js projects.
  • Lovable - Conversational app generation for quickly turning product ideas into deployable code.

Software Engineering Agents

  • Codex CLI - Local terminal coding agent for repository-aware development tasks.
  • Claude Code - Terminal-based coding agent for navigating and editing larger codebases.
  • Gemini CLI - Open-source command-line agent built around Google's Gemini models.
  • Aider - Git-aware pair programmer that edits files and keeps changes easy to review.
  • OpenCode - Open-source terminal coding agent with provider flexibility.
  • OpenHands - Autonomous software engineering agent with shell, browser, and editor workflows.
  • Cline - VS Code agent extension with tool use, file editing, and multi-provider support.
  • RooCode - Autonomous VS Code coding agent for planning and implementation tasks.
  • Pi - Customizable terminal coding agent with an extension-friendly workflow.
  • Plandex - Terminal-based AI coding workflow for larger implementation plans.
  • GPT Engineer - Generates software projects from a high-level specification and clarifying questions.
  • GPT Pilot - App-building agent designed to keep a developer in the loop.
  • Devin - Commercial autonomous software engineering agent.
  • Open Interpreter - Local terminal interface for executing code and automating computer tasks with LLMs.
  • TurboPilot - Self-hosted Copilot-like experiment built around local inference.

Agent Frameworks

  • LangGraph - Stateful graph framework for multi-step and multi-agent LLM workflows.
  • AutoGen - Microsoft's framework for building multi-agent conversations and collaboration loops.
  • CrewAI - Agent orchestration framework with roles, tasks, and human review patterns.
  • Pydantic AI - Typed Python framework for structured LLM applications and reliable outputs.
  • Google ADK - Google's Agent Development Kit for local development, tools, and agent protocols.
  • PocketFlow - Minimal agent framework useful for understanding the core mechanics without much abstraction.
  • MetaGPT - Multi-agent framework that turns a requirement into product, design, and engineering tasks.
  • Mastra - TypeScript framework for agents, workflows, memory, and tool integrations.
  • PraisonAI - Multi-agent framework with workflows, memory, and tool support.
  • Hermes Agent - Personal agent platform with memory, messaging integrations, and sandboxed tool execution.
  • VoltAgent - TypeScript framework for agents with tools, memory, and observability hooks.
  • Portia AI - Agent framework designed around visible plans, progress updates, and human interruption.
  • Agent Skills - Open format and reference SDK for reusable agent capabilities.
  • Model Context Protocol - Open protocol for connecting AI systems to tools, data, and external capabilities.
  • ToolHive - Tool for finding and deploying MCP servers.
  • Steel Browser - Browser automation infrastructure for AI agents, including sessions, screenshots, and proxies.
  • Notte - Framework for browser-using AI agents.

LLM App Frameworks

  • LangChain - Broad framework for LLM apps, chains, agents, retrieval, and integrations.
  • LlamaIndex - Data framework for connecting LLMs to private, structured, and unstructured knowledge.
  • Haystack - Modular framework for search, question answering, agents, and RAG pipelines.
  • Docling - Document conversion and ingestion toolkit for AI pipelines.
  • LLM App - Framework for real-time LLM-enabled data pipelines.
  • LMQL - Query language for constraining and composing LLM calls.
  • SymbolicAI - Neuro-symbolic framework for building LLM-centered applications.
  • Ludwig - Low-code system for training and deploying ML and deep learning models.
  • TensorZero - Framework that combines gateway, observability, evaluation, optimization, and experimentation for LLM apps.
  • Bifrost - Open-source LLM gateway for routing, load balancing, guardrails, and observability.
  • Manifest - LLM router for choosing cost-effective models and enforcing usage limits.
  • Agentset - Platform for building and evaluating RAG and agentic systems.
  • Scale Spellbook - Platform for comparing, building, and deploying generative AI applications.

RAG, Search, and Knowledge

  • Perplexity - AI search interface that combines retrieval and generated answers.
  • Exa - Search API designed for language-model workflows.
  • Phind - Developer-focused AI search and answer engine.
  • You.com - AI search engine with personalized and privacy-oriented features.
  • privateGPT - Local document Q&A for private files.
  • Quivr - Personal knowledge base that lets users chat with stored files and notes.
  • LibreChat - Open-source multi-provider chat UI for assistant-style workflows.
  • Chatbot UI - Open-source ChatGPT-style interface.
  • NotebookLM - Gemini-powered tool for working with documents and notes.
  • Open Notebook - Open-source NotebookLM-like system with more local control.
  • STORM - Research assistant that gathers sources and produces citation-backed reports.
  • Local Deep Research - Research workflow for web, academic, and private-document sources using local or cloud models.
  • Elicit - AI research assistant for literature review and evidence extraction.
  • Consensus - Search engine for answers grounded in scientific papers.
  • SciSpace - Assistant for reading, explaining, and exploring academic literature.

Local LLMs and Inference

  • Ollama - Simple local runner for open-weight language models.
  • LM Studio - Desktop app for discovering, downloading, and running local models.
  • Open WebUI - Self-hosted web interface for local and remote LLMs.
  • Jan - Local-first desktop AI app that can run offline or connect to APIs.
  • Msty - Desktop interface for working with local and hosted AI models.
  • LLM - Simon Willison's CLI and Python library for using local and remote LLMs.
  • gpt4all - Local model ecosystem with desktop app and bindings.
  • llama.cpp - Efficient C/C++ inference for LLaMA-style models and many derivatives.
  • bitnet.cpp - Microsoft inference framework for 1-bit LLMs.
  • whisper.cpp - C/C++ implementation of Whisper for local speech recognition.
  • Harbor - Containerized stack for running local model backends, UIs, and supporting services.
  • RunThisLLM - Hardware-oriented guide for choosing models that can run locally.
  • OpenRouter - Unified API for accessing many hosted models through one interface.
  • Together AI - Hosted training, fine-tuning, and inference for open models.
  • Groq - Fast cloud inference for supported open models using LPU hardware.

Model Providers and Families

  • OpenAI API - APIs for language, multimodal generation, speech, vision, and agentic workflows.
  • ChatGPT - OpenAI's conversational assistant for general reasoning, coding, and multimodal work.
  • Claude - Anthropic's model family for writing, analysis, coding, and long-context tasks.
  • Gemini - Google's multimodal AI family and chat interface.
  • Llama - Meta's open-weight model family for self-hosting, fine-tuning, and research.
  • Mistral - Open-weight and hosted models from Mistral AI.
  • DeepSeek - Open-source model family known for strong reasoning and coding variants.
  • Qwen - Alibaba's multilingual model family with open-source releases.
  • Kimi - Moonshot AI assistant and model family with long-context and agentic use cases.
  • GLM - Open-source language model family from Z.ai.
  • Cohere - Enterprise NLP and LLM platform with retrieval-oriented APIs.
  • MiniMax - Multimodal foundation models spanning text, speech, video, and music.

Evals, Observability, and LLMOps

  • OpenAI Evals - Framework for writing and running model evaluation suites.
  • Langfuse - Open-source tracing, prompt management, metrics, and evaluation platform.
  • Phoenix - Open-source observability for ML and LLM applications.
  • OpenLIT - OpenTelemetry-native observability for generative AI apps.
  • Helicone - Logging, monitoring, caching, and debugging layer for LLM applications.
  • Opik - Open-source tracing, evaluation, and monitoring platform for LLM systems.
  • MLflow - Experiment tracking, model deployment, and evaluation platform with LLM support.
  • Agenta - Open-source platform for prompt management, evaluation, and production monitoring.
  • Portkey - LLMOps gateway for monitoring, routing, caching, and governance.
  • Maxim AI - Evaluation and observability platform for shipping AI products with quality checks.
  • Cleanlab TLM - API for detecting unreliable or hallucinated LLM outputs.
  • Prediction Guard - Controlled LLM access with privacy, safety, and compliance features.
  • rehydra - Local PII anonymization and rehydration SDK for LLM prompts.
  • Agentic Radar - Security scanner for agentic workflows.
  • OpenAI Downtime Monitor - Public status and latency tracker for major LLM APIs.
  • Artificial Analysis - Independent model comparisons across quality, price, speed, and hosting.
  • LMArena - Human-preference leaderboard for model comparison.
  • OpenRouter Rankings - Usage-based model rankings from OpenRouter traffic.
  • SEAL LLM Leaderboard - Expert-driven model benchmark leaderboard.
  • LLM Stats - Model comparison site covering context windows, price, speed, and benchmarks.
  • SWE-bench - Benchmark for software engineering tasks based on real GitHub issues.
  • Terminal-Bench - Benchmark for terminal-based agent performance.

Prompting and Playgrounds

Machine Learning and Deep Learning

  • PyTorch - Popular deep learning framework with dynamic computation graphs and strong research adoption.
  • TensorFlow - End-to-end ML framework for training, deployment, and production pipelines.
  • Keras - High-level neural network API for fast experimentation.
  • MXNet - Deep learning framework with distributed and multi-language support.
  • PaddlePaddle - Baidu's deep learning platform for research and production.
  • DeepLearning4J - JVM-based deep learning framework.
  • mlpack - C++ machine learning library focused on speed and scalability.
  • cuDNN - NVIDIA GPU-accelerated primitives for deep neural networks.
  • Gymnasium - Toolkit for reinforcement learning environments and algorithm comparison.
  • TensorBoard - Visualization toolkit for model training and experiments.
  • Netron - Viewer for neural network, ONNX, and ML model files.
  • Jupyter Notebook - Interactive notebook environment widely used for ML experiments and analysis.
  • Scikit-Learn - Core Python toolkit for classical machine learning.
  • Albumentations - Fast image augmentation library for computer vision pipelines.
  • Activeloop - Dataset management and streaming platform for computer vision and AI workloads.
  • Unsloth - Library for faster and more memory-efficient LLM fine-tuning.
  • Kiln - App for synthetic data, fine-tuning, and model-building workflows.

Multimodal AI

Image

  • Stable Diffusion - Open text-to-image diffusion model ecosystem.
  • Flux - High-quality text-to-image models from Black Forest Labs.
  • Midjourney - Widely used image generation service for stylized and photorealistic outputs.
  • Ideogram - Image generation platform with strong text rendering.
  • Adobe Firefly - Creative Cloud integrated image generation and editing tools.
  • ComfyUI - Node-based interface for Stable Diffusion and image-generation workflows.
  • Civitai - Community hub for sharing diffusion models, LoRAs, and workflows.
  • Lexica - Search engine for Stable Diffusion images and prompts.
  • PromptHero - Prompt search and inspiration across major image models.
  • Hugging Face Diffusion Models Course - Course material for learning diffusion models in Python.

Video

  • Runway - AI video generation and editing platform for creative and production workflows.
  • Pika - Text-to-video and image-to-video generation platform.
  • Luma Dream Machine - Video generation model for realistic motion from text or images.
  • Kling AI - Image and video generation tools.
  • Google Flow - Google AI filmmaking workspace powered by Veo.
  • HyperFrames - Framework for programmatically rendering video with HTML, CSS, JavaScript, and agents.

Audio

  • Whisper - Speech recognition model released by OpenAI.
  • ElevenLabs - High-quality text-to-speech and voice generation platform.
  • Bark - Open-source transformer-based text-to-audio model.
  • TorToiSe - Open-source text-to-speech model with an emphasis on voice quality.
  • AudioCraft - Meta's generative audio toolkit for music and sound generation.
  • Suno - Text-to-music generation platform.
  • Udio - Music generation platform for creating and sharing songs.

Datasets and Benchmarks

  • MNIST - Classic handwritten digit dataset for introductory vision models.
  • CIFAR-10 and CIFAR-100 - Small image classification datasets for model experiments.
  • ImageNet - Large-scale visual recognition dataset that shaped modern computer vision.
  • Microsoft COCO - Detection, segmentation, captioning, and keypoint dataset for vision systems.
  • Visual Question Answering - Benchmark for answering natural-language questions about images.
  • UC Irvine Machine Learning Repository - Broad collection of datasets for classical ML tasks.
  • YouTube-8M - Large-scale labeled video dataset.
  • Open Images - Large annotated image dataset for classification, detection, and segmentation.
  • Pascal VOC 2012 - Object detection and segmentation benchmark.
  • Fashion-MNIST - Drop-in MNIST alternative using fashion product images.
  • DeepMind QA Corpus - Reading comprehension dataset built from CNN and Daily Mail articles.
  • DiffusionDB - Dataset and resource collection around Stable Diffusion prompts and generations.
  • LMArena Leaderboard - Crowdsourced preference benchmark for AI models.
  • Artificial Analysis - Benchmark hub for model quality, latency, throughput, and cost.

Courses and Books

Practical AI Engineering

Foundations

Courses

Papers and Research Context

Communities, News, and Related Lists

About

My curated vault of resources for learning, building, shipping, and using AI in software development. The focus is practical AI engineering: LLM applications, agentic coding, RAG, local models, evaluation, observability, ML foundations, and durable learning material.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors