llm-cache

Here are 9 public repositories matching this topic...

sno-ai / llmix

Production LLM call layer for AI agents and tools: keep OpenAI/Anthropic/AI SDK/LiteLLM, hot-swap models with MDA presets, and add cache, retries, circuit breakers, key rotation, singleflight, and Python/TypeScript/Rust parity.

Updated May 11, 2026
Python

redis / redis-vl-java

Star

Redis Vector Library (RedisVL) -- the AI-native Java client for Redis.

java redis ai embeddings vectors rag vector-search vector-database llm generative-ai semantic-cache llm-cache rag-chatbot semantic-routing agentic-ai

Updated Mar 17, 2026
Java

bhf / aeron-cache

Star

A KV store built with Aeron, SBE and Agrona. RAFT clustered or single node - fast by default. HTTP, WS & SSE API with JSON payloads. UI, CLI & Embeddable Polyglot Libraries. K8s deployable.

Updated May 11, 2026
Java

eneswritescode / neurocache

Star

Deterministic LLM caching layer with context optimization, LRU eviction, model-aware token counting, and concurrency-safe request deduplication.

infrastructure typescript ai backend cache concurrency openai deterministic llm llm-cache token-optimization

Updated Feb 20, 2026
TypeScript

vcal-project / vcal-core

Star

VCAL Core — high-performance semantic cache and vector cache library for LLM applications.

in-memory semantic-search similarity-search hnsw vector-search semantic-cache llm-cache ai-infrastructure ai-cache vector-cache

Updated May 11, 2026
Rust

bv-saketha-rama / flux-cache

Star

Adaptive semantic cache for LLMs with streaming support, ML-based thresholds, and real-time cost tracking. Built in Rust for sub-millisecond performance.

Updated Jan 14, 2026

sk25469 / kvern

Star

Something related to LLM caching

caching llm llm-inference llm-cache

Updated Apr 27, 2026
Python

reaatech / llm-cache

Star

Semantic caching layer for LLM calls. Exact-match and embedding-similarity caching with model-version-aware invalidation, use-case segmentation, and cost-saved tracking. Adapters for Redis and DynamoDB.

typescript ai embeddings openai semantic-search ai-agents llm anthropic llm-cache agentic-ai

Updated Apr 30, 2026
TypeScript

rizwan199811 / neurocache

Star

Reduce LLM API costs and speed up responses by caching completions with NeuroCache’s intelligent, provider-agnostic caching layer.

infrastructure typescript ai backend cache concurrency pytorch transformer openai vscode-extension deterministic fastapi llm llm-cache token-optimization

Updated May 12, 2026
TypeScript

Improve this page

Add a description, image, and links to the llm-cache topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-cache topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-cache

Here are 9 public repositories matching this topic...

sno-ai / llmix

redis / redis-vl-java

bhf / aeron-cache

eneswritescode / neurocache

vcal-project / vcal-core

bv-saketha-rama / flux-cache

sk25469 / kvern

reaatech / llm-cache

rizwan199811 / neurocache

Improve this page

Add this topic to your repo