🌊 LLM-API-Pool

A high-performance, concurrent API pool proxy service for LLMs

LLM-API-Pool is a lightning-fast, highly concurrent API pool proxy service written in Go. It allows you to seamlessly aggregate multiple unreliable or rate-limited text generation API endpoints and present them as a single, bulletproof API to any client application.

✨ Key Features

🏎️ Concurrent Routing & Load Balancing
Lightning-fast routing written in Go. Balance requests across your available endpoints using Round-Robin, Random, or Weighted strategies.
🛡️ Auto Health Checks & Failover
Say goodbye to downtime. The proxy automatically detects unreachable or error-prone endpoints, bypassing them in real time and intelligently retrying requests on healthy providers.
🏊 Model Pools (Aliases)
Group different models from various providers into a single logical "Pool". Configure your client to target a generic name like My_Pool, and let the proxy dynamically route your prompt to any backend model within that pool.
🎛️ Web Admin Interface
An elegant, built-in dashboard to monitor live endpoint statuses, manage API keys, toggle endpoints, and organize your model pools with ease.
📊 Live Request Logging
Real-time observability. See exactly what requests are coming in, which endpoint is serving them, and monitor any errors directly from the UI.
🧠 Intelligent Error Parsing
Intercepts and decodes raw error messages from upstream providers. It feeds detailed, human-readable context back into your request logs to simplify troubleshooting rate limits or authentication outages.
🔄 Live Configuration Reloads
Tweak configurations, update aliases, or change pool settings securely from the web interface on the fly—zero downtime or restarts required.

🚀 Getting Started

1. Prerequisites

Ensure you have Go 1.20+ installed to build the binary.

2. Configuration

Copy the provided config.yaml template and populate it with your specific endpoints and API keys.

3. Build & Run

Get the proxy up and running in seconds:

go mod tidy
go build -o proxy
./proxy

4. Access the Dashboard

Navigate to your web browser and open: http://localhost:5066/admin

🔌 Client Integration

Integrating into API clients (like SillyTavern, UI clients, or your own code) is effortless.

Simply set the custom API endpoint URL in your client to:
http://localhost:5066/v1

For the model name, either use an exact model name available directly on your backends, or use a custom Pool Name defined in your Pool Aliases.

Supercharge your LLM capabilities with limitless availability. Built with ❤️ in Go.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
static		static
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
admin.go		admin.go
config.go		config.go
config.yaml		config.yaml
go.mod		go.mod
go.sum		go.sum
main.go		main.go
proxy.go		proxy.go
state.go		state.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌊 LLM-API-Pool

✨ Key Features

🚀 Getting Started

1. Prerequisites

2. Configuration

3. Build & Run

4. Access the Dashboard

🔌 Client Integration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🌊 LLM-API-Pool

✨ Key Features

🚀 Getting Started

1. Prerequisites

2. Configuration

3. Build & Run

4. Access the Dashboard

🔌 Client Integration

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages