AI Services

A collection of useful AI services for AI sovereignty.

video.mp4

Overview

This repository contains a set of containerized AI services that can be run locally to provide various AI capabilities without relying on external cloud providers. Each service is designed to be easy to deploy and use.

Models

Local LLM inference with multiple backends (vLLM, llama.cpp, SGLang, MLX) and hardware targets (RTX PRO 6000, DGX Spark, AMD Vulkan, Apple Silicon).

Model	Description	Location
Qwen 3.5	Flagship family, 0.8B to 122B variants	models/qwen3.5
Qwen3-Coder-Next	80B MoE coding specialist	models/qwen3-coder-next
Qwopus	Opus-reasoning distilled 27B	models/qwopus
GLM-4.7-Flash	30B MoE, ~3.6B active params	models/glm-4.7-flash

Shared test and benchmark scripts live in models/shared.

Speech Services

Service	Description	Location	Port
Whisper	Speech-to-text using OpenAI Whisper	speech/whisper	8000
Faster Whisper	Optimized Whisper variant	speech/faster-whisper	—
Orpheus TTS	High-quality voice synthesis	speech/orpheus	5005

Other Services

Ollama

A server that runs large language models (LLMs) locally with GPU acceleration support.

Features: Supports various open-source models, API access
Location: ollama
Port: 11434

Demo App (Voice Chat Assistant)

A real-time voice assistant integrating WebRTC, Whisper, Gemma 3, and Orpheus for end-to-end voice chat.

Location: demoapp
Port: 7860

Getting Started

Each service has its own README.md with specific setup instructions and usage examples. Generally, you can start each service using:

cd service_directory
docker compose up -d

Kudos and Credits

This project would not have been possible without the great works of many people who steadily contribute to the open source community!

System Requirements

Docker and Docker Compose
NVIDIA GPU with CUDA support (recommended for optimal performance)
Sufficient disk space for model storage

License

See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
.claude/skills/benchmark		.claude/skills/benchmark
demoapp		demoapp
gpu-dashboard		gpu-dashboard
librechat		librechat
models		models
ollama		ollama
speech		speech
.gitignore		.gitignore
20260503-PROBLEM.md		20260503-PROBLEM.md
LICENSE		LICENSE
README.md		README.md
background.png		background.png
pyproject.toml		pyproject.toml
uv.lock		uv.lock
video.mp4		video.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Services

Overview

Models

Speech Services

Other Services

Ollama

Demo App (Voice Chat Assistant)

Getting Started

Kudos and Credits

System Requirements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Services

Overview

Models

Speech Services

Other Services

Ollama

Demo App (Voice Chat Assistant)

Getting Started

Kudos and Credits

System Requirements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages