Skip to content

AI-Guru/ai_services

Repository files navigation

AI Services

Background

A collection of useful AI services for AI sovereignty.

video.mp4

Overview

This repository contains a set of containerized AI services that can be run locally to provide various AI capabilities without relying on external cloud providers. Each service is designed to be easy to deploy and use.

Models

Local LLM inference with multiple backends (vLLM, llama.cpp, SGLang, MLX) and hardware targets (RTX PRO 6000, DGX Spark, AMD Vulkan, Apple Silicon).

Model Description Location
Qwen 3.5 Flagship family, 0.8B to 122B variants models/qwen3.5
Qwen3-Coder-Next 80B MoE coding specialist models/qwen3-coder-next
Qwopus Opus-reasoning distilled 27B models/qwopus
GLM-4.7-Flash 30B MoE, ~3.6B active params models/glm-4.7-flash

Shared test and benchmark scripts live in models/shared.

Speech Services

Service Description Location Port
Whisper Speech-to-text using OpenAI Whisper speech/whisper 8000
Faster Whisper Optimized Whisper variant speech/faster-whisper
Orpheus TTS High-quality voice synthesis speech/orpheus 5005

Other Services

Ollama

A server that runs large language models (LLMs) locally with GPU acceleration support.

  • Features: Supports various open-source models, API access
  • Location: ollama
  • Port: 11434

Demo App (Voice Chat Assistant)

A real-time voice assistant integrating WebRTC, Whisper, Gemma 3, and Orpheus for end-to-end voice chat.

Getting Started

Each service has its own README.md with specific setup instructions and usage examples. Generally, you can start each service using:

cd service_directory
docker compose up -d

Kudos and Credits

This project would not have been possible without the great works of many people who steadily contribute to the open source community!

System Requirements

  • Docker and Docker Compose
  • NVIDIA GPU with CUDA support (recommended for optimal performance)
  • Sufficient disk space for model storage

License

See the LICENSE file for details.

About

A collection of useful AI services for AI sovereignty.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors