Skip to content

PramudithaN/11labsM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

38 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

11LabsM

React FastAPI TypeScript Python Redis Docker ElevenLabs Vite AntDesign

A full-stack Neural Voice Synthesizer & Translator that converts text into multiple languages simultaneously and generates high-quality speech using ElevenLabs.


πŸ“Έ Preview

Home Page Preview


πŸ“– About This Project

11LabsM is a powerful tool designed for content creators and developers who need to localize speech content quickly. By leveraging the ElevenLabs API and advanced translation providers (DeepL, Google, MyMemory), it can take a single input text and produce high-quality audio in dozens of languages at once.

The application features a modern distributed architecture with a React frontend, a FastAPI backend, and Celery workers for asynchronous processing. It employs a content-addressed caching system in Redis to minimize API costs and latency for repeat requests.


✨ Features

  • 🌍 Multilingual Translation - Translate text into 17+ languages simultaneously including Spanish, French, German, and more.
  • πŸŽ™οΈ Neural TTS Synthesis - Generate ultra-realistic speech using ElevenLabs' industry-leading neural voices.
  • ⚑ Asynchronous Processing - Long-running tasks are handled in the background with Celery and Redis.
  • πŸ“Š Real-time Status Tracking - Monitor the progress of each language's translation and synthesis in real-time.
  • πŸ’Ύ Smart Caching - SHA-256 content-addressed caching stores generated audio to reduce costs and latency.
  • 🎧 Audio Player & Downloads - Stream generated audio directly in the browser or download a ZIP of all files.
  • πŸ› οΈ Customizable Formats - Support for multiple audio formats (MP3, PCM) and bitrates.

πŸ› οΈ Tech Stack

Layer Technology
Frontend Framework React 18
Build Tool Vite 5
UI Components Ant Design 6
API Backend FastAPI 0.111
Task Queue Celery 5.3
Relational Database PostgreSQL 16
Cache / Broker Redis 7.0
Object Storage MinIO / AWS S3
Language TypeScript / Python 3.10+
Deployment Vercel / Render

πŸ“‹ Prerequisites

  • Node.js v18 or higher
  • Python 3.10 or higher
  • pnpm (preferred) or npm
  • Docker & Docker Compose (for local development)
  • ElevenLabs API Key
  • DeepL or Google Translate API Key

βš™οΈ Getting Started

1. Clone the repository

git clone <your-repo-url>
cd 11labsM

2. Set up Backend

cd tts_backend
cp .env.example .env
# Edit .env and fill in your API keys

3. Start Infrastructure with Docker

The easiest way to get started is using Docker Compose, which spins up the API, Worker, Redis, PostgreSQL, and MinIO.

# From the root directory
docker-compose -f tts_backend/docker-compose.yml up --build
  • Frontend: http://localhost:5173
  • Backend API: http://localhost:8000
  • API Docs: http://localhost:8000/docs
  • MinIO Console: http://localhost:9001 (minioadmin / minioadmin)

πŸ“¦ Available Scripts

Frontend (frontend/)

Command Description
pnpm dev Starts the Vite development server
pnpm build Builds the application for production
pnpm preview Previews the production build locally

Backend (tts_backend/)

Command Description
uvicorn app.main:app --reload Starts the FastAPI server
celery -A app.workers.celery_app worker --loglevel=info Starts the Celery worker

πŸ“ Project Structure

11labsM/
β”œβ”€β”€ frontend/                  # React + Vite application
β”‚   β”œβ”€β”€ src/
β”‚   β”‚   β”œβ”€β”€ api/               # API client and typed fetch wrappers
β”‚   β”‚   β”œβ”€β”€ components/        # UI components (JobForm, JobStatus)
β”‚   β”‚   └── types/             # TypeScript interfaces
β”‚   └── public/                # Static assets & Images
β”œβ”€β”€ tts_backend/               # FastAPI + Celery backend
β”‚   β”œβ”€β”€ app/
β”‚   β”‚   β”œβ”€β”€ api/               # API routers (jobs, voices, models)
β”‚   β”‚   β”œβ”€β”€ models/            # SQLAlchemy & Pydantic models
β”‚   β”‚   β”œβ”€β”€ services/          # TTS and Translation logic
β”‚   β”‚   β”œβ”€β”€ storage/           # S3 / MinIO storage logic
β”‚   β”‚   β”œβ”€β”€ utils/             # Redis caching and DB utils
β”‚   β”‚   └── workers/           # Celery task definitions
β”‚   β”œβ”€β”€ alembic/               # Database migrations
β”‚   └── Dockerfile             # Backend container definition
β”œβ”€β”€ render.yaml                # Render deployment configuration
β”œβ”€β”€ docker-compose.yml         # Local development orchestration (links to backend)
└── PROJECT_ANALYSIS.md        # Technical architecture documentation

πŸ™‹β€β™‚οΈ Connect with Me


Developed with ❀️ by Pramuditha Nadun.

About

An automated localization and voice generation pipeline that translates English text into 17 languages and sequentially synthesizes high-quality audio using the ElevenLabs API.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors