🎙️ EchoEmotion — Speech Emotion Recognition

Detect human emotions (calm, happy, fearful, disgust) from audio in real-time using a multi-model ML pipeline, a FastAPI backend, and a React + Tailwind frontend.

📖 Table of Contents

Overview
Architecture
Tech Stack
Quick Start (Docker)
Local Development
Dataset Setup
Training the Model
API Reference
Database Schema
Deployment
Testing
Contributing

Overview

EchoEmotion is an end-to-end production-ready Speech Emotion Recognition system built on the RAVDESS dataset. It compares five classifiers (MLP, Random Forest, SVM, XGBoost, LightGBM) and automatically selects the best by weighted F1 score.

Accuracy: ~72 – 78% depending on model selection and dataset size.

Features

🎤 Browser microphone recording
📁 Drag-and-drop audio upload (WAV, MP3, OGG, FLAC, M4A)
📊 Probability chart + radar profile per prediction
🏆 Model comparison table with cross-validation
📈 Full dashboard with emotion distribution analytics
🔐 JWT authentication (register / login)
🐘 PostgreSQL persistence for all predictions
🐳 Docker Compose for one-command startup

Architecture

┌─────────────────────────────────────────────────────┐
│                   React + Vite                      │
│  LandingPage / PredictPage / DashboardPage          │
│  Axios + TanStack Query + Framer Motion             │
└────────────────────┬────────────────────────────────┘
                     │ HTTP (REST)
┌────────────────────▼────────────────────────────────┐
│              FastAPI  (Python 3.11)                 │
│  /api/v1/predict   /train   /dashboard   /auth      │
│  JWT Auth · Rate Limiting · Swagger UI              │
│                                                     │
│  ┌───────────────────┐   ┌─────────────────────┐   │
│  │   ML Pipeline     │   │   PostgreSQL (ORM)  │   │
│  │  FeatureExtractor │   │  User / Prediction  │   │
│  │  ModelTrainer     │   │  EmotionStat        │   │
│  │  EmotionPredictor │   │  ModelRegistry      │   │
│  └───────────────────┘   └─────────────────────┘   │
└─────────────────────────────────────────────────────┘

Tech Stack

Layer	Technology
Frontend	React 18 · Vite · TailwindCSS · Framer Motion
Charts	Recharts
State	TanStack Query · Zustand
Backend	FastAPI · Uvicorn · Pydantic v2
Auth	JWT (python-jose) · bcrypt (passlib)
ML	scikit-learn · librosa · XGBoost · LightGBM
Database	PostgreSQL 16 · SQLAlchemy 2 (async)
DevOps	Docker · Docker Compose · GitHub Actions

Quick Start (Docker)

git clone https://github.com/your-username/echoemotion.git
cd echoemotion

# 1. Add RAVDESS dataset (see Dataset Setup below)
# 2. Copy environment files
cp backend/.env.example backend/.env

# 3. Start everything
docker compose up --build

# API docs:     http://localhost:8000/docs
# Frontend:     http://localhost:3000
# PostgreSQL:   localhost:5432

Local Development

Backend

cd backend
python -m venv venv && source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt

# Create .env from example
cp .env.example .env
# Edit .env — at minimum set DATABASE_URL to your local Postgres

# Create tables
python scripts/init_db.py

# Run dev server
uvicorn app.main:app --reload --port 8000

Frontend

cd frontend
npm install
# Create .env.local
echo "VITE_API_URL=http://localhost:8000/api/v1" > .env.local
npm run dev   # → http://localhost:5173

Dataset Setup

Download the RAVDESS dataset (speech-only files).
Place the extracted Actor_* folders inside backend/dataset/:

backend/
└── dataset/
    ├── Actor_01/
    │   ├── 03-01-01-01-01-01-01.wav
    │   └── ...
    ├── Actor_02/
    └── ...

Training the Model

Via API (recommended)

curl -X POST http://localhost:8000/api/v1/train \
  -H "Content-Type: application/json" \
  -d '{"observed_emotions": ["calm","happy","fearful","disgust"], "compare_models": true}'

Via Python script

cd backend
python -c "
from app.ml.trainer import ModelTrainer
from app.core.config import EMOTIONS_MAP
t = ModelTrainer('dataset', 'models', EMOTIONS_MAP)
result = t.train(['calm','happy','fearful','disgust'])
print(result['best_model'], result['best_accuracy'])
"

The pipeline will train MLP · Random Forest · SVM · XGBoost · LightGBM, run 5-fold cross-validation on each, and save the best model automatically.

API Reference

Method	Endpoint	Description
GET	`/api/v1/health`	Health check + model status
GET	`/api/v1/emotions`	List supported emotions
POST	`/api/v1/predict`	Upload audio → emotion + confidence
POST	`/api/v1/train`	Train / retrain model
GET	`/api/v1/model-info`	Active model metadata
GET	`/api/v1/metrics`	Full training metrics + comparison
GET	`/api/v1/dashboard`	Prediction statistics
POST	`/api/v1/auth/register`	Create account
POST	`/api/v1/auth/login`	Get JWT token
GET	`/api/v1/auth/me`	Current user info

Full interactive docs: http://localhost:8000/docs

Database Schema

users (id UUID PK, email, username, hashed_password, is_active, created_at)
predictions (id UUID PK, user_id FK, filename, predicted_emotion, confidence,
             all_probabilities JSON, audio_duration_s, created_at)
emotion_stats (id, emotion UNIQUE, total_count, avg_confidence, last_updated)
model_registry (id, version, algorithm, accuracy, metrics JSON, is_active, created_at)

Deployment

Backend → Render

Build command: pip install -r requirements.txt
Start command: uvicorn app.main:app --host 0.0.0.0 --port $PORT
Environment: set all vars from .env.example

Frontend → Vercel

cd frontend
npm run build
# Deploy /dist to Vercel
# Set VITE_API_URL to your backend URL

Database → Neon

Free-tier PostgreSQL — just update DATABASE_URL in your env.

Testing

# Backend
cd backend
pytest tests/ -v

# Frontend
cd frontend
npm test

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ EchoEmotion — Speech Emotion Recognition

📖 Table of Contents

Overview

Features

Architecture

Tech Stack

Quick Start (Docker)

Local Development

Backend

Frontend

Dataset Setup

Training the Model

Via API (recommended)

Via Python script

API Reference

Database Schema

Deployment

Backend → Render

Frontend → Vercel

Database → Neon

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎙️ EchoEmotion — Speech Emotion Recognition

📖 Table of Contents

Overview

Features

Architecture

Tech Stack

Quick Start (Docker)

Local Development

Backend

Frontend

Dataset Setup

Training the Model

Via API (recommended)

Via Python script

API Reference

Database Schema

Deployment

Backend → Render

Frontend → Vercel

Database → Neon

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages