LaunchMintAI — Brutal Startup Intelligence Engine

Stop building shit nobody wants.

LaunchMintAI is a production-grade research engine combining dual-layer search grounding, parallel agentic analysis, a calibrated two-step LLM pipeline, and an applied ML intelligence layer to validate startup ideas before a single line of product code is written.

What It Does

Most startup validators give you vibes. LaunchMintAI gives you data.

Pulls real TAM/CAGR numbers from McKinsey, Gartner, Statista via Serper + Tavily search grounding
Runs your idea through 20+ specialized analysis modules in parallel
Runs an XGBoost survival classifier trained on 2,000 synthetic startups to predict 5-year survival probability
Runs 10,000 Monte Carlo simulations to generate Bear/Base/Bull financial scenarios
Scores competitor customer pain using VADER NLP on a curated 14-competitor knowledge base
Roasts your idea with a two-step calibrated LLM pipeline — neutral classifier locks the score, creative writer delivers the verdict, Python overwrites unconditionally
Generates investor-ready pitch copy grounded in live market data from web search

4 Core Tabs

Tab	What It Does
Validator	TAM/SAM/SOM extraction, CAGR grounding, adversarial audit, DS Intelligence Layer + full forensic competitor analysis (kill strategies, SWOT, funding intel)
VC Roast	Ruthless fatal flaw analysis with calibrated survival scoring across 6 idea tiers — two-step LLM pipeline prevents score collapse, validated 21/21 across diverse idea types
Pitch Forge	High-conversion taglines, elevator pitches, cold email hooks, value propositions — seeded with real market numbers from the Validator cache or live web search
Battle Room	Compare Arena — pit two validated ideas head-to-head across 5 dimensions, AI declares a winner

DS Intelligence Layer

The applied ML layer that separates LaunchMintAI from a GPT wrapper.

User Idea
    │
    ▼
┌───────────────────────────────────────────┐
│              DS Pipeline                  │
│           (parallel threads)              │
├─────────────┬─────────────┬───────────────┤
│ XGBoost     │ Monte Carlo │  VADER NLP    │
│ Classifier  │ Simulation  │  Sentiment    │
│             │             │               │
│ survival %  │ Bear/Base/  │ pain_score    │
│ risk_tier   │ Bull runway │ kill_strategy │
│ conf_band   │ breakeven   │ top_complaints│
└─────────────┴─────────────┴───────────────┘
    │
    ▼
/ds_insights endpoint (FastAPI)
    │
    ▼
DSInsights UI (3 real-time cards)

Model Performance

Metric	Value
Algorithm	XGBoost Binary Classifier
Training Data	2,000 synthetic startups · 10 features
AUC-ROC	0.8170
F1 Score	0.7183
Accuracy	73%
Monte Carlo Runs	10,000 per idea
VADER Competitor KB	14 curated competitors

VC Roast — Two-Step Calibrated LLM Pipeline

The hardest engineering problem in this project: LLM calibration.

A single-prompt model collapsed all scores to 12–15% regardless of idea quality. The root cause: creative personas override numeric rules — LLMs are reasoners, not rule-followers. Adding more rules to the prompt didn't fix it.

The fix: a two-step pipeline with three enforcement layers.

User Idea
    │
    ├──► [Parallel]
    │       ├── Serper Web Search (live competitor data)
    │       └── Flash-Lite Classifier (neutral, no persona)
    │               └── Tier 1–6 · survival % · verdict locked
    │
    ▼
Flash Roaster (creative writer)
    └── Receives pre-locked numbers via prompt injection
    └── Writes fatal flaw analysis, kill shot, investment verdict
    │
    ▼
Python Safety Net
    └── data["survival_chance"] = survival_chance  ← unconditional overwrite

Three enforcement layers:

Classifier prompt — neutral tone, no persona, structured JSON output with Tier 1–6 classification
Roaster prompt — receives {tier}, {survival_chance}, {verdict} pre-injected; cannot override them
Python code — unconditionally overwrites the score after the LLM response, regardless of what the model wrote

Result: 21/21 test ideas score in the correct calibrated range across all 6 tiers. Ideas are deliberately different from prompt examples — proving generalisation, not memorisation.

Tier	Example	Survival Range
T1 — Consumer clone	Dating app for gamers	5–15%
T2 — Thin B2B feature	Social media scheduler	12–25%
T3 — Vertical SaaS	PT clinic management	21–40%
T4 — Enterprise AI	Mortgage doc automation	41–60%
T5 — Category challenger	AI vs QuickBooks	55–72%
T6 — Platform play	Full OS replacement	65–85%

Pitch Forge — Market-Grounded Copy Generation

Pitch Forge generates five investor-ready outputs from a single idea:

Field	What It Is
`tagline`	≤10-word hook (≤10 words enforced in test suite)
`elevator_pitch`	2–3 sentence pitch for founders
`value_proposition`	Customer-facing benefit statement
`tweet_thread_hook`	≤280-char viral opener (character counter in UI)
`cold_email_subject`	High open-rate subject line

Market grounding fallback chain:

Pulls market_size, growth_rate, top_competitor from Validator cache (if idea was already validated)
Falls back to live Serper web search for independent market context
Graceful degradation if both unavailable — no silent failures

Test suite: 30 ideas across 5 tiers (T1 consumer → T5 platform replacement), validated for static fallback detection, jargon-free copy, tweet length, and field completeness.

Eval Layer

A proof layer — not just a demo.

backend/app/ds/eval/
├── dataset.jsonl       50 labeled ideas · 11 domains · ground-truth sourced
├── golden.test.py      Correctness  →  50/50  100%
├── benchmark.py        Performance  →  386ms avg · P95 596ms
├── generate_charts.py  4 evaluation charts (PNG)
├── EVAL_REPORT.md      Full report with error analysis
├── results/            JSON + TXT outputs
└── charts/             Accuracy · Survival · Rule breakdown · Grid

Domains: SaaS · AI/ML · FinTech · HealthTech · EdTech · E-Commerce · Consumer · MarketPlace · DeepTech · GreenTech · Web3

Technical Stack

Layer	Technology
Frontend	React 19 · TypeScript · Vite 6 · Tailwind CSS · Framer Motion
Backend	FastAPI 0.128 · Python 3.10+ · Pydantic
LLM (Primary)	Google Gemini 2.5 Flash — creative generation, full reports
LLM (Classifier)	Google Gemini 2.5 Flash-Lite — neutral tier classification (cheaper, faster)
Search	Serper (Google grounding) · Tavily AI (McKinsey → BCG → Gartner waterfall)
ML	XGBoost 2.0 · scikit-learn · VADER NLP
Simulation	NumPy Monte Carlo (10K runs)
Vector DB	ChromaDB (long-term intelligence persistence)
Key Management	6-key rotation pool per provider with automatic failover

Getting Started

Prerequisites

Python 3.10+
Node.js 18+
Gemini API key — aistudio.google.com (free tier works)
Serper API key — serper.dev (free: 2,500 searches/month)
Tavily API key — tavily.com (free: 1,000 searches/month, optional)

Backend

cd backend
python -m venv venv
venv\Scripts\activate          # Windows
# source venv/bin/activate     # Linux/Mac
pip install -r requirements.txt
cp .env.example .env           # then fill in your API keys
python -m app.main

Frontend

cd frontend
npm install
cp .env.example .env           # set VITE_API_BASE_URL
npm run dev

Run the Test Suites

# DS correctness test (50/50)
cd backend/app/ds/eval && python golden.test.py

# DS performance benchmark
python benchmark.py

# Full model evaluation (AUC, F1, confusion matrix)
cd backend/app/ds && python evaluate.py

# DS stress test (50 cases, 5 tiers)
python test_ds_stress.py

# VC Roast calibration test (21 ideas, all tiers)
python test_vc_roast.py

# Pitch Forge output quality test (30 ideas, 5 tiers)
python test_pitch_forge.py

CI/CD

GitHub Actions runs on every push to master:

DS Golden Test — validates all 50 eval cases pass
DS Stress Test — runs 50-case stress suite (only if golden passes)
Frontend Build — verifies Vite build succeeds

See .github/workflows/ds-eval.yml

Project Structure

See PROJECT_STRUCTURE.md for the full annotated file tree.

License

MIT — see LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.github/workflows		.github/workflows
LaunchMintAI-master		LaunchMintAI-master
backend		backend
frontend		frontend
.gitignore		.gitignore
ANTIGRAVITY_INSTRUCTIONS.md		ANTIGRAVITY_INSTRUCTIONS.md
Antigravity_Last_Conversation.md		Antigravity_Last_Conversation.md
Do_You_Get_All_10.html		Do_You_Get_All_10.html
LICENSE		LICENSE
LaunchMintAI.bat		LaunchMintAI.bat
LaunchMintAI_Validation_Report.html		LaunchMintAI_Validation_Report.html
Launch_LaunchMintAI.bat		Launch_LaunchMintAI.bat
Launchmint_AI.md		Launchmint_AI.md
PROCESS.html		PROCESS.html
PROJECT_STRUCTURE.md		PROJECT_STRUCTURE.md
README.md		README.md
Report_Structure.html		Report_Structure.html
RestartLaunchMintAI.bat		RestartLaunchMintAI.bat
Validation_Report_Structure.html		Validation_Report_Structure.html
implementation_plan_validator_tab.md		implementation_plan_validator_tab.md
list_models.py		list_models.py
stress_results.txt		stress_results.txt
stress_test.py		stress_test.py
test_pitch_forge.py		test_pitch_forge.py
test_search_diagnostics.py		test_search_diagnostics.py
test_suite.py		test_suite.py
test_vc_roast.py		test_vc_roast.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LaunchMintAI — Brutal Startup Intelligence Engine

What It Does

4 Core Tabs

DS Intelligence Layer

Model Performance

VC Roast — Two-Step Calibrated LLM Pipeline

Pitch Forge — Market-Grounded Copy Generation

Eval Layer

Technical Stack

Getting Started

Prerequisites

Backend

Frontend

Run the Test Suites

CI/CD

Project Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LaunchMintAI — Brutal Startup Intelligence Engine

What It Does

4 Core Tabs

DS Intelligence Layer

Model Performance

VC Roast — Two-Step Calibrated LLM Pipeline

Pitch Forge — Market-Grounded Copy Generation

Eval Layer

Technical Stack

Getting Started

Prerequisites

Backend

Frontend

Run the Test Suites

CI/CD

Project Structure

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages