⚡ ThunderCrawler

A full-stack, production-grade web crawler built with Next.js 16, TypeScript, Prisma, and Redis. ThunderCrawler lets you submit URLs, track crawl jobs in real time via WebSockets, and persist results to a serverless PostgreSQL database — all from a clean, modern UI.

✨ Features

🕷️ Async web crawling — submit URLs and crawl them in the background via dedicated workers
🔴 Real-time progress updates powered by WebSockets (ws)
🔐 Authentication with NextAuth v5 — credential-based login with bcrypt password hashing
🗄️ Database persistence via Prisma ORM + Neon (serverless PostgreSQL)
⚡ Redis-backed job queue — workers pull crawl jobs from a Redis queue (Docker Compose included)
🛡️ Input validation with Zod schemas
🎨 Modern UI built with Tailwind CSS v4, Radix UI, and shadcn/ui components
🧪 Tests directory included for unit/integration coverage

🛠️ Tech Stack

Category	Technology
Framework	Next.js 16 (App Router)
Language	TypeScript 5
Auth	NextAuth v5 (Prisma Adapter) + bcryptjs
Database	PostgreSQL via Neon (serverless) + Prisma ORM v7
Job Queue	Redis 7 (Docker)
Real-time	WebSockets (`ws`)
UI	Tailwind CSS v4, Radix UI, shadcn/ui, Lucide React
Validation	Zod
Package Manager	Bun / npm

📁 Project Structure

ThunderCrawler/
├── app/                  # Next.js App Router — pages & API routes
├── actions/              # Next.js Server Actions
├── components/           # Reusable React UI components (shadcn/ui)
├── config/               # App-wide configuration
├── generated/prisma/     # Auto-generated Prisma client
├── hooks/                # Custom React hooks
├── lib/                  # Shared utilities (Prisma client, helpers)
├── prisma/               # Prisma schema & migrations
├── public/               # Static assets
├── tests/                # Unit & integration tests
├── types/                # Global TypeScript type definitions
├── workers/              # Background crawl worker processes
├── docker-compose.yml    # Redis service for local development
├── index.ts              # Standalone Prisma entry point / seed script
├── middleware.ts          # Next.js middleware (auth guards, routing)
├── next.config.ts        # Next.js configuration
└── prisma.config.ts      # Prisma configuration

⚙️ Getting Started

Prerequisites

Node.js v18+ or Bun
Docker (for Redis)
A Neon account (or any PostgreSQL database)

1. Clone the Repository

git clone https://github.com/THUNDERBLD/ThunderCrawler.git
cd ThunderCrawler

2. Install Dependencies

npm install
# or
bun install

3. Configure Environment Variables

Create a .env file in the project root:

# Database (Neon / PostgreSQL)
DATABASE_URL="postgresql://user:password@host/dbname?sslmode=require"

# NextAuth
NEXTAUTH_SECRET="your_nextauth_secret"
NEXTAUTH_URL="http://localhost:3000"

# Redis
REDIS_URL="redis://localhost:6379"

4. Start Redis (via Docker)

docker-compose up -d

This starts a Redis 7 instance on port 6379 with persistence enabled.

5. Set Up the Database

# Push the Prisma schema to your database
npx prisma db push

# (Optional) Open Prisma Studio to inspect data
npx prisma studio

6. Run the Development Server

npm run dev
# or
bun dev

Open http://localhost:3000 in your browser.

7. Run Background Workers

In a separate terminal, start the crawl worker:

# Adjust the path to your worker entry point
npx ts-node workers/index.ts
# or
bun workers/index.ts

🏗️ Architecture Overview

Browser ──(HTTP/WS)──► Next.js App
                           │
                    Server Actions
                           │
               ┌───────────┴────────────┐
               ▼                        ▼
         Prisma ORM               Redis Queue
               │                        │
        Neon PostgreSQL           Crawl Workers
         (persist results)        (fetch & crawl URLs)

A user submits a URL through the UI.
A Server Action validates the input (Zod) and enqueues a job in Redis.
A background worker dequeues the job, performs the HTTP crawl, and saves results to PostgreSQL via Prisma.
The client receives live progress updates over a WebSocket connection.

📜 Available Scripts

Script	Description
`npm run dev`	Start the Next.js development server
`npm run build`	Build for production
`npm run start`	Start the production server
`npm run lint`	Run ESLint

🚀 Deployment

Deploy on Vercel

The easiest way to deploy ThunderCrawler is with Vercel:

Push your repository to GitHub.
Import it on vercel.com/new.
Set all environment variables from your .env file in the Vercel dashboard.
Deploy!

Note: The Redis-backed worker process needs to run separately (e.g., on a VPS, Railway, or Render) since Vercel is serverless and cannot run persistent background processes.

🤝 Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch: git checkout -b feature/your-feature
Commit your changes: git commit -m "feat: add your feature"
Push and open a Pull Request

📄 License

This project is open source and available under the MIT License.

👤 Author

THUNDERBLD — @THUNDERBLD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚡ ThunderCrawler

✨ Features

🛠️ Tech Stack

📁 Project Structure

⚙️ Getting Started

Prerequisites

1. Clone the Repository

2. Install Dependencies

3. Configure Environment Variables

4. Start Redis (via Docker)

5. Set Up the Database

6. Run the Development Server

7. Run Background Workers

🏗️ Architecture Overview

📜 Available Scripts

🚀 Deployment

Deploy on Vercel

🤝 Contributing

📄 License

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.vscode		.vscode
actions		actions
app		app
components		components
config		config
generated/prisma		generated/prisma
hooks		hooks
lib		lib
prisma		prisma
public		public
tests		tests
types		types
workers		workers
.gitignore		.gitignore
README.md		README.md
bun.lock		bun.lock
components.json		components.json
debug-yc.html		debug-yc.html
docker-compose.yml		docker-compose.yml
eslint.config.mjs		eslint.config.mjs
index.ts		index.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
prisma.config.ts		prisma.config.ts
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

⚡ ThunderCrawler

✨ Features

🛠️ Tech Stack

📁 Project Structure

⚙️ Getting Started

Prerequisites

1. Clone the Repository

2. Install Dependencies

3. Configure Environment Variables

4. Start Redis (via Docker)

5. Set Up the Database

6. Run the Development Server

7. Run Background Workers

🏗️ Architecture Overview

📜 Available Scripts

🚀 Deployment

Deploy on Vercel

🤝 Contributing

📄 License

👤 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages