Distributed Job Scheduler

A highly available, fault-tolerant distributed job scheduling system built with Java 21 and Spring Boot, demonstrating advanced distributed systems concepts.

🎯 Project Goals

This project showcases:

Leader Election: Redis-based leader election with automatic failover
Distributed Locking: Redlock algorithm to prevent duplicate job execution
Fault Tolerance: Automatic recovery from node failures
Observability: Comprehensive metrics, logging, and health checks
Production-Ready: Docker, Kubernetes, CI/CD ready

🏗️ Architecture

See ARCHITECTURE.md for comprehensive architecture documentation including:

High-level system design
Component interactions
Leader election process
Job execution flow
Database schema
Deployment architecture

🚀 Quick Start

Prerequisites

Java 21 (OpenJDK or Oracle JDK)
Maven 3.9+
Docker & Docker Compose

Local Development

Start infrastructure services

cd deployment/docker
docker-compose up -d mysql redis

Build the project
```
mvn clean install
```
Run the application
```
mvn spring-boot:run
```
Access the application
- API: http://localhost:8080/api
- Health: http://localhost:8080/actuator/health
- Metrics: http://localhost:8080/actuator/prometheus

Multi-Node Cluster

Run a 3-node cluster with Docker Compose:

cd deployment/docker
docker-compose up -d

This starts:

3 scheduler nodes (ports 8080, 8081, 8082)
MySQL database
Redis cluster

Note: Prometheus and Grafana are disabled by default. They will be enabled in Phase 4 (Observability). See docs/OBSERVABILITY_STRATEGY.md

📚 Documentation

DEVELOPMENT.md - Development progress tracker and phase index
ARCHITECTURE.md - Comprehensive architecture documentation
DIAGRAMS_ASCII.md - ASCII architecture diagrams
docs/features/ - Feature-specific documentation

🧪 Testing

# Run all tests
mvn test

# Run only unit tests
mvn test -Dtest=*Test

# Run only integration tests
mvn test -Dtest=*IntegrationTest

# Run with coverage
mvn test jacoco:report

🛠️ Technology Stack

Backend:

Java 21 (Virtual Threads, Records, Pattern Matching)
Spring Boot 3.2.3
Spring Data JPA (Hibernate 6.4)
Redisson 3.27.0 (Redis client)
Flyway 10.8.1 (Database migrations)

Infrastructure:

Redis 7.2+ (Coordination)
MySQL 8.0+ (Persistence)
Prometheus + Grafana (Observability)
Docker & Kubernetes (Deployment)

📊 Key Features

Phase 1: Core Infrastructure ✅

Project structure and Maven setup
Database schema with Flyway (V1-V4 migrations)
Core domain entities (Job, JobExecution, SchedulerNode)
JPA repositories
Coordination layer (leader election, distributed locking)
REST API controllers
Job executor with virtual threads

Phase 2: Leader Election & Failover (Coming Soon)

Redis-based leader election
Heartbeat mechanism
Automatic failover
Fencing tokens

Phase 3: Distributed Locking (Coming Soon)

Redlock implementation
Idempotency service
Retry logic with exponential backoff
Job state machine

Phase 4: Observability (Deferred)

Prometheus metrics
Custom health indicators
Structured logging
Distributed tracing

Note: Observability features deferred to focus on core distributed systems functionality first.

🤝 Contributing

This is a portfolio project . Contributions are welcome!

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'feat: add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

👥 Authors

Scheduler Team - Initial work

🙏 Acknowledgments

Inspired by production job schedulers like Quartz, Airflow, and Temporal
Built to demonstrate distributed systems expertise
Designed with production-ready patterns and best practices

Status: 🚧 Phase 1 in progress
Last Updated: 2026-03-07

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
deployment		deployment
docs		docs
scheduler-ui		scheduler-ui
src		src
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
BUILD_FIX_SUMMARY.md		BUILD_FIX_SUMMARY.md
DEVELOPMENT.md		DEVELOPMENT.md
DIAGRAMS_ASCII.md		DIAGRAMS_ASCII.md
README.md		README.md
pom.xml		pom.xml
verify-build.bat		verify-build.bat
verify-build.sh		verify-build.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed Job Scheduler

🎯 Project Goals

🏗️ Architecture

🚀 Quick Start

Prerequisites

Local Development

Multi-Node Cluster

📚 Documentation

🧪 Testing

🛠️ Technology Stack

📊 Key Features

Phase 1: Core Infrastructure ✅

Phase 2: Leader Election & Failover (Coming Soon)

Phase 3: Distributed Locking (Coming Soon)

Phase 4: Observability (Deferred)

🤝 Contributing

📝 License

👥 Authors

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Distributed Job Scheduler

🎯 Project Goals

🏗️ Architecture

🚀 Quick Start

Prerequisites

Local Development

Multi-Node Cluster

📚 Documentation

🧪 Testing

🛠️ Technology Stack

📊 Key Features

Phase 1: Core Infrastructure ✅

Phase 2: Leader Election & Failover (Coming Soon)

Phase 3: Distributed Locking (Coming Soon)

Phase 4: Observability (Deferred)

🤝 Contributing

📝 License

👥 Authors

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages