This repository hosts the reprodb.github.io website, which provides statistics and analysis of artifact evaluations across computer science research conferences.
The website presents:
- Area Dashboards: Dedicated pages for Security and Systems conferences
- Per-Conference Pages: Detailed statistics, author rankings, and repository data for each conference
- Prolific Authors: Per-area rankings of researchers with most artifact contributions, with reproducibility and functional rates
- Repository Statistics: Stars, forks, and language breakdowns for artifact repos
- Visualizations: SVG charts showing artifact evaluation trends over time
Data is automatically collected from:
- sysartifacts.github.io — Systems conferences (EuroSys, SOSP, SC, …)
- secartifacts.github.io — Security conferences (ACSAC, CHES, NDSS, PETS, …)
- USENIX — Direct scraping for conferences like FAST
- DBLP — Computer science bibliography for author statistics
- Jekyll — Static site generator
- GitHub Pages — Hosting with automatic deployment
- Minimal Mistakes — Jekyll theme (v4.27)
- Python — Data processing (see reprodb-pipeline)
docker run --rm -v "$PWD:/srv/jekyll" -p 4000:4000 jekyll/jekyll:4.2.2 \
sh -c "bundle install --quiet && jekyll serve"bundle install
bundle exec jekyll serve
# View at http://localhost:4000.
├── _config.yml # Jekyll configuration
├── _includes/
│ ├── conference_page.html # Shared template for per-conference pages
│ └── masthead.html # Custom navigation header with dropdowns
├── _data/
│ ├── summary.yml # Overall statistics
│ ├── artifacts_by_conference.yml
│ ├── artifacts_by_year.yml
│ ├── authors.yml # Top prolific authors (all areas)
│ ├── author_summary.yml
│ ├── systems_authors.yml # Systems area author rankings
│ ├── security_authors.yml # Security area author rankings
│ ├── repo_stats.yml # GitHub repo metadata
│ └── navigation.yml # Dropdown navigation structure
├── assets/
│ ├── charts/ # Generated SVG visualizations
│ ├── data/ # JSON data exports
│ ├── css/main.scss # Custom styles
│ └── images/logo.svg # Site logo
├── systems/ # Per-conference pages (EuroSys, FAST, SC, SOSP)
├── security/ # Per-conference pages (ACSAC, CHES, NDSS, …)
├── index.md # Homepage
├── statistics.md # Statistics dashboard
├── systems.md # Systems area overview
├── security.md # Security area overview
├── systems_authors.md # Systems prolific authors
├── security_authors.md # Security prolific authors
├── repo_stats.md # Repository statistics overview
├── methodology.md # How data is collected
└── about.md # About the project
Data is updated monthly via GitHub Actions in the reprodb-pipeline repository.
The workflow:
- Scrapes artifact evaluation results from sysartifacts, secartifacts, and USENIX
- Collects repository statistics (stars, forks, languages)
- Generates SVG visualizations
- Matches papers with DBLP for author identification and metrics
- Splits author data by area (systems/security)
- Commits updated data to this repository
Contributions welcome! Please:
- Fork the repository
- Create a feature branch
- Make your changes
- Submit a pull request
Every pull request runs three automated checks (see .github/workflows/pr-checks.yml):
| Check | What it does | When it fails |
|---|---|---|
| Jekyll Build | Builds the full site with jekyll-build-pages |
Liquid syntax errors, missing includes, bad YAML front matter |
| JSON Schema Validation | Validates src/assets/data/*.json against schemas in ReproDB/data-schemas |
Data files don't match expected schema (missing fields, wrong types) |
| Data Immutability | Blocks PRs that modify src/assets/data/ or src/_data/ |
Data changes should come from the reprodb-pipeline, not manual edits |
- Pages: Keep pages as
.mdwhere possible. Alpine.js/Tabulator controls live in_includes/*.htmlpartials. - Data: Do not manually edit JSON files in
src/assets/data/. If data needs updating, run the pipeline. - Schemas: If the pipeline produces a new field, update the schema in data-schemas first.
For data issues, please report them in the reprodb-pipeline repository.
Apache License 2.0 — see LICENSE for details.
- reprodb-pipeline — Data collection and analysis pipeline
- sysartifacts.github.io — Systems artifact evaluation results
- secartifacts.github.io — Security artifact evaluation results
Inspired by Systems Circus and maintained by the research artifacts community.
Data sources:
- DBLP — Computer science bibliography
- CSRankings — Faculty affiliation data