CLEAR — Clean, Leakage-free Evaluation And Reporting

An open toolkit for honest, leakage-free evaluation of highway vehicle-trajectory prediction. It anchors every model against the zero/low-parameter baselines — constant velocity, constant acceleration, and the Intelligent Driver Model — under a recording-/vehicle-disjoint split, and emits a standardized Eval Card that forces disclosure of the things that otherwise inflate reported numbers: the baseline set, the split, the scoring convention, stratified error, and a predicted-collision admissibility metric.

Reference implementation for the paper "Constant Acceleration Is a Hard Baseline: A Leakage-Free Re-Evaluation of Highway Vehicle-Trajectory-Prediction Benchmarks."

Why

Leakage inflates. Sliding windows share vehicles/recordings across folds; CLEAR forbids it by default and prints a PASS-CLEAN / WARN-LEAKY verdict.
CA is a hard baseline. A zero-parameter constant-acceleration model is unbeaten by trained models — including official published architectures — on clean highway data (highD, exiD).
Averaging hides the tail. ~77% of windows are trivial cruising; a single RMSE buries the safety-critical cases, so CLEAR reports difficulty-stratified error.
Accuracy ≠ safety. The lowest-error model can forecast the most rear-end collisions; CLEAR measures predicted-collision admissibility alongside displacement error.

Install

pip install -e .            # core (NumPy only)
pip install -e ".[figures]" # + matplotlib for card figures
pip install -e ".[deep]"    # + torch for the official CS-LSTM/STDAN adapters

No data ships with CLEAR. Obtain highD/exiD (https://levelxdata.com) and NGSIM under their own licenses; point the loaders at your local copy.

Quickstart

clear run --data /path/to/highD/data --split recording --out card/
# or: python -m clear.cli run --data /path/to/exiD/data --split random   # emits WARN-LEAKY

from clear import load_levelx, evaluate, Predictor

ws   = load_levelx("highD/data")              # highD / exiD / inD / rounD / uniD (levelX); load_ngsim for NGSIM
card = evaluate(ws, split="recording", out_dir="card/")
print(card.beats_CA)                          # did your models beat CA @5s?
card.to_markdown("card/evalcard.md"); card.to_figure("card/evalcard.pdf")

Plug in any model by subclassing Predictor:

class MyModel(Predictor):
    name = "MyModel"
    def predict(self, ws):           # -> (N, HOR, 2)  [lat, long] in the per-window canonical frame
        ...
card = evaluate(ws, models=[MyModel()], split="recording")

Run the official deep models inside CLEAR

CS-LSTM / STDAN consume a convolutional social grid that CLEAR builds with social_grid=True:

from clear import load_levelx, evaluate
from clear.deep import CSLSTM
ws    = load_levelx("highD/data", social_grid=True)
model = CSLSTM(repo="external/conv-social-pooling", weights="cslstm.tar")  # your clone + weights
card  = evaluate(ws, models=[model], split="recording")
print(card.beats_CA)   # {'CS-LSTM': False} on clean highway data — CA wins

CLEAR does not redistribute third-party model code; clone the official repo and point the adapter at it.

The Eval Card

Protocol disclosure — windows, horizon, split + leakage verdict.
Baseline ladder — FDE (mean/median) & RMSE @1–5 s with bootstrap CIs, skill-over-CV, and the paired CA win-rate.
Stratified error — per kinematic regime (cruise/mild/hard) and maneuver, with window shares.
Admissibility — predicted rear-end-collision rate (overall and hard-braking tail).
Scoring transparency — best-of-k sweep for multimodal models.
Reporting checklist — five ticks operationalizing the paper's recommendations.

python -m clear.repro regenerates a card per dataset (highD/exiD/NGSIM).

Reproducibility & tests

pip install -e ".[dev]" && pytest -q     # data-free unit tests run in CI

Citation

If you use CLEAR, please cite the software (archived on Zenodo) and the paper. See CITATION.cff.

Aredah, A. CLEAR: Clean, Leakage-free Evaluation And Reporting for highway trajectory prediction (v0.3.0). Zenodo. https://doi.org/10.5281/zenodo.20800658

Roadmap

v0.3 (current): levelX + NGSIM adapters, leakage-safe splits, CV/CA/IDM, full metrics + admissibility, the Eval Card (JSON/MD/figure), CLI, and the CS-LSTM/STDAN deep adapters.
v1.0: docs site, broader dataset coverage, PyPI release, archived Zenodo DOI.

License

BSD-3-Clause — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
clear		clear
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
.zenodo.json		.zenodo.json
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLEAR — Clean, Leakage-free Evaluation And Reporting

Why

Install

Quickstart

Run the official deep models inside CLEAR

The Eval Card

Reproducibility & tests

Citation

Roadmap

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CLEAR — Clean, Leakage-free Evaluation And Reporting

Why

Install

Quickstart

Run the official deep models inside CLEAR

The Eval Card

Reproducibility & tests

Citation

Roadmap

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages