From 3e7e972cb263029e298a4893bce1b1d4f157ddf0 Mon Sep 17 00:00:00 2001
From: Bertrand Travacca <bertrand.travacca@gmail.com>
Date: Mon, 22 Jun 2026 15:47:00 -0700
Subject: [PATCH 1/4] ci: add Open Journals draft-pdf workflow to compile JOSS
 paper.md

Adds a workflow_dispatch + paper-change-triggered job that runs the
Open Journals Pandoc pipeline (journal=joss) over the repo-root paper.md
and uploads paper.pdf as a build artifact, so the JOSS proof PDF can be
verified before submission.
---
 .github/workflows/draft-pdf.yml | 32 ++++++++++++++++++++++++++++++++
 1 file changed, 32 insertions(+)
 create mode 100644 .github/workflows/draft-pdf.yml

diff --git a/.github/workflows/draft-pdf.yml b/.github/workflows/draft-pdf.yml
new file mode 100644
index 0000000..0e837ad
--- /dev/null
+++ b/.github/workflows/draft-pdf.yml
@@ -0,0 +1,32 @@
+name: Draft JOSS paper PDF
+
+# Compiles paper.md into the JOSS-styled paper.pdf using the Open Journals
+# Pandoc pipeline, so the proof PDF can be checked before/at submission.
+# Runs manually (Actions tab -> Run workflow) and whenever the paper changes.
+
+on:
+  workflow_dispatch:
+  push:
+    paths:
+      - paper.md
+      - paper.bib
+      - .github/workflows/draft-pdf.yml
+
+jobs:
+  paper:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+
+      - name: Build draft PDF
+        uses: openjournals/openjournals-draft-action@master
+        with:
+          journal: joss
+          paper-path: paper.md
+
+      - name: Upload paper.pdf
+        uses: actions/upload-artifact@v4
+        with:
+          name: paper
+          path: paper.pdf

From 41f138a6257b924724e3486e8ee2517a4f5df872 Mon Sep 17 00:00:00 2001
From: Bertrand Travacca <bertrand.travacca@gmail.com>
Date: Mon, 22 Jun 2026 15:52:52 -0700
Subject: [PATCH 2/4] docs(paper): bring JOSS paper.md up to updated JOSS scope

Adds the now-required Software design, Research impact statement, and AI
usage disclosure sections; strengthens State of the field with a
build-vs-contribute justification; adds a financial-support statement;
expands the body to 1064 words (updated JOSS range is 750-1750).
---
 paper.md | 136 +++++++++++++++++++++++++++++++++++--------------------
 1 file changed, 88 insertions(+), 48 deletions(-)

diff --git a/paper.md b/paper.md
index 96f6262..6b14f1a 100644
--- a/paper.md
+++ b/paper.md
@@ -32,12 +32,12 @@ measured charging telemetry is scarce, proprietary, and privacy-restricted.
 `ev-flow` produces such populations entirely from public inputs: it grounds
 every synthetic vehicle in 2017 National Household Travel Survey (NHTS)
 microdata [@nhts2017] and a per-region vehicle sales-mix model, then carries it
-through a deterministic nine-stage pipeline (M1–M9) to a time-stamped charging
-profile. It ships residential and workplace profile types, descriptive
-charging-equipment (EVSE brand and connector) enrichment, and both 15-minute and
-hourly time grids. Every output is stored in UTC, is timezone-aware, and is
-bit-for-bit reproducible from a single master random seed. The library exposes a
-high-level `generate_profiles` / `Fleet` / `Profile` API for downstream studies,
+through a deterministic nine-stage pipeline to a time-stamped charging profile.
+It ships residential and workplace profile types, descriptive charging-equipment
+(EVSE brand and connector) enrichment, and both 15-minute and hourly time grids.
+Every output is stored in UTC, is timezone-aware, and is bit-for-bit reproducible
+from a single master random seed. The library exposes a high-level
+`generate_profiles` / `Fleet` / `Profile` interface for downstream studies,
 together with `ev-flow bootstrap` and `ev-flow doctor` command-line tools that
 build and diagnose the underlying data on a fresh machine.
 
@@ -47,63 +47,103 @@ Quantifying the electricity demand and temporal flexibility of EV fleets is
 central to power-systems planning, transportation electrification, and energy
 economics. Because real per-vehicle charging records are rarely shareable,
 researchers turn to *bottom-up simulation*: deriving plausible charging load
-from travel behavior plus techno-economic assumptions. Existing open generators,
-however, are calibrated to non-U.S. mobility surveys or collapse the regional,
-seasonal, and equipment heterogeneity that drives aggregate demand. `ev-flow`
-targets that gap with a U.S.-focused, NHTS-grounded generator whose every
-modeling choice is traceable to a public source, and whose outputs are
-deterministic and auditable so that downstream results can be reproduced exactly.
+from travel behavior plus techno-economic assumptions. The problem `ev-flow`
+solves is the absence of an open, U.S.-grounded generator that preserves the
+regional, seasonal, and equipment heterogeneity driving aggregate demand while
+remaining fully deterministic and auditable. Its target users are energy-systems
+modelers, distribution-planning engineers, rate-design analysts, and researchers
+who need reproducible charging populations they can regenerate exactly rather
+than a fixed, opaque dataset.
 
-## State of the field
+# State of the field
 
 Several open tools generate or simulate EV charging, but they occupy adjacent
 niches. `emobpy` [@gaetemorales2021emobpy], `venco.py` [@miorelli2025vencopy],
 and RAMP-mobility [@mangipinto2022ramp] build EV demand and flexibility profiles
 from *European* national travel surveys; their behavioral calibration and
-charging-availability assumptions do not transfer cleanly to U.S. fleets.
+charging-availability assumptions do not transfer cleanly to U.S. fleets, whose
+vehicle mix, trip structure, and home/work charging access differ materially.
 `datafev` [@gumrukcu2023datafev] and ACN-Sim [@lee2021acnsim] are
 charging-infrastructure and control simulators that *consume* charging sessions
 to test management algorithms, rather than *generating* survey-grounded
-behavioral populations. `ev-flow` is complementary: it produces U.S.-regional,
-NHTS-grounded behavioral profiles that can serve directly as boundary inputs to
-those optimization, market, and control models.
+behavioral populations. Rather than re-skinning a European generator or
+hard-coding U.S. assumptions onto an infrastructure simulator, `ev-flow`
+contributes a distinct artifact: it ties every behavioral choice to a U.S.
+public source (NHTS travel diaries; the SPEECh charging model
+[@powell2022speech]), differentiates eight regions through an explicit sales-mix
+model, and emits profiles usable directly as boundary inputs to the optimization,
+market, and control models the tools above target. No existing open tool fills
+that U.S.-focused, NHTS-grounded, fully-reproducible niche, which is why building
+`ev-flow` was warranted rather than contributing to an existing project.
 
-# Implementation
+# Software design
 
-The nine pipeline stages stitch NHTS person-days into donor-matched 365-day
-travel calendars with a temperature-dependent winter energy uplift
-[@yuksel2015regional]; sample behavioral plug-in start times from the published
-SPEECh K=16 Gaussian-mixture parameterization [@powell2022speech;
-@powell2022speechdata]; evaluate a three-layer Bernoulli plug-in decision model
-[@munkhammar2015probability]; propagate a continuous-time state-of-charge ledger
-with an explicit PHEV gasoline range-extension term; and rasterize plug status to
-the output grids. Regional differentiation comes from the sales-mix model, so
-battery-capacity and powertrain distributions differ across the bay_area,
-la_basin, new_york_metro, boston, chicago, dallas_fort_worth, seattle, and
-us_national regions. The package is pure Python (requires Python ≥ 3.10), depends
-only on the scientific-Python stack, and ships a typed (`py.typed`) public API.
+`ev-flow` is organized as a nine-stage pipeline (internally M1–M9) behind a small
+public API, a structure chosen to make every transformation independently
+testable and cacheable. The central design trade-off is **determinism over raw
+throughput**: the entire pipeline is seeded from a single master seed so that a
+given `(region, profile_type, seed)` reproduces every profile byte-for-byte, and
+a continuous-integration job re-runs the pipeline to confirm identical cached
+output. This makes downstream scientific results exactly reproducible at the cost
+of some vectorization opportunities. Key modeling choices each reflect a
+grounding decision rather than convenience: NHTS single-day person records are
+stitched into donor-matched 365-day travel calendars with a temperature-dependent
+winter energy uplift [@yuksel2015regional]; behavioral plug-in start times are
+sampled from the published SPEECh K=16 Gaussian-mixture parameterization
+[@powell2022speech; @powell2022speechdata]; a three-layer Bernoulli plug-in model
+[@munkhammar2015probability] decides whether each opportunity is taken; and a
+continuous-time state-of-charge ledger with an explicit PHEV gasoline
+range-extension term propagates energy before plug status is rasterized to the
+15-minute and hourly grids. Regional heterogeneity is isolated in the sales-mix
+model, so battery-capacity and powertrain distributions vary across regions
+without touching the pipeline. A deliberate design principle is **auditable
+validation**: rather than hide divergences, a validation runner compares
+generated distributions against literature bounds and classifies each as PASS, an
+*explained* failure (a documented, sourced limitation), or an explained skip. The
+runner defines 32 distributional checks, 21 of which apply to a residential run;
+for the reference `bay_area` residential profile it reports 11 PASS, 0
+*unexplained* failures, 6 explained failures, and 4 explained skips — every
+non-PASS carrying an explicit rationale. The package is pure Python (requires
+Python ≥ 3.10), depends only on the scientific-Python stack, and ships a typed
+(`py.typed`) public API covered by an extensive `pytest` suite (500+ tests) on a
+multi-OS, multi-version CI matrix.
 
-# Reproducibility and validation
+# Research impact statement
 
-Determinism is a first-class contract: a single seed reproduces every profile
-byte-for-byte, and a continuous-integration job re-runs the pipeline to confirm
-identical cached output. A built-in validation runner compares the generated
-distributions against bounds drawn from the literature and classifies each
-divergence as PASS, an *explained* failure (a documented, sourced modeling
-limitation), or an explained skip. The runner defines 32 distributional checks,
-of which 21 apply to a residential run. For the reference `bay_area` residential
-profile it reports 11 PASS, 0 *unexplained* failures, 6 explained failures, and
-4 explained skips — every non-PASS carrying an explicit literature rationale
-rather than being hidden. The
-package is covered by an extensive `pytest` suite (500+ tests) run on a
-multi-OS, multi-version CI matrix with strict type-checking on the public API,
-and each release is archived to Zenodo for a citable DOI.
+`ev-flow` was extracted from an aggregated-EV grid-study research codebase and
+continues to supply that study's synthetic charging populations, so its outputs
+are already in active research use rather than being a prospective tool. A
+companion methodology preprint describing the same software has been posted to
+arXiv (2026), and each tagged release is archived to Zenodo with a citable DOI,
+giving downstream work a stable, versioned reference. The package is built for
+adoption by other groups: `ev-flow bootstrap` reconstructs the full data tree on
+a clean machine from public sources, `ev-flow doctor` diagnoses a broken
+install, three worked tutorials and a browser-based quickstart notebook lower the
+entry cost, and the deterministic, bit-reproducible output plus the published
+validation report are concrete community-readiness signals a reviewer or user can
+verify directly. Because its profiles are designed as boundary inputs, `ev-flow`
+is positioned to feed energy-system optimization, electricity-market, and
+charging-control models — the same downstream consumers served by the related
+tools above — with U.S.-regional behavior they currently lack.
+
+# AI usage disclosure
+
+Generative-AI assistants — specifically Anthropic's Claude (used via the Claude
+Code command-line tool) and the Cursor AI code editor — assisted with portions of
+the software development, documentation, and the drafting of this paper. All
+AI-assisted output was
+reviewed and verified by the author and is not accepted on trust: the code is
+covered by an automated `pytest` suite and continuous integration, every
+quantitative claim in this paper was checked against the cited sources and
+re-derived from the package's own validation runner, and the manuscript was put
+through two independent adversarial review passes before submission.
 
 # Acknowledgements
 
-This work builds directly on public data and methods, in particular the NHTS
-program of the U.S. Federal Highway Administration and the SPEECh charging model
-of Powell, Cezar, and Rajagopal. We thank the maintainers of the open
-scientific-Python ecosystem on which `ev-flow` depends.
+This work received no specific grant or external financial support. It builds
+directly on public data and methods, in particular the NHTS program of the U.S.
+Federal Highway Administration and the SPEECh charging model of Powell, Cezar,
+and Rajagopal. We thank the maintainers of the open scientific-Python ecosystem
+on which `ev-flow` depends.
 
 # References

From 8ffdffafad52eded6bc2e0da8cff82579ca1ed82 Mon Sep 17 00:00:00 2001
From: Bertrand Travacca <bertrand.travacca@gmail.com>
Date: Mon, 22 Jun 2026 15:59:35 -0700
Subject: [PATCH 3/4] ci: pin mypy < 2.1 to fix typecheck against numpy 2.5
 stubs

mypy 2.1.0 errors on numpy>=2.5 .pyi files ('Type statement is only
supported in Python 3.12 and greater') under the configured py3.10
type target, breaking the typecheck gate. Pin below 2.1 until the
stub/target interaction is resolved upstream.
---
 pyproject.toml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/pyproject.toml b/pyproject.toml
index 4847043..4c30ce2 100644
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -51,7 +51,7 @@ dev = [
     "pytest>=8",
     "pytest-cov>=5",
     "ruff>=0.5",
-    "mypy>=1.10",
+    "mypy>=1.10,<2.1",  # 2.1.0 rejects numpy>=2.5 stubs under the py3.10 type target
     "build>=1.2",
     "twine>=5",
     "bandit>=1.7",

From 3abab0d956d0aa7aaeaa5b2dfbdd8f4382906789 Mon Sep 17 00:00:00 2001
From: Bertrand Travacca <bertrand.travacca@gmail.com>
Date: Mon, 22 Jun 2026 16:02:14 -0700
Subject: [PATCH 4/4] ci: cap numpy<2.5 in dev extra to fix typecheck (not mypy
 version)

The typecheck break is numpy 2.5.0's stubs using PEP 695 'type'
statements, which mypy rejects under the py3.10 target regardless of
mypy version (2.0 and 2.1 both fail). Revert the ineffective mypy<2.1
cap and instead bound numpy<2.5 in the dev extra only; runtime deps
stay unconstrained for end users.
---
 pyproject.toml | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/pyproject.toml b/pyproject.toml
index 4c30ce2..52e5ee5 100644
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -51,7 +51,12 @@ dev = [
     "pytest>=8",
     "pytest-cov>=5",
     "ruff>=0.5",
-    "mypy>=1.10,<2.1",  # 2.1.0 rejects numpy>=2.5 stubs under the py3.10 type target
+    "mypy>=1.10",
+    # numpy 2.5 ships PEP 695 `type` statements in its stubs, which mypy rejects
+    # under our py3.10 type target ("Type statement is only supported in Python
+    # 3.12 and greater"). Cap numpy for the dev/type-check env only; runtime
+    # `dependencies` above stay unconstrained for end users.
+    "numpy<2.5",
     "build>=1.2",
     "twine>=5",
     "bandit>=1.7",