modelwright is an early-stage project for turning spreadsheet workbooks into transparent, version-controlled, standalone Python models.
The intended direction is a generic workflow that can inspect workbook structure, extract formulas and dependencies, generate maintainable Python source, and validate the generated model against the original workbook outputs.
This repository is currently an early implementation skeleton. It defines minimal Python package and test scaffolding plus initial validation, extraction, graph, generation, oracle records, and thin JSON command-line wrappers, but does not yet provide a release stability guarantee, catalog schema, or full workbook conversion.
- Build the first package-backed validation/report, workbook extraction, generation, and CLI cores.
- Keep extraction, code generation, validation, diagnostics, and reporting responsibilities separate.
- Avoid committing private notes, source workbooks, generated clones, or large artifacts while the project shape is still being established.
The durable API is organized by module responsibility:
modelwright.extraction: workbook extraction records andextract_workbook.modelwright.graph: dependency graph records andbuild_dependency_graph.modelwright.formulas: formula expression records, translation helpers, and reference-index helpers.modelwright.generation: generated-module records andgenerate_python_module.modelwright.validation: validation scenarios, scalar comparisons, and report records.modelwright.oracles,modelwright.formulas_oracle, andmodelwright.oracle_validation: oracle request/result records, optionalformulasoracle execution, and oracle-backed report assembly.
The package root modelwright exposes a curated convenience facade for those records and functions. Module-level imports remain preferred for implementation work because this project is still pre-release.
Bootstrap the repo-local virtual environment before using the console script:
scripts/bootstrap_dev_env.shThe current CLI prints JSON to stdout and stays close to the Python APIs:
modelwright workbook extract path/to/workbook.xlsx > tmp/extraction.json
modelwright workbook graph path/to/workbook.xlsx > tmp/dependency-graph.json
modelwright conversion plan path/to/workbook.xlsx > tmp/conversion-plan.json
modelwright model generate --contract tmp/contract.json --expressions tmp/expressions.json --constants tmp/constants.json --out tmp/generated_model.py > tmp/generation-result.json
modelwright validation report --scenario tests/fixtures/synthetic_model/baseline_scenario.json --generated-values tmp/generated-values.json --oracle-values tmp/oracle-values.json > tmp/validation-report.jsonThese commands do not provide a one-step workbook converter. conversion plan reports extraction, graphing, formula-translation, and residual-blocker status; model generate expects explicit generated-module and formula-expression JSON inputs; and validation report compares already-observed generated/oracle values. See planning/cli-json-workflows.md for JSON examples and workflow boundaries.
Bootstrap a repo-local virtual environment:
scripts/bootstrap_dev_env.shThis installs Modelwright with the dev extra:
.venv/bin/python -m pip install -e '.[dev]'Run lint checks:
.venv/bin/python -m ruff check .Run tests:
.venv/bin/python -m pytestBuild docs locally:
.venv/bin/sphinx-build -b html docs _build/html -W
.venv/bin/python scripts/verify_docs_theme.py _build/htmlRestore the public external FABLE benchmark workbooks into ignored local paths:
scripts/bootstrap_dev_env.sh --benchmarksmodelwright is pre-release. The current alpha line is 0.1.0a6; alpha releases must not be described as full-workbook conversion guarantees.
Check release artifacts locally:
scripts/check_release_artifacts.shRelease checks write build outputs under ignored tmp/release-checks/.
See docs/guides/release-deployment.rst for the release and deployment runbook.
AGENTS.mdis the working contract for AI coding agents.CONTRIBUTING.mdis the contributor onboarding and development workflow guide.ROADMAP.mdis the current plan and next-step tracker.CHANGE_LOG.mdis the append-only project narrative.planning/contains focused design notes and research records that are too detailed for the roadmap.benchmarks/contains tracked metadata for official external benchmarks; large workbook binaries remain untracked and are restored locally undertmp/.src/modelwright/contains the importable Python package.tests/contains package-backed tests and tracked synthetic fixture helpers.tmp/is ignored local working space for private notes, source workbooks, experiments, and generated scratch outputs.