MedChronology AI 🏥

Medical Records Chronology Builder — upload multiple medical PDFs and extract a sorted timeline of events with source document and page citations.

Built for legal, insurance, and review workflows where teams spend significant time turning scattered medical records into usable chronologies.

What it does

Upload — drag in any number of medical PDFs (GP letters, hospital discharge summaries, specialist reports, test results)
Extract — AI reviews every page and identifies dated medical events
Chronology — events are sorted oldest → newest with source doc + page number citations
Export — download as CSV, JSON, or plain text (Word-ready)

Example output

Date	Event	Source	Page	Confidence
2022-11-03	Emergency PCI following ST-elevation MI	hospital_letter.pdf	2	high
2022-11-22	Echocardiogram — EF 45%	discharge_summary.pdf	1	high
2022-12-10	Commenced Bisoprolol 2.5mg and Ramipril 5mg	gp_letter.pdf	3	high
2023-01-02	Ramipril increased to 10mg following BP review	gp_letter.pdf	3	medium

Architecture

Uploaded PDFs
    │
    ▼
PyMuPDF (page-by-page text extraction)
    │
    ▼
OpenAI GPT-4o-mini (event extraction per page)
    │  Extracts: date, event, detail, confidence
    │
    ▼
dateparser (normalises messy date formats)
    │
    ▼
Deduplication (removes near-duplicates across docs)
    │
    ▼
Sorted chronological timeline
    │
    ▼
Streamlit UI (timeline cards / table / export)

Key design decisions:

Page-by-page extraction → precise page citations
GPT-4o-mini chosen for cost efficiency
dateparser handles messy formats — "14th Jan 23", "approx Nov 2020"
Confidence scoring: high / medium / low based on date specificity
Deduplication handles the same event appearing across multiple docs

Stack

Screenshot

Stack

Layer	Tool
Language	Python 3.12
UI	Streamlit
PDF extraction	PyMuPDF (fitz)
Date normalisation	dateparser
LLM	OpenAI GPT-4o-mini
Testing	pytest
Linting	Ruff
CI	GitHub Actions
Hosting	Streamlit Cloud

Setup

1. Clone and install

git clone https://github.com/Keer0205/medchrono
cd medchrono
pip install -r requirements.txt

2. Add your OpenAI API key

mkdir -p .streamlit
echo 'OPENAI_API_KEY = "sk-..."' > .streamlit/secrets.toml

Never commit secrets.toml — it is in .gitignore.

3. Run

streamlit run app.py

4. Run tests

pytest tests/ -v

Deploy to Streamlit Cloud

Push to GitHub (secrets.toml is gitignored — do NOT push it)
Go to share.streamlit.io and select your repo
In Secrets settings, paste: OPENAI_API_KEY = "sk-..."
Deploy

Live demo: https://medchronology-ai.streamlit.app

Cost estimate

All figures are approximate and will vary based on document length and content density.

Bundle size	Approx cost
1 GP letter (2 pages)	~£0.001
5 documents (20 pages)	~£0.01
50 documents (200 pages)	~£0.10

GPT-4o-mini is used for extraction — very low cost per page.

Limitations

Text-based PDFs only — scanned images need OCR pre-processing
Handwritten notes not supported in current version
Dates in unusual formats may have lower confidence
Always verify outputs against the original source documents
Not a substitute for professional legal or medical review

Next steps (roadmap)

OCR support for scanned PDFs (Tesseract or Azure OCR)
Handwritten note support
Export to Word document with formatted chronology table
Highlight conflicting dates across documents
Login for multi-user firm access

Related projects

ClinicOps Copilot — same RAG architecture, applied to aesthetic clinic documents

Disclaimer

For informational use only. This tool extracts and organises information from uploaded documents. It does not provide medical or legal advice. Always verify outputs against the original source documents.

Built by Keerthana | Legal AI portfolio project

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
tests		tests
README.md		README.md
app.py		app.py
extractor.py		extractor.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MedChronology AI 🏥

What it does

Example output

Architecture

Stack

Screenshot

Stack

Setup

1. Clone and install

2. Add your OpenAI API key

3. Run

4. Run tests

Deploy to Streamlit Cloud

Cost estimate

Limitations

Next steps (roadmap)

Related projects

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MedChronology AI 🏥

What it does

Example output

Architecture

Stack

Screenshot

Stack

Setup

1. Clone and install

2. Add your OpenAI API key

3. Run

4. Run tests

Deploy to Streamlit Cloud

Cost estimate

Limitations

Next steps (roadmap)

Related projects

Disclaimer

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages