Document OCR API

A Flask API for extracting structured data from identity documents and financial records. The project currently supports passport MRZ extraction, Nigerian NIN card/slip parsing, bank statement parsing, and an optional generic webhook forwarder.

The codebase is organized so new document types and country-specific rules can be added without reshaping the whole service.

Features

Passport MRZ extraction with TD3 validation and image-quality checks.
Nigerian NIN card and slip parsing with normalized response fields.
Bank statement summary extraction from PDFs and images.
Optional webhook forwarding to up to three configured targets.
Swagger UI at /api-docs.
HMAC request-signing utilities for production authentication.
OCR backend abstraction with RapidOCR first and optional EasyOCR fallback.

Project Structure

document-ocr-api/
  app.py
  requirements.txt
  src/
    api/
      routes.py
    countries/
      profile.py
      registry.py
      ghana/
      nigeria/
    core/
      auth.py
      flash_glance.py
      ocr_engine.py
    document_ocr/
      bank_statement/
      drivers_license/
      nin/
      passport/
      voter_id/
    webhook_forwarder/
      broadcast.py
      routes.py
      signing.py
  tests/

Requirements

Python 3.11 or 3.12
pip
RapidOCR dependencies from requirements.txt

RapidOCR is the preferred OCR backend. EasyOCR can be enabled as a fallback with ENABLE_EASYOCR_FALLBACK=1, but it is slower.

Setup

git clone https://github.com/YOUR_USERNAME/document-ocr-api.git
cd document-ocr-api
python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt
copy .env.example .env

On Linux or macOS, activate the environment with:

source venv/bin/activate

Configuration

Create a .env file from .env.example.

OCR_SECRET_KEY=change-this-in-production

FORWARDER_SECRET=change-this-if-webhook-forwarding-is-enabled
FORWARDER_TARGET_1_URL=https://endpoint1.example.com/webhook
FORWARDER_TARGET_2_URL=https://endpoint2.example.com/webhook
FORWARDER_TARGET_3_URL=https://endpoint3.example.com/webhook

ENABLE_EASYOCR_FALLBACK=0

FORWARDER_* settings are only required if /api/webhooks/forward is used.

Running

python app.py

The API runs on http://localhost:5005.

Swagger UI is available at:

http://localhost:5005/api-docs

For production-style serving:

gunicorn --bind 0.0.0.0:5005 --workers 4 --timeout 120 app:app

Endpoints

Health Check

GET /

Returns:

{
  "status": "healthy",
  "message": "Document OCR API is live"
}

Passport Extraction

POST /api/passport
POST /api/scan-passport

Form data:

file: passport image
country (optional): ISO-3166 alpha-3 country hint, for example NGA

Example:

curl -X POST http://localhost:5005/api/passport ^
  -F "file=@passport.jpg"

NIN Extraction

POST /api/nin

Form data:

file: NIN card or slip image
country (optional): ISO-3166 alpha-3 country code. Defaults to NGA.

Bank Statement Extraction

POST /api/bank-statement

Form data:

file: PDF or image bank statement

Voter ID / Voter Card Extraction

POST /api/voter-id

Form data:

file: voter document image, or PDF with embedded text
country (optional): ISO-3166 alpha-3 country code. Defaults to NGA.

voter_id is the canonical processor name. Country metadata keeps local naming clear: Nigeria exposes VOTER_CARD, while Ghana exposes VOTER_ID.

Driver's License Extraction

POST /api/drivers-license

Form data:

file: driver's license image, or PDF with embedded text
country (optional): ISO-3166 alpha-3 country code. Defaults to NGA.

For these newer identity processors, the runtime flow is:

Flask route -> shared document processor -> text_extraction.py -> country parser

For example, /api/voter-id calls src/document_ocr/voter_id/processor.py. That processor calls src/document_ocr/text_extraction.py to convert the upload into text, then dispatches to src/countries/nigeria/voter_id.py or src/countries/ghana/voter_id.py.

Country Metadata

GET /api/countries
GET /api/countries/{country_code}

Returns registered countries and their local identity document metadata. This is metadata only; a listed ID does not automatically mean an OCR parser exists for that exact ID yet.

Example:

curl http://localhost:5005/api/countries/NGA

Example response fragment:

{
  "success": true,
  "country": {
    "country_code": "NGA",
    "country_name": "Nigeria",
    "supported_identity_documents": [
      {"code": "NIN_CARD", "name": "National Identification Number card"},
      {"code": "VOTER_CARD", "name": "Permanent voter card"},
      {"code": "DRIVERS_LICENSE", "name": "Driver's license"}
    ]
  }
}

Webhook Forwarding

POST /api/webhooks/forward

Receives a raw request body, signs it with FORWARDER_SECRET, and forwards it to configured targets.

Forwarded requests include:

X-Timestamp
X-Signature
X-Source: webhook-forwarder
Content-Type, when provided by the original request
X-Request-Id or X-Correlation-Id, when provided

The forwarder keeps a short in-memory dedupe cache for repeated payloads.

HMAC Authentication

The request auth decorator is present in src/core/auth.py. It expects:

X-Timestamp: current Unix timestamp
X-Signature: HMAC_SHA256(OCR_SECRET_KEY, "{timestamp}.{path}")

Authentication is currently bypassed in code while OCR behavior is being developed. Re-enable it before exposing the API publicly.

Extending The API

For a new document type:

Add a processor under src/document_ocr/<document_type>/processor.py.
Keep the processor response shape consistent: success, message, document_type, data, and optional diagnostics.
Add the route in src/api/routes.py.
Add focused tests for missing files, invalid inputs, and a known-good sample.

For country-specific logic:

Create a country package under src/countries/<country>/, for example src/countries/nigeria/.
Put country-specific aliases, supported document types, and validation helpers in that package.
Register the country's CountryProfile in src/countries/registry.py.
Keep shared OCR/parsing in the document processor.
Return country codes and validation details explicitly in the response.

Current country-specific support:

NGA / Nigeria
- Passport MRZ country-code alias correction, such as N6A or NG4 to NGA.
- Nigerian NIN card/slip metadata and parser support.
- Voter card parser support.
- Driver's license parser support.
- Additional local ID metadata: BVN and Tax Identification Number.
- Basic NIN format validation for exactly 11 digits.
GHA / Ghana
- Passport MRZ country-code alias correction, such as 6HA to GHA.
- Voter ID parser support.
- Driver's license parser support.
- Starter local ID metadata: Ghana Card, Tax Identification Number, and SSNIT number.

Processor naming rule:

Use one canonical folder for the shared document family, such as document_ocr/voter_id.
Put local country names in src/countries/<country>/rules.py.
Put country-specific parsing differences in src/countries/<country>/<document>.py.

Example:

src/
  document_ocr/
    voter_id/
      processor.py
  countries/
    nigeria/
      voter_id.py      # Parses Nigeria Voter Card
      rules.py         # Exposes local code VOTER_CARD
    ghana/
      voter_id.py      # Parses Ghana Voter ID
      rules.py         # Exposes local code VOTER_ID

Example response fragment for country-aware endpoints:

{
  "country": {
    "country_code": "NGA",
    "country_name": "Nigeria",
    "supported": true,
    "checks": {
      "document_type_supported": true,
      "nin_format_valid": true
    }
  }
}

When adding another country, keep the shape similar to src/countries/ghana/rules.py:

from src.countries.profile import CountryProfile

COUNTRY_PROFILE = CountryProfile(
    code="ABC",
    name="Example Country",
    mrz_code_aliases={"ABC"},
    supported_identity_documents={
        "NATIONAL_ID": "National identity card",
        "VOTER_ID": "Voter identity card",
    },
)

Tests

python -m pytest tests -v

For local development, install the test dependencies with:

pip install -r requirements-dev.txt

The included tests are smoke tests for route availability and basic error behavior. Full OCR accuracy tests should use controlled sample documents.

Security Notes

The API processes uploads in memory and does not persist documents by default.
Use a strong OCR_SECRET_KEY before production deployment.
Re-enable HMAC verification before public exposure.
Put rate limiting and upload-size limits at the reverse proxy or gateway layer.
Webhook logs redact common sensitive headers.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document OCR API

Features

Project Structure

Requirements

Setup

Configuration

Running

Endpoints

Health Check

Passport Extraction

NIN Extraction

Bank Statement Extraction

Voter ID / Voter Card Extraction

Driver's License Extraction

Country Metadata

Webhook Forwarding

HMAC Authentication

Extending The API

Tests

Security Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.github/workflows		.github/workflows
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
examples.py		examples.py
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Document OCR API

Features

Project Structure

Requirements

Setup

Configuration

Running

Endpoints

Health Check

Passport Extraction

NIN Extraction

Bank Statement Extraction

Voter ID / Voter Card Extraction

Driver's License Extraction

Country Metadata

Webhook Forwarding

HMAC Authentication

Extending The API

Tests

Security Notes

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages