Compliance and Quality Check of Technical and Product Software Documentation

AI-assisted compliance and quality assurance platform for software documentation.

Current implementation baseline:

Frontend: Next.js + React + TypeScript browser app in frontend/
Backend: FastAPI API in src/doc_quality/api/main.py
Persistence: PostgreSQL primary path for sessions, reviews, audit events, documents, and findings
Authentication: backend-issued HTTP-only session cookies with role-based authorization
Orchestration: optional standalone CrewAI orchestrator in services/orchestrator/

Business context summary

Software teams in regulated or audit-heavy environments (healthcare, fintech, enterprise SaaS or critical infrastructure) often lose significant time during releases because documentation quality and compliance checks are manual, inconsistent and late in the delivery cycle.

The Doc Quality Compliance Checker addresses this by introducing a structured, workflow-oriented system that:

checks technical documents against governance and quality standards,
improves consistency across SOP, architecture, and risk artifacts,
shortens review cycles for QA, compliance, and audit teams,
reduces release risk by surfacing gaps earlier,
provides better traceability for approvals and governance decisions.

Data privacy

As an additional feature, data privacy topics of this MVP are handled by a team of three people (Wiebke Meyer, Carol Willing, and me). Requirements and status are documented in the GitHub repository compliance-checker by Carol Willing. Insights from that documentation are used to improve this product. Due to the short timeframe, only selected aspects can be covered in this iteration.

Data privacy is a complex topic in its own right, and we are addressing it with inspiration and support from expert Katharine Jarmul.

Backend implementation tracking for AIUC-1 and OWASP Chapter A privacy controls is maintained in:

Data Privacy Violation Mitigation Checklist

Primary business value

Faster readiness for audits and reviews through standardized evidence quality.
Lower operational risk by detecting non-compliance before release gates.
Higher team productivity by reducing repetitive manual document checks.
Stronger governance visibility for product leads, QA, and compliance stakeholders.

Observability and AI quality telemetry (Admin — technical service)

The Observability admin module provides production-grade AI workflow tracing for QM leads, architects, and technical service teams performing quality inspections or preparing for external audits. It visualizes:

Quality summary KPIs — total observations, average quality score, P95 latency, and hallucination report count within a selectable 24 h / 7 d / 30 d window.
Quality aspect breakdown — pass / warn / fail counts and average scores per aspect (performance, accuracy, evaluation, hallucination, error).
Workflow component breakdown — per-component (research_agent, document_analyzer, compliance_checker) observation counts, outcome distribution, average latency, and latest-event timestamp, making it possible to isolate which pipeline stage introduced latency regressions or failure spikes.
Recent GenAI prompt/output pairs — full prompt, output, provider, model, and a rich trace payload (tokens used, temperature, latency, hallucination flag, and any additional metadata) for every LLM-backed flow.
Prometheus snapshot — HTTP request totals, hallucination report totals, and AI evaluation totals directly from the /metrics endpoint.

The page supports two explicit global modes via NEXT_PUBLIC_APP_MODE:

demo: representative mock telemetry for demonstrations
real: live database-backed telemetry without demo fallback

Stakeholder governance and rights management (Admin — QM)

The Stakeholders & Rights admin module supports QM leads and administrators in governing who holds which role and what rights they carry throughout regulated workflows. It provides:

Role-template matrix — permission toggles per stakeholder role (qm_lead, architect, riskmanager, auditor, developer) aligned with the RBAC matrix in frontend/lib/rbac.ts, ensuring UI-visible role constraints stay consistent with backend authorization enforcement.
Persistent employee assignment per role — name-to-role assignments are stored in PostgreSQL (stakeholder_employee_assignments table, Alembic migration 008). Both a single-add form and a bulk-add mode (one name per line, automatic deduplication, parallel async POST) are provided for faster team onboarding.
Audit-ready record keeping — each assignment carries created_at, created_by, and profile_id provenance fields, making the governance record queryable for inspection reports and audit evidence packages.

Both the Observability and Stakeholders & Rights modules are accessible from the Admin section of the left navigation bar and are protected by backend-enforced session authentication. They are designed to be the operational control surface for technical leads and QM personnel overseeing AI-assisted documentation workflows.

Product snapshot

Attached browser-page view of the Document Hub:

Frontend architecture note: why `frontend/lib/` is necessary

The browser app uses frontend/lib/ as a shared service and domain layer between UI pages/components and backend or mock data sources. This keeps feature behavior deterministic and avoids duplicating rules across many screens.

What `frontend/lib/` provides

API/service clients for backend communication (for example audit trail, dashboard, observability, exports, artifact operations).
View-model logic for filtering, KPI calculations, formatting, selection fallback rules, and domain mapping.
Cross-cutting infrastructure such as auth context/RBAC helpers and shared mock store state used in demo and fallback modes.
Shared UI behavior helpers such as selection style utilities and reusable filtering hooks.
URL query-state synchronization via shared query utilities to keep deep links and selected list/detail state stable and cleanup stale params.

Why this separation is important

Consistency: one implementation of business rules is reused across pages instead of copy/paste variants.
Maintainability: API contract changes are isolated to service files rather than scattered in many components.
Testability: logic can be unit-tested directly from lib modules without rendering full pages.
Deterministic routing behavior: shared query sync logic enforces consistent selection/deep-link handling across routed surfaces.
Incremental delivery: stubs in lib modules allow frontend flows to compile and run while backend integrations are completed.

Without frontend/lib/, page and component files would absorb API logic, domain transforms, routing normalization, and reusable UX rules, resulting in higher coupling, duplicated logic, and faster behavioral drift.

Getting Started

Testing

Quick testing entry points from the project root with an active .venv:

Run all backend tests: python -m pytest
Run one module quickly: python -m pytest tests/test_auth_session_api.py -v

For full testing details (scope, status, workflows, role ownership, and lifecycle mapping), see:

Database Setup (Phase 0 MVP)

Phase 0 requires PostgreSQL 16 for session authentication, HITL reviews, and compliance audit trails.

Quick Start (4 steps):

Start PostgreSQL (Docker: docker compose up -d | Local: install PostgreSQL 16 and start the service)
Initialize database: .\.venv\Scripts\python.exe init_postgres.py
Verify with login test (use AUTH_MVP_EMAIL / AUTH_MVP_PASSWORD from your .env)
Run tests: pytest tests/test_auth_session_api.py -v

📖 Database Setup Guide — Complete walkthrough with Docker/local/cloud options, troubleshooting, and schema details.

Also See:

Quick Command Reference — Copy/paste terminal commands
Full Setup Guide — Detailed configuration and verification steps
Infrastructure Overview — Schema, requirements alignment, deployment path
Application User Handbook — Operational guidance for stakeholders, including top menu controls and compliance relevance
Authentication and Authorization Guide — Implemented login, session, RBAC, throttling, recovery, and security-test concepts
Observability and Logging Guide — Structured logging, OpenTelemetry tracing, Prometheus metrics, quality evaluation telemetry, and compliance monitoring
Admin Dashboard Guide — Role governance, model policy management, admin UI components, and configuration tasks
Project Structure Guide — Complete tree-style overview of codebase layout with inline component descriptions

Start the application (regular workflow: database + backend + frontend)

Before using the app (login, dashboard, admin modules, or document workflows), complete this startup block first. Use three separate terminals so PostgreSQL, the API, and the UI run at the same time. Ensure Docker Engine is running before Step 1.

Step 1: Start the database

In a terminal opened at doc_quality_compliance_check/, run:

docker compose up -d
.\.venv\Scripts\python.exe init_postgres.py

Expected outcome:

PostgreSQL listens on localhost:5432
Schema initialization completes without errors
.env contains DATABASE_URL=postgresql+psycopg2://postgres:postgres@localhost:5432/doc_quality

Step 2: Start the backend API

In a second terminal opened at doc_quality_compliance_check/, run the recommended launcher:

.\scripts\start_backend.ps1 -Reload

Expected backend behavior:

Starts Uvicorn on 127.0.0.1:8000 if the port is free
Returns success when a healthy backend is already running on 8000
Fails fast if the port is occupied but /health is not responding

Expected output includes either:

Uvicorn running on http://127.0.0.1:8000
Backend already running and healthy on http://127.0.0.1:8000

Step 3: Start the frontend UI

In a third terminal, from the project root, run the recommended launcher:

.\scripts\start_frontend.ps1

Expected frontend behavior:

Starts npm run dev on localhost:3000 if the port is free
Returns success when a healthy frontend is already running on 3000
Fails fast if the port is occupied but the app is not responding
Always changes into frontend/ regardless of where the script is invoked from, preventing wrong-working-directory startup errors

Then open:

http://localhost:3000/login

Configure login credentials in .env for both regular and admin sessions:

AUTH_MVP_EMAIL=mvp-user@example.invalid
AUTH_MVP_PASSWORD=CHANGE_ME_BEFORE_USE
AUTH_MVP_ROLES=qm_lead

AUTH_ADMIN_EMAIL=admin@example.invalid
AUTH_ADMIN_PASSWORD=CHANGE_ME_ADMIN_BEFORE_USE
AUTH_ADMIN_ROLES=app_admin,qm_lead

Role notes:

AUTH_ADMIN_* credentials are bootstrapped by backend auth and can sign in through the same /login page
app_admin role grants access to admin policy and stakeholder governance write operations
qm_lead role additionally covers observability and broader governance surfaces

Step 4: Verify the full stack

Database is running on localhost:5432
Backend health (via proxy): http://localhost:3000/health
Backend health (direct): http://127.0.0.1:8000/health
frontend/.env.local has NEXT_PUBLIC_API_ORIGIN= (empty value, required for local proxy mode)
Admin auth bootstrap self-check (requires admin or qm_lead session): GET /api/v1/auth/bootstrap-self-check

Application start limitations and Q&A (after regular workflow)

This section captures startup limitations, fallback paths, and troubleshooting without changing the regular startup steps above.

Q: What is the backend fallback if the launcher script is not available or fails?
Use direct Uvicorn startup from doc_quality_compliance_check/:

.\.venv\Scripts\python.exe -m uvicorn src.doc_quality.api.main:app --host 127.0.0.1 --port 8000 --reload

Q: What is the frontend fallback if I do not use the launcher script?
Run from doc_quality_compliance_check/frontend/:

npm run dev

Q: Why can local login appear to succeed but immediately return to /login?
NEXT_PUBLIC_API_ORIGIN must stay empty in frontend/.env.local for local development. The frontend proxies /api/* and /health through Next.js to 127.0.0.1:8000, which keeps the session cookie same-origin (localhost:3000). Setting a direct cross-origin URL breaks SameSite=lax cookie delivery and can cause a silent redirect loop back to /login.

Q: How should I configure the Auth API status badge for local demos?
Keep NEXT_PUBLIC_ENABLE_AUTH_HEALTH_CHECK=true and set NEXT_PUBLIC_HEALTH_ORIGIN=http://127.0.0.1:8000 so the badge checks backend health directly instead of depending on the Next.js proxy path.

Q: The frontend shows blank pages or MODULE_NOT_FOUND after dependency/type changes. What should I do?
Clear the Next.js build cache and restart:

Remove-Item -Recurse -Force frontend\.next
.\scripts\start_frontend.ps1

Q: Port 3000 is occupied by a stuck dev server. How do I recover?
Stop the stale process and restart frontend:

Stop-Process -Id 2424 -Force
.\scripts\start_frontend.ps1

Q: Bridge run is blocked by runtime topology proof errors on local machines. What should I configure?
If you are running locally without dedicated deployed bridge-agent containers, set:

BRIDGE_RUNTIME_TOPOLOGY_SOURCE=docker_inspect
BRIDGE_RUNTIME_TOPOLOGY_ALLOW_METADATA_FALLBACK=true

This keeps docker inspect as the primary source but allows metadata fallback when probe data is unavailable. For production-grade strict attestation, set BRIDGE_RUNTIME_TOPOLOGY_ALLOW_METADATA_FALLBACK=false and ensure all four bridge-agent containers are deployed and healthy.

Full bridge runtime env reference checklist:

Bridge runtime .env checklist

Optional: start the orchestrator service

If you want to exercise the CrewAI orchestration runtime as well, use a fourth terminal.

Orchestrator terminal (from doc_quality_compliance_check/services/orchestrator/):
```
uv run python -m doc_quality_orchestrator
```
Verify orchestrator health:
- http://localhost:8010/health

The orchestrator is optional for the core login/dashboard/document flows. The backend remains the system of record and exposes the Skills API used by orchestrator workflows.

Password Recovery Flow

The login page now includes a production-style recovery path:

Open forgot-access route via /forgot-access
Request recovery token (generic anti-enumeration response)
Open reset-access route via /reset-access?token=...
Set new password, then sign in again at /login

Backend endpoints are implemented in auth route module:

POST /api/v1/auth/recovery/request
POST /api/v1/auth/recovery/verify
POST /api/v1/auth/recovery/reset

Security behavior includes hashed recovery tokens, TTL + single-use validation, per-IP/per-email throttling, session revocation on reset, and audit logging.

Security environment defaults (Phase 0 hardening)

Variable	Default	Production behavior
`SECRET_KEY`	`change-me-in-production`	Startup fails if unchanged
`SESSION_COOKIE_SECURE`	`false`	Forced to `true` when `ENVIRONMENT != development`
`AUTH_RECOVERY_DEBUG_EXPOSE_TOKEN`	`false`	Token/reset URL stays hidden unless explicitly enabled in development
`GLOBAL_RATE_LIMIT_ENABLED`	`true`	Global `/api/v1/*` request limiting enforced
`AUTH_LOGIN_RATE_LIMIT_COUNT`	`8`	Login throttling + lockout policy enabled
`BRIDGE_RUNTIME_TOPOLOGY_ALLOW_METADATA_FALLBACK`	`true` (local), `false` (recommended production)	Controls whether bridge runtime can fall back to metadata proof if docker inspect topology probes fail
`AUTH_ADMIN_EMAIL`	`admin@example.invalid`	Bootstrap admin identity for controlled admin login
`AUTH_ADMIN_ROLES`	`app_admin,qm_lead`	Grants admin module access and governance privileges

Authorization matrix (browser users vs service clients)

Endpoint group	Browser session roles	Service API key (`service`)
`/api/v1/skills/*`	`qm_lead`, `architect`, `riskmanager`, `auditor`	Allowed (explicit machine endpoints)
`/api/v1/observability/*`	`qm_lead`, `architect`, `riskmanager`, `auditor`	Allowed (quality telemetry + evaluation ingestion)
`/api/v1/reports/*`	`qm_lead`, `riskmanager`, `auditor`	Denied
`/api/v1/compliance/*`	`qm_lead`, `architect`, `riskmanager`, `auditor`	Denied
`/api/v1/research/*`	`qm_lead`, `architect`, `riskmanager`, `auditor`	Denied
`/api/v1/auth/me`	Any authenticated browser session	Denied (session-only endpoint)

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.cursor		.cursor
.github		.github
.vscode		.vscode
AAMAD		AAMAD
docs		docs
frontend		frontend
migrations		migrations
project-context		project-context
scripts		scripts
services/orchestrator		services/orchestrator
src		src
templates		templates
tests		tests
.env.example		.env.example
.env.postgresql.example		.env.postgresql.example
.gitignore		.gitignore
ADMIN_README.md		ADMIN_README.md
AGENTS.md		AGENTS.md
ALIGNMENT_REVIEW.md		ALIGNMENT_REVIEW.md
APP_USER_HANDBOOK.md		APP_USER_HANDBOOK.md
AUTHENTICATION_AUTHORIZATION_README.md		AUTHENTICATION_AUTHORIZATION_README.md
CHECKLIST.md		CHECKLIST.md
CREWAI_BEST_PRACTICES_ASSESSMENT.md		CREWAI_BEST_PRACTICES_ASSESSMENT.md
DATABASE_README.md		DATABASE_README.md
DATA_PRIVACY_VIOLATION_MITIGATION_README.md		DATA_PRIVACY_VIOLATION_MITIGATION_README.md
DOCUMENT_UPLOAD_PERSISTENCE.md		DOCUMENT_UPLOAD_PERSISTENCE.md
HITL_PERSISTENCE_CHANGE_SUMMARY.md		HITL_PERSISTENCE_CHANGE_SUMMARY.md
HITL_PERSISTENCE_FIX.md		HITL_PERSISTENCE_FIX.md
HITL_PERSISTENCE_VERIFICATION.md		HITL_PERSISTENCE_VERIFICATION.md
HITL_QUICK_REFERENCE.md		HITL_QUICK_REFERENCE.md
IMPLEMENTATION_PLAN.md		IMPLEMENTATION_PLAN.md
LICENSE		LICENSE
OBSERVABILITY_LOGGING_README.md		OBSERVABILITY_LOGGING_README.md
PERSISTENCE_FIX.md		PERSISTENCE_FIX.md
POSTGRES_INFRASTRUCTURE_SETUP.md		POSTGRES_INFRASTRUCTURE_SETUP.md
POSTGRES_SETUP.md		POSTGRES_SETUP.md
POSTGRES_SETUP_QUICKSTART.md		POSTGRES_SETUP_QUICKSTART.md
PROJECT_STRUCTURE.md		PROJECT_STRUCTURE.md
README.md		README.md
SEARCH_CONCEPT_README.md		SEARCH_CONCEPT_README.md
TESTING_README.md		TESTING_README.md
conftest.py		conftest.py
doc_quality.db		doc_quality.db
docker-compose.yml		docker-compose.yml
init_postgres.py		init_postgres.py
package-lock.json		package-lock.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Compliance and Quality Check of Technical and Product Software Documentation

Business context summary

Data privacy

Primary business value

Observability and AI quality telemetry (Admin — technical service)

Stakeholder governance and rights management (Admin — QM)

Product snapshot

Frontend architecture note: why `frontend/lib/` is necessary

What `frontend/lib/` provides

Why this separation is important

Getting Started

Testing

Database Setup (Phase 0 MVP)

Start the application (regular workflow: database + backend + frontend)

Step 1: Start the database

Step 2: Start the backend API

Step 3: Start the frontend UI

Step 4: Verify the full stack

Application start limitations and Q&A (after regular workflow)

Optional: start the orchestrator service

Password Recovery Flow

Security environment defaults (Phase 0 hardening)

Authorization matrix (browser users vs service clients)

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Compliance and Quality Check of Technical and Product Software Documentation

Business context summary

Data privacy

Primary business value

Observability and AI quality telemetry (Admin — technical service)

Stakeholder governance and rights management (Admin — QM)

Product snapshot

Frontend architecture note: why frontend/lib/ is necessary

What frontend/lib/ provides

Why this separation is important

Getting Started

Testing

Database Setup (Phase 0 MVP)

Start the application (regular workflow: database + backend + frontend)

Step 1: Start the database

Step 2: Start the backend API

Step 3: Start the frontend UI

Step 4: Verify the full stack

Application start limitations and Q&A (after regular workflow)

Optional: start the orchestrator service

Password Recovery Flow

Security environment defaults (Phase 0 hardening)

Authorization matrix (browser users vs service clients)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Frontend architecture note: why `frontend/lib/` is necessary

What `frontend/lib/` provides

Packages