Safe, config-driven Python web ingestion pipeline with extraction, evidence-gated AI generation, provenance ledger, RAG chunks, data cards, and multi-provider exports.
python pypi provenance gemini web-scraping openai developer-tools python-package audit-trail html-extraction structured-output rag research-software ai-pipeline llm trafilatura deepseek web-ingestion data-cards ssrf-protection
-
Updated
May 23, 2026 - Python