GraphLens

Starter template for a Python-based data project with:

RAG pipelines
Vector database integrations
Streamlit or Gradio frontends
Local scripts, notebooks, and test scaffolding

Project Structure

GraphLens/
├── app/                    # Frontend entry points
├── server_backend/         # FastAPI backend (API endpoints for transcript/ingest/query)
├── config/                 # YAML / TOML runtime configs
├── data/                   # Local project data (gitignored except placeholders)
├── docs/                   # Notes, architecture docs, ADRs
├── notebooks/              # Exploration notebooks
├── scripts/                # CLI helpers for ingest, index, eval
├── src/graphlens/          # Main Python package (core logic)
├── tests/                  # Test suite
├── .env.example            # Environment variable template
├── .gitignore
└── pyproject.toml

Quick Start

Create a virtual environment.
Install the project in editable mode: pip install -e .
Copy .env.example to .env and fill in provider keys.
Start a UI:
- Streamlit: streamlit run app/streamlit_app.py
- Gradio: python app/gradio_app.py

server backend

FastAPI Backend (API)

The FastAPI backend exposes a simple RAG pipeline as HTTP endpoints so the frontend can call it directly.

Main endpoints (v1):

POST /api/v1/transcript → YouTube URL → timestamped transcript (normalized)

POST /api/v1/ingest → transcript/URL → chunks + embeddings → stored in vector DB

POST /api/v1/query → question → retrieves chunks from vector DB → returns answer

Run the API locally:

uvicorn server_backend.main:app --reload

Swagger docs:

http://127.0.0.1:8000/docs

Code layout:

server_backend/ → API layer (routes + schemas)

src/graphlens/ → core logic (transcripts/chunking/embeddings/vector store)

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
app		app
config		config
data		data
evaluation		evaluation
scripts		scripts
server_backend		server_backend
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Tiktoken_working_explanation.txt		Tiktoken_working_explanation.txt
pyproject.toml		pyproject.toml
reliability_model_slide.pdf		reliability_model_slide.pdf
reliability_model_slide.png		reliability_model_slide.png
requirements.txt		requirements.txt
spacy_understanding.txt		spacy_understanding.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GraphLens

Project Structure

Quick Start

server backend

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GraphLens

Project Structure

Quick Start

server backend

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages