Dream RAG Research Project

A generalized model of dreaming in a RAG-based system. The project builds on the kwaai-rag ingestion and knowledge-graph stack, with the goal of adding a "dream loop" that discovers cross-document links, completes entity schemas, and refines the graph during idle time.

Implementation language: Python. The rust implementations/ directory contains reference ports from kwaai-rag and is not part of the active development path.

Authors

Christopher J. Mayfield
Reza Rassool
Jourdane Hamilton
Annika Vriens
Aman Avinash
Maira Khwaja

Publication

Paper in progress: How'd you sleep, bro? A Dreaming Retrieval-Augmented Generation Architecture Through the Lens of the Free Energy Principle (Overleaf)

Progress

Done

The core kwaai-rag pipeline has been ported to Python. Each module lives in a top-level .py file.

Module	Description
`document.py`	Extract plain text from `.txt`, `.md`, `.pdf`, `.docx`, `.doc`, and other common formats
`chunker.py`	Text chunking — character-level sliding window and paragraph-semantic strategies
`doc_schema.py`	Document schema definitions, section matching, and auto-detection (YAML-driven)
`embedder.py`	Async HTTP client for the Ollama embedding API (`nomic-embed-text`, 768-dim)
`meta_store.py`	Per-tenant chunk metadata and file-sync tracking (SQLite)
`ner.py`	Lightweight proper-noun pre-screening and pronoun resolution (no external NLP deps)
`gliner.py`	Thin async client for a GLiNER NER server — injects high-confidence person spans into extraction prompts
`graph.py`	Knowledge graph with entity nodes, directed relations, LLM-based extraction, and SQLite persistence
`ingestion.py`	End-to-end pipeline: chunk → embed → upload, with optional knowledge-graph extraction

Ingestion pipeline (ingestion.py):

Extract text from a document
Split into chunks (configurable strategy, overlap, and surrounding context)
Embed chunks via Ollama
Store chunk metadata in meta_store
Optionally extract entities and relations into the knowledge graph (LLM + GLiNER + NER hints)

graph.py currently covers basic ingestion and extraction. The kwaai-rag reference (rust implementations/graph.rs) includes additional graph maintenance and dream-loop hooks that still need to be ported.

Quick start

1. Install dependencies

cd dreamrag
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

2. Start Ollama and pull the embedding model

ollama pull nomic-embed-text
python -m dreamrag check

3. Add documents

Place files in data/documents/ (.txt, .md, .pdf, .docx, etc.) or pass paths directly.

4. Ingest

# Chunk + embed + store metadata and vectors
python -m dreamrag ingest

# Also extract a knowledge graph (slower; uses your Ollama LLM)
python -m dreamrag ingest --graph --llm-model llama3.1:8b

# Ingest specific files
python -m dreamrag ingest path/to/file.pdf another.md

5. Check status

python -m dreamrag status

Ingested data is stored under data/store/ (chunk metadata, vectors, and optional knowledge graph).

TODO

1. Project wiring (run end-to-end)

Add requirements.txt with pinned dependencies
Add a minimal CLI (python -m dreamrag ingest)
Add scripts/gliner_server.py (referenced by gliner.py but not yet in this repo)
Wire up a concrete vector store for chunk embeddings (vector_store.py)
Organize modules into a proper Python package (e.g. move top-level modules into dreamrag/)

2. Dream loop (core research goal)

Implement dream.py — a background task runner that operates during idle time
Cross-link discovery — find entities shared across documents/chunks via GraphStore.all_chunk_entity_pairs()
Relation completion — infer missing relations from graph structure and evidence chunks
Entity schema completion — fill schema.org fields (birthDate, addressLocality, etc.) for low-confidence entities
Post-dream graph refinement — dedup merges, relation sanitization, confidence rescoring

3. Retrieval and query layer

Chunk vector search over embedded chunks
Hybrid retrieval combining vector search with graph neighbors
Query interface (query.py?) that ties retrieval + graph context together for generation

4. Complete `graph.py`

graph.py covers basic ingestion but is missing most graph maintenance logic from the kwaai-rag reference. Port the following into Python:

search_entities — entity retrieval by embedding
bfs_neighbors, entity_chunks — graph traversal for RAG
find_dedup_candidates* — entity deduplication (exact, fuzzy, name-structure, etc.)
merge_entity_into, unmerge_alias — canonical entity merging
sanitize_relations — clean up bad or inferred relations
coref_candidates_for_chunk — coreference resolution
all_chunk_entity_pairs — cross-link discovery for the dream loop
set_schema_type, set_document_titles — dream completion helpers

5. Tests and examples

Unit and integration tests for core modules (pytest)
Example document in data/documents/sample.txt
Sample doc schemas (YAML) to exercise doc_schema
CI pipeline (GitHub Actions)

6. Dreaming

Assess how to optimize for dreaming, such as graph completion and storage compression
Gather more information on the neuroscience
Metrics

Repository layout

document.py       # Text extraction
chunker.py        # Chunking strategies
doc_schema.py     # Section schemas
embedder.py       # Ollama embeddings
meta_store.py     # Chunk/sync metadata
vector_store.py   # Chunk embedding storage
ner.py            # Proper-noun & pronoun handling
gliner.py         # GLiNER NER client
graph.py          # Knowledge graph store & extraction
ingestion.py      # Full ingestion pipeline
dreamrag/         # CLI (`python -m dreamrag`)
data/documents/   # Drop files here for ingestion
data/store/       # Generated databases (gitignored)

rust implementations/   # kwaai-rag reference (not actively maintained)
  *.rs

Dependencies

Python packages — aiohttp (embedder, graph, gliner, ingestion), pdfminer.six (PDF extraction), pyyaml (doc schemas).

External services — Ollama (embeddings), an LLM inference endpoint (graph extraction), and optionally a GLiNER NER server.

Ideas

-evaluate recall and not just pulled context

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dream RAG Research Project

Authors

Publication

Progress

Done

Quick start

1. Install dependencies

2. Start Ollama and pull the embedding model

3. Add documents

4. Ingest

5. Check status

TODO

1. Project wiring (run end-to-end)

2. Dream loop (core research goal)

3. Retrieval and query layer

4. Complete `graph.py`

5. Tests and examples

6. Dreaming

Repository layout

Dependencies

Ideas

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data/documents		data/documents
dreamrag		dreamrag
rust implementations		rust implementations
.gitignore		.gitignore
README.md		README.md
chunker.py		chunker.py
doc_schema.py		doc_schema.py
document.py		document.py
embedder.py		embedder.py
gliner.py		gliner.py
graph.py		graph.py
ingestion.py		ingestion.py
meta_store.py		meta_store.py
ner.py		ner.py
requirements.txt		requirements.txt
vector_store.py		vector_store.py

Folders and files

Latest commit

History

Repository files navigation

Dream RAG Research Project

Authors

Publication

Progress

Done

Quick start

1. Install dependencies

2. Start Ollama and pull the embedding model

3. Add documents

4. Ingest

5. Check status

TODO

1. Project wiring (run end-to-end)

2. Dream loop (core research goal)

3. Retrieval and query layer

4. Complete graph.py

5. Tests and examples

6. Dreaming

Repository layout

Dependencies

Ideas

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

4. Complete `graph.py`

Packages