Skip to content

feat: improve GraphRAG extraction quality and remove HuggingFace provider#9

Merged
aksOps merged 2 commits into
mainfrom
claude/fix-codecontext-config-DR15O
Mar 20, 2026
Merged

feat: improve GraphRAG extraction quality and remove HuggingFace provider#9
aksOps merged 2 commits into
mainfrom
claude/fix-codecontext-config-DR15O

Conversation

@aksOps

@aksOps aksOps commented Mar 20, 2026

Copy link
Copy Markdown
Contributor

No description provided.

claude added 2 commits March 20, 2026 04:35
…ider

- Drop HuggingFace TGI provider; keep only Azure OpenAI and Ollama
- Add gleaning (multi-pass entity extraction) inspired by Microsoft GraphRAG
  - Default 1 gleaning pass catches 30-50% more entities
  - Configurable via indexing.max_gleanings in config
- Improve entity extraction prompt with few-shot examples, expanded entity
  types (Document, Metric, Process), weight guidance, and implicit
  relationship extraction instructions
- Add entity name normalization (case-insensitive, whitespace-collapsed)
  to prevent duplicate entities like "Apple Inc" vs "apple inc"
- Add relationship deduplication by (source, target, predicate) to prevent
  duplicate edges in the knowledge graph
- Fix Louvain modularity gain formula to use the standard calculation:
  ΔQ = [k_i_in/(2m)] - [sigma_tot * k_i / (2m²)]
  The previous formula was missing the removal cost term

https://claude.ai/code/session_011Ryet7uu9j6VyzNGmUuaaj
Comprehensive guide for completing the langchaingo migration in a new
sandbox with network access. Includes exact code for provider adapter,
chunker rewrite, API notes, and step-by-step instructions.

https://claude.ai/code/session_011Ryet7uu9j6VyzNGmUuaaj
@aksOps aksOps merged commit d7e4bac into main Mar 20, 2026
8 checks passed
@aksOps aksOps deleted the claude/fix-codecontext-config-DR15O branch April 23, 2026 09:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants