RAG System with Milvus and LangChain

A Retrieval-Augmented Generation (RAG) system that combines Milvus vector database with LangChain and OpenAI for intelligent document querying and response generation.

System Overview

This system implements RAG architecture to provide accurate, context-aware responses to questions by:

Storing document embeddings in Milvus vector database
Retrieving relevant context when queried
Generating human-like responses using OpenAI's language models

Components

Vector Store (src/vector_store/milvus_store.py)

Uses Milvus for efficient vector similarity search
Stores document embeddings using OpenAI's embedding model
Handles document addition and retrieval operations
Configurable connection settings for Milvus database

RAG Chain (src/rag/rag_chain.py)

Implements the core RAG logic
Integrates OpenAI's ChatGPT for response generation
Manages the retrieval-generation pipeline
Configurable temperature and other LLM parameters

Document Processing (main.py)

Handles document loading and chunking
Configurable chunk size and overlap
Supports text documents (expandable to other formats)

Prerequisites

Python 3.12+
Docker and Docker Compose
OpenAI API key

Installation

Clone the repository:

git clone [repository-url]
cd RAG-System

Create a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

# Create .env file with your OpenAI API key
echo "OPENAI_API_KEY=your_key_here" > .env

Start Milvus services:

docker-compose up -d

Usage

Add your documents to the data directory:

mkdir data
echo "Your document content here" > data/test_document.txt

Run the system:

python main.py

Local CI commands

Run tests with coverage:

pytest -q --maxfail=1 --disable-warnings --cov=src --cov-report=term-missing

Run linting:

flake8

Configuration

Milvus Settings

Host: localhost (default)
Port: 19530 (default)
Configurable in src/config/settings.py

Document Processing

Chunk size: 1000 (default)
Chunk overlap: 0 (default)
Configurable in src/config/settings.py

LLM Settings

Model: OpenAI ChatGPT
Temperature: 0.2 (default)
Configurable in src/config/settings.py

Use Cases

This RAG system is ideal for:

Document question-answering
Knowledge base augmentation
Contextual information retrieval
Research assistance
Technical documentation queries

Docker Services

The system uses several Docker containers:

Milvus standalone server
Etcd for metadata storage
MinIO for object storage

Access MinIO console:

URL: http://localhost:9015
Credentials: minioadmin/minioadmin

Extending the System

The system can be extended with:

Additional document types (PDF, HTML, etc.)
Custom embedding models
Alternative vector databases
Web interface
Batch processing
Caching mechanisms

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
.github		.github
data		data
frontend		frontend
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
QUICK_START_STATUS_CHECK.md		QUICK_START_STATUS_CHECK.md
RAG-System.code-workspace		RAG-System.code-workspace
README.md		README.md
README_GITHUB_STATUS.md		README_GITHUB_STATUS.md
SECURITY.md		SECURITY.md
TRIGGER_WORKFLOW.md		TRIGGER_WORKFLOW.md
check_github_status.py		check_github_status.py
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt
scrape.py		scrape.py
scrape_confluence.py		scrape_confluence.py
setup.cfg		setup.cfg
setup.py		setup.py
trigger_status_check.sh		trigger_status_check.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG System with Milvus and LangChain

System Overview

Components

Vector Store (src/vector_store/milvus_store.py)

RAG Chain (src/rag/rag_chain.py)

Document Processing (main.py)

Prerequisites

Installation

Usage

Local CI commands

Configuration

Milvus Settings

Document Processing

LLM Settings

Use Cases

Docker Services

Extending the System

Contributing

License

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG System with Milvus and LangChain

System Overview

Components

Vector Store (src/vector_store/milvus_store.py)

RAG Chain (src/rag/rag_chain.py)

Document Processing (main.py)

Prerequisites

Installation

Usage

Local CI commands

Configuration

Milvus Settings

Document Processing

LLM Settings

Use Cases

Docker Services

Extending the System

Contributing

License

References

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages