Pdf-Q-A

Overview

This project is an AI-powered document Q&A system that allows users to upload PDFs, extract text, store embeddings in a database, and retrieve relevant answers using vector search. It uses Langchain (embedding), PostgreSQL (PgVector), and Streamlit to create an interactive interface for document-based querying.

Features

PDF Upload & Text Extraction – Extracts text from uploaded PDFs using PyPDF. Chunking for Efficient Retrieval – Splits long text into smaller chunks for better search results. AI-Powered Embeddings – Converts text into numerical vectors using SentenceTransformers. Vector Database Storage – Stores embeddings in PostgreSQL with PgVector for fast similarity search. Question Answering System – Matches user queries with relevant document chunks based on vector similarity. Streamlit UI – Provides an easy-to-use web interface for uploading PDFs and asking questions.

Tools and Frameworks

Frontend: Streamlit (for UI & user interaction) Backend: Python, LangChain (for text processing & chunking) Machine Learning: SentenceTransformers (all-MiniLM-L6-v2) for text embeddings Database: PostgreSQL with PgVector for efficient vector search Libraries Used: PyPDF, SentenceTransformers, psycopg2, Streamlit, LangChain

Setup

Install python dependencies (requirements.txt)
Set up PostgreSQL and PgVector.
Edit configurations in database.py and run the file to create the database
Run the application streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
app.py		app.py
database.py		database.py
embedding.py		embedding.py
pdf_processing.py		pdf_processing.py
query.py		query.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pdf-Q-A

Overview

Features

Tools and Frameworks

Setup

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pdf-Q-A

Overview

Features

Tools and Frameworks

Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages