The PDF Converse tool can be used to upload PDFs and effectively answer all queries. With this tool, users can collaborate and engage in discussions within the context of the document itself.
- Upload PDF files: Users can easily upload their PDF files using the user-friendly interface.
- Extract text: The application extracts the text content from the uploaded PDF files using PyPDF2 library.
- Text processing and embeddings: The code utilizes langchain's CharacterTextSplitter to segment the text into manageable chunks.
- Building the knowledge base: The text chunks and embeddings are stored in Chroma, leveraging FAISS for efficient similarity search, enhancing retrieval accuracy.
- Clone the repository: git clone https://github.com/Bhavd33p/PDF-Converse.git
- pip install -r requirements.txt.
- After Installing all dependencies you need to install ollama from https://ollama.com/ for Command Line
- Then use these command to fetch the model :
~ ollama pull nomic-embed-text
~ ollama run mistral
Run the Application by
- streamlit run pdfconverse.py
if Axios error 403 occurs upon uploading pdf use this : streamlit run pdfconverse.py --server.enableXsrfProtection false - Upload your PDF file using upload pdf section in sidebar.
- Ask a query about the PDF content in the chat options.
- The application will display the answer based on the uploaded PDF file.
The extension version is also available , you can use it by uploading chrome_extension folder to developer mode .
There are two python files:
~ pdfconverse.py {This is best model for this PS and application will run on this}.
~ chat.py {This is CLI based model}.