RAG with Mistral 7B

This code sets up an interactive chat system:

It loads documents from a URL, creates a FAISS vector store for them, and initializes a language model.
RAG for PDF is also available using persist chroma DB
Users can input messages, and the system responds using the language model.
The conversation history is stored in a python list, and the loop continues until the user inputs 'exit'.
Uses mistral-7b-instruct-v0.1.Q5_K_M.gguf for LLM (you need to download it into the repo to use it)
llama-cpp-python and ctransformers either can be used for LLM inference
For PDF RAG system, streamlit for UI is also used. For website data RAG, cli is used as the interface.

To simply run the PDFs RAG project:

Download supported gguf model from HuggingFace. Place the model file in the folder.
Install packages in your activated environment

pip install -r requirement.txt

cd "pdf inference"
python ingest.py

streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
URL inference		URL inference
pdf inference		pdf inference
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback