Medical RAG App

The objective of this project is to create a chatbot that can be used to communicate with users to provide answers to their health issues. This is a RAG implementation using open source stack. The LLM used for the chatbot is BioMistral. BioMistral is an opensource LLM finetuned for medical domains. In order to run the application locally on CPU, a quantized model of the LLM is used. PubMedBert is the embedding model used for the application. This model is finetuned using sentence-transormers. It outperforms all other sentence transformer models for tasks on medical domain. It creates a 768 dimensional dense vector for embedding. Qdrant is the vector database used for storing the vectors. Qudrant is a self hosted open source vector database. LangChain and Llama CPP are used as the orchestration framework.

Data

The data folder contains the data used for creating the vectors. I have used 2 pdf files. You can use any number of files. The data in these files will be be converted to vectors and stored in Qdrant.

Architecture

The initial step is to create a vector embedding of all the documents. This is done using the script ingest.py. When the user inputs a query, the vectors with the highest similarity are retrieved from the vector database. This is goven as the context for the LLM. THis would help to reduce the context length of the input. In order to facilitate memory, we are using ConversationalRetrivalChain. The previous query and output are passed to the LLM so that it can rewrite the new query. This gives more context to the LLM, and gives better responses.

How to Run

Step 1:Setup virtual environment

In order to run the application, you have to create a new Python virtual environment.

python3 -m venv venv
source venv venv

Step 2: Install all the requirements

pip install -r requirements.txt

Step 3: Setup Qdrant

I am using docker image of Qdrant. If you are following this method, you should install docker.

docker pull qdrant/qdrant

docker run -p 6333:6333 -p 6334:6334 \
    -v $(pwd)/qdrant_storage:/qdrant/storage:z \
    qdrant/qdrant

Step4: Download the quantized version of LLM

Downaload the quantized version of BioMistral from here to the project directory.

Step5: Ingest data to Qdrant collection

python3 ingest.py

Step 6: Start the FastAPI server

uvicorn main:app

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
templates		templates
.gitignore		.gitignore
README.md		README.md
ingest.py		ingest.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

templates

templates

.gitignore

.gitignore

README.md

README.md

ingest.py

ingest.py

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Medical RAG App

Data

Architecture

How to Run

About

Packages

Languages

joyceannie/Medical_ChatBot

Folders and files

Latest commit

History

Repository files navigation

Medical RAG App

Data

Architecture

How to Run

About

Topics

Resources

Stars

Watchers

Forks

Languages