GRAG - Good RAG

GRAG is a simple python package that provides an easy end-to-end solution for implementing Retrieval Augmented Generation (RAG).

The package offers an easy way for running various LLMs locally, Thanks to LlamaCpp and also supports vector stores like Chroma and DeepLake. It also makes it easy to integrage support to any vector stores easy.

Diagram of a basic RAG pipeline

Project Overview

A ready to deploy RAG pipeline for document retrival.
Basic GUI (Under Development)
Evaluation Suite (Under Development)
RAG enhancement using Graphs (Under Development)

Getting Started

To run the projects, make sure the instructions below are followed.

Further customization can be made on the config file, src/config.ini.

git clone the repository
pip install . from the repository (note: add - then change directory to the cloned repo)
For Dev: pip install -e .

Requirements

Required packages to install includes (refer to pyproject.toml):

PyTorch
LangChain
Chroma
Unstructured.io
sentence-embedding
instructor-embedding

LLM Models

To quantize model, run: python -m grag.quantize.quantize

For more details, go to .\llm_quantize\readme.md Tested models:

Llama-2 7B, 13B
Mixtral 8x7B
Gemma 7B

Model Compatibility

Refer to llama.cpp Supported Models (under Description) for list of compatible models.

Supported Vector Databases

1. Chroma

Since Chroma is a server-client based vector database, make sure to run the server.

To run Chroma locally, move to src/scripts then run source run_chroma.sh. This by default runs on port 8000.
If Chroma is not run locally, change host and port under chroma in src/config.ini.

2. Deeplake

For more information refer to Documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 683 Commits
.github/workflows		.github/workflows
ci		ci
cookbook		cookbook
demo		demo
documentation		documentation
full_report		full_report
llm_quantize		llm_quantize
presentation		presentation
projects/Basic-RAG		projects/Basic-RAG
proposal		proposal
research_paper		research_paper
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
default_config.ini		default_config.ini
pyproject.toml		pyproject.toml
requirements.yml		requirements.yml

License

arjbingly/grag

Folders and files

Latest commit

History

Repository files navigation

GRAG - Good RAG

Table of Content

Project Overview

Getting Started

Requirements

LLM Models

Supported Vector Databases

About

Topics

Resources

License

Stars

Watchers

Forks

Languages