AI-Playground

AI Playground for trying out LLM Models, Embeddings, Vector Stores, Semantic Search, RAG, Azure OpenAI, LLaMa, Mistral

Installation

pip install -U ai-playground

Local Installation

Pre-requisites:

Python 3.10+ and pip

# Start virtual environment
source ./activate

# Install requirements
pip install -r requirements.txt

Running the full playground

Copy .env.example to .env and fill in the values
Run the following command to start the server

python ai_playground.py

Models

Llama 2 - https://huggingface.co/TheBloke/Llama-2-7b-Chat-GGUF
Llama 3 Instruct - https://huggingface.co/lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF/tree/main

wget -c https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF/resolve/main/mistral-7b-v0.1.Q8_0.gguf wget -c https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF/resolve/main/mistral-7b-instruct-v0.1.Q8_0.gguf wget -c https://huggingface.co/TheBloke/Mistral-7B-OpenOrca-GGUF/resolve/main/mistral-7b-openorca.Q8_0.gguf

wget -c https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-GGUF/resolve/main/Wizard-Vicuna-7B-Uncensored.Q8_0.gguf

wget -c https://huggingface.co/TheBloke/CodeLlama-7B-Instruct-GGUF/resolve/main/codellama-7b-instruct.Q8_0.gguf

Running Individual things

Use StarCoder

pip install transformers pip install torch torchvision pip install accelerate bitsandbytes pip install accelerate[torch]

Edit:

load_in_8bit=True

python starcoder.py

will download ~60 GB of model

Try with LLaMA.cpp

Extract LLaMA.cpp zip to bin/ directory

./bin/main.exe -m models/llama-2-7b-chat.Q8_0.gguf

Try with vLLM

pip install -U vllm

python -u -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --model mistralai/Mistral-7B-v0.1

Try with FastChat

pip install -U fastchat

python -m fastchat.serve.openai_api_server --host localhost --port 8000

Try with LeptonAI

pip install -U leptonai

Try with ollama

echo "FROM ./models/llama-2-13b-chat.Q5_K_M.gguf" > llama-2-13b-chat.Modelfile

ollama create llama2-13b-chat -f ./llama-2-13b-chat.Modelfile

ollama run llama2-13b-chat

ollama ps

Specs

RAM Required:

Model Size	RAM Required
3B	8 GB
7B	16 GB
13B	32 GB

Chat UIs

OpenWebUI
ChatBotUI
OpenUI
AnythingLLM
LobeChat

Agents

https://github.com/joaomdmoura/crewAI

Other Tools

https://github.com/outlines-dev/outlines
guidance

Development Notes

pip install pyautogen

pip install openplayground
openplayground run

ollama run mistral

pip install -U jina

Ray Serve
pip install "ray[serve]"
https://github.com/ray-project/ray-llm

txtai

MLC AI - https://mlc.ai/package/
pip install --pre --force-reinstall mlc-ai-nightly mlc-chat-nightly -f https://mlc.ai/wheels
python -m mlc_chat.rest 

OpenLLM


https://github.com/FlowiseAI/Flowise


wget https://gpt4all.io/models/ggml-gpt4all-j.bin -O models/ggml-gpt4all-j

https://github.com/go-skynet/LocalAI
docker pull quay.io/go-skynet/local-ai:latest

nlpcloud

curl "https://api.nlpcloud.io/v1/<model_name>/entities" \
  -H "Authorization: Token <token>" \
  -H "Content-Type: application/json" \
  -X POST \
  -d '{"text":"John Doe has been working for Microsoft in Seattle since 1999."}'


https://github.com/microsoft/semantic-kernel
https://github.com/microsoft/guidance


https://skypilot.readthedocs.io/

Later:
https://github.com/Arize-ai/phoenix
https://github.com/explodinggradients/ragas
https://github.com/trypromptly/LLMStack


Q5_K_M



poetry export -f requirements.txt --output requirements.txt


lazypredict

mito

pip install langchain-serve

LangServe

pip install -U "langserve[all]"
pip install -U langchain-cli


langflow run


flowise

promptflow
pip install promptflow promptflow-tools


# DSPy
pip install dspy-ai



https://github.com/ShreyaR/guardrails
https://github.com/guardrails-ai/guardrails



guidance
https://github.com/1rgs/jsonformer

LangChain
https://github.com/jina-ai/langchain-serve

LangFlow / Flowise / LangSmith
ChainLit

promptflow


LMQI
https://github.com/eth-sri/lmql

https://github.com/zilliztech/GPTCache

https://github.com/argilla-io/argilla

https://github.com/vllm-project/vllm

https://github.com/TransformerOptimus/SuperAGI

accelerate
  - accelerate config
  - accelerate env
bitsandbytes
wand
https://github.com/huggingface/text-generation-inference


ctransformers

spacy
spacy-llm
gorilla-cli
https://github.com/langgenius/dify
gptcache

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.vscode		.vscode
anything-llm @ 1135853		anything-llm @ 1135853
chatbot-ui @ 937739f		chatbot-ui @ 937739f
models		models
pages		pages
.editorconfig		.editorconfig
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
activate		activate
ai_playground.py		ai_playground.py
autogen_agents.py		autogen_agents.py
autotrain.py		autotrain.py
chroma.py		chroma.py
codegen.py		codegen.py
config.py		config.py
dolly.py		dolly.py
embeddings.py		embeddings.py
falcon.py		falcon.py
langchain.py		langchain.py
langflow.py		langflow.py
litellm.py		litellm.py
llama-2-13b-chat.Modelfile		llama-2-13b-chat.Modelfile
llama_ctransformers.py		llama_ctransformers.py
lobe-chat.docker-compose.yml		lobe-chat.docker-compose.yml
mistral.py		mistral.py
nlpcloud.py		nlpcloud.py
open_playground.py		open_playground.py
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
qdrant.py		qdrant.py
serge.docker-compose.yml		serge.docker-compose.yml
starcoder.py		starcoder.py
txtai.yml		txtai.yml
txtai_app.py		txtai_app.py
vicuna_ctransformers.py		vicuna_ctransformers.py
vllm_app.py		vllm_app.py

License

vs4vijay/AI-Playground

Folders and files

Latest commit

History

Repository files navigation