Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory
-
Updated
May 11, 2024 - Python
Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory
Unify Efficient Fine-Tuning of 100+ LLMs
End to End Generative AI Industry Projects on LLM Models with Deployment
This repo contains everything about transformers and NLP.
Firefly: 大模型训练工具,支持训练Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Tuning the Finetuning: An exploration of achieving success with QLoRA
🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the model's Chinese open domain.
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
qwen-1.5-1.8B sentiment analysis with prompt optimization and qlora fine-tuning
🐋MindChat(漫谈)——心理大模型:漫谈人生路, 笑对风霜途
Kickstart with LLMs
META LLAMA3 GENAI Real World UseCases End To End Implementation Guide
Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS
Add a description, image, and links to the qlora topic page so that developers can more easily learn about it.
To associate your repository with the qlora topic, visit your repo's landing page and select "manage topics."