LLaMA.MMEngine

😋Training LLaMA with MMEngine!

LLaMA.MMEngine is an experimental repository that leverages the MMEngine training engine, originally designed for computer vision tasks, to train and fine-tune language models. The primary goal of this project is to explore the compatibility of MMEngine with language models, learn about fine-tuning techniques, and engage with the open-source community for knowledge sharing and collaboration.

🤩 Features

Support for loading LLaMA models with parameter sizes ranging from 7B to 65B
Instruct tuning support
low-rank adaptation (LoRA) fine-tuning support

🏃 Todo-List

int8 quantization support
improve the generate script
support show validation loss

👀 Getting Started

Installation

Install PyTorch

Following this guide https://pytorch.org/get-started/locally/

Setup this repo

Clone the repo

git clone https://github.com/RangiLyu/llama.mmengine
cd llama.mmengine

Install dependencies

pip install -r requirements.txt

Run setup.py

python setup.py develop

Get pre-trained LLaMA models

Please Download the model weights from the official LLaMA repo.

The checkpoints folder should be like this:

checkpoints/llama
├── 7B
│   ├── checklist.chk
│   ├── consolidated.00.pth
│   └── params.json
├── 13B
│   ...
├── tokenizer_checklist.chk
└── tokenizer.model

Convert the weights (Thanks for the script from Lit-LLaMA):

python scripts/convert_checkpoint.py \
    --output_dir checkpoints/mm-llama \
    --ckpt_dir checkpoints/llama \
    --tokenizer_path checkpoints/llama/tokenizer.model \
    --model_size 7B

LoRA fine-tuning

python tools/train.py configs/llama-7B_finetune_3e.py

Inference

python tools/generate.py configs/llama-7B_finetune_3e.py work_dirs/llama-7B_finetune_3e/epoch_3.pth

🤗 Contributing

I greatly appreciate your interest in contributing to LLaMA.MMEngine! Please note that this project is maintained as a personal side project, which means that available time for development and support is limited. With that in mind, I kindly encourage members of the community to get involved and actively contribute by submitting pull requests!

Acknowledgements

@Lightning-AI for Lit-LLaMA ️
@tloen for Alpaca-LoRA
Stanford Alpaca
LLaMA

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
configs		configs
mmllama		mmllama
templates		templates
tools		tools
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

mmllama

mmllama

templates

templates

tools

tools

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

setup.cfg

setup.cfg

setup.py

setup.py

Repository files navigation

LLaMA.MMEngine

🤩 Features

🏃 Todo-List

👀 Getting Started

Installation

Get pre-trained LLaMA models

LoRA fine-tuning

Inference

🤗 Contributing

Acknowledgements

About

Releases

Packages

Languages

License

RangiLyu/llama.mmengine

Folders and files

Latest commit

History

Repository files navigation

LLaMA.MMEngine

🤩 Features

🏃 Todo-List

👀 Getting Started

Installation

Get pre-trained LLaMA models

LoRA fine-tuning

Inference

🤗 Contributing

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages