LoRA Low-Rank Adaptaion of Large Language Models

LoRA is a training method that accelerates the training of large language models while consuming less memory. It adds pairs of rank-decomposition weight matrices (Called Update matrices) to existing weights, and only trains those newly added added weights.

It has some addvantages :

Previos pretrained weights are kept frozen so the model is not as prone to catastrophic forgetting.
Rank-decomposition matrices have significantly fewer parameters than the original model, which means that trained LoRA weights are easily portable.

notice the following image :

let's say that m=100 , n=100 and k=5 (k refers to the maximum rank of matrix) then the original matrix size is 100 * 100 = 10000 which means 10000 parameters in that one.

But after using Rank-decomposition matrices you now have 100 * 5 = 500 and 5 * 100 = 500 which means 500 + 500 = 1000 parameters and that is a huge improvement.
LoRA matrices are generally added to the attention layers of the original model.
The greater memory-efficiency allows you to run fine-tuning on consumer GPUs like the Tesla T4, RTX 3080 or even the RTX 2080 Ti! GPUs like the T4 are free and readily accessible in Kaggle or Google Colab notebooks.

Method

The technique constrains the rank of the update matrix ΔW using its rank decomposition. It represents ΔWₙₖ as the product of 2 low-rank matrices Bₙᵣ and Aᵣₖ where r << min(n, k). This implies that the forward pass of the layer, originally Wx, is modified to Wx + BAx (as shown in the figure below). A random Gaussian initialization is used for A and B is initially to 0, so BA=0 at the start of training. The update BA is additionally scaled with a factor α/r.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitattributes		.gitattributes
Bloom1B7_finetuning_with_LoRA.ipynb		Bloom1B7_finetuning_with_LoRA.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitattributes

.gitattributes

Bloom1B7_finetuning_with_LoRA.ipynb

Bloom1B7_finetuning_with_LoRA.ipynb

README.md

README.md

Repository files navigation

LoRA Low-Rank Adaptaion of Large Language Models

Method

About

Releases

Packages

Languages

ElDokmak/Bloom-Finetuning-LoRA

Folders and files

Latest commit

History

Repository files navigation

LoRA Low-Rank Adaptaion of Large Language Models

Method

About

Topics

Resources

Stars

Watchers

Forks

Languages