Badam support #30308

Theodotus1243 · 2024-04-18T06:36:22Z

Feature request

https://arxiv.org/pdf/2404.02827.pdf

Memory efficient optimizer like galore but with less hyperparameters

Motivation

hiyouga/LLaMA-Factory#3287

LlamaFactory supports it, but having it in transformers is much more convenient

Your contribution

amyeroberts · 2024-04-18T09:03:06Z

cc @younesbelkada

xiao-li-hub · 2024-04-24T07:42:54Z

@Theodotus1243 @amyeroberts @younesbelkada

Hi, thanks so much for acknowledging our BAdam optimizer (https://github.com/Ledzy/BAdam). Here's a brief overview of its features:

Memory Efficiency: It is memory efficient full parameter finetuning method. We tune Llama 2-8B and Llama 3-8B using a single RTX3090 with BAdam; see our Github page https://github.com/Ledzy/BAdam for detailed performance metrics.
Simplicity in Hyperparameters: BAdam introduces only one additional hyperparameter, and it can be adaptively set; see the "Hyperparameter Suggestion" section in our Github page: https://github.com/Ledzy/BAdam#hyperparameter-suggestion.
Time Efficiency: Compared to LoRA and traditional Adam, BAdam cuts the actual backward time by half after the same number of epochs, thanks to the Chain rule property.
Rapid Convergence: The algorithm converges very fast, and we observe that only epoch is often enough for instruction tuning (e.g., tuning llama 3-8B using Alpaca-GPT4 dataset).

So far, Llama-Factory has integrated our method. We believe BAdam offers significant advancements for the LLMs community, and we are eager to support its integration into the Hugging Face Transformers library. Should you find these features compelling, we would be delighted to assist in its implementation.

amyeroberts added Feature request Request for a new feature optimization labels Apr 18, 2024

younesbelkada linked a pull request May 7, 2024 that will close this issue

FEAT: Add Badam optimizer #30692

Draft

3 tasks

acwme111 mentioned this issue May 7, 2024

[Feature Request] add layer-wise optimizers #29732

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Badam support #30308

Badam support #30308

Theodotus1243 commented Apr 18, 2024

amyeroberts commented Apr 18, 2024

xiao-li-hub commented Apr 24, 2024

Badam support #30308

Badam support #30308

Comments

Theodotus1243 commented Apr 18, 2024

Feature request

Motivation

Your contribution

amyeroberts commented Apr 18, 2024

xiao-li-hub commented Apr 24, 2024