FEAT: Add Badam optimizer #30692

younesbelkada · 2024-05-07T10:29:25Z

What does this PR do?

This PR adds Badam optimizer to transformers Trainer API

TODOs:

Add ratio optimizer
Add layer optimizer
Add docs

HuggingFaceDocBuilderDev · 2024-05-07T10:49:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

muellerzr

Nice job :) Very straightforward.

amyeroberts

Thanks for the work adding this!

At the moment, I think there's some work needed to move the handling logic into different methods to prevent leakiness i.e. not all of these methods need to know about badam.

p.s. I can't read badam and not think about this Kylie banger

amyeroberts · 2024-05-09T11:15:40Z

src/transformers/training_args.py

+
+        if use_badam:
+            self.optim = "badam_" + self.optim


Having to remove and then add this back seems both convoluted and an indication that there's something funny about the implementation. Specifically, it seems like something which should be handled on the OptimizerNames side

amyeroberts · 2024-05-09T11:17:45Z

src/transformers/trainer.py

+            if badam_kwargs is not None:
+                from badam import BlockOptimizer
+
+                self.optimizer = BlockOptimizer(
+                    base_optimizer=self.optimizer,
+                    named_parameters_list=list(opt_model.named_parameters()),
+                    block_prefix_list=None,
+                    **badam_kwargs,
+                )


This is another indication of a peculiar pattern - why set self.optimizer and then overwrite here? Instead, the correct optimizer class and kwargs should be handled in Trainer.get_optimizer_cls_and_kwargs(self.args, opt_model)

amyeroberts · 2024-05-09T11:19:02Z

src/transformers/trainer.py

        if args.optim_args:
            for mapping in args.optim_args.replace(" ", "").split(","):
                key, value = mapping.split("=")
-                optim_args[key] = value
+                if "badam_" in key and use_badam:


Is there ever a case when use_badam is False and badam_ is in the key?

amyeroberts · 2024-05-09T11:22:57Z

src/transformers/trainer.py

+                    badam_optim_args[key.replace("badam_", "")] = value
+                else:
+                    optim_args[key] = value


We shouldn't need to do this string manipulation and have these parallel optim_args. This doesn't scale well if we want to add other optimizers which accept some of the previous optim args

amyeroberts · 2024-05-09T11:23:51Z

src/transformers/trainer.py

        if args.optim_args:
            for mapping in args.optim_args.replace(" ", "").split(","):
                key, value = mapping.split("=")
-                optim_args[key] = value
+                if "badam_" in key and use_badam:


nit - check the bool flag first in the and check, it'll be faster

Suggested change

if "badam_" in key and use_badam:

if use_badam and "badam_" in key:

add badam v1

510ef03

protect with None

1f06aed

muellerzr approved these changes May 7, 2024

View reviewed changes

amyeroberts reviewed May 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Add Badam optimizer #30692

FEAT: Add Badam optimizer #30692

younesbelkada commented May 7, 2024 •

edited

HuggingFaceDocBuilderDev commented May 7, 2024

muellerzr left a comment

amyeroberts left a comment

amyeroberts May 9, 2024

amyeroberts May 9, 2024

amyeroberts May 9, 2024

amyeroberts May 9, 2024

amyeroberts May 9, 2024

	if "badam_" in key and use_badam:
	if use_badam and "badam_" in key:

FEAT: Add Badam optimizer #30692

Are you sure you want to change the base?

FEAT: Add Badam optimizer #30692

Conversation

younesbelkada commented May 7, 2024 • edited

What does this PR do?

TODOs:

HuggingFaceDocBuilderDev commented May 7, 2024

muellerzr left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts May 9, 2024

Choose a reason for hiding this comment

amyeroberts May 9, 2024

Choose a reason for hiding this comment

amyeroberts May 9, 2024

Choose a reason for hiding this comment

amyeroberts May 9, 2024

Choose a reason for hiding this comment

amyeroberts May 9, 2024

Choose a reason for hiding this comment

younesbelkada commented May 7, 2024 •

edited