Adding adapters to SpeechBrain (Code from Samsung AI Center Cambridge) #2534

TParcollet · 2024-04-30T18:14:32Z

What does this PR do?

Based on this #2526, this PR is a first attempt at adding Adapters to any SB model. This will only work if PreTrainer is used and not checkpointer. Indeed, the checkpointer will try to reload after the state_dict has been modified. So you need to 1. Instanciate the Brain, 2. Call the PreTrainer; 3. add the adapters; 4. call fit. An example is:

`

    asr_brain = ASR(
        modules=hparams["modules"],
        opt_class=hparams["Adam"],
        hparams=hparams,
        run_opts=run_opts,
        checkpointer=hparams["checkpointer"],
    )

    # adding objects to trainer:
    asr_brain.tokenizer = hparams["tokenizer"]

    # Load the pretrained model.
    run_on_main(hparams["pretrainer"].collect_files)
    hparams["pretrainer"].load_collected()

    from speechbrain.lobes.models.Adapters import (
        add_adapters_to_linear_in_model,
    )
    from speechbrain.lobes.models.Adapters import HoulsbyAdapterLinear

    add_adapters_to_linear_in_model(
        model=asr_brain.modules.Transformer,
        adapter_class=HoulsbyAdapterLinear,
        projection_size=32,
    )

    # Training
    asr_brain.fit(
        asr_brain.hparams.epoch_counter, train_data, valid_data,
    )

`

…eleased into develop

…into develop

pplantinga · 2024-05-02T15:22:51Z

I don't think this is quite the right approach because I don't think it allows for stopping/restarting which is part of the point of checkpointing. Instead, the checkpointer should store the LoRA'd model, not the pretrained model. Ideally it would even only store the LoRA weights (and any updated weights) and not the whole model, making for very small checkpoints and faster saving. Example:

add_adapters: !name:speechbrain.lobes.models.Adapters.add_adapters_to_linear_in_model
  adapter_class: !name:speechbrain.lobes.models.Adapters.HoulsbyAdapterLinear
  projection_size: 32

pretrainer: !new:speechbrain....Pretrainer
  loadables:
    transformer: !ref <Transformer>

# Load the pretrained model.
run_on_main(hparams["pretrainer"].collect_files)
hparams["pretrainer"].load_collected()
hparams["add_adapters"](hparams["Transformer"])

asr_brain = ASR(
    modules=hparams["modules"],
    opt_class=hparams["Adam"],
    hparams=hparams,
    run_opts=run_opts,
    checkpointer=hparams["checkpointer"], # Checkpointer loads LoRA weights only and applies them
)

asr_brain.fit(
    asr_brain.hparams.epoch_counter, train_data, valid_data,
)

TParcollet · 2024-05-02T17:36:23Z

It does allow for stop and restart because you are altering the object i.e. the checkpointer keeps track of it! The only problem is indeed that you store the whole model, however, I don't think it's an issue because in-fine you may simply don't know where to put the pre-trained adapters in the model if they are not applied to every linear layer for instance. I'd be happy to see a functionnal example of something else though.

pplantinga · 2024-05-02T22:53:49Z

Tbh I think PEFT handles this perfectly, perhaps we should lift their code wholesale.

TParcollet · 2024-05-03T08:27:33Z

You mean depend on another Huggingface library?

pplantinga · 2024-05-03T12:23:58Z

My opinion is we should just add it as a dependency, but I understand the objections to it. So instead we could just copy the parts of the code that make sense into speechbrain.

TParcollet · 2024-05-03T12:30:18Z

If you could give me a neat example of an integration of PEFT, I could be convinced.

poonehmousavi · 2024-05-03T15:05:41Z

The problem with peft is when we want to load the model from the speechbrain checkpoint. It is a mess to make it work and also it could cause the problem when using different version of peft. But maybe we could find a way to do this in a cleaner way.

pplantinga · 2024-05-09T17:34:34Z

speechbrain/lobes/models/Adapters.py

Shouldn't this go in nnet rather than lobes?

Good question. it's unclear because Adapters can be considered as "entire models" coming from the literature. But I think I agree that they can also be seen as small components. I'd be happy if you could help with the get_model like for PEFT. From your previous PR, I liked the fact that we can rely on the larger Adapter base from PEFT -- I am wondering if there isn't a way to combine both ...

I personally like the fact that with my function, you can actually specify what part of the Brain.modules or whatever model you want to put Adapters on. But I'd be happy to see something else.

Titouan Parcollet/Embedded AI /SRUK/Engineer/Samsung Electronics and others added 30 commits February 8, 2024 12:26

shorter augmentations in yaml

47e3097

layout to 80 char

5ab888a

listed label replication

a3bf472

listed label replication

c86d687

listed label replication

761bf93

Refact CTC

09cfde3

Refact transducer

e60396f

Refact seq2seq

d6a5524

call replicate label instead of duplication

9daba50

refactor aishell

6bf2361

refactor aishell

7ec92c5

CommonLanuageÃ

ebae569

fix error + CV CTC

088a0eb

Giga OOF

bfb9bc2

Giga OOF

21353d5

Giga OOF

9971121

Giga OOF

f879302

Giga OOF

95c5ea4

Giga OOF

1b24844

Giga OOF

a5a97aa

Giga OOF

55904dd

Giga OOF

7f366bb

Finishing OOF

963bda4

final touch LULZ

922024a

fix tests

819f8c8

Tests???Ã

8ade568

fix augment in some recipes

9e73c10

merge

b2b8f56

Merge branch 'develop' of https://github.com/TParcollet/speechbrain-r…

f0e9f6d

…eleased into develop

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

afd37a1

…into develop

Titouan Parcollet/Embedded AI /SRUK/Engineer/Samsung Electronics added 6 commits February 26, 2024 12:20

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

331ff7d

…into develop

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

81db8cc

…into develop

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

9ba61e6

…into develop

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

56b5d3c

…into develop

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

e4c6f32

…into develop

Initial adapter proposal

13f889d

TParcollet added enhancement New feature or request work in progress Not ready for merge labels Apr 30, 2024

TParcollet changed the title ~~Adding adapters to SpeechBrain~~ Adding adapters to SpeechBrain (Code from Samsung AI Center Cambridge) Apr 30, 2024

Make sacrifice to the CI mighty spirit

5f3c311

TParcollet requested review from pplantinga, mravanelli, poonehmousavi and Adel-Moumen May 1, 2024 08:56

TParcollet linked an issue May 1, 2024 that may be closed by this pull request

Adapters + LLama -- re-design. #2526

Open

Adel-Moumen added this to the v1.0.2 milestone May 2, 2024

pplantinga mentioned this pull request May 9, 2024

Add recipe for training whisper with PEFT - LoRA #2543

Closed

3 tasks

pplantinga reviewed May 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding adapters to SpeechBrain (Code from Samsung AI Center Cambridge) #2534

Adding adapters to SpeechBrain (Code from Samsung AI Center Cambridge) #2534

TParcollet commented Apr 30, 2024 •

edited

pplantinga commented May 2, 2024

TParcollet commented May 2, 2024 •

edited

pplantinga commented May 2, 2024

TParcollet commented May 3, 2024

pplantinga commented May 3, 2024

TParcollet commented May 3, 2024

poonehmousavi commented May 3, 2024

pplantinga May 9, 2024

TParcollet May 9, 2024

TParcollet May 9, 2024

Adding adapters to SpeechBrain (Code from Samsung AI Center Cambridge) #2534

Are you sure you want to change the base?

Adding adapters to SpeechBrain (Code from Samsung AI Center Cambridge) #2534

Conversation

TParcollet commented Apr 30, 2024 • edited

What does this PR do?

pplantinga commented May 2, 2024

TParcollet commented May 2, 2024 • edited

pplantinga commented May 2, 2024

TParcollet commented May 3, 2024

pplantinga commented May 3, 2024

TParcollet commented May 3, 2024

poonehmousavi commented May 3, 2024

pplantinga May 9, 2024

Choose a reason for hiding this comment

TParcollet May 9, 2024

Choose a reason for hiding this comment

TParcollet May 9, 2024

Choose a reason for hiding this comment

TParcollet commented Apr 30, 2024 •

edited

TParcollet commented May 2, 2024 •

edited