GPU-based vectorized Specaug Version 2 #9155

amorari-nvidia · 2024-05-09T18:48:38Z

What does this PR do ?

This PR proposes a faster version of the Spectrogram Augmentation module for time and frequency masking.

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

…g-v2

for more information, see https://pre-commit.ci

pzelasko

I visually inspected that the masks look correct on random spectrograms. I also ran a training with this variant in Nsight profiler and noticed the specaug computation mostly disappeared from the profile. It is estimated to be about 4-5x faster than the variant proposed in #9041 which would make it 8-10x faster than the original version (still available as the "legacy" setting).

LGTM!

nemo/collections/asr/parts/submodules/spectr_augment.py

requested update to another version with some extra fixes

titu1994

Looks great ! Minor comments

nemo/collections/asr/parts/submodules/spectr_augment.py

for more information, see https://pre-commit.ci

amorari-nvidia · 2024-05-09T22:56:14Z

After this I think that we may also have to add the flag in the audio processing module:
https://github.com/amorari-nvidia/NeMo/blob/d26eab45d54e9e00e74c68ea4b0bbbf7b48a7a50/nemo/collections/asr/modules/audio_preprocessing.py#L494

And also we may have to add a test, adding the pytest run_only_on('GPU') flag the same it is done for the numba code:
#https://github.com/amorari-nvidia/NeMo/blob/d26eab45d54e9e00e74c68ea4b0bbbf7b48a7a50/tests/collections/asr/test_asr_modules.py#L71

nemo/collections/asr/parts/submodules/spectr_augment.py

Signed-off-by: Alessandro Morari <amorari@nvidia.com>

pzelasko · 2024-05-13T15:00:11Z

@amorari-nvidia As a last thing, could you remove the numba specaug implementation altogether in this PR? IIRC it will automatically kick in with numba 0.58 (or when numba+cuda versions are compatible for earlier versions), overriding the implementation here. CC @titu1994 pls double check if I'm right about this.

amorari-nvidia · 2024-05-13T18:16:35Z

Waiting for confirmation from @titu1994

titu1994 · 2024-05-13T19:31:45Z

You can remove it entirely, or set the default flag of numba cuda to false. I think that's sufficient for the time being.

If you can remove it cleanly, then that's fine too

amorari-nvidia · 2024-05-13T20:04:29Z

I will just set the Numba version default flag to false for now

Signed-off-by: pzelasko <pzelasko@users.noreply.github.com>

amorari-nvidia · 2024-05-13T20:51:32Z

I will just set the Numba version default flag to false for now

The Numba SpecAugment version is already set to false, while the vectorized version is set to true and is the default code.

pzelasko · 2024-05-14T15:44:55Z

great, re-triggered tests, LGTM as soon as it's green

pablo-garay · 2024-05-14T23:11:49Z

great, re-triggered tests, LGTM as soon as it's green

CI passed

pzelasko · 2024-05-15T20:43:43Z

Congrats @amorari-nvidia on the first contribution!

pzelasko and others added 5 commits April 25, 2024 15:15

GPU-based vectorized SpecAug

c9ad51c

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Wider dtypes for specaug mask bounds computation

9b5c427

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Merge remote-tracking branch 'upstream/fast-specaug' into fast-specau…

dbb7a32

…g-v2

Merge remote-tracking branch 'upstream/main' into fast-specaug-v2

07406a7

fast spec augmentation v2

5c46686

github-actions bot added the ASR label May 9, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

de70cb8

for more information, see https://pre-commit.ci

pzelasko previously approved these changes May 9, 2024

View reviewed changes

pzelasko requested a review from titu1994 May 9, 2024 18:55

pzelasko mentioned this pull request May 9, 2024

GPU-based vectorized SpecAug #9041

Closed

8 tasks

pzelasko added the Run CICD label May 9, 2024

github-advanced-security bot found potential problems May 9, 2024

View reviewed changes

nemo/collections/asr/parts/submodules/spectr_augment.py Fixed Show fixed Hide fixed

titu1994 reviewed May 9, 2024

View reviewed changes

nemo/collections/asr/parts/submodules/spectr_augment.py Outdated Show resolved Hide resolved

nemo/collections/asr/parts/submodules/spectr_augment.py Show resolved Hide resolved

nemo/collections/asr/parts/submodules/spectr_augment.py Show resolved Hide resolved

amorari-nvidia and others added 4 commits May 9, 2024 22:24

Removed randint code, added comments

74dd041

merged, removed randint code, added comments

4bdb2c9

[pre-commit.ci] auto fixes from pre-commit.com hooks

1d86953

for more information, see https://pre-commit.ci

Merge branch 'main' into fast-specaug-v2

d26eab4

pzelasko reviewed May 10, 2024

View reviewed changes

nemo/collections/asr/parts/submodules/spectr_augment.py Outdated Show resolved Hide resolved

galv reviewed May 10, 2024

View reviewed changes

nemo/collections/asr/parts/submodules/spectr_augment.py Outdated Show resolved Hide resolved

Fixed padding coverage bug, fixed long casting bug, fixed comments

34e2666

Signed-off-by: Alessandro Morari <amorari@nvidia.com>

pzelasko added Run CICD and removed Run CICD labels May 10, 2024

amorari-nvidia added 2 commits May 11, 2024 15:58

fixed bug due to using freq_axis with length

62fd1d8

Signed-off-by: Alessandro Morari <amorari@nvidia.com>

Added tests for vectorized spectrogram augmentation

c8ab4e7

Signed-off-by: Alessandro Morari <amorari@nvidia.com>

pzelasko and others added 2 commits May 13, 2024 16:26

Merge branch 'main' into fast-specaug-v2

5f3d672

Apply isort and black reformatting

bc62b1f

Signed-off-by: pzelasko <pzelasko@users.noreply.github.com>

pzelasko added Run CICD and removed Run CICD labels May 14, 2024

pzelasko added Run CICD and removed Run CICD labels May 15, 2024

pzelasko self-requested a review May 15, 2024 16:04

pzelasko approved these changes May 15, 2024

View reviewed changes

pzelasko merged commit 061cc45 into NVIDIA:main May 15, 2024
248 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU-based vectorized Specaug Version 2 #9155

GPU-based vectorized Specaug Version 2 #9155

amorari-nvidia commented May 9, 2024

pzelasko left a comment

titu1994 left a comment

amorari-nvidia commented May 9, 2024

pzelasko commented May 13, 2024

amorari-nvidia commented May 13, 2024

titu1994 commented May 13, 2024

amorari-nvidia commented May 13, 2024

amorari-nvidia commented May 13, 2024 •

edited

pzelasko commented May 14, 2024

pablo-garay commented May 14, 2024

pzelasko commented May 15, 2024

GPU-based vectorized Specaug Version 2 #9155

GPU-based vectorized Specaug Version 2 #9155

Conversation

amorari-nvidia commented May 9, 2024

What does this PR do ?

pzelasko left a comment

Choose a reason for hiding this comment

titu1994 left a comment

Choose a reason for hiding this comment

amorari-nvidia commented May 9, 2024

pzelasko commented May 13, 2024

amorari-nvidia commented May 13, 2024

titu1994 commented May 13, 2024

amorari-nvidia commented May 13, 2024

amorari-nvidia commented May 13, 2024 • edited

pzelasko commented May 14, 2024

pablo-garay commented May 14, 2024

pzelasko commented May 15, 2024

amorari-nvidia commented May 13, 2024 •

edited