Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Loss Weight ablation experiment #75

Open
JinYu1998 opened this issue Jan 22, 2024 · 3 comments
Open

Loss Weight ablation experiment #75

JinYu1998 opened this issue Jan 22, 2024 · 3 comments

Comments

@JinYu1998
Copy link

Have you tried the effect of different loss weights on the distillation results ?

@sanchit-gandhi
Copy link
Collaborator

Hey @JinYu1998 - we did a coarse sweep over the KL weights when setting up preliminary experiments on just the LibriSpeech corpus. We found the setting from DistilBART to be best, and so committed to this for the rest of the project. We didn't do any further tuning of the loss weights on our full training set. You can find an ablation over the loss terms (not weights) in page 26 of the paper.

@JinYu1998
Copy link
Author

Hey @JinYu1998 - we did a coarse sweep over the KL weights when setting up preliminary experiments on just the LibriSpeech corpus. We found the setting from DistilBART to be best, and so committed to this for the rest of the project. We didn't do any further tuning of the loss weights on our full training set. You can find an ablation over the loss terms (not weights) in page 26 of the paper.

Thank you for your reply, I have previously worked on dynamic temperature distillation on classifieds, and just recently finished this work. I'm very interested in distillation in whisper, and look forward to combining my work with distill whisper very well.

@abdulmominseo
Copy link

Delicious & Exciting Diet Foods : Weight Loss Food full video - https://youtu.be/4Kr8gtd2oss?si=1HVNwuBNTKgr4XCL

@huggingface huggingface deleted a comment from abdulmominseo Feb 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants