Automatic Learning Rate Scaling? #3250

ziqipang · 2021-10-28T16:33:47Z

ziqipang
Oct 28, 2021

Hi, I am new to horovod and trying to use it for distributed training. I am wondering if we have to manually increase the learning rate when we use more gpus.

On the documentation, the instructions tell us to scale the learning rate manually, and it seems the example training code also do this. But on viewing this issue, I am confused that if horovod will by default scale the learning rate?

Thank you for helping me!

maxhgerlach · 2021-11-02T12:38:58Z

maxhgerlach
Nov 2, 2021
Collaborator

Obviously it would depend on the optimizer you use whether and how you want to scale your learning rate with the number of workers (total batch size). For simple SGD that may be the right call, but less so for more involved schemes. It's better to be explicit here in your code.

0 replies

tgaddair · 2021-11-08T00:04:06Z

tgaddair
Nov 8, 2021
Maintainer

Hey @ziqipang, as @maxhgerlach said, this needs to be done manually in most cases. Some higher level frameworks like PyTorch Lightning and Ludwig will automatically scale the learning rate for you, but that is not being done by Horovod, rather it's a feature of those frameworks. Does that answer your question?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic Learning Rate Scaling? #3250

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Automatic Learning Rate Scaling? #3250

ziqipang Oct 28, 2021

Replies: 2 comments

maxhgerlach Nov 2, 2021 Collaborator

tgaddair Nov 8, 2021 Maintainer

ziqipang
Oct 28, 2021

maxhgerlach
Nov 2, 2021
Collaborator

tgaddair
Nov 8, 2021
Maintainer