Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] deepspeed.runtime.zero.utils.ZeRORuntimeException: You are using ZeRO-Offload with a client provided optimizer #693

Open
LUMO666 opened this issue Jan 18, 2024 · 2 comments
Labels
pending Something isn't working

Comments

@LUMO666
Copy link

LUMO666 commented Jan 18, 2024

Run /scripts/run_raft_align.sh in docker and get an error.

deepspeed.runtime.zero.utils.ZeRORuntimeException: You are using ZeRO-Offload with a client provided optimizer (<class 'transformers.optimization.AdamW'

) which in most cases will yield poor performance. Please either use deepspeed.ops.adam.DeepSpeedCPUAdam or set an optimizer in your ds-config (https://www.deepspeed.ai/docs/config-json/#optimizer-parameters). If you really want to use a custom optimizer w. ZeRO-Offload and understand the performance impacts you can also set <"zero_force_ds_cpu_optimizer": false> in your configuration file.

Is it related to mpi4py? I'm doubting whether I have mpi4py installed correctly. Thanks.

@LUMO666 LUMO666 added the pending Something isn't working label Jan 18, 2024
@research4pan
Copy link
Contributor

@WeiXiongUST @hendrydong I am wondering if you could take a look? Thanks 🙏

@hendrydong
Copy link
Contributor

Hi, it looks that the configuations of "ZeRO-Offload" is not correct, you may double check the yaml file.

BTW, this might be more related to the configuration of deepspeed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pending Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants