Issues: horovod/horovod
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Milestones
Assignee
Sort
Issues list
NVIDIA CUDA TOOLKIT version to run Horovod in Conda Environment
#4043
opened May 10, 2024 by
ppandit95
Environment crashes because it seems to be overriding built in modules
bug
#4042
opened May 8, 2024 by
mtrattner
Replace tf.train.SessionRunHook by tf.compat.v1.train.SessionRunHook ?
bug
#4040
opened May 1, 2024 by
whatdhack
v0.28.1 Version Mismatch with TF 2.12.0. Works with v0.28.0
bug
#4039
opened Apr 16, 2024 by
liamaltarac
Tensorflow Saved model not portable with latest tf.keras.optimizers
bug
#4028
opened Mar 11, 2024 by
supercharleszhu
Unexpected Worker Failure when using Elastic Horovod + Process Sets
bug
#4021
opened Feb 7, 2024 by
Pranavug
Can I call horovod training process in proc = subprocess.Popen(command, shell=True, cwd=cwd) using command
bug
#4017
opened Jan 15, 2024 by
bit-pku-zdf
Error install horovod with python 3.11.5 on macOS 11.3.1
bug
#4013
opened Dec 22, 2023 by
DriverSong
AttributeError: module 'horovod.torch' has no attribute 'init'
bug
#4009
opened Dec 13, 2023 by
Cow-Kite
Getting error while running multi node machine learning training on H100 servers
enhancement
#3989
opened Oct 2, 2023 by
PurvagLapsiwala
Test test.integration.test_spark.SparkTests.test_dbfs_local_store broken for tensorflow>=2.13
bug
#3988
opened Sep 27, 2023 by
EnricoMi
How to write tensorflow custom training loop with using horovod.
#3987
opened Sep 21, 2023 by
PurvangL
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.