Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix incorrect
src
argument in broadcast_params
function
#796
opened Apr 26, 2024 by
Yuxin-CV
Loading…
fix loading distributed checkpoint when enable auto-detect-ckpt-format but disable use-dist-ckpt
#794
opened Apr 24, 2024 by
imh966
Loading…
fix a mistake when check if num_layers dividable by vpp
#781
opened Apr 16, 2024 by
constroy
Loading…
Support S3 checkpointing for the torch strategy in distributed checkpointing
#748
opened Mar 22, 2024 by
jrocmar
Loading…
Update outdated method name passed to get linear_layer function to match intented method that was imported
#740
opened Mar 18, 2024 by
OckermanSethGVSU
Loading…
Replace outdated import path of get_forward_backward_func in eval_utils.py
#734
opened Mar 14, 2024 by
OckermanSethGVSU
Loading…
support more general inference case that query length > 1
#730
opened Mar 12, 2024 by
yidong72
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2024-05-08.