Skip to content

请教,shardfomer中GPT2FusedLinearConv1D_Col为什么反向做了两次allreduce #4961

lichenlu started this conversation in Community | General
Discussion options

You must be logged in to vote

Replies: 3 comments 3 replies

Comment options

You must be logged in to vote
1 reply
@lichenlu
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@lichenlu
Comment options

@flybird11111
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants