New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add RingFlashAttention for context parallel #8383
base: develop
Are you sure you want to change the base?
Add RingFlashAttention for context parallel #8383
Conversation
Thanks for your contribution! |
09fd62e
to
fbd16a1
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #8383 +/- ##
===========================================
- Coverage 54.29% 54.17% -0.13%
===========================================
Files 617 619 +2
Lines 96339 96618 +279
===========================================
+ Hits 52310 52339 +29
- Misses 44029 44279 +250 ☔ View full report in Codecov by Sentry. |
|
||
# if step != cp_size - 1: | ||
# comm_buffer.wait() | ||
paddle.device.synchronize() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TODO:batch_isend_irecv异步流下,无法wait,需要修复。对性能有影响。
f94a915
to
4e88520
Compare
PR types
New features
PR changes
Models
Description
为fleet的context parallel增加ring flash attention的支持