Skip to content

Actions: flashinfer-ai/flashinfer

All workflows

Actions

Loading...

Showing runs from all workflows
377 workflow runs
377 workflow runs
Event

Filter by event

Status

Filter by status

Branch
Actor

Filter by actor

bugfix: suppress alignment warning of sampling kernels (#297)
Automatically bump version and release Python wheels #141: Commit 1250b68 pushed by yzh119
June 11, 2024 08:47 27s main
June 11, 2024 08:47 27s
bugfix: suppress alignment warning of sampling kernels (#297)
Build FlashInfer Docs #171: Commit 1250b68 pushed by yzh119
June 11, 2024 08:47 45s main
June 11, 2024 08:47 45s
bugfix: fix wrong padded_batch_size_ (#296)
Build FlashInfer Docs #170: Commit aff4cf0 pushed by yzh119
June 11, 2024 05:14 1m 11s main
June 11, 2024 05:14 1m 11s
bugfix: fix wrong padded_batch_size_ (#296)
Automatically bump version and release Python wheels #140: Commit aff4cf0 pushed by yzh119
June 11, 2024 05:14 26s main
June 11, 2024 05:14 26s
refactor: refactor decode handler (#294)
Automatically bump version and release Python wheels #139: Commit 60459e4 pushed by yzh119
June 10, 2024 10:33 24s main
June 10, 2024 10:33 24s
refactor: refactor decode handler (#294)
Build FlashInfer Docs #169: Commit 60459e4 pushed by yzh119
June 10, 2024 10:33 53s main
June 10, 2024 10:33 53s
misc: add some notes in cmake.config (#293)
Automatically bump version and release Python wheels #138: Commit 4c5e28b pushed by yzh119
June 10, 2024 07:20 24s main
June 10, 2024 07:20 24s
misc: add some notes in cmake.config (#293)
Build FlashInfer Docs #168: Commit 4c5e28b pushed by yzh119
June 10, 2024 07:20 45s main
June 10, 2024 07:20 45s
doc: fix the math display of group gemm operator (#292)
Build FlashInfer Docs #167: Commit 4198686 pushed by yzh119
June 10, 2024 07:14 57s main
June 10, 2024 07:14 57s
doc: fix the math display of group gemm operator (#292)
Automatically bump version and release Python wheels #137: Commit 4198686 pushed by yzh119
June 10, 2024 07:14 26s main
June 10, 2024 07:14 26s
bugfix: Fix the behavior of decode cuda graph wrapper (#291)
Automatically bump version and release Python wheels #136: Commit e252c94 pushed by yzh119
June 10, 2024 07:07 36s main
June 10, 2024 07:07 36s
bugfix: Fix the behavior of decode cuda graph wrapper (#291)
Build FlashInfer Docs #166: Commit e252c94 pushed by yzh119
June 10, 2024 07:07 49s main
June 10, 2024 07:07 49s
bugfix: fix the synchronization issue in distributed operators (#290)
Automatically bump version and release Python wheels #135: Commit f13ec08 pushed by yzh119
June 9, 2024 08:49 32s main
June 9, 2024 08:49 32s
bugfix: fix the synchronization issue in distributed operators (#290)
Build FlashInfer Docs #165: Commit f13ec08 pushed by yzh119
June 9, 2024 08:49 1m 0s main
June 9, 2024 08:49 1m 0s
feat: initial support of distributed operators (#289)
Automatically bump version and release Python wheels #134: Commit 03553da pushed by yzh119
June 8, 2024 08:24 28s main
June 8, 2024 08:24 28s
feat: initial support of distributed operators (#289)
Build FlashInfer Docs #164: Commit 03553da pushed by yzh119
June 8, 2024 08:24 44s main
June 8, 2024 08:24 44s
cmake: fix DECODE_F8_DTYPES and DECODE_FP8_DTYPES discrepancy (#287)
Automatically bump version and release Python wheels #133: Commit 809abaa pushed by yzh119
June 7, 2024 08:13 32s main
June 7, 2024 08:13 32s
cmake: fix DECODE_F8_DTYPES and DECODE_FP8_DTYPES discrepancy (#287)
Build FlashInfer Docs #163: Commit 809abaa pushed by yzh119
June 7, 2024 08:13 48s main
June 7, 2024 08:13 48s
bugfix: fix the data type of aligned_alloc in handlers (#283)
Automatically bump version and release Python wheels #132: Commit 5a38066 pushed by yzh119
June 5, 2024 06:58 25s main
June 5, 2024 06:58 25s
bugfix: fix the data type of aligned_alloc in handlers (#283)
Build FlashInfer Docs #162: Commit 5a38066 pushed by yzh119
June 5, 2024 06:58 1m 1s main
June 5, 2024 06:58 1m 1s
feat: add group gemm operators (#282)
Automatically bump version and release Python wheels #131: Commit e08ba42 pushed by yzh119
June 5, 2024 03:28 30s main
June 5, 2024 03:28 30s
feat: add group gemm operators (#282)
Build FlashInfer Docs #161: Commit e08ba42 pushed by yzh119
June 5, 2024 03:28 49s main
June 5, 2024 03:28 49s
Add dtype checks for q-kv tensors (#280)
Build FlashInfer Docs #160: Commit 7aadc0d pushed by yzh119
June 4, 2024 05:42 49s main
June 4, 2024 05:42 49s
Add dtype checks for q-kv tensors (#280)
Automatically bump version and release Python wheels #130: Commit 7aadc0d pushed by yzh119
June 4, 2024 05:42 23s main
June 4, 2024 05:42 23s
bugfix: fix cudagraph-compatible prefill/decode apis (#281)
Automatically bump version and release Python wheels #129: Commit 1092e7e pushed by yzh119
June 4, 2024 05:21 25s main
June 4, 2024 05:21 25s