[SYCL] Intel SYCL runtime support for AutoGPTQ #638

abhilash1910 · 2024-04-11T18:54:55Z

Hello @PanQiWei . @qwopqwop200 , @fxmarty .Greetings from Intel. Thanks for creating AutoGPTQ.

Considering adoption of more SYCL runtime across different vendors and specifically for Intel accelerators, this is an effort to upstream the cuda kernels for (gemm/non gemm quants) to SYCL runtime . This is an approach taken to further extend Intel accelerator support as well as provide an alternative compiler runtime for non cuda specific devices.

The proposal is to extend SYCL runtime through DPCT toolchain and to support a standard runtime for different vendors.
Since the idea is to provide better sycl compiler characterization, this PR would help enable the framework for a wide range of accelerators working on SYCL runtime.S
An initial kernel prototype is added, and further extension kernels will be ported to support SYCL backend in next couple of days. (Draft mode).
An extended idea: (if community is interested) SYCL maintenance will be extended by myself (team) and external contributors and depending upon support received we plan to integrate nv/amd and other vendor support for SYCL runtime for AutoGPTQ.

Qubitium · 2024-04-25T18:18:09Z

@abhilash1910 Is SYCL compatible with amd x86_64? also curious if this would be a good candidate replace all the current qigen code?

abhilash1910 · 2024-04-27T18:17:13Z

Hi @Qubitium , yes there is experimental support for amdgcn architecture , and we have to specify the build target from fsycl to point to that. I will be picking things up from next week as was out of office this week.

add sycl 256 kernel

400dc47

abhilash1910 marked this pull request as draft April 11, 2024 19:03

fxmarty self-requested a review April 12, 2024 08:57

abhilash1910 added 3 commits April 17, 2024 05:30

initial build

99631b9

add atomic operations from sycl

e4abdec

add 64 kernels

0fb5583

Qubitium mentioned this pull request Apr 27, 2024

[DEPRECATION] Discussion on Fused attention and QiGEN #655

Open

add exllama kernel wo reconstruction

4756b1c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] Intel SYCL runtime support for AutoGPTQ #638

[SYCL] Intel SYCL runtime support for AutoGPTQ #638

abhilash1910 commented Apr 11, 2024

Qubitium commented Apr 25, 2024

abhilash1910 commented Apr 27, 2024

[SYCL] Intel SYCL runtime support for AutoGPTQ #638

Are you sure you want to change the base?

[SYCL] Intel SYCL runtime support for AutoGPTQ #638

Conversation

abhilash1910 commented Apr 11, 2024

Qubitium commented Apr 25, 2024

abhilash1910 commented Apr 27, 2024