Having issue loading my HQQ quantized model #35

BeichenHuang · 2024-04-24T12:34:34Z

Hi! I am trying to load my HQQ quantized model using the offloading strategy, but I have problem in the model safetensor files.
I notice that in your HQQ quantized model safetensor files, the weights, taking layer 0 expert 1 as an example, are saved like:

But I use the the code from official HQQ websit, the saved model is only one .pt file:
How to splite the weight into all thoses components?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Having issue loading my HQQ quantized model #35

Having issue loading my HQQ quantized model #35

BeichenHuang commented Apr 24, 2024 •

edited

Having issue loading my HQQ quantized model #35

Having issue loading my HQQ quantized model #35

Comments

BeichenHuang commented Apr 24, 2024 • edited

BeichenHuang commented Apr 24, 2024 •

edited