AttributeError: 'LlamaAttention' object has no attribute 'qkv_proj' #497
-
Hello, I'm trying to load a quantized model from the disk of an Ubuntu ec2 instance, however, I keep getting the attached error message, does anybody know the solution? For context, I installed, the auto_gptq using the following command: pip install "auto-gptq==0.4.2" Related Code
Error message |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
UPDATE |
Beta Was this translation helpful? Give feedback.
UPDATE
This was solved by installing auto_gptq from source using
pip install git+https://github.com/PanQiWei/AutoGPTQ.git@v0.4.2