New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MTL][Qwen] model fail RuntimeError: "normal_kernel_cpu" not implemented for 'Byte' #10826
Comments
Sorry that for ipex-llm==2.5.0b20240421 && bigdl-core-xe-21 == 2.5.0b20240421, I can't reproduce this this issue on both arc and mtl(at least load_low_bit works fine)
|
With passing
|
Shall we update our save/load example to explicitly add this parameter? |
Yes and I think we can do this in our |
Upgrading to ipex-llm from bigdl-llm, meet below issue, besides that we also meet accuracy issue.
2024-04-21 21:35:27,923 - INFO - Converting the current model to sym_int4 format......
Traceback (most recent call last):
File "C:\Users\test\Documents\qwen_validate\ultra_test_code_and_data\benchmark_test2intel\gen_prediction.py", line 78, in
model = AutoModelForCausalLM.load_low_bit(model_path, trust_remote_code=True, optimize_model=True).eval()
File "C:\Users\test\Documents\qwen_validate\ultra_test_code_and_data\env\lib\site-packages\ipex_llm\transformers\model.py", line 657, in load_low_bit
) = model_class._load_pretrained_model(
File "C:\Users\test\Documents\qwen_validate\ultra_test_code_and_data\env\lib\site-packages\transformers\modeling_utils.py", line 3125, in _load_pretrained_model
model.apply(model._initialize_weights)
File "C:\Users\test\Documents\qwen_validate\ultra_test_code_and_data\env\lib\site-packages\torch\nn\modules\module.py", line 897, in apply
module.apply(fn)
File "C:\Users\test\Documents\qwen_validate\ultra_test_code_and_data\env\lib\site-packages\torch\nn\modules\module.py", line 897, in apply
module.apply(fn)
File "C:\Users\test\Documents\qwen_validate\ultra_test_code_and_data\env\lib\site-packages\torch\nn\modules\module.py", line 897, in apply
module.apply(fn)
[Previous line repeated 2 more times]
File "C:\Users\test\Documents\qwen_validate\ultra_test_code_and_data\env\lib\site-packages\torch\nn\modules\module.py", line 898, in apply
fn(self)
File "C:\Users\test\Documents\qwen_validate\ultra_test_code_and_data\env\lib\site-packages\transformers\modeling_utils.py", line 1261, in _initialize_weights
self._init_weights(module)
File "C:\Users\test/.cache\huggingface\modules\transformers_modules\us_qwen_0435_r2-sym_int4\modeling_qwen.py", line 697, in init_weights
module.weight.data.normal(mean=0.0, std=self.config.initializer_range)
RuntimeError: "normal_kernel_cpu" not implemented for 'Byte'
The text was updated successfully, but these errors were encountered: