问题 #1

wuguangshuo · 2023-05-29T07:50:58Z

不知道大佬有没有遇到ValueError: paged_adamw_32bit is not a valid OptimizerNames这个错误

taishan1994 · 2023-05-29T08:22:00Z

不知道大佬有没有遇到ValueError: paged_adamw_32bit is not a valid OptimizerNames这个错误

你这个是包的版本的问题，

pip install -q -U bitsandbytes
pip install -q -U git+https://github.com/huggingface/transformers.git
pip install -q -U git+https://github.com/huggingface/peft.git
pip install -q -U git+https://github.com/huggingface/accelerate.git

之前在qlora提的问题勘验：
To summarize all the issues:

- lora weights are not saved correctly : Comment out the following code
# if args.bits < 16:
#     old_state_dict = model.state_dict
#     model.state_dict = (
#         lambda self, *_, **__: get_peft_model_state_dict(self, old_state_dict())
#     ).__get__(model, type(model))

RuntimeError: self and mat2 must have the same dtype：peft version must be 0.4.0.dev0
pip install -U git+https://github.com/huggingface/peft.git
ValueError: Cannot merge LORA layers when the model is loaded in 8-bit mode：just don't use
model = model.merge_and_unload()
RuntimeError: mat1 and mat2 shapes cannot be multiplied (44x6656 and 1x22151168)：when you use as

model = AutoModel.from_pretrained(args["model_dir"],  
                                  trust_remote_code=True,
                                  load_in_4bit=True,
                                  device_map={"":0})
model = PeftModel.from_pretrained(model, args["save_dir"], trust_remote_code=True)
model.cuda().eval()  <- DO NOT ADD THIS

lunalulu · 2023-07-28T06:22:00Z

@wuguangshuo 请问你解决了吗？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

问题 #1

问题 #1

wuguangshuo commented May 29, 2023

taishan1994 commented May 29, 2023

lunalulu commented Jul 28, 2023

问题 #1

问题 #1

Comments

wuguangshuo commented May 29, 2023

taishan1994 commented May 29, 2023

lunalulu commented Jul 28, 2023