Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

infer_lora_finetuning.py在加载模型的时候,不能使用多卡加载,单卡12G加载会报显存溢出。 #7

Open
steamfeifei opened this issue Jun 29, 2023 · 3 comments

Comments

@steamfeifei
Copy link

infer_lora_finetuning.py在加载模型的时候,不能使用多卡加载,单卡12G加载会报显存溢出。
有什么办法可以配置多卡加载模型吗?

@ssbuild
Copy link
Owner

ssbuild commented Jun 30, 2023

权重量化,保存权重,然后去小卡推理。

@steamfeifei
Copy link
Author

steamfeifei commented Jun 30, 2023

我在问“你是谁?”,这样的问题,进行了lora微调,出来的结果还是和原来的类似。这种有什么办法让它纠正过来吗?
image

@ssbuild
Copy link
Owner

ssbuild commented Jun 30, 2023

喂数据

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants