Skip to content

Issues: hiyouga/LLaMA-Factory

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

cannot use pure_bf16 with zero3 cpu offload pending This problem is yet to be addressed.
#3476 opened Apr 27, 2024 by mces89
1 task done
How to select Badam optimizer in the web interface? enhancement New feature or request pending This problem is yet to be addressed.
#3473 opened Apr 26, 2024 by leekum2018
1 task done
[Feature Request] 我们需要更灵活的保存策略? pending This problem is yet to be addressed.
#3472 opened Apr 26, 2024 by marko1616
fsdp-qlora yi-34B-chat throw error " ValueError: Cannot flatten integer dtype tensors" pending This problem is yet to be addressed.
#3470 opened Apr 26, 2024 by hellostronger
1 task done
Can't find 'adapter_config.json' solved This problem has been already solved.
#3466 opened Apr 26, 2024 by may012345
1 task done
NameError: name 'awq_ext' is not defined solved This problem has been already solved.
#3465 opened Apr 26, 2024 by Anorid
1 task done
New llama-factory code runs into batch["input_ids"] is None. The old version is ok. pending This problem is yet to be addressed.
#3463 opened Apr 26, 2024 by hnhoangdz
1 task done
deepspeed的bug pending This problem is yet to be addressed.
#3461 opened Apr 26, 2024 by bravelyi
1 task done
评估集上loss、学习率为0 solved This problem has been already solved.
#3457 opened Apr 26, 2024 by sly123197811
1 task done
Support for RLAIF methods pending This problem is yet to be addressed.
#3453 opened Apr 25, 2024 by dineshresearch
Could you please share some tips with your rich experience? pending This problem is yet to be addressed.
#3452 opened Apr 25, 2024 by xiaochengsky
1 task done
关于llama3base版本的评测 pending This problem is yet to be addressed.
#3447 opened Apr 25, 2024 by QingChengLineOne
1 task done
SFT zero2 zero3下loss不一致 pending This problem is yet to be addressed.
#3442 opened Apr 25, 2024 by wsdmanonymous
1 task done
streaming模式和非streaming模式下模型指标差异巨大 pending This problem is yet to be addressed.
#3436 opened Apr 25, 2024 by zhangbin1997
1 task
量化后的gptq模型,部署成openai后调用报错 pending This problem is yet to be addressed.
#3408 opened Apr 24, 2024 by ccp123456789
究竟怎么做dpo呀 pending This problem is yet to be addressed.
#3395 opened Apr 23, 2024 by XuanRen4470
1 task done
Issues of LLaMA3 SFT on multi-nodes pending This problem is yet to be addressed.
#3381 opened Apr 22, 2024 by Liusifei
1 task done
训练一段时间后,在保存文件时,会提示文件夹【拒绝访问】 pending This problem is yet to be addressed.
#3359 opened Apr 20, 2024 by kynow2
1 task done
loss=0 for lora sft Baichuan2-13B-Chat with bf16 pending This problem is yet to be addressed.
#3353 opened Apr 19, 2024 by conderls
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.