Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问单卡4090能对TinyLLaVA-1.5B进行增量预训练吗? #51

Open
Lyzz123 opened this issue Apr 22, 2024 · 4 comments
Open

请问单卡4090能对TinyLLaVA-1.5B进行增量预训练吗? #51

Lyzz123 opened this issue Apr 22, 2024 · 4 comments

Comments

@Lyzz123
Copy link

Lyzz123 commented Apr 22, 2024

No description provided.

@baichuanzhou
Copy link
Contributor

可以的,请看这个例子:https://github.com/DLCV-BUAA/TinyLLaVABench/blob/main/docs/Evaluation.md

@Lyzz123
Copy link
Author

Lyzz123 commented Apr 23, 2024

您好,这个不是评估的吗?

@baichuanzhou
Copy link
Contributor

@Lyzz123
Copy link
Author

Lyzz123 commented Apr 25, 2024

我在执行lora微调的脚本时有如下提示:
Token indices sequence length is longer than the specified maximum sequence length for this model (3377 > 3072). Running this sequence through the model will result in indexing errors.
请问这个有影响吗?需要修改--model_max_length参数吗?

还想问一下loss训练到多少就可以了呀?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants