Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prompt_style #1416

Closed
fireyanci opened this issue May 13, 2024 · 4 comments
Closed

prompt_style #1416

fireyanci opened this issue May 13, 2024 · 4 comments

Comments

@fireyanci
Copy link

I don't want to use the dataset styles listed in prompt styles: Dict. I want to use my own defined dataset style. How can I build my own dataset style to use finetune/ora,my datasets style is
{
"conversation": [
{
"system": "This is like an instruction",
"input": "",
"output": ""
}]
}

@fireyanci
Copy link
Author

because i want use Multi round conversation data

@rasbt
Copy link
Collaborator

rasbt commented May 20, 2024

I think the easiest way here would be to use on of the existing datasets as templates. I remember that deita had multi turn questions in the dataset, so I added this as an option. Maybe this is helpful as a template for building your own datset:

include_multiturn_conversations: bool = False

But note that LitGTP otherwise doesn't do anything special for multi turn. It basically treat the data multiturn example as another regular input example during training.

@fireyanci
Copy link
Author

fireyanci commented May 21, 2024

Thank you very much for your reply,I've read your explanation about Dora, it's excellent. Thank you.I hope to use it in the LitGPT project.

@rasbt
Copy link
Collaborator

rasbt commented May 21, 2024

Glad to hear you found it useful! There are currently so many todos, but yeah, adding DoRA to LitGPT some time would be great.

@rasbt rasbt closed this as completed May 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants