Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not a problem - but like people should know #26

Open
Atlas3DSS opened this issue Mar 26, 2023 · 2 comments
Open

Not a problem - but like people should know #26

Atlas3DSS opened this issue Mar 26, 2023 · 2 comments
Labels
documentation Improvements or additions to documentation

Comments

@Atlas3DSS
Copy link

https://arxiv.org/abs/2303.11366 Is a really cool paper about reflection in LLMs
image
That is after training on like 20 samples for 50 epochs on my 3090 on the 7B model.

User: [Topic or question]

Assistant Hypothetical Response: [Brief or simplified answer to the topic or question]

Agent Reflection: [Critique of the hypothetical response, highlighting the limitations, inaccuracies, or areas that need improvement or expansion, while providing guidance on how to address these issues in the revised response]

Bot Actual Response: [The natural and contextually appropriate answer to the topic or question, as generated by the advanced language model, which incorporates the suggestions and improvements from the agent reflection for a more comprehensive and accurate response]

This + training sets generated with this frame work seem to really improve the generations of these models with fairly limited training sets. Just thought i would share.

@lxe
Copy link
Owner

lxe commented Mar 28, 2023

Nice to see you can get this from such a small sample set!

@lxe lxe added the documentation Improvements or additions to documentation label Mar 28, 2023
@Atlas3DSS
Copy link
Author

I have been keeping track of my datasets if anyone else wants to play they are here
https://docs.google.com/spreadsheets/d/1QSwJFiyzUQ6H1CloDmJWcHJfYiT7SVxfwBDOOcbvFEo/edit?usp=sharing

Thank you again for making this lovely tool.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants