Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

An explanation of the model selection rule in Figure 1 #21

Open
EliverQ opened this issue Apr 18, 2023 · 0 comments
Open

An explanation of the model selection rule in Figure 1 #21

EliverQ opened this issue Apr 18, 2023 · 0 comments
Labels
good first issue Good for newcomers

Comments

@EliverQ
Copy link
Collaborator

EliverQ commented Apr 18, 2023

As we mention in the survey, we only include LLMs (larger than 10B) with publicly reported evaluation results in Figure 1. Excluding models with papers (because formal evaluation results are generally included in papers), models without papers contain Cohere, YaLM, Luminous, ChatGPT, Bard, and Vicuna. Among these models:

  • Cohere, YaLM, Luminous, and ChatGPT are evaluated by HELM.
  • Vicuna reports its results compared with other models at here.
  • Bard is evaluated by paper 1, paper 2, and paper 3.

While some models do not comply with the criteria, they have played an important role in the development of large language models. We add them to the list and provide corresponding links for those who need them.
We will continue collecting related models but will not be adding them until May 2023. Please let us know if you come across any models that meet the inclusion criteria. Thank you to everyone who provided suggestions for our paper.

@EliverQ EliverQ closed this as completed Apr 18, 2023
@EliverQ EliverQ reopened this Apr 18, 2023
@EliverQ EliverQ pinned this issue Apr 18, 2023
@EliverQ EliverQ added the good first issue Good for newcomers label Apr 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

1 participant