Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ELO ranking score? #47

Open
Tokkiu opened this issue Apr 10, 2024 · 3 comments
Open

ELO ranking score? #47

Tokkiu opened this issue Apr 10, 2024 · 3 comments

Comments

@Tokkiu
Copy link

Tokkiu commented Apr 10, 2024

截屏2024-04-10 11 34 05

How to generate this ranking? If I added new model, how to reproduce this benchmark?

@Tokkiu
Copy link
Author

Tokkiu commented Apr 10, 2024

My new model is implemented in this pr. https://github.com/OpenGenerativeAI/llm-colosseum/pull/45/files
You can watch the video of my model vs mistral at here.
https://github.com/Tokkiu/llm-colosseum?tab=readme-ov-file#1-vs-1-mistral-vs-solar

@shawokou123
Copy link

我的新模型已经在这个 PR 中实现。https://github.com/OpenGenerativeAI/llm-colosseum/pull/45/files您可以在这里观看我的模型与 Mistral 的视频。 https://github.com/Tokkiu/llm-colosseum?tab=readme-ov-file#1-vs-1-mistral-vs-solar

你好璟琦,我对这个项目也非常感兴趣,可以交流吗?

@taozhiyuai
Copy link

I just launch 50 rounds for two models. the result shows who is a better models. at the moment, Gemma 7B is the best. v1.1 is worse.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants