ELO ranking score? #47

Tokkiu · 2024-04-10T03:34:50Z

How to generate this ranking? If I added new model, how to reproduce this benchmark?

Tokkiu · 2024-04-10T03:35:49Z

My new model is implemented in this pr. https://github.com/OpenGenerativeAI/llm-colosseum/pull/45/files
You can watch the video of my model vs mistral at here.
https://github.com/Tokkiu/llm-colosseum?tab=readme-ov-file#1-vs-1-mistral-vs-solar

shawokou123 · 2024-04-10T05:15:55Z

我的新模型已经在这个 PR 中实现。https://github.com/OpenGenerativeAI/llm-colosseum/pull/45/files您可以在这里观看我的模型与 Mistral 的视频。 https://github.com/Tokkiu/llm-colosseum?tab=readme-ov-file#1-vs-1-mistral-vs-solar

你好璟琦，我对这个项目也非常感兴趣，可以交流吗？

taozhiyuai · 2024-04-12T12:24:01Z

I just launch 50 rounds for two models. the result shows who is a better models. at the moment, Gemma 7B is the best. v1.1 is worse.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ELO ranking score? #47

ELO ranking score? #47

Tokkiu commented Apr 10, 2024

Tokkiu commented Apr 10, 2024

shawokou123 commented Apr 10, 2024

taozhiyuai commented Apr 12, 2024

ELO ranking score? #47

ELO ranking score? #47

Comments

Tokkiu commented Apr 10, 2024

Tokkiu commented Apr 10, 2024

shawokou123 commented Apr 10, 2024

taozhiyuai commented Apr 12, 2024