Constitutional AI models do not achieve MT-Bench scores as reported #145

JingtongSu · 2024-03-27T19:03:13Z

Hi, thanks for your great work!

I'm especially interested in the recently-introduced constitutional-ai tuning in this blog post. I've found the open-source SFT model and DPO model on huggingface. However, when I tried to launch the MT-Bench test with them, the returned results are significantly worse than those reported in the blog post, according to the figure below (which I've copied over here for reference):

The MT-Bench score I've collected are 5.33 / 6.39 for the SFT / DPO model respectively, where the reference figure shows (approximately) 6.5 / 7.2.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constitutional AI models do not achieve MT-Bench scores as reported #145

Constitutional AI models do not achieve MT-Bench scores as reported #145

JingtongSu commented Mar 27, 2024

Constitutional AI models do not achieve MT-Bench scores as reported #145

Constitutional AI models do not achieve MT-Bench scores as reported #145

Comments

JingtongSu commented Mar 27, 2024