Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

can we add C4 and PTB tasks for PpL? feature request A feature that isn't implemented yet.
#1884 opened May 25, 2024 by 123wujiao
Add Regression Testing feature request A feature that isn't implemented yet. good first issue Good for newcomers help wanted Contributors and extra help welcome.
#1883 opened May 24, 2024 by haileyschoelkopf
eval with Alpaca template
#1882 opened May 24, 2024 by oneonlee
Evaluation MC Questions
#1875 opened May 23, 2024 by kangqi-ni
chat model evaluation
#1870 opened May 22, 2024 by jordane95
Add more math evaluation tasks
#1869 opened May 22, 2024 by jordane95
--device cuda:3 not honored when using --model vllm bug Something isn't working. documentation Improvements or additions to documentation.
#1846 opened May 15, 2024 by LGLG42
How to use Zeno
#1842 opened May 14, 2024 by DavidAdamczyk
sha256 for datasets or samples
#1836 opened May 13, 2024 by artemorloff
Using Language Models as Evaluators feature request A feature that isn't implemented yet.
#1831 opened May 13, 2024 by lintangsutawika
Errors when loading exact_match.py
#1830 opened May 13, 2024 by twxin
Add More Tests feature request A feature that isn't implemented yet.
#1827 opened May 12, 2024 by haileyschoelkopf
ProTip! Add no:assignee to see everything that’s not assigned.