Add config to save eval results #12

arthurtobler · 2023-06-20T16:58:34Z

Thanks for this neat repo, very convenient to evaluate LLM!

As a feature request, I would like to suggest adding an option to save results of an evaluation for the implemented tasks to allow for easier analytics.
My understanding is that the current main.py only print results.
It could be useful to store scores per sub-task as well for tasks like MMLU or BBH

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add config to save eval results #12

Add config to save eval results #12

arthurtobler commented Jun 20, 2023

Add config to save eval results #12

Add config to save eval results #12

Comments

arthurtobler commented Jun 20, 2023