Skip to content

Commit

Permalink
.
Browse files Browse the repository at this point in the history
  • Loading branch information
aidando73 committed Nov 23, 2024
1 parent 54db9e4 commit 7ca3132
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tools/benchmarks/llm_eval_harness/meta_eval/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,7 @@ Given the extensive number of tasks available (12 for pretrained models and 30 f
- **Tasks for 3.1 pretrained models**: BBH and MMLU-Pro
- Chosen as they overlap with the Hugging Face [Open LLM Leaderboard v2](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
- **Tasks for 3.2 pretrained models**: MMLU
- Chosen because MMLU is a common eval, and is the first one shown on on the [llama website](https://llama.com)
- Chosen because MMLU is a common eval, and is the first one shown on on [llama.com](https://llama.com)
- **Tasks for 3.1 instruct models**: Math-Hard, IFeval, GPQA, and MMLU-Pro
- Chosen as they overlap with the Hugging Face [Open LLM Leaderboard v2](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)

Expand Down

0 comments on commit 7ca3132

Please sign in to comment.