manual vault backup: 2024-06-01 - 1 files

Affected files: Resources/BENCHMARKS.md
swyxio · Jun 1, 2024 · bb0e3ad · bb0e3ad
1 parent 17c7623
commit bb0e3ad
Showing 1 changed file with 1 addition and 0 deletions.
diff --git a/Resources/BENCHMARKS.md b/Resources/BENCHMARKS.md
@@ -3,6 +3,7 @@ Benchmarks exist between the Data and Models, and are the least obvious/glamorou
 
 easiest way i know to run the benchmarks yourself is https://github.com/EleutherAI/lm-evaluation-harness
 - which was forked from the MMLU test https://huggingface.co/blog/evaluating-mmlu-leaderboard and is also related to the stanford HELM impl
+- and drives the Open LLM Leaderboard https://github.com/huggingface/blog/blob/main/open-llm-leaderboard-mmlu.md
 openai evals is promising but doesnt have most of them implemented yet