Run MMLU on any LLM

Choose LLM
Benchmark Results
Public Model Results
Copied to clipboard!

Need a better benchmark? Try our Arena.