Run MMLU on any LLM
Choose LLM
API Endpoint
Borg Cloud
OpenAI
Google AI Studio
LLaMA.com
OpenRouter
Model Name
Access Token
Use MMLU-Light (faster evaluation)
Run Benchmark
Cancel
Benchmark Results
Public Model Results
Copied to clipboard!
Need a better benchmark? Try our
Arena
.