Harder version of MMLU with 10 answer choices.
pip install lm-eval && lm_eval --model hf --tasks mmlu_pro --batch_size auto