Chrome Extension
WeChat Mini Program
Use on ChatGLM

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation.

Weihao Xuan, Rui Yang, Heli Qi,Qingcheng Zeng, Yunze Xiao, Aosong Feng, Dairui Liu,Yun Xing, Junjue Wang, Fan Gao, Jinghui Lu, Yuang Jiang, Huitao Li,Xin Li, Kunyu Yu, Ruihai Dong, Shangding Gu, Yuekang Li, Xiaofei Xie, Felix Juefei-Xu, Foutse Khomh, Osamu Yoshie,Qingyu Chen,Douglas Teodoro,Nan Liu, Randy Goebel, Lei Ma,Edison Marrese-Taylor,Shijian Lu,Yusuke Iwasawa,Yutaka Matsuo,Irene Li

CoRR(2025)

Cited 0|Views5
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined