Chrome Extension
WeChat Mini Program
Use on ChatGLM

Challenging the Boundaries of Reasoning: an Olympiad-Level Math Benchmark for Large Language Models

Haoxiang Sun, Yingqian Min,Zhipeng Chen,Wayne Xin Zhao, Lei Fang,Zheng Liu,Zhongyuan Wang,Ji-Rong Wen

arxiv(2025)

Cited 0|Views7
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined