How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks for LLMsJialun Cao, Yuk-Kit Chan, Zixuan Ling,Wenxuan Wang,Shuqing Li, Mingwei Liu, Ruixi Qiao, Yuting Han, Chaozheng Wang,Boxi Yu,Pinjia He,Shuai Wang,Zibin Zheng,Michael R. Lyu,Shing-Chi CheungCoRR(2025)引用 0|浏览13AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要