MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs.

Ziheng Jiang,Haibin Lin, Yinmin Zhong, Qi Huang,Yangrui Chen,Zhi Zhang,Yanghua Peng,Xiang Li,Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen,Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu,Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao, Liang Xiang, Zherui Liu,Zhe Li, Xiaoying Jia, Jianxi Ye,Xin Jin,Xin Liu

Symposium on Networked Systems Design and Implementation(2024)

引用 0|浏览7
暂无评分
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要