Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models
Dheevatsa Mudigere,Yuchen Hao,Jianyu Huang,Zhihao Jia,Andrew Tulloch,Srinivas Sridharan,Xing Liu,Mustafa Ozdal,Jade Nie,Jongsoo Park,Liang Luo, Jie (Amy) Yang,Leon Gao,Dmytro Ivchenko,Aarti Basant,Yuxi Hu,Jiyan Yang,Ehsan K. Ardestani,Xiaodong Wang,Rakesh Komuravelli,Ching-Hsiang Chu,Serhat Yilmaz,Huayu Li,Jiyuan Qian,Zhuobo Feng,Yinbin Ma, Junjie Yang,Ellie Wen,Hong Li,Lin Yang,Chonglin Sun,Whitney Zhao,Dimitry Melts,Krishna Dhulipala, K. R. Kishore, Tyler Graf,Assaf Eisenman,Kiran Kumar Matam,Adi Gangidi,Guoqiang Jerry Chen, Manoj Krishnan,Avinash Nayak,Krishnakumar Nair,Bharath Muthiah,Mahmoud Khorashadi,Pallab Bhattacharya,Petr Lapukhov,Maxim Naumov, Ajit Mathews,Lin Qiao,Mikhail Smelyanskiy,Bill Jia,Vijay Rao International Symposium on Computer Architecture(2021)
关键词
Deep Learning,Large-Scale Optimization
AI 理解论文
溯源树
样例
