ElasticFlow: an Elastic Serverless Training Platform for Distributed Deep Learning.
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 2, ASPLOS 2023(2023)
Key words
Distributed Deep Learning,GPU Cluster,Serverless Computing,Cluster Scheduling
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined