Accumulation Bit-Width Scaling for Ultra-Low Precision Training of Deep Networks.
ICLR 2019(2019)
Key words
reduced precision floating-point,partial sum accumulation bit-width,deep learning,training
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined