Exact Fused Dot Product Add Operators

Oregane Desrentes, Benoit Dupont de Dinechin,Florent de Dinechin

2023 IEEE 30TH SYMPOSIUM ON COMPUTER ARITHMETIC, ARITH 2023(2023)

引用 0|浏览2
暂无评分
摘要
This article explores architectures of exact (correctly rounded) fused dot product and add operators suitable for the FP32 and FP64 binary floating-point representations with sub-normal support, and other representations with a wide dynamic range such as bfloat16. The exact summation of terms before rounding requires a full-size accumulator, and this work discusses techniques to compress the identical bits of this accumulator. This requires the computation of the relative shift amounts of the terms, which is formulated as a parallel prefix algorithm, allowing for a low-latency implementation. Architectural options for the exact fused dot product and add operators with up to 16 products for FP32, FP64 and mixed-precision BF16 to FP32 are evaluated using the TSMC 16FFC technology node.
更多
查看译文
关键词
dot product,BF16,FP32,FP64,three-term sum
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要