Numerical Reproducibility and Accuracy at ExaScale

Computer Arithmetic(2013)

引用 19|浏览2
暂无评分
摘要
Given current hardware trends, ExaScale computing (1018 floating point operations per second) is projected to be available in less than a decade, achieved by using a huge number of processors, of order 109. Given the likely hardware heterogeneity in both platform and network, and the possibility of intermittent failures, dynamic scheduling will be needed to adapt to changing resources and loads. This will make it likely that repeated runs of a program will not execute operations like reductions in exactly the same order. This in turn will make reproducibility, i.e. getting bitwise identical results from run to run, difficult to achieve, because floating point operations like addition are not associative, so computing sums in different orders often leads to different results. Indeed, this is already a challenge on today's platforms.
更多
查看译文
关键词
numerical reproducibility,dynamic scheduling,current hardware trend,exascale computing,likely hardware heterogeneity,different order,bitwise identical result,floating point operation,huge number,different result,repeated run,accuracy,hardware,floating point arithmetic,shape,parallel processing,computational modeling,addition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要