Parallel random numbers: as easy as 1, 2, 3

SC(2011)

引用 337|浏览391
暂无评分
摘要
Most pseudorandom number generators (PRNGs) scale poorly to massively parallel high-performance computation because they are designed as sequentially dependent state transformations. We demonstrate that independent, keyed transformations of counters produce a large alternative class of PRNGs with excellent statistical properties (long period, no discernable structure or correlation). These counter-based PRNGs are ideally suited to modern multi-core CPUs, GPUs, clusters, and special-purpose hardware because they vectorize and parallelize well, and require little or no memory for state. We introduce several counter-based PRNGs: some based on cryptographic standards (AES, Threefish) and some completely new (Philox). All our PRNGs pass rigorous statistical tests (including TestU01's BigCrush) and produce at least 264 unique parallel streams of random numbers, each with period 2128 or more. In addition to essentially unlimited parallel scalability, our PRNGs offer excellent single-chip performance: Philox is faster than the CURAND library on a single NVIDIA GPU.
更多
查看译文
关键词
unlimited parallel scalability,excellent statistical property,parallel random number,curand library,unique parallel stream,cryptographic standard,sequentially dependent state transformation,excellent single-chip performance,counter-based prngs,rigorous statistical test,long period,statistical test,memory management,cryptography,gpgpu,random number generation,generators,hardware,statistical testing,chip,radiation detector,radiation detectors,pseudorandom number generator,testing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要