Competitiveness of a Non-Linear Block-Space GPU Thread Map for Simplex Domains.

IEEE Transactions on Parallel and Distributed Systems(2018)

引用 11|浏览39
暂无评分
摘要
This work presents and studies the efficiency problem of mapping GPU threads onto simplex domains. A non-linear map $\lambda (\omega)$ is formulated based on a block-space enumeration principle that reduces the number of thread-blocks by a factor of approximately $2\times$ and $6\times$ for 2-simplex and 3-simplex domains, respectively, when compared to the standard approach. Performance resul...
更多
查看译文
关键词
Graphics processing units,Instruction sets,Symmetric matrices,Computer architecture,Optimization,Programming
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要