Automatic Horizontal Fusion for GPU Kernels

2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)(2022)

引用 13|浏览45
暂无评分
摘要
We present automatic horizontal fusion, a novel optimization technique that complements the standard kernel fusion techniques for GPU programs. Unlike the standard fusion, whose goal is to eliminate intermediate data round trips, our horizontal fusion technique aims to increase the thread-level parallelism to hide instruction latencies. We also present HFUSE, a new source to source CUDA compiler t...
更多
查看译文
关键词
Deep learning,Codes,Fuses,Graphics processing units,Parallel processing,Hardware,Cryptocurrency
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要