GPU-UniCache: Automatic Code Generation of Spatial Blocking for Stencils on GPUs
Proceedings of the Computing Frontiers Conference, pp. 107-116, 2017.
Spatial blocking is a critical memory-access optimization to efficiently exploit the computing resources of parallel processors, such as many-core GPUs. By reusing cache-loaded data over multiple spatial iterations, spatial blocking can significantly lessen the pressure of accessing slow global memory. Stencil computations, for example, c...More
Full Text (Upload PDF)
PPT (Upload PPT)