Optimal 2D Data Partitioning for DMA Transfers on MPSoCs

Digital System Design(2012)

引用 5|浏览0
暂无评分
摘要
Reducing the effects of off-chip memory access latency is a key factor in exploiting efficiently embedded multicore platforms. We consider architectures that admit a multi-core computation fabric, having its own fast and small memory to which the data blocks to be processed are fetched from external memory using a DMA (direct memory access) engine, employing a double- or multiple-buffering scheme to avoid processor idling. In this paper we focus on application programs that process two dimensional data arrays and we determine automatically the size and shape of the portions of the data array which are subject to a single DMA call, based on hardware and applications parameters. When the computation on different array elements are completely independent, the asymmetry of memory structure leads always to prefer one-dimensional horizontal pieces of memory, while when the computation of a data element shares some data with its neighbors, there is a pressure for more "square" shapes to reduce the amount of redundant data transfers. We provide an analytic model for this optimization problem and validate our results by running a mean filter application on the CELL simulator.
更多
查看译文
关键词
optimal 2d data partitioning,multiple-buffering scheme,memory structure,dma transfers,off-chip memory access latency,double buffering,double-buffering scheme,data element share,mpsoc,data element,two-dimensional data arrays,processor idling,dma engine,small memory,data partitioning,data paralleization,cell simulator,redundant data transfer,direct memory access engine,system-on-chip,data block,multiprocessing systems,optimization problem,data blocks,array elements,mean filter application,external memory,file organisation,dma call,direct memory access,memory structure asymmetry,multicore computation fabric,data array,direct memory access (dma),dimensional data array,cell processor,one-dimensional horizontal memory pieces,embedded multicore platforms,system on chip
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要