Memory partitioning and scheduling co-optimization in behavioral synthesis

ICCAD(2012)

引用 37|浏览67
暂无评分
摘要
Achieving optimal throughput by extracting parallelism in behavioral synthesis often exaggerates memory bottleneck issues. Data partitioning is an important technique for increasing memory bandwidth by scheduling multiple simultaneous memory accesses to different memory banks. In this paper we present a vertical memory partitioning and scheduling algorithm that can generate a valid partition scheme for arbitrary affine memory inputs. It does this by arranging non-conflicting memory accesses across the border of loop iterations. A mixed memory partitioning and scheduling algorithm is also proposed to combine the advantages of the vertical and other state-of-art algorithms. A set of theorems is provided as criteria for selecting a valid partitioning scheme. This is followed by an optimal and scalable memory scheduling algorithm. By utilizing the property of constant strides between memory addresses in successive loop iterations, an address translation optimization technique for an arbitrary partition factor is proposed to improve performance, area and energy efficiency. Experimental results show that on a set of real-world medical image processing kernels, the proposed mixed algorithm with address translation optimization can gain speed-up, area reduction and power savings of 15.8%, 36% and 32.4% respectively, compared to the state-of-art memory partitioning algorithm.
更多
查看译文
关键词
memory s ch eduling,processor scheduling,parallel processing,behavioral synthesis,memory address translation optimization,address translation optimization technique,scalable memory scheduling algorithm,loop iterations,medical image processing kernels,optimal throughput,state-of-art memory,data partitioning,non-conflicting memory access,scheduling co-optimization,multiple simultaneous memory access,memory partitioning,storage allocation,performance improvement,speed-up,arbitrary partition factor,different memory bank,memory address,arbitrary affine memory input,power savings,mixed memory partitioning,area improvement,optimal-and-scalable memory scheduling algorithm,exaggerates memory bottleneck issue,memory bandwidth,memory banks,nonconflicting memory accesses,vertical memory partitioning-and-scheduling algorithm cooptimization,affine memory inputs,parallel memories,constant stride property,energy efficiency improvement,iterative methods,mixed memory partitioning-and-scheduling algorithm,area reduction,memory access scheduling,scheduling algorithms,schedules,kernel,memory management
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要