High-level synthesis of resource-shared microarchitectures from irregular complex C-code

2016 International Conference on Field-Programmable Technology (FPT)(2016)

引用 3|浏览40
暂无评分
摘要
Many high-level synthesis (HLS) tools aim at the hardware translation of input programs with relatively short loop bodies, often with a very regular control flow. However, codes from domains such as control engineering and numerical simulation often have a considerably different structure with large loop bodies holding (tens of) thousands of individual operations. Compilation of such codes not only requires the sharing of hardware operators, but also the efficient storage and forwarding of many intermediate results. Both academic as well as industrial synthesis tools have great difficulty coping with such input programs. We present Nymble-RS, a C-to-hardware compiler optimized to translate such complex programs. When evaluated on codes for the domain of convex solvers, the generated accelerators reach clock frequencies of over 200 MHz (exceeding those achieved by a state-of-the-art industrial tool by more than 3×), and offer speed-ups of up to 5× over software executing on the 800 MHz Cortex-A9 CPUs used in typical reconfigurable system-on-chips.
更多
查看译文
关键词
high-level synthesis,resource-shared microarchitectures,irregular complex C-code,HLS,hardware translation,control flow,Nymble-RS,C-to-hardware compiler,convex solvers,reconfigurable system-on-chips
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要