Thread Warping: Dynamic and Transparent Synthesis of Thread Accelerators

ACM Trans. Design Autom. Electr. Syst.(2011)

引用 8|浏览4
暂无评分
摘要
We introduce thread warping, a dynamic optimization technique that customizes multicore architectures to a given application by dynamically synthesizing threads into custom accelerator circuits on FPGAs (Field-Programmable Gate Arrays). Thread warping builds upon previous dynamic synthesis techniques for single-threaded applications, enabling dynamic architectural adaptation to different amounts of thread-level parallelism, while also exploiting parallelism within each thread to further improve performance. Furthermore, thread warping maintains the important separation of function from architecture, enabling portability of applications to architectures with different quantities of microprocessors and FPGAs, an advantage not shared by static compilation/synthesis approaches. We introduce an approach consisting of CAD tools and operating system support that enables thread warping on potentially any microprocessor/FPGA architecture. We evaluate thread warping using a simulator for high-performance computing systems with different interconnections in addition to multicore embedded systems having between 4 and 64 ARM11 microprocessors. On average, thread warping achieved approximately 3x speedup compared to a high-performance quad-core Intel Xeon and 109x compared to an embedded system consisting of 4 ARM11 cores, with a size cost approximately equal to 36 ARM11 cores.
更多
查看译文
关键词
dynamic architectural adaptation,thread accelerators,thread warping,arm11 microprocessors,different amount,arm11 core,dynamic optimization technique,different interconnection,different quantity,transparent synthesis,dynamically synthesizing thread,embedded system,thread level parallelism,embedded systems,field programmable gate array,reconfigurable computing,fpga,operating system
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要