A VLIW vector media coprocessor with cascaded SIMD ALUs

IEEE Transactions on Very Large Scale Integration (VLSI) Systems(2009)

引用 10|浏览0
暂无评分
摘要
High-definition video applications, such as digital TV and digital video cameras, require high processing performance for high-quality visual images in addition to a complex video CODEC. Pre-/postprocessing to improve video quality is becoming much more important because requirements for pre-/postprocessing vary among applications and processing algorithms have not been stabilized. Therefore, a new processor architecture that has a highly parallel datapath is needed. In this paper, we introduce a VLIW vector media coprocessor, "vector coprocessor (VCP)," that includes three asymmetric execution pipelines with cascaded SIMD ALUs. To improve performance efficiency, we reduce the area ratio of the control circuit while increasing the ratio of the arithmetic circuit. The total gate count of VCP is 1268 kgates and its maximum operating frequency is 300 MHz at 90-nm CMOS process. Some of the processing kernels in an adaptive prefilter that is applied to preprocessing for video encoding are evaluated. In the case of the edgeness and the sum of absolute differences, the performance is 183 giga operations per second. VCP offers enough performance for HD video processing and good cost-performance while all processing pipeline units operate effectively.
更多
查看译文
关键词
digital video camera,video encoding,processing pipeline unit,processing algorithm,processing kernel,hd video processing,video quality,high processing performance,vliw vector media coprocessor,complex video,cascaded simd alus,high-definition video application,video processing,vliw,instruction sets,processor architecture,coprocessors,digital television,alu,arithmetic,pipelines,circuits,digital tv,very long instruction word
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要