OpenCV Optimization on Heterogeneous Multi-core Systems for Gesture Recognition Applications

2016 45th International Conference on Parallel Processing Workshops (ICPPW)(2016)

引用 3|浏览9
暂无评分
摘要
Increasingly there are a variety of important applications for image processing on mobile phones. The performance of image processing applications thus becomes one of the focused points of research activities. OpenCV provides APIs to let programmers develop image processing programs with ease. The new OpenCV 3.0 enables OpenCL flow to aim at making the applications developed with OpenCV run fast on heterogeneous multi-core systems. Although OpenCL programs are portable, the performance still needs to be tuned for different architecture models. In this paper, we demonstrate the optimization flow for Gesture Recognition Applications with OpenCV 3.0 on Mali GPUs. In this case study, several optimization techniques are devised for the flow. The techniques include vectorization, the increase of vector width via layout transformation, kernel fusion, etc. Preliminary experimental results show that our scheme is effective to optimize OpenCV 3.0 flow for Gesture Recognition Applications on embedded heterogeneous multi-core systems.
更多
查看译文
关键词
OpenCV,OpenCL,heterogeneous,multi-core,Mali GPUs,Gesture Recognition,vectorization,vector,kernel fusion
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要