A Stream Processing Framework for On-Line Optimization of Performance and Energy Efficiency on Heterogeneous Systems

Parallel & Distributed Processing Symposium Workshops(2014)

引用 4|浏览8
暂无评分
摘要
Modern processors have the potential of executing compute-intensive programs quickly and efficiently, but require applications to be adapted to their ever increasing parallelism. Here, heterogeneous systems add complexity by combining processing units with different characteristics. Scheduling should thus consider the performance of each processor as well as competing workloads and varying inputs. To assist programmers of stream processing applications in facing this challenge we present libHawaii, an open source library for cooperatively using all processors of heterogeneous systems easily and efficiently. It supports exploiting data flow, data element and task parallelism via pipelining, partitioning and demand-based allocation of consecutive work items. Scheduling is automatically adapted on-line to continuously optimize performance and energy efficiency. Our C++ library does not depend on specific hardware architectures or parallel computing frameworks. However, it facilitates maximizing the throughput of compatible GPUs by overlapping computations and memory transfers while maintaining low latencies. This paper describes the algorithms and implementation of libHawaii and demonstrates its usage on existing applications. We experimentally evaluate our library using two examples: General matrix multiplication (GEMM) is a simple yet important building block of many high-performance computing applications. Complementarily, the detection, extraction and matching of sparse image features exhibits greater complexity, including indeterministic memory access and synchronization.
更多
查看译文
关键词
parallel programming,power aware computing,processor scheduling,public domain software,software libraries,C++ library,GEMM,GPU,compute-intensive programs,data element,data flow,data partitioning,demand-based allocation,general matrix multiplication,heterogeneous systems,high-performance computing applications,indeterministic memory access,libHawaii,low-latency maintenance,memory transfers,online energy efficiency optimization,online performance optimization,open source library,overlapping computations,pipelining,processing units,processor performance,processor scheduling,processor workloads,sparse image feature detection,sparse image feature extraction,sparse image feature matching,stream processing framework,synchronization,task parallelism,varying processor inputs,energy efficiency,heterogeneous computing,load balancing,parallel programming,real-time,stream processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要