Parallel Acceleration of SAM Algorithm and Performance Analysis

Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of(2013)

引用 9|浏览59
暂无评分
摘要
Advances in sensor and computer technology are revolutionizing the way that remote sensing data with hundreds or even thousands of channels for the same area on the surface of the earth is collected, managed and analyzed. In this paper, the classical Spectral Angle Mapper (SAM) algorithm, which is fit for parallel and distributed computing, is implemented by using Graphic Processing Units (GPU) and distributed cluster respectively to accelerate the computations. A quantitative performance comparison between Compute Unified Device Architecture (CUDA) and MATLAB platform is given by analyzing result of different parallel architectures' implementation of the same SAM algorithm. Especially for the property of GPU, this paper studied the balance between resource acquirement of each thread and the number of active blocks, and the impact of computational complexity on speedup. In addition, page-locked memory and stream are also introduced to make CPU and GPU work collaboratively. Moreover, we improved the SAM algorithm, in which several training samples are instead of a single one. Experimental results on hyperspectral data have shown that recognition result of the improved SAM algorithm is better than that only using single spectrum. On the other hand, the GPU parallel implementation achieves a higher speedup comparing with the multithread CPU counterpart. And the asynchronous transfer function of CUDA covers the data transmission latency effectively, thus improves the devices' resource occupancy significantly.
更多
查看译文
关键词
earth surface,geophysical techniques,remote sensing,geophysics computing,spectral angle mapper,gpu,sam,high-performance computing,distributed cluster,sensor technology,parallel architectures,graphics processing units,remote sensing data,cuda platform,page-locked memory,sam algorithm parallel acceleration,mathematics computing,computational complexity,hyperspectral data,asynchronous transfer mode,computer technology,compute unified device architecture,distributed computing,matlab platform,data transmission latency,parallel computing,performance analysis,graphic processing units,cuda asynchronous transfer function,high performance computing,computer architecture,instruction sets,acceleration
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要