Accelerating Scientific Algorithms In Array Databases With Gpus

2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)(2017)

引用 0|浏览13
暂无评分
摘要
In data science, interactive data analysis allows to very efficiently interpret what information is in the data. However, with increasing amounts of data available, the data crunching and data presentation becomes a more complex and resource-demanding task. Array databases allow to mitigate this difficulty by distributing the workload on a cluster. But there are two major challenges. Firstly, they rely on CPUs to process the data. Secondly, it is difficult to represent complex scientific algorithms in terms of native database operations. A way to integrate GPU accelerated algorithms is needed. In this paper, we study and test how to run scientific algorithms on GPUs entirely as native array database operators. We present the implementation of a specific scientific algorithm as exemplary use case. The computation of the Differential Emission Measure (DEM) calculates the distribution of plasma density in the solar corona at specific temperatures. DEM uses image series of the NASA spacecraft Solar Dynamic Observatory (SDO). We stack these images in a 3-dimensional array and process the data on GPUs, distributed in an array database cluster. For our algorithm, we measure a decrease of the overall runtime by a factor of 10700. We discuss different strategies used to minimize the overhead of GPUs and parameters used to scale the array database to a cluster.
更多
查看译文
关键词
AIA, Array Database, CUDA, DEM, GPU, User Defined Operator, SciDB
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要