Parallel clustering of big data of spatio-temporal trajectory

2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC)(2015)

引用 26|浏览2
暂无评分
摘要
Generally, computing efficiency of many spatial data analysis algorithm will sharply decline as data size increase. It is very meaningful for extending the analysis method of spatial data and enhancing computational efficiency by introducing the distributed parallel computing model. Considering the features of spatio-temporal trajectory data, which is massive, related to time and dynamic, we proposed the fast calculation method of the trajectory similarity based on coarse-grained Dynamic Time Warping. The algorithm will reduce the consuming time greatly when the length of trajectory sequences are very long. We also proposed the parallel trajectory clustering strategy of big data under the Hadoop MapReduce model in this paper. The big data of trajectory are sliced, and the trajectory similarity and the iteration computation of cluster center are dealt with by multiwork nodes simultaneously. The experimental results of the parallel trajectory clustering, which based on the open source project Mahout, implemented on the vehicle trajectory data show that the clustering results are valid. The computing performance of parallel clustering are obviously improved as the trajectory data size increases. And the new parallel clustering method outperforms the traditional algorithm like k-means algorithm.
更多
查看译文
关键词
MapReduce, parallel computing, trajectory clustering, trajectory similarity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要