Large vector spatial data storage and query processing using clickhouse

Shuaijun Chen, Zhibao Wang,Lu Bai, Kunyi Liu, Juntao Gao, Man Zhao,Maurice D. Mulvenna

39TH INTERNATIONAL SYMPOSIUM ON REMOTE SENSING OF ENVIRONMENT ISRSE-39 FROM HUMAN NEEDS TO SDGS, VOL. 48-M-1(2023)

引用 0|浏览0
暂无评分
摘要
The exponential growth of geospatial data resulting from the development of earth observation technology has created significant challenges for traditional relational databases. While NoSQL databases based on distributed file systems can handle massive data storage, they often struggle to cope with real-time query. Column-storage databases, on other hand, are highly effective at both storage and query processing for large-scale datasets. In this paper, we propose a spatial version of ClickHouse that leverages R-Tree indexing to enable efficient storage and real-time analysis of massive remote sensing data. ClickHouse is a column-oriented, open-source database management system designed for handling large-scale datasets. By integrating R-Tree indexing, we have created a highly efficient system for storing and querying geospatial data. To evaluate the performance of our system, we compare it with HBase, a popular distributed, NoSQL database system. Our experimental results show that ClickHouse outperforms HBase in handling spatial data queries, with a response time approximately three times faster than HBase. We attribute this performance gain to the highly efficient R-Tree indexing used in ClickHouse, which allows for fast spatial data query.
更多
查看译文
关键词
ClickHouse,vector spatial data,query processing,HBase,remote sensing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要