Cruncher: Distributed In-Memory Processing For Location-Based Services

2016 IEEE 32nd International Conference on Data Engineering (ICDE)(2016)

引用 14|浏览40
暂无评分
摘要
Advances in location-based services (LBS) demand high-throughput processing of both static and streaming data. Recently, many systems have been introduced to support distributed main-memory processing to maximize the query throughput. However, these systems are not optimized for spatial data processing. In this demonstration, we showcase Cruncher, a distributed main-memory spatial data warehouse and streaming system. Cruncher extends Spark with adaptive query processing techniques for spatial data. Cruncher uses dynamic batch processing to distribute the queries and the data streams over commodity hardware according to an adaptive partitioning scheme. The batching technique also groups and orders the overlapping spatial queries to enable inter-query optimization. Both the data streams and the offline data share the same partitioning strategy that allows for data co-locality optimization. Furthermore, Cruncher uses an adaptive caching strategy to maintain the frequently-used location data in main memory. Cruncher maintains operational statistics to optimize query processing, data partitioning, and caching at runtime. We demonstrate two LBS applications over Cruncher using real datasets from OpenStreetMap and two synthetic data streams. We demonstrate that Cruncher achieves order(s) of magnitude throughput improvement over Spark when processing spatial data.
更多
查看译文
关键词
Cruncher system,distributed in-memory processing,location-based services,LBS,static data,streaming data,query throughput,spatial data processing,distributed main-memory spatial data warehouse and streaming system,adaptive query processing techniques,dynamic batch processing,query distribution,adaptive caching strategy,frequently-used location data,operational statistics,OpenStreetMap
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要