Distributed and interactive cube exploration

Data Engineering(2014)

引用 175|浏览91
暂无评分
摘要
Interactive ad-hoc analytics over large datasets has become an increasingly popular use case. We detail the challenges encountered when building a distributed system that allows the interactive exploration of a data cube. We introduce DICE, a distributed system that uses a novel session-oriented model for data cube exploration, designed to provide the user with interactive sub-second latencies for specified accuracy levels. A novel framework is provided that combines three concepts: faceted exploration of data cubes, speculative execution of queries and query execution over subsets of data. We discuss design considerations, implementation details and optimizations of our system. Experiments demonstrate that DICE provides a sub-second interactive cube exploration experience at the billion-tuple scale that is at least 33% faster than current approaches.
更多
查看译文
关键词
data analysis,query processing,DICE system,billion-tuple scale,distributed data cube exploration,distributed system,faceted data cubes exploration,interactive ad-hoc analytics,interactive data cube exploration,session-oriented model,speculative query execution,sub-second interactive cube exploration
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要