Workload diversity and dynamics in big data analytics: implications to system designers

Proceedings of the 2nd Workshop on Architectures and Systems for Big Data(2012)

引用 21|浏览56
暂无评分
摘要
The emergence of big data analytics and the need for cost/energy efficient IT infrastructure motivate a new focus on data-centric designs. In this paper, we aim to better understand the design implications of data analytics systems by quantifying workload requirements and runtime dynamics. We examine four workloads representing big data analytics trends for fast decisions, total integration, deep analysis and fresh insights: an archive store, a columnar database enhanced with table compression, an analytics engine with distributed R, and a transaction/analytics hybrid system. These appliations demonstrate diverse resource requirements both within and across workloads as well as load imbalance due to data skew. Our observations suggest several directions to design balanced data analytics systems, including tight integration of heterogeneous, active data stores, support for efficient communication and data-centric load balancing.
更多
查看译文
关键词
analytics trend,analytics system,data skew,big data analytics,balanced data,workload diversity,data analytics system,analytics hybrid system,active data store,analytics engine,system designer,big data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要