Open Big Data Infrastructures To Everyone

Konstantinos Tsakalozos, Cory Johns, Kevin Monroe, Pete Vandergiessen, Andrew Mcleod, Antonio Rosales

2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)(2016)

引用 28|浏览38
暂无评分
摘要
The evolution of big data has increased the complexity of the respective software. Big data infrastructures require progressively more time and effort to setup, configure, maintain and integrate with existing systems. In absence of a big data "expert", users are often discouraged from using such solutions. The option of consuming big data infrastructures as a service seems to be a viable one, yet it is not without drawbacks. Such an option a) is costly, b) often locks users down to a vendor, and c) is limited to what the vendor decides to make availableIn this paper we present Juju, an open source service modelling approach by Canonical that addresses the above shortcomings. With Juju users can deploy and maintain their infrastructures to a rich variety of target environments that include almost any cloud, local machines (using containers & VMs), bare metal systems and any remote machine the user might have ssh access to. The Juju big data community makes sure that deploying big data infrastructures is as simple as running "juju deploy hadoop", while interfaces among infrastructures allow for easy system integration. In this work we also show how the operational knowledge of complex software such as Apache Spark can be encapsulated in a few hundreds lines.
更多
查看译文
关键词
Big data infrastructure,User,Option
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要