Map-Side Merge Joins for Scalable SPARQL BGP Processing
CloudCom), 2013 IEEE 5th International Conference(2013)
摘要
In recent times, it has been widely recognized that, due to their inherent scalability, frameworks based on MapReduce are indispensable for so-called "Big Data" applications. However, for Semantic Web applications using SPARQL, there is still a demand for sophisticated MapReduce join techniques for processing basic graph patterns, which are at the core of SPARQL. Renowned for their stable and efficient performance, sort-merge joins have become widely used in DBMSs. In this paper, we demonstrate the adaptation of merge joins for SPARQL BGP processing with MapReduce. Our technique supports both n-way joins and sequences of join operations by applying merge joins within the map phase of MapReduce while the reduce phase is only used to fulfill the preconditions of a subsequent join iteration. Our experiments with the LUBM benchmark show an average performance benefit between 15% and 48% compared to other MapReduce based approaches while at the same time scaling linearly with the RDF dataset size.
更多查看译文
关键词
average performance benefit,lubm benchmark,sparql bgp processing,map-side merge joins,basic graph pattern,big data,map phase,efficient performance,sophisticated mapreduce,rdf dataset size,scalable sparql bgp processing,semantic web application,semantic web
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要