Scalable Adaptive Label Propagation In Grappa

2015 IEEE International Conference on Big Data (Big Data)(2015)

引用 3|浏览66
暂无评分
摘要
Nodes of a social graph often represent entities with specific labels, denoting properties such as age-group or gender. Design of algorithms to assign labels to unlabeled nodes by leveraging node-proximity and a-priori labels of seed nodes is of significant interest. A semi-supervised approach to solve this problem is termed "LPA-Label Propagation Algorithm" where labels of a subset of nodes are iteratively propagated through the network to infer yet unknown node labels. While LPA for node labelling is extremely fast and simple, it works well only with an assumption of node-homophily - connected nodes are connected because they must deserve a similar label - which can often be a misnomer. In this paper we propose a novel algorithm "Adaptive Label Propagation" that dynamically adapts to the underlying characteristics of homophily, heterophily, or otherwise, of the connections of the network, and applies suitable label propagation strategies accordingly. Moreover, our adaptive label propagation approach is scalable as demonstrated by its implementation in Grappa, a distributed shared-memory system. Our experiments on social graphs from Facebook, YouTube, Live Journal, Orkut and Netlog demonstrate that our approach not only improves the labelling accuracy but also computes results for millions of users within a few seconds.
更多
查看译文
关键词
scalable adaptive label propagation,Grappa,social graph,node-proximity,seed nodes,semisupervised approach,LPA-label propagation algorithm,node-homophily,adaptive label propagation,distributed shared-memory system,social graphs,Facebook,YouTube,Live Journal,Orkut,Netlog
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要