Microblogger tag auto-annotation based on collective knowledge

Information Science and Technology(2013)

引用 0|浏览14
暂无评分
摘要
In China, it is free for users to introduce themselves with tags on Sina weibo, the biggest Chinese microblogging platform. User tags provide much useful information for user retrieval and precision advertising. However, 64.2% of users in our study have not tagged themselves. This paper aims to providemicroblogger tag auto-annotation. First, we analyze the tagging behaviors of users to explore what information that is contained in the tagging. Second, based on the analyses and features of social networking, we propose two methods of user tag auto-annotation: one is Collective Filtering (CF) based method and the other is Probabilistic Latent Semantic Analysis (PLSA) based method. Specifically, the CF-based method annotates tags for one user according to the voting scheme by considering his/her neighboring users' tags. By contrast, the PLSA-based method annotates tags for one user according to the topic distribution of his/her neighboring users' tags. Experimental results demonstrate that: (1) the CF-based method outperforms the PLSA-based method at word-level; and (2) the PLSA-based method can obtain more accurate tag annotation results than the CF-based method at topic-level.
更多
查看译文
关键词
vectors,visualization,web pages,feature extraction,probabilistic logic
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要