Stability Evaluation of Event Detection Techniques for Twitter.

ADVANCES IN INTELLIGENT DATA ANALYSIS XV(2016)

引用 4|浏览18
暂无评分
摘要
Twitter continues to gain popularity as a source of up-todate news and information. As a result, numerous event detection techniques have been proposed to cope with the steadily increasing rate and volume of social media data streams. Although most of these works conduct some evaluation of the proposed technique, comparing their effectiveness is a challenging task. In this paper, we examine the challenges to reproducing evaluation results for event detection techniques. We apply several event detection techniques and vary four parameters, namely time window (15 vs. 30 vs. 60 mins), stopwords (include vs. exclude), retweets (include vs. exclude), and the number of terms that define an event (1...5 terms). Our experiments use real-world Twitter streaming data and show that varying these parameters alone significantly influences the outcomes of the event detection techniques, sometimes in unforeseen ways. We conclude that even minor variations in event detection techniques may lead to major difficulties in reproducing experiments.
更多
查看译文
关键词
Event Detection Techniques, Social Media Data Streams, Retweets, Baseline Technique, Dashed Frame
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要