It Was Easy, when Apples and Blackberries Were only Fruits

CLEF (Notebook Papers/LABs/Workshops)(2010)

引用 51|浏览23
暂无评分
摘要
Ambiguities in company names are omnipresent. This is not accidental, companies deliberately chose ambiguous brand names, as part of their marketing and branding strategy. This procedure leads to new challenges, when it comes to nding information about the company on the Web. This paper is concerned with the task of classifying Twitter messages, whether they are related to a given company: for example, we classify a set of twitter messages containing a keyword apple, whether a message is related to the company Apple Inc. Our technique is essen- tially an SVM classier, which uses a simple representation of relevant and irrelevant information in the form of keywords, grouped in specic \proles". We developed a simple technique to construct such classiers for previously unseen companies, where no training set is available, by training the meta-features of the classier with the help of a general test set. Our techniques show high accuracy gures over the WePS-3 dataset.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要