Online Discriminative Spam Filter Training

CEAS(2006)

引用 106|浏览33
暂无评分
摘要
We describe a very simple technique for discriminatively training a spam filter. Our results on the TREC Enron spam corpus would have been the best for the Ham at .1% measure, and second best by the 1-ROCA measure. For the Mr. X corpus, our 1-ROCA measure was a close second best, and third best by the Ham at .1% measure. We use a very simple feature extractor (all words in the subject and head- ers). Our learning algorithm is also very simple: gradient descent of a logistic regression model.
更多
查看译文
关键词
logistic regression model,gradient descent
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要