Multilingual XML-Based Named Entity Recognition for E-Retail Domains.

LREC(2002)

引用 29|浏览111
暂无评分
摘要
Abstract: We describe the multilingual Named Entity Recognition and Classification (NERC) subpart of an e-retail product comparison systemwhich is currently under development as part of the EU-funded project CROSSMARC. The system must be rapidly extensible, bothto new languages and new domains. To achieve this aim we use XML as our common exchange format and the monolingual NERCcomponents use a combination of rule-based and machine-learning techniques. It has been challenging to process web pages...
更多
查看译文
关键词
machine learning,web pages,rule based
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要