Subword Retrieval on Biomedical Documents

FLAIRS Conference(2003)

引用 23|浏览12
暂无评分
摘要
Document retrieval in languages with a rich and com- plex morphology - particularly in terms of derivation and (single-word) composition - suffers from serious perfor- mance degradation with the stemming-only query-term-to- text-word matching paradigm. We propose an alternative ap- proach in which morphologically complex word forms are segmented into relevant subwords (such as stems, prefixes, suffixes), and subwords constitute the basic unit for index- ing and retrieval. We evaluate our approach on a biomedical document collection.
更多
查看译文
关键词
document retrieval,indexation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要