Corpus-based learning of compound noun indexing

RANLPIR '00: Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 11(2000)

引用 13|浏览0
暂无评分
摘要
In this paper, we present a corpus-based learning method that can index diverse types of compound nouns using rules automatically extracted from a large tagged corpus. We develop an efficient way of extracting the compound noun indexing rules automatically and perform extensive experiments to evaluate our indexing rules. The automatic learning method shows about the same performance compared with the manual linguistic approach but is more portable and requires no human efforts. We also evaluate the seven different filtering methods based on both the effectiveness and the efficiency, and present a new method to solve the problems of compound noun over-generation and data sparseness in statistical compound noun processing.
更多
查看译文
关键词
automatic learning method,indexing rule,data sparseness,corpus-based learning method,compound noun,compound noun over-generation,corpus-based learning,new method,compound noun indexing rule,extensive experiment,statistical compound noun processing,noun,indexation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要