Amharic Semantic Information Retrieval System

Knowledge Discovery, Knowledge Engineering and Knowledge Management(2022)

引用 0|浏览2
暂无评分
摘要
Amharic is the official language of Ethiopia, currently having a population of over 118 million. Developing effective information retrieval (IR) system for Amharic has been a challenging task due to limited resources coupled with complex morphology of the language. This paper presents the development of Amharic semantic IR system using query expansion based on deep neural learning model and WordNet. In order to optimize the retrieval result, we propose Amharic text representation using root forms of words applied for stopword identification, indexing, term matching and query expansion. Comparisons are made with the conventional stem-based text representation for information retrieval, and we show that using the root forms of words is better for both resource construction and system development. The effectiveness of the proposed Amharic semantic IR system is evaluated on Amharic Adhoc Information Retrieval Test Collection (2AIRTC).
更多
查看译文
关键词
Semantic information retrieval, Query expansion, Complex morphology, Amharic IR resources
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要