Towards A Passages Extraction Method For Arabic Question Answering Systems

ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2019): VOL 1 - ADVANCED INTELLIGENT SYSTEMS FOR EDUCATION AND INTELLIGENT LEARNING SYSTEM(2020)

引用 3|浏览0
暂无评分
摘要
Question Answering Systems (QASs) aim to provide a precise answer to questions written in natural language. Passages extraction is a challenging task that affects directly the performance of any QAS. In this paper, we propose a passages extraction method for Arabic Question Answering Systems. It consists of two steps: (1) formulating the query from the Arabic questions user and (2) extracting candidate passages that contain most probably, the correct answers. First, we describe the querys formulation by using stemmed words and performing a Pos-tagging process. Then, we identify relevant passages from Arabic Wikipedia based on two levels of Information Retrieval (IR). In the first level, we extract relevant documents from Arabic Wikipedia based on both documents titles and Named Entities (NEs) contained in the formulated query. The second IR level extracts candidate passages from the pages extracted in the first level based on the similarity with the query. This allows to reduce the number of extracted passages and keep the N top-ranked ones. The obtained primary results are promising as they show a high level of similarity between a given question and the candidate passages.
更多
查看译文
关键词
Arabic question answering system, Passages extraction, Information retrieval, Natural language processing, Wikipedia, POS tagger, Named entity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要