A Named Entity Recognition Approach for Albanian Using Deep Learning

Complex Pattern MiningStudies in Computational Intelligence(2020)

引用 4|浏览1
暂无评分
摘要
Named Entity Recognition (NER) is an information extraction task that deals with the identification and tagging of generic named entities and/or domainspecific named entities. NER is a crucial task in semantic processing of text data, making it a key component in different Natural Language Processing applications such as Question Answering, Machine Translation, etc. In this paper we propose an approach for Named Entity Recognition based on Deep Learning models using an Albanian corpus. We focused on the generic named entities such as person's name, geographical location, name of organization/institution and other categories. Given that there is no publicly available Albanian annotated corpus, we have manually created one. Furthermore, we have built a deep neural network using LSTM cells as the hidden layers and a Conditional Random Field as the output, using both word and character tagging. Taking into consideration the complexity of theAlbanian language and the little research done in NLP for Albanian, the results achieved are promising. The results obtained from the experiments demonstrate that the NER performance can be further improved by using a larger annotated corpus to train the model.
更多
查看译文
关键词
named entity recognition approach,albanian,deep learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要