Mining The Whole Set Of Person Names From The Tibetan Web
2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 3(2009)
摘要
Along with the rapid development of Tibetan language information and Tibetan Web in recent years, personal information becomes a main focus of researchers. While, due to the complexity of the Web information, the extraction of person names is difficult, especially in Tibetan Web. This paper presents a rule-based approach, which is based on the case-auxiliary words and lexicon, to extract the person name from the Tibetan Web. According to the grammar information and statistical rules, we have developed a person name extraction system, which is used for the Tibetan Web. We design a series of experiments to evaluate the performance of the system, and the evaluation results are satisfactory.
更多查看译文
关键词
person name extraction, case-auxiliary words, Tibetan language, Web
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要