A Method Of Collecting Four Character Medicine Effect Phrases In Tcm Patents Based On Semi-Supervised Learning
COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2019)(2020)
摘要
As a result of historical reasons and writing habits, the effects of medicine in Traditional Chinese Medicine (TCM) patents are often described using four character phrases. These four character phrases are not easily identified by the Chinese word segmentation system, thus greatly affects the results of patent analysis and mining. This paper proposes a semi-supervised learning method to collect four character effect phrases from the abstracts texts of TCM patents, which can help enrich the lexicon of Chinese word segmentation system, and also provide support for semantic patent retrieval and analysis. The experimental results show the validity of the method.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络