Collaborative Information Extraction

user-5e9d449e4c775e765d44d7c9(2020)

引用 0|浏览51
暂无评分
摘要
Embodiments relate to a system, program product, and method for information extraction and annotation of a data set. Neural models are utilized to automatically attach machine annotations to data elements within an unlabeled data set. The attached machine annotations are evaluated and a score is attached to the annotations. The score reflects a confidence of correctness of the annotations. A labeled data set is iteratively expanded with selectively evaluated annotations based on the attached score. The labeled data set is applied to an unexplored corpus to identify matching corpus data to populated instances of the labeled data set.
更多
查看译文
关键词
Information extraction,Annotation,Correctness,Natural language processing,Computer science,Artificial intelligence,Labeled data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要