A visual analytics approach to author name disambiguation.

Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies(2016)

引用 1|浏览42
暂无评分
摘要
Academic publication archives often draw from numerous, heterogeneous sources, whose records can follow differing naming conventions. As such, ambiguity issues concerning authorship of scientific papers often arise, such as authors sharing similar names, the use of first names versus initials, or alternate name spellings for the same author. These ambiguities have plagued research on scientific collaboration and influence. Detecting and correcting these errors is important for maintaining the archive, as well as for ensuring correctness and reliability in any desired subsequent analysis. There are existing analytic methods designed to accomplish this with varying degrees of accuracy, but many of them require fine tuning or manual categorization. We have developed a visual analytics system to interactively control and apply several analytic name disambiguation algorithms in a finely controlled manner, and to present the results to the user for verification or correction. We demonstrate the efficacy of our system by using it to find and resolve ambiguities in authorship data collected from Cornell University Library's arXiv.org and the InfoVis 2004 contest dataset with improved accuracy and speed over existing approaches.
更多
查看译文
关键词
Visual analytics, bibliographies, coauthor graphs, name ambiguity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要