Text Mining in Full Text Articles -- Methodical and Represenation Issues

Nature Precedings(2009)

引用 0|浏览10
暂无评分
摘要
In many cases, information from abstracts of biomedical publications is not sufficient for annotation of database entries. Therefore, text mining systems supporting curators of biodatabases should be able to process full text articles. Beside the technical problems arising from full text parsing, the representation of the annotated full text is an important issue. Journal articles are mostly electronically available in PDF or HTML format. Also with more easily manageable XML formats, readers would like to have a visualisation of annotations and semantic enrichment directly in the PDF or HTML. We summarize the technical problems arising from parsing of HTML and PDF journal full texts and show first results of visualisation in both formats.
更多
查看译文
关键词
visualization,PDF,text mining,Full Text,text parsing,HTML,journals,parsing,ProMiner,publishing,biodatabase,biocurator
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要