Search in Archival Facsimile Documents for Digital History

2023 IEEE 19th International Conference on e-Science (e-Science)(2023)

引用 0|浏览6
暂无评分
摘要
Recent advances in text digitization and processing have opened up many possibilities for historical archives to be processed and digitized in an efficient and automated manner. Processing steps, also involving language detection, optical character recognition (OCR), named entity recognition (NER), recognition error detection, and automated or manual correction can result in digitized archives providing both high-quality facsimile representations of original document scans and extracted text metadata close to the original text in a machine-friendly format. Exploration of digitally enhanced archives is an important step forward in the future workflow of archivists and historians alike. After analysing the requirements of these users, we propose a concept for dynamically generating retrieval-relevant facsimile image snippets. This work demonstrates a Human-in-the-Loop retrieval and research workflow based on these methods by providing a search user interface prototype geared towards intuitively exploring topics across a multilingual historical facsimile archive corpus.
更多
查看译文
关键词
digital history,archival document,information retrieval,user interface
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要