Search in Archival Facsimile Documents for Digital History

2023 IEEE 19th International Conference on e-Science (e-Science)(2023)

Cited 0|Views13
No score
Abstract
Recent advances in text digitization and processing have opened up many possibilities for historical archives to be processed and digitized in an efficient and automated manner. Processing steps, also involving language detection, optical character recognition (OCR), named entity recognition (NER), recognition error detection, and automated or manual correction can result in digitized archives providing both high-quality facsimile representations of original document scans and extracted text metadata close to the original text in a machine-friendly format. Exploration of digitally enhanced archives is an important step forward in the future workflow of archivists and historians alike. After analysing the requirements of these users, we propose a concept for dynamically generating retrieval-relevant facsimile image snippets. This work demonstrates a Human-in-the-Loop retrieval and research workflow based on these methods by providing a search user interface prototype geared towards intuitively exploring topics across a multilingual historical facsimile archive corpus.
More
Translated text
Key words
digital history,archival document,information retrieval,user interface
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined