A scalable pattern spotting system for historical documents.

Pattern Recognition(2016)

引用 26|浏览10
暂无评分
摘要
Information retrieval in historical documents has long consisted in spotting words. In this paper, we focus on graphical pattern spotting. Contrary to object detection and classification, where models of the object of interest may be trained, pattern spotting does not rely on any prior information regarding the query, nor predefined class of graphical objects. An offline sliding window approach may be suitable, provided that the challenge raised by high computational and storage costs is handled. We propose an unsupervised, segmentation-free approach that takes advantage of recent developments in computer vision to overcome these issues. We also investigate the use of new, compact descriptors for the data, namely the vectors of locally aggregated descriptors (VLAD) and Fisher Vectors, instead of the usual bag-of-visual-words approach. Results obtained on medieval manuscripts from the DocExplore project show that our approach achieves better retrieval results, with a better efficiency in terms of time/memory, compared to standard approaches. Experimentations show that VLAD and Fisher Vectors can be fruitfully used in the future for the description of historical documents. Additionally, we show that our system can be easily turned into a word spotting system with slight adaptation, and that it achieves results comparable to those recently published in ICDAR 2015 keyword spotting challenge. HighlightsA new segmentation-free method for spotting patterns in document images is proposed.Our method is scalable thanks to the offline indexing and compact representation.We provide a unified benchmarking comparing our method with state of the art models.Experiments are provided on a dataset of medieval manuscript document images.
更多
查看译文
关键词
Pattern spotting,Word spotting,Image retrieval,Bag of visual words,VLAD,Fisher Vector,Product quantization,Historical document image analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要