A Probabilistic Retrieval Model for Word Spotting Based on Direct Attribute Prediction

Eugen Rusakov,Leonard Rothacker,Hyunho Mo,Gernot A. Fink

2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)（2018）

引用 9|浏览33

暂无评分

摘要

In recent years CNNs took over in various fields of computer vision. Adapted to document image analysis, they achieved state-of-the-art performance in word spotting by predicting word string embeddings. One prominent embedding splits a given string in temporal pyramidal regions of character occurrences, namely the Pyramidal Histogram of Characters (PHOC). This string embedding can be interpreted as a binary attribute representation. In this work we present a new approach for ranking retrieval lists originally proposed for zero-shot learning where attribute representations play an important role. Instead of a distance-based matching of the predicted string embedding, we compute the posterior probability of the attribute representation given a word image which can be interpreted as a posterior of the query. We can show that this probabilistic ranking improves word spotting performance, especially in the query-by-string scenario.

查看译文

关键词

deep learning,word spotting

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要