CNN-Based Hindi Numeral String Recognition for Indian Postal Automation

2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)(2019)

引用 2|浏览27
暂无评分
摘要
Digits/numerals in the Indian pin-code of handwritten postal documents may touch each other and hence digit string recognition is a very challenging task. In this paper, we propose a digit string recognition system for Indian postal documents written in Hindi. Unlike normal text string, in a string of digits there is no contextual information among the digits as a digit may be followed by an arbitrary digit in a string of digits. Because of this, here we propose a new architecture which is based on CNN (Convolutional Neural Network) and CTC (Connectionist Temporal Classification), without using RNN for Hindi numeral string recognition. Also to connect CNN with CTC, we transform the outputs of CNN to a two-dimension vector to meet the feeding requirement of CTC. Furthermore, we utilize dense blocks to build CNN part to extract efficient image features. Comparative studies with the state-of-the-art methods show that the proposed method outperforms the other existing methods on Hindi numeral string recognition.
更多
查看译文
关键词
Hindi Numeral String, Convolutional Neural Network, Connectionist Temporal Classification, Postal Automation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要