1 OCR is a difficult task because of some specific features o"/>

Persian OCR with Cascaded Convolutional Neural Networks Supported by Language Model

2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)(2020)

引用 1|浏览1
暂无评分
摘要
Persian 1 OCR is a difficult task because of some specific features of Persian writing style, like different styles of letters in different places of the word and similarity of letters to each other. Recognizing sub-words instead of individual letters can reduce these difficulties. In this manner sub-word segmentation is the critical task of pre-process step. In this paper, a cascaded Convolutional Neural Network is utilized to convert sub-word images into text. A large dictionary of Persian sub-word images with different font styles is used as training data and an Auto-Encoder enriches the features needed for constructing the cascade classifier structure. The initial classifier learns the overall structure of sub-word images that its training data is the result of applying k-means clustering on the huge sub-word image dataset. The later classifier finds the exact text equivalent of the sub-word image. A word segmentation method forms the words based on extracted sub-words. This method use contour distances as a measure for distinguishing words from sub-words. The initial OCR result is improved using Natural Language Processing techniques. Two fast search structures in word dictionaries with the help of a language model build the post-processing module and substitute the misspelled extracted words with the best alternative. Comparison results with Tesseract OCR engine shows the superiority of the algorithm.
更多
查看译文
关键词
Persian OCR,Cascade Classifier,Convolutional Neural Networks,Auto-Encoder,Language Model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要