Extended framework for Sindhi numerals OCR using gradient orientation histograms

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS(2022)

引用 0|浏览6
暂无评分
摘要
The accuracy on MINST dataset for roman numerals is already 99.65%. However, same models showed low accuracy on Sindhi numerals. It is because Sindhi numerals have high correlation between the shapes of the numerals. In this paper, correlation based template matching is used to analyze the shape ambiguity by identifying the dominant false positives (FP) and false negatives (FN) for every numeral. Furthermore, the Gradients Histogram Orientation (GOH) features are used to improve the accuracy of existing classifiers by image-to-image matching. The classical OCR using simple binary features are not sufficient to address the problems of shape ambiguity in Sindhi numerals, i.e., the shape of digits 2, (SIC), and 3, (SIC), are very similar. The raw pixel values are used as features for the classification in the first stage. In second stage, the input image is matched with the dominant FP and FN of the predicted class, and the final decision is made by the image-to-image matching based on GOH features. Decision based on image to image matching with dominant FP and FN increase the accuracy of the classifier. Support vector machine (SVM), K-nearest neighbor, and template based matching classifiers are used. The proposed extension substantially improves the accuracy of all mentioned classifiers.
更多
查看译文
关键词
Gradient orientation histograms, SIFT, gradient based keypoint descriptors, keypoint descriptor quantization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要