On Multifont Character Classification in Telugu

Communications in Computer and Information Science(2011)

引用 7|浏览8
暂无评分
摘要
A major requirement in the design of robust OCRs is the invariance of feature extraction scheme with the popular fonts used in the print. Many statistical and structural features have been tried for character classification in the past. In this paper, we get motivated by the recent successes in object category recognition literature and use a spatial extension of the histogram of oriented gradients (HOG) for character classification. Our experiments are conducted on 1453950 Telugu character samples in 359 classes and 15 fonts. On this data set, we obtain an accuracy of 96-98% with an SVM classifier.
更多
查看译文
关键词
Support Vector Machine, Support Vector Machine Classifier, Character Recognition, Character Classification, Linear Support Vector Machine
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要