Performance analysis of hybrid deep learning framework using a vision transformer and convolutional neural network for handwritten digit recognition

METHODSX(2024)

引用 0|浏览3
暂无评分
摘要
Digitization created a demand for highly efficient handwritten document recognition systems. A handwritten document consists of digits, text, symbols, diagrams, etc. Digits are an essential element of handwritten documents. Accurate recognition of handwritten digits is vital for effective communication and data analysis. Various researchers have attempted to address this issue with modern convolutional neural network (CNN) techniques. Even after training, CNN filter weights remain unchanged despite the high identification accuracy. As a result, the process cannot flexibly adapt to input changes. Hence computer vision researchers have recently become interested in Vision Transformers (ViTs) and Multilayer Perceptrons (MLPs). The shortcomings of CNNs gave rise to a hybrid model revolution that combines the best elements of the two fields. This paper analyzes how the hybrid convolutional ViT model affects the ability to recognize handwritten digits. Also, the real-time data contains noise, distortions, and varying writing styles. Hence, cleaned and uncleaned handwritten digit images are used for evaluation in this paper. The accuracy of the proposed method is compared with the state-of-the-art techniques, and the result shows that the proposed model achieves the highest recognition accuracy. Also, the probable solutions for recognizing other aspects of handwritten documents are discussed in this paper. center dot Analyzed the effect of convolutional vision transformer on cleaned and real-time handwritten digit images. center dot The model's performance improved with the implication of cross -validation and hyperparameter tuning. center dot The results show that the proposed model is robust, feasible, and effective on cleaned and uncleaned handwritten digits.
更多
查看译文
关键词
Convolutional Neural Network,Vision Transformer,Handwritten Digit Recognition,Machine Learning,Computer Vision
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要