RRecT: Chinese Text Recognition with Radical-Enhanced Recognition Transformer

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI(2023)

引用 0|浏览0
暂无评分
摘要
Text recognition has attracted continuous attention in recent years and reaches increasingly high performance on English datasets. Nevertheless, little attention is emphatically paid to Chinese Text Recognition (CTR), leading to a barely satisfying accuracy on Chinese datasets. Due to the complex glyph structure and large size of character set, CTR is more challenging and requires a powerful capacity of feature extraction. In this paper, we propose a novel network for CTR named Radical-enhanced Recognition Transformer (RRecT). It firstly introduces a customized Recognition Transformer (RecT) to extract multi-grained features, then exploits radical decomposition as an auxiliary supervision signal and enhances character representation with radical information by Radical Prediction Module (RPM) and Radical-Character Fusion Module (RCFM). Thus, final feature contains both character-level and fused radical-level information. The experimental results show that RRecT outperforms the state-of-the-art methods by a margin of 1.4% on Scene dataset, 1.8% on Document dataset and reaches a competitive performance on Web and Handwritten dataset. Moreover, RRecT requires much less computation cost and is a lightweight and effective model.
更多
查看译文
关键词
Chinese text recognition,OCR,radical prior
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要