A New Mixed Binarization Method Used In A Real Time Application Of Automatic Business Document And Postal Mail Sorting

INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY(2013)

引用 24|浏览4
暂无评分
摘要
The binarization is applied in the first stage of segmentation process and has a very strong impact on the performances of the system of the automatic sorting of company documents and mail. We present in the beginning of this paper a complete study of the different existing binarization mechanisms that are developed to meet the needs of specific applications. These conventional approaches, present weaknesses that it is crucial to overcome and unfortunately they remain unsuitable for our real time application. The separation between the thresholding and the text zones location stages considerably increase the computation time and lead to an over-segmentation of the noise and of the paper texture on empty zones of the image. Indeed, none of the traditional methods (whether global or local) efficiently meets all the required conditions. We have managed to optimize this stage by applying a local threshold only near the text zones that can be located by the cumulated gradients method with the multi-resolution and mathematical morphology. We demonstrate the consistent performance of the proposed method on several types of business documents and mail with wide-ranging content and image quality.
更多
查看译文
关键词
Binarization, text zones location, real time processing, automatic sorting of company documents, mail
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要