New method for the selection of binarization parameters based on noise features of historical documents

Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data(2011)

引用 11|浏览12
暂无评分
摘要
Historical documents contain generally different kind of degradations. Due to this degradations the application of methods of noise removal during a preprocessing stage seems to be necessary. Since the noise which, exists in the original document can not be eliminated using a simple noise removal algorithm and it influences the preprocessing result e.g. the binarization, a function of noise detection seems to be necessary. We present in this paper a method for the selection of the input parameters of binarization methods according to the noise type detected in the image. The tests are achieved on benchmarking datasets used at DIBCO 2009 and H-DIBCO 2010. The results returned by the binarization methods using the noise features are promising.
更多
查看译文
关键词
noise detection,simple noise removal algorithm,preprocessing result,historical document,binarization method,preprocessing stage,binarization parameter,noise feature,new method,different kind,noise removal,benchmarking datasets,noise type,preprocessing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要