DocBank: A Benchmark Dataset for Document Layout Analysis

COLING, pp. 949-960, 2020.

Cited by: 5|Views141
EI

Abstract:

Document layout analysis usually relies on computer vision models to understand documents while ignoring textual information that is vital to capture. Meanwhile, high quality labeled datasets with both visual and textual information are still insufficient. In this paper, we present \textbf{DocBank}, a benchmark dataset with fine-grained...More

Code:

Data:

Get fulltext within 24h
Bibtex
Your rating :
0

 

Tags
Comments