Constant delay algorithms for regular document spanners
SIGMOD/PODS '18: International Conference on Management of Data Houston TX USA June, 2018, pp. 165-177, 2018.
Regular expressions and automata models with capture variables are core tools in rule-based information extraction. These formalisms, also called regular document spanners, use regular languages in order to locate the data that a user wants to extract from a text document, and then store this data into variables. Since document spanners c...More
Full Text (Upload PDF)
PPT (Upload PPT)