Efficient Enumeration Algorithms for Regular Document Spanners

    ACM Transactions on Database Systems (TODS), pp. 1-42, 2020.

    Cited by: 1|Bibtex|Views1|Links
    EI
    Keywords:
    Information extractionautomatacapture variablesenumeration delayspanners

    Abstract:

    Regular expressions and automata models with capture variables are core tools in rule-based information extraction. These formalisms, also called regular document spanners, use regular languages to locate the data that a user wants to extract from a text document and then store this data into variables. Since document spanners can easily ...More

    Code:

    Data:

    Your rating :
    0

     

    Tags
    Comments