Tools for the Efficient Generation of Hand-Drawn Corpora Based on Context-Free Grammars

SBIM '09: Proceedings of the 6th Eurographics Symposium on Sketch-Based Interfaces and Modeling(2009)

引用 11|浏览0
暂无评分
摘要
In sketch recognition systems, ground-truth data sets serve to both train and test recognition algorithms. Unfortunately, generating data sets that are sufficiently large and varied is frequently a costly and time-consuming endeavour. In this paper, we present a novel technique for creating a large and varied ground-truthed corpus for hand drawn math recognition. Candidate math expressions for the corpus are generated via random walks through a context-free grammar, the expressions are transcribed by human writers, and an algorithm automatically generates ground-truth data for individual symbols and inter-symbol relationships within the math expressions. While the techniques we develop in this paper are illustrated through the creation of a ground-truthed corpus of mathematical expressions, they are applicable to any sketching domain that can be described by a formal grammar.
更多
查看译文
关键词
context-free grammar,sketch recognition system,efficient generation,ground-truth data set,hand-drawn corpus,test recognition algorithm,math recognition,candidate math expression,ground-truthed corpus,math expression,ground-truth data,sketch recognition,ground truth,context free grammar,formal grammar,random walk
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要