Corpus Creation for New Genres: A Crowdsourced Approach to PP Attachment.

CSLDAMT '10: Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk(2010)

引用 16|浏览54
暂无评分
摘要
This paper explores the task of building an accurate prepositional phrase attachment corpus for new genres while avoiding a large investment in terms of time and money by crowd-sourcing judgments. We develop and present a system to extract prepositional phrases and their potential attachments from ungrammatical and informal sentences and pose the subsequent disambiguation tasks as multiple choice questions to workers from Amazon's Mechanical Turk service. Our analysis shows that this two-step approach is capable of producing reliable annotations on informal and potentially noisy blog text, and this semi-automated strategy holds promise for similar annotation projects in new genres.
更多
查看译文
关键词
new genre,accurate prepositional phrase attachment,informal sentence,prepositional phrase,Mechanical Turk service,crowd-sourcing judgment,large investment,multiple choice question,noisy blog text,potential attachment,PP attachment,corpus creation,crowdsourced approach
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要