Universal Dependencies 2.0 – CoNLL 2017 Shared Task Development and Test Data

Joakim Nivre,Željko Agić,Lars Ahrenberg, Lene Antonsen,Maria Jesus Aranzabe,Masayuki Asahara, Luma Ateyah,Mohammed Attia,Aitziber Atutxa,Elena Badmaeva,Miguel Ballesteros,Esha Banerjee,Sebastian Bank,John Bauer,Kepa Bengoetxea,Riyaz Ahmad Bhat,Eckhard Bick,Cristina Bosco,Gosse Bouma,Sam Bowman,Aljoscha Burchardt,Marie Candito,Gauthier Caron,Gülşen Cebiroğlu Eryiğit, Giuseppe G. A. Celano, Savas Cetin,Fabricio Chalub,Jinho Choi, Yongseok Cho,Silvie Cinková,Çağrı Çöltekin,Miriam Connor,Marie-Catherine de Marneffe,Valeria de Paiva,Arantza Diaz de Ilarraza,Kaja Dobrovoljc,Timothy Dozat,Kira Droganova,Marhaba Eli,Ali Elkahky,Tomaž Erjavec,Richárd Farkas,Hector Fernandez Alcalde,Jennifer Foster,Cláudia Freitas,Katarína Gajdošová,Daniel Galbraith,Marcos Garcia,Filip Ginter,Iakes Goenaga,Koldo Gojenola,Memduh Gökırmak,Yoav Goldberg, Xavier Gómez Guinovart,Berta Gonzáles Saavedra,Matias Grioni,Normunds Grūzītis,Bruno Guillaume,Nizar Habash,Jan Hajič,Linh Hà Mỹ,Kim Harris,Dag Haug,Barbora Hladká,Jaroslava Hlaváčová,Petter Hohle,Radu Ion,Elena Irimia,Anders Johannsen,Fredrik Jørgensen,Hüner Kaşıkara,Hiroshi Kanayama,Jenna Kanerva,Tolga Kayadelen,Václava Kettnerová,Jesse Kirchner, Natalia Kotsyba,Simon Krek,Sookyoung Kwak,Veronika Laippala, Lorenzo Lambertino,Tatiana Lando, Phương Lê Hồng,Alessandro Lenci,Saran Lertpradit,Herman Leung,Cheuk Ying Li,Josie Li,Nikola Ljubešić, Olga Loginova,Olga Lyashevskaya,Teresa Lynn,Vivien Macketanz,Aibek Makazhanov,Michael Mandl,Christopher Manning,Ruli Manurung, Cătălina Mărănduc,David Mareček,Katrin Marheinecke,Héctor Martínez Alonso,André Martins,Jan Mašek,Yuji Matsumoto,Ryan McDonald,Gustavo Mendonça,Anna Missilä,Verginica Mititelu,Yusuke Miyao,Simonetta Montemagni,Amir More, Laura Moreno Romero,Shunsuke Mori,Bohdan Moskalevskyi,Kadri Muischnek,Nina Mustafina, Kaili Müürisep, Pinkey Nainwani, Anna Nedoluzhko,Lương Nguyễn Thị,Huyền Nguyễn Thị Minh,Vitaly Nikolaev,Rattima Nitisaroj, Hanna Nurmi,Stina Ojala,Petya Osenova,Lilja Øvrelid,Elena Pascual,Marco Passarotti, Cenel-Augusto Perez, Guy Perrier,Slav Petrov,Jussi Piitulainen,Emily Pitler,Barbara Plank,Martin Popel, Lauma Pretkalniņa,Prokopis Prokopidis, Tiina Puolakainen,Sampo Pyysalo,Alexandre Rademaker, Livy Real, Siva Reddy, Georg Rehm, Larissa Rinaldi, Laura Rituma,Rudolf Rosa,Davide Rovati,Shadi Saleh,Manuela Sanguinetti, Baiba Saulīte,Yanin Sawanakunanon,Sebastian Schuster,Djamé Seddah,Wolfgang Seeker, Mojgan Seraji,Lena Shakurova,Mo Shen,Atsuko Shimada, Muh Shohibussirri,Natalia Silveira,Maria Simi, Radu Simionescu, Katalin Simkó,Mária Šimková,Kiril Simov,Aaron Smith,Antonio Stella,Jana Strnadová,Alane Suhr,Umut Sulubacak,Zsolt Szántó,Dima Taji,Takaaki Tanaka, Trond Trosterud, Anna Trukhina,Reut Tsarfaty,Francis Tyers,Sumire Uematsu,Zdeňka Urešová,Larraitz Uria, Hans Uszkoreit,Gertjan van Noord,Viktor Varga,Veronika Vincze,Jonathan North Washington,Zhuoran Yu,Zdeněk Žabokrtský,Daniel Zeman,Hanzhi Zhu

user-5bd69975530c70d56f390249(2017)

引用 3|浏览75
暂无评分
摘要
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). This release contains the test data used in the CoNLL 2017 shared task on parsing Universal Dependencies. Due to the shared task the test data was held hidden and not released together with the training and development data of UD 2.0. Therefore this release complements the UD 2.0 release (http://hdl.handle.net/11234/1-1983) to a full release of UD treebanks. In addition, the present release contains 18 new parallel test sets and 4 test sets in surprise languages. The present release also includes the development data already released with UD 2.0. Unlike regular UD releases, this one uses the folder-file structure that was visible to the systems participating in the shared task.
更多
查看译文
关键词
Parsing,Treebank,Interlingua,Test data,Annotation,Natural language processing,Syntax,Programming language,Computer science,Artificial intelligence,Universal dependencies
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要