Universal Dependencies v1: A Multilingual Treebank Collection
: Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajic, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman
: Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
: International Conference on Language Resources and Evaluation
: Paris, France
: 2016
: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
: 1659
: 1666
: 8
: 978-2-9517408-9-1
: http://www.lrec-conf.org/proceedings/lrec2016/index.html
Cross-linguistically consistent annotation is necessary for sound comparative evaluation and cross-lingual learning experiments. It is also useful for multilingual system development and comparative linguistic studies. Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. In this paper, we describe v1 of the universal guidelines, the underlying design principles, and the currently available treebanks for 33 languages.