Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection




Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Jan Hajič, Christopher D. Manning, Sampo Pyysalo, Sebastian Schuster, Francis Tyers, Daniel Zeman

Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis

Language Resources and Evaluation Conference

2020

Proceedings of the 12th Language Resources and Evaluation Conference

4034

4043

979-10-95546-34-4

https://www.aclweb.org/anthology/2020.lrec-1.497/

https://research.utu.fi/converis/portal/detail/Publication/51797699



Universal Dependencies is an open community effort to create
cross-linguistically consistent treebank annotation for many languages
within a dependency-based lexicalist framework. The annotation consists
in a linguistically motivated word segmentation; a morphological layer
comprising lemmas, universal part-of-speech tags, and standardized
morphological features; and a syntactic layer focusing on syntactic
relations between predicates, arguments and modifiers. In this paper, we
describe version 2 of the universal guidelines (UD v2), discuss the
major changes from UD v1 to UD v2, and give an overview of the currently
available treebanks for 90 languages.


Last updated on 2024-26-11 at 18:52