A4 Refereed article in a conference publication

Universal dependencies for Persian




AuthorsSeraji Mojgan, Ginter Filip, Nivre Joakim

EditorsNicoletta Calzolari (Conference Chair),Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis

Conference nameInternational Conference on Language Resources and Evaluation (LREC)

PublisherEuropean Language Resources Association (ELRA)

Publication year2016

Book title Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

Journal name in sourceProceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016

First page 2361

Last page2365

Number of pages5

ISBN978-2-9517408-9-1

Web address http://www.lrec-conf.org/proceedings/lrec2016/index.html

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/29509325


Abstract

The Persian Universal Dependency Treebank (Persian UD) is a recent effort of treebanking Persian with Universal Dependencies (UD), an ongoing project that designs unified and cross-linguistically valid grammatical representations including part-of-speech tags, morphological features, and dependency relations. The Persian UD is the converted version of the Uppsala Persian Dependency Treebank (UPDT) to the universal dependencies framework and consists of nearly 6,000 sentences and 152,871 word tokens with an average sentence length of 25 words. In addition to the universal dependencies syntactic annotation guidelines, the two treebanks differ in tokenization. All words containing unsegmented clitics (pronominal and copula clitics) annotated with complex labels in the UPDT have been separated from the clitics and appear with distinct labels in the Persian UD. The treebank has its original syntactic annotation scheme based on Stanford Typed Dependencies. In this paper, we present the approaches taken in the development of the Persian UD.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 14:45