A4 Vertaisarvioitu artikkeli konferenssijulkaisussa
Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions
Tekijät: Kira Droganova, Filip Ginter, Jenna Kanerva, Daniel Zeman
Toimittaja: Marie-Catherine de Marneffe, Teresa Lynn, Sebastian Schuster
Konferenssin vakiintunut nimi: Universal Dependencies Workshop
Julkaisuvuosi: 2018
Kokoomateoksen nimi: Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)
Aloitussivu: 47
Lopetussivu: 54
ISBN: 978-1-948087-78-0
Verkko-osoite: http://aclweb.org/anthology/W18-6006
Rinnakkaistallenteen osoite: https://research.utu.fi/converis/portal/detail/Publication/37617276
In this paper, we focus on parsing rare and
non-trivial constructions, in particular ellipsis. We report on several experiments in
enrichment of training data for this specific
construction, evaluated on five languages:
Czech, English, Finnish, Russian and Slovak.
These data enrichment methods draw upon
self-training and tri-training, combined with
a stratified sampling method mimicking the
structural complexity of the original treebank.
In addition, using these same methods, we
also demonstrate small improvements over the
CoNLL-17 parsing shared task winning system for four of the five languages, not only restricted to the elliptical constructions.
Ladattava julkaisu This is an electronic reprint of the original article. |