A4 Vertaisarvioitu artikkeli konferenssijulkaisussa

Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions




TekijätKira Droganova, Filip Ginter, Jenna Kanerva, Daniel Zeman

ToimittajaMarie-Catherine de Marneffe, Teresa Lynn, Sebastian Schuster

Konferenssin vakiintunut nimiUniversal Dependencies Workshop

Julkaisuvuosi2018

Kokoomateoksen nimiProceedings of the Second Workshop on Universal Dependencies (UDW 2018)

Aloitussivu47

Lopetussivu54

ISBN978-1-948087-78-0

Verkko-osoitehttp://aclweb.org/anthology/W18-6006

Rinnakkaistallenteen osoitehttps://research.utu.fi/converis/portal/detail/Publication/37617276


Tiivistelmä

In this paper, we focus on parsing rare and
non-trivial constructions, in particular ellipsis. We report on several experiments in
enrichment of training data for this specific
construction, evaluated on five languages:
Czech, English, Finnish, Russian and Slovak.
These data enrichment methods draw upon
self-training and tri-training, combined with
a stratified sampling method mimicking the
structural complexity of the original treebank.
In addition, using these same methods, we
also demonstrate small improvements over the
CoNLL-17 parsing shared task winning system for four of the five languages, not only restricted to the elliptical constructions.


Ladattava julkaisu

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 22:56