A4 Article in conference proceedings
Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions




List of Authors: Kira Droganova, Filip Ginter, Jenna Kanerva, Daniel Zeman
Publication year: 2018
Book title *: Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)
ISBN: 978-1-948087-78-0

Abstract

In this paper, we focus on parsing rare and
non-trivial constructions, in particular ellipsis. We report on several experiments in
enrichment of training data for this specific
construction, evaluated on five languages:
Czech, English, Finnish, Russian and Slovak.
These data enrichment methods draw upon
self-training and tri-training, combined with
a stratified sampling method mimicking the
structural complexity of the original treebank.
In addition, using these same methods, we
also demonstrate small improvements over the
CoNLL-17 parsing shared task winning system for four of the five languages, not only restricted to the elliptical constructions.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Last updated on 2019-29-01 at 11:50