A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä
Investigating the cross-lingual translatability of VerbNet-style classification
Tekijät: Majewska O., Vulić I., McCarthy D., Huang Y., Murakami A., Laippala V., Korhonen A.
Kustantaja: Springer Netherlands
Julkaisuvuosi: 2018
Journal: Language Resources and Evaluation
Tietokannassa oleva lehden nimi: Language Resources and Evaluation
Vuosikerta: 52
Numero: 3
Aloitussivu: 771
Lopetussivu: 799
Sivujen määrä: 29
ISSN: 1574-020X
eISSN: 1574-0218
DOI: https://doi.org/10.1007/s10579-017-9403-x
Verkko-osoite: https://link.springer.com/article/10.1007/s10579-017-9403-x
Rinnakkaistallenteen osoite: https://research.utu.fi/converis/portal/Publication/28544212
VerbNet—the most extensive online verb lexicon currently available for
English—has proved useful in supporting a variety of NLP tasks. However,
its exploitation in multilingual NLP has been limited by the fact that
such classifications are available for few languages only. Since manual
development of VerbNet is a major undertaking, researchers have recently
translated VerbNet classes from English to other languages. However, no
systematic investigation has been conducted into the applicability and
accuracy of such a translation approach across different, typologically
diverse languages. Our study is aimed at filling this gap. We develop a
systematic method for translation of VerbNet classes from English to
other languages which we first apply to Polish and subsequently to
Croatian, Mandarin, Japanese, Italian, and Finnish. Our results on
Polish demonstrate high translatability with all the classes (96% of
English member verbs successfully translated into Polish) and strong
inter-annotator agreement, revealing a promising degree of overlap in
the resultant classifications. The results on other languages are
equally promising. This demonstrates that VerbNet classes have strong
cross-lingual potential and the proposed method could be applied to
obtain gold standards for automatic verb classification in different
languages. We make our annotation guidelines and the six
language-specific verb classifications available with this paper. © 2017
The Author(s)
Ladattava julkaisu This is an electronic reprint of the original article. |