A1 Refereed original research article in a scientific journal

Investigating the cross-lingual translatability of VerbNet-style classification




AuthorsMajewska O., Vulić I., McCarthy D., Huang Y., Murakami A., Laippala V., Korhonen A.

PublisherSpringer Netherlands

Publication year2018

JournalLanguage Resources and Evaluation

Journal name in sourceLanguage Resources and Evaluation

Volume52

Issue3

First page 771

Last page799

Number of pages29

ISSN1574-020X

eISSN1574-0218

DOIhttps://doi.org/10.1007/s10579-017-9403-x

Web address https://link.springer.com/article/10.1007/s10579-017-9403-x

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/Publication/28544212


Abstract

VerbNet—the most extensive online verb lexicon currently available for
English—has proved useful in supporting a variety of NLP tasks. However,
its exploitation in multilingual NLP has been limited by the fact that
such classifications are available for few languages only. Since manual
development of VerbNet is a major undertaking, researchers have recently
translated VerbNet classes from English to other languages. However, no
systematic investigation has been conducted into the applicability and
accuracy of such a translation approach across different, typologically
diverse languages. Our study is aimed at filling this gap. We develop a
systematic method for translation of VerbNet classes from English to
other languages which we first apply to Polish and subsequently to
Croatian, Mandarin, Japanese, Italian, and Finnish. Our results on
Polish demonstrate high translatability with all the classes (96% of
English member verbs successfully translated into Polish) and strong
inter-annotator agreement, revealing a promising degree of overlap in
the resultant classifications. The results on other languages are
equally promising. This demonstrates that VerbNet classes have strong
cross-lingual potential and the proposed method could be applied to
obtain gold standards for automatic verb classification in different
languages. We make our annotation guidelines and the six
language-specific verb classifications available with this paper. © 2017
The Author(s)


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 11:41