Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction - UTU Research Portal

A4 Refereed article in a conference publication

Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction

Authors: Bassignana Elisa, Ginter Filip, Pyysalo Sampo, Rob van der Goot, Plank Barbara

Editors: Tanel Alumäe, Mark Fishel

Conference name: Nordic Conference on Computational Linguistics

Publication year: 2023

Journal: NEALT proceedings series

Book title : Proceedings of The 24th Nordic Conference on Computational Linguistics (NoDaLiDa)

Series title: NEALT proceedings series

Number in series: 52

First page : 80

Last page: 85

ISBN: 978-99-1621-999-7

ISSN: 1736-8197

eISSN: 1736-6305

Publication's open availability at the time of reporting: Open Access

Publication channel's open availability : Open Access publication channel

Web address : https://aclanthology.org/2023.nodalida-1.9

Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/380758650

Self-archived copy's licence: CC BY

Self-archived copy's version: Publisher`s PDF

Abstract

Most research in Relation Extraction (RE) involves the English language, mainly due to the lack of multi-lingual resources. We propose MULTI-CROSSRE, the broadest multi-lingual dataset for RE, including 26 languages in addition to English, and covering six text domains. MULTICROSSRE is a machine translated version of CrossRE (Bassignana and Plank, 2022a), with a sub-portion including more than 200 sentences in seven diverse languages checked by native speakers. We run a baseline model over the 26 new datasets and—as sanity check—over the 26 back-translations to English. Results on the back-translated data are consistent with the ones on the original English CrossRE, indicating high quality of the translation and the resulting dataset.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

2023.nodalida-1.9.pdf