A4 Vertaisarvioitu artikkeli konferenssijulkaisussa

Template-free Data-to-Text Generation of Finnish Sports News




TekijätJenna Kanerva, Samuel Rönnqvist, Riina Kekki, Tapio Salakoski, Filip Ginter

ToimittajaMareike Hartmann, Barbara Plank

Konferenssin vakiintunut nimiNordic Conference on Computational Linguistics

KustannuspaikkaLinköping

Julkaisuvuosi2019

JournalLinköping Electronic Conference Proceedings

Kokoomateoksen nimiProceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30–October 2, Turku, Finland

Sarjan nimiNEALT Proceedings Series

Numero sarjassa42

Aloitussivu242

Lopetussivu252

ISBN978-91-7929-995-8

Verkko-osoitehttps://www.aclweb.org/anthology/W19-6125/

Rinnakkaistallenteen osoitehttps://research.utu.fi/converis/portal/detail/Publication/44407121


Tiivistelmä

News articles such as sports game reports
are often thought to closely follow the underlying game statistics, but in practice
they contain a notable amount of background knowledge, interpretation, insight
into the game, and quotes that are not
present in the official statistics. This
poses a challenge for automated data-totext news generation with real-world news
corpora as training data. We report on
the development of a corpus of Finnish
ice hockey news, edited to be suitable
for training of end-to-end news generation
methods, as well as demonstrate generation of text, which was judged by journalists to be relatively close to a viable product. The new dataset and system source
code are available for research purposes.


Ladattava julkaisu

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 21:19