A4 Refereed article in a conference publication

Template-free Data-to-Text Generation of Finnish Sports News




AuthorsJenna Kanerva, Samuel Rönnqvist, Riina Kekki, Tapio Salakoski, Filip Ginter

EditorsMareike Hartmann, Barbara Plank

Conference nameNordic Conference on Computational Linguistics

Publishing placeLinköping

Publication year2019

JournalLinköping Electronic Conference Proceedings

Book title Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30–October 2, Turku, Finland

Series titleNEALT Proceedings Series

Number in series42

First page 242

Last page252

ISBN978-91-7929-995-8

Web address https://www.aclweb.org/anthology/W19-6125/

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/44407121


Abstract

News articles such as sports game reports
are often thought to closely follow the underlying game statistics, but in practice
they contain a notable amount of background knowledge, interpretation, insight
into the game, and quotes that are not
present in the official statistics. This
poses a challenge for automated data-totext news generation with real-world news
corpora as training data. We report on
the development of a corpus of Finnish
ice hockey news, edited to be suitable
for training of end-to-end news generation
methods, as well as demonstrate generation of text, which was judged by journalists to be relatively close to a viable product. The new dataset and system source
code are available for research purposes.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 21:19