A4 Refereed article in a conference publication
Template-free Data-to-Text Generation of Finnish Sports News
Authors: Jenna Kanerva, Samuel Rönnqvist, Riina Kekki, Tapio Salakoski, Filip Ginter
Editors: Mareike Hartmann, Barbara Plank
Conference name: Nordic Conference on Computational Linguistics
Publishing place: Linköping
Publication year: 2019
Journal: Linköping Electronic Conference Proceedings
Book title : Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30–October 2, Turku, Finland
Series title: NEALT Proceedings Series
Number in series: 42
First page : 242
Last page: 252
ISBN: 978-91-7929-995-8
Web address : https://www.aclweb.org/anthology/W19-6125/
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/44407121
News articles such as sports game reports
are often thought to closely follow the underlying game statistics, but in practice
they contain a notable amount of background knowledge, interpretation, insight
into the game, and quotes that are not
present in the official statistics. This
poses a challenge for automated data-totext news generation with real-world news
corpora as training data. We report on
the development of a corpus of Finnish
ice hockey news, edited to be suitable
for training of end-to-end news generation
methods, as well as demonstrate generation of text, which was judged by journalists to be relatively close to a viable product. The new dataset and system source
code are available for research purposes.
Downloadable publication This is an electronic reprint of the original article. |