A4 Refereed article in a conference publication
Sentence Compression for Automatic Subtitling
Authors: Juhani Luotolahti, Filip Ginter
Editors: Beäta Megyesi
Conference name: Nordic Conference on Computational Linguistics
Publication year: 2015
Book title : Proceedings of NoDaLiDa 2015
First page : 134
Last page: 143
Number of pages: 10
ISBN: 978-91-7519-098-3
Web address : https://aclweb.org/anthology/W/W15/W15-1818.pdf
This paper investigates sentence compression for automatic subtitle generation using supervised machine learning. We present a method for sentence compression as well as discuss generation of training data from compressed Finnish sentences, and different approaches to the problem. The method we present outperforms state-of-the-art baseline in both automatic and human valuation. On real data, 44.9% of the sentences produced by the compression algorithm have been judged to be useable as-is or after minor edits.
Downloadable publication This is an electronic reprint of the original article. |