Sentence Compression for Automatic Subtitling




Juhani Luotolahti, Filip Ginter

Beäta Megyesi

Nordic Conference on Computational Linguistics

2015

Proceedings of NoDaLiDa 2015

134

143

10

978-91-7519-098-3

https://aclweb.org/anthology/W/W15/W15-1818.pdf



This paper investigates sentence compression for automatic subtitle generation using supervised machine learning. We present a method for sentence compression as well as discuss generation of training data from compressed Finnish sentences, and different approaches to the problem. The method we present outperforms state-of-the-art baseline in both automatic and human  valuation. On real data, 44.9% of the sentences produced by the compression algorithm have been judged to be useable as-is or after minor edits.



Last updated on 2024-26-11 at 12:48