A4 Refereed article in a conference publication

Sentence Compression for Automatic Subtitling




AuthorsJuhani Luotolahti, Filip Ginter

EditorsBeäta Megyesi

Conference nameNordic Conference on Computational Linguistics

Publication year2015

Book title Proceedings of NoDaLiDa 2015

First page 134

Last page143

Number of pages10

ISBN978-91-7519-098-3

Web address https://aclweb.org/anthology/W/W15/W15-1818.pdf


Abstract

This paper investigates sentence compression for automatic subtitle generation using supervised machine learning. We present a method for sentence compression as well as discuss generation of training data from compressed Finnish sentences, and different approaches to the problem. The method we present outperforms state-of-the-art baseline in both automatic and human  valuation. On real data, 44.9% of the sentences produced by the compression algorithm have been judged to be useable as-is or after minor edits.



Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 12:48