Applications of Natural Language Processing in Bilingual Language Teaching: An Indonesian-English Case Study - UTU Research Portal

A4 Refereed article in a conference publication

Applications of Natural Language Processing in Bilingual Language Teaching: An Indonesian-English Case Study

Authors: Zara Maxwelll-Smith, Simón González Ochoa, Ben Foley, Hanna Suominen

Editors: Jill Burstein, Ekaterina Kochmar, Claudia Leacock, Nitin Madnani, Ildikó Pilán, Helen Yannakoudakis, Torsten Zesch

Conference name: Workshop on Innovative Use of NLP for Building Educational Applications

Publication year: 2020

Journal: Annual Meeting of the Association for Computational Linguistics

Book title : Proceedings of the Fifteenth Workshop on Innovative Use of NLP for Building Educational Applications (BEA)

First page : 124

Last page: 134

eISBN: 978-1-952148-18-7

ISSN: 0736-587X

DOI: https://doi.org/10.18653/v1/2020.bea-1.12

Self-archived copy’s web address: https://research.utu.fi/converis/portal/Publication/52185695

Abstract

Multilingual corpora are difficult to compile and a classroom setting adds pedagogy to the mix of factors which make this data so rich and problematic to classify. In this paper, we set out methodological considerations of using automated speech recognition to build a corpus of teacher speech in an Indonesian language classroom. Our preliminary results (64% word error rate) suggest these tools have the potential to speed data collection in this context. We provide practical examples of our data structure, details of our piloted computer-assisted processes, and fine-grained error analysis. Our study is informed and directed by genuine research questions and discussion in both the education and computational linguistics fields. We highlight some of the benefits and risks of using these emerging technologies to analyze the complex work of language teachers and in education more generally.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

2020.bea-1.12.pdf