Syntactic properties of constrained English: A corpus-driven approach - UTU Research Portal

A3 Refereed book chapter or chapter in a compilation book

Syntactic properties of constrained English: A corpus-driven approach

Authors: Ivaska Ilmari, Ferraresi Adriano, Bernardini Silvia

Editors: Sylviane Granger, Marie-Aude Lefer

Publishing place: London

Publication year: 2022

Book title : Extending the Scope of Corpus-Based Translation Studies

First page : 133

Last page: 157

ISBN: 978-1-3501-4325-8

eISBN: 978-1-3501-4328-9

DOI: https://doi.org/10.5040/9781350143289.0013

Publication's open availability at the time of reporting: No Open Access

Publication channel's open availability : No Open Access publication channel

Web address : http://dx.doi.org/10.5040/9781350143289.0013

Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/69304540

Self-archived copy's version: Final draft

Abstract

This chapter explores the common ground shared by non-native (L2) and translated language (TrL), seen as instances of constrained language use. It has been suggested that these diverge from native non-translated language (L1) in consistent ways. We explore this hypothesis in a corpus-driven manner, comparing written English in its L2 and TrL varieties, setting them against the benchmark of the L1 variety. In an attempt to control for confounding variables, we include two first/source languages for the constrained varieties, as well as three registers (argumentative writing, political speeches and tourism-related communication), which also allows us to increase representativeness. Methodologically, we look at frequencies of part-of-speech dependency bigrams, adopting keyness analysis and multidimensional analysis to detect and interpret differences between the contrasted varieties. The strengths of the approach are that it relies on syntactically parsed data instead of shallow part-of-speech sequences, is fully data-driven and can be easily implemented in different languages. Results indicate a tendency for the constrained varieties to rely on post-nominal modification and common nouns with determiners to a greater extent than non-constrained varieties, and to display a peculiar use of syntactic structures including proper nouns. Registers are found to impact greatly on results, and cross-register differences to be less prominent in the constrained varieties, which might point to a less heightened sensitivity to register conventions when performing language tasks under constraint of another language. Given the vast amount of variation in the data, the contribution ends on a note of caution when generalizing over-interpretations of constrained language data.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

ivaska-et-al_2022_parallel-publication.pdf