Refereed article in compilation book (A3)

Syntactic properties of constrained English: A corpus-driven approach




List of AuthorsIvaska Ilmari, Ferraresi Adriano, Bernardini Silvia

EditorsSylviane Granger, Marie-Aude Lefer

PlaceLondon

Publication year2022

Book title *Extending the Scope of Corpus-Based Translation Studies

Start page133

End page157

ISBN978-1-3501-4325-8

eISBN978-1-3501-4328-9

DOIhttp://dx.doi.org/10.5040/9781350143289.0013

URLhttp://dx.doi.org/10.5040/9781350143289.0013

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/69304540


Abstract

This chapter explores the common ground shared by non-native (L2) and translated language (TrL), seen as instances of constrained language use. It has been suggested that these diverge from native non-translated language (L1) in consistent ways. We explore this hypothesis in a corpus-driven manner, comparing written English in its L2 and TrL varieties, setting them against the benchmark of the L1 variety. In an attempt to control for confounding variables, we include two first/source languages for the constrained varieties, as well as three registers (argumentative writing, political speeches and tourism-related communication), which also allows us to increase representativeness. Methodologically, we look at frequencies of part-of-speech dependency bigrams, adopting keyness analysis and multidimensional analysis to detect and interpret differences between the contrasted varieties. The strengths of the approach are that it relies on syntactically parsed data instead of shallow part-of-speech sequences, is fully data-driven and can be easily implemented in different languages. Results indicate a tendency for the constrained varieties to rely on post-nominal modification and common nouns with determiners to a greater extent than non-constrained varieties, and to display a peculiar use of syntactic structures including proper nouns. Registers are found to impact greatly on results, and cross-register differences to be less prominent in the constrained varieties, which might point to a less heightened sensitivity to register conventions when performing language tasks under constraint of another language. Given the vast amount of variation in the data, the contribution ends on a note of caution when generalizing over-interpretations of constrained language data.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Last updated on 2022-08-11 at 13:57