A1 Refereed original research article in a scientific journal
Constrained language use in Finnish: A corpus-driven approach
Authors: Ilmari Ivaska, Silvia Bernardini
Publisher: Cambridge University Press
Publication year: 2020
Journal: Nordic Journal of Linguistics
Journal acronym: NJL
Volume: 43
Issue: 1
First page : 33
Last page: 57
Number of pages: 25
ISSN: 0332-5865
eISSN: 1502-4717
DOI: https://doi.org/10.1017/S0332586520000013
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/46691400
It has been suggested that second languages and translated languages are constrained by aninterplay of several linguistic systems. This paper reports on a data-driven quantitativestudy on constrained Finnish. We detect linguistic phenomena that distinguish constrained from non-constrained Finnish across constrained varieties, first/source languages, and registers. Implementing a two-phase method, we first detect key quantitative differences of syntactically defined POS bigrams between each variety-, language-pair- and register-specific constrained dataset and its non-constrained counterpart, using Boruta feature selection. We then use the results as variables in a Multi-dimensional Analysis. The results show that both nominal complexity and verbal/clausal complexity distinguish constrained from non-constrained Finnish. These differences interact with both type of constraint and register: the constrained varieties are less sensitive to register differences, and this tendency is more pronounced in learner Finnish than in translated Finnish. Leaving out any of these variables from the analysis would blur our view of this multi-faceted phenomenon.
Downloadable publication This is an electronic reprint of the original article. |