当前位置: X-MOL 学术Nordic Journal of Linguistics › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Constrained language use in Finnish: A corpus-driven approach
Nordic Journal of Linguistics ( IF 0.5 ) Pub Date : 2020-04-13 , DOI: 10.1017/s0332586520000013
Ilmari Ivaska , Silvia Bernardini

It has been suggested that second languages and translated languages are constrained by an interplay of several linguistic systems. This paper reports on a data-driven quantitative study on constrained Finnish. We detect linguistic phenomena that distinguish constrained from non-constrained Finnish across constrained varieties, first/source languages, and registers. Implementing a two-phase method, we first detect key quantitative differences of syntactically defined POS bigrams between each variety-, language-pair- and register-specific constrained dataset and its non-constrained counterpart, using Boruta feature selection. We then use the results as variables in a Multi-dimensional Analysis. The results show that both nominal complexity and verbal/clausal complexity distinguish constrained from non-constrained Finnish. These differences interact with both type of constraint and register: the constrained varieties are less sensitive to register differences, and this tendency is more pronounced in learner Finnish than in translated Finnish. Leaving out any of these variables from the analysis would blur our view of this multi-faceted phenomenon.

中文翻译:

芬兰语中的受限语言使用:语料库驱动的方法

有人提出,第二语言和翻译语言受到多种语言系统相互作用的限制。本文报告了对受限芬兰语的数据驱动定量研究。我们检测语言现象,这些现象在受约束的变体、第一/源语言和语域中区分受约束的芬兰语和不受约束的芬兰语。实施两阶段方法,我们首先使用 Boruta 特征选择检测每个品种、语言对和寄存器特定的约束数据集与其非约束数据集之间的句法定义的 POS 二元组的关键定量差异。然后,我们将结果用作多维分析中的变量。结果表明,名义复杂性和口头/从句复杂性都区分了受限芬兰语和非受限芬兰语。这些差异与约束类型和语域相互作用:受约束的变体对语域差异不太敏感,这种趋势在芬兰语学习者中比在芬兰语翻译中更为明显。从分析中忽略这些变量中的任何一个都会模糊我们对这种多方面现象的看法。
更新日期:2020-04-13
down
wechat
bug