当前位置: X-MOL 学术Language Testing › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Time to achieving a designated criterion score level: A survival analysis study of test taker performance on the TOEFL iBT® test
Language Testing ( IF 2.2 ) Pub Date : 2020-08-20 , DOI: 10.1177/0265532220940709
Lora F. Monfils 1 , Venessa F. Manna 1
Affiliation  

This study used survival analysis to examine the patterns and factors associated with time to achieving designated score criteria on a test of English as a foreign language. This was modeled using an extension of the Cox regression model, with two criterion score levels defined as achieving a TOEFL iBT® total test scale score at or above the Common European Framework of Reference (CEFR) Level B2 and at Level C1, respectively. Factors included in the model were test taker background characteristics including age, gender, native language type, exposure to English, and reason for testing. Additionally, to account for those who tested more than once within the study period, and thus had multiple records, an indicator for order of testing occasion was included in the model. Results indicate that approximately 82% of the test takers in our study sample tested one time in the study period (2014–2016), and the number of repeaters decreased rapidly across occasions. For those who did not achieve the designated criterion scores at first testing, the likelihood of achievement increases with repeated testing, with a somewhat greater effect for the less stringent B2 criterion. Results also indicate that the association of gender with performance differed across levels.

中文翻译:

达到指定标准分数水平的时间:对考生在 TOEFL iBT® 考试中表现的生存分析研究

本研究使用生存分析来检查与在英语作为外语测试中达到指定分数标准所需的时间相关的模式和因素。这是使用 Cox 回归模型的扩展进行建模的,其中两个标准分数级别定义为分别达到或高于欧洲共同参考框架 (CEFR) B2 级和 C1 级的 TOEFL iBT® 总考试量表分数。模型中包含的因素是应试者背景特征,包括年龄、性别、母语类型、接触英语和测试原因。此外,为了说明在研究期间测试多次并因此有多个记录的人,模型中包含了测试时间顺序的指标。结果表明,在我们的研究样本中,大约 82% 的考生在研究期间(2014-2016 年)测试了一次,并且重复的人数在不同情况下迅速减少。对于那些在第一次测试时没有达到指定标准分数的人,通过重复测试,达到的可能性会增加,对不太严格的 B2 标准的影响稍大。结果还表明,性别与绩效的关联因级别而异。
更新日期:2020-08-20
down
wechat
bug