当前位置: X-MOL 学术Lang. Resour. Eval. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Reproducibility in speech rate convergence experiments
Language Resources and Evaluation ( IF 2.7 ) Pub Date : 2021-01-29 , DOI: 10.1007/s10579-021-09528-6
Simone Fuscone , Benoit Favre , Laurent Prévot

The reproducibility of scientific studies grounded on language corpora requires approaching each step carefully, from data selection and pre-processing to significance testing. In this paper, we report on our reproduction of a recent study based on a well-known conversational corpus (Switchboard). The reproduced study Cohen Priva et al. (J Acoust Soc Am 141(5):2989–2996, 2017) focuses on speech rate convergence between speakers in conversation. While our reproduction confirms the main result of the original study, it also shows interesting variations in the details. In addition, we tested the original study for the robustness of its data selection and pre-processing, as well as the underlying model of speech rate, the variable observed. Our analysis shows that another approach is needed to take into account the complex aspects of speech rate in conversations. Another benefit of reproducing previous studies is to take analysis a step further, testing and strengthening the results of other research teams and increasing the validity and visibility of interesting studies and results. In this line, we also created a notebook of pre-processing and analysis scripts which is available online.



中文翻译:

语速收敛实验中的可重复性

基于语言语料库的科学研究的可再现性要求仔细地执行每个步骤,从数据选择和预处理到重要性测试。在本文中,我们报告了基于著名的会话语料库(总机)的最新研究成果。转载的研究Cohen Priva等。(J Acoust Soc Am 141(5):2989-2996,2017)专注于会话中说话者之间的语速收敛。我们的复制品证实了原始研究的主要结果,但同时也显示出有趣的细节变化。此外,我们测试了原始研究的数据选择和预处理的健壮性,以及语音速率的基础模型(观察到的变量)。我们的分析表明,需要另一种方法来考虑对话中语音速率的复杂方面。复制以前的研究的另一个好处是,可以进一步进行分析,测试和增强其他研究团队的结果,并提高有趣的研究和结果的有效性和可见性。在这一行中,我们还创建了一个预处理和分析脚本的笔记本,可以在线获取。

更新日期:2021-01-29
down
wechat
bug