当前位置: X-MOL 学术Lang. Resour. Eval. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
C2SI corpus: a database of speech disorder productions to assess intelligibility and quality of life in head and neck cancers
Language Resources and Evaluation ( IF 1.7 ) Pub Date : 2020-06-15 , DOI: 10.1007/s10579-020-09496-3
Virginie Woisard , Corine Astésano , Mathieu Balaguer , Jérôme Farinas , Corinne Fredouille , Pascal Gaillard , Alain Ghio , Laurence Giusti , Imed Laaridh , Muriel Lalain , Benoît Lepage , Julie Mauclair , Olivier Nocaudie , Julien Pinquier , Gilles Pouchoulin , Michèle Puech , Danièle Robert , Vincent Roger

Within the framework of the Carcinologic Speech Severity Index (C2SI) INCa Project, we collected a large database of French speech recordings aiming at validating Disorder Severity Indexes. Such a database will be useful for measuring the impact of oral and pharyngeal cavity cancer on speech production. It will permit to assess patients’ quality of life after treatment. The database is composed of audio recordings from 134 sessions and associated metadata. Several intelligibility and comprehensibility levels of speech functions have been evaluated. Acoustics and prosody have been assessed. Perceptual evaluation rates from both naive and expert juries are being produced. Automatic analyzes are being carried out. It is intended to provide speech therapists and physicians with objective tools, which take into account the intelligibility and comprehensibility of patients which received cancer treatment (surgery and/or radiotherapy and/or chemotherapy). The aim of this paper is to justify the necessity of such a corpus and to present its data collection. This C2SI corpus will be available to the scientific community through the Scientific Interest Group Parolothèque.



中文翻译:

C2SI语料库:语言障碍产生的数据库,用于评估头颈癌的清晰度和生活质量

在癌性语音严重度指数(C2SI)INCa项目的框架内,我们收集了一个大型的法国语音记录数据库,旨在验证疾病严重度指数。这样的数据库对于测量口腔和咽腔癌对语音产生的影响将是有用的。它将允许评估治疗后患者的生活质量。该数据库由来自134个会话的音频记录和相关的元数据组成。语音功能的几种清晰度和可理解性水平已得到评估。声学和韵律已得到评估。来自天真的和专业的陪审团的感知评价率正在产生。自动分析正在进行中。目的是为言语治疗师和医师提供客观的工具,考虑到接受癌症治疗(手术和/或放射疗法和/或化学疗法)的患者的清晰度和可理解性。本文的目的是证明这种语料库的必要性并介绍其数据收集。该C2SI语料库将通过科学兴趣小组Parolothèque提供给科学界。

更新日期:2020-07-24
down
wechat
bug