Abstract
Oral productions of speakers with Down syndrome exhibit special characteristics that have been the target of study for decades. In spite of this attention, the availability of rich resources for its analysis is still scarce. In this paper, we present the definition and compiling procedure of a corpus of semi-controlled oral productions of speakers with Down syndrome that aims to allow the analysis of how these speakers with these speakers produce functional and linguistic aspects of speech. The PRAUTOCAL corpus has been recorded while using a video game for training oral competences. Utterances are related to well defined communicative tasks recorded by both speakers with Down syndrome and typically developing speakers. We present the procedure for human experts to evaluate the recordings and the transcription criteria followed for enriching the utterances of the corpus. PRAUTOCAL permits the analysis of the clear contrast in voice and speech between individuals with Down syndrome and typically developing speakers, taking into account the high heterogeneity of the speech problems characteristic of the syndrome. This material allows the analysis of the speech problems in Down syndrome, with applications to the generation of knowledge that could be used in future works for therapists to prepare specific training or enriching diagnosis regarding possible speech and language disorders.
Similar content being viewed by others
Notes
Resolution PI 20-1639 of the Ethics Committee
References
Abbeduto, L. (2008). Pragmatic development. Down Syndrome Research and Practice. https://doi.org/10.3104/reviews.2078
Abbeduto, L., Warren, S. F., & Conners, F. A. (2007). Language development in down syndrome: From the prelinguistic period to the acquisition of literacy. Mental Retardation and Developmental Disabilities Research Reviews, 13(3), 247–261.
Adell, J., Escudero, D., & Bonafonte, A. (2012). Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence. Speech Communication, 54(3), 459–476.
Aguilar, L. (2019). Learning prosody in a video game-based learning approach. Multimodal Technologies and Interaction, 3(3), 51.
Aguilar, L., de-la Mota, C., & Prieto, P. (2021, May 24) Guía multimedia de la prosodia del español. Retrieved from http://www.prado.uab.es
Albertini, G., Bonassi, S., Dall’Armi, V., Giachetti, I., Giaquinto, S., & Mignano, M. (2010). Spectral analysis of the voice in Down syndrome. Research in Developmental Disabilities, 31(5), 995–1001.
Baur, C., Rayner, E., & Tsourakis, N. (2014). Using a serious game to collect a child learner speech corpus. In: Ninth international conference on language resources and evaluation (LREC)
Becker, J. T., Boiler, F., Lopez, O. L., Saxton, J., & McGonigle, K. L. (1994). The natural history of Alzheimer’s disease: description of study cohort and accuracy of diagnosis. Archives of Neurology, 51(6), 585–594.
Boersma, P. (2021, May 24) Praat: Doing phonetics by computer. Retrieved from http://www.praat.org/
Brooks, G. (2013). The prerequisites for successful teaching and learning of literacy. European Journal of Education, 48(4), 557–569.
Brown-Sweeney, S. G., & Smith, B. L. (1997). The development of speech production abilities in children with Down syndrome. Clinical Linguistics & Phonetics, 11(5), 345–362.
Bunton, K., & Leddy, M. (2011). An evaluation of articulatory working space area in vowel production of adults with Down syndrome. Clinical Linguistics & Phonetics, 25(4), 321–334.
Chapman, R., & Hesketh, L. (2001). Language, cognition, and short-term memory in individuals with Down syndrome. Down Syndrome Research and Practice, 7(1), 1–7.
Cleland, J., Wood, S., Hardcastle, W., Wishart, J., & Timmins, C. (2010). Relationship between speech, oromotor, language and cognitive abilities in children with Down’s syndrome. International Journal of Language & Communication Disorders, 45(1), 83–95.
Cole, J. (2015). Prosody in context: A review. Language, Cognition and Neuroscience, 30(1–2), 1–31.
Corral, S., Arribas, D., Santamaria, P., Sueiro, M., & Perelia, J. (2005). Escala de inteligencia de Wechsler para nifios-IV. TEA Ediciones.
Corrales-Astorgano, M., Escudero-Mancebo, D., & Gonzalez-Ferreras, C. (2016) Acoustic analysis of anomalous use of prosodic features in a corpus of people with intellectual disability. In: Advances in Speech and Language Technologies for Iberian Languages: Third International Conference IberSPEECH (pp. 151–161). Springer
Corrales-Astorgano, M., Escudero-Mancebo, D., & Gonzalez-Ferreras, C. (2018). Acoustic characterization and perceptual analysis of the relative importance of prosody in speech of people with down syndrome. Speech Communication, 99, 90–100.
Corrales-Astorgano, M., Martinez-Castilla, P., Escudero-Mancebo, D., Aguilar, L., Gonzalez-Ferreras, C., & Cardefioso-Payo, V. (2019). Automatic assessment of prosodic quality in down syndrome: Analysis of the impact of speaker heterogeneity. Applied Sciences, 9(7), 1440.
Dunn, L., Dunn, L., & Arribas, D. (2006). Test de vocabulario en imagenes peabody. TEA.
Eggers, K., & Van Eerdenbrugh, S. (2017). Speech disfluencies in children with Down Syndrome. Journal of Communication Disorders, 71, 72.
Estebas-Vilaplana, E., Gutierrez, Y. M., Vizcaino, F., Cabrera, M., & de Gran, C. P. (2015). Boundary tones in spanish declaratives: Modelling sustained pitch. ICPhS.
Eyben, F., Weninger, F., Gross, F., & Schuller, B. (2013). Recent developments in opensmile, the Munich open-source multimedia feature extractor. In: Proceedings of the 21st ACM international conference on Multimedia, ACM (pp. 835–838)
Eyben, F., Scherer, K. R., Schuller, B. W., Sundberg, J., Andre, E., Busso, C., Devillers, L. Y., Epps, J., Laukka, P., Narayanan, S. S., et al. (2016). The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing. IEEE Transactions on Affective Computing, 7(2), 190–202.
Fidler, D. J., & Nadel, L. (2007). Education and children with down syndrome: Neuroscience, development, and intervention. Mental Retardation and Developmental Disabilities Research Reviews, 13(3), 262–271.
Forbes, M. M., Fromm, D., & MacWhinney, B. (2012). Aphasiabank: A resource for clinicians. Seminars in Speech and Language, Thieme Medical Publishers, 33, 217–222.
Fougeron, C., Crevier-Buchman, L., Fredouille, C., Ghio, A., Meunier, C., ChevrieMuller, C., Bonastre, J. F., Simon, A.C., de Looze, C., Duez, D. et al. (2010). The DesPho-APaDy Project: Developing an Acoustic-phonetic Characterization of Dysarthric Speech in French. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)
Freitas, J., Calado, A., Braga, D., Silva, P., & Dias, M. (2010). Crowdsourcing platform for large-scale speech data collection. In: VI Jornadas en Tecnologia del Habla and II Iberian SLTech Workshop
Gemmeke, J., Ons, B., Tessema, N. M., Van de Loo, J., De Pauw, G., Daelemans, W., Huyghe, J., Derboven, J., Vuegen, L., Van Den Broeck, B., et al. (2013). Self-taught assistive vocal interfaces: An overview of the aladin project. Proceedings Interspeech, 2013, 2038–2043.
Gonzalez-Ferreras, C., Escudero-Mancebo, D., Corrales-Astorgano, M., Aguilar-Cuevas, L., & Flores-Lucas, V. (2017). Engaging adolescents with down syndrome in an educational video game. International Journal of Human Computer Interaction, 33(9), 693–712.
Gosztolya, G., Vincze, V., Toth, L., Pakaski, M., Kalman, J., & Hoffmann, I. (2019). Identifying mild cognitive impairment and mild Alzheimer’s disease based on spontaneous speech using ASR and linguistic features. Computer Speech & Language, 53, 181–197.
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., & Witten, I. H. (2009). The weka data mining software: an update. ACM SIGKDD Explorations Newsletter, 11, 10–18.
Halliday, M. A. (1970). Language structure and language function. New Horizons in Linguistics, 1, 140–165.
Hauptman, Y., Aloni-Lavi, R., Lapidot, I., Gurevich, T., Manor, Y., Naor, S., Diamant, N., & Opher, I. (2019). Identifying distinctive acoustic and spectral features in Parkinson’s disease. Proceedings of Interspeech, 2019, 2498–2502.
Kent, R. D., & Vorperian, H. K. (2013). Speech impairment in Down syndrome: A review. Journal of Speech Language and Hearing Research (online), 56(1), 178.
Khan, T., Lundgren, L. E., Anderson, D. G., Nowak, I., Dougherty, M., Verikas, A., Pavel, M., Jimison, H., Nowaczyk, S., & Aharonson, V. (2020). Assessing Parkinson’s disease severity using speech analysis in non-native speakers. Computer Speech & Language, 61, 101047.
Kim, H., Hasegawa-Johnson, M., Perlman, A., Gunderson, J., Huang, T. S., Watkin, K., & Frame, S. (2008). Dysarthric speech database for universal access research. Interspeech.
Kim, M. J., Wang, J., & Kim, H. (2016). Dysarthric speech recognition using kullbackleibler divergence-based hidden markov model (pp. 2671–2675). Interspeech.
Ladd, D. R. (2008). Intonational phonology. Cambridge University Press.
Lahiri, R., Kumar, M., Bishop, S., & Narayanan, S. (2020). Learning domain invariant representations for child-adult classification from speech. In: International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
Le, D., Licata, K., Persad, C., & Provost, E. M. (2016). Automatic assessment of speech intelligibility for individuals with aphasia. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(11), 2187–2199.
Lee, M. T., Thorpe, J., & Verhoeven, J. (2009). Intonation and phonation in young adults with Down syndrome. Journal of Voice, 23(1), 82–87.
Leech, G. N. (2016). Principles of pragmatics. Routledge.
Li, M., Tang, D., Zeng, J., Zhou, T., Zhu, H., Chen, B., & Zou, X. (2019). An automated assessment framework for atypical prosody and stereotyped idiosyncratic phrases related to autism spectrum disorder. Computer Speech & Language, 56, 80–94.
Lin, Y. S., Gau, S. S. F., & Lee, C. C. (2018). An interlocutor-modulated attentional lstm for differentiating between subgroups of autism spectrum disorder (pp. 2329–2333). Interspeech.
Loveall, S. J., Hawthorne, K., & Gaines, M. (2021). A meta-analysis of prosody in autism, williams syndrome, and down syndrome. Journal of Communication Disorders, 89, 106055.
Lyakso, E., Frolova, O, Kaliyev, A., Gorodnyi, V., Grigorev, A., & Matveev, Y. (2019). AD-Child. Ru: Speech Corpus for Russian Children with Atypical Development. In: International Conference on Speech and Computer (pp. 299–308). Springer
MacWhinney, B., Fromm, D., Forbes, M., & Holland, A. (2011). Aphasiabank: Methods for studying discourse. Aphasiology, 25(11), 1286–1307.
Martin, G. E., Klusek, J., Estigarribia, B., & Roberts, J. E. (2009). Language characteristics of individuals with Down syndrome. Topics in Language Disorders, 29(2), 112.
Martinez, M. H., Duran, X. P., & Navarro, J. N. (2011). Attention deficit disorder with or without hyperactivity or impulsivity in children with Down’s syndrome. International Medical Review on down Syndrome, 15(2), 18–22.
Martinez-Castilla, P., & Peppe, S. (2008). Developing a test of prosodic ability for speakers of iberian spanish. Speech Communication, 50(11–12), 900–915.
McGraw, I., Gruenstein, A., & Sutherland, A. (2009) A self-labeling speech corpus: Collecting spoken words with an online educational game. In: Tenth Annual Conference of the International Speech Communication Association
Menendez-Pidal, X., Polikoff, J.B., Peters, S.M., Leonzio, J.E., & Bunnell, H.T. (1996). The nemours database of dysarthric speech. In: Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP'96, IEEE (vol. 3, pp. 1962–1965)
Meunier, C., Fougeron, C., Fredouille, C., Bigi, B., Crevier-Buchman, L., Delais Roussarie, E., Georgeton, L., Ghio, A., Laaridh, I., Legou, T., et al. (2016). The typaloc corpus: A collection of various dysarthric speech recordings in read and spontaneous styles. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) (pp. 4658–4665)
Moura, C. P., Cunha, L. M., Vilarinho, H., Cunha, M. J., Freitas, D., Palha, M., Pueschel, S. M., & Pais-Clemente, M. (2008). Voice parameters in children with Down syndrome. Journal of Voice, 22(1), 34–42.
Nicolao, M., Christensen, H., Cunningham, S., Green, P., & Hain, T. (2016). A framework for collecting realistic recordings of dysarthric speech-the homeservice corpus. In: Proceedings of LREC 2016, European Language Resources Association
O’Leary, D., Lee, A., O’Toole, C., & Gibbon, F. (2020). Perceptual and acoustic evaluation of speech production in Down syndrome: A case series. Clinical Linguistics & Phonetics, 34(1–2), 72–91.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.
Raven, J. C., Court, J. H., & Raven, J. (1995). Matrices progresivas : escalas CPM (color), SPM (general) y APM (superior). TEA Ediciones, S.A.U.
Roch, M., Florit, E., & Levorato, C. (2015). Follow-up study on reading comprehension in down’s syndrome: the role of reading skills and listening comprehension. International Journal of Language & Communication Disorders, 46, 231.
Roch, M., Florit, E., & Levorato, M. C. (2012). The advantage of reading over listening text comprehension in down syndrome: What is the role of verbal memory? Research in Developmental Disabilities, 33(3), 890–899.
Rochet-Capellan, A., & Dohen, M. (2015). Acoustic characterisation of vowel production by young adults with Down syndrome. In: 18th International Congress of Phonetic Sciences (ICPhS 2015)
Rodger, R. (2009). Voice quality of children and young people with Down's Syndrome and its impact on listener judgement. PhD thesis, Queen Margaret University
Rudzicz, F., Namasivayam, A. K., & Wolff, T. (2012). The torgo database of acoustic and articulatory speech from speakers with dysarthria. Language Resources and Evaluation, 46(4), 523–541.
Saarni, C., Campos, J. J., Camras, L. A., & Witherington, D. (2007). Emotional development: Action, communication, and understanding. In W. Damon & R. M. Lerner (Eds.), Handbook of child psychology (p. 3). Wiley.
Sakar, B. E., Isenkul, M. E., Sakar, C. O., Sertbas, A., Gurgen, F., Delil, S., Apaydin, H., & Kursun, O. (2013). Collection and analysis of a parkinson speech dataset with multiple types of sound recordings. IEEE Journal of Biomedical and Health Informatics, 17(4), 828–834.
Satt, A., Hoory, R., Konig, A., Aalten, P., & Robert, P.H. (2014). Speech-based automatic and robust detection of very early dementia. In: Fifteenth Annual Conference of the International Speech Communication Association
Saz, O., Lleida, E., Vaquero, C., & Rodriguez, W. R. (2010). The alborada-i3a corpus of disordered speech. LREC.
Seifpanahi, S., Bakhtiar, M., & Salmalian, T. (2011). Objective vocal parameters in Farsi-speaking adults with Down syndrome. Folia Phoniatrica Et Logopaedica, 63(2), 72–76.
Shriberg, E. E. (1994) Preliminaries to a theory of speech disfluencies. PhD thesis, Citeseer
Stojanovik, V. (2011). Prosodic deficits in children with Down syndrome. Journal of Neurolinguistics, 24(2), 145–155.
Weiner, J., Frankenberg, C., Telaar, D., Wendelstein, B., Schroder, J., & Schultz, T. (2016) Towards automatic transcription of ilse—An interdisciplinary longitudinal study of adult development and aging. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), (pp. 718–725)
Wild, A., Vorperian, H. K., Kent, R. D., Bolt, D. M., & Austin, D. (2018). Single-word speech intelligibility in children and adults with Down syndrome. American Journal of Speech-Language Pathology, 27(1), 222–236.
Zampini, L., Fasolo, M., Spinelli, M., Zanchi, P., Suttora, C., & Salerni, N. (2016). Prosodic skills in children with Down syndrome and in typically developing children. International Journal of Language & Communication Disorders, 51(1), 74–83.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The activities of Down syndrome speech analysis continue (1/2018-12/2020) in the project funded by the Ministerio de Ciencia, Innovación y Universidades and the European Regional Development Fund FEDER (TIN2017-88858-C2-1-R) and in the project funded by the Junta de Castilla y León (VA050G18). Part of this work was funded by the BBVA Foundation (2015-2017) in the framework of the project PRADIA: Pragmatics and prosody: the graphic adventure game.
Rights and permissions
About this article
Cite this article
Escudero-Mancebo, D., Corrales-Astorgano, M., Cardeñoso-Payo, V. et al. PRAUTOCAL corpus: a corpus for the study of Down syndrome prosodic aspects. Lang Resources & Evaluation 56, 191–224 (2022). https://doi.org/10.1007/s10579-021-09542-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-021-09542-8