Skip to main content
Log in

Convolutional and Deep Neural Networks based techniques for extracting the age-relevant features of the speaker

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

With the advent of conversational voice recognition systems such as Alexa, SIRI, OK Google, etc., natural language conversational scheme including Chatbot and voice recognition systems are in new high and determining the age of a speaker is critical for setting the pertinent context. Age can be inferred from the speech signal by inferring various factors such as physical attributes of voice, linguistic attributes, frequency, speech rate, etc., This paper discusses on extracting the spectral features of speech such as Cepstral Coefficients, Spectral Decrease, Centroid, Flatness, Spectral Entropy,Jitter and Shimmer as inputs which would also helps in classifying speaker age through deep learning techniques.A novel approach is addressed along with the model for implementation using Deep Neural Network and Convolutional Neural Network for classifying the features using three different classifiers.The results obtained from the proposed system would outline the performance in speaker age recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Source: robustness-related issues in speaker recognition)

Fig. 2
Fig. 3
Fig. 4

Source: deep learning-based distant-talking speech processing in real-world sound environments)

Fig. 5
Fig.6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data availability

TIMID, Switch Board and CMU KIDS corpus.

References

Download references

Acknowledgment

I am grateful to all kinds of support provided by Prof. Dr. E. Chandra Eswaran for guiding me for my research work.

Funding

The research work is supported by RUSA 2.0- BEICH.

Author information

Authors and Affiliations

Authors

Contributions

Both the authors conceived of the presented idea, developed the theory and performed the computations and Dr. E. Chandra encouraged K. Karthika to investigate the research and supervised the findings of this work. All authors discussed the results and contributed to the final manuscript. This work has been submitted for Indian Intellectual property with Patent Application Number 201841032399.

Corresponding author

Correspondence to Karthika Kuppusamy.

Ethics declarations

Conflict of Interest

The authors declare that they have no competing interests.

Replication of results

No replicated results are presented.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kuppusamy, K., Eswaran, C. Convolutional and Deep Neural Networks based techniques for extracting the age-relevant features of the speaker. J Ambient Intell Human Comput 13, 5655–5667 (2022). https://doi.org/10.1007/s12652-021-03238-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-021-03238-1

Keywords

Navigation