Vocal Features: From Voice Identification to Speech Recognition by Machine

Xiaochang Li; Mara Mills

doi:10.1353/tech.2019.0066

Technology and Culture

Vocal Features: From Voice Identification to Speech Recognition by Machine
Xiaochang Li , Mara Mills
Technology and Culture
Johns Hopkins University Press
Volume 60, Number 2 Supplement, April 2019
pp. S129-S160
10.1353/tech.2019.0066
Article
- View Citation
- Related Content
Additional Information

Purchase/rental options available:
- Buy Issue for $25 at JHUP

Abstract

ABSTRACT:

This article considers machine methods used in the collection, processing, and application of vocal recordings for speaker identification and speech recognition between 1908 and 1970. The first phonographic archives featured collections of "vocal portraits" that prompted international investigations into the essential features of human voices for individual identification. Visual records of speech later found the same applications, but as "voiceprint identification" via sound spectrography began to achieve legal and commercial success in the 1960s, the procedure attracted more widespread scientific attention, which ultimately discredited both its accuracy and its rationale. At the same time, spectrogram collections spurred a new application—speech recognition by machine. The changing status of the speech spectrogram, from a record of unique features of individual voices to a model of fundamental invariants in speech sounds, was rooted in the demands of automated processing and a corresponding shift from the sound archive to the acoustic database.

collapse

You are not currently authenticated.

If you would like to authenticate using a different subscribed institution or have your own login and password to Project MUSE

Authenticate

Purchase/rental options available:
- Buy Issue for $25 at JHUP

Technology and Culture

Vocal Features: From Voice Identification to Speech Recognition by Machine

Share

Additional Information

Project MUSE Mission