当前位置: X-MOL 学术Technol. Cult. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Vocal Features: From Voice Identification to Speech Recognition by Machine
Technology and Culture ( IF 0.8 ) Pub Date : 2019-01-01 , DOI: 10.1353/tech.2019.0066
Xiaochang Li , Mara Mills

ABSTRACT:This article considers machine methods used in the collection, processing, and application of vocal recordings for speaker identification and speech recognition between 1908 and 1970. The first phonographic archives featured collections of "vocal portraits" that prompted international investigations into the essential features of human voices for individual identification. Visual records of speech later found the same applications, but as "voiceprint identification" via sound spectrography began to achieve legal and commercial success in the 1960s, the procedure attracted more widespread scientific attention, which ultimately discredited both its accuracy and its rationale. At the same time, spectrogram collections spurred a new application—speech recognition by machine. The changing status of the speech spectrogram, from a record of unique features of individual voices to a model of fundamental invariants in speech sounds, was rooted in the demands of automated processing and a corresponding shift from the sound archive to the acoustic database.

中文翻译:

人声特征:从语音识别到机器语音识别

摘要:本文考虑了 1908 年至 1970 年间用于语音记录的收集、处理和应用以进行说话人识别和语音识别的机器方法。用于个人识别的人声。语音的视觉记录后来发现了相同的应用,但随着声谱学的“声纹识别”在 1960 年代开始在法律和商业上取得成功,该程序引起了更广泛的科学关注,最终使其准确性和基本原理都受到了质疑。与此同时,频谱图集合催生了一种新的应用——机器语音识别。语音频谱图的变化状态,
更新日期:2019-01-01
down
wechat
bug