Spoken Content Retrieval: A Survey of Techniques and Technologies,Foundations and Trends in Information Retrieval

当前位置： X-MOL 学术 › Found. Trends Inf. Ret. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Spoken Content Retrieval: A Survey of Techniques and Technologies
Foundations and Trends in Information Retrieval ( IF 8.3 ) Pub Date : 2012-7-22 , DOI: 10.1561/1500000020
Martha Larson

Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings. This growth has motivated extensive research on techniques and technologies that facilitate reliable indexing and retrieval. Spoken content retrieval (SCR) requires the combination of audio and speech processing technologies with methods from information retrieval (IR). SCR research initially investigated planned speech structured in document-like units, but has subsequently shifted focus to more informal spoken content produced spontaneously, outside of the studio and in conversational settings. This survey provides an overview of the field of SCR encompassing component technologies, the relationship of SCR to text IR and automatic speech recognition and user interaction issues. It is aimed at researchers with backgrounds in speech technology or IR who are seeking deeper insight on how these fields are integrated to support research and development, thus addressing the core challenges of SCR.

中文翻译：

口语内容检索：技术调查

近年来，语音媒体（即包含语音内容的数字音频和视频）蓬勃发展。互联网以及私人和企业环境中都在收集大量收藏品。这种增长激发了对促进可靠索引和检索的技术的广泛研究。语音内容检索（SCR）要求将音频和语音处理技术与信息检索（IR）中的方法结合起来。SCR研究最初调查以文件状单位组织的计划语音，但随后将重点转移到在工作室外和对话环境中自发制作的非正式语音内容。这项调查概述了SCR涵盖组件技术的领域，SCR与文本IR的关系以及自动语音识别和用户交互问题。它面向具有语音技术或IR背景的研究人员，他们正在寻求关于如何集成这些领域以支持研发的更深刻见解，从而解决SCR的核心挑战。

更新日期：2012-07-22

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11