当前位置: X-MOL 学术Speech Commun. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
POLEMAD–A database for the multimodal analysis of Polish pronunciation
Speech Communication ( IF 2.4 ) Pub Date : 2020-12-11 , DOI: 10.1016/j.specom.2020.12.005
Robert Wielgat , Rafał Jędryka , Anita Lorenc , Łukasz Mik , Daniel Król

The structure and functionality of the POLEMAD database constructed on the basis of a study using Electromagnetic Articulograph AG 500, an acoustic camera, and 3 video cameras are described in the paper. The article describes also data types stored in the database including speaker data, EMA data, video and sound recordings, phonetic information, and dynamic Bayesian network (DBN) models. The database allows for selective extraction of various types of samples for further analysis, which is performed by SQL queries generated in MATLAB® using Database Toolbox™. The possibilities of potential future application of the database in statistical analysis and automation of experiments on speech inversion using DBN are described in the paper as well.



中文翻译:

POLEMAD –用于波兰语发音多模态分析的数据库

本文描述了基于使用电磁关节AG 500,声学相机和3台摄像机的研究而构建的POLEMAD数据库的结构和功能。本文还介绍了存储在数据库中的数据类型,包括扬声器数据,EMA数据,视频和声音记录,语音信息以及动态贝叶斯网络(DBN)模型。该数据库允许有选择地提取各种类型的样本以进行进一步分析,这是通过使用Database Toolbox™在MATLAB®中生成的SQL查询执行的。本文还描述了数据库在统计分析和使用DBN进行语音倒置实验的自动化方面潜在的未来应用可能性。

更新日期:2021-01-06
down
wechat
bug