当前位置: X-MOL 学术Speech Commun. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Voks: Digital instruments for chironomic control of voice samples
Speech Communication ( IF 2.4 ) Pub Date : 2020-11-02 , DOI: 10.1016/j.specom.2020.10.002
Grégoire Locqueville , Christophe d’Alessandro , Samuel Delalez , Boris Doval , Xiao Xiao

This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen.



中文翻译:

Voks:用于语音样本的色觉控制的数字仪器

本文介绍了Voks,这是一种新的数字仪器系列,可以实时控制和修改预先记录的语音信号样本。基于Voks的仪器由Voks本身,合成软件和一组给定的手性(手动)接口组成。基于音节控制点的新方法可以精确控制节奏。也可以使用其他方法来控制时间,包括擦洗和回放速度变化。借助实时高质量的声码器,可以修改语音样本的音高,发声量,语音张力,表观声道大小,发声率,非周期性比率。提出了声音参数的手性控制的不同形式。通过使用表面上的手写笔(C-Voks)或Theremin(T-Voks)连续的手势来控制音高。也可以使用其他接口。使用双相按钮控制音节节律。可以使用Theremin,控制表面或连续控制器(例如推子)来控制搓洗,播放速度和音色相关的参数。除了逼真的模仿说话或唱歌的声音外,其他演奏模式还可以产生新的有趣声音。Voks参与了歌唱合成系统的比较感知评估。它已在现场音乐环境中使用不同的控制界面进行了演示。除了音乐或诗歌表演之外,还预见到表演性声音合成在语言学习和语音再教育中的应用。控制表面或连续控制器(例如推子)。除了逼真的模仿说话或唱歌的声音外,其他演奏模式还可以产生新的有趣声音。Voks参与了歌唱合成系统的比较感知评估。它已在现场音乐环境中使用不同的控制界面进行了演示。除了音乐或诗歌表演之外,还预见到表演性声音合成在语言学习和语音再教育中的应用。控制表面或连续控制器(例如推子)。除了逼真的模仿说话或唱歌的声音外,其他演奏模式还可以产生新的有趣声音。Voks参与了歌唱合成系统的比较感知评估。它已在现场音乐环境中使用不同的控制界面进行了演示。除了音乐或诗歌表演之外,还预见到表演性声音合成在语言学习和语音再教育中的应用。

更新日期:2020-11-09
down
wechat
bug