Abstract—
The article considers the problem of automatic segmentation of a speech signal into phonetic units in conditions of their a priori uncertain spectral composition and correlation properties. A guaranteed significance level criterion is developed based on the information–theoretic approach. An example of practical application of this criterion is considered; a full-scale experiment is set up and conducted. It is shown that the proposed criterion can guarantee a stable significance level when processing speech frames of short duration.
Similar content being viewed by others
Notes
A standard speech frame corresponds to the standard pitch period value (10 ms).
REFERENCES
P. Makhach and R. Skarnitzl, Principles of Phonetic Segmentation (Epocha Publ. House, Praha, 2013). https://www.researchgate.net/publication/234052076
E. Pakoci, B. Popovic, N. Jakovljevic, et al., Lect. Notes Comp. Sci. 9811, 67 (2016).
M. B. Popov, Uchen. Zap. Kazan. Univ., Ser.: Gum. Nauki 159, 1144 (2017).
L. R. Rabiner and R. W. Shafer, Theory and Applications of Digital Speech Processing (Pearson, Boston, 2010).
V. V. Savchenko, J. Commun. Technol. Electron. 64, 590 (2019).
V. V. Savchenko, J. Commun. Technol. Electron. 63, 53 (2018).
V. S. Vykhovanets and D. Tszyan’min, Rechevye Tekhnol., No. 1, 45 (2016).
V. V. Savchenko and A. V. Savchenko, “Software complex of voice hidden control of personal computer for home and office,” Certificate No. 2013615628 (Rospatent, Moscow, 2013).
V. V. Savchenko, Meas. Tech. 62, 282 (2019).
V. V. Savchenko, Radiophys. Quantum Electron. 58, 373 (2015).
V. V. Savchenko, Elektrosvyaz’, No. 12, 22 (2017).
A. F. Shishkina, Teoriya, Praktika, Innovatsii, No. 4, 18 (2016).
N. Benati and H. Bahi, in Proc. 7th Int. Conf. Sci. of Electronics, Technologies of Information and Telecommun., Hammamet, 18–20 Dec., 2017 (IEEE, New York, 2017), p. 267.
V. V. Savchenko, Inform. Sist. & Tekhnol., No. 2, 12 (2014).
A. E. Sakran, S. M. Abdou, S. E. Hamid, and M. Rashwan, https://www.researchgate.net/publication/317339722
H. Kamper, A. Jansen, and S. Goldwater, Comput. Speech Lang. 46, 154 (2017).
A. Yu. Yakimuk and A. A. Konev, Inf. & Sist. Upr., No. 2, 108 (2018).
V. V. Savchenko, Radioelectron. & Commun. 61, 419 (2018).
D. Yu. Akat’ev and V. V. Savchenko, Avtometriya 41 (2), 68 (2005).
A. V. Savchenko, Lecture Notes in Artificial Intell. 10314, 264 (2017).
V. V. Savchenko, Radiophys. Quantum Electron. 60, 89 (2017).
A. A. Borovkov, Mathematical Statistics (Lan’, St.-Petersburg, 2010).
S. L. Marple, Jr. Digital Spectral Analysis: with Applications (Prentice-Hall, Englewood Cliffs, N. J., 1987; Mir, Moscow, 1990).
S. Kullback, Information Theory and Statistics (Dover, New York, 1997).
R. M. Gray, A. Buzo, A. H. Gray, and Y. Matsuyama, IEEE Trans. Acoust., Speech, Signal Process. 28, 367 (1980).
V. V. Savchenko, J. Commun. Technol. Electron. 42, 393 (1997).
V. V. Savchenko, Nauch. Vedom. Belgorod. Gos. Univ., Ser.: Istor., Politolog., Ekonom., Inf. 33/1, 74 (2015).
A, L. Ronzhin and K. V. Evgrafova, Izv. Vyssh. Uchebn. Zaved., Gum. Nauki 2, 227 (2011).
Funding
This study was carried out within the Basic Research Program of the National Research University Higher School of Economics (HSE).
Author information
Authors and Affiliations
Corresponding authors
Additional information
Translated by A. Ovchinnikova
Rights and permissions
About this article
Cite this article
Savchenko, V.V., Savchenko, A.V. Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation. J. Commun. Technol. Electron. 65, 1311–1317 (2020). https://doi.org/10.1134/S1064226920110157
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1064226920110157