Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation

Savchenko, V. V.; Savchenko, A. V.

doi:10.1134/S1064226920110157

Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation

THEORY AND METHODS OF SIGNAL PROCESSING
Published: 26 November 2020

Volume 65, pages 1311–1317, (2020)
Cite this article

Journal of Communications Technology and Electronics Aims and scope Submit manuscript

V. V. Savchenko¹ &
A. V. Savchenko²

61 Accesses
8 Citations
Explore all metrics

Abstract—

The article considers the problem of automatic segmentation of a speech signal into phonetic units in conditions of their a priori uncertain spectral composition and correlation properties. A guaranteed significance level criterion is developed based on the information–theoretic approach. An example of practical application of this criterion is considered; a full-scale experiment is set up and conducted. It is shown that the proposed criterion can guarantee a stable significance level when processing speech frames of short duration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Segmentation Algorithm Using Temporal Features and Group Delay for Speech Signals

Speech Signal Segmentation into Vocalized and Unvocalized Segments on the Basis of Simultaneous Masking

Article 01 July 2018

Criterion for Minimum of Mean Information Deviation for Distinguishing Random Signals with Similar Characteristics

Article 01 September 2018

Notes

A standard speech frame corresponds to the standard pitch period value (10 ms).
See https://sites.google.com/site/frompldcreators/produkty-1/phonemetraining.
See http://www.internationalphoneticalphabet.org/

REFERENCES

P. Makhach and R. Skarnitzl, Principles of Phonetic Segmentation (Epocha Publ. House, Praha, 2013). https://www.researchgate.net/publication/234052076
E. Pakoci, B. Popovic, N. Jakovljevic, et al., Lect. Notes Comp. Sci. 9811, 67 (2016).
Article Google Scholar
M. B. Popov, Uchen. Zap. Kazan. Univ., Ser.: Gum. Nauki 159, 1144 (2017).
Google Scholar
L. R. Rabiner and R. W. Shafer, Theory and Applications of Digital Speech Processing (Pearson, Boston, 2010).
Google Scholar
V. V. Savchenko, J. Commun. Technol. Electron. 64, 590 (2019).
Article Google Scholar
V. V. Savchenko, J. Commun. Technol. Electron. 63, 53 (2018).
Article Google Scholar
V. S. Vykhovanets and D. Tszyan’min, Rechevye Tekhnol., No. 1, 45 (2016).
V. V. Savchenko and A. V. Savchenko, “Software complex of voice hidden control of personal computer for home and office,” Certificate No. 2013615628 (Rospatent, Moscow, 2013).
Google Scholar
V. V. Savchenko, Meas. Tech. 62, 282 (2019).
Article Google Scholar
V. V. Savchenko, Radiophys. Quantum Electron. 58, 373 (2015).
Article Google Scholar
V. V. Savchenko, Elektrosvyaz’, No. 12, 22 (2017).
A. F. Shishkina, Teoriya, Praktika, Innovatsii, No. 4, 18 (2016).
Google Scholar
N. Benati and H. Bahi, in Proc. 7th Int. Conf. Sci. of Electronics, Technologies of Information and Telecommun., Hammamet, 18–20 Dec., 2017 (IEEE, New York, 2017), p. 267.
V. V. Savchenko, Inform. Sist. & Tekhnol., No. 2, 12 (2014).
A. E. Sakran, S. M. Abdou, S. E. Hamid, and M. Rashwan, https://www.researchgate.net/publication/317339722
H. Kamper, A. Jansen, and S. Goldwater, Comput. Speech Lang. 46, 154 (2017).
Article Google Scholar
A. Yu. Yakimuk and A. A. Konev, Inf. & Sist. Upr., No. 2, 108 (2018).
V. V. Savchenko, Radioelectron. & Commun. 61, 419 (2018).
D. Yu. Akat’ev and V. V. Savchenko, Avtometriya 41 (2), 68 (2005).
A. V. Savchenko, Lecture Notes in Artificial Intell. 10314, 264 (2017).
V. V. Savchenko, Radiophys. Quantum Electron. 60, 89 (2017).
A. A. Borovkov, Mathematical Statistics (Lan’, St.-Petersburg, 2010).
S. L. Marple, Jr. Digital Spectral Analysis: with Applications (Prentice-Hall, Englewood Cliffs, N. J., 1987; Mir, Moscow, 1990).
S. Kullback, Information Theory and Statistics (Dover, New York, 1997).
R. M. Gray, A. Buzo, A. H. Gray, and Y. Matsuyama, IEEE Trans. Acoust., Speech, Signal Process. 28, 367 (1980).
V. V. Savchenko, J. Commun. Technol. Electron. 42, 393 (1997).
V. V. Savchenko, Nauch. Vedom. Belgorod. Gos. Univ., Ser.: Istor., Politolog., Ekonom., Inf. 33/1, 74 (2015).
A, L. Ronzhin and K. V. Evgrafova, Izv. Vyssh. Uchebn. Zaved., Gum. Nauki 2, 227 (2011).

Download references

Funding

This study was carried out within the Basic Research Program of the National Research University Higher School of Economics (HSE).

Author information

Authors and Affiliations

Editorial Board of the Journal Radiotekhnika i Elektronika, 125009, Moscow, Russia
V. V. Savchenko
HSE University, Laboratory of Algorithms and Technologies for Networks Analysis, 603155, Nizhny Novgorod, Russia
A. V. Savchenko

Authors

V. V. Savchenko
View author publications
You can also search for this author in PubMed Google Scholar
A. V. Savchenko
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to V. V. Savchenko or A. V. Savchenko.

Additional information

Translated by A. Ovchinnikova

Rights and permissions

Reprints and permissions

About this article

Cite this article

Savchenko, V.V., Savchenko, A.V. Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation. J. Commun. Technol. Electron. 65, 1311–1317 (2020). https://doi.org/10.1134/S1064226920110157

Download citation

Received: 14 February 2019
Revised: 07 February 2020
Accepted: 20 April 2020
Published: 26 November 2020
Issue Date: November 2020
DOI: https://doi.org/10.1134/S1064226920110157

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions