Skip to main content
Log in

Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation

  • THEORY AND METHODS OF SIGNAL PROCESSING
  • Published:
Journal of Communications Technology and Electronics Aims and scope Submit manuscript

Abstract

The article considers the problem of automatic segmentation of a speech signal into phonetic units in conditions of their a priori uncertain spectral composition and correlation properties. A guaranteed significance level criterion is developed based on the information–theoretic approach. An example of practical application of this criterion is considered; a full-scale experiment is set up and conducted. It is shown that the proposed criterion can guarantee a stable significance level when processing speech frames of short duration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.

Similar content being viewed by others

Notes

  1. A standard speech frame corresponds to the standard pitch period value (10 ms).

  2. See https://sites.google.com/site/frompldcreators/produkty-1/phonemetraining.

  3. See http://www.internationalphoneticalphabet.org/

REFERENCES

  1. P. Makhach and R. Skarnitzl, Principles of Phonetic Segmentation (Epocha Publ. House, Praha, 2013). https://www.researchgate.net/publication/234052076

  2. E. Pakoci, B. Popovic, N. Jakovljevic, et al., Lect. Notes Comp. Sci. 9811, 67 (2016).

    Article  Google Scholar 

  3. M. B. Popov, Uchen. Zap. Kazan. Univ., Ser.: Gum. Nauki 159, 1144 (2017).

    Google Scholar 

  4. L. R. Rabiner and R. W. Shafer, Theory and Applications of Digital Speech Processing (Pearson, Boston, 2010).

    Google Scholar 

  5. V. V. Savchenko, J. Commun. Technol. Electron. 64, 590 (2019).

    Article  Google Scholar 

  6. V. V. Savchenko, J. Commun. Technol. Electron. 63, 53 (2018).

    Article  Google Scholar 

  7. V. S. Vykhovanets and D. Tszyan’min, Rechevye Tekhnol., No. 1, 45 (2016).

  8. V. V. Savchenko and A. V. Savchenko, “Software complex of voice hidden control of personal computer for home and office,” Certificate No. 2013615628 (Rospatent, Moscow, 2013).

    Google Scholar 

  9. V. V. Savchenko, Meas. Tech. 62, 282 (2019).

    Article  Google Scholar 

  10. V. V. Savchenko, Radiophys. Quantum Electron. 58, 373 (2015).

    Article  Google Scholar 

  11. V. V. Savchenko, Elektrosvyaz’, No. 12, 22 (2017).

  12. A. F. Shishkina, Teoriya, Praktika, Innovatsii, No. 4, 18 (2016).

    Google Scholar 

  13. N. Benati and H. Bahi, in Proc. 7th Int. Conf. Sci. of Electronics, Technologies of Information and Telecommun., Hammamet, 18–20 Dec., 2017 (IEEE, New York, 2017), p. 267.

  14. V. V. Savchenko, Inform. Sist. & Tekhnol., No. 2, 12 (2014).

  15. A. E. Sakran, S. M. Abdou, S. E. Hamid, and M. Rashwan, https://www.researchgate.net/publication/317339722

  16. H. Kamper, A. Jansen, and S. Goldwater, Comput. Speech Lang. 46, 154 (2017).

    Article  Google Scholar 

  17. A. Yu. Yakimuk and A. A. Konev, Inf. & Sist. Upr., No. 2, 108 (2018).

  18. V. V. Savchenko, Radioelectron. & Commun. 61, 419 (2018).

  19. D. Yu. Akat’ev and V. V. Savchenko, Avtometriya 41 (2), 68 (2005).

  20. A. V. Savchenko, Lecture Notes in Artificial Intell. 10314, 264 (2017).

  21. V. V. Savchenko, Radiophys. Quantum Electron. 60, 89 (2017).

  22. A. A. Borovkov, Mathematical Statistics (Lan’, St.-Petersburg, 2010).

  23. S. L. Marple, Jr. Digital Spectral Analysis: with Applications (Prentice-Hall, Englewood Cliffs, N. J., 1987; Mir, Moscow, 1990).

  24. S. Kullback, Information Theory and Statistics (Dover, New York, 1997).

  25. R. M. Gray, A. Buzo, A. H. Gray, and Y. Matsuyama, IEEE Trans. Acoust., Speech, Signal Process. 28, 367 (1980).

  26. V. V. Savchenko, J. Commun. Technol. Electron. 42, 393 (1997).

  27. V. V. Savchenko, Nauch. Vedom. Belgorod. Gos. Univ., Ser.: Istor., Politolog., Ekonom., Inf. 33/1, 74 (2015).

  28. A, L. Ronzhin and K. V. Evgrafova, Izv. Vyssh. Uchebn. Zaved., Gum. Nauki 2, 227 (2011).

Download references

Funding

This study was carried out within the Basic Research Program of the National Research University Higher School of Economics (HSE).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to V. V. Savchenko or A. V. Savchenko.

Additional information

Translated by A. Ovchinnikova

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Savchenko, V.V., Savchenko, A.V. Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation. J. Commun. Technol. Electron. 65, 1311–1317 (2020). https://doi.org/10.1134/S1064226920110157

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S1064226920110157

Navigation