The DEFINITION OF INTERPHONEME TRANSITIONS OF UZBEK WORDS BY DISCRETE WAVELET TRANSFORMS
Keywords:
Discrete wavelet transform, signal segmentation, multi-scale analysis, speech recognitionAbstract
The article refers to the field of information processing theory on the example of evaluating the use of discrete wavelet transform for processing and recognition of speech signals. The difficulties of recognizing speech signals are described and the advantages of discrete wavelet transform over other traditional methods of signal processing, such as Fourier transform, are given. An algorithm based on a discrete wavelet transform is defined and described in detail. The speech signal is processed on the basis of a discrete wavelet transform. A detailed algorithm for finding the most informative level (scale), which is one of the basic parameters of the discrete wavelet transform, is presented. For the analysis, a speech signal with a sampling frequency of 22050 Hz was used, which was decomposed in scales from 3 to 6 and then using an algorithm to identify the informative level, the level (scale) that most closely describes the original signal was selected and determined. When determining the informative level, the type of the mother wavelet is important. In this article, we used the mother wavelet from the Daubechies family, namely the 4th order Daubechies. The second main parameter of the discrete wavelet transform - shift is determined based on practice (16-20ms) and is equal to 256 samples. Also described is a step-by-step algorithm in combination with a discrete wavelet transform, which allows segmenting a speech signal into phonemes, as a result of which it becomes possible to switch from an analog signal to text. Based on the proposed algorithms, the results of the process of segmentation of a speech signal into phonemic units are presented. Based on the results of the work done, a number of conclusions were identified that had not previously been identified due to the traditional Fourier transform used
References
Vishnyakova O.A., Lavrov D.N. Algorithm of phonemic segmentation based on the analysis of the rate of change in the energy of discrete wavelet transform // Bulletin of Omsk University. 2011. No. 4. P. 146–152.
Ermolenko T., Shevchuk V. Segmentation algorithms using fast wavelet transform // Articles accepted for publication on the website of the International conference Dialog'2003. URL: www.dialog-21.ru
Ramishvili G.S. Automatic speaker recognition by voice. M.:radio and communication, 1981. P. 224.
Makarov KS Construction and research of articulatory code books for solving speech inverse problems: dis. Cand. tech. nauk, IITP RAS, 2005
Leonov A. S., Sorokin V. N. Inverse problem for the vocal tract: identification of control forces from articulatory movements // Pattern Recognition and Image Analysis. 2000. Vol. 10. № 1. P. 110– 126.
Ganapathiraju A., Hamaker J., Picone J., Doddington G. R., Ordowski M. Syllable-Based Large Vocabulary Continuous Speech Recognition // IEEE Transactions on Speech and Audio Processing, 2001. Vol. 9. № 4. P. 358–366.
Sorokin V.N., Tsyplikhin A.I. Vowel segmentation and recognition // Information processes. 2004. Vol. 4. № 2. C. 202–220
Abdullaeva M.I., Segmentation of the speech signal into phonemic units, II International Scientific and Practical Conference Global And Regional Aspects Of Sustainable Development, February 8, 2021, Copenhagen , Denmark
Smirnov V.M., Filatov V.N., Application of wavelet transform in speech signal segmentation, Wave electronics and infocommunication systems, Collection of articles of the XXII International scientific conference: 2 parts. 2019, pp 174-179.
B. Grayden and M. S. Scordilis. Phonemic segmentation of fluent speech. Proc. of ICASSP, pages 73–76, 1994.
Makarov K.S. Construction and research of articulatory code books for solving speech inverse problems: Dissertation. for a job. step. Ph.D., IITP RAS, 2005.
Leonov A.S., Sorokin V. N. Inverse problem for the vocal tract: identification of control forces from articulatory movements // Pattern Recognition and Image Analysis. 2000. V.10, No. 1. P.110-126.
Sorokin V.K. Synthesis of speech. M.: Nauka, 1992.392 p.
Dremin I. M., Ivanov O. V., Nechitaylo V. A. Wavelets and their use //. Usage of wavelets and their use (in Russian) // Uspekhi physicheskikh nauki. 2001. T. 171. № 5. С. 465-500.
Zheltov P.V., Semenov V.I. Wavelet-transformation of acoustic signal. Kazan: KSTU named after A.I. Tupolev, 2008. 102 с