NUTQ SIGNALIDAN SHOVQINLARNI KETMA-KET BARTARAF ETISH ALGORITMI

Niyozmatova, Nilufar A’loxanovna; Yuldashev, Yusuf Sheralievich; Jalelov, Quanish Moyatdinovich; Samijonov, Abdurashid Narzullo; Abdullaeva, Barno Mo‘ydinjon

TECHNICAL SCIENCES

SEQUENTIAL NOISE SUPPRESSION ALGORITHM FROM THE SPEECH SIGNAL

Niyozmatova Nilufar Alokhanovna Niyozmatova N.A. Yuldashev Yusuf Sheralievich Yuldashev Y.S. Jalelov Kuanish Moyatdinovich Jalelov K.M. Samijonov Abdurashid Narzullo ugli Samijonov A.N. Abdullaeva Barno Muydinjon kizi Abdullaeva B.M.

Abstract

The problem of denoising speech signals is a relevant task for speech recognition systems, mobile communication devices, and voice interfaces. This study proposes a new approach that combines Self - Organizing Map (SOM) and Spectral Subtraction methods. This approach effectively denoises speech by using the SOM neural network to detect noisy clusters and combining energy and frequency features. During the study, the Minimum Statistics Noise Estimation method was used for noise estimation, an adaptive feature selection strategy was applied, and experiments were conducted at various noise levels (from 1% to 25%). The results, evaluated using the PESQ metric, showed that the proposed approach outperforms traditional methods such as Wavelet and Spectral Subtraction. The advantage of this approach lies in the improved algorithm for detecting noisy clusters and the optimized post-processing stage aimed at preserving speech naturalness. The research results indicate that the combination of the SOM and Spectral Subtraction methods is an effective solution for denoising speech signals in noisy environments.

Keywords

Self - Organizing Map Spectral Subtraction Mel - Frequency Cepstral Coefficients Log - Mel spectrogram Perceptual Evaluation of Speech Quality.

Authors

Niyozmatova Nilufar Alokhanovna

Institute of Fundamental and Applied Research under the National Research University “Tashkent Institute of Irrigation and Agricultural Mechanization Engineers”

Yuldashev Yusuf Sheralievich

Tashkent University of Information Technologies named after Muhammad al-Khwarizmi

ORCID: 0000-0001-9708-2687

Jalelov Kuanish Moyatdinovich

National Research University “Tashkent Institute of Irrigation and Agricultural Mechanization Engineers”

Samijonov Abdurashid Narzullo ugli

Namangan State University

Abdullaeva Barno Muydinjon kizi

Namangan State University

How To Cite

Journal Style

Niyozmatova, N. A.; Yuldashev, Y. S.; Jalelov, K. M.; Samijonov, A. N. u.; Abdullaeva, B. M. k. SEQUENTIAL NOISE SUPPRESSION ALGORITHM FROM THE SPEECH SIGNAL. Innovatsion texnologiyalar, 2025, 58(2), 118-129. https://www.innotex-journal.uz/article.php?id=115&lang=uz

References

Mamatov, N., Niyozmatova, N., & Samijonov, A. (2021). Software for preprocessing voice signals. International Journal of Applied Science and Engineering, 18(1), 1 - 8. INNOVA TSI ON TEXNOLOGIYALAR INNOVATIVE TECHNOLOGIES ИННОВАЦИОННЫЕ ТЕХНОЛОГИИ 2025 - yil 1(58) - son 2025 volume 58, number 2 Том 58 No 2, 2025 ISSN 2181 - 4732 ISSN 2181 - 4732 129
Niyozmatova, N. N., Jalelov, N. K., Samijonov, N. B., & Madrahimova, N. M. (2024). Eliminating noise from a speech signal based on a pair of filters. International Journal of Science and Research Archive, 13(2), 401 – 410. https://doi.org/10.30574/ijsra.2024.13.2.2058
Boll, S.F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing, 27(2), 113 - 120.
Lim, J.S., & Oppenheim, A.V. (1979). Enhancement and bandwidth compression of noisy speech. Proceedings of the IEEE, 67(12), 1586 - 1604.
Ephraim, Y., & Malah, D. (1984). Speech enhancement using a minimum - mean square error short - time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing, 32(6), 1109 - 1121.
Ramirez, J., Górriz, J.M., & Segura, J.C. (2007). Voice activity detection: fundamentals and speech recognition system robustness. In M. Grimm & K. Kroschel (Eds.), Robust Speech Recognition and Understanding (pp. 1 - 22). I - Tech Education and Publishing.
Martin, R. (2001). Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Transactions on Speech and Audio Processing, 9(5), 504 - 512.
Kohonen, T. (1982). Self - organized formation of topologically correct feature maps. Biological Cybernetics, 43(1), 59 - 69.
Zhang, X.L., & Wu, J. (2013). Deep belief networks based voice activity detection. IEEE Transactions on Audio, Speech, and Language Processing, 21(4), 697 - 710.
Ramírez, J., Segura, J.C., Benítez, C., De La Torre, A., & Rubio, A. (2004). Efficient voice activity detection algorithms using long - term speech information. Speech Communication, 42(3 - 4), 271 - 287.