Bulletin of L.N. Gumilyov Eurasian National University. Mathematics, computer science, mechanics series https://bulmathmc.enu.kz/index.php/main <p><strong>Bulletin of L.N. Gumilyov Eurasian National University.</strong> <strong>Mathematics, computer science, mechanics series</strong></p> <p><strong>Editor-in-Chief:</strong> Temirgaliyev Nurlan, Doctor of Physical and Mathematical Sciences, Professor, Director of the Institute of Theoretical Mathematics and Scientific Computations of L.N. Gumilyov Eurasian National University, Astana, Kazakhstan</p> <p><strong>Certificate of registration of mass media:</strong> № KZ65VPY00031936 dated 02.02.2021</p> <p><strong>ISSN</strong> <a href="https://portal.issn.org/api/search?search[]=MUST=allissnbis=%223007-0155%22&amp;search_id=37191800" target="_blank" rel="noopener">3007-0155</a> <strong>eISSN</strong> <a href="https://portal.issn.org/api/search?search[]=MUST=allissnbis=%223007-0155%22&amp;search_id=37191800" target="_blank" rel="noopener">3007-0163</a></p> <p><strong>DOI of the journal:</strong> <a href="https://bulmathmc.enu.kz/index.php/main/index" target="_blank" rel="noopener">10.32523/2616-7182</a></p> <p><strong>Frequency</strong> – 4 times a year.</p> <p><strong>Languages:</strong> Kazakh, English, Russian</p> <p><strong>Review:</strong> Double Blindness</p> <p><strong>Percentage of rejected articles:</strong> 65%</p> <p><strong>Founder and publisher:</strong> <a href="https://enu.kz/en">NJSC "L.N. Gumilyov Eurasian National University"</a>, Astana, Republic of Kazakhstan</p> en-US vest_math@enu.kz (Жубанышева А.Ж.) vest_math@enu.kz (Жубанышева А.Ж.) Sun, 30 Mar 2025 00:00:00 +0000 OJS 3.3.0.9 http://blogs.law.harvard.edu/tech/rss 60 Identification of the spoken language using the Wav2Vec2 model for the Kazakh language https://bulmathmc.enu.kz/index.php/main/article/view/280 <p>This study presents the development and fine-tuning of an oral language identification model using the XLSR (Cross-Lingual Speech Recognition) Wav2Vec2 variant. Trained on a rich and diverse dataset spanning six languages, with a particular focus on low-resource languages such as Kazakh, the model demonstrates remarkable capabilities in multilingual speech recognition. Thanks to extensive evaluation, the finely tuned model not only surpasses existing benchmarks, but also surpasses other modern models, including Whisper variants. Having achieved an impressive F1 score of 92.9% and an accuracy of 93%, the model demonstrates its performance in real multilingual and low-resource scenarios. This work makes a significant contribution to the development of speech recognition technologies by providing a reliable solution for language identification in various language environments, especially in underrepresented language settings. Its success highlights the potential of Wav2Vec2-based models in improving speech processing systems in low-resource multilingual contexts. The results of this analysis can contribute to the development of reliable and effective automatic speech recognition systems optimized for the Kazakh language. Such technologies will find applications in various fields, including speech-to-text conversion, voice assistants and voice communication tools.</p> Zh. Kozhirbayev , S. Umbet Copyright (c) 2025 Bulletin of L.N. Gumilyov Eurasian National University. Mathematics, computer science, mechanics series https://bulmathmc.enu.kz/index.php/main/article/view/280 Sun, 30 Mar 2025 00:00:00 +0000