AI-BASED PRONUNCIATION TRAINING TOOLS FOR ENGLISH LANGUAGE LEARNERS: A SYSTEMATIC REVIEW
DOI:
https://doi.org/10.26418/jeep.v7i1.99864Keywords:
Systematic Review, Text-to-Speech (TTS), AI-based pronunciation Training, ELSA Speak, Automatic Speech Recognition (ASR), English Language Learners (ELLs).Abstract
Proficiency in English pronunciation is essential for effective communication, especially among English language learners (ELLs). This systematic review synthesizes findings from 22 studies published between 2016 and 2024 on the use of AI-based pronunciation training tools in English language education. The analysis shows growing interest in this field, with a notable rise in publications in 2022. AI tools, particularly ELSA Speak, have been shown to enhance pronunciation accuracy, learner confidence, and engagement through personalized feedback and interactive practice. Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) technologies also contribute to improved motivation and pronunciation development. However, gaps remain regarding the role of AI in teacher training and its impact on educators’ pronunciation skills. Most studies focus on students, highlighting the need for research on teachers’ experiences and effective classroom integration. Overall, this review provides evidence-based insights into the pedagogical use of AI for pronunciation learning. The results indicate that AI-based pronunciation tools improve accuracy, confidence, and engagement, but challenges such as technological barriers, feedback limitations, and insufficient educator-focused studies persist. Addressing these gaps is essential to optimizing AI’s role in language learning.References
Ahamed, S. K., & Lakshmi, K. D. (2023). From accents to accuracy: A literature review on utilizing digital tools for perfecting pronunciation. Journal of Research Administration, 5(2), 5506–5517. https://journlra.org/index.php/jra/article/view/685/494
Al-Jarf, R. (2022). Text-to-speech software for promoting EFL freshman students’ decoding skills and pronunciation accuracy. Journal of Computer Science and Technology Studies, 4(2), 19–30. https://doi.org/10.32996/jcsts
Al-Shallakh, M. A. I. (2023). Artificial intelligence-based mobile learning in English Language Teaching (ELT) for EFL learners: Enhancing pronunciation with ELSA SPEAK in Oman. Arab Humanities Journal, 4(3), 208–221. https://journals.mejsp.com/assets/uploads/journals-researches/1699854571_4213206970.pdf
Al-Shallakh, M. A. I. (2024). Embedding artificial intelligent applications in higher educational institutions to improve students’ pronunciation performance. Theory and Practice in Language Studies, 14(6), 1897–1906. https://doi.org/10.17507/tpls.1406.31
Amante-Nochefranca, G., Orbase-Sandal, O., Alieto, E. O., Laput, I. U., Albani, S. E., Lucas, R. I., & Tanpoco, M. (2024). AI-assisted English Language Learning and Teaching in a developing country: An investigation of ESl student’s beliefs and challenges. ICAISE 2023. Lecture Notes in Networks and Systems, 837(3), 281–289. https://doi.org/10.1007/978-3-031-48465-0_37
Amin, E. A. R. (2024). EFL students’ perception of using AI text-to-speech apps in learning pronunciation. Migration Letters, 21(3), 887–903. https://migrationletters.com/index.php/ml/article/view/6905
Amrate, M., & Tsai, P. H. (2024). Computer-assisted pronunciation training: A systematic review. ReCALL, 37, 22–42. https://doi.org/10.1017/S0958344024000181
Anggraini. (2022). Improving students’ pronunciation skill using Elsa Speak application. Journey: Journal of English Language and Pedagogy, 5(1), 135–141. https://doi.org/10.33503/journey.v5i1.1840
Assalamah, T. M., Hilza, I. F., & Wafiroh, L. H. (2024). Students’ attitudes toward the use of Edtech as a powerful pronunciation learning tool in EFL. International Journal of Education, Language, Literature, Arts, Culture, and Social Humanities, 2(3), 161–177. https://doi.org/10.59024/ijellacush.v2i3.973
Bashori, M., van Hout, R., Strik, H., & Cucchiarini, C. (2022). ‘Look, I can speak correctly’: Learning vocabulary and pronunciation through websites equipped with automatic speech recognition technology. Computer Assisted Language Learning, 37(5–6), 1335–1363. https://doi.org/10.1080/09588221.2022.2080230
Bashori, M., van Hout, R., Strik, H., & Cucchiarini, C. (2024). I Can Speak: Improving English pronunciation through automatic speech recognition-based language learning systems. Innovation in Language Learning and Teaching, 2, 1–19. https://doi.org/10.1080/17501229.2024.2315101
Chung, B., & Bong, H. K. M. (2022). A study on the intelligibility of Korean-accented English: Possibilities of implementing AI applications in English education. Journal of Asia TEFL, 19(1), 197–215. https://doi.org/10.18823/asiatefl.2022.19.1.12.197
Dennis, N. K. (2024). Using AI-powered speech recognition technology to improve English pronunciation and speaking skills. IAFOR Journal of Education, 12(2), 107–126. https://doi.org/10.22492/ije.12.2.05
Derwing, T. M., & Munro, M. J. (2015). Pronunciation fundamentals: Evidence-based perspectives for L2 teaching and research (N. Spada & N. Van Deusen-Scholl (eds.)). John Benjamins Publishing Company. https://doi.org/10.1075/lllt.42
Dida, H. A., Chakravarthy, D., & Rabbi, F. (2023). ChatGPT and big data: Enhancing text-to-speech conversion. Mesopotamian Journal of Big Data, 2023, 31–35. https://doi.org/10.58496/mjbd/2023/005
Eksi, G. Y., & Yesilcinar, S. (2016). An investigation of the effectiveness of online text-to-speech tools in improving EFL teacher trainees’ pronunciation. English Language Teaching, 9(2), 205. https://doi.org/10.5539/elt.v9n2p205
Elnagar, B. M. A. (2020). An investigation of instructors’ approaches in teaching pronunciation: A case study. English Language Teaching, 13(8), 185. https://doi.org/10.5539/elt.v13n8p185
Fitria, T. N. (2022). Utilizing text-to-speech technology: Natural Reader in teaching pronunciation. JETLEE : Journal of English Language Teaching, Linguistics, and Literature, 2(2), 70–78. https://doi.org/10.47766/jetlee.v2i2.312
Gesa, F., Frigole, N., & Suárez, M.-D.-M. (2024). Pronunciation development in the EFL classroom: The case of flowchase. Universitat de Barcelona, 5, 746–764. https://www.researchgate.net/publication/380827447
Kantt, B., & Asher, A. (2024). LangTech in the AI era (Issue 3). Independently published. https://r.search.yahoo.com/_ylt=Awr1QGw_5AlnUQIA3K_LQwx.;_ylu=Y29sbwNzZzMEcG9zAzMEdnRpZAMEc2VjA3Ny/RV=2/RE=1729911103/RO=10/RU=https%3A%2F%2F7690351.fs1.hubspotusercontent-na1.net%2Fhubfs%2F7690351%2FLead%2520Magnets%2FWhitepaper%2520-%2520LangTech%2520in
Kholis, A. (2021). Elsa Speak app: Automatic speech recognition (ASR) for supplementing English pronunciation skills. Pedagogy : Journal of English Language Teaching, 9(1), 1–14. https://doi.org/10.32332/joelt.v9i1.2723
Lee, Y. J., & Davis, R. O. (2024). A case study of implementing generative AI in university’s general English courses. Contemporary Educational Technology, 16(4), 1–8. https://doi.org/10.30935/cedtech/15218%0AARTICLE
Levis, J. M. (2018). Intelligibility, oral communication, and the teaching of pronunciation. In Cambridge University Press. Cambridge University Press. https://doi.org/10.1017/9781108241564
Liu, X., Wu, D., Ye, Y., Xu, M., Jiao, J., & Lin, W. (2020). Improving accuracy in imitating and reading aloud via speech visualization technology. International Journal of Emerging Technologies in Learning, 15(8), 144–160. https://doi.org/10.3991/IJET.V15I08.11475
Meisarah, F. (2020). Mobile-assisted pronunciation training: The Google Play pronunciation and phonetics application. Script Journal: Journal of Linguistics and English Teaching, 5(2), 70–88. https://doi.org/10.24903/sj.v5i2.487
Ngoc, N. K., & Thanh, N. T. M. (2024). Tertiary students’ perceptions on Elsa Speak application for pronunciation learning. European Journal of Applied Linguistics Studies, 7(1), 1–13. https://doi.org/10.46827/ejals.v7i1.483
Sariani, Miladiyenti, F., Rozi, F., Haslina, W., & Marzuki, D. (2022). Incorporating mobile-based artificial intelligence to English pronunciation learning in tertiary-level students: Developing autonomous learning. International Journal of Advanced Science Computing and Engineering, 4(3), 220–232. https://doi.org/10.62527/ijasce.4.3.92
Shehata, M. G. M. (2024). A program based on artificial intelligence to enhance prospective teachers’ English pronunciation. CDELT Occasional Papers in the Development of English Education, 86(4), 145–179. https://doi.org/10.21608/opde.2024.362818
Sholekhah, M. F., & Fakhrurriana, R. (2023). The use of ELSA Speak as a mobile-assisted language learning (MALL) towards EFL students pronunciation. JELITA: Journal of Education, Language Innovation, and Applied Linguistics, 2(2), 93–100. https://doi.org/10.37058/jelita.v2i2.7596
Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology, 14(8), 1–14. https://doi.org/10.3389/fpsyg.2023.1210187
Thi-Nhu Ngo, T., Hao-Jan Chen, H., & Kuo-Wei Lai, K. (2024). The effectiveness of automatic speech recognition in ESL/EFL pronunciation: A meta-analysis. ReCALL, 36(1), 4–21. https://doi.org/10.1017/s0958344023000113
Vančová, H. (2023). AI and AI-powered tools for pronunciation training. Journal of Language and Cultural Education, 11(3), 12–24. https://doi.org/10.2478/jolace-2023-0022
Woo, J. H., & Choi, H. (2021). Systematic review for AI-based language learning tools. Journal of Digital Contents Society, 22(11), 1783–1792. https://doi.org/10.9728/dcs.2021.22.11.1783
Xue, Y. (2023). A study on the influence of mother tongue transfer on English pronunciation of primary students. Creative Education, 14(11), 2123–2130. https://doi.org/10.4236/ce.2023.1411135
Zakiyyah, F., Setyaji, A., & Ardini, S. N. (2022). The analysis of pronunciation application based on the concept of artificial intelligence. UNCLLE (Undergraduate Conference on Language, Literature, and Culture), 2(1), 559–569. https://publikasi.dinus.ac.id/index.php/unclle/article/view/6025
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Journal of English Education Program

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright aims to protect the specific way the article has been written to describe an experiment and the results. Journal of English Education Program (JEEP) is committed to its authors to protect and defend their work and their reputation and takes allegations of infringement, plagiarism, ethical disputes, and fraud very seriously. Under the terms of the Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), author(s) retain copyright and grant the journal the right of first publication (online and print) with the work simultaneously. We use the ShareAlike (SA) component in addition to the Attribution (BY) and NonCommercial (NC) components, as follows:
BY (attribution): Users are allowed to share, distribute, and redistribute the published article in any medium or format, with identification of the authors and its initial publication in this journal. Authors are encouraged to post and distribute their articles immediately after publication (e.g., institutional or public repositories, personal websites). Authors are allowed to enter into additional contractual arrangements for the non-exclusive distribution of the published article and an acknowledgment of its initial publication in this journal.
NC (non-commercial): Users are not allowed to use the article commercially without the permission of the authors. Authors agree explicitly that the published article is indexed worldwide in databases, repositories, and indexation services, even if these services operate on a commercial basis. Authors grant Journal of English Education Program (JEEP) explicit the right to include the published articles in databases, repositories, and indexation services.
SA (ShareAlike): If user(s) remix, transform, or build upon the material, they must distribute their contributions under the same license as the original (CC BY-NC-SA 4.0).