Automated detection of human emotion from speech signals is a relatively new area in artificial intelligence aimed at determining the emotions people express through their speech. Traditionally, SER did feature extraction recognition with handcrafted ones and classical machine learning ones such as SVM (support vector machines) and HMM (hidden Markov models). The richness of emotions made these methodologies however challenging. The evolution of deep learning, in particular CNNs, RNNs, and other Transformer-based structures, has greatly improved the accuracy and robustness of SER systems. In this work, the SER is studied in depth taking into account the most relevant methods and feature extraction methods as well as an introduction of benchmark databases. It also includes augmentation methods, evaluation measures and the difficulties of real-time processing. Regardless of the advancements, SER continues to encounter challenges, including scarcity of datasets, imbalance between classes, domain adaptation, and high computational requirements. The review highlights unanswered questions regarding research and analyses. future directions, including multimodal fusion, self-supervised learning, and Explainable AI.
Alkahla,L. Thanoon, Hussein,M. Khalaf, Alqassab,A. and Aliyu,D. (2025). A Comprehensive Review of Speech Emotion Recognition: Advances, Challenges, and Future Directions. Al-Noor Journal for Information Technology and Cybersecurity, 2(1), 31-36. doi: 10.69513/jncs.v1.i1.a5
MLA
Alkahla,L. Thanoon, , Hussein,M. Khalaf, , Alqassab,A. , and Aliyu,D. . "A Comprehensive Review of Speech Emotion Recognition: Advances, Challenges, and Future Directions", Al-Noor Journal for Information Technology and Cybersecurity, 2, 1, 2025, 31-36. doi: 10.69513/jncs.v1.i1.a5
HARVARD
Alkahla L. Thanoon, Hussein M. Khalaf, Alqassab A., Aliyu D. (2025). 'A Comprehensive Review of Speech Emotion Recognition: Advances, Challenges, and Future Directions', Al-Noor Journal for Information Technology and Cybersecurity, 2(1), pp. 31-36. doi: 10.69513/jncs.v1.i1.a5
CHICAGO
L. Thanoon Alkahla, M. Khalaf Hussein, A. Alqassab and D. Aliyu, "A Comprehensive Review of Speech Emotion Recognition: Advances, Challenges, and Future Directions," Al-Noor Journal for Information Technology and Cybersecurity, 2 1 (2025): 31-36, doi: 10.69513/jncs.v1.i1.a5
VANCOUVER
Alkahla L. Thanoon, Hussein M. Khalaf, Alqassab A., Aliyu D. A Comprehensive Review of Speech Emotion Recognition: Advances, Challenges, and Future Directions. NJITC, 2025; 2(1): 31-36. doi: 10.69513/jncs.v1.i1.a5