official Journal of AlNoor University
Volume & Issue: Volume 2, Issue 1, June 2025 
Information Technology and Cybersecurity

Integrating Deep Learning and Swarm Intelligence for Speech Recognition: A Review

Pages 1-9

noor mohammed yousif, ola zaid Abd Al-majid

Abstract Abstract
With an emphasis on deep learning and bio-inspired optimization techniques, this paper provides an extensive overview of current developments in voice and emotion detection systems. Advanced recurrent networks like GRU and SVNN, attention-based encoder-decoder frameworks, and hybrid CNN-LSTM architectures are just a few of the models examined in the examined papers. In order to increase robustness, feature extraction methods like MFCC, PLPC, LPCC, and log Mel-filter banks are frequently used in conjunction with data augmentation techniques including speed perturbation, noise injection, and pitch shifting. To enhance feature selection and classifier performance, a number of optimization methods are used, including Particle Swarm Optimization (PSO), Cat Swarm Optimization (CSO), Glowworm Swarm Optimization (GSO), and innovative hybrids like MUPW and GREO. The examined works show state-of-the-art accuracy in a variety of tasks, such as multimodal (audio-visual) recognition systems, Arabic dialect recognition, and emotional speech classification. According to experimental results, there are significant improvements in performance compared to standard models; in certain systems, accuracy rates can approach 99.76%. The increasing efficacy of combining deep learning with intelligent optimization is highlighted in this paper, which also makes recommendations for future developments including transducer-based architectures, real-time adaptation, and domain-specific data augmentation.

Information Technology and Cybersecurity

Advances in High-Performance Models for Natural Language Processing: A Review

Pages 11-17

Mohammed Nabeel Islam, Taher Ali Mahmood

Abstract This in-depth review looks at the most recent developments in high-performance models for Natural Language Processing (NLP), with a focus on transformer-based architectures and large language models (LLMs), which have changed the field. The rapid growth of model capabilities has changed the way machines understand, generate, and use human language, opening up new possibilities and problems in many areas. The review talks about important research trends, such as new ways to build transformer models, rules for scaling up performance, ways to make systems more efficient, how to make them work in more than one language, how to test them, how to think about ethics, how to protect them from attacks, how to explain them, and how to distill knowledge. Even though there has been a lot of progress, there are still big problems, such as needing a lot of computing power, ethical issues with bias and safety, not being able to understand things easily, and having trouble evaluating things. The review provides publications a structured overview of the current state of affairs, pointing out promising research directions and practical issues to think about when using high-performance NLP models. The results show how these technologies could change the world, but they also stress the need for responsible development that takes into account technical limitations and social effects.

Information Technology and Cybersecurity

Self-Supervised Learning for Speech Recognition: A Comprehensive Review

Pages 19-22

Ibrahim Adnan Alzinalabdin, fatin thair

Abstract Self-supervised learning (SSL) has emerged as a transformative approach in speech recognition, enabling models to leverage vast amounts of unlabelled data and reduce reliance on annotated datasets. This review systematically examines key SSL methodologies—contrastive learning, masked prediction, clustering techniques, and mutual information-based approaches—and evaluates their effectiveness in speech recognition tasks. Contrastive learning, exemplified by frameworks like SimCLR and MoCo, enhances feature robustness through data augmentation and negative sampling. Masked prediction, as demonstrated by Wav2Vec 2.0, excels at learning contextual relationships by reconstructing masked audio segments. Clustering methods improve generalization by grouping similar audio features, while mutual information-based techniques optimize representation quality. Despite their strengths, SSL methods face challenges such as implementation complexity, data quality dependence, and high computational demands. Future research directions include hybrid models combining SSL with supervised learning, multi-modal integration, and applications in low-resource languages and real-time systems. By addressing these challenges, SSL promises to advance speech recognition technologies, offering scalable and efficient solutions for diverse real-world applications.

Trends Challenges and Future Directions for Intelligent IoT and Deep Learning: A Review

Pages 23-29

omar h fathi

Abstract The Internet of Things (IoT) has become a transformative paradigm, enabling the use of connected, smart devices across diverse sectors like healthcare, transportation, and industry. With the rapid increase in IoT devices, an enormous volume of data is generated, presenting challenges in real-time processing, analysis, and decision-making. Traditional data handling methods often struggle to manage this complexity. Deep learning offers powerful capabilities for extracting insights from large, diverse datasets, making it a suitable solution for IoT data challenges. This review explores recent scientific progress in combining deep learning techniques with IoT frameworks. It highlights key applications, ranging from smart homes to industrial automation. The study also examines technical challenges such as limited computational resources, security issues, and deployment complexities. Various deep learning architectures adapted for IoT are analysed. The need for edge computing and lightweight models is emphasized. Future research opportunities in scalable and secure intelligent IoT systems are identified. Overall, this work provides a comprehensive overview of trends and innovations in intelligent IoT powered by deep learning.

Information Technology and Cybersecurity

A Comprehensive Review of Speech Emotion Recognition: Advances, Challenges, and Future Directions

Pages 31-36

Lubna Thanoon Alkahla, Maher Khalaf Hussein, Asmaa Alqassab, Dahiru Aliyu

Abstract Automated detection of human emotion from speech signals is a relatively new area in artificial intelligence aimed at determining the emotions people express through their speech. Traditionally, SER did feature extraction recognition with handcrafted ones and classical machine learning ones such as SVM (support vector machines) and HMM (hidden Markov models). The richness of emotions made these methodologies however challenging. The evolution of deep learning, in particular CNNs, RNNs, and other Transformer-based structures, has greatly improved the accuracy and robustness of SER systems. In this work, the SER is studied in depth taking into account the most relevant methods and feature extraction methods as well as an introduction of benchmark databases. It also includes augmentation methods, evaluation measures and the difficulties of real-time processing. Regardless of the advancements, SER continues to encounter challenges, including scarcity of datasets, imbalance between classes, domain adaptation, and high computational requirements. The review highlights unanswered questions regarding research and analyses. future directions, including multimodal fusion, self-supervised learning, and Explainable AI.