DefinePK

DefinePK hosts the largest index of Pakistani journals, research articles, news headlines, and videos. It also offers chapter-level book search.

EFFECTIVE SPEECH EMOTION RECOGNITION USING R-CNN & BLSTM


Article Information

Title: EFFECTIVE SPEECH EMOTION RECOGNITION USING R-CNN & BLSTM

Authors: Muhammad Hassan Askari, Adeel Shahzad, Ahmed Faraz, Muhammad Fuzail, Naeem Aslam, Mohsin Ali Tariq

Journal: Kashf Journal of Multidisciplinary Research (KJMR)

HEC Recognition History
Category From To
Y 2024-10-01 2025-12-31

Publisher: Kashf Institute of Development & Studies

Country: Pakistan

Year: 2025

Volume: 2

Issue: 6

Language: en

DOI: 10.71146/kjmr514

Categories

Abstract

Speech Emotion Recognition (SER) is gaining significant attention in the field of human-computer interaction (HCI) over past decade. Specially in the fields like health, security, communication, and entertainment. But due to the lack of research on how to boost the speech processing efficiency, the current emotion recognition systems need improvement and more accuracy. To enhance the accuracy, we proposed an Effective Speech Emotion Recognition System (ESERS) which is a hybrid approach that uses Autoencoders (AEs) for denoising and robust feature extraction with a Self-Attentional Convolutional Neural Network–Bidirectional Long Short-Term Memory (CNN-BLSTM) architecture for effective temporal and contextual modeling. Using CREMA Dataset, we achieved Weighted Accuracy (WA) improved from 73.9% to 81.6% and Unweighted Accuracy (UA) increased from 68.5% to 82.8%. which shows absolute improvement of 7.7% and 14.3%, and relative improvements of 10.4% and 20.9% respectively. Hence, to enhance system efficiency, the hybrid approach outperforms traditional approaches currently in use.


Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...