Adaptive Boosted Support Vector Machine-random Forest for Environmental Sound Classification

Article Information

Title: Adaptive Boosted Support Vector Machine-random Forest for Environmental Sound Classification

Authors: Faiz Ul Hasnain, Visha Iqbal, Tayyaba Javed, Muhammad Yasir

Journal: Journal of Computing & Biomedical Informatics

HEC Recognition History

Category	From	To
Y	2023-07-01	2024-09-30
Y	2022-07-01	2023-06-30

Publisher: Research Center of Computing & Biomedical Informatics

Country: Pakistan

Year: 2025

Volume: 9

Issue: 02

Language: en

Keywords: AdaBoostFeature FusionESCEnvironmental SoundUrbanSound8KESC10ESC50

Abstract

Environmental sound classification (ESC) is a method to differentiate the audio related to the various environmental sounds. Environmental sounds have a more complex time-frequency structure compared to structured sounds like music and speech. To extract the frequency and time-based features from audio more accurately and effectively, a novel fusion of several features including MFCCs, Mel-spectrogram, spectral skewness, spectral kurtosis and normalized pitch frequency will be evaluated in this study to provide a comprehensive representation of environmental sounds. The fusion will capture various aspects of the input audio data, such as spectral characteristics, statistical properties, and frequency-related information. By using multimodal information fusion, the algorithm will enhance the discriminative power of the model to distinguish between different sounds more effectively. Moreover, the integration of a variety of machine learning models will enhance the robustness and generalization ability of the model. The combination of several machine learning models will reduce the training time and enhance the classification rate of environmental audio under limited computational resources. Furthermore, this thesis will employ three data augmentation methods, namely, time stretch, pitch tuning, and white noise to minimize the probability of overfitting problems due to the limited audios in each class of dataset. This research will evaluate the ensemble model classification accuracy against baseline SVM, RF classifiers, and other state-of-the-art approaches. In UrbanSound8K, ESC-50, and ESC-10 datasets, the highest achieved accuracies using AdaBoost SVM-RF classifiers were ( 94%), (85%), and ( 95%) respectively. The experimental findings demonstrate that the suggested approach achieves superior performance for ESC tasks.

Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...

DefinePK

Adaptive Boosted Support Vector Machine-random Forest for Environmental Sound Classification

Article Information

HEC Recognition History

Categories

Abstract

DefinePK

Select Collection

Adaptive Boosted Support Vector Machine-random Forest for Environmental Sound Classification

Article Information

HEC Recognition History

Categories

Abstract