DefinePK

DefinePK hosts the largest index of Pakistani journals, research articles, news headlines, and videos. It also offers chapter-level book search.

A Resample-Smote Balance with Random Forest for improving seminal quality prediction in healthcare informatics


Article Information

Title: A Resample-Smote Balance with Random Forest for improving seminal quality prediction in healthcare informatics

Authors: Raihani Mohamed, Abdul Rafiez Abdul Raziff, Sabri Mohd. Nasir

Journal: ARPN Journal of Engineering and Applied Sciences

HEC Recognition History
Category From To
Y 2023-07-01 2024-09-30
Y 2022-07-01 2023-06-30
Y 2021-07-01 2022-06-30
X 2020-07-01 2021-06-30

Publisher: Khyber Medical College, Peshawar

Country: Pakistan

Year: 2021

Volume: 16

Issue: 21

Language: English

Categories

Abstract

Previous research shown that men seminal fertility rates significantly decreased in the last twenty years due to health status, life habit and environment. Thus, seminal quality prediction in men fertility has become very demanding decision support systems in the biomedical engineering field. Existing solutions focused on producing the accuracy of the prediction model. They also acknowledged that there were problems of imbalance class, ambiguity and noise of the sample dataset. However, the real issues of the domain were still vague. A framework consists of Resample-Smote Balance with the state-of-art Random Forest (RF) method is proposed to overcome the real-world problem and tested on the Fertility dataset. This method is introduced to alleviate the ambiguity and noise of the sample set. Subsequently, the Smote method is applied to balance the size of the dataset before the classification phase using RF is taken place. The performance of the proposed model is compared with other state-of-art classifiers such as MLP, SVM and DT. Consequently, this work manages to produce the best accuracy model with 98.2% that can improve the ambiguity and noise from the sample set at the same time competent to handle the imbalance class of the dataset. The RF also boost the accuracy model compare with other classifiers due to its capability to produce the most probable class from its majority-voting task as output.


Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...