Kmeans-SMOTE Integration forHandlingImbalance DatainClassifying Financial Distress Companiesusing SVM and Naïve Bayes
Dublin Core
Title
Kmeans-SMOTE Integration forHandlingImbalance DatainClassifying Financial Distress Companiesusing SVM and Naïve Bayes
Subject
K-means-SMOTE;Data Imbalance;Financial Distress
Description
mbalanced data presents significant challenges in machine learning, leading to biased classification outcomes favouringthe majority class. This issue is especially pronounced in financial distress classification, where data imbalance is common due to the scarcity of such instances in real-world datasets. This study aims to mitigate data imbalance in financial distress companies using the Kmeans-SMOTE method approach by combining K-meansclustering and the Synthetic Minority Oversampling Technique (SMOTE). Various classification approaches, including Naïve Bayes and Support Vector Machine (SVM),are implemented on a financial distress dataset from Kaggle to evaluate the effectiveness of Kmeans-SMOTE. Experimental results show that SVM outperforms Naïve Bayes with impressive accuracy (99.1%), f1-score (99.1%),Area Under Precision-Recall (AUPRC) (99.1%), and Geometric-mean (Gmean) (98.1%). Based on these results, Kmeans-SMOTE canbalance the data effectively, leading to a quite significant improvement in performance
Creator
Didit Johar Maulana1, Siti Saadah2, Prasti Eko Yunanto
Source
https://jurnal.iaii.or.id/index.php/RESTI/article/view/5140/892
Publisher
Departmentof Informatics, Informatics, Telkom University, Bandung, Indonesia
Date
07-02-2024
Contributor
FAJAR BAGUS W
Format
PDF
Language
ENGLISH
Type
TEXT
Files
Collection
Citation
Didit Johar Maulana1, Siti Saadah2, Prasti Eko Yunanto, “Kmeans-SMOTE Integration forHandlingImbalance DatainClassifying Financial Distress Companiesusing SVM and Naïve Bayes,” Repository Horizon University Indonesia, accessed February 3, 2026, https://repository.horizon.ac.id/items/show/10204.