A stacking ensemble model with SMOTE for improved imbalanced classification on credit data

Dublin Core

Title

A stacking ensemble model with SMOTE for improved imbalanced classification on credit data

Subject

Banking data
Classification
Credit risk
Stacking ensemble
Synthetic minority over-sampling technique

Description

This research is based on a significant problem in credit risk analysis in the banking sector caused by class imbalance. We face the problem of the model’s inability to accurately identify risks in the ‘‘Charged Off’’ class. As a solution, we propose a stacked ensemble approach that utilizes synthetic minority over-sampling technique (SMOTE) to balance the class distribution. Experiments were conducted by applying SMOTE to the training data before training the credit model using gradient boosting (XGBoost) and random forest (RF) algorithms in a single ensemble. The results show significant improvements in precision, recall, and F1-score after applying SMOTE on the unbalanced classes. The updated model achieved a striking accuracy rate of 0,97 on resampled training data. This re-search clearly identifies the problem of class imbalance as a major challenge in credit risk analysis. The application of SMOTE in a stacked ensemble was found to be effective in improving model performance, making a valuable contribution to the development of more reliable credit models for better risk management and revenue generation in financial institutions.

Creator

Nur Alamsyah1, Budiman2, Titan Parama Yoga1, R. Yadi Rakhman Alamsyah2

Source

Journal homepage: http://telkomnika.uad.ac.id

Date

Feb 27, 2024

Contributor

PERI IRAWAN

Format

PDF

Language

ENGLISH

Type

TEXT

Files

Collection

Citation

Nur Alamsyah1, Budiman2, Titan Parama Yoga1, R. Yadi Rakhman Alamsyah2, “A stacking ensemble model with SMOTE for improved imbalanced classification on credit data,” Repository Horizon University Indonesia, accessed January 12, 2026, https://repository.horizon.ac.id/items/show/10141.