Customer Segmentation Using an Enhanced RFM–K-Means Framework on
The Online Retail Dataset
Dublin Core
Title
Customer Segmentation Using an Enhanced RFM–K-Means Framework on
The Online Retail Dataset
The Online Retail Dataset
Subject
Customer Segmentation, Online Retail Transactions, RFM Analysis, K-Means Clustering, Cluster Optimization, Targeted Marketing, Retail Analytics
Description
Effective customer segmentation is crucial for online retailers to enhance marketing strategies and boost profitability. However, analyzing
transactional data often reveals challenges, such as noisy records and incomplete temporal patterns, which hinder accurate customer profiling.
This paper proposes a robust methodology combining RFM (Recency, Frequency, Monetary) analysis with enhanced K-means clustering to
segment customers of a UK-based online retailer, using data from December 2010 to December 2011. We preprocess the data to handle anomalies,
engineer RFM features, and optimize cluster selection using the Elbow Method and Davies-Bouldin score, identifying four distinct segments:
Best Customers, Loyal Customers, Almost Lost, and Lost Cheap Customers. Results show a 5% improvement in segmentation accuracy compared
to baseline methods, with actionable insights for targeted marketing. This approach not only advances customer segmentation techniques but also
offers practical value for retail businesses aiming to improve customer retention and sales.
transactional data often reveals challenges, such as noisy records and incomplete temporal patterns, which hinder accurate customer profiling.
This paper proposes a robust methodology combining RFM (Recency, Frequency, Monetary) analysis with enhanced K-means clustering to
segment customers of a UK-based online retailer, using data from December 2010 to December 2011. We preprocess the data to handle anomalies,
engineer RFM features, and optimize cluster selection using the Elbow Method and Davies-Bouldin score, identifying four distinct segments:
Best Customers, Loyal Customers, Almost Lost, and Lost Cheap Customers. Results show a 5% improvement in segmentation accuracy compared
to baseline methods, with actionable insights for targeted marketing. This approach not only advances customer segmentation techniques but also
offers practical value for retail businesses aiming to improve customer retention and sales.
Creator
Isnandar Agus1,*, MS Hasibuan2
Source
https://ijiis.org/index.php/IJIIS/article/view/289/172
Publisher
Institute Informatics and Business Darmajaya, Indonesia, Bandar Lampung, Indonesia
Date
desember 2025
Contributor
Fajar bagus W
Format
PDF
Language
English
Type
Text
Files
Collection
Citation
Isnandar Agus1,*, MS Hasibuan2, “Customer Segmentation Using an Enhanced RFM–K-Means Framework on
The Online Retail Dataset,” Repository Horizon University Indonesia, accessed January 1, 2026, https://repository.horizon.ac.id/items/show/9741.
The Online Retail Dataset,” Repository Horizon University Indonesia, accessed January 1, 2026, https://repository.horizon.ac.id/items/show/9741.