Online Retail Customer Segmentation

RFM-based clustering with KMeans baseline and GMM comparison

Raw rows loaded
525,461
Rows after cleaning
407,664
Customers segmented
4,312
Chosen clusters (K)
3

Model Selection

Model Silhouette Score Interpretation
KMeans (k=3) 0.4117 Best baseline
GMM (k=3) Run notebook cell for exact value Secondary comparison

Cluster Profiles (Relative RFM)

Cluster Segment Name Recency Frequency Monetary Size
0 Champions 34.06 (low) 2.18 (high) 8.00 (high) 1,361
1 At-Risk 249.76 (very high) 0.85 (low) 5.59 (low) 955
2 Potential Loyalists 54.24 (mid) 1.07 (mid-low) 6.16 (mid) 1,996

Key Plots

Customers per Segment

Bar chart showing customer count per segment

Average Revenue per Customer by Segment

Bar chart showing average revenue per segment

Customer Segments (PCA projection)

Scatter plot of customer segments projected onto two PCA components

Recommended Actions

Project Outputs

Note: Frequency and Monetary values in clustering were log-transformed for modeling stability. Use the exported file for business-facing dashboards and campaign execution.