Online Retail II Customer Segmentation

Unsupervised clustering (RFM) for marketing actionability

Raw Transactions
525,461
After Cleaning
407,664
Customers
4,312
Final Clusters
3

RFMKMeansGMM ComparisonPCA

Approach

Model Selection

KMeans with k=3 achieved the strongest baseline separation.

Why k=3: balanced cluster quality and easy stakeholder interpretation.

Segment Definitions

Champions

Low recency, highest frequency, highest spend.

Potential Loyalists

Moderate recency and value; good conversion upside.

At-Risk

Long inactive window with low activity and lower spend.

Actions

Retention for Champions, upsell for Potential Loyalists, win-back for At-Risk.

Results Visualization

Customers per Segment

Customers per Segment

Avg Revenue by Segment

Average Revenue by Segment

PCA Projection

Customer Segments PCA projection

Two-dimensional projection used for communication, not for model training.

Deliverables and Deployment

Project status: baseline complete and presentation-ready.

1 / 7