Skip to content

Project uses K-Means clustering to analyze mall customers based on spending behavior and annual income

License

Notifications You must be signed in to change notification settings

gargi510/R-Mall_Customer_Segmentation

Repository files navigation

🛍️ Retail Insights: Mall Customer Segmentation using R

📌 Project Overview

Customer segmentation is a crucial aspect of business strategy, allowing companies to understand their customers better and optimize marketing efforts. This project applies K-Means Clustering to segment mall customers based on their Annual Income and Spending Score, helping businesses tailor personalized experiences and improve customer retention.


📂 Dataset

  • Description: The dataset consists of demographic and spending behavior data of mall customers.
  • Features:
    • CustomerID: Unique identifier for each customer
    • Gender: Customer's gender
    • Age: Customer's age
    • Annual Income (k$): Customer's yearly income in thousands of dollars
    • Spending Score (1-100): A score assigned based on customer spending behavior

🎯 Objective

✔ Perform Exploratory Data Analysis (EDA) to identify patterns and insights.
✔ Implement K-Means Clustering to categorize customers into distinct groups.
✔ Visualize the customer segments to understand their characteristics.
✔ Provide actionable insights for businesses to enhance marketing strategies.


🛠️ Tech Stack

  • Language: R
  • Libraries Used: ggplot2, dplyr, tidyverse, cluster, factoextra
  • Clustering Algorithm: K-Means

🔎 Key Insights

  • Identified distinct customer groups based on income and spending patterns.
  • Visualized clusters using scatter plots and elbow method to determine the optimal number of clusters.
  • Provided strategic recommendations for targeting high-value customers and improving retention.

🚀 How to Run the Project

1️⃣ Install the required R packages:

install.packages(c("ggplot2", "dplyr", "tidyverse", "cluster", "factoextra"))

2️⃣ Load the dataset into R.
3️⃣ Perform data preprocessing and exploratory analysis.
4️⃣ Apply the K-Means Clustering Algorithm.
5️⃣ Visualize and interpret the customer segments.


📊 Visualizations

Key visualizations include:
📌 Scatter plots showing customer segments
📌 Distribution of spending scores and income
📌 Cluster separation using principal component analysis (PCA)


👩‍💻 Author

Developed by Gargi Mishra
📌 LinkedIn

About

Project uses K-Means clustering to analyze mall customers based on spending behavior and annual income

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages