To improve product recommendations for new customers, AeroFit's market research team aims to determine the unique characteristics of the target audience for each treadmill model. The team will investigate potential differences in customer demographics, preferences, and behaviours across various treadmill types to tailor recommendations more effectively. Following is their product portfolio
To overcome challenges, Aerofit relies on skilled data analysts. Their data-driven approach involves analyzing historical data to gain actionable insights for decision-making. Through this analysis, they uncover patterns, trends, and viewer preferences, enabling Aerofit to determine suitable product recommendations for customers. Furthermore, data analysis enables the company to optimize its sales strategy based on customer type, fostering global business growth. Utilizing Python, Pandas, Seaborn, and Matplotlib for data analysis and visualization, I conducted an in-depth analysis of the Aerofit data provided by Kaggle. The primary objective of this analysis was to gain insights into the following aspects:
We will use the count of users, probabilities, and conditional probabilities to evaluate the users.
The dataset consists of sales data for Aerofit products
Attribute | Details |
---|---|
Product Purchased | KP281, KP481, or KP781 |
Age | In years |
Gender | Male/Female |
Education | In years |
Marital Status | Single or partnered |
Usage (per week) | The average number of times the customer plans to use the treadmill each week |
Income ($) | Annual income (in $) |
Fitness (1-5) | Self-rated fitness on a 1-to-5 scale, where 1 is poor shape and 5 is excellent shape |
Miles (per week) | The average number of miles the customer expects to walk/run each week |
I have conducted Exploratory Data Analysis (EDA) on each column in the dataset. This involved generating histograms, countplots, box plots, and line plots. Additionally, I used pair plots and correlation plots to examine relationships between columns. This EDA process gave me a deeper understanding of each column and the relationships between pairs of columns.
Here are the links to the Jupyter Notebook containing detailed code, insights, and recommendations:
Following is the embedded version of above notebook hosted on Kaggle