Extended Project - Wholesales Customer
Extended Project - Wholesales Customer
2. Univariate Analysis……………………………..………………..………………………………………………………………………5
2.1. Explore all the variables and provide observations on the distributions of all the relevant
2.2. Explore all the categorical variables and provide observations on their
frequency……………………………………………………........................................................................................9
3 Multivariate Analysis……………………………………………..……………………………………………………………………13
3.3. Find the total spending across regions via different channels……………………………………………………16
3.4. Find the total spending on each of the categories across different region and channels……………16
3.5 Do the item varieties show similar behaviour across region and channel?...................................22
3.6 Is there any correlation between the different item varieties in terms of spending?....................24
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 440 entries, 0 to 439
Data columns (total 9 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Buyer/Spender 440 non-null int64
1 Channel 440 non-null object
2 Region 440 non-null object
3 Fresh 440 non-null int64
4 Milk 440 non-null int64
5 Grocery 440 non-null int64
6 Frozen 440 non-null int64
7 Detergents_Paper 440 non-null int64
8 Delicatessen 440 non-null int64
dtypes: int64(7), object(2)
Buyer/sender
Fresh
Milk
Grocery
Frozen
Detergents_paper
Delicatessen
Categorical data types are identified as follows
Channel
Region
2. Univariate Analysis
2.2 Explore all the categorical variables and provide observations on their frequency
Observations
1. Region:
The "Lisbon" region has the highest number of customers.
"Oporto" and "Other" regions have a similar number of customers.
2. Channel:
The "Hotel/Restaurant/Cafe" channel has the highest number of customers.
"Retailer" and "Direct" channels have a similar number of customers.
3. Multivariate Analysis
3.4 Find the total spending on each of the categories across different region and
channels
Observation
For item fresh, Hotel has the highest total spending followed by retail
In retail, highest total spending in other, followed by Oporto
In hotel, Highest total spending in other, followed by Lisbon
Other region is performing well in all channels
Observation
For item Milk, Retail has the highest total spending followed by Hotel
In retail, highest total spending in other, followed by lisbon
In hotel, Highest total spending in lisbon, followed by other
Observation
For item grocery, retail has the highest total spending followed by hotel
In retail, highest total spending in Lisbon, followed by other
In hotel, Highest total spending in Oporto, followed by Lisbon
Observation
For item frozen, Hotel has the highest total spending followed by retail
In retail, highest total spending in Lisbon,
In hotel, Highest total spending in Oporto, followed by other
Observation
For item detergents paper, Retail has the highest total spending followed by hotel
In retail, highest total spending in oporto, followed by Lisbon
In hotel, Highest total spending in Lisbon
Observation
For item delicatessen, retail has the highest total spending followed by hotel
In retail, highest total spending in Lisbon, followed by other
In hotel, Highest total spending in other, followed by lisbon
3.5 Do the item varieties show similar behavior across region and channel?
By region
By channel
Observation
3.6 Is there any correlation between the different item varieties in terms of
spending?
Observation
4. Univariate Analysis
By implementing these recommendations, the business can optimize its product offerings,
marketing strategies, and sales channels to increase revenue and profitability.