Heirarchical Clustering.ipynb - Colab
Heirarchical Clustering.ipynb - Colab
ipynb - Colab
dataset = pd.read_csv('/content/Mall_Customers.csv')
dataset.head()
0 1 Male 19 15 39
1 2 Male 21 15 81
2 3 Female 20 16 6
3 4 Female 23 16 77
4 5 Female 31 17 40
dataset.shape
(200, 5)
dataset.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 200 entries, 0 to 199
Data columns (total 5 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 CustomerID 200 non-null int64
1 Gender 200 non-null object
2 Age 200 non-null int64
3 Annual Income (k$) 200 non-null int64
4 Spending Score (1-100) 200 non-null int64
dtypes: int64(4), object(1)
memory usage: 7.9+ KB
dataset.describe()
X = dataset.iloc[:, 3:]
X.head()
https://colab.research.google.com/drive/1IQVN2sKiNbELNVBA1WahQe7XEHKCMSRG#scrollTo=je3s-tguwhom&printMode=true 1/4
10/22/24, 11:16 AM Heirarchical_clustering.ipynb - Colab
0 15 39
1 15 81
2 16 6
3 16 77
4 17 40
import scipy.cluster.hierarchy as hc
import matplotlib.pyplot as plt
from pylab import rcParams
rcParams['figure.figsize'] = 15, 10
https://colab.research.google.com/drive/1IQVN2sKiNbELNVBA1WahQe7XEHKCMSRG#scrollTo=je3s-tguwhom&printMode=true 2/4
10/22/24, 11:16 AM Heirarchical_clustering.ipynb - Colab
https://colab.research.google.com/drive/1IQVN2sKiNbELNVBA1WahQe7XEHKCMSRG#scrollTo=je3s-tguwhom&printMode=true 3/4
10/22/24, 11:16 AM Heirarchical_clustering.ipynb - Colab
https://colab.research.google.com/drive/1IQVN2sKiNbELNVBA1WahQe7XEHKCMSRG#scrollTo=je3s-tguwhom&printMode=true 4/4