0% found this document useful (0 votes)
75 views

Radhika PCA - Problem Statement

Data science

Uploaded by

Radhika Giraddi
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
75 views

Radhika PCA - Problem Statement

Data science

Uploaded by

Radhika Giraddi
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Topic: Dimension Reduction With PCA

Instructions:
Please share your answers filled in-line in the word document. Submit code separately
wherever applicable.
Please ensure you update all the details:
Name:vaanishree
Batch ID: 30/11/2021
Topic: Principal Component Analysis

Problem Statement: -
Perform hierarchical and K-means clustering on the dataset. After that, perform PCA on
the dataset and extract the first 3 principal components and make a new dataset with
these 3 principal components as the columns. Now, on this new dataset, perform
hierarchical and K-means clustering. Compare the results of clustering on the original
dataset and clustering on the principal components dataset (use the scree plot
technique to obtain the optimum number of clusters in K-means clustering and check if
you’re getting similar results with and without PCA).

ANS -I attached all programs code for that problem statement.Please


see in python code because here and there I maintain code with
explanation.

© 2013 - 2021 360DigiTMG. All Rights Reserved.


© 2013 - 2021 360DigiTMG. All Rights Reserved.
Problem Statement: -

A pharmaceuticals manufacturing company is conducting a study on a new medicine to treat


heart diseases. The company has gathered data from its secondary sources and would like you
to provide high level analytical insights on the data. Its aim is to segregate patients depending
on their age group and other factors given in the data. Perform PCA and clustering algorithms on
the dataset and check if the clusters formed before and after PCA are the same and provide a
brief report on your model. You can also explore more ways to improve your model.

Note: This is just a snapshot of the data. The datasets can be downloaded from AiSpry LMS in
the Hands-On Material section.

ANS - I attached all programs code for that problem statement.Please see in python code
because here and there I maintain code with explanation.

© 2013 - 2021 360DigiTMG. All Rights Reserved.

You might also like