EE - 353 - 769 A4 Unsupervised Learning

Uploaded by

Abhishek agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

EE - 353 - 769 A4 Unsupervised Learning

Uploaded by

Abhishek agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

EE769 Introduction to Machine Learning (July 2024 edition)

Electrical Engineering, Indian Institute of Technology Bombay

Programming Assignment – 4 : Unsupervised Learning

Instructions:

a) Only submit ipython notebooks. The notebook should be a complete code plus report with copious
comments, references and URLs, outputs, critical observations, and your reasoning to choose next steps.
b) Use good coding practices such as avoiding hard-coding, using self-explanatory variable names, using
functions (if applicable). This will also be graded.
c) Cite your sources if you use code from the internet. Also clarify what you have modified. Ensure that the
code has a permissive license or it can be assumed that academic purposes fall under ‘fair use’.
d) Submit a link to a viewable 10 minute video walk through of your code and insights

Problem statements:

Data: https://www.kaggle.com/datasets/alirezachahardoli/customer-data-clustring

Objective: Derive customer insights based on their credit card use features

1. Data preprocessing: [2]

a. Visualize and pre-process the data as appropriate. You might have to use a power, an exponential, or
a log transformation.
b. You may find and drop some of the highly correlated or inappropriate variables, or encode discrete
variables as appropriate
2. Clustering: Try to find meaningful customer segments using clustering [4]
a. Train k-means, and find the appropriate number of k.
b. Train DBSCAN, and see if by varying MinPts and ε, you can get the same number of clusters as k-
means.
c. Using the cluster assignment as the label, visualize the t-sne embedding.
d. Try to give each cluster a name, such as “reckless spenders”
3. PCA: Try to find if there are only a few components/directions that explain most of the variance in the data.
[3]
a. First, normalize each variable independently. Then Train PCA on appropriate variables.
b. Plot the variance explained versus PCA dimensions.
c. Reconstruct the data with various numbers of PCA dimensions, and compute the MSE.

Data Mining Business Report Hansraj Yadav
83% (12)
Data Mining Business Report Hansraj Yadav
34 pages
AML Assignment 1 1
No ratings yet
AML Assignment 1 1
4 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
8 pages
Ads Phase 5
No ratings yet
Ads Phase 5
23 pages
Machine Learning - Project
80% (10)
Machine Learning - Project
14 pages
Mining and Visualising Real-World Data: About This Module
100% (1)
Mining and Visualising Real-World Data: About This Module
16 pages
Another Project-Creating Customer Segments
No ratings yet
Another Project-Creating Customer Segments
31 pages
Capstones AIML and DS Capstone Projects
No ratings yet
Capstones AIML and DS Capstone Projects
6 pages
VL2024250504566_AST03
No ratings yet
VL2024250504566_AST03
2 pages
With Python: Machine Learning
No ratings yet
With Python: Machine Learning
3 pages
CUSTOMER SEGMENTATION 2
No ratings yet
CUSTOMER SEGMENTATION 2
19 pages
assignment-1
No ratings yet
assignment-1
4 pages
ML Project
100% (1)
ML Project
10 pages
CSUDS Project
No ratings yet
CSUDS Project
13 pages
Lecture - 7 - Practical - DBSCAN Clustering in Python
No ratings yet
Lecture - 7 - Practical - DBSCAN Clustering in Python
3 pages
S-1
No ratings yet
S-1
5 pages
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
From Everand
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
Georgio Daccache
No ratings yet
UI21CS29_Lab2
No ratings yet
UI21CS29_Lab2
11 pages
ML Assignment (22BCE8086) 2
No ratings yet
ML Assignment (22BCE8086) 2
19 pages
Daa-01
No ratings yet
Daa-01
11 pages
AIML MODEL
No ratings yet
AIML MODEL
13 pages
Balaji 1
No ratings yet
Balaji 1
30 pages
Aard
No ratings yet
Aard
12 pages
2324 BigData Lab3
No ratings yet
2324 BigData Lab3
6 pages
ADS Phase4
No ratings yet
ADS Phase4
21 pages
Case Study 219302405
No ratings yet
Case Study 219302405
14 pages
Machine Learning Lab Record Report
No ratings yet
Machine Learning Lab Record Report
38 pages
VIVA
No ratings yet
VIVA
5 pages
Data Preparation Basics#
No ratings yet
Data Preparation Basics#
2 pages
Articles Xgboost Classification With Smote-Enn Algorithm
No ratings yet
Articles Xgboost Classification With Smote-Enn Algorithm
11 pages
Software Projekt
No ratings yet
Software Projekt
11 pages
COMP1801 - Copy 1
No ratings yet
COMP1801 - Copy 1
18 pages
Data Mining Project
100% (2)
Data Mining Project
20 pages
Tasks for Students (1)
No ratings yet
Tasks for Students (1)
4 pages
Python Machine Learning 2
No ratings yet
Python Machine Learning 2
532 pages
30 Days ML Projects Challenge
No ratings yet
30 Days ML Projects Challenge
288 pages
Guides
No ratings yet
Guides
23 pages
Big Mart Sales Prediction Using Machine Learning Report PDF
No ratings yet
Big Mart Sales Prediction Using Machine Learning Report PDF
56 pages
Customer Segmentation New
No ratings yet
Customer Segmentation New
11 pages
PPT PDF Custome Segmentation
No ratings yet
PPT PDF Custome Segmentation
18 pages
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
Data Science Practical
No ratings yet
Data Science Practical
22 pages
plate-notebook-guided-project-1-1
No ratings yet
plate-notebook-guided-project-1-1
58 pages
Northbay Summarizes Data Pre-Processing Algorithms
No ratings yet
Northbay Summarizes Data Pre-Processing Algorithms
10 pages
Unit 6aics
No ratings yet
Unit 6aics
25 pages
Ads Phase 4
No ratings yet
Ads Phase 4
12 pages
Credit Card Fraud Analysis Ashutosh
No ratings yet
Credit Card Fraud Analysis Ashutosh
3 pages
CMSA SEM 6 DSE ML
No ratings yet
CMSA SEM 6 DSE ML
3 pages
Spark Lab
No ratings yet
Spark Lab
6 pages
Bank Marketing Targets 1724510938
No ratings yet
Bank Marketing Targets 1724510938
13 pages
Predictive Analysis For Big Mart Sales Using Machine Learning Algorithms
No ratings yet
Predictive Analysis For Big Mart Sales Using Machine Learning Algorithms
6 pages
Cours 3 - TP
No ratings yet
Cours 3 - TP
3 pages
First Coding Session - Overview!
No ratings yet
First Coding Session - Overview!
5 pages
Nikhil_Sanjay_Thorat_Assignment_2
No ratings yet
Nikhil_Sanjay_Thorat_Assignment_2
9 pages
Project Report - Data Mining
0% (1)
Project Report - Data Mining
52 pages
Project Report-Micro Credit Loan
No ratings yet
Project Report-Micro Credit Loan
8 pages
Machine Learning Laboratory: Manual
No ratings yet
Machine Learning Laboratory: Manual
52 pages
Machine Learning Da Ii Name: Mehakmeet Singh Regno: 16bce0376 Q6.)
No ratings yet
Machine Learning Da Ii Name: Mehakmeet Singh Regno: 16bce0376 Q6.)
48 pages
Kubernetes and Cloud Native Associate (KCNA) Exam Preparation
From Everand
Kubernetes and Cloud Native Associate (KCNA) Exam Preparation
Georgio Daccache
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet

EE - 353 - 769 A4 Unsupervised Learning

Uploaded by

EE - 353 - 769 A4 Unsupervised Learning

Uploaded by

EE769 Introduction to Machine Learning (July 2024 edition)

Electrical Engineering, Indian Institute of Technology Bombay

Programming Assignment – 4 : Unsupervised Learning

1. Data preprocessing: [2]

You might also like