0% found this document useful (0 votes)

24 views28 pages

Week 8 Prev & Current Assignments

Uploaded by

Ashok Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views28 pages

Week 8 Prev & Current Assignments

Uploaded by

Ashok Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

NPTEL Online Certification Courses

Indian Institute of Technology Kharagpur

Course Name: Introduction to Machine Learning

Assignment – Week 8 (Clustering)
TYPE OF QUESTION: MCQ/MSQ

Number of Question: 7 Total Marks: 7x2 = 14

1. For two runs of K-Mean clustering is it expected to get same clustering results?

A) Yes
B) No

Answer: (B)
K-Means clustering algorithm instead converses on local minima which might also correspond
to the global minima in some cases but not always. Therefore, it’s advised to run the K-Means
algorithm multiple times before drawing inferences about the clusters.
However, note that it’s possible to receive same clustering results from K-means by setting the
same seed value for each run. But that is done by simply making the algorithm choose the set
of same random no. for each run.

2. Which of the following can act as possible termination conditions in K-Means?

I. For a fixed number of iterations.

II. Assignment of observations to clusters does not change between iterations. Except for
cases with a bad local minimum.
III. Centroids do not change between successive iterations.
IV. Terminate when RSS falls below a threshold

A) I, III and IV
B) I, II and III
C) I, II and IV
D) All of the above

Answer: D

3. After performing K-Means Clustering analysis on a dataset, you observed the following
dendrogram. Which of the following conclusion can be drawn from the dendrogram?
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

A) There were 28 data points in clustering analysis.

B) The best no. of clusters for the analysed data points is 4.
C) The proximity function used is Average-link clustering.
D) The above dendrogram interpretation is not possible for K-Means clustering
analysis.

Answer: A dendrogram is not possible for K-Means clustering analysis. However, one can
create a cluster gram based on K-Means clustering analysis.

4. What should be the best choice of no. of clusters based on the following results:

A) 1
B) 2
C) 3
D) 4

Answer: C
The silhouette coefficient is a measure of how similar an object is to its own cluster
compared to other clusters. Number of clusters for which silhouette coefficient is highest
represents the best choice of the number of clusters.

5. Given, six points with the following attributes:

NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

Which of the following clustering representations and dendrogram depicts the use of MIN or
Single link proximity function in hierarchical clustering:

A)
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

C)
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

Solution: A)
Answer: For the single link or MIN version of hierarchical clustering, the proximity of
two clusters is defined to be the minimum of the distance between any two points in the
different clusters. For instance, from the table, we see that the distance between points 3
and 6 is 0.11, and that is the height at which they are joined into one cluster in the
dendrogram. As another example, the distance between clusters {3, 6} and {2, 5} is given
by dist ({3, 6}, {2, 5}) = min (dis (3, 2), dist (6, 2), dist (3, 5), dist (6, 5)) = min (0.1483,
0.2540, 0.2843, 0.3921) = 0.1483.

6. Which of the following algorithms are most sensitive to outliersWhatWh?

A) K-means clustering
B) K-medians clustering
C) K-modes clustering
D) K -medoids clustering

Answer: A
K-means is the most sensitive because it uses the mean of the cluster data points to find the
cluster center.

7. What is the possible reason(s) for producing two different dendograms using agglomerative
clustering for the same data set?
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

A) Proximity function
B) No. of data points
C) Variables used
D) All of these

Answer: E
Change in either of the proximity function, no of variables used and data points will change the
dendograms.
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

Course Name: Introduction to Machine Learning

Assignment – Week 8 (Clustering)
TYPE OF QUESTION: MCQ/MSQ

Number of Question: 7 Total Marks: 7x2 = 14

1. For two runs of K-Mean clustering is it expected to get same clustering results?

A) Yes
B) No

2. Which of the following can act as possible termination conditions in K-Means?

I. For a fixed number of iterations.

A) I, III and IV
B) I, II and III
C) I, II and IV
D) All of the above

Answer: D

A) There were 28 data points in clustering analysis.

Answer: A dendrogram is not possible for K-Means clustering analysis. However, one can
create a cluster gram based on K-Means clustering analysis.

4. What should be the best choice of no. of clusters based on the following results:

A) 1
B) 2
C) 3
D) 4

5. Given, six points with the following attributes:

NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

Which of the following clustering representations and dendrogram depicts the use of MIN or
Single link proximity function in hierarchical clustering:

A)
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

C)
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

6. Which of the following algorithms are most sensitive to outliersWhatWh?

A) K-means clustering
B) K-medians clustering
C) K-modes clustering
D) K -medoids clustering

Answer: A
K-means is the most sensitive because it uses the mean of the cluster data points to find the
cluster center.

A) Proximity function
B) No. of data points
C) Variables used
D) All of these

Answer: E
Change in either of the proximity function, no of variables used and data points will change the
dendograms.
Introduction to Machine Learning -IITKGP
Assignment - 8
TYPE OF QUESTION: MCQ/MSQ
Number of questions: 15 Total mark: 2 * 15 = 30

1. What is true about K-Mean Clustering?

1. K-means is extremely sensitive to cluster center initializations
2. Bad initialization can lead to Poor convergence speed
3. Bad initialization can lead to bad overall clustering
a. 1 and 2
b. 1 and 3
c. All of the above
d. 2 and 3

Correct Answer: c
Detailed Solution: All three of the given statements are true. K-means is extremely sensitive
to cluster center initialization. Also, bad initialization can lead to Poor convergence speed as
well as bad overall clustering.

__________________________________________________________________________________

2. In which of the following cases will K-Means clustering fail to give good results? (Mark all
that apply)
a. Data points with outliers
b. Data points with round shapes
c. Data points with non-convex shapes
d. Data points with different densities

Correct Answer: a, c, d
Detailed Solution: K-Means clustering algorithm fails to give good results when the data
contains outliers, the density spread of data points across the data space is different and the
data points follow non-convex shapes.

__________________________________________________________________________________

3. Which of the following clustering algorithms suffers from the problem of convergence at
local optima? (Mark all that apply)
a. K- Means clustering algorithm
b. Agglomerative clustering algorithm
c. Expectation-Maximization clustering algorithm
d. Diverse clustering algorithm
Correct Answer: a, c

Detailed Solution: Out of the options given, only K-Means clustering algorithm and EM
clustering algorithm has the drawback of converging at local minima.

4. In the figure below, if you draw a horizontal line on y-axis for y=2. What will be the
number of clusters formed?

a. 1

b. 2

c. 3

d. 4

Correct Answer: b

Detailed solution: Since the number of vertical lines intersecting the red horizontal line at

y=2 in the dendrogram are 2, therefore, two clusters will be formed.

5. Assume, you want to cluster 7 observations into 3 clusters using K-Means clustering
algorithm. After first iteration the clusters: C1, C2, C3 has the following observations:
C1: {(1,1), (4,4), (7,7)}
C2: {(0,4), (4,0)}
C3: {(5,5), (9,9)}
What will be the cluster centroids if you want to proceed for second iteration?

a. C1: (4,4), C2: (2,2), C3: (7,7)

b. C1: (2,2), C2: (0,0), C3: (5,5)
c. C1: (6,6), C2: (4,4), C3: (9,9)
d. None of these
Correct Answer: a
Detailed Solution:
Finding centroid for data points in cluster C1 = ((2+4+6)/3, (2+4+6)/3) = (4, 4)
Finding centroid for data points in cluster C2 = ((0+4)/2, (4+0)/2) = (2, 2)
Finding centroid for data points in cluster C3 = ((5+9)/2, (5+9)/2) = (7, 7)
Hence, C1: (4,4), C2: (2,2), C3: (7,7)

__________________________________________________________________________________

6. Following Question 5, what will be the Manhattan distance for observation (9, 9) from
cluster centroid C1 in the second iteration?
a. 10
b. 5
c. 6
d. 7

Correct Answer: a
Detailed Solution: Manhattan distance between centroid C1 i.e. (4, 4) and (9, 9) = (9-4) +
(9-4) = 10.

__________________________________________________________________________________

7. Which of the following is not a clustering approach?

a. Hierarchical
b. Partitioning
c. Bagging
d. Density-Based

Correct Answer: c
Detailed Solution: Follow lecture slides.

__________________________________________________________________________________

8. Which one of the following is correct?

a. Complete linkage clustering is computationally cheaper compared to single linkage.
b. Single linkage clustering is computationally cheaper compared to K-means clustering.
c. K-Means clustering is computationally cheaper compared to single linkage clustering.
d. None of the above.

Correct Answer: c
Detailed Solution: K-Means clustering is generally computationally more efficient than single
linkage hierarchical clustering. In K-Means, the main computational cost involves iterating over
data points and updating cluster assignments and centroids until convergence, which typically
converges in a reasonable number of iterations. In contrast, single linkage hierarchical
clustering involves pairwise distance computations for all data points at each step of the
hierarchy, making it computationally more expensive, especially for large datasets.

__________________________________________________________________________________
9. Considering single-link and complete-link hierarchical clustering, is it possible for a point to
be closer to points in other clusters than to points in its own cluster? If so, in which approach
will this tend to be observed?
a. No
b. Yes, single-link clustering
c. Yes, complete-link clustering
d. Yes, both single-link and complete-link clustering.

Correct Answer: d
Detailed Solution: In single-link hierarchical clustering, it is possible for a point to be closer to
points in other clusters than to points in its own cluster. This can lead to the phenomenon
known as "chaining," where clusters are stretched out because the similarity between two
clusters is determined by the closest pair of data points, which can sometimes result in points
from different clusters being closer to each other than to points within their own clusters.

In complete-link hierarchical clustering, it is also possible for a point to be closer to points in

other clusters than to points in its own cluster. This can lead to clusters being tightly packed
together and is sometimes referred to as the "crowding" problem.

__________________________________________________________________________________

10. After performing K-Means Clustering analysis on a dataset, you observed the following
dendrogram. Which of the following conclusions can be drawn from the dendrogram?
a. There were 28 data points in the clustering analysis
b. The best number of clusters for the analyzed data points is 4
c. The proximity function used is Average-link clustering
d. The above dendrogram interpretation is not possible for K-Means clustering
analysis

Correct Answer: d
Detailed Solution:
A dendrogram is not possible for K-Means clustering analysis. However, one can create a
cluster gram based on K-Means clustering analysis.

11. Feature scaling is an important step before applying K-Mean algorithm. What is the reason
behind this?
a. In distance calculation it will give the same weights for all features
b. You always get the same clusters if you use or don’t use feature scaling
c. In Manhattan distance it is an important step but in Euclidean it is not
d. None of these

Correct Answer: a
Detailed Solution:
Feature scaling ensures that all the features get the same weight in the clustering analysis.
Consider a scenario of clustering people based on their weights (in KG) with a range 55-110
and height (in inches) with a range 5.6 to 6.4. In this case, the clusters produced without scaling
can be very misleading as the range of weight is much higher than that of height. Therefore,
its necessary to bring them to same scale so that they have equal weightage on the clustering
result.

12. Which of the following options is a measure of internal evaluation of a clustering algorithm?
a. Rand Index
b. Jaccard Index
c. Davis-Bouldin Index
d. F-score

Correct Answer: c
Detailed Solution: Follow lecture slides.
13. Given, A= {0,1,2,5,6} and B = {0,2,3,4,5,7,9}, calculate Jaccard Index of these two sets.
a. 0.50
b. 0.25
c. 0.33
d. 0.41

Correct Answer: c

Detailed Solution:

To calculate the Jaccard Index for two sets A and B, you need to find the intersection and union of the
sets and then divide the size of the intersection by the size of the union.

Let's denote the sets as Set A and Set B:

Set A: {0, 1, 2, 5, 6}

Set B: {0, 2, 3, 4, 5, 7, 9}

Intersection (the elements that are common to both sets): {0, 2, 5}

Union (all unique elements from both sets): {0, 1, 2, 3, 4, 5, 6, 7, 9}

Now, calculate the Jaccard Index:

Jaccard Index = (Size of Intersection) / (Size of Union)

Jaccard Index = (3) / (9) = 1/3 ≈ 0.33

So, the Jaccard Index for these two sets is approximately 0.33.

The correct answer is c. 0.33.

__________________________________________________________________________________

14. Suppose you run K-means clustering algorithm on a given dataset. What are the factors on
which the final clusters depend?
I. The value of K
II. The initial cluster seeds chosen
III. The distance function used.

a. I only
b. II only
c. I and II only
d. I, II and III
Correct Answer: d

Detailed Solution:

The final clusters in the K-means clustering algorithm depend on the following factors:

I. The value of K: The number of clusters (K) is a crucial parameter in K-means, and it significantly affects
the final clustering. Different values of K can lead to different clusterings.

II. The initial cluster seeds chosen: The initial placement of cluster centroids or seeds can impact the
convergence of the algorithm and the resulting clusters. Different initializations may lead to different
final cluster assignments.

III. The distance function used: The choice of distance metric (e.g., Euclidean distance, Manhattan
distance, etc.) influences how the algorithm measures the similarity or dissimilarity between data
points. Different distance functions can lead to different cluster shapes and assignments.

So, all three factors (I, II, and III) play a role in determining the final clusters in K-means.

The correct answer is d. I, II, and III.

15. Consider a training dataset with two numerical features namely, height of a person and age
of the person. The height varies from 4-8 and age varies from 1-100. We wish to perform
K-Means clustering on the dataset. Which of the following options is correct?

a. We should use Feature-scaling for K-Means Algorithm.

b. Feature Scaling can not be used for K-Means Algorithm.
c. You always get the same clusters if you use or don’t use feature scaling.
d. None of these

Correct Answer: a

Detailed Solution: In K-Means clustering, the scale of features can affect the clustering results. When
features have different scales, K-Means tends to give more weight to features with larger scales. In the
given scenario, the "height" feature has a range of 4-8, while the "age" feature has a range of 1-100.
Because of this significant difference in scales, it's advisable to use feature scaling to bring both
features to a similar scale. Standardization (subtracting the mean and dividing by the standard
deviation) or min-max scaling (scaling features to a specific range, like [0, 1]) are common methods for
feature scaling in K-Means.

Option b is incorrect because feature scaling can be used for K-Means, and it's often recommended
when dealing with features of different scales.

Option c is also incorrect. You do not always get the same clusters if you use or don't use feature
scaling. The initial centroids and the clustering results can be influenced by the scale of the features,
so feature scaling can lead to different clusters compared to not using it.
So, the correct answer is a. We should use Feature-scaling for K-Means Algorithm.

__________________________________________________________________________________

************END************
9/16/24, 12:28 AM Introduction To Machine Learning - IITKGP - - Unit 11 - Week 8

Assessment submitted.
(https://swayam.gov.in) (https://swayam.gov.in/nc_details/NPTEL)
X

[email protected] 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Introduction To Machine Learning - IITKGP

(course)

Thank you for taking the Week 8 :

Course
outline
Assignment 8.
About
NPTEL () Week 8 : Assignment 8
How does an Your last recorded submission was on 2024-09-16, 00:27 Due date: 2024-09-18, 23:59 IST.
NPTEL IST
online
1) 2 points
course
work? ()

Week 0 ()
A)
Week 1 () B)

Week 2 () 2) 2 points

Week 3 ()

Week 4 ()

Week 5 ()

Week 6 () A)
B)
Week 7 ()
C)
Week 8 ()
3) 2 points
Lecture 41:
Introduction to
Clustering
(unit?

https://onlinecourses.nptel.ac.in/noc24_cs81/unit?unit=68&assessment=149 1/7
9/16/24, 12:28 AM Introduction To Machine Learning - IITKGP - - Unit 11 - Week 8

unit=68&lesso
Assessment
n=69)submitted.
X
Lecture 42:
Kmeans
Clustering
(unit?
unit=68&lesso
n=70)

Lecture 43:
Agglomerative
Hierarchical
Clustering
(unit?
unit=68&lesso
n=71)
A)
Lecture 44:
B)
Python
Exereise on C)
Kmeans D)
Clustering
(unit? 4) 2 points
unit=68&lesso
n=72)

Tutorial 8
(unit?
unit=68&lesso
n=73)

Week 8 :
Lecture A)
Material (unit?
B)
unit=68&lesso
n=74) C)
D)
Quiz: Week 8
: Assignment
8 5) 2 points
(assessment?
name=149)

Feedback
Form for Week
8 (unit?
unit=68&lesso
n=136)

Download
Videos ()

Problem
Solving
Session -
July 2024 ()

https://onlinecourses.nptel.ac.in/noc24_cs81/unit?unit=68&assessment=149 2/7
9/16/24, 12:28 AM Introduction To Machine Learning - IITKGP - - Unit 11 - Week 8

Assessment submitted.
X

https://onlinecourses.nptel.ac.in/noc24_cs81/unit?unit=68&assessment=149 3/7
9/16/24, 12:28 AM Introduction To Machine Learning - IITKGP - - Unit 11 - Week 8

Assessment submitted.
X

https://onlinecourses.nptel.ac.in/noc24_cs81/unit?unit=68&assessment=149 4/7
9/16/24, 12:28 AM Introduction To Machine Learning - IITKGP - - Unit 11 - Week 8

Assessment submitted.
X

A)
B)
C)
D)

6) 2 points

A)
B)
e

https://onlinecourses.nptel.ac.in/noc24_cs81/unit?unit=68&assessment=149 5/7
9/16/24, 12:28 AM Introduction To Machine Learning - IITKGP - - Unit 11 - Week 8

C)
Assessment submitted.
X D)

7) 2 points

A)
B)
C)
D)

8) 2 points

A)
B)
C)
D)

9) 2 points

A)
B)
C)
D)

10) 2 points

A)
e

https://onlinecourses.nptel.ac.in/noc24_cs81/unit?unit=68&assessment=149 6/7
9/16/24, 12:28 AM Introduction To Machine Learning - IITKGP - - Unit 11 - Week 8

B)
Assessment submitted.
X C)
D)
You may submit any number of times before the due date. The final submission will be
considered for grading.
Submit Answers

https://onlinecourses.nptel.ac.in/noc24_cs81/unit?unit=68&assessment=149 7/7

01.-Panel Descarga 4004R (579-354)
100% (3)
01.-Panel Descarga 4004R (579-354)
102 pages
Data Analytics With Python - Unit 13 - Week 11
No ratings yet
Data Analytics With Python - Unit 13 - Week 11
4 pages
Building Auxiliary System Design
100% (1)
Building Auxiliary System Design
4 pages
World Matrimonial Property Regimes
No ratings yet
World Matrimonial Property Regimes
2 pages
Capture d’écran, le 2025-04-21 à 21.26.38
No ratings yet
Capture d’écran, le 2025-04-21 à 21.26.38
14 pages
IML-IITKGP - Assignment 8 Solution
No ratings yet
IML-IITKGP - Assignment 8 Solution
8 pages
Assignment 8 Solution
No ratings yet
Assignment 8 Solution
7 pages
Nptel Week 9
No ratings yet
Nptel Week 9
4 pages
Week 11 - Clustering Analysis, K-means, Hierarchical clustering.docx (1)
No ratings yet
Week 11 - Clustering Analysis, K-means, Hierarchical clustering.docx (1)
3 pages
Feedback The Correct Answer Is:analysis of Time Series
No ratings yet
Feedback The Correct Answer Is:analysis of Time Series
42 pages
Clustering
No ratings yet
Clustering
47 pages
Assign 7
No ratings yet
Assign 7
5 pages
S27
No ratings yet
S27
30 pages
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
No ratings yet
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
6 pages
numerical
No ratings yet
numerical
1 page
Exam Advanced Data Mining Date: 5-11-2009 Time: 14.00-17.00: General Remarks
100% (1)
Exam Advanced Data Mining Date: 5-11-2009 Time: 14.00-17.00: General Remarks
5 pages
Important Questions dwdm-2
No ratings yet
Important Questions dwdm-2
7 pages
Data Mining II 4986
No ratings yet
Data Mining II 4986
4 pages
Data Mining Clustering Questions
No ratings yet
Data Mining Clustering Questions
18 pages
quiz4
No ratings yet
quiz4
4 pages
K means example
No ratings yet
K means example
8 pages
Cluster Analysis Chapter 8 Solution
No ratings yet
Cluster Analysis Chapter 8 Solution
8 pages
Assignment 3_ed84b364-5b0f-4d8f-bf79-e60ce9fe3df4
No ratings yet
Assignment 3_ed84b364-5b0f-4d8f-bf79-e60ce9fe3df4
2 pages
Clustering
No ratings yet
Clustering
8 pages
Cluster MCQ
No ratings yet
Cluster MCQ
12 pages
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
No ratings yet
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
11 pages
Learn Lab3
No ratings yet
Learn Lab3
12 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
71 pages
ML Assign4
No ratings yet
ML Assign4
7 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
69 pages
ML UNIT-5 (1)
No ratings yet
ML UNIT-5 (1)
30 pages
Cluster Evaluation Techniques: Atds Assignment
No ratings yet
Cluster Evaluation Techniques: Atds Assignment
4 pages
MT2023-Sol
No ratings yet
MT2023-Sol
8 pages
UNIT 3.docx
No ratings yet
UNIT 3.docx
19 pages
Unit V - Question Bank
No ratings yet
Unit V - Question Bank
6 pages
Cluster Analysis Thesis Matlab Code PDF
100% (3)
Cluster Analysis Thesis Matlab Code PDF
7 pages
Zara
No ratings yet
Zara
47 pages
Final_NPTEL_Assignment-11_Sol
No ratings yet
Final_NPTEL_Assignment-11_Sol
3 pages
Lecture-11 Cluster Analysis-1
No ratings yet
Lecture-11 Cluster Analysis-1
28 pages
637227449508725497DataMining (Chapter8)
No ratings yet
637227449508725497DataMining (Chapter8)
8 pages
7ecac317a71f16ec6dbbd739ae715f5f-112-127
No ratings yet
7ecac317a71f16ec6dbbd739ae715f5f-112-127
16 pages
W9
No ratings yet
W9
5 pages
Practice Questions
No ratings yet
Practice Questions
14 pages
Answer 1722794593 K Means and Hierarchical Clustering MCQ 4991 70633b18 16bf 4638 A27d 84fda792d5c0
No ratings yet
Answer 1722794593 K Means and Hierarchical Clustering MCQ 4991 70633b18 16bf 4638 A27d 84fda792d5c0
18 pages
Chapter 3 Unsupervised Learning
No ratings yet
Chapter 3 Unsupervised Learning
45 pages
DWDM MID - 2 Question Paper and Online Bits
No ratings yet
DWDM MID - 2 Question Paper and Online Bits
3 pages
Lp2-Etl Model Assignment No. 2: R (2) C (4) V (2) T (2) Total (10) Dated Sign
No ratings yet
Lp2-Etl Model Assignment No. 2: R (2) C (4) V (2) T (2) Total (10) Dated Sign
7 pages
Lab 08 Solutions
No ratings yet
Lab 08 Solutions
5 pages
Introduction To Data Mining Clustering Analysis
No ratings yet
Introduction To Data Mining Clustering Analysis
84 pages
Solution 10
No ratings yet
Solution 10
7 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
Clustering: CMPUT 466/551 Nilanjan Ray
No ratings yet
Clustering: CMPUT 466/551 Nilanjan Ray
34 pages
Data Mining Assignment
No ratings yet
Data Mining Assignment
24 pages
Exam SRM Sample Questions 2
No ratings yet
Exam SRM Sample Questions 2
60 pages
DM Quiz2 Ans DJ
No ratings yet
DM Quiz2 Ans DJ
4 pages
ML UNIT-5
No ratings yet
ML UNIT-5
31 pages
K Means
No ratings yet
K Means
33 pages
Lecture Notes - Clustering
No ratings yet
Lecture Notes - Clustering
13 pages
TD4
No ratings yet
TD4
1 page
Ca-3 QB (Pec-It602b) - 2024-1
No ratings yet
Ca-3 QB (Pec-It602b) - 2024-1
12 pages
Data Mining and Warehousing Quizzes Compilation - Answer Key
No ratings yet
Data Mining and Warehousing Quizzes Compilation - Answer Key
5 pages
Classification and Clustering (HAREN SHARMA (03529802018) )
No ratings yet
Classification and Clustering (HAREN SHARMA (03529802018) )
9 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Set 2
100% (1)
Set 2
23 pages
Hall No. 2: Helipad Exhibition Center
No ratings yet
Hall No. 2: Helipad Exhibition Center
1 page
Yamada Et Al., 2005
No ratings yet
Yamada Et Al., 2005
7 pages
Account Summary Payment Information
100% (1)
Account Summary Payment Information
3 pages
Measurements & Their Errors
100% (1)
Measurements & Their Errors
29 pages
Fact Tables
No ratings yet
Fact Tables
3 pages
Planning Formation CUI CUF
No ratings yet
Planning Formation CUI CUF
2 pages
SWOT Free Presentation Template
No ratings yet
SWOT Free Presentation Template
5 pages
Thesis Final Ver 1
No ratings yet
Thesis Final Ver 1
59 pages
Decline Curve Analysis - Fekete
100% (1)
Decline Curve Analysis - Fekete
20 pages
Synchronic Number Reference 1 To 13
No ratings yet
Synchronic Number Reference 1 To 13
24 pages
JDST+Notification XII
No ratings yet
JDST+Notification XII
5 pages
5G SCFT
No ratings yet
5G SCFT
6 pages
Velocity Calculator For Air Drilling
No ratings yet
Velocity Calculator For Air Drilling
14 pages
Edwards-Anderson Model
No ratings yet
Edwards-Anderson Model
8 pages
Einstein Rhet2
No ratings yet
Einstein Rhet2
28 pages
AI Question Bank
No ratings yet
AI Question Bank
4 pages
TLE - Event Management Services Fourth
No ratings yet
TLE - Event Management Services Fourth
3 pages
Science 9 English PDF
No ratings yet
Science 9 English PDF
214 pages
Sohum Sen: IIM Calcutta On A Sunny Morning PC: Animesh Giradkar, 58th Batch
No ratings yet
Sohum Sen: IIM Calcutta On A Sunny Morning PC: Animesh Giradkar, 58th Batch
9 pages
EN1993 Benchmarks
No ratings yet
EN1993 Benchmarks
212 pages
Agriculture Optional Syllabus
100% (1)
Agriculture Optional Syllabus
6 pages
Critical Success Factors For The Implementation EProcurement
No ratings yet
Critical Success Factors For The Implementation EProcurement
11 pages
Polyspace Code Verification: Coding Standards Report For Project: Polyspace
No ratings yet
Polyspace Code Verification: Coding Standards Report For Project: Polyspace
30 pages
3SAT
No ratings yet
3SAT
1 page
ACDLabs NMR Data Processing
No ratings yet
ACDLabs NMR Data Processing
4 pages
Ise314 ch24
No ratings yet
Ise314 ch24
22 pages

Week 8 Prev & Current Assignments

Uploaded by

Week 8 Prev & Current Assignments

Uploaded by

NPTEL Online Certification Courses

Indian Institute of Technology Kharagpur

Course Name: Introduction to Machine Learning

Number of Question: 7 Total Marks: 7x2 = 14

2. Which of the following can act as possible termination conditions in K-Means?

I. For a fixed number of iterations.

A) There were 28 data points in clustering analysis.

5. Given, six points with the following attributes:

6. Which of the following algorithms are most sensitive to outliersWhatWh?

Course Name: Introduction to Machine Learning

Number of Question: 7 Total Marks: 7x2 = 14

2. Which of the following can act as possible termination conditions in K-Means?

I. For a fixed number of iterations.

A) There were 28 data points in clustering analysis.

5. Given, six points with the following attributes:

6. Which of the following algorithms are most sensitive to outliersWhatWh?

1. What is true about K-Mean Clustering?

y=2 in the dendrogram are 2, therefore, two clusters will be formed.

a. C1: (4,4), C2: (2,2), C3: (7,7)

7. Which of the following is not a clustering approach?

8. Which one of the following is correct?

In complete-link hierarchical clustering, it is also possible for a point to be closer to points in

Let's denote the sets as Set A and Set B:

Intersection (the elements that are common to both sets): {0, 2, 5}

Union (all unique elements from both sets): {0, 1, 2, 3, 4, 5, 6, 7, 9}

Now, calculate the Jaccard Index:

Jaccard Index = (Size of Intersection) / (Size of Union)

Jaccard Index = (3) / (9) = 1/3 ≈ 0.33

The correct answer is c. 0.33.

The correct answer is d. I, II, and III.

a. We should use Feature-scaling for K-Means Algorithm.

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Introduction To Machine Learning - IITKGP

Thank you for taking the Week 8 :

You might also like