Module 1 Quiz - Coursera15
Module 1 Quiz - Coursera15
Module 1 Quiz
Back Due May 7, 11:59 PM +04
Graded Quiz • 20 min • 11 total points
Module 1: Fundamentals of
Machine Learning - Intro to SciKit
Module 1 Quiz
Learn
1. Select the option that correctly completes the sentence: 1 point
Week 1 - Assignments
Unsupervised Learning
Classification
Like Dislike Report an issue
Supervised Learning
Regression
Training a model using categorically labelled data to predict labels for new data is known as __________.
Regression
Feature Extraction
Classification
Clustering
Training a model using labelled data where the labels are continuous quantities to predict labels for new data is
known as __________.
Clustering
Regression
Classification
Feature Extraction
5. Using the data for classes 0, 1, and 2 plotted below, what class would a KNeighborsClassifier classify the new 1 point
point as for k = 1 and k = 3?
k=1: Class 1
k=3: Class 0
k=1: Class 0
k=3: Class 1
k=1: Class 2
k=3: Class 1
k=1: Class 0
k=3: Class 2
k=1: Class 1
k=3: Class 2
6. Which of the following would *not* be something you specify for a nearest neighbor classifier algorithm? 1 point
A method for pooling the classes of neighbor points to make a final classification decision
7. Why is it important to examine your dataset as a first step in applying machine learning? (Select all that apply): 1 point
It is not important
8. The key purpose of splitting the dataset into training and test sets is: 1 point
To reduce the amount of labelled data needed for evaluating classifier accuracy
To estimate how well the learned model will generalize to new data
To reduce the number of features we need to consider as input to the learning algorithm
9. The purpose of setting the random_state parameter in train_test_split is: (Select all that apply) 1 point
To split the data into similar subsets so that bias is not introduced into the final results
To make experiments easily reproducible by always using the same partitioning of the data
10. Given a dataset with 10,000 observations and 50 features plus one label, what would be the dimensions of 1 point
X_train, y_train, X_test, and y_test? Assume a train/test split of 75%/25%.
X_train: (2500, )
y_train: (2500, )
X_test: (7500, 50)
y_test: (7500, )
y_train: (10000, )
X_test: (10000, 50)
y_test: (10000, )
y_train: (7500, )
X_test: (2500, 50)
y_test: (2500, )
11. Which of the following statements about the effect of the number of neighbors parameter k is true? 1 point
For small values of k (e.g. k = 1), the classifier will tend to be much more sensitive to noise, mislabeled data,
and other sources of variation for individual data points.
For larger values of k (e.g. k = 10), the regions in feature space assigned to different classes generally
become less fragmented and have smoother decision boundaries between the regions, compared to results
using smaller values of k.
For small values of k (e.g. k = 1), the classifier will tend to be much less sensitive to noise, mislabeled data,
and other sources of variation for individual data points.
I, Luai Doumet, understand that submitting work that isn’t my own may result in permanent failure of this course or deactivation
of my Coursera account.
https://www.coursera.org/learn/python-machine-learning/exam/W3RV7/module-1-quiz/attempt 1/1