50% found this document useful (4 votes)
4K views

Module 1 Quiz - Coursera15

The document is a quiz for a machine learning module that covers the following topics: - Supervised learning involves training a model using labeled data and using the model to predict labels for new data. - Unsupervised learning involves modeling the features of an unlabeled dataset to find hidden structure. - Classification involves training a model using categorically labeled data to predict labels for new data. - Regression involves training a model using labeled data where the labels are continuous quantities to predict labels for new data. - A k-Nearest Neighbors model would classify a new point as class 1 for k=1 and class 0 for k=3 based on the plotted data for classes 0, 1, and 2.

Uploaded by

Luai Doumet
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
50% found this document useful (4 votes)
4K views

Module 1 Quiz - Coursera15

The document is a quiz for a machine learning module that covers the following topics: - Supervised learning involves training a model using labeled data and using the model to predict labels for new data. - Unsupervised learning involves modeling the features of an unlabeled dataset to find hidden structure. - Classification involves training a model using categorically labeled data to predict labels for new data. - Regression involves training a model using labeled data where the labels are continuous quantities to predict labels for new data. - A k-Nearest Neighbors model would classify a new point as class 1 for k=1 and class 0 for k=3 based on the plotted data for classes 0, 1, and 2.

Uploaded by

Luai Doumet
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

5/21/23, 9:44 PM Module 1 Quiz | Coursera

Module 1 Quiz
Back Due May 7, 11:59 PM +04
Graded Quiz • 20 min • 11 total points

Module 1: Fundamentals of
Machine Learning - Intro to SciKit

Module 1 Quiz
Learn
1. Select the option that correctly completes the sentence: 1 point
Week 1 - Assignments

Quiz: Module 1 Quiz


Quiz • 20
Training a model usingmin
labeled data and using this model to predict the labels for new data is known as
11 questions ____________.
Review Learning Objectives
Programming Assignment:
Density Estimation
Assignment 1
3h
Supervised Learning

Clustering Submit your assignment


Try again
Unsupervised Learning
Due  May 7, 11:59 PM +04 Attempts  3 every 8 hours

2. Select the option Receive


that correctly
gradecompletes the sentence: Your grade
1 point
View Feedback
Modeling theTo Pass  82%
features or unlabeled
of an higher dataset to find hidden structure is known as ____________. 81.81% We keep your highest score

Unsupervised Learning

Classification
Like Dislike Report an issue
Supervised Learning

Regression

3. Select the option that correctly completes the sentence: 1 point

Training a model using categorically labelled data to predict labels for new data is known as __________.

Regression

Feature Extraction

Classification

Clustering

4. Select the option that correctly completes the sentence: 1 point

Training a model using labelled data where the labels are continuous quantities to predict labels for new data is
known as __________.

Clustering

Regression

Classification

Feature Extraction

5. Using the data for classes 0, 1, and 2 plotted below, what class would a KNeighborsClassifier classify the new 1 point
point as for k = 1 and k = 3?

k=1: Class 1
k=3: Class 0

k=1: Class 0
k=3: Class 1

k=1: Class 2
k=3: Class 1

k=1: Class 0
k=3: Class 2

k=1: Class 1
k=3: Class 2

6. Which of the following would *not* be something you specify for a nearest neighbor classifier algorithm? 1 point

The number of initial clusters to create from the training set

How many nearest neighbors to examine

A distance metric that finds neighbor points in the training set

An optional weighting function on the neighbor points

A method for pooling the classes of neighbor points to make a final classification decision

7. Why is it important to examine your dataset as a first step in applying machine learning? (Select all that apply): 1 point

See what type of cleaning or preprocessing still needs to be done

You might notice missing data

Gain insight on what machine learning model might be appropriate, if any

Get a sense for how difficult the problem might be

It is not important

8. The key purpose of splitting the dataset into training and test sets is: 1 point

To speed up the training process

To reduce the amount of labelled data needed for evaluating classifier accuracy

To estimate how well the learned model will generalize to new data

To reduce the number of features we need to consider as input to the learning algorithm

9. The purpose of setting the random_state parameter in train_test_split is: (Select all that apply) 1 point

To avoid predictable splitting of the data

To avoid bias in data splitting

To split the data into similar subsets so that bias is not introduced into the final results

To make experiments easily reproducible by always using the same partitioning of the data

10. Given a dataset with 10,000 observations and 50 features plus one label, what would be the dimensions of 1 point
X_train, y_train, X_test, and y_test? Assume a train/test split of 75%/25%.

X_train: (2500, )

y_train: (2500, 50)


X_test: (7500, )
y_test: (7500, 50)

X_train: (2500, 50)

y_train: (2500, )
X_test: (7500, 50)
y_test: (7500, )

X_train: (10000, 50)

y_train: (10000, )
X_test: (10000, 50)
y_test: (10000, )

X_train: (10000, 28)


y_train: (10000, )

X_test: (10000, 12)


y_test: (10000, )

X_train: (7500, 50)

y_train: (7500, )
X_test: (2500, 50)
y_test: (2500, )

11. Which of the following statements about the effect of the number of neighbors parameter k is true? 1 point

For small values of  k (e.g. k = 1), the classifier will tend to be much more sensitive to noise, mislabeled data,
and other sources of variation for individual data points.

Nearest neighbor classifier accuracy is never significantly affected by the choice of k.

For larger values of k (e.g. k = 10), the regions in feature space assigned to different classes generally
become less fragmented and have smoother decision boundaries between the regions, compared to results
using smaller values of k.

For small values of  k (e.g. k = 1), the classifier will tend to be much less sensitive to noise, mislabeled data,
and other sources of variation for individual data points.

Coursera Honor Code  Learn more

I, Luai Doumet, understand that submitting work that isn’t my own may result in permanent failure of this course or deactivation
of my Coursera account.

Submit Save draft 

https://www.coursera.org/learn/python-machine-learning/exam/W3RV7/module-1-quiz/attempt 1/1

You might also like