ml_interview
ml_interview
com
MOCK PAPER
#KeepLearning #KeepGrowing
#TheSmartWay
#foxmula
QUESTION 1
What’s the trade-off between bias and variance?
foxmula.com
QUESTION 2
What is the difference between supervised and unsupervised
machine learning?
Recall is also known as the true positive rate: the amount of positives
your model claims compared to the actual number of positives there
are throughout the data. Precision is also known as the positive
predictive value, and it is a measure of the amount of accurate
positives your model claims compared to the number of positives it
actually claims. It can be easier to think of recall and precision in the
context of a case where you’ve predicted that there were 10 apples
and 5 oranges in a case of 10 apples. You’d have perfect recall (there
are actually 10 apples, and you predicted there would be 10) but
66.7% precision because out of the 15 events you predicted, only 10
(the apples) are correct.
QUESTION 4
What is Bayes’ Theorem? How is it useful in a machine learning
context?
Bayes’ Theorem says no. It says that you have a (.6 * 0.05) (True
Positive Rate of a Condition Sample) / (.6*0.05)(True Positive Rate of a
Condition Sample) + (.5*0.95) (False Positive Rate of a Population) =
0.0594 or 5.94% chance of getting a flu.
Bayes’ Theorem is the basis behind a branch of machine learning that
most notably includes the Naive Bayes classifier. That’s something
important to consider when you’re faced with machine learning
interview questions.
QUESTION 5
What’s your favorite algorithm, and can you explain it to me in less
than a minute?
QUESTION 6
What’s the F1 score? How would you use it?
QUESTION 7
What’s the difference between Type I and Type II error?
QUESTION 9
How do you handle missing or corrupted data in a dataset?
In Pandas, there are two very useful methods: isnull() and dropna() that
will help you find columns of data with missing or corrupted data and
drop those values. If you want to fill the invalid values with a
placeholder value (for example, 0), you could use the fillna() method.
foxmula.com
QUESTION 10
Pick an algorithm. Write the psuedo-code for a parallel
implementation.