Machine Learning For Interviews
Machine Learning For Interviews
2. Advantages
3. Disadvantages
No
Naive Bayes can handle missing data. Attributes are handled separately by the
algorithm at both model construction time and prediction time. As such, if a
data instance has a missing value for an attribute, it can be ignored while
preparing the model, and ignored when a probability is calculated for a class
value.
6. Impact of outliers?
1. Sentiment Analysis
2. Spam classification
3. twitter sentiment analysis
4. document categorization
Linear Regression
2. Advantages
3. Disadvantages
Yes
6. Impact of outliers?
linear regression needs the relationship between the independent and dependent
variables to be linear. It is also important to check for outliers since linear
regression is sensitive to outlier effects.
Types of Problems it can solve (Supervised)
1. Regression
Home Work?
SVM
2. Advantages
Yes
6. Impact of outliers?
1. Classification
2. Regression
Classification
1. Confusion Matrix
2. Precision, Recall, F1 score
Regression
1. R2, Adjusted R2
2. MSE, RMSE, MAE
Interview Questions:
1. Decision Tree
2. Entropy, Information Gain, Gini Impurity
3. Decision Tree Working For Categorical and Numerical Features
4. What are the scenarios where Decision Tree works well
5. Decision Tree Low Bias And High Variance- Overfitting
6. Hyperparameter Techniques
7. Library used for constructing decision tree
8. Impact of Outliers Of Decision Tree
9. Impact of missing values on Decision Tree
10.Does Decision Tree require Feature Scaling
2. Advantages
3. Disadvantages
3. Unstable: Adding a new data point can lead to re-generation of the overall
tree and all nodes need to be recalculated and recreated.
4. Not suitable for large datasets: If data size is large, then one single tree
may grow complex and lead to overfitting. So in this case, we should use
Random Forest instead of a single Decision Tree.
No
6. Impact of outliers?
1. Classification
2. Regression
Performance Metrics
Classification
1. Confusion Matrix
2. Precision, Recall, F1 score
Regression
1. R2, Adjusted R2
2. MSE, RMSE, MAE
Logistics Regression
2. Advantages
3. Disadvantages
Yes
5. Missing Values
6. Impact of outliers?
Like linear regression, estimates of the logistic regression are sensitive to the
unusual observations: outliers, high leverage, and influential observations.
Numerical examples and analysis are presented to demonstrate the most recent
outlier diagnostic methods using data sets from medical domain
1. Classification
Performance Metrics
Classification
1. Confusion Matrix
2. Precision, Recall, F1 score
1. Doesn't Overfit
3. Disadvantages
No
6. Impact of outliers?
Robust to Outliers
1. Classification
2. Regression
Performance Metrics
Classification
1. Confusion Matrix
2. Precision, Recall, F1 score
Regression
1. R2, Adjusted R2
2. MSE, RMSE, MAE
Missing Values
2. Advantages
Advantages of Adaboost
1. Doesn't Overfit
3. Disadvantages
No
6. Impact of outliers?
1. Classification
2. Regression
Performance Metrics
Classification
1. Confusion Matrix
2. Precision, Recall, F1 score
Regression
1. R2, Adjusted R2
2. MSE, RMSE, MAE