Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7
INTRODUCTION TO MACHINE LEARNING
1. What is Machine Learning?
- **Answer:** Machine Learning is a branch of Artificial Intelligence that focuses
on developing algorithms that allow computers to learn from data.
2. What is Supervised Learning?
- **Answer:** Supervised Learning is a type of Machine Learning where the model
is trained on labeled data, and it learns to make predictions or decisions based on that data.
3. What is Unsupervised Learning?
- **Answer:** Unsupervised Learning is a type of Machine Learning where the
model is trained on unlabeled data, and it learns to find patterns or structures in the data without explicit supervision.
4. What is Reinforcement Learning?
- **Answer:** Reinforcement Learning is a type of Machine Learning where an
agent learns to make decisions by interacting with an environment and receiving feedback in the form of rewards or penalties.
5. What is the goal of Machine Learning?
- **Answer:** The goal of Machine Learning is to develop models or algorithms
that can learn from data and make predictions or decisions without being explicitly programmed to do so.
6. What is feature engineering in Machine Learning?
- **Answer:** Feature engineering is the process of selecting, transforming, or
creating new features from raw data to improve the performance of machine learning models.
7. What is overfitting in Machine Learning?
- **Answer:** Overfitting occurs when a machine learning model learns the training data too well, capturing noise or random fluctuations in the data, which leads to poor performance on unseen data.
8. What is underfitting in Machine Learning?
- **Answer:** Underfitting occurs when a machine learning model is too simple to
capture the underlying structure of the data, leading to poor performance both on the training and unseen data.
9. What is the purpose of a validation set in Machine Learning?
- **Answer:** The purpose of a validation set is to evaluate the performance of a
machine learning model during training and tuning hyperparameters, without contaminating the test set with information from the training process.
10. What is a hyperparameter in Machine Learning?
- **Answer:** A hyperparameter is a configuration that is external to the model
and is not learned from data, but it controls the learning process, such as the learning rate or the number of hidden layers in a neural network.
11. What is the purpose of cross-validation in Machine Learning?
- **Answer:** Cross-validation is used to assess the performance and
generalization ability of a machine learning model by splitting the dataset into multiple subsets, training the model on different combinations of these subsets, and then evaluating its performance on the remaining data.
12. What is ensemble learning in Machine Learning?
models to improve performance, robustness, or generalization by aggregating their predictions.
13. What is bias in Machine Learning?
- **Answer:** Bias in machine learning refers to the error introduced by overly
simplistic assumptions made by the model during the learning process, leading to systematic inaccuracies in predictions. 14. What is variance in Machine Learning?
- **Answer:** Variance in machine learning refers to the sensitivity of the model's
predictions to changes in the training data, indicating how much the predictions would vary if the model were trained on different datasets.
15. What is the bias-variance tradeoff in Machine Learning?
- **Answer:** The bias-variance tradeoff is a fundamental concept in machine
learning that describes the tradeoff between the bias of the model (its tendency to underfit the data) and its variance (its sensitivity to fluctuations in the training data).
16. What is regularization in Machine Learning?
- **Answer:** Regularization is a technique used to prevent overfitting in machine
learning models by adding a penalty term to the loss function, which discourages overly complex models.
17. What is feature scaling in Machine Learning?
- **Answer:** Feature scaling is the process of standardizing or normalizing the
range of features in the dataset to ensure that they are on a similar scale, which can improve the performance and convergence of machine learning algorithms.
18. What is dimensionality reduction in Machine Learning?
- **Answer:** Dimensionality reduction is the process of reducing the number of
features in the dataset while preserving its essential information, which can help improve the efficiency, interpretability, and performance of machine learning models.
19. What is a confusion matrix in classification problems?
- **Answer:** A confusion matrix is a table that visualizes the performance of a
classification model by presenting the counts of true positive, true negative, false positive, and false negative predictions. 20. What is the ROC curve used for in binary classification?
- **Answer:** The ROC curve (Receiver Operating Characteristic curve) is used to
evaluate the performance of a binary classification model by plotting the true positive rate against the false positive rate at various threshold settings.
21. What is bias in Machine Learning?
- **Answer:** Bias in machine learning refers to the error introduced by overly
simplistic assumptions made by the model during the learning process, leading to systematic inaccuracies in predictions.
22. What is variance in Machine Learning?
- **Answer:** Variance in machine learning refers to the sensitivity of the model's
predictions to changes in the training data, indicating how much the predictions would vary if the model were trained on different datasets.
23. What is the bias-variance tradeoff in Machine Learning?
- **Answer:** The bias-variance tradeoff is a fundamental concept in machine
learning that describes the tradeoff between the bias of the model (its tendency to underfit the data) and its variance (its sensitivity to fluctuations in the training data).
24. What is regularization in Machine Learning?
- **Answer:** Regularization is a technique used to prevent overfitting in machine
learning models by adding a penalty term to the loss function, which discourages overly complex models.
25. What is feature scaling in Machine Learning?
- **Answer:** Feature scaling is the process of standardizing or normalizing the
range of features in the dataset to ensure that they are on a similar scale, which can improve the performance and convergence of machine learning algorithms.
26. What is dimensionality reduction in Machine Learning?
- **Answer:** Dimensionality reduction is the process of reducing the number of features in the dataset while preserving its essential information, which can help improve the efficiency, interpretability, and performance of machine learning models.
27. What is a confusion matrix in classification problems?
- **Answer:** A confusion matrix is a table that visualizes the performance of a
classification model by presenting the counts of true positive, true negative, false positive, and false negative predictions.
28. What is the ROC curve used for in binary classification?
- **Answer:** The ROC curve (Receiver Operating Characteristic curve) is used to
evaluate the performance of a binary classification model by plotting the true positive rate against the false positive rate at various threshold settings.
29. What is the difference between precision and recall in classification evaluation metrics?
- **Answer:** Precision measures the proportion of true positive predictions out of
all positive predictions made by the model, while recall measures the proportion of true positive predictions out of all actual positive instances in the dataset.
30. What is transfer learning in Machine Learning?
- **Answer:** Transfer learning involves leveraging knowledge gained from
solving one problem and applying it to a different but related problem, often by fine- tuning pre-trained models on new data.
31. What is the purpose of a validation set in Machine Learning?
- **Answer:** The purpose of a validation set is to evaluate the performance of a
machine learning model during training and tuning hyperparameters, without contaminating the test set with information from the training process.
32. What is a hyperparameter in Machine Learning?
- **Answer:** A hyperparameter is a configuration setting used to control the learning process of a machine learning algorithm, such as the learning rate in gradient descent or the depth of a decision tree.
33. What is cross-validation in Machine Learning?
- **Answer:** Cross-validation is a resampling technique used to assess the
generalization performance of a machine learning model by splitting the dataset into multiple subsets, training the model on some subsets, and evaluating it on the remaining subsets.
34. What is ensemble learning in Machine Learning?
models to improve performance, robustness, or generalization by aggregating their predictions.
35. What is bias in Machine Learning?
- **Answer:** Bias in machine learning refers to the error introduced by overly
simplistic assumptions made by the model during the learning process, leading to systematic inaccuracies in predictions.
36. What is variance in Machine Learning?
- **Answer:** Variance in machine learning refers to the sensitivity of the model's
predictions to changes in the training data, indicating how much the predictions would vary if the model were trained on different datasets.
37. What is the bias-variance tradeoff in Machine Learning?
- **Answer:** The bias-variance tradeoff is a fundamental concept in machine
learning that describes the tradeoff between the bias of the model (its tendency to underfit the data) and its variance (its sensitivity to fluctuations in the training data).
38. What is regularization in Machine Learning?
- **Answer:** Regularization is a technique used to prevent overfitting in machine learning models by adding a penalty term to the loss function, which discourages overly complex models.
39. What is feature scaling in Machine Learning?
- **Answer:** Feature scaling is the process of standardizing or normalizing the
range of features in the dataset to ensure that they are on a similar scale, which can improve the performance and convergence of machine learning algorithms.
40. What is dimensionality reduction in Machine Learning?
- **Answer:** Dimensionality reduction is the process of reducing the number of
features in the dataset while preserving its essential information, which can help improve the efficiency, interpretability, and performance of machine learning models.