ML
ML
Machine learning is a branch of artificial intelligence that enables algorithms to uncover hidden patterns
within datasets, allowing them to make predictions on new, similar data without explicit programming for
each task.
Machine learning’s impact extends to autonomous vehicles, drones, and robots, enhancing their
adaptability in dynamic environments. This approach marks a breakthrough where machines learn from
data examples to generate accurate outcomes, closely intertwined with data mining and data science.
1. **Image Recognition**: Machine learning is used in self-driving cars to recognize and classify objects,
such as pedestrians, traffic lights, and other vehicles.
3. **Speech Recognition**: Virtual assistants like Siri and Alexa use machine learning to recognize and
interpret voice commands.
4. **Natural Language Processing**: Machine learning is used to analyze and understand natural
language, enabling applications such as chatbots and language translation software.
5. **Predictive Maintenance**: Machine learning algorithms can be used to predict when equipment or
machines are likely to fail, allowing for proactive maintenance.
6. **Customer Churn Prediction**: Banks and telecommunications companies use machine learning to
predict which customers are likely to switch to a competitor.
7. **Fraud Detection**: Machine learning is used in finance to detect fraudulent transactions and prevent
identity theft.
8. **Sentiment Analysis**: Machine learning algorithms can analyze customer feedback and sentiment to
help businesses improve their products and services.
9. **Medical Diagnosis**: Machine learning is used in healthcare to diagnose diseases such as cancer,
diabetes, and Alzheimer's.
10. **Recommendation Systems**: Music streaming services like Spotify use machine learning to
recommend music to users based on their listening history.
11. **Robotics**: Machine learning is used in robotics to enable robots to learn from their environment
and adapt to new situations.
12. **Supply Chain Optimization**: Machine learning algorithms can be used to optimize supply chain
logistics and reduce costs.
13. **Autonomous Vehicles**: Machine learning is used in self-driving cars to enable them to navigate
complex road networks and make decisions in real-time.
16. **Quality Control**: Machine learning is used in manufacturing to detect defects and improve quality
control processes.
17. **Content Filtering**: Machine learning algorithms can be used to filter out inappropriate content
online, such as hate speech or pornography.
18. **Recommendation Systems for E-commerce**: Online retailers use machine learning to recommend
products to customers based on their browsing and purchasing history.
19. **Healthcare Analytics**: Machine learning is used in healthcare analytics to analyze patient data and
identify trends and patterns.
20. **Predictive Analytics**: Machine learning algorithms can be used to predict future outcomes, such
as stock prices, weather patterns, or election results.
o Regression
o a) Classification
o Classification algorithms are used to solve the classification problems in which the output
variable is categorical, such as "Yes" or No, Male or Female, Red or Blue, etc. The
classification algorithms predict the categories present in the dataset. Some real-world
examples of classification algorithms are Spam Detection, Email filtering, etc.
b) Regression
Regression algorithms are used to solve regression problems in which there is a linear relationship
between input and output variables. These are used to predict continuous output variables, such as
market trends, weather prediction, etc.
o Association
Clustering
The clustering technique is used when we want to find the inherent groups from the data. It is a way
to group the objects into a cluster such that the objects with the most similarities remain in one group
and have fewer or no similarities with the objects of other groups.
Association
Association rule learning is an unsupervised learning technique, which finds interesting relations among
variables within a large dataset. The main aim of this learning algorithm is to find the dependency of one
data item on another data item and map those variables accordingly so that it can generate maximum
profit. This algorithm is mainly applied in Market Basket analysis, Web usage mining, continuous
production, etc.
Q. Confusion matrix
A confusion matrix is a table that is used to evaluate the performance of a machine learning model on a
test dataset. It provides a summary of the predictions made by the model and the actual outcomes. The
matrix is called "confusion" because it confuses the predicted outcomes with the actual outcomes.
Q. TP (True Positive): A positive result that is correctly predicted by the model. This means that the
model correctly identified a true instance of the target class or concept.
Q.TN (True Negative): A negative result that is correctly predicted by the model. This means that the
model correctly identified a non-instance of the target class or concept.
Q.FP (False Positive): A negative result that is incorrectly predicted as positive by the model. This means
that the model incorrectly identified a non-instance of the target class or concept as an instance.
Q.FN (False Negative): A positive result that is incorrectly predicted as negative by the model. This
means that the model incorrectly identified a true instance of the target class or concept as a non-instance.
Q.Accuracy: The proportion of correct predictions out of all predictions made by the model. It is the sum
of True Positives (TP) and True Negatives (TN) divided by the total number of instances.
Q.Precision: The proportion of True Positives (TP) among all positive predictions made by the model. It
is the ratio of TP to the sum of TP and False Positives (FP).
Q.Recall (also known as Sensitivity or True Positive Rate): The proportion of True Positives (TP)
among all actual positive instances. It is the ratio of TP to the sum of TP and False Negatives (FN).
Q.F1 Score: Balanced measure of Precision and Recall for imbalanced datasets.
Q.Threshold: Value used to classify instances as positive or negative.
Q. False Positive Rate (FPR): Proportion of incorrect positives.
Q. AUC – ROC: Measure of a model's performance on binary classification tasks.
Q. R-squared (R2): Goodness of fit for linear models, measuring how well the model explains the data.