0% found this document useful (0 votes)

16 views72 pages

1 - AML - Manish

The document provides an overview of artificial intelligence (AI) and machine learning (ML), highlighting their definitions, goals, and various applications. It explains the differences between supervised, unsupervised, semi-supervised, and reinforcement learning, along with examples of algorithms used in each category. Additionally, it discusses structured and unstructured data, emphasizing the importance of ML in handling complex problems and adapting to new information.

Uploaded by

hetvibhora192

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views72 pages

1 - AML - Manish

Uploaded by

hetvibhora192

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 72

Machine Learning

What is artificial intelligence?

• Artificial intelligence is a broad field, which refers to the use of
technologies to build machines and computers that have the ability to
mimic cognitive functions associated with human intelligence, such as
being able to see, understand, and respond to spoken or written
language, analyze data, make recommendations, and more.
• The main goal of Artificial Intelligence is to develop self-
reliant machines that can think and act like humans.
• These machines can mimic human behavior and
perform tasks by learning and problem-solving.
• Most of the AI systems simulate natural intelligence to
solve complex problems.
Let’s have a look at an example of an AI-driven product - Amazon Echo.

Other examples of AI
Machine Translation such as Google Translate
Self Driving Vehicles such as Google’s Waymo
AI Robots such as Sophia and Aibo
Speech Recognition applications like Apple’s Siri or OK Google
Definition
What is Machine Learning?

• Machine learning is a subset of artificial intelligence that

automatically enables a machine or system to learn and improve from
experience. Instead of explicit programming, machine learning uses
algorithms to analyze large amounts of data, learn from the insights,
and then make informed decisions.

• Machine learning algorithms improve performance over time as they

are trained—exposed to more data. Machine learning models are the
output, or what the program learns from running an algorithm on
training data. The more data used, the better the model will get
Why use Machine Learning?
When building non-learners, we usually follow these steps:

1.We make rules

2.We write an algorithm
3.If the algorithm performs well, we deploy. If not, we go back to step 1

However, if the problem is complex, we'll likely end up with a long list of rules that
are hard to maintain and scale to other similar problems.

 An ML system would be much shorter, easier to maintain, and in many cases, more
accurate.
Traditional approach vs ML
Steps for Spam Detection Using Pattern Recognition
1.Analyze Spam Characteristics:
1. Identify common words/phrases (e.g., "4U," "credit
card," "free," "amazing").
2. Observe patterns in sender names, email bodies, etc.
2.Develop Detection Algorithms:
1. Create algorithms to detect identified patterns.
2. Flag emails as spam based on pattern matches.
3.Iterate and Refine:
1. Test the detection algorithm.
2. Continuously refine the algorithm for better accuracy.
spam filter using traditional programming techniques

Since the problem is not trivial, your program will likely become a long list of com ‐ plex rules—pretty hard to
maintain.
Traditional approach• Machine
vs ML Learning-Based Spam Filtering
• Automatic Learning:
• Detects frequent patterns in spam versus ham examples.
• Learns which words/phrases are strong spam indicators.
• Results in shorter, easier-to-maintain, and more accurate
programs.
• Adaptability:
• Machine learning models can adapt to new spam tactics
(e.g., "4U" to "For U").
• Reduces the need for manual updates to detection rules.
spam filter using ML • Ability to remain sensitive (Resilience Against
Evolving Tactics):
• Continually learns from new data, making it harder for
spammers to bypass filters.

• Higher Accuracy:
• More likely to identify spam correctly, reducing false
positives and negatives

Automatically adapting to change

Machine learning algorithms will generate logic.
We need not to generate code for each scenario, that will be handled by ML.
How Does Machine Learning Work?
• Machine learning accesses vast amounts of data (both structured and
unstructured) and learns from it to predict the future. It learns from
the data by using multiple algorithms and techniques. Below is a
diagram that shows how a machine learns from data.
Scenarios in which ML is useful
• In some cases where you cannot do programming. Example email spam
classifier in which you have to write program to classify a email as spam
or not spam. Then we will use if else ladder for discount ,huge, sale etc
• A very difficult scenario where number of cases will be uncountable that
you cannot write them. Example we need to classify whether in picture
dog is there or not. How humans learn it by looking at dogs from
childhood ,feed it in brain. Basically learning from data
• One very important usecase is Data Mining. Data mining is the use of
machine learning and statistical analysis to uncover patterns and other
valuable information from large data sets.
Machine Learning is great for
• Problems for which existing solutions require a lot of hand-tuning or long
lists of rules: one Machine Learning algorithm can often simplify code and
perform better.
• Complex problems for which there is no good solution at all using a
traditional approach: the best Machine Learning techniques can find a
solution.
• Fluctuating environments: a Machine Learning system can adapt to new
data.
• Getting insights about complex problems and large amounts of data.
Artificial intelligence (AI) vs.
machine learning (ML)
Structured and unstructured
data ?
Structured Data
The data which is to the point, factual, and highly organized is referred to as
structured data. It is quantitative in nature, i.e., it is related to quantities that means
it contains measurable numerical values like numbers, dates, and times .

It is easy to search and analyze structured data. Structured data exists in a predefined
format. Relational database consisting of tables with rows and columns is one of the best
examples of structured data. Structured data generally exist in tables like excel files and
Google Docs spreadsheets. The programming language SQL (structured query language)
is used for managing the structured data. SQL is developed by IBM in the 1970s and
majorly used to handle relational databases and warehouses.

Structured data is highly organized and understandable for machine language. Common
applications of relational databases with structured data include sales transactions, Airline
reservation systems, inventory control, and others.
Unstructured Data
• All the unstructured files, log files, audio files, and image files are
included in the unstructured data. Some organizations have much
data available, but they did not know how to derive data value since
the data is raw.
Unstructured data is the data that lacks any predefined model or format. It
requires a lot of storage space, and it is hard to maintain security in it. It
cannot be presented in a data model or schema. That's why managing,
analyzing, or searching for unstructured data is hard. It resides in various
different formats like text, images, audio and video files, etc. It is qualitative
in nature and sometimes stored in a non-relational database or NO-SQL.

It is not stored in relational databases, so it is hard for computers and

humans to interpret it. The limitations of unstructured data include the
requirement of data science experts and specialized tools to manipulate the
data.
Types of Machine Learning Systems
•By Supervision:
• Supervised Learning: Trained with labeled data.
• Unsupervised Learning: No labeled data; finds patterns on its own.
• Semi-supervised Learning: Partially labeled data.
• Reinforcement Learning: Learns through rewards and punishments.

•By Learning Process:

• Online Learning: Learns incrementally as new data arrives.
• Batch Learning: Learns from the entire dataset at once.

•By Approach:
• Instance-based Learning: Compares new data points to known ones.
• Model-based Learning: Detects patterns and builds a predictive model.
Types of Machine Learning

a. Based on the amount of supervision needed for ML algo to get train

Type of ML

1. Supervised Learning
In supervised learning, the data is already labeled, which means you know the target variable. Using this method of
learning, systems can predict future outcomes based on past data. It requires that at least an input and output
variable be given to the model for it to be trained.

Below is an example of a supervised learning method. The algorithm is trained using labeled data of dogs and cats.
The trained model predicts whether the new image is that of a cat or a dog.

Some examples of supervised learning include linear regression, logistic regression, support vector machines, Naive Bayes,
and decision tree.
Labelled and unlabeled data
https://www.geeksforgeeks.org/supervised-unsupervised-learning/
Supervised Learning
•Definition:
• Training data includes desired solutions, known as labels.
• The algorithm learns from labeled examples provided by a
"supervisor."
•Process:
• Training: Machine is trained using well-labeled data with correct
answers.
• Prediction: New examples are provided, and the algorithm uses the
learned patterns to predict the correct outcomes.
•Goal:
• To enable the algorithm to generalize from the labeled data and make
accurate predictions on new, unseen data.
Type of prediction

Image Source:https://www.mathworks.com/help/stats/machine-learning-in-matlab.html
Classification
•Purpose:
• Predict outcomes based on data features.
• Example: A bank predicting if a customer will default on a loan.

•Key Components:
• Features: Characteristics of the data (e.g., credit history, number of loans).
• Target: The outcome to be predicted (e.g., loan repayment status).

•Types:
• Binary Classification: Two possible outcomes (e.g., Yes/No, 1/0).
• Multiclass Classification: More than two possible outcomes.

•Common Algorithms:
• Logistic Regression
• Decision Tree Classifier
• K Nearest Neighbor Classifier
• Random Forest Classifier
• Neural Networks https://medium.com/@betulsamancii/supervised-vs-unsupervised-learning-630e024093bd
Regression

•Purpose:
•Predict continuous values rather than categories.
•Example: Predicting the price of a house based on its features.

•Key Components:
•Features: Attributes of the data (e.g., lot size, number of bedrooms, neighborhood).
•Target: The continuous value to be predicted (e.g., house price).

•Process:
•Train an algorithm to understand how features relate to the target value.
•Use the trained model to predict values for new data.

•Common Algorithms:

•Linear Regression
•Decision Tree Regressor
•K Nearest Neighbor Regressor
•Random Forest Regressor
•Neural Networks
Classification and Regression
• Some algorithms can be used for
both regression and classification
tasks

• For example, Logistic Regression is

commonly used for classification,

• as it can output a value that

corresponds to the probability of
belonging to a given class (e.g.,
20% chance of being spam).

https://static.javatpoint.com/tutorial/machine-learning/images/regression-vs-classification-in-machine-learning.png
Classification and Regression

Guess the type

Unsupervised learning

• In unsupervised learning, as you might guess, the training data is unlabeled.

• The system tries to learn without a teacher.
• Unsupervised learning algorithms employ unlabeled data to discover patterns
from the data on their own.
• The systems are able to identify hidden features from the input data provided.
• Once the data is more readable, the patterns and similarities become more
evident.
• Goal: Discover patterns, relationships, and structures within the data.
• Below is an example of an unsupervised learning method that trains a
model using unlabeled data. In this case, the data consists of different
vehicles. The purpose of the model is to classify each kind of vehicle.
Unsupervised learning algorithms
• Clustering
— K-Means
— DBSCAN
— Hierarchical Cluster Analysis (HCA)

Anomaly Detection and Novelty Detection

— One-class SVM
— Isolation Forest

Visualization and Dimensionality Reduction

— Principal Component Analysis (PCA)
— Kernel PCA
— Locally-Linear Embedding (LLE)

Association Rule Learning

— Apriori
— Eclat
Primary Categories:
1. Clustering:
• Groups similar data points into clusters.
• Maximizes intra-cluster similarity and minimizes inter-cluster similarity.

• Algorithms:
• K-Means
• DBSCAN
• Hierarchical Cluster Analysis (HCA)
• Gaussian Mixture Models (GMMs)
• Principal Component Analysis (PCA)
• Density-Based Spatial Clustering (DBSCAN)

• Applications:
• Customer segmentation, pattern recognition, anomaly detection
.
2. Association Rule Learning:
• Identifies relationships or associations between variables in datasets.
• Useful for uncovering patterns like "if X is purchased, Y is also likely.“

• Algorithms:
• Apriori
• Eclat
• FP-Growth Algorithm

• Applications:
• Market basket analysis, recommendation systems, product placement.
Others category
1. Anomaly Detection:

•Purpose: Identify unusual or outlier instances in data.

•Applications:
•Detecting fraudulent credit card transactions.
•Catching manufacturing defects.
•Removing outliers from datasets.

•Training: Mostly normal instances are shown; the system learns to recognize normal behavior and flag
anomalies.

•Tolerance: Can often handle a small percentage of outliers in the training set.

2. Novelty Detection:
•Purpose: Identify new or previously unseen data that deviates from known normal patterns.

•Training: System is trained only on normal data.

•Applications: Often used for detecting new types of anomalies or novel instances that were not present
during training.
Others
• Dimensionality reduction
• Types: PCA, Kernel PCA etc…
3. Semi-Supervised Learning:-
•Definition:
• Combines elements of supervised and unsupervised learning.
• Utilizes both a limited amount of labeled data and a large
amount of unlabeled data.

•Training Process:
• Labeled Data:
• Provides supervision and guidance for the model.
• Helps the algorithm learn the relationships between inputs
and outputs.
• Unlabeled Data:
• Allows the model to identify additional patterns and
structures in the data.
• Enhances the learning process by providing more context
beyond the labeled data.
•Goal:
• Improve model accuracy by leveraging both labeled and unlabeled data.
• Develop a better representation of the data, leading to more accurate predictions.
• Applications:
• Natural Language Processing (NLP): Handling large text corpora where labeling is costly.
• Image Classification: Utilizing vast amounts of unlabeled images to improve classification
performance.
• Medical Diagnosis: Combining limited labeled cases with abundant unlabeled data to enhance
diagnostic models.

• Advantages:
• Reduces the need for extensive labeled datasets, which are often expensive or time-consuming to
obtain.
• Can lead to better performance compared to using only labeled data due to the richer information
from unlabeled data.

• Examples of Techniques:
• Self-Training: Using the model’s own predictions on unlabeled data to iteratively improve its accuracy.
• Co-Training: Using multiple models trained on different features or views of the data to label the
unlabeled data and improve each other's performance.
• Graph-Based Methods: Constructing a graph where nodes represent data points and edges represent
similarities, using this structure to propagate labels from labeled to unlabeled data.
Reinforcement Learning
•Definition:

•A type of machine learning where an algorithm learns through interaction with an environment.

•Receives feedback in the form of rewards or penalties to learn a policy that maximizes cumulative
rewards over time.

•Process:

•Action: The algorithm takes actions in an environment.

•Reward Signal: The environment provides feedback based on the quality of actions (rewards for good
actions, penalties for bad actions).

•Policy: The algorithm learns a mapping from states to actions (policy) that maximizes the total reward.

•Goal:

•To develop an optimal policy that leads to the highest possible cumulative reward.
•Applications:

•Game Playing: Algorithms

learn to play games (e.g., chess,
Go) by playing repeatedly and
learning from rewards and
penalties.

•Robotics: Robots learn to

perform tasks (e.g., walking,
grasping objects) by interacting
with their environment and
improving their actions based on
feedback.

•Recommendation Systems: Algorithms personalize recommendations by learning from user interactions and
feedback.
•Key Concepts: LBH
• Trial-and-Error Learning: The algorithm learns from its experiences, improving its performance over time
through experimentation.
• Exploration vs. Exploitation: Balancing between trying new actions (exploration) and using known actions
that give high rewards (exploitation).
• Reward Function: The design of the reward function influences the behavior and learning efficiency of the
algorithm.
• Value Function: Estimates the expected reward for a given state or action to guide decision-making.

•Challenges:
• Designing the Reward Function: Crafting an effective reward function that aligns with the desired outcome
can be complex.
• Scalability: Reinforcement learning can be computationally intensive and may require extensive training time.
• Exploration: Ensuring adequate exploration of the action space to discover optimal strategies can be
challenging.
Type of ML
Another criterion used to classify Machine
Learning systems is whether or not the
system can learn incrementally from a
stream of incoming data.

• Online vs Batch ML
• Batch Learning
• Definition:
• Machine learning where the model is trained offline on the entire dataset in one
go.
• Model updates occur in batch mode after processing all training data at once.
• Process:
• Training: Model is trained once on a fixed dataset.
• Prediction: After training, the model is used to make predictions without further
updates.
• Suitability:
• Best for static datasets where data does not change over time.
• Useful when training on large datasets is too computationally expensive to process
in real-time.
•Advantages: OT,CR
•Efficient for large datasets when offline training is feasible.
•Training can be done in parallel, utilizing significant computational resources.

•Disadvantages: A,RT
•Adaptability: Model does not adapt to changes in the data distribution over time.
•Re-training: New models must be trained from scratch if data distribution changes,
which can be time-consuming and computationally expensive.

•Applications:
•Natural Language Processing (NLP)
•Computer Vision
•Recommendation Systems

•Quality:
•Depends on the quality and quantity of training data and choice of algorithms.
https://www.linkedin.com/pulse/types-machine-learning-techniques-training-method-based-sharma
• Online Learning
• Definition:
• Machine learning where the model is updated incrementally as new data arrives in real-time.
• Training occurs on small subsets of data, with continuous updates as new data is received.
• Process:
• Training: Model is trained on a small subset of data, updated continuously with new data.
• Adaptation: Model adapts to changes in data distribution over time.
• Suitability:
• Ideal for streaming data and scenarios where data distribution changes frequently.
• Useful for applications with evolving data, such as financial market analysis, real-time
recommendations, and customer behavior analysis.
•Advantages: A,E

•Adaptability: Model can learn and adapt to changes (dynamic ) in data distribution over time.
•Efficiency: Computationally efficient as only a subset of data is processed at any time.

•Disadvantages: AC,DM
•Algorithm Choice: Performance depends on the choice of online learning algorithms and
their update rules.
•Data Management: Requires efficient handling of data streams and updates.

•Applications:
•Financial Market Analysis
•Customer Behavior Analysis
•Real-time Recommendation Systems

•Quality:
•Depends on the size of the data subset used for training, the rate of new data arrival, and the
algorithm's effectiveness.
Instance-Based Versus Model-Based Learning
• One more way to categorize Machine Learning systems is by how they
generalize.
• Most Machine Learning tasks are about making predictions.
• This means that given a number of training examples, the system
needs to be able to generalize to examples it has never seen before.
• Having a good performance measure on the training data is good, but
insufficient; the true goal is to perform well on new instances.
Instance-Based
•Definition:
•A machine learning approach where the model learns from
examples by storing them and then making predictions
based on the similarity of new examples to these stored
instances.

•Predictions are made by comparing new instances to the

stored instances using a similarity measure.

•Process:

•Storage: The system memorizes training examples (e.g.,

known spam emails).

•Similarity Measure: When a new example (e.g., a new

email) needs to be classified, it is compared to stored
instances using a similarity metric (e.g., word overlap).

•Classification: The new example is classified based on

the class of the most similar stored instances (e.g., if most
similar instances are spam, the new email is flagged as
spam).
•Spam Filter Example:

•Basic Approach: The filter flags emails that are identical to known spam emails.

•Improved Approach: The filter flags emails that are similar to known spam emails using a
similarity measure.

•Similarity Measure: One simple measure is to count the number of common words between the new email and
known spam emails.
•Classification Rule: If the new email shares many words with known spam, it is flagged as spam.

•Characteristics:
•Lazy Learning: The model does not learn a general rule but instead stores and compares
individual examples.

•Computational Cost: The cost is incurred at prediction time, as similarity comparisons are made
on-the-fly.

•No Explicit Model: The system relies on stored instances and does not create a general model
during training.
•Advantages: (SF)
• Simplicity: Easy to implement and understand.
• Flexibility: Can handle noisy or irrelevant features as the prediction relies on similar instances
rather than a fixed model.

•Disadvantages: (S,G)
• Scalability: May become inefficient with large datasets due to the need to compare each new
instance to many stored instances.
• Prediction for Outliers: May struggle with instances significantly different from stored
examples.
• No Generalization: The system only generalizes based on the similarity of stored instances
and does not create a generalized model.

•Example Visualization: (refer fig)

•Figure Example: A new instance is classified based on the majority class of the most similar
instances (e.g., a new shape classified as a triangle because most similar shapes are triangles).
• Applications:
• Classification: Assigning class labels based on the majority class of similar
instances (e.g., k-Nearest Neighbors).
• Regression: Predicting values based on the average or weighted average of
similar instances.
• Anomaly Detection: Identifying outliers by comparing new instances to
stored normal instances.

• Examples:
• k-Nearest Neighbors (k-NN): Classifies new instances based on the majority
vote of the k closest stored instances.
• Case-Based Reasoning: Uses past cases or experiences to solve new
problems.
Model Based
•Definition:

•Machine learning approach where a mathematical model is explicitly learned from training data to map inputs to
outputs.
•The learned model is then used to make predictions on new data.

•Process:

•Training: A model is trained on a dataset to learn a mapping from inputs to outputs.

•Prediction: The trained model is used to predict outcomes for new, unseen data based on the learned mapping.

•Characteristics:

•Generalization: The model generalizes from the training data, allowing it to make predictions for inputs that
differ from those in the training set.
•Model Definition: Involves defining and learning a model structure (e.g., linear regression, decision trees,
neural networks)
•Advantages: A,E,I,(ou)

•Adaptability: Can make predictions for new inputs that are not present in the training set.

•Interpretability: Some models (e.g., linear regression, decision trees) offer clear insights into the relationship
between inputs and outputs.

•Efficiency: Once trained, predictions can be made quickly without needing to reference the entire training
dataset.

•Disadvantages: COT

•Complexity: Models may not always capture complex, non-linear relationships accurately.

•Overfitting: Risk of overfitting to the training data, especially with complex models and small datasets.
•Training Time: Some models (e.g., neural networks) can be computationally expensive to train.

Applications:
•Regression: Predicting continuous values (e.g., house prices, temperature).
•Classification: Assigning categorical labels (e.g., spam detection, image classification).
•Time Series Forecasting: Predicting future values based on past data (e.g., stock prices, weather forecasting).
• Types of Models:

• Linear Models: (e.g., Linear Regression, Logistic Regression) Model the

relationship as a linear function of the inputs.

• Decision Trees: Partition the input space into regions based on feature
values, making predictions based on majority class or average value in each
region.

• Neural Networks: Learn complex, non-linear mappings using multiple

layers of interconnected nodes (neurons).

• Support Vector Machines: Classify data by finding the hyperplane that best
separates different classes.
Model-based learning
Model based
• Example-
• Suppose you want to know if money makes people happy, so you download the Better Life Index data from the OECD’s
website and stats about gross domestic product (GDP) per capita from the IMF’s website. Then you join the tables and sort
by GDP per capita.
• Although the data is noisy (i.e., partly random), it
looks like life satisfaction goes up more or less
linearly as the country’s GDP per capita increases.
• So you decide to model life satisfaction as a linear
function of GDP per capita. A few possible linear models
• This step is called model selection:
• linear model of life satisfaction with just one
attribute, GDP per capita.
• life_satisfaction = θ0 + θ1 × GDP_per_capita
• This model has two model parameters, θ0 and θ1.
By tweaking these parameters, you can make your
model represent any linear function, as shown

Discuss code 01 Final model

Main Challenges of Machine Learning
• Data Collection
• Easily available for college/ in structured formate
• Data Gathering is a very intensive task
• Data can be gathered through API/Web scrapping/ Images

• Insufficient Quantity of Training Data/Labelled data

• Assume two algorithm working on same problem

• Model A: Having good results but only 100 rows of data
• Model B: Not so good but have 100k rows of data.

• Which model will work better??

• More data with not great algo still can solve the problem
• Termed as Unreasonable effectiveness of data
• Huge data doesn’t matter the algorithm
• However in real practice we never has a huge
• Doing a labelling a data is also a very time consuming
Challenges……
• Non-representative Training Data
• Initial model (i.e. wit
• Now you have more data: A new model
• Lets have data collection related to who will win the WC cricket
• Only India: not a good representation
• Sufficient data gathering from various countries
• Its referred as Sampling noise
• Sampling bias: Even though huge data but not properly sampled like: 20 countries but
asked question to all Indians

• Poor-Quality Data
• Data wrangling , quality of data, missing value, non structured data
• 60% is data cleaning
• Garbage in garbage out
Irrelevant Features
• No contribution of columns
• Increase in cost of computation
• Example : Whether person will come or not (Age, Wt, ht, location )
• Location no contribution
• Wt, ht : BMI (single )
• Very important overgeneralization in Humans and Machines: Just as humans
may overgeneralize based on limited experiences (e.g., assuming all taxi
drivers in a foreign country are thieves after one bad experience),
 Overfitting the Training Data machines can also overgeneralize if not carefully trained.

Overfitting in Machine Learning: Overfitting occurs when a

machine learning model performs exceptionally well on training data
but fails to generalize to new, unseen data. This is akin to making
assumptions based on limited examples.

Prevention is Key: To avoid overfitting, it's essential to implement

strategies like hyperparameter tuning, cross-validation, regularization,
and using more diverse training data to ensure the model captures
general patterns rather than specific details.
 Overfitting happens when the model is too complex relative to the amount and noisiness
of the training data. The possible solutions are:
• To simplify the model by selecting one with fewer parameters (e.g., a linear model rather
than a high-degree polynomial model), by reducing the number of attributes in the training
data or by constraining the model
• To gather more training data
• To reduce the noise in
 Constraining a model to make it simpler and reduce the risk of overfitting is called
regularization.

 The amount of regularization to apply during

learning can be controlled by a hyperparameter.
• Under fitting the Training Data
• underfitting is the opposite of overfitting:
• model is too simple to learn the underlying
structure of the data.
• E.g. a linear model of life satisfaction is prone to
underfit;
• reality is just more complex than the model,
• so its predictions are bound to be inaccurate,
even on the training examples.
The main options to fix this problem are:
• Selecting a more powerful model, with more
parameters
• Feeding better features to the learning algorithm
(feature engineering)
• Reducing the constraints on the model (e.g.,
reducing the regularization hyperparameter)
Testing and Validation

It is common to use 80% of the data for training and hold out 20% for testing. However, this depends on the size of the
dataset: if it contains 10 million instances, then holding out 1% means your test set will contain 100,000 instances: that’s
probably more than enough to get a good estimate of the generalization error.
Testing and Validation

• The only way to know how well a model will generalize to new cases is to actually try it out on new cases.
• One way to put your model in production and monitor how well it performs.
• This works well, but if your model is horribly bad, your users will complain—not the best idea.
• A better option is to split your data into two sets: the training set and the test set.
• As these names imply, you train your model using the training set, and you test it using the test set.
• The error rate on new cases is called the generalization error (or out-of sample error), a
• nd by evaluating your model on the test set, you get an estimate of this error.
• This value tells you how well your model will perform on instances it has never seen before.
• If the training error is low (i.e., your model makes few mistakes on the training set) but the generalization
error is high, it means that your model is overfitting the training data
Hyperparameters Tuning and Model Selection
Machine Learning Development Lifecycle
(MLDLC)
• https://medium.com/@pp1222001/decoding-the-machine-learning-d
evelopment-lifecycle-mldlc-4133e05ab8c7
How are AI and ML connected?
• While AI and ML are not quite the same thing, they are closely connected. The
simplest way to understand how AI and ML relate to each other is:

• AI is the broader concept of enabling a machine or system to sense, reason, act, or

adapt like a human
• ML is an application of AI that allows machines to extract knowledge from data
and learn from it autonomously
• One helpful way to remember the difference between machine learning and
artificial intelligence is to imagine them as umbrella categories. Artificial
intelligence is the overarching term that covers a wide variety of specific
approaches and algorithms. Machine learning sits under that umbrella, but so do
other major subfields, such as deep learning, robotics, expert systems, and natural
language processing
Differences between AI and ML

• While artificial intelligence encompasses the idea of a machine that can mimic
human intelligence, machine learning does not. Machine learning aims to teach a
machine how to perform a specific task and provide accurate results by
identifying patterns.

• Let’s say you ask your Google Nest device, “How long is my commute today?” In
this case, you ask a machine a question and receive an answer about the estimated
time it will take you to drive to your office. Here, the overall goal is for the device
to perform a task successfully—a task that you would generally have to do
yourself in a real-world environment (for example, research your commute time).

• In the context of this example, the goal of using ML in the overall system is not to
enable it to perform a task. For instance, you might train algorithms to analyze
live transit and traffic data to forecast the volume and density of traffic flow.
However, the scope is limited to identifying patterns, how accurate the prediction
was, and learning from the data to maximize performance for that specific task.
Benefits of using AI and ML together
• AI and ML bring powerful benefits to organizations of all shapes and
sizes, with new possibilities constantly emerging. In particular, as the
amount of data grows in size and complexity, automated and
intelligent systems are becoming vital to helping companies automate
tasks, unlock value, and generate actionable insights to achieve better
outcomes.
• Here are some of the business benefits of using artificial intelligence
and machine learning:
Applications of AI and ML

• Artificial intelligence and machine learning can be applied in

many ways, allowing organizations to automate repetitive or
manual processes that help drive informed decision-making.
• Companies across industries are using AI and ML in various
ways to transform how they work and do business.
Incorporating AI and ML capabilities into their strategies and
systems helps organizations rethink how they use their data
and available resources, drive productivity and efficiency,
enhance data-driven decision-making through predictive
analytics, and improve customer and employee experiences.
Here are some of the most common applications of AI and ML:

• Healthcare and life sciences

Patient health record analysis and insights, outcome forecasting and modeling, accelerated drug
development, augmented diagnostics, patient monitoring, and information extraction from clinical notes.

• Manufacturing
Production machine monitoring, predictive maintenance, IoT analytics, and operational efficiency.

• Ecommerce and retail

Inventory and supply chain optimization, demand forecasting, visual search, personalized offers and
experiences, and recommendation engines.

• Financial services
Risk assessment and analysis, fraud detection, automated trading, and service processing optimization.

• Telecommunications
Intelligent networks and network optimization, predictive maintenance, business process automation,
upgrade planning, and capacity forecasting.

21AI63 Module 1
No ratings yet
21AI63 Module 1
38 pages
BDO EGOV Registration Via Bancnet
No ratings yet
BDO EGOV Registration Via Bancnet
5 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
775 pages
Intelineo 6000 2.0.0 Global Guide
No ratings yet
Intelineo 6000 2.0.0 Global Guide
1,541 pages
Desktop Publishing Lesson 1
No ratings yet
Desktop Publishing Lesson 1
11 pages
Answer: D E
100% (1)
Answer: D E
32 pages
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
ML m1-m5 NOTES
No ratings yet
ML m1-m5 NOTES
160 pages
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
No ratings yet
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
100 pages
06 Spring Cloud Config
No ratings yet
06 Spring Cloud Config
30 pages
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
114 pages
Overview of Machine Learning PDF
100% (1)
Overview of Machine Learning PDF
57 pages
SSH
No ratings yet
SSH
538 pages
Machine Learning Full PDF
No ratings yet
Machine Learning Full PDF
149 pages
Chapter 01 Notes
No ratings yet
Chapter 01 Notes
11 pages
Design An Indoor Positioning System Using ESP32 Ultra-Wide Band Module
No ratings yet
Design An Indoor Positioning System Using ESP32 Ultra-Wide Band Module
13 pages
Basics of Machine Learning
No ratings yet
Basics of Machine Learning
20 pages
Unit V
No ratings yet
Unit V
67 pages
Unit-3 Machine Learning
No ratings yet
Unit-3 Machine Learning
81 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
Addressing Table: PT Activity 3.6.1: Packet Tracer Skills Integration Challenge
No ratings yet
Addressing Table: PT Activity 3.6.1: Packet Tracer Skills Integration Challenge
6 pages
ML-Unit 1 Merged
No ratings yet
ML-Unit 1 Merged
151 pages
2 - Machine Learning - 130824
No ratings yet
2 - Machine Learning - 130824
81 pages
Module 4 ISML
No ratings yet
Module 4 ISML
88 pages
Module1 ML
No ratings yet
Module1 ML
114 pages
ML Notes
No ratings yet
ML Notes
113 pages
User Manual Ecu Test Bench 2020
100% (1)
User Manual Ecu Test Bench 2020
24 pages
Machine Learning
No ratings yet
Machine Learning
97 pages
Zenith - Bank - Token - Application Form For Internet Banking Token
75% (4)
Zenith - Bank - Token - Application Form For Internet Banking Token
2 pages
Unit 1
No ratings yet
Unit 1
55 pages
Machine Learning - UNIT I
No ratings yet
Machine Learning - UNIT I
70 pages
Machine Hallucinations Architecture and Artificial Intelligence Architectural Design 1st Edition Neil Leach (Editor) All Chapters Instant Download
No ratings yet
Machine Hallucinations Architecture and Artificial Intelligence Architectural Design 1st Edition Neil Leach (Editor) All Chapters Instant Download
67 pages
Machine Learning Unit 1: A Machine Can Learn If It Can Gain More Data To Improve Its Performance
No ratings yet
Machine Learning Unit 1: A Machine Can Learn If It Can Gain More Data To Improve Its Performance
62 pages
5 - AML Lecture 5 - Linear Regression
No ratings yet
5 - AML Lecture 5 - Linear Regression
56 pages
ML ppt1
No ratings yet
ML ppt1
39 pages
Unit 1
No ratings yet
Unit 1
62 pages
Overview of Machine Learning
No ratings yet
Overview of Machine Learning
60 pages
Machine Learning: Presentation
100% (2)
Machine Learning: Presentation
23 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
78 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
Machine Learning
No ratings yet
Machine Learning
73 pages
ML
No ratings yet
ML
19 pages
3 - AML - Lecture 3 - Feature Engg
No ratings yet
3 - AML - Lecture 3 - Feature Engg
39 pages
Module 1 Notes
No ratings yet
Module 1 Notes
56 pages
Cognate X Spidey
No ratings yet
Cognate X Spidey
46 pages
Introduction To ML
100% (1)
Introduction To ML
39 pages
Unit-1 New
No ratings yet
Unit-1 New
48 pages
Machine Learning - Lec1
No ratings yet
Machine Learning - Lec1
56 pages
Unit 3
No ratings yet
Unit 3
33 pages
ML-Unit 1
No ratings yet
ML-Unit 1
43 pages
Machine Learning
No ratings yet
Machine Learning
25 pages
MEDICINE Reports Idkidnf
No ratings yet
MEDICINE Reports Idkidnf
49 pages
Ai 1
No ratings yet
Ai 1
28 pages
Advanced Archive Recovery Software
No ratings yet
Advanced Archive Recovery Software
44 pages
Module 1 Notes
No ratings yet
Module 1 Notes
38 pages
4 - Outliers - +transformaations ML
No ratings yet
4 - Outliers - +transformaations ML
28 pages
F5 202 - Pre-Sales Fundamentals Study Guide - 1st
No ratings yet
F5 202 - Pre-Sales Fundamentals Study Guide - 1st
2 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
Chapter 1
No ratings yet
Chapter 1
30 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Unit-1 Part-1 Material
No ratings yet
Unit-1 Part-1 Material
45 pages
Management Information System in Construction Project: Semester Genap 2017/2018
No ratings yet
Management Information System in Construction Project: Semester Genap 2017/2018
52 pages
Chapter 1
No ratings yet
Chapter 1
40 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Module 1
No ratings yet
Module 1
34 pages
Ml-Unit 1
No ratings yet
Ml-Unit 1
53 pages
ML Chapter 1
No ratings yet
ML Chapter 1
37 pages
Non Academic Competitions Guidelines and Mechanics
No ratings yet
Non Academic Competitions Guidelines and Mechanics
12 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Ai Faheem
No ratings yet
Ai Faheem
16 pages
GettingStartedwithMachineLearningML DataScience365
No ratings yet
GettingStartedwithMachineLearningML DataScience365
12 pages
Department of Emerging Technology (SB) III B.Tech - I Semester
No ratings yet
Department of Emerging Technology (SB) III B.Tech - I Semester
12 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
17 pages
Aiba-Module 1 Machine Learning
No ratings yet
Aiba-Module 1 Machine Learning
23 pages
ML Basic
No ratings yet
ML Basic
12 pages
Read Me
No ratings yet
Read Me
7 pages
Celebrate Christmas Buy Verified Linkedin Accounts Online - Boost Your Career
No ratings yet
Celebrate Christmas Buy Verified Linkedin Accounts Online - Boost Your Career
15 pages
Paper 19-Deep Learning Based Neck Models For Object Detection
No ratings yet
Paper 19-Deep Learning Based Neck Models For Object Detection
7 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
PROJECTOR CATALOUGE - I9 PRO-MAX ANDROID - 2024
No ratings yet
PROJECTOR CATALOUGE - I9 PRO-MAX ANDROID - 2024
3 pages
IMP Hierarchical Clustering
No ratings yet
IMP Hierarchical Clustering
3 pages
ML Notes
No ratings yet
ML Notes
15 pages
Unix Commands List
No ratings yet
Unix Commands List
6 pages
Group4 - EMP - Oct - 20 - MIS - Project - AWS Report
No ratings yet
Group4 - EMP - Oct - 20 - MIS - Project - AWS Report
15 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
6 pages
Ipc 4u-4416
No ratings yet
Ipc 4u-4416
3 pages
Exp:0-10 Yrs Difficulty Level: Easy To Moderate: Common and Random Questions
No ratings yet
Exp:0-10 Yrs Difficulty Level: Easy To Moderate: Common and Random Questions
3 pages
Unit I - Machine Learning at CSJMU - 6 Slides Handouts
No ratings yet
Unit I - Machine Learning at CSJMU - 6 Slides Handouts
4 pages
SR6004 Spec Sheet
No ratings yet
SR6004 Spec Sheet
2 pages
AutoCAD 3D Training
No ratings yet
AutoCAD 3D Training
5 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Database Management Systems - Theory
No ratings yet
Database Management Systems - Theory
3 pages
Canon Ir2016
No ratings yet
Canon Ir2016
5 pages
Scrum Term Worksheet
No ratings yet
Scrum Term Worksheet
1 page