ML UNIT I NEW
ML UNIT I NEW
UNIT I
Introduction:
Machine Learning is an AI technique
that teaches computers to learn from
experience. Machine learning
algorithms use computational methods
to “learn” information directly from
data . The algorithms adaptively
improve their performance as the
number of samples available for
learning increases. Deep learning is a
specialized form of machine learning.
Machine learning is a powerful tool that
can be used to solve a wide range of
problems. It allows computers to learn
from data, without being explicitly
programmed. This makes it possible to
build systems that can automatically
improve their performance over time by
learning from their experiences.
Machine learning is an application of
artificial intelligence that uses statistical
techniques to enable computers to learn
and make decisions without being
explicitly programmed. It is predicated
on the notion that computers can learn
from data, spot patterns, and make
judgments with little assistance from
humans.
It is a subset of Artificial Intelligence. It
is the study of making machines more
human-like in their behavior and
decisions by giving them the ability to
learn and develop their own programs.
This is done with minimum human
intervention, i.e., no explicit
programming. The learning process is
automated and improved based on the
experiences of the machines throughout
the process.
Good quality data is fed to the
machines, and different algorithms are
used to build ML models to train the
machines on this data. The choice of
algorithm depends on the type of data at
hand and the type of activity that needs
to be automated.
Now we may wonder, how is it
different from traditional
programming? Well, in traditional
programming, we would feed the input
data and a well-written and tested
program into a machine to generate
output. When it comes to machine
learning, input data, along with the
output, is fed into the machine during
the learning phase, and it works out a
program for itself. To understand this
better, refer to the illustration below,
Preparation:
The collected data can be in a raw
form which can’t be directly fed to
the machine. So, this is a process of
collecting datasets from different
sources, analyzing these datasets and
then constructing a new dataset for
further processing and exploration.
This preparation can be performed
either manually or from the automatic
approach. Data can also be prepared
in numeric forms also which would
fasten the model’s learning.
Example:
An image can be converted to a
matrix of N X N dimensions, the
value of each cell will indicate the
image pixel.
Input:
Now the prepared data can be in the
form that may not be machine-
readable, so to convert this data to the
readable form, some conversion
algorithms are needed. For this task to
be executed, high computation and
accuracy is needed.
Example:
Data can be collected through the
sources like MNIST Digit
data(images), Twitter comments,
audio files, video clips.
Processing:
This is the stage where algorithms
and ML techniques are required to
perform the instructions provided
over a large volume of data with
accuracy and optimal computation.
Output:
In this stage, results are procured by
the machine in a meaningful manner
which can be inferred easily by the
user. Output can be in the form of
reports, graphs, videos, etc
Storage:
This is the final step in which the
obtained output and the data model
data and all the useful information are
saved for future use.
Advantages of data processing in
Machine Learning:
1. Improved model performance: Data
processing helps improve the
performance of the ML model by
cleaning and transforming the data
into a format that is suitable for
modeling.
2. Better representation of the data:
Data processing allows the data to be
transformed into a format that better
represents the underlying
relationships and patterns in the data,
making it easier for the ML model to
learn from the data.
3. Increased accuracy: Data processing
helps ensure that the data is accurate,
consistent, and free of errors, which
can help improve the accuracy of the
ML model.
Disadvantages of data processing in
Machine Learning:
1. Time-consuming: Data processing
can be a time-consuming task,
especially for large and complex
datasets.
2. Error-prone: Data processing can be
error-prone, as it involves
transforming and cleaning the data,
which can result in the loss of
important information or the
introduction of new errors.
3. Limited understanding of the data:
Data processing can lead to a limited
understanding of the data, as the
transformed data may not be
representative of the underlying
relationships and patterns in the data.