0% found this document useful (0 votes)
2 views

AI

The document outlines the fundamentals of machine learning, detailing the process of building a model which includes data collection, algorithm selection, training, and evaluation. It describes different types of training data (labeled, unlabeled, structured, and unstructured) and categorizes machine learning algorithms into supervised, unsupervised, reinforcement, and semi-supervised learning. Additionally, it explains inferencing as the application of the trained model for making predictions, distinguishing between batch and real-time inferencing.

Uploaded by

amanda
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

AI

The document outlines the fundamentals of machine learning, detailing the process of building a model which includes data collection, algorithm selection, training, and evaluation. It describes different types of training data (labeled, unlabeled, structured, and unstructured) and categorizes machine learning algorithms into supervised, unsupervised, reinforcement, and semi-supervised learning. Additionally, it explains inferencing as the application of the trained model for making predictions, distinguishing between batch and real-time inferencing.

Uploaded by

amanda
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Machine Learning Fundamentals

Building a machine learning model involves data collection and


preparation, selecting an appropriate algorithm, training the model on
the prepared data, and evaluating its performance through testing and
iteration.

1. Training Data
The machine learning process starts with collecting and processing
training data.

Two types of data:

1. Labeled data => Labeled data is a dataset where each instance or example is
accompanied by a label or target variable that represents the desired output
or classification.
2. Unlabeled data => Unlabeled data is a dataset where the instances or examples
do not have any associated labels or target variables.
3. Structured data => Structured data refers to data that is organized and
formatted in a predefined manner, typically in the form of tables or
databases with rows and columns. Subtypes:

Tabular data: This includes data stored in spreadsheets, databases, or


CSV files, with rows representing instances and columns representing
features or attributes.

Time-series data: This type of data consists of sequences of values


measured at successive points in time, such as stock prices, sensor
readings, or weather data.

4. Unstructured data => Unstructured data is data that lacks a


predefined structure or format, such as text, images, audio, and video.
Subtypes :

Text data: This includes documents, articles, social media posts, and other
textual data.
Image data: This includes digital images, photographs, and video frames.

2. ML Algorithms
The compiled training data is fed into machine learning
algorithms. The ML learning process is traditionally divided into
three broad categories: supervised
learning, unsupervised learning, and reinforcement learning.
In supervised learning, the algorithms are trained on labeled
data. The goal is to learn a mapping function that can predict the
output for new, unseen input data.
Unsupervised learning refers to algorithms that learn from
unlabeled data. The goal is to discover inherent patterns,
structures, or relationships within the input data.
In reinforcement learning, the machine is given only a
performance score as guidance and semi-supervised learning,
where only a portion of training data is labeled. Feedback is
provided in the form of rewards or penalties for its actions, and
the machine learns from this feedback to improve its decision-
making over time

3. Inferencing
After the model has been trained, it is time to begin the process
of using the information that a model has learned to make
predictions or decisions. This is called inferencing. There are two
main types of inferencing in machine learning: batch inferencing
and real-time inferencing.

Batch inferencing is when the computer takes a large


amount of data, such as images or text, and analyzes it
all at once to provide a set of results. This type of
inferencing is often used for tasks like data analysis,
where the speed of the decision-making process is not
as crucial as the accuracy of the results
Real-time inferencing is when the computer has to make
decisions quickly, in response to new information as it
comes in. This is important for applications where
immediate decision-making is critical, such as in
chatbots or self-driving cars. The computer has to
process the incoming data and make a decision almost
instantaneously, without taking the time to analyze a
large dataset.

You might also like