AI
AI
1. Training Data
The machine learning process starts with collecting and processing
training data.
1. Labeled data => Labeled data is a dataset where each instance or example is
accompanied by a label or target variable that represents the desired output
or classification.
2. Unlabeled data => Unlabeled data is a dataset where the instances or examples
do not have any associated labels or target variables.
3. Structured data => Structured data refers to data that is organized and
formatted in a predefined manner, typically in the form of tables or
databases with rows and columns. Subtypes:
Text data: This includes documents, articles, social media posts, and other
textual data.
Image data: This includes digital images, photographs, and video frames.
2. ML Algorithms
The compiled training data is fed into machine learning
algorithms. The ML learning process is traditionally divided into
three broad categories: supervised
learning, unsupervised learning, and reinforcement learning.
In supervised learning, the algorithms are trained on labeled
data. The goal is to learn a mapping function that can predict the
output for new, unseen input data.
Unsupervised learning refers to algorithms that learn from
unlabeled data. The goal is to discover inherent patterns,
structures, or relationships within the input data.
In reinforcement learning, the machine is given only a
performance score as guidance and semi-supervised learning,
where only a portion of training data is labeled. Feedback is
provided in the form of rewards or penalties for its actions, and
the machine learns from this feedback to improve its decision-
making over time
3. Inferencing
After the model has been trained, it is time to begin the process
of using the information that a model has learned to make
predictions or decisions. This is called inferencing. There are two
main types of inferencing in machine learning: batch inferencing
and real-time inferencing.