Introduction To Data Science and Machine Learning
Introduction To Data Science and Machine Learning
1. Fraud detection
2. Sentiment analysis
1. Each attribute
becomes a
dimension
Sugar
2. Each record
becomes a point in
the space
e
Ag BP level
Heart healthy
Potential heart ailments
Machine learning happens in mathematical space / feature
space:
1. Position of a
point in space is
defined with
respect to the
origin
Sugar
2. The position is
e
decided by the
values of the Ag
BP level
attributes for a
Heart healthy
point Potential heart ailments
Machine learning happens in mathematical space / feature
space:
Sugar
5. But multiple planes can
do the job. Each
e
Ag
representing an alternate
hypothesis
BP level
Heart healthy
6. The learning algorithm Erroneous classification
Potential heart ailments
selects that hypothesis
Machine learning happens in mathematical space / feature
space:
Sugar
e
8. x , y, z represent the Ag
three dimensions BP level
i.e. BP, Age, Sugar
Heart healthy
Potential heart ailments
while d represents
Machine learning happens in mathematical space / feature
space:
Sugar
get value of d
(healthy or ailing)
e
Ag
11.The data point will
ax + byabove
be placed + cz or
= d, BP level
Sugar
or healthy hear will
be known only after
direct observation
e
Ag
ax + by + cz = d, BP level
Heart healthy
Potential heart ailments
Machine learning happens in mathematical space / feature
space:
ax + by + cz = d,
Sugar
e
Ag
BP level
15.If majority of new data Heart healthy
Potential heart ailments
points are correctly
classified, the model is
Machine Learning Categories
Machine learning categories:
1.Supervised Learning
2.Unsupervised Learning
3.Reinforcement Learning
Supervised Machine Learning:
1. Class of algorithms which work in two stages. The first stage is called
training and second one is usually called testing. Sometimes it may
involve validation stage followed by testing
2. At each stage it takes input data prepared for that stage. i.e. for training
data for training stage, test data for test stage, validation data for validation
stage
3. During training, the machine learning algorithm gets the training data
inform of independent and dependent variables
3. The algorithms are not used to find any relationship between dependent
and independent variables