Machine Learning Unit-1.2
Machine Learning Unit-1.2
• The robot learns by trying all the possible paths and then
choosing the path which gives him the reward with the
least hurdles.
• Each right step will give the robot a reward and each wrong
step will subtract the reward of the robot.
• Training: The training is based upon the input, The model will
return a state and the user will decide to reward or punish the
model based on its output.
• Email filtering
• Speech recognition,
• Face recognition
• Disease detection
• Fraud detection
• Computer vision,
• Self-driven cars
• Weather Forecasting
• NLP
• Amazon product recommendation, etc.
Issues in Machine Learning
• Healthcare,
• Education,
• Finance,
• Automobile,
• Marketing, Shipping,
• Infrastructure,
• Automation, etc.
• Big companies like Amazon
• Facebook,
• Google
• Adobe, etc.
4. Irrelevant features
6. Inadequate Infrastructure
BASIC TYPES OF DATA IN MACHINE LEARNING
• Each data set also has multiple attributes, each of which gives information on a
specific characteristic.
• Both the datasets, Student and Student Performance, are having four features or
dimensions; hence they are told to have four dimensional data space.
Types of Data in ML
• An ideal example of interval data is Celsius temperature. The difference between each value
remains the same in Celsius temperature.
• For example, the difference between 12°C and 18°C degrees is measurable and is 6°C as in the
case of difference between 15.5°C and 21.5°C. Other examples include date, time, etc.
• For interval data, mathematical operations such as addition and subtraction are possible. For
that reason, for interval data, the central tendency can be measured by mean, median, or
• mode. Standard deviation can also be calculated.
• However, interval data do not have something called a ‘true zero’ value.
• For example, there is nothing called ‘0 temperature’ or ‘no temperature’. Hence, only addition
and subtraction applies for interval data. The ratio cannot be applied. This means, we can say
a temperature of 40°C is equal to the temperature of 20°C + temperature of 20°C.
• However, we cannot say the temperature of 40°C means it is twice as hot as in temperature of
20°C.
Ratio data
• Ratio data represents numeric data for which exact
value can be measured.
• Absolute zero is available for ratio data.
• A special type of discrete attribute which can assume two values only is called
binary attribute. Examples of binary attribute include male/ female,
positive/negative, yes/no, etc.
• Continuous attributes can assume any possible value which is a real number.
Examples of continuous attribute include length, height, weight, price, etc.
• On the other hand, interval and ratio attributes are continuous, barring a few
exceptions, e.g. ‘count’ attribute.