Machine Learning CS-8 Dept. of CS, KFUEIT: Instructor: Muhammad Adeel Abid
Machine Learning CS-8 Dept. of CS, KFUEIT: Instructor: Muhammad Adeel Abid
CS-8
Dept. of CS, KFUEIT
• Unsupervised Learning
• Semi-Supervised
Learning
Supervised
• Learning
the correct classes of the training data
are known e.g. Classification
Credit: http://us.hudson.com/legal/blog/postid/513/predictive-analytics-artificial-intelligence-science-fiction-e-discovery-truth
Unsupervised
• Learning
the correct classes of the training data are
not known e.g. Clustering
Credit: http://us.hudson.com/legal/blog/postid/513/predictive-analytics-artificial-intelligence-science-fiction-e-discovery-truth
Semi-Supervised
• A MixLearning
of Supervised and Unsupervised
learning
Credit: http://us.hudson.com/legal/blog/postid/513/predictive-analytics-artificial-intelligence-science-fiction-e-discovery-truth
Machine Learning
Techniques
Techniques
• classification: predict class from
observations
• clustering: group observations
into “meaningful” groups
• regression (prediction): predict value
from observations
Classificatio
• n into a predefined category.
classify a document
• documents can be text, images
• Popular one is Naive Bayes Classifier.
• Steps:
– Step1 : Train the program (Building a Model) using a
training set with a category for e.g. sports, cricket,
news,
– Classifier will compute probability for each word, the
probability that it makes a document belong to each
of considered categories
– Step2 : Test with a test data set against this Model
Clusterin
• g of grouping a set of objects
clustering is the task
in such a way that objects in the same group
(called a cluster) are more similar to each other
• objects are not predefined
• For e.g. these keywords
– “man’s shoe”
– “women’s shoe”
– “women’s t-shirt”
– “man’s t-shirt”
– can be cluster into 2 categories “shoe” and “t-shirt”
or “man” and “women”
• Popular ones are K-means clustering and
Hierarchical clustering
K-means
• Clustering
partition n observations into k clusters in which each observation
belongs to the cluster with the nearest mean, serving as a prototype of
the cluster.
Regressio
• is a measure nof the relation
between the mean value of one
variable (e.g. output) and
corresponding values of other
variables (e.g. time and cost).
• regression analysis is a
statistical process for estimating
the relationships among
variables.
• Regression means to predict
the output value using training
data.
• Popular one is Logistic
regression (binary regression)
Classification vs
• Regression
Classification means • Regression means
to group the output to predict the
into a class. output value using
• classification to training data.
predict the type of • regression to
tumor i.e. harmful or predict the house
not harmful using price from training
training data data
• if it is • if it is a real
discrete/categorical number/continuous
variable, then it is , then it is
classification regression problem.
problem
Let’s see the usage in Real
life
Use-
•
Cases
Spam Email Detection
• Machine Translation (Language
Translation)
• Image Search (Similarity)
• Clustering (KMeans) :
Amazon Recommendations
• Classification : Google News
continued
…
Use-Cases
•
(contd.)
Text Summarization - Google News
• Rating a Review/Comment: Yelp
• Fraud detection : Credit card Providers
• Decision Making : e.g. Bank/Insurance
sector
• Sentiment Analysis
• Speech Understanding – iPhone with Siri
• Face Detection – Facebook’s Photo tagging