0% found this document useful (0 votes)
177 views

Machine Learning CS-8 Dept. of CS, KFUEIT: Instructor: Muhammad Adeel Abid

The document discusses machine learning, providing an introduction to key concepts like supervised and unsupervised learning, classification, clustering, and regression. It also defines common machine learning terminology and techniques, and gives examples of real-world applications of machine learning like spam detection, language translation, and product recommendations.

Uploaded by

Opportunities
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
177 views

Machine Learning CS-8 Dept. of CS, KFUEIT: Instructor: Muhammad Adeel Abid

The document discusses machine learning, providing an introduction to key concepts like supervised and unsupervised learning, classification, clustering, and regression. It also defines common machine learning terminology and techniques, and gives examples of real-world applications of machine learning like spam detection, language translation, and product recommendations.

Uploaded by

Opportunities
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 19

Machine Learning

CS-8
Dept. of CS, KFUEIT

Instructor: Muhammad Adeel Abid


Agend
a
• Introduction
• Basics
• Classification
• Clustering
• Regression
• Use-Cases
About
• subfield of Artificial Intelligence (AI)
• name is derived from the concept that it deals with
“construction and study of systems that can learn from
data”
• can be seen as building blocks to make computers learn
to behave more intelligently
• It is a theoretical concept. There are various techniques
with various implementations.
Terminology
• Features
– The number of features or distinct traits that can be used to
describe
each item in a quantitative manner.
• Samples
– A sample is an item to process (e.g. classify). It can be a document,
a picture, a sound, a video, a row in database or CSV file, or
whatever you can describe with a fixed set of quantitative traits.
• Feature vector
– is an n-dimensional vector of numerical features that represent
some
object.
• Feature extraction
– Preparation of feature vector
– transforms the data in the high-dimensional space to a space of
fewer dimensions.
• Training/Evolution set
– Set of data to discover potentially predictive relationships.
Workflow
Categorie
s
• Supervised Learning

• Unsupervised Learning

• Semi-Supervised
Learning
Supervised
• Learning
the correct classes of the training data
are known e.g. Classification

Credit: http://us.hudson.com/legal/blog/postid/513/predictive-analytics-artificial-intelligence-science-fiction-e-discovery-truth
Unsupervised
• Learning
the correct classes of the training data are
not known e.g. Clustering

Credit: http://us.hudson.com/legal/blog/postid/513/predictive-analytics-artificial-intelligence-science-fiction-e-discovery-truth
Semi-Supervised
• A MixLearning
of Supervised and Unsupervised
learning

Credit: http://us.hudson.com/legal/blog/postid/513/predictive-analytics-artificial-intelligence-science-fiction-e-discovery-truth
Machine Learning
Techniques
Techniques
• classification: predict class from
observations
• clustering: group observations
into “meaningful” groups
• regression (prediction): predict value
from observations
Classificatio
• n into a predefined category.
classify a document
• documents can be text, images
• Popular one is Naive Bayes Classifier.
• Steps:
– Step1 : Train the program (Building a Model) using a
training set with a category for e.g. sports, cricket,
news,
– Classifier will compute probability for each word, the
probability that it makes a document belong to each
of considered categories
– Step2 : Test with a test data set against this Model
Clusterin
• g of grouping a set of objects
clustering is the task
in such a way that objects in the same group
(called a cluster) are more similar to each other
• objects are not predefined
• For e.g. these keywords
– “man’s shoe”
– “women’s shoe”
– “women’s t-shirt”
– “man’s t-shirt”
– can be cluster into 2 categories “shoe” and “t-shirt”
or “man” and “women”
• Popular ones are K-means clustering and
Hierarchical clustering
K-means
• Clustering
partition n observations into k clusters in which each observation
belongs to the cluster with the nearest mean, serving as a prototype of
the cluster.
Regressio
• is a measure nof the relation
between the mean value of one
variable (e.g. output) and
corresponding values of other
variables (e.g. time and cost).
• regression analysis is a
statistical process for estimating
the relationships among
variables.
• Regression means to predict
the output value using training
data.
• Popular one is Logistic
regression (binary regression)
Classification vs
• Regression
Classification means • Regression means
to group the output to predict the
into a class. output value using
• classification to training data.
predict the type of • regression to
tumor i.e. harmful or predict the house
not harmful using price from training
training data data
• if it is • if it is a real
discrete/categorical number/continuous
variable, then it is , then it is
classification regression problem.
problem
Let’s see the usage in Real
life
Use-

Cases
Spam Email Detection
• Machine Translation (Language
Translation)
• Image Search (Similarity)
• Clustering (KMeans) :
Amazon Recommendations
• Classification : Google News

continued

Use-Cases

(contd.)
Text Summarization - Google News
• Rating a Review/Comment: Yelp
• Fraud detection : Credit card Providers
• Decision Making : e.g. Bank/Insurance
sector
• Sentiment Analysis
• Speech Understanding – iPhone with Siri
• Face Detection – Facebook’s Photo tagging

You might also like