0% found this document useful (0 votes)

25 views

ML Unit-1

Uploaded by

imjyoti1511

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

ML Unit-1

Uploaded by

imjyoti1511

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

UNIT I:

Introduction to Machine Learning with Python: Introduction to Machine Learning, basic terminology,
Types of Machine Learning and Applications, Using Python for Machine Learning: Installing Python and
packages from the Python Package Index, Introduction to NumPy, SciPy, matplotlib and scikit-learn, Tiny
application of Machine Learning.

UNIT II:

Supervised Learning: Types of Supervised Learning, Supervised Machine Learning Algorithms: kNearest
Neighbors, Regression Models, Naive Bayes Classifiers, Decision Trees, Ensembles of Decision Trees,
Kernelized Support Vector Machines, Uncertainty Estimates from Classifiers.

UNIT III:

Building good training datasets: Dealing with missing data, Handling categorical data, partitioning a data
set into separate training and test datasets, bringing features onto the same scale, selecting meaningful
features, assessing feature importance with random forests. Compressing data via dimensionality
reduction: Unsupervised dimensionality reduction via PCA, Supervised data compression via linear
discriminant analysis (Text Book 2)

UNIT IV:

Learning best Practices for Model Evaluation and Hyperparameter tuning: streamlining workflows with
pipelines, using k-fold cross validation to assess model performance, debugging algorithms with learning
and validation curves, fine tuning machine learning models via grid search, looking at different
performance evaluation metrics. Combining different model sfor Ensemble learning: learning with
ensembles, combining classifiers via majority vote, bagging-building an ensemble of classifiers from
bootstrap samples, leveraging weak learners via adaptive boosting (Text Book 2)

UNIT V:

Working with Text Data (Data Visualization): Types of Data Represented as Strings, Example Application:
Sentiment Analysis of Movie Reviews, Representing Text Data as a Bag of Words, Stop Words, Rescaling
the Data with tf-idf, Investigating Model Coefficients, Approaching a Machine Learning Problem, Testing
Production Systems, Ranking, Recommender Systems and Other kinds of Learning.
UNIT-1

What is Machine Learning

In the real world, we are surrounded by humans who can learn everything from their
experiences with their learning capability, and we have computers or machines which work
on our instructions. But can a machine also learn from experiences or past data like a
human does? So here comes the role of Machine Learning.

Introduction to Machine Learning

A subset of artificial intelligence known as machine learning focuses primarily on the

creation of algorithms that enable a computer to independently learn from data and
previous experiences. Arthur Samuel first used the term "machine learning" in 1959. It could
be summarized as follows:

Without being explicitly programmed, machine learning enables a machine to automatically

learn from data, improve performance from experiences, and predict things.

Machine learning algorithms create a mathematical model that, without being explicitly
programmed, aids in making predictions or decisions with the assistance of sample historical
data, or training data. For the purpose of developing predictive models, machine learning
brings together statistics and computer science. Algorithms that learn from historical data
are either constructed or utilized in machine learning. The performance will rise in
proportion to the quantity of information we provide.

A machine can learn if it can gain more data to improve its performance.
How does Machine Learning work
A machine learning system builds prediction models, learns from previous data, and predicts
the output of new data whenever it receives it. The amount of data helps to build a better
model that accurately predicts the output, which in turn affects the accuracy of the
predicted output.

The Machine Learning algorithm's operation is depicted in the following block diagram:

Features of Machine Learning:

Machine learning uses data to detect various patterns in a given dataset.

o It can learn from past data and improve automatically.

o It is a data-driven technology.
o Machine learning is much similar to data mining as it also deals with the huge
amount of the data.

Need for Machine Learning

The demand for machine learning is steadily rising. Because it is able to perform tasks that
are too complex for a person to directly implement, machine learning is required. Humans
are constrained by our inability to manually access vast amounts of data; as a result, we
require computer systems, which is where machine learning comes in to simplify our lives.

By providing them with a large amount of data and allowing them to automatically explore
the data, build models, and predict the required output, we can train machine learning
algorithms. The cost function can be used to determine the amount of data and the machine
learning algorithm's performance. We can save both time and money by using machine
learning.

The significance of AI can be handily perceived by its utilization's cases, Presently, AI is

utilized in self-driving vehicles, digital misrepresentation identification, face
acknowledgment, and companion idea by Facebook, and so on. Different top organizations,
for example, Netflix and Amazon have constructed AI models that are utilizing an immense
measure of information to examine the client interest and suggest item likewise.

Following are some key points which show the importance of Machine Learning:
o Rapid increment in the production of data
o Solving complex problems, which are difficult for a human
o Decision making in various sector including finance
o Finding hidden patterns and extracting useful information from data.

Classification of Machine Learning

At a broad level, machine learning can be classified into three types:

1. Supervised learning
2. Unsupervised learning
3. Reinforcement learning

1) Supervised Learning

In supervised learning, sample labeled data are provided to the machine learning system for
training, and the system then predicts the output based on the training data.

The system uses labeled data to build a model that understands the datasets and learns
about each one. After the training and processing are done, we test the model with sample
data to see if it can accurately predict the output.

The mapping of the input data to the output data is the objective of supervised learning. The
managed learning depends on oversight, and it is equivalent to when an understudy learns
things in the management of the educator. Spam filtering is an example of supervised
learning.
Supervised learning can be grouped further in two categories of algorithms:

o Classification
o Regression

2) Unsupervised Learning

Unsupervised learning is a learning method in which a machine learns without any

supervision.

The training is provided to the machine with the set of data that has not been labeled,
classified, or categorized, and the algorithm needs to act on that data without any
supervision. The goal of unsupervised learning is to restructure the input data into new
features or a group of objects with similar patterns.

In unsupervised learning, we don't have a predetermined result. The machine tries to find
useful insights from the huge amount of data. It can be further classifieds into two
categories of algorithms:

o Clustering
o Association

3) Reinforcement Learning

Reinforcement learning is a feedback-based learning method, in which a learning agent gets

a reward for each right action and gets a penalty for each wrong action. The agent learns
automatically with these feedbacks and improves its performance. In reinforcement
learning, the agent interacts with the environment and explores it. The goal of an agent is to
get the most reward points, and hence, it improves its performance.

The robotic dog, which automatically learns the movement of his arms, is an example of
Reinforcement learning.

Applications of Machine learning

Machine learning is a buzzword for today's technology, and it is growing very rapidly day by
day. We are using machine learning in our daily life even without knowing it such as Google
Maps, Google assistant, Alexa, etc. Below are some most trending real-world applications of
Machine Learning:
1. Image Recognition:

Image recognition is one of the most common applications of machine learning. It is used to
identify objects, persons, places, digital images, etc. The popular use case of image
recognition and face detection is, Automatic friend tagging suggestion:

Facebook provides us a feature of auto friend tagging suggestion. Whenever we upload a

photo with our Facebook friends, then we automatically get a tagging suggestion with name,
and the technology behind this is machine learning's face detection and recognition
algorithm.

It is based on the Facebook project named "Deep Face," which is responsible for face
recognition and person identification in the picture.

2. Speech Recognition

While using Google, we get an option of "Search by voice," it comes under speech
recognition, and it's a popular application of machine learning.
Speech recognition is a process of converting voice instructions into text, and it is also
known as "Speech to text", or "Computer speech recognition." At present, machine
learning algorithms are widely used by various applications of speech recognition. Google
assistant, Siri, Cortana, and Alexa are using speech recognition technology to follow the
voice instructions.

3. Traffic prediction:

If we want to visit a new place, we take help of Google Maps, which shows us the correct
path with the shortest route and predicts the traffic conditions.

It predicts the traffic conditions such as whether traffic is cleared, slow-moving, or heavily
congested with the help of two ways:

o Real Time location of the vehicle form Google Map app and sensors
o Average time has taken on past days at the same time.

Everyone who is using Google Map is helping this app to make it better. It takes information
from the user and sends back to its database to improve the performance.

4. Product recommendations:

Machine learning is widely used by various e-commerce and entertainment companies such
as Amazon, Netflix, etc., for product recommendation to the user. Whenever we search for
some product on Amazon, then we started getting an advertisement for the same product
while internet surfing on the same browser and this is because of machine learning.

Google understands the user interest using various machine learning algorithms and
suggests the product as per customer interest.

As similar, when we use Netflix, we find some recommendations for entertainment series,
movies, etc., and this is also done with the help of machine learning.

5. Self-driving cars:

One of the most exciting applications of machine learning is self-driving cars. Machine
learning plays a significant role in self-driving cars. Tesla, the most popular car
manufacturing company is working on self-driving car. It is using unsupervised learning
method to train the car models to detect people and objects while driving.

6. Email Spam and Malware Filtering:

Whenever we receive a new email, it is filtered automatically as important, normal, and

spam. We always receive an important mail in our inbox with the important symbol and
spam emails in our spam box, and the technology behind this is Machine learning. Below are
some spam filters used by Gmail:

o Content Filter
o Header filter
o General blacklists filter
o Rules-based filters
o Permission filters

Some machine learning algorithms such as Multi-Layer Perceptron, Decision tree,

and Naïve Bayes classifier are used for email spam filtering and malware detection.

7. Virtual Personal Assistant:

We have various virtual personal assistants such as Google

assistant, Alexa, Cortana, Siri. As the name suggests, they help us in finding the
information using our voice instruction. These assistants can help us in various ways just by
our voice instructions such as Play music, call someone, Open an email, Scheduling an
appointment, etc.

These virtual assistants use machine learning algorithms as an important part.

These assistant record our voice instructions, send it over the server on a cloud, and decode
it using ML algorithms and act accordingly.

8. Online Fraud Detection:

Machine learning is making our online transaction safe and secure by detecting fraud
transaction. Whenever we perform some online transaction, there may be various ways that
a fraudulent transaction can take place such as fake accounts, fake ids, and steal
money in the middle of a transaction. So to detect this, Feed Forward Neural
network helps us by checking whether it is a genuine transaction or a fraud transaction.

For each genuine transaction, the output is converted into some hash values, and these
values become the input for the next round. For each genuine transaction, there is a specific
pattern which gets change for the fraud transaction hence, it detects it and makes our
online transactions more secure.

9. Stock Market trading:

Machine learning is widely used in stock market trading. In the stock market, there is always
a risk of up and downs in shares, so for this machine learning's long short term memory
neural network is used for the prediction of stock market trends.

10. Medical Diagnosis:

In medical science, machine learning is used for diseases diagnoses. With this, medical
technology is growing very fast and able to build 3D models that can predict the exact
position of lesions in the brain.

It helps in finding brain tumors and other brain-related diseases easily.

11. Automatic Language Translation:

Nowadays, if we visit a new place and we are not aware of the language then it is not a
problem at all, as for this also machine learning helps us by converting the text into our
known languages. Google's GNMT (Google Neural Machine Translation) provide this feature,
which is a Neural Machine Learning that translates the text into our familiar language, and it
called as automatic translation.

The technology behind the automatic translation is a sequence to sequence learning

algorithm, which is used with image recognition and translates the text from one language
to another language.

NumPy
NumPy basically provides n-dimensional array object. NumPy also provides
mathematical functions which can be used in many calculations.

Command to install: pip install numpy

import numpy as np
arr = np.array([[1,2,3],[4,5,6]])
print("Numpy array
{}".format(arr))

Output

Numpy array

[[1 2 3]

[4 5 6]]

SciPy
SciPy is collection of scientific computing functions. It provides advanced linear algebra
routines, mathematical function optimization, signal processing, special mathematical
functions, and statistical distributions.

Command to install: pip install scipy

from scipy import sparse

# Create a 2D NumPy array with a diagonal of ones, and zeros everywhere else
eye = np.eye(3)
print("NumPy array:
{}".format(eye))
sparse_matrix = sparse.csr_matrix(eye)
print("
SciPy sparse CSR matrix:
{}".format(sparse_matrix))

Output

NumPy array:

[[1. 0. 0.]

[0. 1. 0.]

[0. 0. 1.]]

SciPy sparse CSR matrix:

(0, 0) 1.0

(1, 1) 1.0

(2, 2) 1.0

matplotlib
matplotlib is scientific plotting library usually required to visualize data. Importantly
visualization is required to analyze the data. You can plot histograms, scatter graphs, lines
etc.

Command to install: pip install matplotlib

import matplotlib.pyplot as plt

x = [1,2,3]
y = [4,5,6]
plt.scatter(x,y)
plt.show()

Output

scikit-learn
scikit-learn is built on NumPy, SciPy and matplotlib provides tools for data analysis and
data mining. It provides classification and clustering algorithms built in and some datasets
for practice like iris dataset, Boston house prices dataset, diabetes dataset etc.

Command to install: pip install scikit-learn

from sklearn import datasets
iris_data = datasets.load_iris()
sample = iris_data['data'][:3]
print("iris dataset sample data:
{}".format(iris_data['feature_names']))
print("{}".format(sample))

Output

iris dataset sample data:

['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']

[[5.1 3.5 1.4 0.2]

[4.9 3. 1.4 0.2]

[4.7 3.2 1.3 0.2]]

pandas
pandas is used for data analysis it can take multi-dimensional arrays as input and
produce charts/graphs. pandas may take a table with columns of different datatypes. It may
ingest data from various data files and database like SQL, Excel, CSV etc.

Command to install: pip install pandas

import pandas as pd
age = {'age': [4, 6, 8, 34, 5, 30, 41] }
dataframe = pd.DataFrame(age)
print("all age:
{}".format(dataframe))
filtered = dataframe[dataframe.age > 20]
print("age above 20:
{}".format(filtered))

Output

all age:

age

0 4

1 6

2 8

3 34

4 5
5 30

6 41

age above 20:

age

3 34

5 30

6 41

Machine Learning For Absolute Beginners A - Oliver Theobald
100% (2)
Machine Learning For Absolute Beginners A - Oliver Theobald
179 pages
Implementation of The LIBOR Market Model
100% (4)
Implementation of The LIBOR Market Model
22 pages
JNTUK R20 ML UNIT-I (Chapter-I)
No ratings yet
JNTUK R20 ML UNIT-I (Chapter-I)
9 pages
R20 ML NOTES
No ratings yet
R20 ML NOTES
118 pages
ML-Unit 1
No ratings yet
ML-Unit 1
43 pages
ML-Unit 1 Merged
No ratings yet
ML-Unit 1 Merged
151 pages
Module 1
No ratings yet
Module 1
38 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
ML Unit 1
No ratings yet
ML Unit 1
20 pages
UNIT-IV Notes
No ratings yet
UNIT-IV Notes
42 pages
Introducion to ML
No ratings yet
Introducion to ML
29 pages
ML CH 1
No ratings yet
ML CH 1
53 pages
Machine Learning
No ratings yet
Machine Learning
73 pages
MACHINE LEARNING
No ratings yet
MACHINE LEARNING
97 pages
Unit1 ML
No ratings yet
Unit1 ML
23 pages
PUSHKAR
No ratings yet
PUSHKAR
15 pages
Machine Learning Tutorial For Beginners
No ratings yet
Machine Learning Tutorial For Beginners
15 pages
Unit I_Machine Learning @ CSJMU_6 Slides Handouts
No ratings yet
Unit I_Machine Learning @ CSJMU_6 Slides Handouts
4 pages
ML Notes
No ratings yet
ML Notes
202 pages
UNIT I-Machine Learning
No ratings yet
UNIT I-Machine Learning
68 pages
Karthik
No ratings yet
Karthik
10 pages
mehakreport
No ratings yet
mehakreport
23 pages
ML Links
No ratings yet
ML Links
176 pages
Unit 4
No ratings yet
Unit 4
39 pages
CBSYLLABUS BDA 1
No ratings yet
CBSYLLABUS BDA 1
4 pages
ML Unit1(HKB)
No ratings yet
ML Unit1(HKB)
7 pages
Unit-1
No ratings yet
Unit-1
88 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
5_6095834670757318868
No ratings yet
5_6095834670757318868
62 pages
w1 - Introduction To ML
No ratings yet
w1 - Introduction To ML
41 pages
1 ML
No ratings yet
1 ML
24 pages
Unit 5 Machine Learning
No ratings yet
Unit 5 Machine Learning
14 pages
ML 1
No ratings yet
ML 1
79 pages
Data Science IV
No ratings yet
Data Science IV
126 pages
Machine Learning
100% (1)
Machine Learning
81 pages
Chapter 01 machine learning
No ratings yet
Chapter 01 machine learning
22 pages
Updated Unit 1
No ratings yet
Updated Unit 1
57 pages
Notes - Machine Learning
No ratings yet
Notes - Machine Learning
138 pages
ML Full Slides Final
No ratings yet
ML Full Slides Final
458 pages
ML - Full Slides Srikanth Allamshatty
No ratings yet
ML - Full Slides Srikanth Allamshatty
369 pages
ML 1
No ratings yet
ML 1
21 pages
1.2.1 ML Intro
No ratings yet
1.2.1 ML Intro
18 pages
Unit-1 Part-1 Material
No ratings yet
Unit-1 Part-1 Material
45 pages
Chapter 1 Overview
No ratings yet
Chapter 1 Overview
19 pages
Basics of Machine Learning
100% (4)
Basics of Machine Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
25 pages
Presentation 33360 Content Document 20250319044717PM
No ratings yet
Presentation 33360 Content Document 20250319044717PM
126 pages
Machine Learning- UNIT I (1)
No ratings yet
Machine Learning- UNIT I (1)
70 pages
Unit-1
No ratings yet
Unit-1
55 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
17 pages
1 - Machine Learning (Start)
No ratings yet
1 - Machine Learning (Start)
32 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
89 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
44 pages
Session One Machine Learning
No ratings yet
Session One Machine Learning
18 pages
ml
No ratings yet
ml
333 pages
Machinelearning Unit-1
No ratings yet
Machinelearning Unit-1
29 pages
Unit 1 Introduction of Machine Learning Notes
No ratings yet
Unit 1 Introduction of Machine Learning Notes
57 pages
Python Machine Learning Illustrated Guide For Beginners & Intermediates:The Future Is Here!
From Everand
Python Machine Learning Illustrated Guide For Beginners & Intermediates:The Future Is Here!
William Sullivan
4.5/5 (2)
Python Machine Learning: Introduction to Machine Learning with Python
From Everand
Python Machine Learning: Introduction to Machine Learning with Python
Frank Millstein
No ratings yet
Hotel Management
No ratings yet
Hotel Management
3 pages
BATCH 3 (3)
No ratings yet
BATCH 3 (3)
2 pages
UNIT-3_IOT
No ratings yet
UNIT-3_IOT
14 pages
SPM
No ratings yet
SPM
18 pages
UNIT-1 ch01
No ratings yet
UNIT-1 ch01
28 pages
File Handling in PHP
No ratings yet
File Handling in PHP
16 pages
Python Data Science Handbook Essential Tools for Working with Data 1st Edition Jake Vanderplas download pdf
100% (3)
Python Data Science Handbook Essential Tools for Working with Data 1st Edition Jake Vanderplas download pdf
55 pages
Voice Based System Assistant Using NLP and Deep Learning
No ratings yet
Voice Based System Assistant Using NLP and Deep Learning
63 pages
8 LO5 Lect 1
No ratings yet
8 LO5 Lect 1
16 pages
Scoring Key/marking Scheme
No ratings yet
Scoring Key/marking Scheme
9 pages
Numpy in Python
No ratings yet
Numpy in Python
7 pages
Dli Catalog
No ratings yet
Dli Catalog
26 pages
Open3D: A Modern Library For 3D Data Processing: Qian-Yi Zhou Jaesik Park Vladlen Koltun Intel Labs
No ratings yet
Open3D: A Modern Library For 3D Data Processing: Qian-Yi Zhou Jaesik Park Vladlen Koltun Intel Labs
6 pages
Face Detectionand Recognition Using Open CV
No ratings yet
Face Detectionand Recognition Using Open CV
13 pages
Super 40 - Pandas Series Worksheet 1 - Answers
No ratings yet
Super 40 - Pandas Series Worksheet 1 - Answers
10 pages
Numpy Reference
No ratings yet
Numpy Reference
1,413 pages
UNIT-2 Arrays in Python
No ratings yet
UNIT-2 Arrays in Python
64 pages
Python Interview
0% (1)
Python Interview
18 pages
ch-1 Numpy - 2nd - Day
No ratings yet
ch-1 Numpy - 2nd - Day
5 pages
Build a Robo Advisor with Python From Scratch Automate your financial and investment decisions MEAP Rob Reider pdf download
100% (1)
Build a Robo Advisor with Python From Scratch Automate your financial and investment decisions MEAP Rob Reider pdf download
54 pages
Getting Started With Python Cheat Sheet
No ratings yet
Getting Started With Python Cheat Sheet
1 page
Pandas DataFrame Notes
67% (3)
Pandas DataFrame Notes
13 pages
SRM PPS Unit 5 Consolidated
No ratings yet
SRM PPS Unit 5 Consolidated
139 pages
Final Mini Project
No ratings yet
Final Mini Project
50 pages
Python Arrays
No ratings yet
Python Arrays
4 pages
Pandas Workout (MEAP V06) Reuven Lerner - The complete ebook set is ready for download today
100% (1)
Pandas Workout (MEAP V06) Reuven Lerner - The complete ebook set is ready for download today
66 pages
Solution Python Programming Practical List
No ratings yet
Solution Python Programming Practical List
36 pages
Coding Python
100% (9)
Coding Python
252 pages
FDA Lab Manual Final
No ratings yet
FDA Lab Manual Final
42 pages
John M. Stewart - Python For Scientists (2017, Cambridge University Press)
No ratings yet
John M. Stewart - Python For Scientists (2017, Cambridge University Press)
268 pages
Teddy Mukadzambo R201998U Python Refresher
No ratings yet
Teddy Mukadzambo R201998U Python Refresher
8 pages
MODEL EXAM II Answer Key - For Merge
No ratings yet
MODEL EXAM II Answer Key - For Merge
20 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
24 pages
Prediction of Admission in Engineering College
No ratings yet
Prediction of Admission in Engineering College
59 pages
Matrices Parte 2 - Jupyter Notebook
No ratings yet
Matrices Parte 2 - Jupyter Notebook
20 pages