0% found this document useful (0 votes)

68 views

11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science

The document summarizes 11 common machine learning algorithms. It begins by explaining that machine learning has increased in popularity due to demand and technology advancements. It then groups machine learning algorithms into 3 categories: supervised learning, unsupervised learning, and reinforcement learning. The summary focuses on the 5 most common algorithms discussed in the document: linear regression, support vector machines, naive bayes, logistic regression, and k-nearest neighbors. For each algorithm, it provides a brief 1-2 sentence description of the algorithm's goal and approach.

Uploaded by

Dheeraj Sonkhla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views

11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science

Uploaded by

Dheeraj Sonkhla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

towardsdatascience.

com

11 Most Common Machine Learning

Algorithms Explained in a Nutshell

Soner Yıldırım

23–29 m inutes

A summary of common machine learning

algorithms.

Photo by Santiago Lacarta on Unsplash

The prevalence of machine learning has been increasing

tremendously in recent years due to the high demand and
advancements in technology. The potential of machine learning to
create value out of data has made it appealing for businesses in
many different industries. Most machine learning products are
designed and implemented with off-the-shelf machine learning
algorithms with some tuning and minor changes.

There is a wide variety of machine learning algorithms that can be

grouped in three main categories:

• Supervised learning algorithms model the relationship between

features (independent variables) and a label (target) given a set of
observations. Then the model is used to predict the label of new
observations using the features. Depending on the characteristics
of target variable, it can be a classification (discrete target
variable) or a regression (continuous target variable) task.

• Unsupervised learning algorithms try to find the structure in

unlabeled data.

• Reinforcement learning works based on an action-reward

principle. An agent learns to reach a goal by iteratively calculating
the reward of its actions.

In this post, I will cover the most common algorithms in the first two
categories.

Note: Although deep learning is a sub-field of machine learning, I

will not include any deep learning algorithms in this post. I think
deep learning algorithms should be discussed separately due to
complexity and having distinct dynamics. Besides, I hesitate to
make this post too long and bore the readers.

Let’s start.

1. Linear Regression

Linear regression is a supervised learning algorithm and tries to

model the relationship between a continuous target variable and
one or more independent variables by fitting a linear equation to the
data.

For a linear regression to be a good choice, there needs to be a

linear relation between independent variable(s) and target variable.
There are many tools to explore the relationship among variables
such as scatter plots and correlation matrix. For example, the
scatter plot below shows a positive correlation between an
independent variable (x-axis) and dependent variable (y-axis). As
one increases, the other one also increases.

A linear regression model tries to fit a regression line to the data

points that best represents the relations or correlations. The most
common technique to use is ordinary-least squares (OLE). With
this method, best regression line is found by minimizing the sum of
squares of the distance between data points and the regression
line. For the data points above, the regression line obtained using
OLE seems like:

2. Support Vector Machine

Support Vector Machine (SVM) is a supervised learning algorithm

and mostly used for classification tasks but it is also suitable for
regression tasks.

SVM distinguishes classes by drawing a decision boundary. How

to draw or determine the decision boundary is the most critical part
in SVM algorithms. Before creating the decision boundary, each
observation (or data point) is plotted in n-dimensional space. “n” is
the number of features used. For instance, if we use “length” and
“width” to classify different “cells”, observations are plotted in a
2-dimensional space and decision boundary is a line. If we use 3
features, decision boundary is a plane in 3-dimensional space. If we
use more than 3 features, decision boundary becomes a
hyperplane which is really hard to visualize.

Decision boundary in 2D space is a line

Decision boundary is drawn in a way that the distance to support

vectors are maximized. If the decision boundary is too close to a
support vector, it will be highly sensitive to noises and not
generalize well. Even very small changes in independent variables
may cause a misclassification.

The data points are not always linearly separable like in the figure
above. In these cases, SVM uses kernel trick which measures the
similarity (or closeness) of data points in a higher dimensional
space in order to make them linearly separable.

Kernel function is kind of a similarity measure. The inputs are

original features and the output is a similarity measure in the new
feature space. Similarity here means a degree of closeness. It is a
costly operation to actually transform data points to a high-
dimensional feature space. The algorithm does not actually
transform the data points to a new, high dimensional feature space.
Kernelized SVM compute decision boundaries in terms of similarity
measures in a high-dimensional feature space without actually
doing a transformation. I think this is why it is also called kernel
trick.

SVM is especially effective in cases where number of dimensions

are more than the number of samples. When finding the decision
boundary, SVM uses a subset of training points rather than all
points which makes it memory efficient. On the other hand, training
time increases for large datasets which negatively effects the
performance.

3. Naive Bayes

Naive Bayes is a supervised learning algorithm used for

classification tasks. Hence, it is also called Naive Bayes Classifier.

Naive bayes assumes that features are independent of each

other and there is no correlation between features. However,
this is not the case in real life. This naive assumption of features
being uncorrelated is the reason why this algorithm is called
“naive”.

The intuition behind naive bayes algorithm is the bayes’ theorem:

p(A|B): Probability of event A given event B has already occurred

p(B|A): Probability of event B given event A has already occuured

p(A): Probability of event A

p(B): Probability of event B

Naive bayes classifier calculates the probability of a class given a

set of feature values (i.e. p(yi | x1, x2 , … , xn)). Input this into
Bayes’ theorem:

p(x1, x2 , … , xn | yi) means the probability of a specific

combination of features (an observation / row in a dataset) given a
class label. We need extremely large datasets to have an estimate
on the probability distribution for all different combinations of feature
values. To overcome this issue, naive bayes algorithm assumes
that all features are independent of each other. Furthermore,
denominator (p(x1,x2, … , xn)) can be removed to simplify the
equation because it only normalizes the value of conditional
probability of a class given an observation ( p(yi | x1,x2, … , xn)).

The probability of a class ( p(yi) ) is very simple to calculate:

Under the assumption of features being independent, p(x1, x2 , … ,

xn | yi) can be written as:

The conditional probability for a single feature given the class label
(i.e. p(x1 | yi) ) can be more easily estimated from the data. The
algorithm needs to store probability distributions of features for
each class independently. For example, if there are 5 classes and
10 features, 50 different probability distributions need to be stored.

Adding all these up, it became an easy task for naive bayes
algorithm to calculate the probability to observe a class given
values of features (p(yi | x1, x2 , … , xn) )

The assumption that all features are independent makes naive

bayes algorithm very fast compared to complicated algorithms. In
some cases, speed is preferred over higher accuracy. On the other
hand, the same assumption makes naive bayes algorithm less
accurate than complicated algorithms. Speed comes at a cost!

4. Logistic Regression

Logistic regression is a supervised learning algorithm which is

mostly used for binary classification problems. Although
“regression” contradicts with “classification”, the focus here is on
the word “logistic” referring to logistic function which does the
classification task in this algorithm. Logistic regression is a simple
yet very effective classification algorithm so it is commonly used for
many binary classification tasks. Customer churn, spam email,
website or ad click predictions are some examples of the areas
where logistic regression offers a powerful solution.

The basis of logistic regression is the logistic function, also called

the sigmoid function, which takes in any real valued number and
maps it to a value between 0 and 1.
Consider we have the following linear equation to solve:

Logistic regression model takes a linear equation as input and uses

logistic function and log odds to perform a binary classification task.
Then, we will get the famous s shaped graph of logistic regression:

We can use the calculated probability ‘as is’. For example, the
output can be “the probability that this email is spam is 95%” or “the
probability that customer will click on this ad is 70%”. However, in
most cases, probabilities are used to classify data points. For
instance, if the probability is greater than 50%, the prediction is
positive class (1). Otherwise, the prediction is negative class (0).

It is not always desired to choose positive class for all probability

values higher than 50%. Regarding the spam email case, we have
to be almost sure in order to classify an email as spam. Since
emails detected as spam directly go to spam folder, we do not want
the user to miss important emails. Emails are not classified as
spam unless we are almost sure. On the other hand, when
classification in a health-related issue requires us to be much more
sensitive. Even if we are a little suspicious that a cell is malignant,
we do not want to miss it. So the value that serves as a threshold
between positive and negative class is problem-dependent. Good
thing is that logistic regression allows us to adjust this threshold
value.

5. K-Nearest Neighbors (kNN)

K-nearest neighbors (kNN) is a supervised learning algorithm that

can be used to solve both classification and regression tasks. The
main idea behind kNN is that the value or class of a data point is
determined by the data points around it.

kNN classifier determines the class of a data point by majority

voting principle. For instance, if k is set to 5, the classes of 5 closest
points are checked. Prediction is done according to the majority
class. Similarly, kNN regression takes the mean value of 5 closest
points. Let’s go over an example. Consider the following data points
that belong to 4 different classes:

Let’s see how the predicted classes change according to the k

value:

It is very important to determine an optimal k value. If k is too low,

the model is too specific and not generalized well. It also tends to
be sensitive to noise. The model accomplishes a high accuracy on
train set but will be a poor predictor on new, previously unseen data
points. Therefore, we are likely to end up with an overfit model. On
the other hand, if k is too large, the model is too generalized and
not a good predictor on both train and test sets. This situation is
known as underfitting.

kNN is simple and easy to interpret. It does not make any

assumption so it can be implemented in non-linear tasks. kNN
becomes very slow as the number of data points increases
because the model needs to store all data points. Thus, it is also
not memory efficient. Another downside of kNN is that it is sensitive
to outliers.

6. Decision Trees

A decision tree builds upon iteratively asking questions to partition

data. It is easier to conceptualize the partitioning data with a visual
representation of a decision tree:

This represents a decision tree to predict customer churn. First split

is based on monthly charges amount. Then the algorithm keeps
asking questions to separate class labels. The questions get more
specific as the tree gets deeper.

The aim of the decision tree algorithm is to increase the

predictiveness as much as possible at each partitioning so that the
model keeps gaining information about the dataset. Randomly
splitting the features does not usually give us valuable insight into
the dataset. Splits that increase purity of nodes are more
informative. The purity of a node is inversely proportional to the
distribution of different classes in that node. The questions to ask
are chosen in a way that increases purity or decrease impurity.

How many questions do we ask? When do we stop? When is our

tree sufficient to solve our classification problem? The answer to all
these questions leads us to one of most important concepts in
machine learning: overfitting. The model can keep asking
questions until all the nodes are pure. However, this would be a too
specific model and would not generalize well. It achieves high
accuracy with training set but performs poorly on new, previously
unseen data points which indicates overfitting. The depth of a tree
is controlled by max_depth parameter for decision tree algorithm in
scikit-learn.
Decision tree algorithm usually does not require to normalize or
scale features. It is also suitable to work on a mixture of feature
data types (continuous, categorical, binary). On the negative side, it
is prone to overfitting and needs to be ensembled in order to
generalize well.

6. Random Forest

Random forest is an ensemble of many decision trees. Random

forests are built using a method called bagging in which decision
trees are used as parallel estimators. If used for a classification
problem, the result is based on majority vote of the results received
from each decision tree. For regression, the prediction of a leaf
node is the mean value of the target values in that leaf. Random
forest regression takes mean value of the results from decision
trees.

Random forests reduce the risk of overfitting and accuracy is much

higher than a single decision tree. Furthermore, decision trees in a
random forest run in parallel so that the time does not become a
bottleneck.

The success of a random forest highly depends on using

uncorrelated decision trees. If we use same or very similar trees,
the overall result will not be much different than the result of a
single decision tree. Random forests achieve to have uncorrelated
decision trees by bootstrapping and feature randomness.

Bootsrapping is randomly selecting samples from training data

with replacement. They are called bootstrap samples.

Bootstrap samples (Figure source)

Feature randomness is achieved by selecting features randomly

for each decision tree in a random forest. The number of features
used for each tree in a random forest can be controlled with
max_features parameter.

Feature randomness
Random forest is a highly accurate model on many different
problems and does not require normalization or scaling. However, it
is not a good choice for high-dimensional data sets (i.e. text
classification) compared to fast linear models (i.e. Naive Bayes).

7. Gradient Boosted Decision Trees (GBDT)

GBDT is an ensemble algorithm which uses boosting method to

combine individual decision trees.

Boosting means combining a learning algorithm in series to achieve

a strong learner from many sequentially connected weak learners.
In case of GBDT, the weak learners are decision trees.

Each tree attempts to minimize the errors of previous tree. Trees in

boosting are weak learners but adding many trees in series and
each focusing on the errors from previous one make boosting a
highly efficient and accurate model. Unlike bagging, boosting does
not involve bootstrap sampling. Everytime a new tree is added, it
fits on a modified version of initial dataset.

Since trees are added sequentially, boosting algorithms learn

slowly. In statistical learning, models that learn slowly perform
better.

A loss function is used to detect the residuals. For instance, mean

squared error (MSE) can be used for a regression task and
logarithmic loss (log loss) can be used for classification tasks. It is
worth noting that existing trees in the model do not change when a
new tree is added. The added decision tree fits the residuals from
the current model.

Learning rate and n_estimators are two critical hyperparameters

for gradient boosting decision trees. Learning rate, denoted as α,
simply means how fast the model learns. Each new tree modifies
the overall model. The magnitude of the modification is controlled
by learning rate. n_estimator is the number of trees used in the
model. If the learning rate is low, we need more trees to train the
model. However, we need to be very careful at selecting the
number of trees. It creates a high risk of overfitting to use too many
trees.

GBDT is very efficient on both classification and regression tasks

and provides more accurate predictions compared to random
forests. It can handle mixed type of features and no pre-processing
is needed. GBDT requires careful tuning of hyperparameters in
order to prevent the model from overfitting.

GBDT algorithm is so powerful that there are many upgraded

versions of it have been implemented such as XGBOOST,
LightGBM, CatBoost.

Note on overfitting

One key difference between random forests and gradient boosting

decision trees is the number of trees used in the model. Increasing
the number of trees in random forests does not cause overfitting.
After some point, the accuracy of the model does not increase by
adding more trees but it is also not negatively effected by adding
excessive trees. You still do not want to add unnecessary amount
of trees due to computational reasons but there is no risk of
overfitting associated with the number of trees in random forests.

However, the number of trees in gradient boosting decision trees is

very critical in terms of overfitting. Adding too many trees will cause
overfitting so it is important to stop adding trees at some point.

8. K-Means Clustering

Clustering is a way to group a set of data points in a way that

similar data points are grouped together. Therefore, clustering
algorithms look for similarities or dissimilarities among data points.
Clustering is an unsupervised learning method so there is no label
associated with data points. Clustering algorithms try to find the
underlying structure of the data.

Clustering is not classification.

Observations (or data points) in a classification task have labels.

Each observation is classified according to some measurements.
Classification algorithms try to model the relationship between
measurements (features) on observations and their assigned class.
Then the model predicts the class of new observations.

K-means clustering aims to partition data into k clusters in a way

that data points in the same cluster are similar and data points in
the different clusters are farther apart. Thus, it is a partition-based
clustering technique. Similarity of two points is determined by the
distance between them.

K-means clustering tries to minimize distances within a cluster and

maximize the distance between different clusters. K-means
algorithm is not capable of determining the number of clusters. We
need to define it when creating the KMeans object which may be a
challenging task.

Consider the following 2D visualization of a dataset:

It can be partitioned into 4 different clusters as below:

Real life datasets are much more complex in which clusters are not
clearly separated. However, the algorithm works in the same way.
K-means is an iterative process. It is built on expectation-
maximization algorithm. After number of clusters are determined, it
works by executing the following steps:

1. Randomly select centroids (center of cluster) for each cluster.

2. Calculate the distance of all data points to the centroids.

3. Assign data points to the closest cluster.

4. Find the new centroids of each cluster by taking the mean of all
data points in the cluster.

5. Repeat steps 2,3 and 4 until all points converge and cluster centers
stop moving.

K-Means clustering is relatively fast and easy to interpret. It is also

able to choose the positions of initial centroids in a smart way that
speeds up the convergence.
One challenge with k-means is that number of clusters must be pre-
determined. K-means algorithm is not able to guess how many
clusters exist in the data. If there is a non-linear structure
separating groups in the data, k-means will not be a good choice.

9. Hierarchical Clustering

Hierarchical clustering means creating a tree of clusters by

iteratively grouping or separating data points. There are two types
of hierarchical clustering:

• Agglomerative clustering

• Divisive clustering

One of the advantages of hierarchical clustering is that we do not

have to specify the number of clusters (but we can).

Agglomerative clustering is kind of a bottom-up approach. Each

data point is assumed to be a separate cluster at first. Then the
similar clusters are iteratively combined.

The figure above is called dendrogram which is a diagram

representing tree-based approach. In hierarchical clustering,
dendrograms are used to visualize the relationship among clusters.

One of the advantages of hierarchical clustering is that we do not

have to specify the number of clusters beforehand. However, it is
not wise to combine all data points into one cluster. We should stop
combining clusters at some point. Scikit-learn provides two options
for this:

• Stop after a number of clusters is reached (n_clusters)

• Set a threshold value for linkage (distance_threshold). If the

distance between two clusters are above the threshold, these
clusters will not be merged.

Divisive clustering is not commonly used in real life so I will mention

it briefly. Simple yet clear explanation is that divisive clustering is
the opposite of agglomerative clustering. We start with one giant
cluster including all data points. Then data points are separated into
different clusters. It is an up to bottom approach.

Hierarchical clustering always generates the same clusters.

K-means clustering may result in different clusters depending on
how the centroids (center of cluster) are initiated. However, it is a
slower algorithm compared to k-means. Hierarchical clustering
takes long time to run especially for large data sets.

10. DBSCAN Clustering

Partition-based and hierarchical clustering techniques are highly

efficient with normal shaped clusters. However, when it comes to
arbitrary shaped clusters or detecting outliers, density-based
techniques are more efficient.

Arbitrary shaped clusters

DBSCAN stands for density-based spatial clustering of applications
with noise. It is able to find arbitrary shaped clusters and clusters
with noise (i.e. outliers).

The main idea behind DBSCAN is that a point belongs to a cluster if

it is close to many points from that cluster.

There are two key parameters of DBSCAN:

• eps: The distance that specifies the neighborhoods. Two points are
considered to be neighbors if the distance between them are less
than or equal to eps.

• minPts: Minimum number of data points to define a cluster.

Based on these two parameters, points are classified as core point,

border point, or outlier:

• Core point: A point is a core point if there are at least minPts

number of points (including the point itself) in its surrounding area
with radius eps.

• Border point: A point is a border point if it is reachable from a core

point and there are less than minPts number of points within its
surrounding area.

• Outlier: A point is an outlier if it is not a core point and not

reachable from any core points.

DBSCAN does not require to specify number of clusters

beforehand. It is robust to outliers and able to detect the outliers.

In some cases, determining an appropriate distance of

neighborhood (eps) is not easy and it requires domain knowledge.

11. Principal Component Analysis (PCA)

PCA is a dimensionality reduction algorithm which basically derives

new features from the existing ones with keeping as much
information as possible. PCA is an unsupervised learning algorithm
but it is also widely used as a preprocessing step for supervised
learning algorithms.

PCA derives new features by finding the relations among features

within a dataset.

Note: PCA is a linear dimensionality reduction algorithm. There are

also non-linear methods available.

The aim of PCA is to explain the variance within the original dataset
as much as possible by using less features (or columns). The new
derived features are called principal components. The order of
principal components is determined according to the fraction of
variance of original dataset they explain.
The principal components are linear combinations of the features of
original dataset.

The advantage of PCA is that a significant amount of variance of

the original dataset is retained using much smaller number of
features than the original dataset. Principal components are
ordered according to the amount of variance they explain.

Thank you for reading. Please let me know if you have any
feedback.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Exstream05Content03Doc PDF
No ratings yet
Exstream05Content03Doc PDF
198 pages
IKO968959021 Auth Letter
No ratings yet
IKO968959021 Auth Letter
3 pages
1-Mapping Problems To Machine Learning Tasks
No ratings yet
1-Mapping Problems To Machine Learning Tasks
19 pages
Machine learning algorithms laiki
No ratings yet
Machine learning algorithms laiki
123 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
Assignment 2
No ratings yet
Assignment 2
111 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
Module 1 & 2
No ratings yet
Module 1 & 2
21 pages
ML notes
No ratings yet
ML notes
10 pages
UNIT1
No ratings yet
UNIT1
38 pages
Classification Algorithm in Machine Learning
No ratings yet
Classification Algorithm in Machine Learning
7 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
Unit 1
No ratings yet
Unit 1
15 pages
ML Algorithms
No ratings yet
ML Algorithms
12 pages
Classification
No ratings yet
Classification
7 pages
Decision Trees
No ratings yet
Decision Trees
5 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Machine Learning Algorithms For Breast Cancer Prediction
No ratings yet
Machine Learning Algorithms For Breast Cancer Prediction
8 pages
Machine Learning Algorithms - A Review - ART20203995
No ratings yet
Machine Learning Algorithms - A Review - ART20203995
6 pages
Algorithms in ML
No ratings yet
Algorithms in ML
15 pages
ML UNIT-4
No ratings yet
ML UNIT-4
20 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Machine Learning: Dr. Windhya Rankothge (PHD - Upf, Barcelona)
No ratings yet
Machine Learning: Dr. Windhya Rankothge (PHD - Upf, Barcelona)
44 pages
The 10 Algorithms Machine Learning Engineers Need To Know
No ratings yet
The 10 Algorithms Machine Learning Engineers Need To Know
14 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
Classification Algorithms 3rd
No ratings yet
Classification Algorithms 3rd
15 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
AIML
No ratings yet
AIML
30 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
Types of Kernels in Support Vector Machines
No ratings yet
Types of Kernels in Support Vector Machines
14 pages
What Are The Common Algorithms in Machine Learning
No ratings yet
What Are The Common Algorithms in Machine Learning
3 pages
Machine Learning
100% (6)
Machine Learning
115 pages
Chapter5 - Machine Learning
No ratings yet
Chapter5 - Machine Learning
37 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
Assignment 0.2
No ratings yet
Assignment 0.2
8 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
Unit-5 MECH 3-2
No ratings yet
Unit-5 MECH 3-2
14 pages
PID5108657
No ratings yet
PID5108657
8 pages
AI and DS QB1
No ratings yet
AI and DS QB1
31 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
30905022071_AGNIK KR JANA_CA2
No ratings yet
30905022071_AGNIK KR JANA_CA2
9 pages
Machine Learning Concept1
No ratings yet
Machine Learning Concept1
16 pages
Machine Learing Algorithms
No ratings yet
Machine Learing Algorithms
13 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
ML - Interview Prep
No ratings yet
ML - Interview Prep
9 pages
ML Unit 3 V1
No ratings yet
ML Unit 3 V1
25 pages
Unit 3 big data
No ratings yet
Unit 3 big data
50 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Day 4 Content
No ratings yet
Day 4 Content
35 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
9 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
Unit - 3
No ratings yet
Unit - 3
83 pages
Supervised Learning
No ratings yet
Supervised Learning
46 pages
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
DSA Practical PDF
No ratings yet
DSA Practical PDF
23 pages
Index: Sr. No. Program Page No
No ratings yet
Index: Sr. No. Program Page No
1 page
MYCIN
No ratings yet
MYCIN
34 pages
Uitt Project
No ratings yet
Uitt Project
69 pages
Subscribed Sample Document - This Is Done For A Very Important Purpose Which I Can't Tell You Right Now
No ratings yet
Subscribed Sample Document - This Is Done For A Very Important Purpose Which I Can't Tell You Right Now
1 page
Major Project Report: "Office Portal"
No ratings yet
Major Project Report: "Office Portal"
1 page
Women Empowerment in India
No ratings yet
Women Empowerment in India
6 pages
Ansi NFRC 500-2017 E0a0
No ratings yet
Ansi NFRC 500-2017 E0a0
33 pages
Vibration in Pumps PDF
No ratings yet
Vibration in Pumps PDF
5 pages
Sony Company Corporation: Chapter 1 Introduction 1.1 Company Background
No ratings yet
Sony Company Corporation: Chapter 1 Introduction 1.1 Company Background
9 pages
Preventative Maintenance Checklist
No ratings yet
Preventative Maintenance Checklist
1 page
Nand SPL
No ratings yet
Nand SPL
20 pages
BS en 756-96 PDF
100% (1)
BS en 756-96 PDF
13 pages
Interviewing and Interrogation Techniques
90% (10)
Interviewing and Interrogation Techniques
39 pages
Ampi
No ratings yet
Ampi
2 pages
Theory Master Toolbox (INCOMPLETE)
No ratings yet
Theory Master Toolbox (INCOMPLETE)
19 pages
I Objectives
No ratings yet
I Objectives
3 pages
PPS Prelim 23-24 Set1
No ratings yet
PPS Prelim 23-24 Set1
4 pages
MLdemy Brochure 2023
No ratings yet
MLdemy Brochure 2023
4 pages
BRANQUIUN10
No ratings yet
BRANQUIUN10
2 pages
Amphenol Musician Range PDF
No ratings yet
Amphenol Musician Range PDF
8 pages
Robert J. Conti World Unity p.1
No ratings yet
Robert J. Conti World Unity p.1
19 pages
BPW Trailer Axles Series H K N With Drum Brakes
No ratings yet
BPW Trailer Axles Series H K N With Drum Brakes
108 pages
Optical Fiber Communication - A. Selvarajan, S. Kar and T Srinivas
No ratings yet
Optical Fiber Communication - A. Selvarajan, S. Kar and T Srinivas
92 pages
Determination of Chemical and Physical Properties of Portland Cement
No ratings yet
Determination of Chemical and Physical Properties of Portland Cement
1 page
The Political Economy Of Population Aging Japan And The United States Kimiko Terai download
100% (1)
The Political Economy Of Population Aging Japan And The United States Kimiko Terai download
40 pages
DraftSRTRegulation06062023 Ap
No ratings yet
DraftSRTRegulation06062023 Ap
44 pages
ESAB Dual_Shield_II_80-Ni1_H4
No ratings yet
ESAB Dual_Shield_II_80-Ni1_H4
3 pages
Factors Affecting Youth Voting Preferences in The Philippine Senatorial Election: A Structural Equation Modelling (SEM) Analysis
No ratings yet
Factors Affecting Youth Voting Preferences in The Philippine Senatorial Election: A Structural Equation Modelling (SEM) Analysis
24 pages
1 ZTE Optimization of Network Performance 2
100% (2)
1 ZTE Optimization of Network Performance 2
20 pages
The Pearl Setbook Guide Docx
No ratings yet
The Pearl Setbook Guide Docx
31 pages
10 46519-Ij3dptdi 1206809-2781091
No ratings yet
10 46519-Ij3dptdi 1206809-2781091
12 pages
Android Based All-purpose Agriculture Machine
No ratings yet
Android Based All-purpose Agriculture Machine
17 pages
Insight 2 Data Privacy Act of 2012
No ratings yet
Insight 2 Data Privacy Act of 2012
7 pages
Groundwater Vulnerability Assessment Using DRASTIC Model
No ratings yet
Groundwater Vulnerability Assessment Using DRASTIC Model
17 pages