0% found this document useful (0 votes)

11 views

Guide

Uploaded by

Foba Ogunkeye

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Guide

Uploaded by

Foba Ogunkeye

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Chapter 1

● Figure 1-5

Image 1-5 shows a Supervised machine learning system; in this system, the training
set that is fed into the algorithm includes the desired outcome(solution) referred to
as “labels”. A typical supervised learning task is Classification and an example of
this task is spam email detection whereby the unwanted emails are labelled as
spam and the wanted emails are labelled as ham. Another task is prediction, this
task requires the system to predict a target value. It is referred to as the Regression
task.

Some regression may be used to carry out classification tasks as well. Exam

Examples of supervised learning algorithms;

K nearest neighbours, Linear regression, Logistic regression, SVM, neural networks.

● Figure 1-7

Image 1-7 shows unlabelled training set for unsupervised machine learning system

Examples of algorithms include; Clustering, Anomaly detection/novelty reduction,

visualization and dimensionality reduction, Association rule learning.
● Figure 1-8

The Clustering algorithm carries out the task of detecting similarities in an

unlabelled training set and groups them.

● Figure 1-11

The figure 1-11 shows a semi- supervised learning with two class (triangles and
squares) the unlabelled class helps to classify the new instance (X) to the triangle
class. The semi supervised learning makes use of plenty of unlabelled instances and
few labelled instances to operate. It’s a combination both supervised and
unsupervised learning algorithms

● Figure 1-12
Figure 1-12 shows reinforcement learning, this is typical of robots. This learning
system is called an agent which observes the environment, selects and performs
actions and gets rewards in return or penalty

● Figure 1-15

In instance based learning, the machine learns the examples by heart and then
generalizes to new cases by using a similarity measure to compare the new
instances to the learned examples.

● Figure 1-16

Model based Learning, this is a generalization technique that involves building a

model and then using it to make predictions
● Figure 1-22

This shows that the model is overfitting the training set, it means that the model
the algorithm is too complex, the validation (generalization) error is high and there
is low training error. Regularization needs to be performed, more training data is
need and the noise needs to be reduced.

● Figure 1-23

This image shows the effect of regularization on the model, it forces the model to fit
the data.

Chapter 2

● Be able to answer questions pertaining to test set creation, as discussed in

"Create a Test Set," pp. 51-55

Once the data set is obtained, the first thing to do is visualization of the data via
plots. Afterwards, the data is split in two; the Training data and the test data. The
split is carried out by picking instances randomly using the following code
The split is usually 20% test data 80% training data. After the training model using
the training set, the model is evaluated using the test set. By evaluating the model
on the test set, we can estimate the generalization error and this value tells us how
the model would perform.

Every single time that this function is run, it produces different sets because of its
randomness,a way to lock this is to set the random number generators seed.
When trying to create a test data it is important that the test set is representative of
the whole dataset. Random sampling is generally fine if the dataset is large and
contains multiple features but if the data is not very large, one runs the risk of
introducing sampling bias. A way of doing this is carrying out stratified sampling;
dividing the dataset into homogenous subgroups making sure that the dataset is
representative of the entire dataset.

● Be able to answer questions pertaining to training and evaluating on the training set, as
discussed in "Training and Evaluating on the Training Set," pp. 72-73

When training your data, different algorithms can be used and compared for accuracy. For
prediction task we can use the linear regression model. After using a few instances from the
training set, it was observed that the prediction is not so accurate so we measured the
performance of the model using the RMSE (root mean square error). We measure the models
RSME. The Decision tree model is trained and then evaluated on the training set. The

● Be able to answer questions pertaining to evaluation using cross-validation, as

discussed in "Better Evaluation Using Cross-Validation," pp. 73-75

Cross validation techniques helps for evaluating the trained model better. There are cases
where using the model might be over fitting the training data but might not be evident using
the RSME, it might just give a perfect model. The best thing is to cover all bases by carrying out
cross validation.

one way to start is to split the training set into a smaller training set, train on that and then
evaluate using the validation set. A better alternative is to use the scikit learns K-fold cross
validation feature which splits the training set into different folds(eg. 10folds) and sets one
aside each time for evaluation and trains the other 9. The result is an array containing 10
evaluation scores. The benefit of this is that it shows where the model is lacking if its overfitting
the training data.

from sklearn.model_selection import cross_val_score

scores = cross_val_score(tree_reg, housing_prepared, housing_labels,

scoring="neg_mean_squared_error", cv=10)

Chapter 3

● The MNIST dataset is very (very) widely used. As such, you should be
able to describe the MNIST dataset, as well as explain Python code
segments in the section "MNIST," pp. 85-87

from sklearn.datasets import fetch_openml

mnist = fetch_openml('mnist_784', version=1)
mnist.keys()
dict_keys(['data', 'target', 'feature_names', 'DESCR', 'details',
'categories', 'url'])

The first step is to fetch the data, the data is in scikit learn. The data set has a
similar dictionary structure including; DESCR key describes the data set, data key
contains an array with one row per instance and one column per feature, target key
contains the array label.

>>> X, y = mnist["data"], mnist["target"]

>>> X.shape
(70000, 784)
>>> y.shape
(70000,)

The code above describes the data size, there are 70,000 images each image having
784 features. Because each image is 28 X 28 pixels

import matplotlib as mpl

import matplotlib.pyplot as plt
some_digit = X[0]
some_digit_image = some_digit.reshape(28, 28)
plt.imshow(some_digit_image, cmap="binary")
plt.axis("off")
plt.show()

This code is used to reshape an instance feature and then display using the
Matplotlib function

● Explain briefly a binary classifier, p. 88

Binary Classifier refers to the classification task that have two class labels; normal
state and abnormal state. In this section we are trying to classify what is 5 and what
is not 5
y_train_5 = (y_train == 5) # True for all 5s, False for all other digits
y_test_5 = (y_test == 5)

The Stochastic gradient descent is a good classifier because it picks an instance at a

time at random and it is capable of handling very large dataset

from sklearn.linear_model import SGDClassifier

sgd_clf = SGDClassifier(random_state=42)
sgd_clf.fit(X_train, y_train_5)

The SGD classifier guessed that the image represents a 5, to evaluate the models
performance we use cross validation

● Understand Cross-Validation, pp. 89-90

The cross validation is a performance measure. Evaluating a binary classifier is

trickier than a regressor.

from sklearn.model_selection import StratifiedKFold

from sklearn.base import clone

skfolds = StratifiedKFold(n_splits=3, random_state=42)

for train_index, test_index in skfolds.split(X_train, y_train_5):

clone_clf = clone(sgd_clf)

X_train_folds = X_train[train_index]

y_train_folds = y_train_5[train_index]

X_test_fold = X_train[test_index]

y_test_fold = y_train_5[test_index]

clone_clf.fit(X_train_folds, y_train_folds)

y_pred = clone_clf.predict(X_test_fold)

n_correct = sum(y_pred == y_test_fold)

print(n_correct / len(y_pred)) # prints 0.9502, 0.96565, and 0.96495

The Stratified Kfold class performs stratified sampling to produce folds that contain
a representative ratio of each class

At each iteration the code creates a clone of the classifier, trains the code on
training folds and then makes predictions on the test fold. Then it counts the
number of correct predictions and outputs the ratio of correct predictions.
Second code

>>> from sklearn.model_selection import cross_val_score

>>> cross_val_score(sgd_clf, X_train, y_train_5, cv=3, scoring="accuracy")

array([0.96355, 0.93795, 0.95615])

Description of code from page 89-90

The above code is executed using the cross_val_score function to evaluate the
SGDClassifier model. The first thing the code does is to create k-folds, in this case 3
folds, then it trains and evaluates the model 3 times; picking a fold for evaluation
everytime and training on the other 3 folds. it gives above 93% accuracy on all cross
validation folds.

looking at a dumb classifier that classifies every image as not-5, using the cross
validation.

from sklearn.base import BaseEstimator

class Never5Classifier(BaseEstimator):

def fit(self, X, y=None):

return self

def predict(self, X):

return np.zeros((len(X), 1), dtype=bool)

Can you guess this model’s accuracy? Let’s find out:

>>> never_5_clf = Never5Classifier()

>>> cross_val_score(never_5_clf, X_train, y_train_5, cv=3, scoring="accuracy")

array([0.91125, 0.90855, 0.90915])

● Very important! Confusion Matrix, especially Figure 3-2, pp. 90-92

The confusion matrix counts the number of times an instance is classified as

another ( ie when an instance of class A is classified as class B)
The cros_val_score() function performs kfold cross-validation and then returns
predictions made on each test fold. using the confusion matrix function you get the
matrix for the model. The result below is depicted in the image above

>>> from sklearn.metrics import confusion_matrix

>>> confusion_matrix(y_train_5, y_train_pred)
array([[53057, 1522],
[ 1325, 4096]])

The rows are the actual values and the columns are the predicted values, the
interpretation of the result and image is that 53057 of the instances were classified
correctly as non-5’s while 1,522 were classified wrongly as 5’s. The second row
depicts that 1,325 were classifies wrongly as non-5’s while 4,095 were classified
correctly as 5’s

● Precision/Recall Trade-off, especially Figure 3-3 and Figure 3-4, along with
associated Python code, pp. 93-97

Precision is the measure of accuracy of the positive predictions , recall is the ratio of
the positive instances that are correctly detected by the classifier.
The image about shows the recal and precision at different thresholds; at the middle threshold, the precision is =

The higher the precision the lower the recall, you cant have it both ways when both
are high, that what the trade off means. you have to decide if you want the model
to have more precision or recall. The trade off computes the score based on the
decision function

>>> y_scores = sgd_clf.decision_function([some_digit])

>>> y_scores
array([2412.53175101])
>>> threshold = 0
>>> y_some_digit_pred = (y_scores > threshold)
array([ True])

>>> threshold = 8000

>>> y_some_digit_pred = (y_scores > threshold)
>>> y_some_digit_pred
array([False])

If the score is greater than the threshold it assigns positive but if it is less than the
threshold it assigns negative; the code above shows this. this confirms that raising
the threshold decreases the recall. the image is a 5 but when the threshold is raised
the classifier misses so its important to know which threshold to use.

To do this use the cross_val_predict function to get scores of all the instances in the
training set, specify that you want to return decision scores

y_scores = cross_val_predict(sgd_clf, X_train, y_train_5, cv=3,

method="decision_function")

then use the precision_recall_curve function to compute the precision and recall
for all possible thresholds afterwards use the matplolib to plot the precision and
recall as a function of the threshold.
This is the precision-recall plot, this helps to decide the threshold to use, this shows
all possible thresholds for precision and recall

● Understand Error Analysis, especially the included Python code and graphs,
pp. 102-105

Error Analysis: This is the process of analyzing the types of errors a model
makes..

Tool: Confusion matrix

(The general idea is to count the number of times instances of class A

are classified as class B.)

Make predictions using the cross_val_predict() function, then plot matrix using the
confusion_matrix() function

Plot an image representation of the confusion matrix, using Matplotlib’s

matshow() function
Conclusion: Most images are on the main diagonal, which means that they
were classified correctly.

The 5s look slightly darker than the other digits, which could mean that there
are fewer images of 5s in the dataset and/or that the classifier does not
perform as well on 5s as on other digits.

Focus confusion matrix on the errors:

I. First, divide each value in the confusion matrix by the number of

images in the corresponding class so that you can compare error rates
instead of absolute numbers of errors.

II. Fill the diagonal with zeros to keep only the errors, and plot the result:
Results: The confusion matrix is not necessarily symmetrical. The errors
include:

I. The column for class 8 is quite bright, which tells you that many
images get misclassified as 8s.
II. However, the row for class 8 is not that bad, telling you that
actual 8s, in general, get properly classified as 8s.
III. 3s and 5s often get confused (in both directions).

Solution: Time should be spent on reducing the false 8s. For example:

I. gather more training data for digits that look like 8s (but are not) so
that the classifier can learn to distinguish them from real 8s.
II. engineer new features that would help the classifier—for example,
writing an algorithm to count the number of closed loops (e.g., 8 has two,
6 has one, 5 has none).
III. Preprocess the images (e.g., using Scikit-Image, Pillow, or OpenCV) to
make some patterns, such as closed loops, stand out more.

Analyzing individual errors:

This is more difficult and time-consuming.

For example:
Plot examples of 3s and 5s (the plot_digits() function just uses
Matplotlib’s imshow() function

The blocks of images analyses the individual errors of 3 and 5.

These include:

I. The classifier gets the 3 in the bottom-left and top-right blocks

wrong probably bcos is it so badly written.
II. Also, the 5 in the first row and second column truly looks like a
badly written 3.

Why the misclassification:

1.) The reason is that we used a simple SGDClassifier, which is a linear
model.

All it does is assign a weight per class to each pixel, and when it
sees a new image it just sums up the weighted pixel intensities
to get a score for each class. So since 3s and 5s differ only by
a few pixels, this model will easily confuse them.

2.) Another reason for misclassification is that the classifier is quite

sensitive to image shifting and rotation.

The main difference between 3s and 5s is the position of the

small line that joins the top line to the bottom arc. If you draw a
3 with the junction slightly shifted to the left, the classifier might
classify it as a 5, and vice versa.

Solution: So one way to reduce the 3/5 confusion would be

to preprocess the images to ensure that they are well
centered and not too rotated. This will probably help reduce
other errors as well.

Chapter 4

● General understanding about Linear Regression (only p. 112)

This is one one of the simplest model there is. It is a supervised learning model.

Generally, a LR model makes a prediction by simply computing a weighted

sum of the input features, plus a constant called the bias term (also called
the intercept term).

● Gradient Descent: Explain Figure 4-3, Figure 4-4. Figure 4-5, Figure 4-6, Figure
4-7, pp. 118-121

Gradient Descent (GD) is an iterative optimization approach that can used to

train LR model. It gradually/iteratively tweaks the model parameters to minimize
the cost function over the training set.

How it work:

Gradient Descent measures the local gradient of the error function with regard
to the parameter vector θ, and it goes in the direction of descending gradient.
Once the gradient is zero, you have reached a minimum!
Figure 4-3:

In this depiction of GD, it starts by filling θ with random values (this is called
random initialization). Then it gradually tweaks the model parameters to minimize
the cost function over the training set. (e.g., the MSE), until the algorithm
converges to a minimum.

Also, the steps gradually get smaller as the parameters approach the minimum
because the learning step size is proportional to the slope of the cost
function.

Figure 4.4:

In this depiction of GD, the learning rate is small. Then it goes through many
iterations to converge, which will take a long time.
Figure 4-5:

In this depiction of GD, the learning rate is too high. Then GD might jump
across the valley and, possibly even higher up than you were before.

This might make the algorithm diverge, with larger and larger values, failing to
find a good solution.

Figure 4-6:

In this depiction of GD, the cost function has holes, ridges, plateaus, and all
sorts of irregularities, making convergence to the minimum difficult.

The figure shows the two main challenges with GD:

I. LEFT: If the random initialization starts the algorithm on the left, then it
will converge to a local minimum, which is not as good as the global
minimum.
II. RIGHT: If GD starts on the right, then it will take a very long time to cross
the plateau. And if you stop too early, you will never reach the global
minimum.

Figure 4-7:

In this depiction of GD, the cost function has the shape of a bowl.

The figure shows GD where features 1 and 2 have the same scale (on the left),
and on a training set where feature 1 has much smaller values than feature 2 (on
the right).

I. On the left, GD goes straight toward the minimum, thereby reaching it

quickly.
II. On the right, GD first goes in a direction almost orthogonal to the direction
of the global minimum, and it ends with a long march down an almost flat
valley. It will eventually reach the minimum, but it will take a long time.

● General understanding about Polynomial Regression, pp. 128-130

Polynomial Regression, a more complex model that can fit non‐linear datasets.

How to do PR:

Add powers of each feature to original futures to create an extended set of

features. Then, train the extended set of features on a linear model.

Demerit:

I. It is more prone to overfitting the training data because it has more

parameters (than Linear Regression).

Merits (What plain LR cant do):

I. can fit non‐linear datasets

II. capable of finding relationships between features

● Explain Figure 4-14, Figure 4-15, Figure 4-16, pp. 131-134

Figure 4-14:

In this depiction of a high-degree polynomial model (300-degree), it is applied to

a non-linear dataset and its result is compared to that of a plain linear model and
a second-degree polynomial model. Apparently, the 300-degree polynomial
model wiggles around the data to get as close as possible to the training
instances. In conclusion, the high-degree Polynomial Regression model is
severely overfitting the training data, while the linear model is underfitting
it. The model that will generalize best in this case is the quadratic model.
which makes sense because the data was generated using a quadratic
model.

Ideally, a high-degree polynomial regression will likely fit the training data
much better than a plain linear model.

Figure 4-15:
In this depiction of learning curves, LM’s performance on the training set and the
validation set, as a function of the training set size, is show. In conclusion, this
model underfits the training data.

Solution: Use a more complex model or come up with better features

A. The model’s performance on the training set - performs poorly

When there are just one or two instances in the training set, the model
can fit them perfectly, which is why the curve starts at zero. But as new
instances are added to the training set, it becomes impossible for the
model to fit the training data perfectly, both because the data is noisy and
because it is not linear at all. So the error on the training data goes up
until it reaches a plateau, at which point adding new instances to the
training set doesn’t make the average error much better or worse.

B. The model’s performance on the Validation set - performs poorly

When the model is trained on very few training instances, it is

incapable of generalizing properly, which is why the validation
error is initially quite big. Then, as the model is shown more
training examples, it learns, and thus the validation error slowly
goes down. However, once again a linear model cannot do a good
job modeling the data, so the error ends up at a plateau, very close
to the other curve

Figure 4-16:
In this depiction of learning curves, the 10th-degree polynomial model’s performance on
the training set and the validation set, as a function of the training set size, is shown. In
conclusion, this model overfits the training data.

Solution: feed the model with more training data (increase size of training data)
until the validation error reaches the training error.

These learning curves look a bit like the previous ones, but there are two very important
differences:

I. The error on the training data is much lower (than with the LM model).
II. There is a gap between the curves. This means that the model performs
significantly better on the training data than on the validation data, which is the
hallmark of an overfitting model.

● Explain the Bias/Variance Trade-off, p. 134

Increasing a model’s complexity will typically increase its variance and

reduce its bias. Conversely, reducing a model’s complexity increases its bias
and reduces its variance. This is why it is called a trade-off.

● Early Stopping, as illustrated in Figure 4-20, p. 141, as well as the Python code
on page 142

Figure 4-20:
In this depiction of learning curves, a complex model ( high-degree Polynomial
Regression) is being trained with Batch Gradient Descent demonstrates early stopping.

As the epochs go by the algorithm learns, and its prediction error (RMSE) on the
training set and validation set both decrease. After a while though, the validation
error stops decreasing and starts to go back up (this indicates that the model has
started to overfit the training data). With early stopping, you just stop training as soon as
the validation error reaches the minimum (this is indicated as the best model).
● Explain Decision Boundary, as illustrated in Figure 4-23, p. 146

This figure depicts a model’s (log_reg) estimated probabilities for flowers with petal
widths varying from 0 cm to 3 cm.
The petal width of Iris virginica flowers (represented by triangles) ranges from 1.4
cm to 2.5 cm, while the petal width of other iris flowers (represented by squares)
generally have a smaller petal width, ranging from 0.1 cm to 1.8 cm.
Notice that there is a bit of overlap:
Above about 2 cm the classifier is highly confident that the flower is an Iris virginica
(it outputs a high probability for that the virginica class), while below 1 cm it is highly
confident that it is not an Iris virginica (high probability for the “Not Iris virginica class).

In between these extremes, the classifier is unsure. However, if you ask it to predict
the class (using the predict() method rather than the predict_proba() method), it will
return whichever class is the most likely. Therefore, there is a decision boundary
at around 1.6 cm where both probabilities are equal to 50%: if the petal width is
higher than 1.6 cm, the classifier will predict that the flower is an Iris virginica, and
otherwise it will predict that it is not (even if it is not very confident).

Choosing Model and Tuning
No ratings yet
Choosing Model and Tuning
20 pages
P06 The Classification Pipeline Ans
No ratings yet
P06 The Classification Pipeline Ans
16 pages
Lecture03. Classification (Chapter 3)
No ratings yet
Lecture03. Classification (Chapter 3)
46 pages
Week 7 Laboratory Activity
No ratings yet
Week 7 Laboratory Activity
12 pages
2020 Evaluation PDF
No ratings yet
2020 Evaluation PDF
25 pages
Chapter 7 Learning
No ratings yet
Chapter 7 Learning
34 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
No ratings yet
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
117 pages
statistic inference unit 2 notes
No ratings yet
statistic inference unit 2 notes
34 pages
sklearn
No ratings yet
sklearn
141 pages
Classification Algorithms I
No ratings yet
Classification Algorithms I
14 pages
Practical Issues
No ratings yet
Practical Issues
30 pages
Assignment 3.Docx 2
No ratings yet
Assignment 3.Docx 2
23 pages
Udacity Machine Learning Analysis Supervised Learning
100% (1)
Udacity Machine Learning Analysis Supervised Learning
504 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Statistical Learning Slides
No ratings yet
Statistical Learning Slides
60 pages
Classification
No ratings yet
Classification
53 pages
DM assignment 2
No ratings yet
DM assignment 2
23 pages
AI Lab M.Tech
No ratings yet
AI Lab M.Tech
29 pages
ML Unit 2 Part 1
No ratings yet
ML Unit 2 Part 1
47 pages
ML MANUAL WITH OUTPUTS (2)
No ratings yet
ML MANUAL WITH OUTPUTS (2)
30 pages
Lecture 2 - Supervised Learning
No ratings yet
Lecture 2 - Supervised Learning
6 pages
Lecture4 Foundations Supervised Learning
No ratings yet
Lecture4 Foundations Supervised Learning
22 pages
C2W3_Lab_01_Model_Evaluation_and_Selection
No ratings yet
C2W3_Lab_01_Model_Evaluation_and_Selection
21 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
1729585037_ML11_Generalization
No ratings yet
1729585037_ML11_Generalization
40 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Wa0001.
No ratings yet
Wa0001.
173 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
Exp4 - Supervised Learning
No ratings yet
Exp4 - Supervised Learning
10 pages
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
No ratings yet
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
34 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
Machine Learning Midterm
No ratings yet
Machine Learning Midterm
18 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Machine Learning Chapter3
No ratings yet
Machine Learning Chapter3
27 pages
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
APS1070 Lecture (3) Slides
No ratings yet
APS1070 Lecture (3) Slides
70 pages
SL
No ratings yet
SL
30 pages
CSC407_Chapter 5-6
No ratings yet
CSC407_Chapter 5-6
42 pages
ML_Lab_01999676272
No ratings yet
ML_Lab_01999676272
12 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
30 pages
Lecture 5b - Model Performance Analytics
No ratings yet
Lecture 5b - Model Performance Analytics
27 pages
chapter 1 capstone project ai class 12
No ratings yet
chapter 1 capstone project ai class 12
5 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
Lect 1
No ratings yet
Lect 1
24 pages
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
No ratings yet
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
61 pages
Lecturenotes Cse176
No ratings yet
Lecturenotes Cse176
80 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
Chapter 2 Machine Learning Draft-85-172
No ratings yet
Chapter 2 Machine Learning Draft-85-172
88 pages
ML
No ratings yet
ML
8 pages
DR Antonio Gulli - A Collection of Advanced Data Science and Machine Learning Interview Questions Solved in Python and Spark (II) - Hands-On Big Data and Machine - Programming Interview Questions) (
No ratings yet
DR Antonio Gulli - A Collection of Advanced Data Science and Machine Learning Interview Questions Solved in Python and Spark (II) - Hands-On Big Data and Machine - Programming Interview Questions) (
112 pages
Lecturenotes PDF
No ratings yet
Lecturenotes PDF
80 pages
K Fold
No ratings yet
K Fold
9 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
CarDD A New Dataset For Vision-Based Car Damage Detection
No ratings yet
CarDD A New Dataset For Vision-Based Car Damage Detection
13 pages
Executive Post Graduate Certification in Data Analytics IHUB
No ratings yet
Executive Post Graduate Certification in Data Analytics IHUB
15 pages
NLP Report
No ratings yet
NLP Report
19 pages
Deep Multi-Objective Multi-Stakeholder Music Recommendation
No ratings yet
Deep Multi-Objective Multi-Stakeholder Music Recommendation
39 pages
Breast Cancer Classification Using Deep Learning Final Ppt (1)
No ratings yet
Breast Cancer Classification Using Deep Learning Final Ppt (1)
19 pages
Sms Spam Detectionn (1)
No ratings yet
Sms Spam Detectionn (1)
63 pages
Lecture 2 Classifier Performance Metrics
No ratings yet
Lecture 2 Classifier Performance Metrics
60 pages
ML CourseProj Doc 1.8.16.54 Final
No ratings yet
ML CourseProj Doc 1.8.16.54 Final
35 pages
Customer Churn Prediction System: A Machine Learning Approach
No ratings yet
Customer Churn Prediction System: A Machine Learning Approach
24 pages
NSU - CSE465 - Project Analysis
No ratings yet
NSU - CSE465 - Project Analysis
11 pages
Ai Mini Project Report
No ratings yet
Ai Mini Project Report
41 pages
Paper_1
No ratings yet
Paper_1
19 pages
Pima Indians Diabetes Mellitus Classification Based On Machine Learning (ML) Algorithms
No ratings yet
Pima Indians Diabetes Mellitus Classification Based On Machine Learning (ML) Algorithms
17 pages
Fraud call detection using conversation analyzer
No ratings yet
Fraud call detection using conversation analyzer
6 pages
Real Fake Image Classification Using Explainable Efficientnetv2S: A Comparative Analysis
No ratings yet
Real Fake Image Classification Using Explainable Efficientnetv2S: A Comparative Analysis
6 pages
Classification Algorithm: Supervised Learning Technique Training Data
No ratings yet
Classification Algorithm: Supervised Learning Technique Training Data
28 pages
Diet Recommendation System
No ratings yet
Diet Recommendation System
5 pages
Andrew Treadway - Software Engineering For Data Scientists (MEAP V03) - Manning Publications (2023)
100% (1)
Andrew Treadway - Software Engineering For Data Scientists (MEAP V03) - Manning Publications (2023)
319 pages
IEEE-Machine Learning For The Predictive Maintenance of A Jaw Crusher in The Mining Industry
No ratings yet
IEEE-Machine Learning For The Predictive Maintenance of A Jaw Crusher in The Mining Industry
6 pages
Sharmila Vege Sana 2018
No ratings yet
Sharmila Vege Sana 2018
37 pages
Car Transport Prediction
100% (2)
Car Transport Prediction
27 pages
III BCA ML_syll_model_all units - Copy
No ratings yet
III BCA ML_syll_model_all units - Copy
85 pages
Ensemble Techniques Project
100% (2)
Ensemble Techniques Project
28 pages
Conference Latex Template ECCE
No ratings yet
Conference Latex Template ECCE
6 pages
FINANCE & RISK ANALYTICS – PROJECT - YARESH VIJAYASUNDARAM
No ratings yet
FINANCE & RISK ANALYTICS – PROJECT - YARESH VIJAYASUNDARAM
48 pages
Jntuk Machine Learning 3-2 Unit-2
No ratings yet
Jntuk Machine Learning 3-2 Unit-2
47 pages
Decision Tree With Cross Validation
No ratings yet
Decision Tree With Cross Validation
19 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
1 Identification of Hate Speech in Social Media
No ratings yet
1 Identification of Hate Speech in Social Media
6 pages
01 - Graziella
No ratings yet
01 - Graziella
6 pages