0% found this document useful (0 votes)

7 views18 pages

Hyperparameters and Model Validation _ Python Data Science Handbook

The document discusses hyperparameters and model validation in machine learning, emphasizing the importance of choosing the right model and hyperparameters for effective predictions. It critiques naive validation methods and introduces holdout sets and cross-validation as more robust evaluation techniques. Additionally, it addresses model selection, the bias-variance trade-off, and strategies for improving model performance.

Uploaded by

krithika rajendran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views18 pages

Hyperparameters and Model Validation _ Python Data Science Handbook

Uploaded by

krithika rajendran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

This is an excerpt from the Python Data Science Handbook

(http://shop.oreilly.com/product/0636920034919.do) by Jake VanderPlas; Jupyter
notebooks are available on GitHub
(https://github.com/jakevdp/PythonDataScienceHandbook).

The text is released under the CC-BY-NC-ND license (https://creativecommons.org/licenses/by-nc-

nd/3.0/us/legalcode), and code is released under the MIT license
(https://opensource.org/licenses/MIT). If you find this content useful, please consider supporting
the work by buying the book (http://shop.oreilly.com/product/0636920034919.do)!

Hyperparameters and Model Validation

< Introducing Scikit-Learn (05.02-introducing-scikit-learn.html) | Contents

(index.html) | Feature Engineering (05.04-feature-engineering.html) >

Open in Colab

(https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/not
Hyperparameters-and-Model-Validation.ipynb)

In the previous section, we saw the basic recipe for applying a supervised
machine learning model:

1. Choose a class of model

2. Choose model hyperparameters
3. Fit the model to the training data
4. Use the model to predict labels for new data

The first two pieces of this—the choice of model and choice of hyperparameters
—are perhaps the most important part of using these tools and techniques
effectively. In order to make an informed choice, we need a way to validate that
our model and our hyperparameters are a good fit to the data. While this may
sound simple, there are some pitfalls that you must avoid to do this effectively.

# Thinking about Model Validation

In principle, model validation is very simple: after choosing a model and its
hyperparameters, we can estimate how effective it is by applying it to some of
the training data and comparing the prediction to the known value.

The following sections first show a naive approach to model validation and why
it fails, before exploring the use of holdout sets and cross-validation for more
robust model evaluation.

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 1/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

## Model validation the wrong way

Let's demonstrate the naive approach to validation using the Iris data, which we
saw in the previous section. We will start by loading the data:

In [1]: from sklearn.datasets import load_iris

iris = load_iris()
X = iris.data
y = iris.target

Next we choose a model and hyperparameters. Here we'll use a k-neighbors

classifier with n_neighbors=1 . This is a very simple and intuitive model that
says "the label of an unknown point is the same as the label of its closest
training point:"

In [2]: from sklearn.neighbors import KNeighborsClassifier

model = KNeighborsClassifier(n_neighbors=1)

Then we train the model, and use it to predict labels for data we already know:

In [3]: model.fit(X, y)
y_model = model.predict(X)

Finally, we compute the fraction of correctly labeled points:

In [4]: from sklearn.metrics import accuracy_score

accuracy_score(y, y_model)

Out[4]: 1.0

We see an accuracy score of 1.0, which indicates that 100% of points were
correctly labeled by our model! But is this truly measuring the expected
accuracy? Have we really come upon a model that we expect to be correct 100%
of the time?

As you may have gathered, the answer is no. In fact, this approach contains a
fundamental flaw: it trains and evaluates the model on the same data.
Furthermore, the nearest neighbor model is an instance-based estimator that
simply stores the training data, and predicts labels by comparing new data to
these stored points: except in contrived cases, it will get 100% accuracy every
time!

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 2/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

## Model validation the right way: Holdout sets

So what can be done? A better sense of a model's performance can be found
using what's known as a holdout set: that is, we hold back some subset of the
data from the training of the model, and then use this holdout set to check the
model performance. This splitting can be done using the train_test_split
utility in Scikit-Learn:

In [5]: from sklearn.cross_validation import train_test_split

# split the data with 50% in each set
X1, X2, y1, y2 = train_test_split(X, y, random_state=0,
train_size=0.5)

# fit the model on one set of data

model.fit(X1, y1)

# evaluate the model on the second set of data

y2_model = model.predict(X2)
accuracy_score(y2, y2_model)

Out[5]: 0.90666666666666662

We see here a more reasonable result: the nearest-neighbor classifier is about

90% accurate on this hold-out set. The hold-out set is similar to unknown data,
because the model has not "seen" it before.

## Model validation via cross-validation

One disadvantage of using a holdout set for model validation is that we have
lost a portion of our data to the model training. In the above case, half the
dataset does not contribute to the training of the model! This is not optimal,
and can cause problems – especially if the initial set of training data is small.

One way to address this is to use cross-validation; that is, to do a sequence of

fits where each subset of the data is used both as a training set and as a
validation set. Visually, it might look something like this:

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 3/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

figure source in Appendix (06.00-figure-code.html#2-Fold-Cross-Validation)

Here we do two validation trials, alternately using each half of the data as a
holdout set. Using the split data from before, we could implement it like this:

In [6]: y2_model = model.fit(X1, y1).predict(X2)

y1_model = model.fit(X2, y2).predict(X1)
accuracy_score(y1, y1_model), accuracy_score(y2, y2_model)

Out[6]: (0.95999999999999996, 0.90666666666666662)

What comes out are two accuracy scores, which we could combine (by, say,
taking the mean) to get a better measure of the global model performance. This
particular form of cross-validation is a two-fold cross-validation—that is, one in
which we have split the data into two sets and used each in turn as a validation
set.

We could expand on this idea to use even more trials, and more folds in the
data—for example, here is a visual depiction of five-fold cross-validation:

figure source in Appendix (06.00-figure-code.html#5-Fold-Cross-Validation)

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 4/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

Here we split the data into five groups, and use each of them in turn to evaluate
the model fit on the other 4/5 of the data. This would be rather tedious to do by
hand, and so we can use Scikit-Learn's cross_val_score convenience routine
to do it succinctly:

In [7]: from sklearn.cross_validation import cross_val_score

cross_val_score(model, X, y, cv=5)

Out[7]: array([ 0.96666667, 0.96666667, 0.93333333, 0.93333333, 1.

Repeating the validation across different subsets of the data gives us an even
better idea of the performance of the algorithm.

Scikit-Learn implements a number of useful cross-validation schemes that are

useful in particular situations; these are implemented via iterators in the
cross_validation module. For example, we might wish to go to the extreme
case in which our number of folds is equal to the number of data points: that is,
we train on all points but one in each trial. This type of cross-validation is
known as leave-one-out cross validation, and can be used as follows:

In [8]: from sklearn.cross_validation import LeaveOneOut

scores = cross_val_score(model, X, y, cv=LeaveOneOut(len(X)))
scores

Out[8]: array([ 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 0., 1., 0., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 0., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 0., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 0., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 0., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1.])

Because we have 150 samples, the leave one out cross-validation yields scores
for 150 trials, and the score indicates either successful (1.0) or unsuccessful (0.0)
prediction. Taking the mean of these gives an estimate of the error rate:

In [9]: scores.mean()

Out[9]: 0.95999999999999996

Other cross-validation schemes can be used similarly. For a description of what

is available in Scikit-Learn, use IPython to explore the
sklearn.cross_validation submodule, or take a look at Scikit-Learn's online
cross-validation documentation (http://scikit-

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 5/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

learn.org/stable/modules/cross_validation.html).

# Selecting the Best Model

Now that we've seen the basics of validation and cross-validation, we will go
into a litte more depth regarding model selection and selection of
hyperparameters. These issues are some of the most important aspects of the
practice of machine learning, and I find that this information is often glossed
over in introductory machine learning tutorials.

Of core importance is the following question: if our estimator is

underperforming, how should we move forward? There are several possible
answers:

Use a more complicated/more flexible model

Use a less complicated/less flexible model
Gather more training samples
Gather more data to add features to each sample

The answer to this question is often counter-intuitive. In particular, sometimes

using a more complicated model will give worse results, and adding more
training samples may not improve your results! The ability to determine what
steps will improve your model is what separates the successful machine
learning practitioners from the unsuccessful.

## The Bias-variance trade-off

Fundamentally, the question of "the best model" is about finding a sweet spot
in the tradeoff between bias and variance. Consider the following figure, which
presents two regression fits to the same dataset:

figure source in Appendix (06.00-figure-code.html#Bias-Variance-Tradeoff)

It is clear that neither of these models is a particularly good fit to the data, but
they fail in different ways.

The model on the left attempts to find a straight-line fit through the data.
Because the data are intrinsically more complicated than a straight line, the
straight-line model will never be able to describe this dataset well. Such a

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 6/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

model is said to underfit the data: that is, it does not have enough model
flexibility to suitably account for all the features in the data; another way of
saying this is that the model has high bias.

The model on the right attempts to fit a high-order polynomial through the
data. Here the model fit has enough flexibility to nearly perfectly account for
the fine features in the data, but even though it very accurately describes the
training data, its precise form seems to be more reflective of the particular
noise properties of the data rather than the intrinsic properties of whatever
process generated that data. Such a model is said to overfit the data: that is, it
has so much model flexibility that the model ends up accounting for random
errors as well as the underlying data distribution; another way of saying this is
that the model has high variance.

To look at this in another light, consider what happens if we use these two
models to predict the y-value for some new data. In the following diagrams, the
red/lighter points indicate data that is omitted from the training set:

figure source in Appendix (06.00-figure-code.html#Bias-Variance-Tradeoff-

Metrics)

The score here is the R score, or coefficient of determination

(https://en.wikipedia.org/wiki/Coefficient_of_determination), which measures

how well a model performs relative to a simple mean of the target values.
R
2
= 1 indicates a perfect match, R 2
= 0 indicates the model does no better
than simply taking the mean of the data, and negative values mean even worse
models. From the scores associated with these two models, we can make an
observation that holds more generally:

For high-bias models, the performance of the model on the validation set
is similar to the performance on the training set.
For high-variance models, the performance of the model on the
validation set is far worse than the performance on the training set.

If we imagine that we have some ability to tune the model complexity, we

would expect the training score and validation score to behave as illustrated in
the following figure:

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 7/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

figure source in Appendix (06.00-figure-code.html#Validation-Curve)

The diagram shown here is often called a validation curve, and we see the
following essential features:

The training score is everywhere higher than the validation score. This is
generally the case: the model will be a better fit to data it has seen than to
data it has not seen.
For very low model complexity (a high-bias model), the training data is
under-fit, which means that the model is a poor predictor both for the
training data and for any previously unseen data.
For very high model complexity (a high-variance model), the training data
is over-fit, which means that the model predicts the training data very
well, but fails for any previously unseen data.
For some intermediate value, the validation curve has a maximum. This
level of complexity indicates a suitable trade-off between bias and
variance.

The means of tuning the model complexity varies from model to model; when
we discuss individual models in depth in later sections, we will see how each
model allows for such tuning.

## Validation curves in Scikit-Learn

Let's look at an example of using cross-validation to compute the validation
curve for a class of models. Here we will use a polynomial regression model:
this is a generalized linear model in which the degree of the polynomial is a
tunable parameter. For example, a degree-1 polynomial fits a straight line to the
data; for model parameters a and b:

y = ax + b

A degree-3 polynomial fits a cubic curve to the data; for model parameters
:
a, b, c, d

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 8/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

3 2
y = ax + bx + cx + d

We can generalize this to any number of polynomial features. In Scikit-Learn,

we can implement this with a simple linear regression combined with the
polynomial preprocessor. We will use a pipeline to string these operations
together (we will discuss polynomial features and pipelines more fully in
Feature Engineering (05.04-feature-engineering.html)):

In [10]: from sklearn.preprocessing import PolynomialFeatures

from sklearn.linear_model import LinearRegression
from sklearn.pipeline import make_pipeline

def PolynomialRegression(degree=2, **kwargs):

return make_pipeline(PolynomialFeatures(degree),
LinearRegression(**kwargs))

Now let's create some data to which we will fit our model:

In [11]: import numpy as np

def make_data(N, err=1.0, rseed=1):

# randomly sample the data
rng = np.random.RandomState(rseed)
X = rng.rand(N, 1) ** 2
y = 10 - 1. / (X.ravel() + 0.1)
if err > 0:
y += err * rng.randn(N)
return X, y

X, y = make_data(40)

We can now visualize our data, along with polynomial fits of several degrees:

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 9/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

In [12]: %matplotlib inline

import matplotlib.pyplot as plt
import seaborn; seaborn.set() # plot formatting

X_test = np.linspace(-0.1, 1.1, 500)[:, None]

plt.scatter(X.ravel(), y, color='black')
axis = plt.axis()
for degree in [1, 3, 5]:
y_test = PolynomialRegression(degree).fit(X, y).predict(X_test)
plt.plot(X_test.ravel(), y_test, label='degree={0}'.format(degre
plt.xlim(-0.1, 1.0)
plt.ylim(-2, 12)
plt.legend(loc='best');

The knob controlling model complexity in this case is the degree of the
polynomial, which can be any non-negative integer. A useful question to answer
is this: what degree of polynomial provides a suitable trade-off between bias
(under-fitting) and variance (over-fitting)?

We can make progress in this by visualizing the validation curve for this
particular data and model; this can be done straightforwardly using the
validation_curve convenience routine provided by Scikit-Learn. Given a
model, data, parameter name, and a range to explore, this function will
automatically compute both the training score and validation score across the
range:

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 10/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

In [13]: from sklearn.learning_curve import validation_curve

degree = np.arange(0, 21)
train_score, val_score = validation_curve(PolynomialRegression(), X,
'polynomialfeatures__degre

plt.plot(degree, np.median(train_score, 1), color='blue', label='tra

plt.plot(degree, np.median(val_score, 1), color='red', label='valida
plt.legend(loc='best')
plt.ylim(0, 1)
plt.xlabel('degree')
plt.ylabel('score');

This shows precisely the qualitative behavior we expect: the training score is
everywhere higher than the validation score; the training score is monotonically
improving with increased model complexity; and the validation score reaches a
maximum before dropping off as the model becomes over-fit.

From the validation curve, we can read-off that the optimal trade-off between
bias and variance is found for a third-order polynomial; we can compute and
display this fit over the original data as follows:

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 11/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

In [14]: plt.scatter(X.ravel(), y)
lim = plt.axis()
y_test = PolynomialRegression(3).fit(X, y).predict(X_test)
plt.plot(X_test.ravel(), y_test);
plt.axis(lim);

Notice that finding this optimal model did not actually require us to compute
the training score, but examining the relationship between the training score
and validation score can give us useful insight into the performance of the
model.

# Learning Curves
One important aspect of model complexity is that the optimal model will
generally depend on the size of your training data. For example, let's generate a
new dataset with a factor of five more points:

In [15]: X2, y2 = make_data(200)

plt.scatter(X2.ravel(), y2);

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 12/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

We will duplicate the preceding code to plot the validation curve for this larger
dataset; for reference let's over-plot the previous results as well:

In [16]: degree = np.arange(21)

train_score2, val_score2 = validation_curve(PolynomialRegression(),
'polynomialfeatures__deg

plt.plot(degree, np.median(train_score2, 1), color='blue', label='tr

plt.plot(degree, np.median(val_score2, 1), color='red', label='valid
plt.plot(degree, np.median(train_score, 1), color='blue', alpha=0.3,
plt.plot(degree, np.median(val_score, 1), color='red', alpha=0.3, li
plt.legend(loc='lower center')
plt.ylim(0, 1)
plt.xlabel('degree')
plt.ylabel('score');

The solid lines show the new results, while the fainter dashed lines show the
results of the previous smaller dataset. It is clear from the validation curve that
the larger dataset can support a much more complicated model: the peak here
is probably around a degree of 6, but even a degree-20 model is not seriously
over-fitting the data—the validation and training scores remain very close.

Thus we see that the behavior of the validation curve has not one but two
important inputs: the model complexity and the number of training points. It is
often useful to to explore the behavior of the model as a function of the number
of training points, which we can do by using increasingly larger subsets of the
data to fit our model. A plot of the training/validation score with respect to the
size of the training set is known as a learning curve.

The general behavior we would expect from a learning curve is this:

A model of a given complexity will overfit a small dataset: this means the
training score will be relatively high, while the validation score will be
relatively low.

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 13/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

A model of a given complexity will underfit a large dataset: this means

that the training score will decrease, but the validation score will
increase.
A model will never, except by chance, give a better score to the validation
set than the training set: this means the curves should keep getting closer
together but never cross.

With these features in mind, we would expect a learning curve to look

qualitatively like that shown in the following figure:

figure source in Appendix (06.00-figure-code.html#Learning-Curve)

The notable feature of the learning curve is the convergence to a particular

score as the number of training samples grows. In particular, once you have
enough points that a particular model has converged, adding more training
data will not help you! The only way to increase model performance in this case
is to use another (often more complex) model.

## Learning curves in Scikit-Learn

Scikit-Learn offers a convenient utility for computing such learning curves from
your models; here we will compute a learning curve for our original dataset
with a second-order polynomial model and a ninth-order polynomial:

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 14/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

In [17]: from sklearn.learning_curve import learning_curve

fig, ax = plt.subplots(1, 2, figsize=(16, 6))

fig.subplots_adjust(left=0.0625, right=0.95, wspace=0.1)

for i, degree in enumerate([2, 9]):

N, train_lc, val_lc = learning_curve(PolynomialRegression(degree
X, y, cv=7,
train_sizes=np.linspace(0.3

ax[i].plot(N, np.mean(train_lc, 1), color='blue', label='trainin

ax[i].plot(N, np.mean(val_lc, 1), color='red', label='validation
ax[i].hlines(np.mean([train_lc[-1], val_lc[-1]]), N[0], N[-1],
color='gray', linestyle='dashed')

ax[i].set_ylim(0, 1)
ax[i].set_xlim(N[0], N[-1])
ax[i].set_xlabel('training size')
ax[i].set_ylabel('score')
ax[i].set_title('degree = {0}'.format(degree), size=14)
ax[i].legend(loc='best')

This is a valuable diagnostic, because it gives us a visual depiction of how our

model responds to increasing training data. In particular, when your learning
curve has already converged (i.e., when the training and validation curves are
already close to each other) adding more training data will not significantly
improve the fit! This situation is seen in the left panel, with the learning curve
for the degree-2 model.

The only way to increase the converged score is to use a different (usually more
complicated) model. We see this in the right panel: by moving to a much more
complicated model, we increase the score of convergence (indicated by the
dashed line), but at the expense of higher model variance (indicated by the
difference between the training and validation scores). If we were to add even
more data points, the learning curve for the more complicated model would
eventually converge.

Plotting a learning curve for your particular choice of model and dataset can
help you to make this type of decision about how to move forward in improving
your analysis.

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 15/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

# Validation in Practice: Grid Search

The preceding discussion is meant to give you some intuition into the trade-off
between bias and variance, and its dependence on model complexity and
training set size. In practice, models generally have more than one knob to turn,
and thus plots of validation and learning curves change from lines to multi-
dimensional surfaces. In these cases, such visualizations are difficult and we
would rather simply find the particular model that maximizes the validation
score.

Scikit-Learn provides automated tools to do this in the grid search module.

Here is an example of using grid search to find the optimal polynomial model.
We will explore a three-dimensional grid of model features; namely the
polynomial degree, the flag telling us whether to fit the intercept, and the flag
telling us whether to normalize the problem. This can be set up using Scikit-
Learn's GridSearchCV meta-estimator:

In [18]: from sklearn.grid_search import GridSearchCV

param_grid = {'polynomialfeatures__degree': np.arange(21),

'linearregression__fit_intercept': [True, False],
'linearregression__normalize': [True, False]}

grid = GridSearchCV(PolynomialRegression(), param_grid, cv=7)

Notice that like a normal estimator, this has not yet been applied to any data.
Calling the fit() method will fit the model at each grid point, keeping track of
the scores along the way:

In [19]: grid.fit(X, y);

Now that this is fit, we can ask for the best parameters as follows:

In [20]: grid.best_params_

Out[20]: {'linearregression__fit_intercept': False,

'linearregression__normalize': True,
'polynomialfeatures__degree': 4}

Finally, if we wish, we can use the best model and show the fit to our data using
code from before:

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 16/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

In [21]: model = grid.best_estimator_

plt.scatter(X.ravel(), y)
lim = plt.axis()
y_test = model.fit(X, y).predict(X_test)
plt.plot(X_test.ravel(), y_test, hold=True);
plt.axis(lim);

The grid search provides many more options, including the ability to specify a
custom scoring function, to parallelize the computations, to do randomized
searches, and more. For information, see the examples in In-Depth: Kernel
Density Estimation (05.13-kernel-density-estimation.html) and Feature
Engineering: Working with Images (05.14-image-features.html), or refer to
Scikit-Learn's grid search documentation (http://Scikit-
Learn.org/stable/modules/grid_search.html).

# Summary
In this section, we have begun to explore the concept of model validation and
hyperparameter optimization, focusing on intuitive aspects of the bias–
variance trade-off and how it comes into play when fitting models to data. In
particular, we found that the use of a validation set or cross-validation
approach is vital when tuning parameters in order to avoid over-fitting for more
complex/flexible models.

In later sections, we will discuss the details of particularly useful models, and
throughout will talk about what tuning is available for these models and how
these free parameters affect model complexity. Keep the lessons of this section
in mind as you read on and learn about these machine learning approaches!

< Introducing Scikit-Learn (05.02-introducing-scikit-learn.html) | Contents

(index.html) | Feature Engineering (05.04-feature-engineering.html) >

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 17/18
4/14/24, 11:10 AM Hyperparameters and Model Validation | Python Data Science Handbook

Open in Colab

(https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/not
Hyperparameters-and-Model-Validation.ipynb)

https://jakevdp.github.io/PythonDataScienceHandbook/05.03-hyperparameters-and-model-validation.html 18/18

IML 8 - Grid Search and Cross Validation
No ratings yet
IML 8 - Grid Search and Cross Validation
22 pages
RO47002 - Lecture 2C - Hyperparameters and Cross-Validation
No ratings yet
RO47002 - Lecture 2C - Hyperparameters and Cross-Validation
10 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Cross Validation - Notes
No ratings yet
Cross Validation - Notes
10 pages
Machine Learning With Scikit Learn Strata 2015
No ratings yet
Machine Learning With Scikit Learn Strata 2015
72 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Resampling Methods Class 2
No ratings yet
Resampling Methods Class 2
38 pages
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
ml_fat
No ratings yet
ml_fat
9 pages
Reference guide- Validation & cross-validation
No ratings yet
Reference guide- Validation & cross-validation
7 pages
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
18 pages
8
No ratings yet
8
56 pages
Lecture-4 Model Evaluation
No ratings yet
Lecture-4 Model Evaluation
28 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
1 (8 29) Supervised Learning
No ratings yet
1 (8 29) Supervised Learning
60 pages
14 Model Selection and Boosting
No ratings yet
14 Model Selection and Boosting
51 pages
ML Unit 2
No ratings yet
ML Unit 2
86 pages
Supple Maximizing Performance in Cs CuBiCl
No ratings yet
Supple Maximizing Performance in Cs CuBiCl
5 pages
Resampling Methods Class 1
No ratings yet
Resampling Methods Class 1
33 pages
Assignment 1: Q1. Task Description
No ratings yet
Assignment 1: Q1. Task Description
12 pages
Wa0001.
No ratings yet
Wa0001.
173 pages
task4
No ratings yet
task4
2 pages
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
3.1. Cross-Validation - Evaluating Estimator Performance - Scikit-Learn 1.3.0 Documentation
No ratings yet
3.1. Cross-Validation - Evaluating Estimator Performance - Scikit-Learn 1.3.0 Documentation
12 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
IDS U-5 answers
No ratings yet
IDS U-5 answers
16 pages
CH 05 Optimization Technique
No ratings yet
CH 05 Optimization Technique
58 pages
The Supervised Learning Workshop - Second Edition: A New, Interactive Approach to Understanding Supervised Learning Algorithms, 2nd Edition
From Everand
The Supervised Learning Workshop - Second Edition: A New, Interactive Approach to Understanding Supervised Learning Algorithms, 2nd Edition
Blaine Bateman
No ratings yet
XIIAIUNITICAPSTONE_PROJECTPARTII
No ratings yet
XIIAIUNITICAPSTONE_PROJECTPARTII
11 pages
sklearn
No ratings yet
sklearn
141 pages
Lab 14 Task 1
No ratings yet
Lab 14 Task 1
2 pages
Machine Learning HW3 - Image Classification
No ratings yet
Machine Learning HW3 - Image Classification
48 pages
AN2DL_03_2324_NeuralNetwroksTraining
No ratings yet
AN2DL_03_2324_NeuralNetwroksTraining
40 pages
Model Training: (Anything Done While We Train The Model)
No ratings yet
Model Training: (Anything Done While We Train The Model)
194 pages
Codes and Concepts of ML-Developer-2
No ratings yet
Codes and Concepts of ML-Developer-2
17 pages
CS 461 - Fall 2021 - Neural Networks - Machine Learning
No ratings yet
CS 461 - Fall 2021 - Neural Networks - Machine Learning
5 pages
Intro ML Linear Classifier
No ratings yet
Intro ML Linear Classifier
18 pages
UNIT4 Cross Validation
No ratings yet
UNIT4 Cross Validation
16 pages
EMBED LEC MIDTERM REVIEWER
No ratings yet
EMBED LEC MIDTERM REVIEWER
14 pages
DD&ME&KDE&KNN
No ratings yet
DD&ME&KDE&KNN
27 pages
S-5
No ratings yet
S-5
10 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
4 Model Order
No ratings yet
4 Model Order
10 pages
CSC407_Chapter 5-6
No ratings yet
CSC407_Chapter 5-6
42 pages
Python Learning
No ratings yet
Python Learning
21 pages
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
P06 The Classification Pipeline Ans
No ratings yet
P06 The Classification Pipeline Ans
16 pages
Learning Best Practices For Model Evaluation and Hyperparameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyperparameter Tuning
17 pages
Unit 2
No ratings yet
Unit 2
28 pages
All Types of Cross Validation
No ratings yet
All Types of Cross Validation
9 pages
Sampling Methods in Machine Learning
No ratings yet
Sampling Methods in Machine Learning
13 pages
Lec 16
No ratings yet
Lec 16
18 pages
ML - Module 5
No ratings yet
ML - Module 5
80 pages
RandomForest
No ratings yet
RandomForest
8 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
L03 Generalization, Train Test Splits and Validation
No ratings yet
L03 Generalization, Train Test Splits and Validation
49 pages
Tutorial12 Q&A
No ratings yet
Tutorial12 Q&A
7 pages
Chapter 1: Introduction: 1.1. General
No ratings yet
Chapter 1: Introduction: 1.1. General
49 pages
2019 SINHA - Artificial Intelligence - Unsupervised - OCR - Model - Evaluation - Using - GAN
No ratings yet
2019 SINHA - Artificial Intelligence - Unsupervised - OCR - Model - Evaluation - Using - GAN
6 pages
Real-Time Multiple Object Tracking Using Deep Learning Methods2021
No ratings yet
Real-Time Multiple Object Tracking Using Deep Learning Methods2021
30 pages
Facial Recognition of Cattle Based On SK-ResNet
No ratings yet
Facial Recognition of Cattle Based On SK-ResNet
10 pages
IJEM-V14-N6-4
No ratings yet
IJEM-V14-N6-4
14 pages
Machine Learning Quality Management Guideline
No ratings yet
Machine Learning Quality Management Guideline
250 pages
Automating Plant Identification_ High-Accuracy Leaf Species Recognition Using Machine Learning Algorithms
No ratings yet
Automating Plant Identification_ High-Accuracy Leaf Species Recognition Using Machine Learning Algorithms
18 pages
Deep Aqua
No ratings yet
Deep Aqua
12 pages
UGC List of Approved Journals
No ratings yet
UGC List of Approved Journals
6 pages
Document
No ratings yet
Document
1 page
Statistical Tests For Comparing Machine Learning Algorithms
No ratings yet
Statistical Tests For Comparing Machine Learning Algorithms
8 pages
Pakistan Wheat
No ratings yet
Pakistan Wheat
9 pages
020002_1_5.0195796
No ratings yet
020002_1_5.0195796
10 pages
Seminar Report Shanu Saklani
No ratings yet
Seminar Report Shanu Saklani
22 pages
DL Unit-V
No ratings yet
DL Unit-V
23 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Facial Recognition Technical Report - Group 2
No ratings yet
Facial Recognition Technical Report - Group 2
48 pages
AyushiTiwari2214506380Enhancing Financial Security
No ratings yet
AyushiTiwari2214506380Enhancing Financial Security
10 pages
Conference Paper
No ratings yet
Conference Paper
11 pages
Land Use Policy
No ratings yet
Land Use Policy
11 pages
Real Time Home Surveillance and Monitoring System Using Federated Learning EDITED
No ratings yet
Real Time Home Surveillance and Monitoring System Using Federated Learning EDITED
10 pages
Cost-Sensitive Prediction of Stock Price Direction Selection of Technical Indicators
No ratings yet
Cost-Sensitive Prediction of Stock Price Direction Selection of Technical Indicators
17 pages
Proposal
No ratings yet
Proposal
12 pages
Exploring_Flavors_Through_AI_The_Future_of_Culinary_Taste_Prediction
No ratings yet
Exploring_Flavors_Through_AI_The_Future_of_Culinary_Taste_Prediction
9 pages
Steel 2
No ratings yet
Steel 2
39 pages
2024, DCNAM - Automatic detection of pixel level fine crack using a densely connected - 'Beyene et al' [Structures]
No ratings yet
2024, DCNAM - Automatic detection of pixel level fine crack using a densely connected - 'Beyene et al' [Structures]
12 pages
ROWBACK Robust Watermarking For Neural
No ratings yet
ROWBACK Robust Watermarking For Neural
8 pages
Dwdm-Lab Manual
No ratings yet
Dwdm-Lab Manual
39 pages
An Intelligent Machine Condition Monitoring System Using
100% (1)
An Intelligent Machine Condition Monitoring System Using
13 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
5 pages