0% found this document useful (0 votes)

23 views

Additional Notes 3 - Forecasting Model Performance

Uploaded by

Rhea Santos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views

Additional Notes 3 - Forecasting Model Performance

Uploaded by

Rhea Santos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

FORECASTING MODEL PERFORMANCE

FORECASTING MODEL EVALUATION

There are many statistical measures that describe how well a model fits a given sample of data.
However, this goodness-of-fit approach often uses residuals and does not really reflect the
capability of the forecasting technique to successfully predict future observations. The user of the
forecasts is very concerned about the accuracy of future forecasts, not model goodness of fit, so it
is important to evaluate this aspect of any recommended technique.

Sometimes forecast accuracy is called out-of-sample forecast error, to distinguish it from the
residuals that arise from a model-fitting process.

Define: Forecast Error (Residual)

𝑒𝑒𝑡𝑡 = 𝑦𝑦𝑡𝑡 − 𝑦𝑦�𝑡𝑡

Average Error or Mean Error

𝑛𝑛
1
𝑀𝑀𝑀𝑀 = � 𝑒𝑒𝑡𝑡
𝑛𝑛
𝑡𝑡=1

Mean Absolute Deviation or Mean Absolute Error

𝑛𝑛
1
𝑀𝑀𝑀𝑀𝑀𝑀 = � |𝑒𝑒𝑡𝑡 |
𝑛𝑛
𝑡𝑡=1

Mean Squared Error

𝑛𝑛
1
𝑀𝑀𝑀𝑀𝑀𝑀 = � 𝑒𝑒𝑡𝑡2
𝑛𝑛
𝑡𝑡=1

𝜋𝜋
NOTE: 𝑀𝑀𝑀𝑀𝑀𝑀 = �2 ⋅ 𝑀𝑀𝑀𝑀𝑀𝑀 if the forecast errors are normally distributed.

Relative Forecast Error or Percent Forecast Error

𝑒𝑒𝑡𝑡
𝑃𝑃𝐸𝐸𝑡𝑡 = × 100%
𝑦𝑦𝑡𝑡

Mean Percent Forecast Error

𝑛𝑛
1
𝑀𝑀𝑀𝑀𝑀𝑀 = � 𝑃𝑃𝐸𝐸𝑡𝑡
𝑛𝑛
𝑡𝑡=1
Mean Absolute Percent Forecast Error
𝑛𝑛
1
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 = � |𝑃𝑃𝐸𝐸𝑡𝑡 |
𝑛𝑛
𝑡𝑡=1

If a time series consists of uncorrelated observations and has constant variance. we say that it is
white noise. If, in addition, the observations in this time series are normally distributed, the time
series is Gaussian white noise. Ideally, forecast errors are Gaussian white noise.

If a time series is white noise, the distribution of the sample autocorrelation coefficient at lag 𝑘𝑘 in
large samples is approximately normal with mean zero and variance 1/𝑇𝑇, i.e.,

𝑟𝑟𝑘𝑘 ~Normal(0, 1/𝑇𝑇)

Therefore we could test the hypothesis 𝐻𝐻0 : 𝜌𝜌𝑘𝑘 = 0 using the test statistic
𝑟𝑟𝑘𝑘
𝑍𝑍0 = = 𝑟𝑟𝑘𝑘 √𝑇𝑇
�1/𝑇𝑇

This procedure is a one-at-a-time test; that is, the significance level applies to the autocorrelations
considered individually.

We are often interested in evaluating a set of autocorrelations jointly to determine if they indicate
that the time series is white noise. Box and Pierce (1970) have suggested such a procedure.
2
Consider 𝑍𝑍02 = 𝑟𝑟𝑘𝑘2 𝑇𝑇; it is approximately 𝜒𝜒(1) . The Box-Pierce statistic

𝐾𝐾

𝑄𝑄𝐵𝐵𝐵𝐵 = 𝑇𝑇 � 𝑟𝑟𝑘𝑘2
𝑘𝑘=1

2
is distributed approximately as 𝜒𝜒(𝑘𝑘) under the null hypothesis that the time series is white noise.
When this test statistic is applied to a set of residual autocorrelations the statistic 𝑄𝑄𝐵𝐵𝐵𝐵 ~𝜒𝜒(𝐾𝐾−𝑝𝑝)
2

where 𝑝𝑝 is the number of parameters in the model. Box and Pierce call this procedure a
Portmanteau or general goodness-of-fit statistic – it is testing the goodness of fit of the
autocorrelation function to the autocorrelation function of white noise.

A modification of this test that works better for small samples was devised by Ljung and Box (1978).
The Ljung-Box goodness-of-fit statistic is
𝐾𝐾
1
𝑄𝑄𝐿𝐿𝐿𝐿 = 𝑇𝑇(𝑇𝑇 + 2) � � � 𝑟𝑟 2
𝑇𝑇 − 𝑘𝑘 𝑘𝑘
𝑘𝑘=1

The the Ljung-Box statistic is very similar to the original Box-Pierce statistic, the difference being
that the squared sample autocorrelation at lag 𝑘𝑘 is weighted by (𝑇𝑇 + 2)/(𝑇𝑇 − 𝑘𝑘). For large 𝑇𝑇,
these weights will be approximately unity, and so the 𝑄𝑄𝐿𝐿𝐿𝐿 and 𝑄𝑄𝐵𝐵𝐵𝐵 statistics will be very similar.
CHOOSING BETWEEN COMPETING MODELS

Selecting the model that provides the best fit to historical data generally does not result in a
forecasting method that produces the best forecasts of new data. Concentrating too much on
the model that produces the best historical fit often results in overfitting, or including too many
parameters or terms in the model just because these additional terms improve the model fit.

In general, the best approach is to select the model that results in the smallest standard deviation
(or mean squared error) of the one-step-ahead forecast errors when the model is applied to data
that was not used in the fitting process. Some refer to this as an out-of-sample forecast error
standard deviation (or mean squared error). A standard way to measure this out-of-sample
performance is by utilizing some form of data splitting; that is, divide the time series data into
two segments – one for model fitting and the other for performance testing. Sometimes data
splitting is called cross-validation.

It is somewhat arbitrary as to how the data splitting is accomplished. However. a good rule of
thumb is to have at least 20 or 25 observations in the performance testing data set.

Mean Squared Error of Residuals

∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2
𝑠𝑠 =
𝑇𝑇 − 𝑝𝑝

𝑹𝑹 – Squared Statistic

∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2
𝑅𝑅 2 = 1 −
∑𝑇𝑇𝑡𝑡=1(𝑦𝑦𝑡𝑡 − 𝑦𝑦�)2

Large values of 𝑅𝑅 2 suggest a good fit to the historical data. Because the residual sum of squares
always decreases when parameters are added to a model, relying on 𝑅𝑅 2 to select a forecasting
model encourages overfitting or putting in more parameters than are really necessary to obtain
good forecasts. A large value of 𝑅𝑅 2 does not ensure that the out-of-sample one-step-ahead
forecast errors will be small.

Adjusted 𝑹𝑹 – Squared Statistic

2
∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2 /(𝑇𝑇 − 𝑝𝑝) 𝑠𝑠 2
𝑅𝑅Adj = 1 − 𝑇𝑇 = 1 − 𝑇𝑇
∑𝑡𝑡=1(𝑦𝑦𝑡𝑡 − 𝑦𝑦�)2 /(𝑇𝑇 − 1) ∑𝑡𝑡=1(𝑦𝑦𝑡𝑡 − 𝑦𝑦�)2 /(𝑇𝑇 − 1)

The adjustment is a size adjustment – that is, adjust for the number of parameters in the model.
Note that a model that maximizes the adjusted 𝑅𝑅 2 statistic is also the model that minimizes the
residual mean square.
Akaike Information Criterion (AIC)

∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2 2𝑝𝑝

𝐴𝐴𝐴𝐴𝐴𝐴 = ln � �+
𝑇𝑇 𝑇𝑇

Schwarz Information Criterion (SIC)

∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2 𝑝𝑝 ln(𝑇𝑇)

𝑆𝑆𝐼𝐼𝐼𝐼 = ln � �+
𝑇𝑇 𝑇𝑇

These two criteria penalize the sum of squared residuals for including additional parameters in the
model. Models that have small values of the AIC or SIC are considered good models.

One way to evaluate model selection criteria is in terms of consistency. A model selection
criterion is consistent if it selects the true model when the true model is among those considered
with probability approaching unity as the sample size becomes large, and if the true model is not
among those considered, it selects the best approximation with probability approaching unity as
the sample size becomes large.
• All of 𝑠𝑠 2 , the 𝑅𝑅Adj
2
, and the AIC are inconsistent, because they do not penalize for adding
parameters heavily enough. Relying on these criteria tends to result in overfitting.
• The SIC, which carries a heavier size adjustment penalty, is consistent.

Consistency, however, does not tell the complete story. It may turn out that the true model and
any reasonable approximation to it are very complex. An asymptotically efficient model selection
criterion chooses a sequence of models as 𝑇𝑇 (the amount of data available) gets large for which
the one-step-ahead forecast error variances approach the one-step-ahead forecast error
variance for the true model at least as fast as any other criterion. The AIC is asymptotically
efficient but the SIC is not.

Corrected Akaike Information Criterion *for bias

∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2 2𝑇𝑇(𝑝𝑝 + 1)

𝐴𝐴𝐴𝐴𝐴𝐴𝐶𝐶 = ln � �+
𝑇𝑇 𝑇𝑇 − 𝑝𝑝 − 2

Remarks:
• Sometimes we see the first term in the AIC, AICC, or SIC written as −2 ln 𝐿𝐿(𝛽𝛽, 𝜎𝜎 2 ) – the
likelihood function for the fitted model evaluated at the maximum likelihood estimates of the
unknown parameters 𝛽𝛽 and 𝜎𝜎 2 . In this context, AIC, AICC, and SIC are called penalized
likelihood criteria.
• When both AIC and SIC are available. we prefer using SIC. It generally results in smaller, and
hence simpler. models, and so its use is consistent with the time-honored model-building
principle of parsimony.
• Nevertheless, the best way to evaluate a candidate model's potential predictive
performance is to use data splitting. This will provide a direct estimate of the one-step-ahead
forecast error variance.
ADDITIONAL e-VIDEO RESOURCES:

Forecasting: Exponential Smoothing, MSE (youtube.com)

https://www.youtube.com/watch?v=k_HN0wOKDd0

Forecasting: Moving Averages, MAD, MSE, MAPE (youtube.com)

https://www.youtube.com/watch?v=Wo5YWXDRXv8

Forecasting (7): Forecast accuracy measures (MSE, RMSE, MAD & MAPE) (youtube.com)
https://www.youtube.com/watch?v=0vtRKLVNhQ8

Evaluating Time Series Models : Time Series Talk (youtube.com)

https://www.youtube.com/watch?v=kgBDQ3baESw

HANDS-ON EXERCISE

Using the airline.csv data, assess the forecast model performance of the best-fitting Holt-Winters
(multiplicative seasonals) model. *See Chapter 3 slide deck, last example.

RECALL: The airline.csv data contains the number of international passenger bookings (in
thousands) per month on an airline (Pan Am) in the United States were obtained from the Federal
Aviation Administration for the period 1949–1960 (Brown, 1963).

QMB Exam 2 Review
No ratings yet
QMB Exam 2 Review
7 pages
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Test Bank Statistics
No ratings yet
Test Bank Statistics
9 pages
Methodology Expliained
No ratings yet
Methodology Expliained
2 pages
Using Normalized Bayesian Information Criterion (Bic) To Improve Box - Jenkins Model Building
No ratings yet
Using Normalized Bayesian Information Criterion (Bic) To Improve Box - Jenkins Model Building
8 pages
MPD412 - Ind Org - Lecture-03-Forecasting - Part B
No ratings yet
MPD412 - Ind Org - Lecture-03-Forecasting - Part B
26 pages
Estimation, Diagnosis, and Identification of Time Series Models
No ratings yet
Estimation, Diagnosis, and Identification of Time Series Models
15 pages
Forecasting by Exponential Smoothing
No ratings yet
Forecasting by Exponential Smoothing
6 pages
The Box-Jenkins Methodology For RIMA Models
No ratings yet
The Box-Jenkins Methodology For RIMA Models
172 pages
Akaike Information Criterion
100% (1)
Akaike Information Criterion
6 pages
Multivariate Vs Univariate
No ratings yet
Multivariate Vs Univariate
12 pages
Akaike's and other information criteria
No ratings yet
Akaike's and other information criteria
5 pages
Lesson-No.-05_OpMan
No ratings yet
Lesson-No.-05_OpMan
33 pages
Rio Thesis _054559
No ratings yet
Rio Thesis _054559
53 pages
ARIMA Model
No ratings yet
ARIMA Model
30 pages
Arima Model
No ratings yet
Arima Model
30 pages
DSIMGTS
No ratings yet
DSIMGTS
13 pages
Forecasting Level Time Series
No ratings yet
Forecasting Level Time Series
36 pages
Chat GPT
No ratings yet
Chat GPT
24 pages
Time Series: International University - Vnu HCMC
No ratings yet
Time Series: International University - Vnu HCMC
35 pages
American Statistical Association
No ratings yet
American Statistical Association
19 pages
Time Series and Sequential Data
No ratings yet
Time Series and Sequential Data
143 pages
Chapter 3b - The Box-Jenkins Methodology For RIMA Models - Student Version
No ratings yet
Chapter 3b - The Box-Jenkins Methodology For RIMA Models - Student Version
73 pages
An Empirical Investigation of Efficiency 2021
No ratings yet
An Empirical Investigation of Efficiency 2021
6 pages
AIC Tutorial - Hu
No ratings yet
AIC Tutorial - Hu
19 pages
08 Notes 5 GOOD
No ratings yet
08 Notes 5 GOOD
20 pages
Sawa-InformationCriteriaDiscriminating-1978
No ratings yet
Sawa-InformationCriteriaDiscriminating-1978
20 pages
Arima
No ratings yet
Arima
21 pages
The Box-Jenkins Methodology For RIMA Models
No ratings yet
The Box-Jenkins Methodology For RIMA Models
172 pages
CH 13
No ratings yet
CH 13
11 pages
UNIVAR4
No ratings yet
UNIVAR4
56 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
20 pages
SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria
No ratings yet
SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria
6 pages
MSF 566 Topic 03 Stationary Time Series
No ratings yet
MSF 566 Topic 03 Stationary Time Series
61 pages
Improved Methods of Combining Forecasts PDF
No ratings yet
Improved Methods of Combining Forecasts PDF
8 pages
IEA 01 Probability & Statastical Method
No ratings yet
IEA 01 Probability & Statastical Method
30 pages
Automatic Forecasting
No ratings yet
Automatic Forecasting
11 pages
ARIMA Modelling and Forecasting
No ratings yet
ARIMA Modelling and Forecasting
30 pages
Lab: Box-Jenkins Methodology - Test Data Set 1: Time Series and Forecast
No ratings yet
Lab: Box-Jenkins Methodology - Test Data Set 1: Time Series and Forecast
8 pages
BIOSTATISTICS
No ratings yet
BIOSTATISTICS
15 pages
Automatic Forecasting: Month Traffic
No ratings yet
Automatic Forecasting: Month Traffic
11 pages
Univariate Time Series
No ratings yet
Univariate Time Series
83 pages
Univariate Time Series
100% (1)
Univariate Time Series
83 pages
Another Error Measure for Selection of the Best Forecasting Method
No ratings yet
Another Error Measure for Selection of the Best Forecasting Method
6 pages
Time Series Analysis
No ratings yet
Time Series Analysis
7 pages
Chapter 02: Data Patterns and Choice of Forecasting Techniques
No ratings yet
Chapter 02: Data Patterns and Choice of Forecasting Techniques
46 pages
Mixed Model Selection Information Theoretic
No ratings yet
Mixed Model Selection Information Theoretic
7 pages
Module07 - Model Selection and Regularization
No ratings yet
Module07 - Model Selection and Regularization
46 pages
A Review of Basic Statistical Concepts: Answers To Odd Numbered Problems 1
No ratings yet
A Review of Basic Statistical Concepts: Answers To Odd Numbered Problems 1
32 pages
Forecasting
No ratings yet
Forecasting
6 pages
Chapter 7 - TThe Box-Jenkins Methodology For ARIMA Models
100% (1)
Chapter 7 - TThe Box-Jenkins Methodology For ARIMA Models
205 pages
Forecasting
No ratings yet
Forecasting
16 pages
CH 16 Statistics
No ratings yet
CH 16 Statistics
10 pages
Forecasting
No ratings yet
Forecasting
128 pages
Prob Sensitivity and Specificity of Information Criteri
No ratings yet
Prob Sensitivity and Specificity of Information Criteri
20 pages
Question Bank MBM696
No ratings yet
Question Bank MBM696
14 pages
Box-Jenkins Methodology Forecasting Basics
No ratings yet
Box-Jenkins Methodology Forecasting Basics
11 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
7350L02
No ratings yet
7350L02
21 pages
Class Test 9 ST Solution
No ratings yet
Class Test 9 ST Solution
6 pages
Forecasting: B209 Management Accounting
No ratings yet
Forecasting: B209 Management Accounting
59 pages
Adjustment of Calibration Interval
No ratings yet
Adjustment of Calibration Interval
14 pages
Introduction To Binomial Distribution
No ratings yet
Introduction To Binomial Distribution
10 pages
TBS Final Exam 2018
No ratings yet
TBS Final Exam 2018
10 pages
MMS Syllabus
No ratings yet
MMS Syllabus
7 pages
N, It Says The Sampling Distribution of The Sample Mean Is Approximately Normal
No ratings yet
N, It Says The Sampling Distribution of The Sample Mean Is Approximately Normal
3 pages
COURSE PLAN ON STATISTICS
No ratings yet
COURSE PLAN ON STATISTICS
8 pages
Instructions For Running ANOVAs in v25 of SPSS - Print Before Beginning 82278422
No ratings yet
Instructions For Running ANOVAs in v25 of SPSS - Print Before Beginning 82278422
4 pages
Chapter 20
No ratings yet
Chapter 20
22 pages
Syllabus - Asset-V1 - MITx+6.431x+1T2022+type@asset+block@resources - 1T2022 - Syllabus - 1T2022
No ratings yet
Syllabus - Asset-V1 - MITx+6.431x+1T2022+type@asset+block@resources - 1T2022 - Syllabus - 1T2022
2 pages
Time Series Analysis: Trend & Seasonality: APS 425 - Fall 2015
No ratings yet
Time Series Analysis: Trend & Seasonality: APS 425 - Fall 2015
12 pages
Exercise 1
0% (1)
Exercise 1
5 pages
Lin 1992
No ratings yet
Lin 1992
7 pages
ARDL Model
100% (1)
ARDL Model
16 pages
Data Analysis Hypothesis Testing Printable
No ratings yet
Data Analysis Hypothesis Testing Printable
23 pages
2.6 Applications of Poisson Distribution - Business Statistics
No ratings yet
2.6 Applications of Poisson Distribution - Business Statistics
2 pages
Multi Kernel Regression Babak
No ratings yet
Multi Kernel Regression Babak
6 pages
Correlation Analysis: (Testing If One Variable Is Correlated With Another Variable)
100% (3)
Correlation Analysis: (Testing If One Variable Is Correlated With Another Variable)
23 pages
1 Estimation PDF
No ratings yet
1 Estimation PDF
31 pages
Randomnumbers
No ratings yet
Randomnumbers
26 pages
BSM201 MAKAUT CA2 1ST YEAR
No ratings yet
BSM201 MAKAUT CA2 1ST YEAR
7 pages
Take Home Exam 1 30 Probability SIR GEORGE
No ratings yet
Take Home Exam 1 30 Probability SIR GEORGE
10 pages
Industrial Design Of Experiments A Case Study Approach For Design And Process Optimization Sammy Shina pdf download
No ratings yet
Industrial Design Of Experiments A Case Study Approach For Design And Process Optimization Sammy Shina pdf download
77 pages
hamisi
No ratings yet
hamisi
34 pages
Review of Probability Theory
No ratings yet
Review of Probability Theory
75 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
CT6 QP 0416
No ratings yet
CT6 QP 0416
6 pages
No. of Pages
No ratings yet
No. of Pages
3 pages

Additional Notes 3 - Forecasting Model Performance

Uploaded by

Additional Notes 3 - Forecasting Model Performance

Uploaded by

FORECASTING MODEL PERFORMANCE

FORECASTING MODEL EVALUATION

Define: Forecast Error (Residual)

𝑒𝑒𝑡𝑡 = 𝑦𝑦𝑡𝑡 − 𝑦𝑦�𝑡𝑡

Average Error or Mean Error

Mean Absolute Deviation or Mean Absolute Error

Mean Squared Error

Relative Forecast Error or Percent Forecast Error

Mean Percent Forecast Error

𝑟𝑟𝑘𝑘 ~Normal(0, 1/𝑇𝑇)

Mean Squared Error of Residuals

Adjusted 𝑹𝑹 – Squared Statistic

∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2 2𝑝𝑝

Schwarz Information Criterion (SIC)

∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2 𝑝𝑝 ln(𝑇𝑇)

Corrected Akaike Information Criterion *for bias

∑𝑇𝑇𝑡𝑡=1 𝑒𝑒𝑡𝑡2 2𝑇𝑇(𝑝𝑝 + 1)

Forecasting: Exponential Smoothing, MSE (youtube.com)

Forecasting: Moving Averages, MAD, MSE, MAPE (youtube.com)

Evaluating Time Series Models : Time Series Talk (youtube.com)

You might also like