0% found this document useful (0 votes)

21 views11 pages

Time Arima 002

Uploaded by

natthaweeilac

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views11 pages

Time Arima 002

Uploaded by

natthaweeilac

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Certainly!

Here’s an extended version of the previous guide that includes steps for building and
forecasting with an ARIMA model.

### Steps to Make Time-Series Data Stationary and Forecast using ARIMA

1. Load the Data

2. **Preprocessing**
3. **Exploratory Data Analysis (EDA)**
4. **Time-Series Decomposition**
5. **Stationarity Testing**
6. **Differencing and Transformations**
7. **Autocorrelation and Partial Autocorrelation**
8. **Model Building and Forecasting using ARIMA**

Let's go through each step in detail.

### 1. Load the Data

Start by loading the Microsoft stock price data from a CSV file.

```python
import pandas as pd

# Load the data

df = pd.read_csv('MSFT_2015_2021.csv', parse_dates=['Date'], index_col='Date')

# Display the first few rows of the dataset

print(df.head())
```
### 2. Preprocessing

Handle missing values and ensure the data is sorted by date.

```python
# Check for missing values
print(df.isnull().sum())

# Sort the data by date

df = df.sort_index()
```

### 3. Exploratory Data Analysis (EDA)

Perform EDA to understand the data better.

```python
import matplotlib.pyplot as plt

# Plot the closing prices

plt.figure(figsize=(10, 6))
plt.plot(df['Close'], label='Close Price')
plt.title('Microsoft Stock Price (2015-2021)')
plt.xlabel('Date')
plt.ylabel('Close Price')
plt.legend()
plt.show()

# Calculate moving averages

df['MA20'] = df['Close'].rolling(window=20).mean()
df['MA50'] = df['Close'].rolling(window=50).mean()

# Plot moving averages

plt.figure(figsize=(10, 6))
plt.plot(df['Close'], label='Close Price')
plt.plot(df['MA20'], label='20-Day MA')
plt.plot(df['MA50'], label='50-Day MA')
plt.title('Microsoft Stock Price with Moving Averages')
plt.xlabel('Date')
plt.ylabel('Price')
plt.legend()
plt.show()
```

### 4. Time-Series Decomposition

Decompose the time series to observe the trend, seasonality, and residuals.

```python
from statsmodels.tsa.seasonal import seasonal_decompose

# Decompose the time series

decomposition = seasonal_decompose(df['Close'], model='multiplicative')

# Plot the decomposed components

plt.figure(figsize=(12, 8))
decomposition.plot()
plt.show()
```
### 5. Stationarity Testing

Check if the time series is stationary using the Augmented Dickey-Fuller (ADF) test.

```python
from statsmodels.tsa.stattools import adfuller

# Perform the ADF test

result = adfuller(df['Close'])
print('ADF Statistic:', result[0])
print('p-value:', result[1])
for key, value in result[4].items():
print('Critial Values:')
print(f' {key}, {value}')
```

### 6. Differencing and Transformations

If the series is not stationary, apply differencing and transformations.

```python
# Apply log transformation to stabilize variance
df['Price_log'] = np.log(df['Close'])

# Apply first-order differencing to remove trend

df['Price_log_diff'] = df['Price_log'].diff()

# Plot the differenced series

plt.figure(figsize=(10, 6))
plt.plot(df['Price_log_diff'], label='Log Transformed and Differenced Series')
plt.title('Log Transformed and Differenced Series')
plt.xlabel('Date')
plt.ylabel('Log Price Difference')
plt.legend()
plt.show()

# Perform the ADF test on the differenced series

result = adfuller(df['Price_log_diff'].dropna())
print('ADF Statistic after differencing:', result[0])
print('p-value after differencing:', result[1])
for key, value in result[4].items():
print(f'Critical Value ({key}): {value}')
```

### 7. Autocorrelation and Partial Autocorrelation

Plot ACF and PACF to identify the order of ARIMA models.

```python
from statsmodels.graphics.tsaplots import plot_acf, plot_pacf

# Plot ACF and PACF

plt.figure(figsize=(12, 6))
plt.subplot(121)
plot_acf(df['Price_log_diff'].dropna(), ax=plt.gca(), lags=30)
plt.subplot(122)
plot_pacf(df['Price_log_diff'].dropna(), ax=plt.gca(), lags=30)
plt.show()
```
### 8. Model Building and Forecasting using ARIMA

Build ARIMA models based on the ACF and PACF plots and use them for forecasting.

```python
from statsmodels.tsa.arima.model import ARIMA

# Determine the order of ARIMA(p, d, q) based on ACF and PACF plots

p = 1 # Example value
d = 1 # Example value
q = 1 # Example value

# Build and fit the ARIMA model

model = ARIMA(df['Price_log'].dropna(), order=(p, d, q))
fit_model = model.fit()

# Print model summary

print(fit_model.summary())

# Make predictions
start = len(df)
end = len(df) + 30 # Forecast for the next 30 days
pred = fit_model.get_forecast(steps=30)
pred_ci = pred.conf_int()

# Plot the results

plt.figure(figsize=(12, 6))
plt.plot(df['Close'], label='Actual')
plt.plot(np.exp(pred.predicted_mean), label='Forecast')
plt.fill_between(pred_ci.index, np.exp(pred_ci.iloc[:, 0]), np.exp(pred_ci.iloc[:, 1]), color='k',
alpha=0.1)
plt.title('Microsoft Stock Price Forecast')
plt.xlabel('Date')
plt.ylabel('Price')
plt.legend()
plt.show()
```

### Conclusion
This end-to-end guide covers loading, preprocessing, EDA, time-series decomposition,
stationarity testing, differencing, transformations, and model building with ARIMA for
forecasting Microsoft stock price data from 2015-2021.
### Why ARIMA Cannot Support Non-Stationary Time-Series Data

**ARIMA** stands for AutoRegressive Integrated Moving Average. The ARIMA model is a
widely used tool in time series forecasting that requires the data to be stationary. Let’s delve into
the reasons why ARIMA cannot support non-stationary time-series data and why stationarity is
crucial for ARIMA models.

#### 1. Model Assumptions

ARIMA models are based on several key assumptions, one of the most critical being stationarity.
Stationarity means that the statistical properties of the time series—such as mean, variance, and
autocorrelation—are constant over time. Here’s why this assumption is important:

1. Consistency in Parameters: For ARIMA models to make accurate predictions, the

parameters estimated from the historical data need to remain consistent over time. Non-
stationary data, which has trends or varying mean and variance, would violate this assumption,
leading to unreliable predictions.

2. Predictability: Stationary data is more predictable because it follows a consistent pattern.

Non-stationary data can have unpredictable changes, making it difficult for the ARIMA model to
identify a stable structure to base its predictions on.

#### 2. Mathematical Basis

The ARIMA model combines three components:

- **AutoRegressive (AR)**: Relies on past values to predict future values.
- **Integrated (I)**: Involves differencing the data to make it stationary.
- **Moving Average (MA)**: Relies on past forecast errors to predict future values.

- Autoregressive Component: The AR part assumes a linear relationship between an

observation and a specified number of lagged observations. For non-stationary data, this linear
relationship would not hold consistently.
- **Moving Average Component**: The MA part assumes that forecast errors are a linear
combination of past forecast errors. For non-stationary data, the patterns in the errors would not
be consistent, violating this assumption.
- **Integration Component**: The "I" in ARIMA represents differencing the data to achieve
stationarity. This step transforms non-stationary data into stationary data so that AR and MA
models can be effectively applied.

#### 3. Statistical Properties

For the statistical properties of ARIMA to hold, the data must be stationary:
- **Constant Mean**: The mean of the series should not be a function of time.
- **Constant Variance**: The variance of the series should not change over time.
- **Autocorrelation**: The autocorrelation structure should be constant over time.

When the time series is non-stationary, these properties are not constant, making it impossible for
ARIMA to model the data accurately.

#### 4. Example and Solution

**Non-Stationary Data Example**: Suppose you have a time series representing stock prices,
which typically show trends (upwards or downwards). Such data is non-stationary because the
mean and variance change over time.

**Solution**:
1. **Differencing**: Apply differencing to remove the trend and achieve stationarity.
2. **Transformation**: Use log transformation or other techniques to stabilize the variance.

```python
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from statsmodels.tsa.arima.model import ARIMA
from statsmodels.tsa.stattools import adfuller

# Load the data

df = pd.read_csv('MSFT_2015_2021.csv', parse_dates=['Date'], index_col='Date')

# Log transformation to stabilize variance

df['Price_log'] = np.log(df['Close'])

# First-order differencing to remove trend

df['Price_log_diff'] = df['Price_log'].diff().dropna()

# Check for stationarity

result = adfuller(df['Price_log_diff'])
print('ADF Statistic:', result[0])
print('p-value:', result[1])

# Build and fit the ARIMA model

model = ARIMA(df['Price_log_diff'], order=(1, 0, 1))
fit_model = model.fit()
print(fit_model.summary())
```

### Conclusion

- **Stationarity is Essential**: The ARIMA model assumes that the underlying data is stationary.
Non-stationary data violates this assumption, leading to unreliable model parameters and
predictions.
- **Differencing and Transformation**: To use ARIMA on non-stationary data, it is necessary to
apply differencing and possibly transformations to achieve stationarity.
- **Mathematical and Statistical Reasons**: The reliance on past values and errors, and the need
for constant statistical properties, make stationarity a prerequisite for ARIMA models.

LWSTSM51
No ratings yet
LWSTSM51
284 pages
Sarima Group 11
No ratings yet
Sarima Group 11
21 pages
HW2368-Chapter3
No ratings yet
HW2368-Chapter3
18 pages
Autoregressive Conditional Heteroskedasticity ARCH Family of Estimators
No ratings yet
Autoregressive Conditional Heteroskedasticity ARCH Family of Estimators
33 pages
Univariate Time Series Modelling and Forecasting: Introductory Econometrics For Finance' © Chris Brooks 2002 1
No ratings yet
Univariate Time Series Modelling and Forecasting: Introductory Econometrics For Finance' © Chris Brooks 2002 1
62 pages
Chapter 12 Part 2 - Arima Model Estimation - 2023
No ratings yet
Chapter 12 Part 2 - Arima Model Estimation - 2023
15 pages
ARIMA Model Python Example - Time Series Forecasting
No ratings yet
ARIMA Model Python Example - Time Series Forecasting
11 pages
ARDL Models-Bounds Testing
88% (8)
ARDL Models-Bounds Testing
17 pages
Module 3.1 Time Series Forecasting ARIMA Model
No ratings yet
Module 3.1 Time Series Forecasting ARIMA Model
19 pages
Assignment
100% (1)
Assignment
3 pages
Forecasting PDF
No ratings yet
Forecasting PDF
101 pages
Arima Model
No ratings yet
Arima Model
4 pages
Durbin Watson Tabel (Anwar)
No ratings yet
Durbin Watson Tabel (Anwar)
112 pages
Stata Time Series Reference Manual
No ratings yet
Stata Time Series Reference Manual
921 pages
Applied_Data_Science-MODULE-5-SEM8
No ratings yet
Applied_Data_Science-MODULE-5-SEM8
53 pages
MIS410-Chapter7
No ratings yet
MIS410-Chapter7
49 pages
Lect
No ratings yet
Lect
96 pages
Be A 65 Ads Exp 8
No ratings yet
Be A 65 Ads Exp 8
10 pages
AP SHAH ADS Notes Smote
No ratings yet
AP SHAH ADS Notes Smote
52 pages
Box-Jenkins Method: Time Series Analysis: Forecasting and Control
100% (1)
Box-Jenkins Method: Time Series Analysis: Forecasting and Control
4 pages
Regression With Time Series Data: Undergraduate Econometrics, 2 Edition-Chapter 16
100% (1)
Regression With Time Series Data: Undergraduate Econometrics, 2 Edition-Chapter 16
33 pages
Forecasting Session 2.2 2024
No ratings yet
Forecasting Session 2.2 2024
44 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
29 pages
Forecasting Session 2.0 2024
No ratings yet
Forecasting Session 2.0 2024
29 pages
Stock Price Prediction: By: Aarushi Sunderrajan (S0 Paridhi Deval (S0 Pranjal Gupta (S059)
No ratings yet
Stock Price Prediction: By: Aarushi Sunderrajan (S0 Paridhi Deval (S0 Pranjal Gupta (S059)
34 pages
Finance Workshop Honda
No ratings yet
Finance Workshop Honda
30 pages
Arima Time Series Stock Prediction
No ratings yet
Arima Time Series Stock Prediction
23 pages
Chapter - ARIMA Models For Time Series Data
No ratings yet
Chapter - ARIMA Models For Time Series Data
44 pages
Wipro
No ratings yet
Wipro
21 pages
Predicting S&P500 Prices Using ARIMA (LinkedIn - Ivan Hung)
No ratings yet
Predicting S&P500 Prices Using ARIMA (LinkedIn - Ivan Hung)
15 pages
The Relative Performance of VAR and VECM Model: Xzhang@business - Queensu.ca
No ratings yet
The Relative Performance of VAR and VECM Model: Xzhang@business - Queensu.ca
4 pages
00 Time Series Analysis_ Complete Study Guide
No ratings yet
00 Time Series Analysis_ Complete Study Guide
26 pages
Arima Modeling With R Listendata
No ratings yet
Arima Modeling With R Listendata
12 pages
Sta457 Week 2 Notes
No ratings yet
Sta457 Week 2 Notes
18 pages
Time Series Analysis Handbook 03
No ratings yet
Time Series Analysis Handbook 03
12 pages
Arima
No ratings yet
Arima
12 pages
Arma Model
No ratings yet
Arma Model
13 pages
Ema Theory Trading
No ratings yet
Ema Theory Trading
16 pages
Stata Ts Introduction To Time-Series Commands
100% (1)
Stata Ts Introduction To Time-Series Commands
6 pages
Report
No ratings yet
Report
16 pages
University of Zimbabwe: Authorized Materials: Calculator
No ratings yet
University of Zimbabwe: Authorized Materials: Calculator
11 pages
Project Documentation Doc_2
No ratings yet
Project Documentation Doc_2
9 pages
Managerial Economics in A Global Economy: Bab 5 Peramalan Permintaan (Demand Forecasting
No ratings yet
Managerial Economics in A Global Economy: Bab 5 Peramalan Permintaan (Demand Forecasting
14 pages
CH 16
100% (1)
CH 16
54 pages
Arima Word
No ratings yet
Arima Word
13 pages
Forecasting Economic Indicators Using Time Series Analysis
No ratings yet
Forecasting Economic Indicators Using Time Series Analysis
4 pages
Part Ii - Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
No ratings yet
Part Ii - Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
14 pages
Business Forecast Vishay Sood
No ratings yet
Business Forecast Vishay Sood
8 pages
Arima
No ratings yet
Arima
13 pages
TIME - ChatGPT Manual 001
No ratings yet
TIME - ChatGPT Manual 001
7 pages
Midterm Sample ADM 3301
100% (1)
Midterm Sample ADM 3301
19 pages
Auto-Regressive Integrated Moving Average Models I
No ratings yet
Auto-Regressive Integrated Moving Average Models I
12 pages
BTMMS1 2 Nisa Nadiah Binti Mohd Shaifolazham B062010240 Forecasting Exercise PDF
No ratings yet
BTMMS1 2 Nisa Nadiah Binti Mohd Shaifolazham B062010240 Forecasting Exercise PDF
10 pages
Dav 4
No ratings yet
Dav 4
6 pages
Econometrics I: Chapter 3: Two Variable Regression Model: The Problem of Estimation
No ratings yet
Econometrics I: Chapter 3: Two Variable Regression Model: The Problem of Estimation
35 pages
Arima Modelling by Ankit Bhandari
No ratings yet
Arima Modelling by Ankit Bhandari
6 pages
Business analytis C4
No ratings yet
Business analytis C4
10 pages
Group 11 PRAN Project
No ratings yet
Group 11 PRAN Project
11 pages
Statistics Project SEM1 Notes
No ratings yet
Statistics Project SEM1 Notes
5 pages
Assignment 1 Supplementary
No ratings yet
Assignment 1 Supplementary
5 pages
06-time-series-analysis
No ratings yet
06-time-series-analysis
9 pages
Algorithm ARMA
No ratings yet
Algorithm ARMA
1 page
Pigeon Pea Arima
No ratings yet
Pigeon Pea Arima
6 pages
Stationarity & AR, MA, ARIMA, SARIMA
100% (1)
Stationarity & AR, MA, ARIMA, SARIMA
6 pages
Comparison of Trend Forecast Using ARIMA and ETS Models For S&P500 Close Price
No ratings yet
Comparison of Trend Forecast Using ARIMA and ETS Models For S&P500 Close Price
4 pages
Practical Questions-Week 2 With Solution
No ratings yet
Practical Questions-Week 2 With Solution
6 pages
Arima
No ratings yet
Arima
2 pages
Code File
No ratings yet
Code File
4 pages
Autoregressive Integrated Moving Average
No ratings yet
Autoregressive Integrated Moving Average
3 pages
Time Series Practice P5
No ratings yet
Time Series Practice P5
4 pages
Theory Unit 4 TH
No ratings yet
Theory Unit 4 TH
2 pages
Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
No ratings yet
Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
14 pages
Module 4_ Time Series Analysis
No ratings yet
Module 4_ Time Series Analysis
6 pages
Period Data Naive Forecasts Abs. Error Error Abs. % Error
No ratings yet
Period Data Naive Forecasts Abs. Error Error Abs. % Error
3 pages
Autoregressive Integrated Moving Average
No ratings yet
Autoregressive Integrated Moving Average
2 pages
Lec 6
No ratings yet
Lec 6
1 page
Testing Endogeneity
No ratings yet
Testing Endogeneity
3 pages
Assigment-17
No ratings yet
Assigment-17
2 pages
Time Series Models. AR, MA, ARMA, ARIMA _ by Charanraj Shetty _ Towards Data Science
No ratings yet
Time Series Models. AR, MA, ARMA, ARIMA _ by Charanraj Shetty _ Towards Data Science
3 pages
Practice Problems: Chapter 4, Forecasting: Problem 1
No ratings yet
Practice Problems: Chapter 4, Forecasting: Problem 1
10 pages
ARIMA
No ratings yet
ARIMA
3 pages
ARIMA Modelling and Forecasting: by Shipra Mishra Intern
No ratings yet
ARIMA Modelling and Forecasting: by Shipra Mishra Intern
17 pages
Arima 1b
No ratings yet
Arima 1b
6 pages
School of Economics, Finance and Banking College of Business Beeq5113 Applied Econometrics SECOND SEMESTER 2017/2018 Exercise 3
No ratings yet
School of Economics, Finance and Banking College of Business Beeq5113 Applied Econometrics SECOND SEMESTER 2017/2018 Exercise 3
1 page
Class Notes
No ratings yet
Class Notes
6 pages
Arma Arima
No ratings yet
Arma Arima
10 pages
Stock Price Prediction Using ARIMA Model by Dereje Workneh Medium
No ratings yet
Stock Price Prediction Using ARIMA Model by Dereje Workneh Medium
1 page
Angular Portfolio App Development: Create your personal brand
From Everand
Angular Portfolio App Development: Create your personal brand
Abdelfattah Ragab
No ratings yet
"C Programming for Beginners: A Step-by-Step Guide"
From Everand
"C Programming for Beginners: A Step-by-Step Guide"
Lov kush
No ratings yet
Backtrader Essentials: Building Successful Strategies with Python
From Everand
Backtrader Essentials: Building Successful Strategies with Python
Ali AZARY
No ratings yet

Time Arima 002

Uploaded by

Time Arima 002

Uploaded by

Certainly!

1. **Load the Data**

Let's go through each step in detail.

### 1. Load the Data

# Load the data

# Display the first few rows of the dataset

Handle missing values and ensure the data is sorted by date.

# Sort the data by date

### 3. Exploratory Data Analysis (EDA)

Perform EDA to understand the data better.

# Plot the closing prices

# Calculate moving averages

# Plot moving averages

### 4. Time-Series Decomposition

# Decompose the time series

# Plot the decomposed components

# Perform the ADF test

### 6. Differencing and Transformations

If the series is not stationary, apply differencing and transformations.

# Apply first-order differencing to remove trend

# Plot the differenced series

# Perform the ADF test on the differenced series

### 7. Autocorrelation and Partial Autocorrelation

Plot ACF and PACF to identify the order of ARIMA models.

# Plot ACF and PACF

# Determine the order of ARIMA(p, d, q) based on ACF and PACF plots

# Build and fit the ARIMA model

# Print model summary

# Plot the results

#### 1. **Model Assumptions**

1. **Consistency in Parameters**: For ARIMA models to make accurate predictions, the

2. **Predictability**: Stationary data is more predictable because it follows a consistent pattern.

#### 2. **Mathematical Basis**

The ARIMA model combines three components:

- **Autoregressive Component**: The AR part assumes a linear relationship between an

#### 3. **Statistical Properties**

#### 4. **Example and Solution**

# Load the data

# Log transformation to stabilize variance

# First-order differencing to remove trend

# Check for stationarity

# Build and fit the ARIMA model

You might also like

1. Load the Data

#### 1. Model Assumptions

1. Consistency in Parameters: For ARIMA models to make accurate predictions, the

2. Predictability: Stationary data is more predictable because it follows a consistent pattern.

#### 2. Mathematical Basis

- Autoregressive Component: The AR part assumes a linear relationship between an

#### 3. Statistical Properties

#### 4. Example and Solution