0% found this document useful (0 votes)

23 views9 pages

Time Series Analysis

This document discusses time series analysis techniques applied to airline passenger data. It loads and explores the data, visually depicting trends and seasonality. The document then tests for stationarity, differentiates the data to make it stationary, and checks stationarity again.

Uploaded by

Vardan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views9 pages

Time Series Analysis

Uploaded by

Vardan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

time-series-analysis

December 26, 2023

[1]: import pandas as pd

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

from statsmodels.tsa.seasonal import seasonal_decompose

[2]: df = pd.read_csv('/kaggle/input/air-passengers/AirPassengers.csv')
df.head()

[2]: Month #Passengers

0 1949-01 112
1 1949-02 118
2 1949-03 132
3 1949-04 129
4 1949-05 121

[6]: df.rename(columns={"#Passengers": "Passengers"}, inplace=True)

df.head()

[6]: Month Passengers

0 1949-01 112
1 1949-02 118
2 1949-03 132
3 1949-04 129
4 1949-05 121

[7]: df.shape

[7]: (144, 2)

[9]: df['Month'] = pd.to_datetime(df.Month)

df = df.set_index(df.Month)

[10]: df.drop('Month', axis = 1, inplace = True)

print('Column datatypes= \n',df.dtypes)

1
Column datatypes=
Passengers int64
dtype: object

[11]: df

[11]: Passengers
Month
1949-01-01 112
1949-02-01 118
1949-03-01 132
1949-04-01 129
1949-05-01 121
… …
1960-08-01 606
1960-09-01 508
1960-10-01 461
1960-11-01 390
1960-12-01 432

[144 rows x 1 columns]

1 Time Series Characteristics

1.1 Trend
[14]: plt.figure(figsize= (10,6))
plt.plot(df, color="blue")
plt.xlabel('Years')
plt.ylabel('No of Air Passengers')
plt.title('Trend of the Time Series')

[14]: Text(0.5, 1.0, 'Trend of the Time Series')

2
1.2 Seasonality
[15]: # To plot the seasonality we are going to create a temp dataframe and add␣
↪columns for Month and Year values

df_temp = df.copy()
df_temp['Year'] = pd.DatetimeIndex(df_temp.index).year
df_temp['Month'] = pd.DatetimeIndex(df_temp.index).month

[16]: # Stacked line plot

plt.figure(figsize=(10,10))
plt.title('Seasonality of the Time Series')
sns.pointplot(x='Month',y='Passengers',hue='Year',data=df_temp)

[16]: <Axes: title={'center': 'Seasonality of the Time Series'}, xlabel='Month',

ylabel='Passengers'>

3
1.3 Decomposition of Time Series
[20]: # Decompose the time series into trend, seasonality, and residuals
decomposition = seasonal_decompose(df['Passengers'], model='additive')

[21]: fig = decomposition.plot()

4
2 Time Series Analysis
2.1 Check for Stationarity
[34]: from statsmodels.tsa.stattools import adfuller
timeseries = df['Passengers']

def stationarity_test(timeseries):
rolling_mean = timeseries.rolling(window=12).mean()
rolling_std = timeseries.rolling(window=12).std()

# Plot rolling statistics

plt.figure(figsize=(10, 6))
plt.xlabel('Years')
plt.ylabel('No of Air Passengers')
plt.title('Stationary Test: Rolling Mean and Standard Deviation')
plt.plot(timeseries, color='blue', label='Original')
plt.plot(rolling_mean, color='green', label='Rolling Mean')
plt.plot(rolling_std, color='red', label='Rolling Std')
plt.legend()

5
plt.show()

# Dickey-Fuller test
print('Results of Dickey-Fuller Test')
df_test = adfuller(timeseries)
df_output = pd.Series(df_test[0:4], index=['Test Statistic', 'p-value',␣
↪'#Lags Used', 'Number of Observations Used'])

for key, value in df_test[4].items():

df_output['Critical Value (%s)' % key] = value
print(df_output)

return rolling_mean, rolling_std

# Call the stationarity_test function with your time series

rolling_mean, rolling_std = stationarity_test(timeseries)

Results of Dickey-Fuller Test

Test Statistic 0.815369
p-value 0.991880
#Lags Used 13.000000
Number of Observations Used 130.000000
Critical Value (1%) -3.481682
Critical Value (5%) -2.884042
Critical Value (10%) -2.578770

6
dtype: float64

3 Convert Non-Stationary Data to Stationary Data

3.1 Differencing
[44]: from statsmodels.tsa.stattools import adfuller

df_diff = df.diff(periods=1) # First-order differencing

# Plot differenced time series

plt.xlabel('Years')
plt.ylabel('No of Air Passengers')
plt.title('Convert Non-Stationary Data to Stationary Data using Differencing')
plt.plot(df_diff)

[44]: [<matplotlib.lines.Line2D at 0x7d9301531900>,

<matplotlib.lines.Line2D at 0x7d93013799f0>]

[46]: # Drop NA values

df_diff.dropna(inplace=True)

7
# Perform the Dickey-Fuller test on a specific column, e.g., 'Passengers'
column_name = 'Passengers'
stationarity_test(df_diff[column_name])

Results of Dickey-Fuller Test

Test Statistic -2.833426
p-value 0.053655
#Lags Used 12.000000
Number of Observations Used 129.000000
Critical Value (1%) -3.482088
Critical Value (5%) -2.884219
Critical Value (10%) -2.578864
dtype: float64

[46]: (Month
1949-03-01 NaN
1949-04-01 NaN
1949-05-01 NaN
1949-06-01 NaN
1949-07-01 NaN
…
1960-08-01 3.916667
1960-09-01 3.750000

8
1960-10-01 4.500000
1960-11-01 2.333333
1960-12-01 2.250000
Name: Passengers, Length: 142, dtype: float64,
Month
1949-03-01 NaN
1949-04-01 NaN
1949-05-01 NaN
1949-06-01 NaN
1949-07-01 NaN
…
1960-08-01 53.364030
1960-09-01 53.706483
1960-10-01 52.852281
1960-11-01 55.531045
1960-12-01 55.465182
Name: Passengers, Length: 142, dtype: float64)

Stat and Prob Q3-Week8 Mod8 Abelaine Abaquitacorrected
100% (1)
Stat and Prob Q3-Week8 Mod8 Abelaine Abaquitacorrected
34 pages
Time Series Analysis Book
No ratings yet
Time Series Analysis Book
202 pages
Time Series Analysis - CheatSheet
No ratings yet
Time Series Analysis - CheatSheet
10 pages
Instant download of Statistical and Econometric Methods for Transportation Data Analysis 1st Edition Simon P. Washington ebook PDF, every chapter
100% (1)
Instant download of Statistical and Econometric Methods for Transportation Data Analysis 1st Edition Simon P. Washington ebook PDF, every chapter
86 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
29 pages
Stationarity
No ratings yet
Stationarity
27 pages
alizing-time-series-data-in-python
No ratings yet
alizing-time-series-data-in-python
47 pages
Time series with python
No ratings yet
Time series with python
20 pages
M1_L4 (Converting Non Stationary Data)
No ratings yet
M1_L4 (Converting Non Stationary Data)
21 pages
Time Series Transformation
No ratings yet
Time Series Transformation
34 pages
Lec1 (1) .Ps
No ratings yet
Lec1 (1) .Ps
100 pages
2 Trends Seasonality and Residuals Explained!-Copy1
No ratings yet
2 Trends Seasonality and Residuals Explained!-Copy1
14 pages
Gas Prod
100% (3)
Gas Prod
24 pages
ISE487 - HW#1
No ratings yet
ISE487 - HW#1
22 pages
Time Series Analysis (ETH) PDF
No ratings yet
Time Series Analysis (ETH) PDF
180 pages
Week 10 Intro Time Series
No ratings yet
Week 10 Intro Time Series
34 pages
Unit 5 Time Series Data Analysis
No ratings yet
Unit 5 Time Series Data Analysis
33 pages
Lecture 1-3 Eda
No ratings yet
Lecture 1-3 Eda
129 pages
Basic Statistics For Life Sciences
No ratings yet
Basic Statistics For Life Sciences
58 pages
Lecture 1
No ratings yet
Lecture 1
67 pages
Completed Time Series Analysis! ?
No ratings yet
Completed Time Series Analysis! ?
24 pages
01 ASAP TimeSeriesForcasting Day1 2 Introduction
No ratings yet
01 ASAP TimeSeriesForcasting Day1 2 Introduction
62 pages
Time Series
67% (3)
Time Series
34 pages
time-series-forecast-a-comprehensive-guide - Jupyter Notebook
No ratings yet
time-series-forecast-a-comprehensive-guide - Jupyter Notebook
24 pages
Relationships of Body Dissatisfaction and Self Esteem With Social Physique Anxiety Among University Students in Different Study Programs
No ratings yet
Relationships of Body Dissatisfaction and Self Esteem With Social Physique Anxiety Among University Students in Different Study Programs
14 pages
Time Series Analysis of HDFCBANK Stock by Pavan
No ratings yet
Time Series Analysis of HDFCBANK Stock by Pavan
10 pages
Assi_7
No ratings yet
Assi_7
8 pages
Identifying Autism Spectrum Disorder Based On Individual-Aware Down-Sampling and Multi-Modal Learning
No ratings yet
Identifying Autism Spectrum Disorder Based On Individual-Aware Down-Sampling and Multi-Modal Learning
17 pages
26
No ratings yet
26
8 pages
ARIMA
No ratings yet
ARIMA
11 pages
Time Series Analysis
No ratings yet
Time Series Analysis
5 pages
Time Series Analysis
No ratings yet
Time Series Analysis
36 pages
IFM GROUP2 CODE
No ratings yet
IFM GROUP2 CODE
7 pages
Applied Time Series Analysis
No ratings yet
Applied Time Series Analysis
200 pages
Time Series Formulas and Python Functions
No ratings yet
Time Series Formulas and Python Functions
10 pages
Gakhov Time Series Forecasting With Python
No ratings yet
Gakhov Time Series Forecasting With Python
66 pages
(P3)
No ratings yet
(P3)
9 pages
ads exp 9_labmanual
No ratings yet
ads exp 9_labmanual
4 pages
Module 2.3 EDA Part 3 Time Series Data in Python and R
No ratings yet
Module 2.3 EDA Part 3 Time Series Data in Python and R
20 pages
Time Series Using Python
No ratings yet
Time Series Using Python
18 pages
Dhruv_Shah_Vraj_Thakkar_TSA_Project_report[2]
No ratings yet
Dhruv_Shah_Vraj_Thakkar_TSA_Project_report[2]
4 pages
Modules
No ratings yet
Modules
12 pages
Time Series Analysis in R A Beginner's Guide
No ratings yet
Time Series Analysis in R A Beginner's Guide
13 pages
Random number Probability Z value X = Zσ + mean: Models with Uncertainty (Stochastic Modeling)
No ratings yet
Random number Probability Z value X = Zσ + mean: Models with Uncertainty (Stochastic Modeling)
43 pages
Gear Shift Map Design Methodology For Automotive Transmissions
No ratings yet
Gear Shift Map Design Methodology For Automotive Transmissions
23 pages
Environmental Sampling Techniques
No ratings yet
Environmental Sampling Techniques
16 pages
Practical 9- Time-series forecasting
No ratings yet
Practical 9- Time-series forecasting
5 pages
Dav 4
No ratings yet
Dav 4
6 pages
DMPR 4
No ratings yet
DMPR 4
7 pages
TIME - ChatGPT Manual 001
No ratings yet
TIME - ChatGPT Manual 001
7 pages
Time_series_analysis__1718649022
No ratings yet
Time_series_analysis__1718649022
5 pages
Correlation, Indepedent Variables Multiple
No ratings yet
Correlation, Indepedent Variables Multiple
8 pages
R Programming
No ratings yet
R Programming
3 pages
TSA Project Python Code
No ratings yet
TSA Project Python Code
6 pages
Time-series-Air-Passenger
No ratings yet
Time-series-Air-Passenger
2 pages
Forex Trend Classification by Machine Learning
No ratings yet
Forex Trend Classification by Machine Learning
7 pages
Time Series Analysis & Modeling With The Airport Passengers Dataset
No ratings yet
Time Series Analysis & Modeling With The Airport Passengers Dataset
15 pages
A Comprehensive Guide To Time Series Analysis
No ratings yet
A Comprehensive Guide To Time Series Analysis
18 pages
CSE315:Introduction To Data Science: WEEK-8
No ratings yet
CSE315:Introduction To Data Science: WEEK-8
27 pages
The Impact of Reward and Recognition Programs On Employees' Motivation and Satisfaction
No ratings yet
The Impact of Reward and Recognition Programs On Employees' Motivation and Satisfaction
12 pages
Ola Work01
No ratings yet
Ola Work01
3 pages
Time Series Analysis Term Paper
100% (1)
Time Series Analysis Term Paper
6 pages
Dav Ex 4 - 099
No ratings yet
Dav Ex 4 - 099
4 pages
Impact of Recruitment and Selection Strategy On Employees' Performance A Study of Three Selected Manufacturing Companies in Nigeria
No ratings yet
Impact of Recruitment and Selection Strategy On Employees' Performance A Study of Three Selected Manufacturing Companies in Nigeria
11 pages
AP Join Troubleshooting
No ratings yet
AP Join Troubleshooting
8 pages
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
No ratings yet
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
8 pages
Confidence Interval Curve
100% (1)
Confidence Interval Curve
4 pages
1 Forecasting-Questions
No ratings yet
1 Forecasting-Questions
4 pages
Assignment-5 - Colaboratory
No ratings yet
Assignment-5 - Colaboratory
3 pages
MSA Study
No ratings yet
MSA Study
15 pages
Name: Adhithyan Aravind Dept: Computer Science and Business Systems REG NO: 211401004
No ratings yet
Name: Adhithyan Aravind Dept: Computer Science and Business Systems REG NO: 211401004
6 pages
A Comprehensive Guide To Time Series Analysis
No ratings yet
A Comprehensive Guide To Time Series Analysis
26 pages
4 Powerpoint Ethics in Research
No ratings yet
4 Powerpoint Ethics in Research
28 pages
Time Series Modeling: Shouvik Mani April 5, 2018
No ratings yet
Time Series Modeling: Shouvik Mani April 5, 2018
46 pages
The Oedipus Complex
100% (1)
The Oedipus Complex
6 pages
ARIMA Model Python Example - Time Series Forecasting
No ratings yet
ARIMA Model Python Example - Time Series Forecasting
11 pages
Time Series Project
No ratings yet
Time Series Project
19 pages
Time Series Forecast - A Basic Introduction Using Python
No ratings yet
Time Series Forecast - A Basic Introduction Using Python
18 pages
Time Series Analysis
No ratings yet
Time Series Analysis
4 pages
Ibd Manual
No ratings yet
Ibd Manual
12 pages
Time Management Practices of Educators in A State University
100% (1)
Time Management Practices of Educators in A State University
9 pages
I. Choose The Correct Alternative:: II. Fill in The Blanks
No ratings yet
I. Choose The Correct Alternative:: II. Fill in The Blanks
1 page
Syllabus - Certificate Course On Research Methodology
No ratings yet
Syllabus - Certificate Course On Research Methodology
1 page
Time Series
No ratings yet
Time Series
1 page
Stochastic Hydrology: Prediction Is Very Difficult, Especially If Its About The Future
No ratings yet
Stochastic Hydrology: Prediction Is Very Difficult, Especially If Its About The Future
18 pages
Chi Square Assignment MOHA 570
No ratings yet
Chi Square Assignment MOHA 570
3 pages
Deep Learning KCS078
0% (1)
Deep Learning KCS078
2 pages
Module 6 Lesson 1
No ratings yet
Module 6 Lesson 1
7 pages
Data Science Programming In Python
From Everand
Data Science Programming In Python
Anita Raichand
No ratings yet
Profound Python Libraries
From Everand
Profound Python Libraries
Onder Teker
No ratings yet

Time Series Analysis

Uploaded by

Time Series Analysis

Uploaded by

time-series-analysis

December 26, 2023

[1]: import pandas as pd

from statsmodels.tsa.seasonal import seasonal_decompose

[2]: Month #Passengers

[6]: df.rename(columns={"#Passengers": "Passengers"}, inplace=True)

[6]: Month Passengers

[9]: df['Month'] = pd.to_datetime(df.Month)

[10]: df.drop('Month', axis = 1, inplace = True)

[144 rows x 1 columns]

1 Time Series Characteristics

[14]: Text(0.5, 1.0, 'Trend of the Time Series')

[16]: # Stacked line plot

[16]: <Axes: title={'center': 'Seasonality of the Time Series'}, xlabel='Month',

[21]: fig = decomposition.plot()

# Plot rolling statistics

for key, value in df_test[4].items():

return rolling_mean, rolling_std

# Call the stationarity_test function with your time series

Results of Dickey-Fuller Test

3 Convert Non-Stationary Data to Stationary Data

df_diff = df.diff(periods=1) # First-order differencing

# Plot differenced time series

[44]: [<matplotlib.lines.Line2D at 0x7d9301531900>,

[46]: # Drop NA values

Results of Dickey-Fuller Test

You might also like