0% found this document useful (0 votes)

62 views

Magaama

This document discusses pairs trading and statistical arbitrage. It begins by defining cointegration and explaining how cointegrated stock prices can be exploited for trading. Specifically, pairs trading involves shorting the overvalued stock and buying the undervalued stock in a pair when their spread diverges too far from the mean, and then unwinding the position when the spread returns to its mean. Methods for selecting stock pairs, testing for cointegration, and determining optimal thresholds for entering and exiting trades are also examined.

Uploaded by

samadozous

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

Magaama

Uploaded by

samadozous

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 63

Pairs Trading

Prof. Daniel P. Palomar

MAFS5310 - Portfolio Optimization with R

MSc in Financial Mathematics
The Hong Kong University of Science and Technology (HKUST)
Fall 2020-21
Outline
1 Cointegration
2 Basic Idea of Pairs Trading
3 Design of Pairs Trading
Pairs selection
Cointegration test
Optimum threshold
4 LS Regression and Kalman for Pairs Trading
5 From Pairs Trading to Statistical Arbitrage (StatArb)∗
VECM
Optimization of mean-reverting portfolio (MRP)
6 Summary
Outline
1 Cointegration
2 Basic Idea of Pairs Trading
3 Design of Pairs Trading
Pairs selection
Cointegration test
Optimum threshold
4 LS Regression and Kalman for Pairs Trading
5 From Pairs Trading to Statistical Arbitrage (StatArb)∗
VECM
Optimization of mean-reverting portfolio (MRP)
6 Summary
Cointegration
Cointegration is a very interesting property that can be exploited in finance for trading.
Idea: While it may be diﬀicult to predict individual stocks, it may be easier to predict
relative behavior of stocks.
Illustrative example: A drunk man is wandering the streets (random walk) with a dog.
Both paths of man and dog are nonstationary and diﬀicult to predict, but the distance
between them is mean-reverting and stationary.

D. Palomar (HKUST) Pairs Trading 4 / 63

Correlation vs. cointegration

Everybody is familiar with the concept of correlation between two random variables:
correlation is high when they co-move
correlation is zero when they move independently
So what is cointegration?
cointegration is high when two quantities move together or remain close to each other
cointegration is inexistent if the two quantities do not stay together
Clear? You can see why this concept may be diﬀicult to grasp at first, but the truth is
that it’s easy.1
In the financial context:
Cointegration of (log-)prices yt refers to long-term co-movements.
Correlation of (log-)returns ∆yt = yt − yt−1 characterizes short-term co-movements in
(log-)prices yt .

1
Y. Feng and D. P. Palomar, A Signal Processing Perspective on Financial Engineering. Foundations and
Trends in Signal Processing, Now Publishers, 2016.
D. Palomar (HKUST) Pairs Trading 5 / 63
Correlation vs. cointegration
Example of high correlation with no cointegration:
5
ỹ1t
y2t
ỹ1t − y2t

−1
0 20 40 60 80 100 120 140 160 180 200

D. Palomar (HKUST) Pairs Trading 6 / 63

Correlation vs. cointegration
Indeed the returns are highly correlated, see scatter plot:
1

0.8

0.6

0.4

Log−returns of stock 2
0.2

−0.2

−0.4

−0.6

−0.8

−1
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
Log−returns of stock 1

D. Palomar (HKUST) Pairs Trading 7 / 63

Correlation vs. cointegration
Opposite example of high cointegration with no correlation:
1

0.5

−0.5

−1

−1.5

−2

−2.5

−3
y1t
y2t
y1t − y2t
−3.5
0 20 40 60 80 100 120 140 160 180 200

D. Palomar (HKUST) Pairs Trading 8 / 63

Correlation vs. cointegration
Indeed the returns are not correlated, see scatter plot:
1

0.8

0.6

0.4

Log−returns of stock 2
0.2

−0.2

−0.4

−0.6

−0.8

−1
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
Log−returns of stock 1

D. Palomar (HKUST) Pairs Trading 9 / 63

Cointegration

A time series is called integrated of order p, denoted as I(p), if the time series obtained
by differencing the time series p times is weakly stationary,
while by differencing the time series p − 1 times is not weakly stationary.
Example: stock log-prices yt are integrated of order I(1) because
log-prices are not stationary
but log-returns yt − yt−1 are stationary (at least for some period of time).
A multivariate time series is said to be cointegrated if it has at least one linear
combination being integrated of a lower order, e.g., yt is not stationary but wT yt is
stationary for some weights w.

D. Palomar (HKUST) Pairs Trading 10 / 63

Cointegration
Consider the following two nonstationary time series (e.g., log-prices of stocks):
y1t = γxt + w1t
y2t = xt + w2t
with a stochastic common trend defined as a random walk:
xt = xt−1 + wt
where w1t , w2t , wt are i.i.d. residual terms mutually independent.
The coeﬀicient γ is the secret ingredient here.
If γ is known, then we can define the so-called “spread”
zt = y1t − γy2t = w1t − γw2t
which is stationary and mean reverting.
Interestingly, the differences
( (i.e., log-returns) ∆y)1t and ∆y2t can have an arbitrarily small
√ √
correlation: ρ = 1/ 1 + 2σ12 /σ 2 1 + 2σ22 /σ 2 .
D. Palomar (HKUST) Pairs Trading 11 / 63
Cointegration
The log-prices y1t and y2t are cointegrated and the spread zt = y1t − γy2t is stationary
(assume γ = 1):
1

0.5

−0.5

−1

−1.5

−2

−2.5

−3
y1t
y2t
y1t − y2t
−3.5
0 20 40 60 80 100 120 140 160 180 200

D. Palomar (HKUST) Pairs Trading 12 / 63

Recall that if two time series are cointegrated, then in the long term they remain close to
each other.
In other words, the spread zt = y1t − γy2t is mean reverting.
This mean-reverting property of the spread can be exploited for trading and it is
commonly referred to as “pairs trading” or “statistical arbitrage”.
The idea behind pairs trading is to
short-sell the relatively overvalued stocks and buy the relatively undervalued stocks,
unwind the position when they are relatively fairly valued.

D. Palomar (HKUST) Pairs Trading 14 / 63

Trading the spread

Suppose the spread zt = y1t − γy2t is mean-reverting with zero mean.

Stat-arb trading:
if spread is low (zt < −s0 ), then stock 1 is undervalued and stock 2 overvalued:
buy the spread (i.e., buy stock 1 and short-sell stock 2)
unwind the positions when it reverts to zero after i time steps zt+i = 0
if spread is high (zt > s0 ), then stock 1 is overvalued and stock 2 undervalued:
short-sell the spread (i.e., short-sell stock 1 and buy stock 2)
unwind the positions when it reverts to zero after i time steps zt+i = 0
The profit, say, from buying low and unwinding at zero is zt+i − zt = s0 . So easy!
Indeed zt+i − zt = −γ(y2,t+i − y2t )[+ (y1,t+i
] − y1t ), so the whole process is like having
1
used a portfolio with weigths w = .
−γ
Recall that the return of a portfolio w is wT ∆yt .

D. Palomar (HKUST) Pairs Trading 15 / 63

Trading the spread
Illustration on how to trade the spread zt = y1t − γy2t :2

Sell Sell

Buy to unwind Buy to unwind

Sell to unwind

−s0

Buy

2
G. Vidyamurthy, Pairs Trading: Quantitative Methods and Analysis. John Wiley & Sons, 2004.
D. Palomar (HKUST) Pairs Trading 16 / 63
Pairs trading or statistical arbitrage
Statistical arbitrage can be used in practice with profits:3
0.5

Spread 0

−0.5
0 20 40 60 80 100 120 140 160 180 200
(a)
1

0.5
Position

−0.5

−1
0 20 40 60 80 100 120 140 160 180 200
(b)
40

30
P&L

0
0 20 40 60 80 100 120 140 160 180 200
(c)

3
M. Avellaneda and J.-H. Lee, “Statistical arbitrage in the US equities market,” Quantitative Finance,
vol. 10, no. 7, pp. 761–782, 2010.
D. Palomar (HKUST) Pairs Trading 17 / 63
But how to discover cointegrated pairs and γ?
One interesting approach is based on a VECM modeling of the universe of stocks: From
the parameter β contained in the low-rank matrix Π = αβ T one can extract a
cointegration subspace. After that, one can design some portfolio within that
cointegration subspace.4
A simpler approach to discover pairs is by brute force, i.e., try exhaustively different
combinations of pairs of stocks and see if they are cointegrated.
But, given a potential pair, how do we obtain the “secret” γ?
Easy! Just a simple LS regression!

Recall that
γ is needed to form the spread to be traded (i.e., portfolio)
the spread mean µ is needed to determine the thresholds for entering a trade and unwind
later the position.
4
Z. Zhao and D. P. Palomar, “Mean-reverting portfolio with budget constraint,” IEEE Trans. Signal
Process., vol. 66, no. 9, pp. 2342–2357, 2018.
D. Palomar (HKUST) Pairs Trading 18 / 63
Outline
1 Cointegration
2 Basic Idea of Pairs Trading
3 Design of Pairs Trading
Pairs selection
Cointegration test
Optimum threshold
4 LS Regression and Kalman for Pairs Trading
5 From Pairs Trading to Statistical Arbitrage (StatArb)∗
VECM
Optimization of mean-reverting portfolio (MRP)
6 Summary
Design of a pairs trading strategy

We first focus on pairs trading (i.e., statistical arbitrage between two stocks) as the
example to introduce the main steps of statistical arbitrage.
In practice, pairs trading contains three main steps5 :
Pairs selection: identify stock pairs that could potentially be cointegrated.
Cointegration test: test whether the identified stock pairs are indeed cointegrated or not.
Trading strategy design: study the spread dynamics and design proper trading rules.

5
G. Vidyamurthy, Pairs Trading: Quantitative Methods and Analysis. John Wiley & Sons, 2004.
D. Palomar (HKUST) Pairs Trading 20 / 63
Outline
1 Cointegration
2 Basic Idea of Pairs Trading
3 Design of Pairs Trading
Pairs selection
Cointegration test
Optimum threshold
4 LS Regression and Kalman for Pairs Trading
5 From Pairs Trading to Statistical Arbitrage (StatArb)∗
VECM
Optimization of mean-reverting portfolio (MRP)
6 Summary
Pairs selection: normalized price distance

Normalized price distance6 (as a rough proxy to measure cointegration):

∑
T
NPD ≜ (p̃1t − p̃2t )2
t=1

where the normalized price p̃1t of stock 1 is given by p̃1t = p1t /p10 . The normalized prices
of stock 2 defined similarly.
One can easily (i.e., cheaply) compute the NPD for all the possible combination of pairs
and select some pairs with smallest NPD as the potentially cointegrated pairs.
Later one can use a more refined measure of cointegration (more computationally
demanding).

6
E. Gatev, W. N. Goetzmann, and K. G. Rouwenhorst, “Pairs trading: Performance of a relative-value
arbitrage rule,” Review of Financial Studies, vol. 19, no. 3, pp. 797–827, 2006.
D. Palomar (HKUST) Pairs Trading 22 / 63
Outline
1 Cointegration
2 Basic Idea of Pairs Trading
3 Design of Pairs Trading
Pairs selection
Cointegration test
Optimum threshold
4 LS Regression and Kalman for Pairs Trading
5 From Pairs Trading to Statistical Arbitrage (StatArb)∗
VECM
Optimization of mean-reverting portfolio (MRP)
6 Summary
Least Squares (LS) regression
If the spread zt is stationary, it can be written as7

zt = y1t − γy2t = µ + ϵt

where
µ represents the equilibrium value and
ϵt is a zero-mean residual.
Equivalently, it can be written as

y1t = µ + γy2t + ϵt

which now has the typical form of linear regression.

Least squares (LS) regression over T observations:
∑
T
minimize (y1t − (µ + γy2t ))2
µ,γ
t=1
7
G. Vidyamurthy, Pairs Trading: Quantitative Methods and Analysis. John Wiley & Sons, 2004.
D. Palomar (HKUST) Pairs Trading 24 / 63
Cointegration test

LS regression is used to estimate the parameters µ and γ, obtaining the estimates µ̂ and
γ̂.
If y1t and y2t are I(1) and are cointegrated, then the estimates converge to the true values
as the number of observations goes to infinity8 .
Using the estimated parameters µ̂ and γ̂, we can compute the residuals

ϵ̂t = y1t − γ̂y2t − µ̂.

Then, one has to decide whether the spread is stationary, i.e., ϵt is stationary. In practice,
the estimated residuals are used ϵ̂t
There are many well-defined mathematical tests for the stationarity of ϵ̂t , e.g., augmented
Dicky-Fuller (ADF) test, Johansen test, etc.
8
R. F. Engle and C. W. J. Granger, “Co-integration and error correction: Representation, estimation, and
testing,” Econometrica: Journal of the Econometric Society, pp. 251–276, 1987.
D. Palomar (HKUST) Pairs Trading 25 / 63
Outline
1 Cointegration
2 Basic Idea of Pairs Trading
3 Design of Pairs Trading
Pairs selection
Cointegration test
Optimum threshold
4 LS Regression and Kalman for Pairs Trading
5 From Pairs Trading to Statistical Arbitrage (StatArb)∗
VECM
Optimization of mean-reverting portfolio (MRP)
6 Summary
Optimum threshold

Once some identified pairs have passed the cointegration test, one still needs to decide
the entry and exit thresholds to open and unwind the positions, respectively.
For the sake of concreteness, we focus on studying the entry threshold:
open positions when the spread diverges from its long-term mean by s0
unwind the position when it reverts to its mean
Thus, the key problem now is how to design the value of s0 such that the total profit is
maximized.
Total profit:
profit of each trade × number of trades
profit of each trade is s0
number of trades is related to the zero crossings, which can be analized theoretically as well
as empirically.
We focus now on estimating the number of trades.

D. Palomar (HKUST) Pairs Trading 27 / 63

Optimum threshold s0 : Parametric approach∗
Suppose the spread follows a standard Normal distribution.
The probability that the spread deviates above from the mean by s0 or more is

1 − Φ(s0 )

where Φ(·) is the c.d.f. of the standard Normal distribution.

For a path with T days, the number of tradable events is

T(1 − Φ(s0 )).

For each trade, the profit is s0 and then the total profit is s0 T(1 − Φ(s0 )).
Then the optimal threshold is s⋆0 = arg maxs0 {s0 T(1 − Φ(s0 ))}.
In practice, one cannot know the true distribution but can estimate the distribution
parameters.
Then one can compute the total profit based on estimated distribution.
D. Palomar (HKUST) Pairs Trading 28 / 63
Optimum threshold s0 : Parametric approach∗
Optimal threshold s⋆0 maximizes the total profit:
0.7 3
Theoretical
0.6 Parametric 2.5
Probability of trades

Profit of each trade

0.5
2
0.4
1.5
0.3
1
0.2

0.1 0.5

0 0
0 1 2 3 0 1 2 3
s0 s0
(a) (b)

0.25
Theoretical
0.2 Parametric
Total profit

0.15

0.1

0.05

0
0 0.5 1 1.5 2 2.5 3
s0
(c)

D. Palomar (HKUST) Pairs Trading 29 / 63

Optimum threshold s0 : Non-parametric approach∗

Suppose the observed sample path has length T: z1 , z2 , . . . , zT .

We consider J discretized threshold values as s0 ∈ {s01 , s02 , . . . , s0J } and the empirical
trading frequency for the threshold s0j is
∑T
t=1 1{zt >s0j }
f̄j = .
T

The empirical values f̄j may not be a smoothed enough and the resulted profit function
may not be accurate enough.
Smooth the trading frequency function by regularization:

∑
J ∑
J−1
minimize (f̄j − fj )2 + λ (fj − fj+1 )2
f
j=1 j=1

D. Palomar (HKUST) Pairs Trading 30 / 63

Optimum threshold s0 : Non-parametric approach∗
The problem can be rewritten as

minimize ∥f̄ − f∥22 + λ ∥Df∥22

where  
1 −1
 
 1 −1 

D=  ∈ R(J−1)×J .
.. .. 
 . . 
1 −1
Setting the derivative of the objective w.r.t. f to zero yields the optimal solution
f⋆ = (I + λDT D)−1 f̄.
The optimal threshold is the one maximizes the total profit:

s⋆0 = arg max {s0j fj } .

s0j ∈{s01 ,s02 ,...,s0J }

D. Palomar (HKUST) Pairs Trading 31 / 63

Optimum threshold s0 : Non-parametric approach∗
Optimal threshold s⋆0 maximizes the total profit:
0.5 3
Theoretical
0.4 NonParam: empirical 2.5
Probability of trades

Profit of each trade

NonParam: regularized
2
0.3
1.5
0.2
1
0.1 0.5

0 0
0 1 2 3 0 1 2 3
s0 s0
(a) (b)

0.2
Theoretical
NonParam: empirical
0.15 NonParam: regularized
Total profit

0.1

0.05

0
0 0.5 1 1.5 2 2.5 3
s0
(c)

D. Palomar (HKUST) Pairs Trading 32 / 63

Outline
1 Cointegration
2 Basic Idea of Pairs Trading
3 Design of Pairs Trading
Pairs selection
Cointegration test
Optimum threshold
4 LS Regression and Kalman for Pairs Trading
5 From Pairs Trading to Statistical Arbitrage (StatArb)∗
VECM
Optimization of mean-reverting portfolio (MRP)
6 Summary
LS regression for pairs trading
If the spread zt is stationary, it can be written as
zt = y1t − γy2t = µ + ϵt
where
µ represents the equilibrium value and
ϵt is a zero-mean residual.
Equivalently, it can be written as
y1t = µ + γy2t + ϵt
which now has the typical form for linear regression.
Least squares (LS) regression over T observations:
∑
T
minimize (y1t − (µ + γy2t ))2
µ,γ
t=1
By stacking the T observations in the vectors y1 and y2 , we can finally write:
minimize ∥y1 − (µ1 + γy2 )∥2
µ,γ
D. Palomar (HKUST) Pairs Trading 34 / 63
LS regression for pairs trading

Using the estimated parameters µ̂ and γ̂, we can compute the residuals
ϵ̂t = y1t − µ̂ − γ̂y2t .
Then, one has to decide whether the cointegration is acceptable or not so move to the
trading part.
There are many well-defined mathematical tests for the stationarity of ϵ̂t , e.g., augmented
Dicky-Fuller (ADF) test, Johansen test, etc.
Total profit:
profit of each trade × number of trades
profit of each trade is s0
number of trades is related to the zero crossings, which can be analized theoretically as well
as empirically.
Ideally, we want residuals with large amplitude (variance) as well as a strong mean
reversion because they directly affect the profit.

D. Palomar (HKUST) Pairs Trading 35 / 63

LS regression for pairs trading
One good case:
Z−score and trading signal for EWH vs EWZ 2000−08−01 / 2002−01−31
2 2

1 1

0 0

−1 −1

−2 −2
Z−score
−3 signal −3

Aug 01 Sep 01 Oct 02 Nov 01 Dec 01 Jan 02 Feb 01 Apr 02 May 01 Jun 01 Jul 02 Aug 01 Sep 04 Nov 01 Dec 03 Jan 02 Jan 31
2000 2000 2000 2000 2000 2001 2001 2001 2001 2001 2001 2001 2001 2001 2001 2002 2002

Cum P&L 2000−08−01 / 2002−01−31

2.5 2.5

2.0 2.0

1.5 1.5

1.0 1.0

D. Palomar (HKUST) Pairs Trading 36 / 63

LS regression for pairs trading
But also a bad case:
Z−score and trading signal for EWY vs EWT 2000−07−03 / 2001−12−31

8 Z−score 8
signal
6 6

4 4

2 2

0 0

−2 −2

Jul 03 Aug 01 Sep 01 Oct 02 Nov 01 Dec 01 Jan 02 Feb 01 Apr 02 May 01 Jun 01 Jul 02 Aug 01 Sep 04 Nov 01 Dec 03
2000 2000 2000 2000 2000 2000 2001 2001 2001 2001 2001 2001 2001 2001 2001 2001

Cum P&L 2000−07−03 / 2001−12−31

2.0 2.0

1.8 1.8

1.6 1.6

1.4 1.4

1.2 1.2

1.0 1.0

Jul 03 Aug 01 Sep 01 Oct 02 Nov 01 Dec 01 Jan 02 Feb 01 Apr 02 May 01 Jun 01 Jul 02 Aug 01 Sep 04 Nov 01 Dec 03
2000 2000 2000 2000 2000 2000 2001 2001 2001 2001 2001 2001 2001 2001 2001 2001

D. Palomar (HKUST) Pairs Trading 37 / 63

LS regression for pairs trading

The problem with the LS regression is that it assumes that µ and γ are constant.
In practice, they can change with time, resulting in a spread that drifts from equilibrim
never to revert back with huge potential losses.
Thus, in practice, µ and γ are time-varying and have to be tracked.
How to track time-varying parameters?

Of course… Kalman!!!
Well, you can also try a rolling regression or exponential smoothing, but Kalman works
better.

D. Palomar (HKUST) Pairs Trading 38 / 63

Kalman for pairs trading

Recall the previous static relationship for cointegrated series y1t and y2t :

y1t = µ + γy2t + ϵt

Let’s make it time-varying:

y1t = µt + γt y2t + ϵt
Let’s further assume that the parameters µt and γt change slowly over time:

µt+1 = µt + η1t
γt+1 = γt + η2t

Obviously, this fits nicely the Kalman framework!

D. Palomar (HKUST) Pairs Trading 39 / 63

Interlude: The Kalman filter

Kalman filter consist of two equations that model the time-varying hidden state xt and
the observations yt :
xt+1 = Tt xt + η t
yt = Zt xt + ϵt
The observation equation yt = Zt xt + ϵt relates the observation yt to the hidden state xt
as a linear relationship, where Zt is the time-varying observation matrix and ϵt is a
zero-mean Gaussian error ϵt ∼ N (0, R) with covariance matrix R.
The state transition equation xt+1 = Tt xt + η t expresses the transition of the hidden
state from xt to xt+1 as a linear relationship, where Tt is the time-varying transition
matrix and η t is a zero-mean Gaussian error η t ∼ N (0, Q) with covariance matrix Q.
The Kalman filter is extremely versatile in modeling a variety of real-life processes.9

9
J. Durbin and S. J. Koopman, Time Series Analysis by State Space Methods, 2nd Ed. Oxford University
Press, 2012.
D. Palomar (HKUST) Pairs Trading 40 / 63
Kalman for pairs trading
Kalman filter (state transition equation and observation equation):
xt+1 = Txt + η t
y1t = Zt xt + ϵt
where [ ]
µt
xt ≜ is the hidden state
γ
[ t ]
1 0
T≜ is the state transition matrix
0 1
[ ]
σ12 0
η t ∼ N (0, Q) is the i.i.d. state transition noise with Q =
0 σ22
[ ]
Zt ≜ (1 y2t) is the observation coeﬀicient matrix
ϵt ∼ N 0, σϵ2 is the i.i.d. observation noise
Note that this is a time-varying Kalman filter since Zt is time-varying.
Parameters σ12 , σ22 , σϵ2 can be estimated using the EM algorithm using historical data for
calibration.
The hidden state path xt gives the sought time-varying coeﬀicients.
D. Palomar (HKUST) Pairs Trading 41 / 63
Kalman for pairs trading
Log-prices of ETFs EWH and EWZ:
Log−prices 2000−08−01 / 2003−12−31

2.4 2.4

2.2 2.2

2.0 2.0

1.8 1.8

1.6 1.6

1.4 EWH 1.4

EWZ

Aug 01 Nov 01 Feb 01 Jun 01 Sep 04 Jan 02 Apr 01 Jul 01 Oct 01 Jan 02 Apr 01 Jul 01 Oct 01 Dec 31
2000 2000 2001 2001 2001 2002 2002 2002 2002 2003 2003 2003 2003 2003

D. Palomar (HKUST) Pairs Trading 42 / 63

Kalman for pairs trading
Tracking of µ and γ by LS, rolling LS, and Kalman:
Tracking of mu 2000−08−01 / 2003−12−31

mu.LS
1.0 mu.rolling.LS 1.0
mu.Kalman

0.8 0.8

0.6 0.6

0.4 0.4

Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 03 May 01 Aug 01 Nov 03
2000 2000 2001 2001 2001 2001 2002 2002 2002 2002 2003 2003 2003 2003

Tracking of gamma 2000−08−01 / 2003−12−31

gamma.LS
0.6 gamma.rolling.LS 0.6
gamma.Kalman
0.5 0.5

0.4 0.4

0.3 0.3

Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 03 May 01 Aug 01 Nov 03
2000 2000 2001 2001 2001 2001 2002 2002 2002 2002 2003 2003 2003 2003

D. Palomar (HKUST) Pairs Trading 43 / 63

Kalman for pairs trading
Spreads achieved by LS, rolling LS, and Kalman:
Spreads 2000−08−01 / 2003−12−31

LS
0.15 rolling.LS 0.15
Kalman
0.10 0.10

0.05 0.05

0.00 0.00

−0.05 −0.05

−0.10 −0.10

−0.15 −0.15

Aug 01 Nov 01 Feb 01 Jun 01 Sep 04 Jan 02 Apr 01 Jul 01 Oct 01 Jan 02 Apr 01 Jul 01 Oct 01 Dec 31
2000 2000 2001 2001 2001 2002 2002 2002 2002 2003 2003 2003 2003 2003

D. Palomar (HKUST) Pairs Trading 44 / 63

Kalman for pairs trading
Trading of spread from LS:
Z−score and trading on spread based on LS 2000−08−01 / 2003−12−31

2 Z−score 2
signal
1 1

0 0

−1 −1

−2 −2

−3 −3

Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 03 May 01 Aug 01 Nov 03
2000 2000 2001 2001 2001 2001 2002 2002 2002 2002 2003 2003 2003 2003

Cum P&L for spread based on LS 2000−08−01 / 2003−12−31

2.5 2.5

2.0 2.0

1.5 1.5

1.0 1.0

Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 03 May 01 Aug 01 Nov 03
2000 2000 2001 2001 2001 2001 2002 2002 2002 2002 2003 2003 2003 2003

D. Palomar (HKUST) Pairs Trading 45 / 63

Kalman for pairs trading
Trading of spread from rolling LS:
Z−score and trading on spread based on rolling−LS 2000−08−01 / 2003−12−31

2 Z−score 2
signal
1 1

0 0

−1 −1

−2 −2

−3 −3

Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 03 May 01 Aug 01 Nov 03
2000 2000 2001 2001 2001 2001 2002 2002 2002 2002 2003 2003 2003 2003

Cum P&L for spread based on rolling−LS 2000−08−01 / 2003−12−31

3.0 3.0

2.5 2.5

2.0 2.0

1.5 1.5

1.0 1.0

Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 03 May 01 Aug 01 Nov 03
2000 2000 2001 2001 2001 2001 2002 2002 2002 2002 2003 2003 2003 2003

D. Palomar (HKUST) Pairs Trading 46 / 63

Kalman for pairs trading
Trading of spread from Kalman:
Z−score and trading on spread based on Kalman 2000−08−01 / 2003−03−31

Z−score
2 signal 2

1 1

0 0

−1 −1

−2 −2

−3 −3

Aug 01 Oct 02 Dec 01 Feb 01 Apr 02 Jun 01 Aug 01 Oct 01 Dec 03 Feb 01 Apr 01 Jun 03 Aug 01 Oct 01 Dec 02 Feb 03 Mar 31
2000 2000 2000 2001 2001 2001 2001 2001 2001 2002 2002 2002 2002 2002 2002 2003 2003

Cum P&L for spread based on Kalman 2000−08−01 / 2003−03−31

4.0 4.0

3.5 3.5

3.0 3.0

2.5 2.5

2.0 2.0

1.5 1.5

D. Palomar (HKUST) Pairs Trading 47 / 63

Kalman for pairs trading
Wealth comparison:
Cum P&L 2000−08−01 / 2003−03−31

LS
4.0 4.0
rolling.LS
Kalman
3.5 3.5

3.0 3.0

2.5 2.5

2.0 2.0

1.5 1.5

1.0 1.0

Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 01 May 01 Aug 01 Nov 01 Feb 03
2000 2000 2001 2001 2001 2001 2002 2002 2002 2002 2003

D. Palomar (HKUST) Pairs Trading 48 / 63

Kalman filter in finance

The Kalman filter can and has been used in many aspects of financial time-series
modeling as one could expect.10
Examples of univariate time series: rate of inflation, national income, level of
unemployment, etc.
Typical models include: local model, trend-cycle decompositions, seasonality, etc.
Examples of multivariate time series: inflation and national income.
Multiple time series allows for more sophisticated models including common factors,
cointegration, etc.
Also data irregularities can be easily handled, e.g., missing observations, outliers, mixed
frequencies.
Plenty of applications for nonlinear and non-Gaussian models as well, e.g., GARCH
modeling and stochastic volatility modeling.

10
A. Harvey and S. J. Koopman, “Unobserved components models in economics and finance: The role of the
Kalman filter in time series econometrics,” IEEE Control Systems Magazine, vol. 29, no. 6, pp. 71–81, 2009.
D. Palomar (HKUST) Pairs Trading 49 / 63
Outline
1 Cointegration
2 Basic Idea of Pairs Trading
3 Design of Pairs Trading
Pairs selection
Cointegration test
Optimum threshold
4 LS Regression and Kalman for Pairs Trading
5 From Pairs Trading to Statistical Arbitrage (StatArb)∗
VECM
Optimization of mean-reverting portfolio (MRP)
6 Summary
From pairs trading to statistical arbitrage

Pairs trading focuses on finding cointegration between two stocks.

A more general idea is to extend this statistical arbitrage from two stocks to more stocks.

The idea is still based on cointegration:

Try to construct a linear combination of the log-prices of multiple (more than two)
stocks such that it is a cointegrated mean-reversion process.

] is zt = y1t − γy2t , which can be understood as a

In the case of two assets, the[ spread
1
portfolio with weights: w = .
−γ
In the general case of many assets, one has to properly design the portfolio w.

D. Palomar (HKUST) Pairs Trading 51 / 63

Denote the log-prices of multiple stocks as yt and the log-returns as rt = ∆yt = yt − yt−1 .
Most of the multivariate time-series models attempt to model the log-returns rt (because
the log-prices are nonstationary whereas the log-returns are weakly stationary, at least
over some time horizon).
However, it turns out that differencing the log-prices may destroy part of the structure.
The VECM11 tries to fix that issue by including an additional term in the model:
p−1
∑
rt = ϕ0 + Πyt−1 + Φ̃i rt−i + wt ,
i=1

where the term Πyt−1 is called error correction term.

11
R. F. Engle and C. W. J. Granger, “Co-integration and error correction: Representation, estimation, and
testing,” Econometrica: Journal of the Econometric Society, pp. 251–276, 1987.
D. Palomar (HKUST) Pairs Trading 53 / 63
VECM - Matrix Π

The matrix Π is of extreme importance.

∑p−1
Notice that from the model rt = ϕ0 + Πyt−1 + i=1 Φ̃i rt−i + wt one can conclude that
Πyt must be stationary even though yt is not!!!
If that happens, it is said that yt is cointegrated.
There are three possibilities for Π:
rank (Π) = 0: This implies Π = 0, thus yt is not cointegrated (so no mystery here) and the
VECM reduces to a VAR model on the log-returns.
rank (Π) = N: This implies Π is invertible and thus yt must be stationary already.
0 < rank (Π) < N: This is the interestinc case and Π can be decomposed as Π = αβ T with
α, β ∈ RN×r with full column rank. This means that yt has r linearly independent
cointegrated components, i.e., β T yt , each of which can be used for pairs trading.

D. Palomar (HKUST) Pairs Trading 54 / 63

] is zt = y1t − γy2t , which can be understood as a

In the case of two assets, the[ spread
1
portfolio with weights: w = .
−γ
In the general case of many assets, one has to properly design the portfolio w.
One interesting approach is based on a VECM modeling of the universe of stocks:
From the parameter β contained in the low-rank matrix Π = αβ T one can simply use any
column of β (even all of them)
Even better, β defines a cointegration subspace and we can then optimize the portfolio
within that cointegration subspace.12,13

12
Z. Zhao and D. P. Palomar, “Mean-reverting portfolio with budget constraint,” IEEE Trans. Signal
Process., vol. 66, no. 9, pp. 2342–2357, 2018.
13
Z. Zhao, R. Zhou, and D. P. Palomar, “Optimal mean-reverting portfolio with leverage constraint for
statistical arbitrage in finance,” IEEE Trans. Signal Process., vol. 67, no. 7, pp. 1681–1695, 2019.
D. Palomar (HKUST) Pairs Trading 56 / 63
Mean-reverting portfolio (MRP)

Consider the log-prices yt and use β to extract several spreads st = β T yt .

Let’s now use a portfolio w to extract the best mean-reverting spread from st as
zt = wT st .
To design the the portfolio w we have two main objectives (recall that total profit equals:
profit of each trade × number of trades):
we want[large variance (profit of each]trade): wT M0 w, where
T
Mi = E (st − E [st ]) (st+i − E [st+i ])
we want strong mean reversion (number of trades): many proxies exist like the Portmanteau
statistics or crossing statistics.

D. Palomar (HKUST) Pairs Trading 57 / 63

Mean-reverting portfolio (MRP)

For example, if we use the Portmanteau statistics as a proxy for the mean reversion, the
problem formulation becomes:
∑p ( )2
wT Mi w
minimize i=1 wT M0 w
w
subject to wT M0 w = ν
w ∈ W.

Using other proxies, the formulation can be expressed more generally as14
∑p ( )2
minimize wT Hw + λ i=1 wT Mi w
w
subject to wT M0 w = ν
w ∈ W.
14
Z. Zhao and D. P. Palomar, “Mean-reverting portfolio with budget constraint,” IEEE Trans. Signal
Process., vol. 66, no. 9, pp. 2342–2357, 2018.
D. Palomar (HKUST) Pairs Trading 58 / 63
MRP in practice
Observe several stock log-prices and the spreads obtained from β:

5.5
APA
AXP
CAT
5.0 COF
FCX
IBM
MMM

Log-prices
4.5

4.0

3.5

3.0

6.0
s1

5.5

5.4
5.2
s2

5.0

5.6
s3

5.4
5.2
2012 2013 2014

D. Palomar (HKUST) Pairs Trading 59 / 63

MRP in practice
Observe several stock log-prices and the spreads obtained from β:

0.4

Spreads
0.2
0 MRP-cro (prop.)
-0.2 Spread s 1
-0.4

0.02
MRP-cro (prop.) - SR=3.2471

ROI
0

-0.02

0.01
Spread s 1 - SR=2.8579
ROI

0
-0.01

8
MRP-cro (prop.)
Spread s 1
6
Cum. P&L

0
2012 2013 2014

D. Palomar (HKUST) Pairs Trading 60 / 63

First of all, we have discovered the concept of cointegration.

We have learned the basic idea of pairs trading for cointegrated assets:
searching for a cointegrated spread is the first step
making sure that the chosen spread remains cointegrated is key (cointegrated tests)
obtaining the cointegration ratio γ and the entering and exiting thresholds are important
details.
We have learned of the use of Kalman (initially developed for tracking missiles) filtering
for improved pairs trading.
We have briefly explored the extension of pairs trading (for two stocks) to statistical
arbitrage (for more than two stocks):
VECM modeling is an important multivariate time-series modeling tool
sophisticated portfolio designs on the cointegration subspace are possible.

D. Palomar (HKUST) Pairs Trading 62 / 63

Thanks

For more information visit:

https://www.danielppalomar.com

Definitive Guide To Pairs Trading
100% (2)
Definitive Guide To Pairs Trading
178 pages
Pairs Trading and Selection Methods is Co Integration Superior
No ratings yet
Pairs Trading and Selection Methods is Co Integration Superior
17 pages
Trading Pairs With Excel Python by Anjana Gupta @tradingpdfgratis2
No ratings yet
Trading Pairs With Excel Python by Anjana Gupta @tradingpdfgratis2
93 pages
Efficient Pair Selection For Pair-Trading Strategies
No ratings yet
Efficient Pair Selection For Pair-Trading Strategies
16 pages
Advanced Portfolio Management: A Quant's Guide for Fundamental Investors
From Everand
Advanced Portfolio Management: A Quant's Guide for Fundamental Investors
Giuseppe A. Paleologo
No ratings yet
Selection of A Portfolio of Pairs Based On Cointegration: A Statistical Arbitrage Strategy
No ratings yet
Selection of A Portfolio of Pairs Based On Cointegration: A Statistical Arbitrage Strategy
32 pages
Pairs Trading
No ratings yet
Pairs Trading
21 pages
@traderslibrary2 Gupta Trading Pairs Advance Statistical Tools For
100% (1)
@traderslibrary2 Gupta Trading Pairs Advance Statistical Tools For
68 pages
Positional Option Trading: An Advanced Guide
From Everand
Positional Option Trading: An Advanced Guide
Euan Sinclair
2.5/5 (2)
Aquaponics Literature Review
90% (10)
Aquaponics Literature Review
15 pages
META HPA - e - 02 User Manual
100% (1)
META HPA - e - 02 User Manual
16 pages
Pairs Trading - slides_pairs_trading
No ratings yet
Pairs Trading - slides_pairs_trading
63 pages
Finding The Optimal Pre-Set Boundaries For Pairs Trading Strategy
No ratings yet
Finding The Optimal Pre-Set Boundaries For Pairs Trading Strategy
27 pages
Cointegration (6)
No ratings yet
Cointegration (6)
32 pages
Selection of A Portafolio of Pairs On Cointegration - Statistical Arbitrage Strategy
No ratings yet
Selection of A Portafolio of Pairs On Cointegration - Statistical Arbitrage Strategy
28 pages
Pairs Trading: A Statistical Arbitrage Strategy
No ratings yet
Pairs Trading: A Statistical Arbitrage Strategy
21 pages
Pairs Trading: An Implementation of The Stochastic Spread and Cointegration Approach
No ratings yet
Pairs Trading: An Implementation of The Stochastic Spread and Cointegration Approach
29 pages
A New Approach To Modeling and Estimation For Pairs Trading
No ratings yet
A New Approach To Modeling and Estimation For Pairs Trading
31 pages
2014-Huck
No ratings yet
2014-Huck
16 pages
ssrn-4771108
No ratings yet
ssrn-4771108
13 pages
Statistical Arbitrage Pairs Trading Strategies: Review and Outlook
No ratings yet
Statistical Arbitrage Pairs Trading Strategies: Review and Outlook
33 pages
Trading Strategies
No ratings yet
Trading Strategies
5 pages
Pairs Trading Strategy For BANKNIFTY
100% (2)
Pairs Trading Strategy For BANKNIFTY
16 pages
AM The Profitability of Pairs Trading Strategies
No ratings yet
AM The Profitability of Pairs Trading Strategies
36 pages
Selecao de Uma Carteira de Par
No ratings yet
Selecao de Uma Carteira de Par
32 pages
Understanding Pairs Trading: Yuan Chen (Vincent)
No ratings yet
Understanding Pairs Trading: Yuan Chen (Vincent)
38 pages
Lecture23 Pairs Trading
100% (1)
Lecture23 Pairs Trading
24 pages
FULLTEXT01
No ratings yet
FULLTEXT01
44 pages
Huang 2018
No ratings yet
Huang 2018
18 pages
Institut Für Wirtschaftspolitik Und Quantitative Wirtschaftsforschung Diskussionspapier Discussion Papers No. 09/2015
No ratings yet
Institut Für Wirtschaftspolitik Und Quantitative Wirtschaftsforschung Diskussionspapier Discussion Papers No. 09/2015
62 pages
The Profitability of Pairs Trading Strategies: Distance, Cointegration, and Copula Methods
No ratings yet
The Profitability of Pairs Trading Strategies: Distance, Cointegration, and Copula Methods
35 pages
Optimizing Pairs Trading of US Equities in a High Frequency Setting
No ratings yet
Optimizing Pairs Trading of US Equities in a High Frequency Setting
28 pages
6 - Enhancing a Pairs Trading Strategy With ML
No ratings yet
6 - Enhancing a Pairs Trading Strategy With ML
10 pages
MLbootcamp
No ratings yet
MLbootcamp
54 pages
cvx_ccv_stat_arb
No ratings yet
cvx_ccv_stat_arb
20 pages
2018 WB 2869 - QuantConnect PairTradinginPython
100% (1)
2018 WB 2869 - QuantConnect PairTradinginPython
21 pages
Master Thesis
No ratings yet
Master Thesis
78 pages
Draft_Paper
No ratings yet
Draft_Paper
9 pages
Trading FX Toolz
100% (1)
Trading FX Toolz
155 pages
Pairs Trading G GR
No ratings yet
Pairs Trading G GR
31 pages
Issue 85 - Aug 2024 - Full Text Part 03
No ratings yet
Issue 85 - Aug 2024 - Full Text Part 03
1,291 pages
Some Quantitative Issues in Pairs Trading
No ratings yet
Some Quantitative Issues in Pairs Trading
6 pages
A Pairs Trading Strategy Based On Mixed Copulas
No ratings yet
A Pairs Trading Strategy Based On Mixed Copulas
32 pages
Bachelor Thesis
No ratings yet
Bachelor Thesis
55 pages
Some of Existing Method of Pair Trading
No ratings yet
Some of Existing Method of Pair Trading
11 pages
Pairs Trading - Performance of A Relative-Value Arbitrage Rule
No ratings yet
Pairs Trading - Performance of A Relative-Value Arbitrage Rule
31 pages
Stat Arb High Freq Eurostoxx
No ratings yet
Stat Arb High Freq Eurostoxx
31 pages
Statistical Arbitrage in The U.S. Equities Market
No ratings yet
Statistical Arbitrage in The U.S. Equities Market
47 pages
Thesis Schmidt
No ratings yet
Thesis Schmidt
130 pages
GR 1
No ratings yet
GR 1
12 pages
The Process of Finding and Traiding Pairs
No ratings yet
The Process of Finding and Traiding Pairs
2 pages
Econometircs Group Work
No ratings yet
Econometircs Group Work
4 pages
Pairs Trading Cointegration Approach
No ratings yet
Pairs Trading Cointegration Approach
82 pages
Statistical Arbitrage and High-Frequency Data With An Application To Eurostoxx 50 Equities
No ratings yet
Statistical Arbitrage and High-Frequency Data With An Application To Eurostoxx 50 Equities
16 pages
Basket Trading Under Co-Integration With The
No ratings yet
Basket Trading Under Co-Integration With The
14 pages
Strategic Risk Management: Designing Portfolios and Managing Risk
From Everand
Strategic Risk Management: Designing Portfolios and Managing Risk
Campbell R. Harvey
No ratings yet
The Unlucky Investor's Guide to Options Trading
From Everand
The Unlucky Investor's Guide to Options Trading
Julia Spina
3.5/5 (2)
Java Question Bank
No ratings yet
Java Question Bank
10 pages
Campbell Hausfeld Compressor Manual
No ratings yet
Campbell Hausfeld Compressor Manual
23 pages
OPSS - PROV 180 Nov16
No ratings yet
OPSS - PROV 180 Nov16
10 pages
4.ce417 Note Ch4 Part 1
No ratings yet
4.ce417 Note Ch4 Part 1
60 pages
Connotation Vs Denotation Dla 12-18-18
No ratings yet
Connotation Vs Denotation Dla 12-18-18
7 pages
June Vape Directory
No ratings yet
June Vape Directory
48 pages
Two-Phase Flow Venting From Reactor
No ratings yet
Two-Phase Flow Venting From Reactor
9 pages
Sanyo Em-C1900 SM
No ratings yet
Sanyo Em-C1900 SM
28 pages
Foraging Fashion: African American Influences on Cultural Aesthetics
No ratings yet
Foraging Fashion: African American Influences on Cultural Aesthetics
2 pages
Assignment 1
No ratings yet
Assignment 1
13 pages
Grade 11 Media Arts - Photography Unit
No ratings yet
Grade 11 Media Arts - Photography Unit
5 pages
Pamt311 Cornell Notes Dapilaga.docx
No ratings yet
Pamt311 Cornell Notes Dapilaga.docx
3 pages
Propagation Modelling For Wireless Local Loop Channel
No ratings yet
Propagation Modelling For Wireless Local Loop Channel
11 pages
ATC Metar
No ratings yet
ATC Metar
6 pages
Final Role and Competencies of Od Practitioner
100% (1)
Final Role and Competencies of Od Practitioner
31 pages
Design_and_delivery_of_Ireland_s_largest
No ratings yet
Design_and_delivery_of_Ireland_s_largest
6 pages
Analyzing Business Markets and Business Buying Behavior
No ratings yet
Analyzing Business Markets and Business Buying Behavior
4 pages
Stress Concentration
No ratings yet
Stress Concentration
9 pages
ST - ST Catalogue (TSINGSHAN) - Compressed
No ratings yet
ST - ST Catalogue (TSINGSHAN) - Compressed
64 pages
Phil G E P S: Ippine Overnment Lectronic Rocurement Ystem
No ratings yet
Phil G E P S: Ippine Overnment Lectronic Rocurement Ystem
11 pages
22 Improvised Weapons
No ratings yet
22 Improvised Weapons
1 page
Action Research Funded by BERF (By Abdullah)
100% (3)
Action Research Funded by BERF (By Abdullah)
36 pages
BCM 3 Lime Plaster N Plaster of Paris Plaster
No ratings yet
BCM 3 Lime Plaster N Plaster of Paris Plaster
10 pages
Su Jeet Kumar Gupta
No ratings yet
Su Jeet Kumar Gupta
3 pages
Gas Turbine Power Generation
No ratings yet
Gas Turbine Power Generation
14 pages
How To Trade Synthetic Indices
No ratings yet
How To Trade Synthetic Indices
25 pages
Circular Organizing and Triple Loop Learning
No ratings yet
Circular Organizing and Triple Loop Learning
10 pages
It'S The Detail That Counts!: Wabco Air Dryer Cartridges
No ratings yet
It'S The Detail That Counts!: Wabco Air Dryer Cartridges
1 page