Factor Model for Option Returns Analysis
Factor Model for Option Returns Analysis
Matthias Buechner
Bryan T. Kelly
For helpful comments and suggestions we thank Torben Andersen, Bjorn Eraker, Scott Murray, Simon
Schmickler, Peter van Tassel, Yinan Su, Kaushik Vasudevan, Steffen Windmueller, and conference
participants at the FMA Derivatives & Volatility 2019 Workshop, the WFA Annual Meet- ing 2020,
the Econometric Society World Congress 2020, and seminar participants at Warwick University. Matthias
Buechner gratefully acknowledges the support of the Centre for Endowment Asset Management (CEAM)
at Judge Business School, University of Cambridge. AQR Capital Management is a global investment
management firm, which may or may not apply similar investment techniques or methods of analysis
as described herein. The views expressed here are those of the authors and not necessarily those of
AQR or the National Bureau of Economic Research.
At least one co-author has disclosed additional relationships of potential relevance for this research.
Further information is available online at [Link]
NBER working papers are circulated for discussion and comment purposes. They have not been peer-
reviewed or been subject to the review by the NBER Board of Directors that accompanies official
NBER publications.
© 2021 by Matthias Buechner and Bryan T. Kelly. All rights reserved. Short sections of text, not to
exceed two paragraphs, may be quoted without explicit permission provided that full credit, including
© notice, is given to the source.
A Factor Model For Option Returns
Matthias Buechner and Bryan T. Kelly
NBER Working Paper No. 29369
October 2021
JEL No. G1,G12
ABSTRACT
Due to their short lifespans and migrating moneyness, options are notoriously difficult to study
with the factor models commonly used to analyze the risk-return trade-off in other asset classes.
Instrumented principal components analysis solves this problem by tracking contracts in terms of
their pricing-relevant characteristics via time-varying latent factor loadings. We find that a model
with three latent factors prices the cross-section of option returns and explains more than 85% of
the variation in a panel of monthly S&P 500 option returns from 1996 to 2017. In particular, we
show that the IPCA factors can be rationalized via an economically plausible three-factor model
consisting of a level, slope and skew factor. Finally, out-of-sample trading strategies based on
insights from the IPCA model have significant alpha over previously studied option strategies.
Matthias Buechner
Warwick University
Scarman Rd
Coventry CV4 7AL
United Kingdom
[Link].16@[Link]
Bryan T. Kelly
Yale School of Management
165 Whitney Ave.
New Haven, CT 06511
and NBER
[Link]@[Link]
A Factor Model for Option Returns
Matthias Büchnera , Bryan Kellyb,∗
a
Centre for Endowment Asset Management, Judge Business School, University of Cambridge, Trumpington St,
Cambridge, CB2 1AG, United Kingdom
b
Yale University, Yale School of Management, 165 Whitney Avenue, New Haven, CT 06511, USA
AQR Capital Management, Two Greenwich Plaza, Greenwich, CT 06830, USA
National Bureau of Economic Research, Cambridge MA 02138, USA
Abstract
Due to their short lifespans and migrating moneyness, options are notoriously difficult to study
with the factor models commonly used to analyze the risk-return trade-off in other asset classes.
Instrumented principal components analysis solves this problem by tracking contracts in terms of
their pricing-relevant characteristics via time-varying latent factor loadings. We find that a model
with three latent factors prices the cross-section of option returns and explains more than 85% of
the variation in a panel of monthly S&P 500 option returns from 1996 to 2017. In particular, we
show that the IPCA factors can be rationalized via an economically plausible three-factor model
consisting of a level, slope and skew factor. Finally, out-of-sample trading strategies based on
insights from the IPCA model have significant alpha over previously studied option strategies.
Keywords:
Option Return, Factor Model, Return Predictability, IPCA
JEL codes: G11; G12; G13
1. Introduction
Asset pricing aims to understand the risk-reward trade-off that investors face in financial mar-
kets. The most common empirical approach for evaluating this trade-off is to model returns with
a low-dimensional common factor structure. The structure of option contracts makes it difficult to
∗
Corresponding author
Email addresses: [Link]@[Link] (Matthias Büchner), [Link]@[Link] (Bryan
Kelly )
URL: [Link] (Matthias Büchner), [Link] (Bryan Kelly )
Preprint submitted to Journal of Financial Economics First draft: June 2019. This draft: September 28, 2021
model their returns in this way. Instead, the literature primarily studies the risk-return trade-off in
options using parametric no-arbitrage pricing models. These models require a full specification of
underlying distributions and dynamics. While the parametric approach benefits from arbitrage-free
pricing and mathematical elegance, it is prone to model misspecification and likely too simplistic
to describe empirically observed patterns of options returns. In practice, no-arbitrage models fail
to account for a large part of the empirically observed variation in option returns (Israelov and
Kelly, 2017).
Our objective is to develop an understanding for the risk-return trade-off in option markets
using a factor pricing approach commonly applied in other asset classes. The motivation for factor
modeling is independent of the asset class studied: from the asset pricing Euler equation and the
assumption of no-arbitrage, a stochastic discount factor, mt+1 , exists that satisfies Et [mt+1 ri,t+1 ] = 0,
and hence
Covt (mt+1 , ri,t+1 ) Vart (mt+1 )
Et [ri,t+1 ] = − . (1)
Vart (mt+1 ) Et [mt+1 ]
For the first ratio in Equation (1) we adopt the notation βi,t and note that it describes the conditional
exposure of asset i to systematic risk factors. For the second ratio we adopt the notation λt . It can
be interpreted as the conditional price of risk associated with factors. When mt+1 is linear in factors
ft+1 , as assumed in many asset pricing studies, the cross section of excess returns satisfies a linear
factor model:
ri,t+1 = αi,t + β′i,t ft+1 + i,t+1 , (2)
where for all i and t we have Et [i,t+1 ] = E[i,t+1 ft+1 ] = 0, Et [ ft+1 ] = λt , and αi,t = 0.
Typical factor models used in asset pricing are significantly less restrictive than parametric no-
arbitrage option pricing models since they forgo the need to fully specify underlying distributions
and dynamics. However, typical factor modeling approaches rely on pre-ordained factors and
require a long time series of asset returns to estimate betas. This approach is difficult with options
data for a few reasons. First, their short lives make it hard to estimate option betas with time
series regression. Second, rapid migration of option attributes (such as moneyness and maturity)
means that option risk exposures likewise migrate rapidly over time. Thus, it is important to
incorporate conditional betas in option factor models, which further limits the viability of time
2
series regression. Third, the factors are difficult to ascertain a priori. If instead one were to consider
estimating a latent factor model, the same complications that make it difficult to estimate option
betas via time series regression take the de facto latent factor method of principal components
analysis off the table due to its reliance on static betas.
In this paper we take a different tack by treating risk factors as latent, but allow for time
variation in the factor structure. We use the instrumented principal components analysis (IPCA)
methodology of Kelly, Pruitt and Su (2019), which explicitly accounts for time variation in indi-
vidual asset behavior by allowing risk factor loadings, βi,t , to depend on observable asset charac-
teristics. The characteristics serve as instrumental variables for conditional betas, which avoids the
limitations of static betas in time series regression. In fact, a simple Taylor series decomposition
of the option price demonstrates that option returns can be approximately explained via contract
specific coefficients related to option Greeks and common factors.1 However, any theoretically
motivated decomposition is ultimately dependent on a specific model of option prices. An advan-
tage of the IPCA approach is that it does not take a stand on the exact details of the underlying
model and instead infers the relevant sources of risk from a large sample of realized option returns.
Previous literature studying the cross section of option returns has dealt with the challenges
posed by options (short lifespans and time-varying risk attributes) by using empirical techniques
such as portfolio sorts to construct ad hoc prespecified factors. This approach has had some success
in describing option returns, but the resulting fraction of explained variation in option returns is
underwhelming.2 This is primarily due to standard equity-based asset pricing techniques being
ill-equipped to deal with time-varying factor loadings. For example, rolling regressions are too
slow to capture changes in contract risk. In contrast, IPCA offers an internally consistent approach
to estimate conditional loadings and factors simultaneously. In addition, unlike their underlying
assets which have an ambiguous set of asset characteristics, option contracts are special in the
sense that some of their characteristics, like moneyness and maturity, are easy to measure and
unambiguously relevant to the contract’s factor risk. Hence, IPCA is particularly well-suited to the
1
We discuss this decomposition in Section 4.7.
2
These papers include among others Coval and Shumway (2001); Goyal and Saretto (2009); Frazzini and Pedersen
(2012); Cao et al. (2021); Karakaya (2013)
3
analysis of option returns.
1.1. Findings
As the laboratory to study the efficacy of the IPCA methodology in the context of option re-
turns we focus on the panel of monthly S&P 500 option returns from 1996 to 2017. Most prior
academic literature restricts analysis to particular subsets of option contracts, for example focusing
on at-the-money contracts with one month to maturity, because it makes their analyses less sen-
sitive to misspecification biases that are exacerbated when looking across contracts with different
moneyness and maturities. In contrast, our goal is to achieve an accurate description of return
variation and risk compensation for a wide range of contracts in a single model.
Taking the IPCA model to our sample of option returns, we assess the fit performance of option
return factor models in two dimensions. The first is how well the estimated factors and betas cap-
ture the contemporaneous variation in realized returns. In particular, we evaluate candidate models
in terms of their “total R2 ” which measures the fraction of variance in individual option contract re-
turns ri,t+1 explained by β̂′i,t fˆt+1 , where β̂i,t are estimates of conditional loadings on estimated latent
risk factors fˆt+1 .
Second, we inspect a model’s match of differences in expected returns across assets, which
describes the accuracy of the model-implied risk-return trade-off. We measure this as a “predictive
R2 ” or the fraction of variance in realized returns ri,t+1 explained by the model-implied conditional
expected returns β̂′i,t λ̂, where λ̂ denotes the model’s estimated risk prices.3 We compute both mea-
sures by aggregating over all option contracts and time periods.
We begin with a one-factor IPCA specification and with a set of characteristics consisting of
option moneyness, time to maturity, implied volatility, embedded leverage, and Black-Merton-
Scholes (BMS) “Greeks.” A single latent factor is sufficient to explain around 72% of the return
variation in delta-hedged option returns. The prespecified factor models that we study, including
variations of the Fama-French-Carhart model and two models that additionally include option-
specific factors such as the Frazzini and Pedersen (2012) betting-against-beta (or embedded lever-
3
Alternatively, one might entertain time-varying λt . We find that the gains from incorporating time-varying risk
prices are small.
4
age) factor and the Coval and Shumway (2001) option straddle factor, fail to achieve a total R2
close to that of IPCA. Allowing for additional latent factors in IPCA further improves the model’s
ability to describe joint fluctuations in the panel of option returns. The total R2 for the model with
five factors reaches more than 90%.
Further, the IPCA model allows us to test whether average option returns are explained by
factor risk or if contracts earn additional compensation (i.e., alpha) above and beyond that war-
ranted by their risks. We find that at least three factors are necessary to price the cross-section of
option returns on average. This finding is consistent with models such as Carr and Wu (2020) and
Christoffersen et al. (2018) that advocate for low-dimensional multi-component models to capture
option returns. However, we note that this finding does not rule out existence of significant alphas
in subsets of contracts, similar to the findings of Jones (2006). Indeed, in an analysis of alphas in
subsets of our sample (i.e. characteristic-managed portfolios) we find evidence for some significant
managed portfolio alphas.
In terms of predictive R2 , the IPCA model generally outperforms prespecified factor models by
producing more accurate model-based option return forecasts (both in-sample and out-of-sample).
The best performing prespecified factor models from earlier literature include option specific fac-
tors such as the embedded leverage factor of Frazzini and Pedersen (2012) and the Coval and
Shumway (2001) straddle factor, though these underperform IPCA. When comparing IPCA with
its static principal components analysis (PCA), we find that PCA is incapable of matching the
behavior of option returns due to its counterfactual requirement of static betas for individual con-
tracts. This highlights the important role played by time-varying factor loadings in IPCA. Since the
individual option time series are short (only a handful of monthly observations) it is challenging
to estimate static time series regression betas. IPCA overcomes this issue by restating the cross
section in terms of managed portfolios which do not suffer from the extreme panel unbalancedness
that plagues individual contracts. Yet IPCA still exploits vital cross-sectional variation related to
individual contract characteristics. Compared to factor models with prespecified factors, we find
that IPCA provides a more accurate description of the risk-return trade-off in index options. IPCA
alphas are on average about one percentage point per annum smaller (i.e., less than half as large)
5
compared to competing models.
IPCA is a statistical factor model, thus an important (and challenging) task is interpreting the
factors that IPCA recovers. We find that a benchmark IPCA model can be roughly interpreted as
capturing three flavors of risk. The first is the overall level of the volatility surface, the second is
associated with maturity risk and summarized by the term structure slope of the volatility surface,
and the third is associated with the moneyness skew of the volatility surface (and hence associated
with index tail risk). In fact, when we construct observable factors by sorting options into level,
maturity slope, and moneyness skew factors, we find that these explain between 70% and 90%
of the time series variation in our set of three IPCA factors. However, we stress that this factor
interpretation is only a coarse approximation to IPCA. The fact that IPCA seeks statistically op-
timal factors, rather than relying on some ad hoc construction, is a key source of its superior and
robust performance. And, as we learn from the IPCA model, time-varying loadings on the three
observable factors are critical in order to come anywhere near the explanatory power of the IPCA
model.
We examine which characteristics are the most important contributors to the accurate fits of
IPCA. Option implied volatility and vega matter most for describing time variation in betas and
produce the largest contributions to total R2 . Option gamma, a measure of sensitivity to jump
risk, is another important driver of options’ factor betas. In addition to studying the importance of
characteristics in our baseline model specification, we assess our choice of characteristics vis-á-vis
an economically motivated list of characteristics from an option return decomposition. We find
that all of our results are robust to the inclusion of additional option characteristics.
While most of our analysis uses monthly data, we also evaluate the IPCA model performance
for daily returns. This analysis is motivated by Carr and Wu (2020), who propose a decomposition
of daily option returns using a BMS approximation that attributes return performance to each of the
BMS Greeks. Their attribution accurately describes returns of short-dated at-the-money contracts
(where the approximation is most appropriate), but it struggles to describe returns of out-of-the
money contracts and at longer maturities. We show that, in contrast to the Carr and Wu (2020)
attribution, IPCA offers a uniformly accurate description of risk and return for options throughout
6
the moneyness and maturity spectrum. A further robustness exercise demonstrates that IPCA still
performs well when including contracts with up to two years in maturity.
1.2. Literature
Our paper relates to three main strands of literature. The first seeks to understand option returns
by sorting options into portfolios based on a small number of characteristics, then measuring the
full sample average returns of these portfolios. This literature includes among others Coval and
Shumway (2001), Bakshi and Kapadia (2003), Goyal and Saretto (2009), Frazzini and Pedersen
(2012), Cao and Han (2013), Karakaya (2013), Cao, Han, Zhan and Tong (2021), and Vasquez
(2017).
A second strand of literature models option prices directly by enforcing no-arbitrage restric-
tions and specifying the full distributional properties and dynamics of the underlying asset. Start-
ing from the seminal Black and Scholes (1973) model for vanilla options, this literature develops
refinements that allow for various forms of stochastic volatility and jumps (Heston, 1993; Duffie
et al., 2000; Carr and Wu, 2004).4 While this approach has the advantage of imposing economically
meaningful no-arbitrage restrictions that guarantee consistent pricing across strikes and maturities,
they ultimately lead the researcher to sacrifice some realism for the sake of mathematical tractabil-
ity. In fact, earlier empirical research demonstrates that arbitrage opportunities in option markets
exist.5 As a result these models often have difficulty in matching the empirical behavior of option
returns (Israelov and Kelly, 2017).
The third, and most closely related, literature acknowledges the limitations of no-arbitrage op-
tion pricing models, and directly models option returns with prespecified or latent factors. Jones
(2006) estimates non-linear factor models for short-term deep out-of-the-money S&P 500 index
options, allowing for potentially latent factors that eventually manifest as volatility and jump risks
factors. Karakaya (2013) is focused on single-name equity options and proposes a three factor
model consisting of a level, slope and value factor that captures much of the variation in delta-
4
Further papers studying models with price and volatility jump specifications are Eraker et al. (2003); Eraker
(2004); Broadie et al. (2007).
5
For example, see Ofek and Richardson (2003); Ofek, Richardson and Whitelaw (2004); Constantinides, Jackwerth
and Perrakis (2009); Chambers, Foy, Liebner and Lu (2014)
7
hedged option returns. Israelov and Kelly (2017) propose an approach for estimating conditional
option return distributions using semi-parametric time series techniques. Carr and Wu (2020) de-
velop a valuation framework that attributes daily option returns to variation in the first and second
moments of the underlying price and underlying volatility. Brooks, Chance and Shafaati (2018)
employ the LASSO estimator in the context of individual equity options to uncover the character-
istics that provide independent information for the cross-section of option returns. Christoffersen,
Fournier and Jacobs (2018) investigate a factor structure in single-name equity options using prin-
cipal components of equity volatility, skews and term structures and document that these principal
components explain a sizable fraction of the cross-sectional variation in option returns. In an analy-
sis of single-name equity options Horenstein, Vasquez and Xiao (2020) use an asymptotic principal
components approach and find strong evidence for a factor structure in equity option returns that
is partly related to individual firm characteristics and partly to aggregate stock market risk.6
Our paper differs from previous literature by proposing an internally consistent factor-based
approach to modeling option returns. It tackles the challenges of estimating factor betas by lever-
aging contract characteristics as conditioning instruments. Apart from delivering a richer factor
model for returns through characteristics, our IPCA approach estimates latent factors without ex
ante knowledge of the cross-section of returns, hence eliminating the need for the researcher to
take a prior stance on the nature of the factors. And, in contrast to the prior literature, our ap-
proach models conditional factor loadings, which is critical due to the rapidly evolving risks at
the individual option level. Notably, our approach does not enforce no-arbitrage across contracts,
which allows us more flexibility in modeling the cross-section of index option returns and results
in a better fit for realized returns. Importantly, the benefits of this flexibility are not an artifact of
overfit. We show that our model continues to excel in purely out-of-sample assessments.
While IPCA confers many advantages for modeling options returns, it also has some limita-
tions. First, because factors are latent, it can be difficult to ascertain the interpretation of risks
captured by our factor estimates. Second, IPCA does not impose no-arbitrage restrictions with
6
Our paper also relates to recent papers that apply IPCA in different asset classes, including Windmüller (2021)
who studies international equities and Kelly, Palhares and Pruitt (2020a) who study corporate bonds.
8
the same stringency as more traditional option pricing models. Third, IPCA assumes that risk
exposures are linear in option characteristics. Model improvements in each of these dimensions
are interesting avenues for future work. For example, building in part on our paper, Fournier,
Jacobs and Orlowski (2021) propose a non-parametric conditional factor model of option returns
(assuming that factors are observable) to accommodate richer functional forms for risk exposures.
The paper proceeds as follows. Section 2 introduces the data and variables used in the analysis,
Section 3 recaps the instrumented principal components methodology, Section 4 summarizes our
empirical results, and Section 5 concludes.
2. Data
In this paper we focus on options on the S&P 500 index.7 Daily option data is obtained from
OptionMetrics for the period of January 1996 to December 2017. The information provided by
OptionMetrics includes contract specifications (exercise date, strike, etc.) as well as underlying
index values, historical dividend yields, and option sensitivity measures such as the BMS delta,
gamma, vega, and theta. Data for the VIX index is obtained through CBOE.
To introduce notation, each option contract i is defined by its strike price Ki and maturity date
T i (time-to-maturity is T i − t). To measure the moneyness of a contract we use the forward BMS
delta, ∆i,t
Fwd
= ∆i,t ⋅ edi,t (Ti −t)/365 , where ∆i,t is the standard BMS delta and di,t denotes the annualized
dividend yield at time t. The forward delta has the advantage that an exactly at-the-money option
(i.e., Ki = S t ) attains a forward delta of 0.5 in absolute value.
Our results focus on monthly holding period returns that are delta-hedged daily.8 Option returns
are computed for periods defined by the expiration date in a given month which usually is the third
Friday in a month, i.e. we compute monthly holding period returns from daily data over periods
starting on the next trading day after the expiration date and ending with the expiration date.
Variation in the price of the underlying is the most important driver of option returns. By delta-
7
Note, the framework employed here also lends itself to the study of single-name options in a characteristics-rich
environment. However, for the sake of simplicity and given their particular relevance to portfolio risk, in this paper,
we focus on index options.
8
In Section 4.8, we perform a robustness exercise and assess the performance of our model at daily frequency.
9
hedging the option contracts daily, we obtain the portion of the option return that is not explained
by local-linear exposure to the underlying index return. This choice is common in the academic
context (e.g., see Cao and Han, 2013). The delta-hedged profit-and-loss (P&L) for a contract with
value F over a period t = 1, . . . , T is given by
T −1 T −1 T −1
at,t+1 rt
Π[1,T ] = ∑ (Ft+1 − Ft ) − ∑ ∆t (S t+1 − S t ) − ∑ (Ft − ∆t S t ) , (3)
t=1 t=1 t=1 365
where the first term is the raw P&L, the second term captures the adjustment from delta-hedging
the position, and the last term adjusts for the cost of funding the delta-hedged portfolio at the
risk-free rate where at,t+1 is the number of days between trading dates t and t + 1.
We compute returns not against the prevailing option mid-price, but against the mid-price of
the underlying. This ensures that returns are well-behaved even in situations when the option mid-
price is close to zero as is common for deep out-of-the-money (OTM) options. We denote the
delta-hedged return against the spot price by rS∆ pot .
We apply a number of filters to the OptionMetrics data. We exclude observations in which i)
the bid price is negative, ii) the bid exceeds the ask, iii) no-arbitrage conditions are violated, or
iv) the OptionMetrics implied volatility is missing. To ensure the reliability of option price data,
we only study observations for contracts with positive open interest. Following Karakaya (2013),
we exclude observations with extreme embedded leverage by trimming data below (above) the 1st
(99th) percentile of the embedded leverage distribution, where embedded leverage is defined as
Ω = ∣∆ ⋅ S /F∣ . (4)
Finally, we restrict our sample to call options with forward delta of 0.01 to 0.5 and −0.5 to −0.01
for put options, and require time-to-maturity (TTM) of one to 12 months. Focusing on this range of
moneyness and maturity ensures the reliability of contract prices and follows the filters of (Israelov
and Kelly, 2017). Outside of this range, contracts become severely illiquid which introduces noise
in returns data. Figure 1 demonstrates how the trading activity in index options is concentrated in
our sample range. Also note that, by put-call parity, the boundaries of ±0.5 on our moneyness range
10
are not restrictive. Although, we “limit” our sample in the way described above, it encompasses
a much larger fraction of the outstanding contracts than many papers in the literature which often
limit their sample to ATM contracts. In robustness tests, we also assess the validity of our findings
for contracts with longer maturities up to two years (see Section 4.7).
To conclude the exposition of our data sample, Table 1 presents summary statistics. After
applying all aforementioned filters, we arrive at a sample of approximately 77,000 option-month
observations. Out of those observations a little more than two-thirds are put options. Since over
the course of our sample period the underlying S&P 500 index appreciates considerably OTM put
options are naturally more abundant than OTM calls. In addition, the put/call ratio for options on
the S&P 500 is commonly higher than one as market participants demand more put options for
insurance purposes.
Construction of the Implied Volatility Surface. For analysis of the Carr and Wu (2020) P&L at-
tribution in Section 4.8, we construct an implied volatility surface to obtain moments of the un-
derlying and volatility under the risk-neutral measure. We construct implied volatility surfaces
using the daily OptionMetrics volatility file that contains volatilities over a prespecified money-
ness and maturity grid. For the construction of the volatility surface we carry out a second order
smooth bivariate spline interpolation in the TTM vs. moneyness space.9 Since it is common for
implied volatilities of put and call options at the same moneyness and maturity to disagree as a
consequence of differences in price pressure, we construct three different volatility surfaces: the
first is constructed solely from observations of put options in the OptionMetrics volatility files, the
second is constructed from observations of call options, and the third is constructed using both put
and calls. In the latter case, the implied volatilities of put and call options are weighted by their
market capitalization (defined as open interest times option mid-price).
9
For details, see the SciPy function SmoothBivariateSpline, documented at [Link]
scipy/reference/generated/[Link].
11
3. Methodology
In this paper, we aim to uncover a factor structure in empirical option returns. To this end,
we use the instrumented principal components model of Kelly et al. (2019, 2020b). The model is
specified for a general excess return ri,t+1
The system is estimated over a total of N assets and T periods. The loadings, βi,t , are time-varying
and partially depend on an L × 1 vector of (option) characteristics zi,t . We assume that zi,t includes
a constant. The vector of factors, ft+1 , is dimension K × 1 where the number of factors. Following
Kelly et al. (2019), the IPCA model can be estimated by means of an alternating least squares
procedure that iterates between the first order conditions of Γ10 and ft+1 .11
There are a number of features that make IPCA ideal for analyzing the factor structure in op-
tions returns. First, observable characteristics feature centrally in the IPCA framework as they
make the estimation of factors more efficient and improve model performance. Contrary to com-
mon equity, option contracts are defined through a set of precisely measured characteristics such
as their strike, time-to-maturity, implied volatility, their location on the implied volatility surface,
and dividend yields. Thanks to the dimensionality reduction embedded in IPCA, a large number
of potential characteristics can be considered simultaneously. IPCA forms a linear mapping Γβ
between characteristics and factors that isolates the informative signal coming from characteristics
and averages out the noise. This eliminates the need for the researcher to take an ad hoc stance on
the characteristics that matter. Second, time-varying loadings βi,t that depend on characteristics zi,t
allow us to model the conditional behavior of option returns. The “identity” of an option contract
is to a large extent pinned down by its location in TTM/moneyness space. IPCA tracks the migra-
10
For brevity, we denote Γα and Γβ jointly as just Γ. The IPCA model can equivalently be stated as ri,t+1 = z′t Γ f˜t+1 +
i,t+1 where Γ = [Γβ , Γα ] is a horizontally stacked matrix and f˜t+1 = [ ft+1 , 1].
11
For the Python implementation used to carry out the estimation we refer the reader to [Link]
bkelly-lab/ipca.
12
tion of the asset in this space through the conditional betas. Third, by restricting the IPCA model
(5) such that Γα = 0, we can test whether risk compensation in option returns solely arises from
exposure to systematic factors, ft , or whether returns partially line up with characteristics directly
(i.e. Γα ≠ 0), hence constituting compensation without risk.
Asset Pricing Performance. To assess the performance of IPCA factors in pricing option returns,
we use two measures following Kelly, Pruitt and Su (2019). Similar to a common R2 goodness-of-
fit measure we compute
2
∑i,t (ri,t+1 − z′i,t (Γ̂α + Γ̂β fˆt+1 ))
R2total =1− . (6)
∑i,t ri,t+1
2
This total R2 measures how well the set of factors and loadings captures realized returns. As a
second measure, we compute
2
∑i,t (ri,t+1 − z′i,t (Γ̂α + Γ̂β λ̂))
R2pred =1− , (7)
∑i,t ri,t+1
2
where λ̂ denotes the unconditional time-series mean of the factors. This predictive R2 captures how
well differences in average returns are explained through the model’s description of conditional
expected returns, i.e. the models ability to describe risk.
In addition to studying the pricing performance for individual option contracts, it is insightful to
examine the pricing performance for portfolios. The IPCA methodology incorporates a portfolio
notion that circumvents a common problem in the asset pricing literature that is the choice of
relevant test assets. IPCA is tightly linked to the idea of characteristic-managed portfolios. Let Zt
be an N × L matrix of characteristics at time t. Then managed portfolios can be constructed via
Zt′ rt+1
xt+1 = , (8)
Nt+1
where Nt+1 is the number of outstanding options at time t + 1. The managed portfolios, xt+1 , are a
weighted average of option returns where the weights are determined by the characteristics in Zt .
Analogously to total and predictive R2 at the individual option level, we can define performance
13
measures for managed portfolios
2
∑l,t (xl,t+1 − z′l,t zl,t (Γ̂α + Γ̂β fˆt+1 ))
R2total,x =1− , (9)
∑l,t xl,t+12
2
∑l,t (xl,t+1 − z′l,t zl,t (Γ̂α + Γ̂β λ̂))
R2pred,x =1− , (10)
∑l,t xl,t+1
2
where l = 1, . . . , L.
4. Empirical Results
4.1. Data
From the data set described in Section 2, we collect a number of option characteristics and
market variables. Option-level variables include the BMS forward delta (delta), time-to-maturity
(ttm), embedded leverage (embed lev), BMS implied volatility (impvol), BMS theta (theta),
BMS gamma (gamma), BMS vega (vega), BMS volga (volga).12 In all specifications we interact
aforementioned variables with both a constant and an indicator variable that is equal one for put
options and equal zero for calls. This interaction allows for the possibility that characteristics
have different effects for calls versus puts. In order to limit the impact of outliers and to aid the
interpretation of the IPCA results, we re-scale all characteristics to the range [−0.5, 0.5]. By scaling
the characteristics in this manner, we focus on the spacing of characteristics in the cross section
which is the main determinant of differences in returns across contracts. In order to ensure similar
leverage properties of the resulting characteristic-managed portfolios (see Eq. 8) we enforce that
resulting portfolio weights sum to zero as is common for long-short portfolios studied in the asset
pricing literature.
To begin, we estimate IPCA using a baseline set of characteristics including delta, ttm,
embed lev, impvol, theta, gamma and vega, yielding a total of 15 characteristics after inter-
14
action with the put/call dummy and addition of a constant. The dependent variable is the monthly
holding-period return that is delta-hedged daily, rS∆ pot , as defined in Section 2.
Table 2 details the performance of the restricted (Γα = 0) and unrestricted (Γα ≠ 0) IPCA models
with K = 1, . . . , 5 factors. With a single factor, IPCA explains more than 72% of the observed
variation in option returns in the restricted model (Γα = 0) and around 74% in the unrestricted
model (Γα ≠ 0). When including an additional IPCA factor, the R2total increases to around 80% for
restricted and 81% for the unrestricted specification. We find that five factors are needed to explain
more than 90% of the variation in monthly delta-hedged index option returns.
IPCA’s least squares objective function directly targets the total R2 . However, the IPCA factors
do not explicitly target a high predictive R2 . We find that the monthly predictive R2 for a single
factor is 5.47% in the restricted model and 7.59% in the unrestricted model. When including
additional factors, the predictive R2 increases in the restricted model, while it decreases slightly in
the unrestricted model.13 A natural comparison for this result is a panel predictive regression of
option returns on the same set of characteristics as used in IPCA. These regressions are identical
to the estimation of IPCA with a single pre-specified constant factor. Table 3 details the results of
the panel regression. We find that characteristics on their own predict only around 3.15% of the
empirically observed variation in returns. Even adding time fixed effects, the R2 is only 6.23%.
In other words, IPCA simultaneously excels in explaining both realized option returns and future
expected option returns.
It is common practice in the empirical asset pricing literature to examine the explanatory power
of asset pricing models using portfolios such as, for example, the 5×5 Fama-French size and book-
to-market sorted portfolios as test assets.14 The IPCA framework can be approximately stated in
′
terms of managed portfolios xt = Zt−1 rt /Nt−1 (see Section 3), where Nt is the number of assets in
the cross-section at time t. This construction yields an L × 1 vector xt , where L corresponds to the
number of considered characteristics. We then compute performance measures for the managed
portfolios as test assets. We notice that the R2total is markedly higher for managed portfolios than
13
In Section 4.2.1 we put these findings in context by comparing with models from previous literature.
14
The asset pricing literature has entertained a large range of potential test assets. This is a sign for the struggle that
this literature has long faced in determining the set of assets that need to be priced in order for a model to be accepted.
15
for individual contracts. For example, using just a single factor the total R2 is 95.48% in the
unrestricted model and 94.41% in the restricted model. When moving from individual options
to managed portfolios we find that the predictive R2 increases to around 7% or 8% for both the
restricted and unrestricted models.
In Panel C of Table 2 we display p-values for the hypothesis test H0 ∶ Γα = 0 implemented
with the bootstrap procedure outlined in Kelly et al. (2019) using 1000 bootstrap draws. For
specifications with one and two factors, we reject the null hypothesis at the 90% confidence level,
suggesting that the risk space spanned by one or two factor models leaves significant option alpha.
However, for a model with three factors we find a pronounced jump in the p-value to 47.2%. This
suggests that at least three factors are needed for the IPCA model to price the cross section of index
option returns on aggregate. This is evidence that factor risk explains conditional expected option
returns when K = 3. Jones (2006) notes that mispricing is reduced by increasing the number
of factors, and we find evidence in a similar spirit: as we increase the number of factors the
discrepancy between the restricted model with Γα = 0 and the unrestricted model with Γα ≠ 0
vanishes. It is also important to point out that the bootstrap test captures the combined effects that
characteristics have on returns. Hence, it is still possible that certain subsets of contracts have some
alpha associated with characteristics. Jones (2006) finds that option alphas are on average close to
zero, but also documents significant mispricing of specific sub-groups of option contracts such as
deep OTM options. We examine mispricing of particular sub-groups of options next.
Performance in the Cross-Section of Contracts. The IPCA framework is designed to target the
aggregate R2total across all option contracts simultaneously. To understand the success of IPCA in
capturing heterogeneity, we study behavior of contracts with different risk profiles and in different
market regimes. In particular, we focus on sub-groups of contracts based on moneyness, TTM, and
market volatility. For this analysis, we focus exposition on the restricted IPCA model with K = 3
factors (though we report results for other choices of K as well). We choose this specification
based on a desire for a parsimonious model and because we fail to reject the null hypothesis of
zero alpha for K = 3.
Table 4 breaks down IPCA performance by bins sorted on moneyness, TTM, and VIX index
16
level. The results by bin are obtained by computing the total R2 only from those option-month
observations corresponding to a particular bin. Panel A of Table 4 shows that IPCA performance
is particularly strong in terms of both total and predictive R2 for ATM and slightly OTM options,
while the performance weakens to some extent for deep OTM options. By increasing the number
of factors in IPCA, the model is able to better track contracts with low absolute deltas between 0.01
and 0.1. Panel B of Table 4 reports results sorted on the option’s time-to-maturity. We find that
IPCA performs best for options with maturities greater than one month in terms of the total R2 . In
order to assess the model’s performance in different market environments (low vs. high volatility
regimes), Panel C dissects the performance by level of the CBOE VIX index. We find that the IPCA
model captures return variation across a range of market regimes, though low volatility regimes
(VIX below 10%) are somewhat more challenging for the IPCA model.
Overall, these results suggest that IPCA successfully captures observed return variation not
only in aggregate but also in specialized sub-groups of options. Due to their abundance in our
sample, ATM and OTM contracts have a relatively higher weight in the IPCA estimation and
in turn the model is especially well trained for describing these contracts. Nevertheless, option
contracts with more extreme moneyness and implied volatility are reasonably well captured in an
IPCA model using only standard option characteristics and a small number of factors. Furthermore,
we find that allowing for more than K = 3 factors improves performance in these relatively small
subsets of the sample, and indicating that additional factors serve to fit the differential behavior of
options in the more extreme bins.
17
are insufficient to price delta-hedged option returns.15 For this reason, we include two additional
factors constructed specifically from S&P 500 index options.
The first is the betting-against-beta (BAB) factor of Frazzini and Pedersen (2012) constructed
from embedded leverage, Ω, as defined in Equation (4). Specifically, BAB is constructed by sorting
options into a high- and low-embedded leverage portfolio using the cross-sectional median of
embedded leverage as the breakpoint. We weight returns in the two portfolios using the option’s
market capitalization (open interest times price). Then the self-financing, zero-beta factor return is
given by
BABt = rtL /Ωt−1
L
− rtH /Ωt−1
H
,
where ΩtL and ΩtH are the weighted average embedded leverage of the low and high embedded
leverage portfolios, respectively. We construct BAB separately for call and put options and then
average them.
The second option risk factor we include is the straddle factor of Coval and Shumway (2001)
as constructed in Frazzini and Pedersen (2012). For construction of the straddles we limit the set
of considered contracts to those with 1 month to expiry. Then, we construct straddles from all pairs
of put and call options with absolute BMS delta between 0.4 and 0.6. The straddle factor return
is obtained by weighting the delta-hedged straddle returns with the market capitalization of the
constituent options.
We add the BAB factor to the “FFC4” four factor model and refer to the resulting five factor
model as “FFCB5”. Finally, we add the straddle factor to the “FFCB5” model and refer to the
resulting model as “FFCBS6”.
To aid comparison of these observable risk factors with the IPCA latent factor model we allow
for characteristics as instruments for time-varying factor loadings similar to IPCA. Note that the
insight that dynamic betas are central to modeling option contract returns is an insight motivated by
IPCA, and without doing so the observable factor models become nearly infeasible at the contract
level and perform disastrously out-of-sample. We implement observable factor models by prespec-
15
See, e.g., Coval and Shumway (2001); Cao and Han (2013); Frazzini and Pedersen (2012); Karakaya (2013) for
factor models entertained in the earlier literature
18
ifying the factors in the IPCA setting and then estimating the loadings matrix Γβ by evaluating the
associated first order condition.
Another natural model comparison for IPCA is the static PCA model without time-varying
loadings. In order to estimate PCA for the individual option contracts where panel unbalancedness
is a challenge, we utilize an alternating least squares scheme.16
Table 5 compares the different implementations. To provide a baseline, Panel A restates the
performance for the restricted (Γα = 0) IPCA model with K = 1, . . . , 5 factors for both individual
option returns as well as characteristic-managed portfolios. Panel B reports results for the imple-
mentation of IPCA with prespecified observable risk factors and characteristics as instruments. We
find that the IPCA model with a single factor outperforms observable factor models in terms of the
total R2 for all combinations of observable factors that we examine. Adding the observable factors
constructed from option returns (models “FFCB5” and “FFCBS6”) considerably improves the ob-
servable factor model fit. The R2total increases by around 7% versus the Carhart model (“FFC4”)
when the embedded leverage factor is included and by another 17% when the straddle factor is
included, though still nearly half of the R2total from IPCA. The best performing observable factor
model, “FFCBS6,” generates a predictive R2 similar to that of IPCA.
In Panel C, we summarize the fits from running latent factor models estimated with PCA.
Similar to Kelly et al. (2019) we find that PCA is easily outperformed by IPCA when focusing
on individual assets, with the predictive R2 being negative for almost all specifications. When
running PCA at the managed portfolio level the model fit matches IPCA in terms of total R2 , while
IPCA generally produces a higher predictive R2 . This finding is plausible since at the managed
portfolio level IPCA and PCA estimate the same number of coefficients. Nevertheless, the result
demonstrates that IPCA is clearly superior in terms of tracking individual option contracts which
are challenging to track in classical factor models without time-varying loadings given their short
lifespans and swiftly changing risk attributes. Furthermore, IPCA provides the explicit modeling
link between the individual underlying assets and the managed portfolios.
16
In particular, we use the MATLAB function PCA with the flag ’Algorithm’ set to ’als’. To aid convergence we limit
the sample to contracts with at least 50% non-missing observations during their lifespan.
19
Taken together, this evidence suggests that the IPCA framework is superior to factor models
from earlier literature. This includes both observable, prespecified factors (which benefit here
from the inclusion of IPCA-like instrumented betas) and static latent factor models. IPCA better
captures time variation in individual option returns as well as differences in average returns across
assets. Comparing the IPCA and PCA models for individual contracts, we find clear evidence
that time-varying factor loadings are essential for accurately capturing the cross section of option
returns.
So far, when testing for the significance of Γα we have found that for a model with K = 3 factors
characteristics on their own do not line up with options returns. This suggests that our factors are
conditionally mean-variance efficient. We further investigate alphas in the IPCA model and its
competitors, both conditionally and unconditionally.
We test the model’s ability to price test portfolios unconditionally by running time-series re-
gressions of test portfolio returns on the set of IPCA factors. As a benchmark, we pick the
best performing observable factor model that includes the Fama-French three factors, momentum,
the straddle factor as well as the betting-against-beta factor constructed from embedded leverage
(“FFCBS6”). We study two sets of anomaly portfolios. First, the managed portfolios of IPCA im-
mediately provide us with a set of tests assets that weights assets by their characteristics. Second,
we use a set 20 of moneyness and maturity double-sorted portfolios of call and put options. We re-
leverage all portfolios to 10% annualized volatility for comparability. Note that by using portfolios
as test assets, we are giving observable factor models a reasonable chance at competing with IPCA
in terms of fit, while at the contract level observable factor models tend to perform dramatically
worse than IPCA.
Panels (a) and (b) of Figure 2 shows unconditional portfolio alphas from time series regressions
of managed portfolio returns on the set of factors. Significant alphas are denoted by filled in dia-
mond markers. We find that the average absolute portfolio alpha in the IPCA model is more than
a percentage point lower than that from the observable factor model FFCBS6. In panels (c) and
(d) we make the same comparison in terms of average conditional alphas from the conditional beta
20
version of each model (i.e., using IPCA in its intended form and allowing for instrumented betas on
observable factors). Portfolio alphas are obtained as the time-series averages of period-by-period
portfolio residuals in the conditional model. In this case we find that IPCA produces fewer signif-
icant portfolio alphas compared to FFCBS6, and in addition the average absolute alpha in IPCA
is more than two percentage points smaller. Table 9 breaks out alphas for each characteristic-
managed portfolio separately. This analysis shows that not only is the average IPCA alpha magni-
tude smaller than in the observable benchmark model, but also alphas are almost uniformly smaller
across each of the characteristic-managed portfolios. Exceptions to this are delta:put and gamma
portfolios, although even in these cases alphas are nearly the same as in the benchmark model.
We check the robustness of our previous results on the unconditional mean-variance efficiency
of our factors by using a set of 20 BMS delta/TTM double-sorted portfolios that could provide
a more challenging test case. Figure 3 shows the results of this test. IPCA still outperforms the
FFCBS6 model overall: average absolute alphas are around one percentage point lower using the
IPCA factors than the observable factors. Furthermore, when visually assessing the distribution of
alphas, we find that portfolio alphas from the FFCBS6 model exhibit a clear pattern: portfolio al-
phas increase with raw portfolio returns. This suggests a systematic shortcoming of the observable
factors in pricing the test portfolios.
In the previous analysis of mean-variance efficiency of our IPCA factors we have focused on the
model with K = 3 factors. In order to inspect the behavior of average absolute portfolio alphas for
varying number of latent factors, we now re-run the previous tests for the models with K = 1, . . . , 5
factors and compute the conditional and unconditional portfolio average absolute alphas as before.
Table 6 summarizes the results. We find that unconditional and conditional alphas decrease as we
increase the number of factors. This finding is not a guaranteed result of increasing the number
of factors. Furthermore, we find a clear drop in portfolio alphas when moving from two to three
latent factors in agreement with the evidence from our earlier bootstrap exercise, and again favors
a model with K = 3 factors.
21
4.4. Out-of-Sample Performance
So far, we have demonstrated that the IPCA model achieves a superior description of option
return variation in-sample. We now turn to analyzing IPCA’s out-of-sample fits.
We conduct a recursive estimation of IPCA in a backward-looking fashion. The model is
estimated for an expanding window starting at half the available sample length, i.e. the first forecast
is made in January 2007. The construction of out-of-sample realized factor returns, fˆt+1 , follows
Kelly et al. (2019, see Section 4.4).
We evaluate the out-of-sample fit of the restricted IPCA models with K = 1, . . . , 5 factors
using the same performance metrics introduced above, i.e. the total and predictive R2 using both
individual options as well as managed portfolios as test assets. Table 7 details the results of our
analysis. We find that the strong in-sample performance of IPCA in terms of capturing observed
return variation is also true on an out-of-sample basis. For the model with K = 3 factors that most
of our analysis focuses on, the total R2 only reduces from 85.07% in-sample to 82.98% out-of-
sample. Out-of-sample IPCA fits continue to dramatically outperform even the in-sample fits of
observable factor models such as FFCBS6. The predictive R2 for IPCA at the individual contract
level goes from 6.39% in-sample to 3.67% out-of-sample, while the predictive R2 at the portfolio
level drops from 7.90% in-sample to 3.30% out-of-sample.
Most of our analysis to this point has focused on the statistical performance of IPCA. In this
section we present results from a trading strategy that aims to optimally combine the IPCA fac-
tors in a maximum Sharpe ratio sense. To this end, we study the investment performance of the
tangency portfolio of IPCA factors. This provides a description of IPCA model performance in
economic terms. The dual of factor pricing implies that a set of factors that can price all assets in
the economy with zero alpha can also be combined to form the maximum Sharpe ratio portfolio in
the economy.
To construct the out-of-sample tangency portfolio, we recursively estimate the IPCA model us-
ing only backward looking information and, based on the estimated factors, we then run a portfolio
optimization with the Ledoit and Wolf (2004) covariance shrinkage estimator. This procedure pro-
22
duces tangency portfolio weights for each of the IPCA factors. Finally, we obtain the out-of-sample
tangency portfolio return and analyze its performance. For ease of interpretation we re-leverage
the tangency portfolio to target an annualized volatility of 10%. Further, we set the start of the
out-of-sample period such that the first portfolio optimization makes use of five-years of in-sample
data, hence leading to an out-of-sample period from January 2001 to December 2017.
We summarize the out-of-sample performance of the IPCA tangency portfolio based on K =
1, . . . , 5 factors in Table 8. The table documents performance statistics including annualized ex-
pected returns and Sharpe ratios as well as portfolio alphas versus two benchmark models. The
tangency portfolio for the IPCA model with three factors, which we focus on in many parts of our
analysis, attains a Sharpe ratio of 1.67. Furthermore, the tangency portfolio yields a highly signifi-
cant alpha of 15% annualized over and above a model consisting of the Fama-French three factors
plus the momentum factor (“FFC4”). Vis-à-vis a model that in addition includes the Frazzini and
Pedersen (2012) BAB factor and the Coval and Shumway (2001) straddle factor (“FFCBS6”) the
IPCA tangency model still has an alpha of 4% per annum, albeit insignificant. However, IPCA
models with four and five factors result in significant alphas versus the FFCBS6 model. These re-
sults show that not only is IPCA a successful model for pricing options returns, its trading strategy
performance improves over successful strategies proposed in prior literature.
Relation to the implied volatility surface. First, we study how the recovered IPCA factors relate to
different regions of the implied volatility surface. To do so, we double-sort option contracts into
portfolios for three maturity bins and five moneyness (absolute forward delta) bins each month,
yielding 15 equally-weighted portfolios. We then run regressions of these portfolio returns on
our three IPCA factors. Figure 6 plots the estimated regression betas. For Factor 1, we find a
23
clear separation of regression coefficients by maturity. Short-dated contracts up to one month
to maturity exhibit negative coefficients across different levels of moneyness, while longer dated
contracts with maturity between seven and 12 months exhibit positive coefficients. This spread
suggests that Factor 1 captures variation in the term structure slope of the volatility surface. For
Factor 2 we find two distinct patterns. The first is that longer dated contracts (maturities beyond
one month maturity) exhibit stable coefficients across different levels of moneyness, in line with
a volatility level effect. Second, for short-dated contracts (one month maturity), we find generally
increasing coefficients, with deep OTM contracts being most negatively related to Factor 2. This
second pattern is consistent with a short-dated moneyness skew factor and represents index tail risk
(or, similarly, BMS gamma risk). Patterns are similar for Factor 3, where we find coefficients that
are decreasing in absolute delta. The pattern is particularly pronounced for short-dated contracts,
while for long-dated contracts coefficients are more stable. This evidence suggests that Factor 2
and Factor 3 together are a combination of a volatility level factor and moneyness skew factor.
Relation to observable option return factors. In our next analysis, we attempt to build easily inter-
pretable observable factors that align with the estimated IPCA factors. We construct three prespec-
ified portfolios that are defined as a volatility level factor, and maturity slope factor, and moneyness
skew factor. The level factor is an equally-weighted portfolio that shorts options with absolute delta
between 0.4 and 0.5, and captures fluctuations in the overall level of the implied volatility surface.
The maturity slope factor is constructed by buying options with maturities between six and 12
months and shorting options months with one month to maturity. Therefore, this factor captures
fluctuations in option prices associated with shifts in the implied volatility term structure. Finally,
the moneyness skew factor is long OTM call options with delta between 0.1 and 0.2 and short
OTM put options with delta between -0.2 and -0.1. This factor captures variation in the implied
volatility skew, capturing shifts in the market’s expectation of tail events.
Before comparing these observable factors with our IPCA factor, we first analyze their perfor-
mance as a pricing model for individual contracts. Following the analysis of Table 5, we use the
same set of option characteristics to instrument conditional loadings in this model as we use in
the baseline IPCA specification. Among individual contracts, the observable factor model attains
24
a total R2 of 81.5% and predictive R2 of 6.2%. For the managed portfolios, the observable option
factor model attains a total R2 of 94.7% and predictive R2 of 7.8%. Comparing with the IPCA
model with K = 3 in Table 5, we find that the observable option factor model underperforms IPCA
by a small margin, but outperforms the benchmark observable factor models that we studied in
Table 5.
We compare the level, maturity slope, and moneyness skew factors to IPCA factors via mul-
tiple regression, and report results in Table 11. These observable factors explain close to 90% of
the variation in two of our three IPCA factors, and more than 70% of the variation in the third fac-
tor. Since the three observable factors are not orthogonal to each other, we follow Fournier et al.
(2021) and augment the regression evidence by computing Shapley-Owen R2 values to measure
the marginal contribution of each regressor. Shapley-Owen R2 compares the marginal increase in
model fit from adding a given regressor to all potential subsets of regression models which do not
include that regressor. The results indicate that the first IPCA factor is most closely related to the
maturity slope factor, the second IPCA factor is most closely related to the moneyness skew factor,
while the third IPCA factor is most closely related to the volatility level factor. Thus, an observable
option factor model embedding the ideas of level, maturity slope, and moneyness skew fluctuations
is useful for understanding the behavior of the IPCA model.
Factor behavior across market regimes. Our third interpretation analysis examines the condi-
tions under which each factor earns its risk compensation. We define market regimes following
Karakaya (2013), and Table 10 reports annualized factor means, standard deviations and Sharpe
ratios in each regime. Panel A collects the results for the three IPCA factors, while Panel B col-
lects the results for the level, maturity slope, and moneyness skew factors. The first and third IPCA
factors earn significant returns with Sharpe ratios of 1.04 and 1.42 respectively. Outside of volatil-
ity jump periods (period “Non Vol Jump”), we find that all three IPCA factors earn significant,
elevated returns consistent with the evidence of a volatility risk premium in delta-hedged option
returns documented in the literature. Comparing between Panels A and B we see a closer link
between the IPCA factors and the three pre-specified factors. As suggested above, IPCA Factor
1 exhibits properties of a slope factor and indeed we find that its regime-dependent risk compen-
25
sation broadly matches that of the maturity slope factor in Panel B. In particular, we find that the
risk compensation of the slope factor is related to the business cycle (“Recession” vs “Non Reces-
sion”). For IPCA Factor 2 we find a close resemblance in risk properties to the moneyness skew
factor. These share a significant negative price of risk during price jumps and negative compen-
sation during volatility jumps. Likewise, for IPCA Factor 3, we see agreement with the volatility
level factor in line with the regression evidence above.
IPCA factors, risk, and frictions. In our fourth approach to interpretation, we regress IPCA factors
on the realized return of the underlying, measures of aggregate risk, and a proxy for financial
frictions. Regressing on the underlying market return helps us understand the extent to which
option returns maintain residual exposure to the market after imperfect delta hedging (arising for
example from heavy-tailed behavior of the underlying). We also regress on realized variance and
implied variance (CBOE VIX index). As argued by Dew-Becker et al. (2021), contrasting realized
versus implied variance allows us to differentiate which factors reflect jump/gamma risk (better
captured through realized variance) versus the risk of shocks to future volatility (better captured
through implied variance). Finally, we also regress on the intermediary capital risk factor of He
et al. (2017), which measures financial frictions and thus constitute a form of limits-to-arbitrage.
Table 12 displays the results of the regression. IPCA factors relate significantly not only to the
implied volatility of option markets, but also to the dynamics of the underlying itself. The second
factor is most closely aligned with shocks to VIX, and the third factor most related to realized
variance. Most interestingly, we find that the intermediary capital risk factor is the dominant driver
of IPCA factor 1 according to the Shapley-Owen R2 ; it is responsible for half of the explanatory
power of this factor. This suggests that the empirical success of IPCA derives in part from its ability
to capture option price behavior associated with broad market frictions that are missing from more
traditional no-arbitrage option pricing models.
Which Characteristics Matter?. The IPCA model allows us to assess the statistical contribution of
individual characteristics, which provides an economic interpretation for the source of model gains.
Kelly et al. (2019) suggest assessing individual characteristic contributions by setting its associated
Γβ parameters to zero while keeping all other characteristic loadings fixed at their estimated values.
26
They also develop a bootstrap test of each characteristic’s statistical significance in the model. We
outline this procedure in Appendix A.
Table 13 summarizes the contribution of each characteristic to the IPCA model measured as
the reduction in total R2 from excluding that characteristic, holding all other model estimates fixed.
We find that the most important characteristics are implied volatility, BMS vega, and BMS gamma.
The bootstrapped p-values confirm that these characteristics are highly significantly contributors.
For comparison, Figure 4 plots the estimated factor loadings, Γβ , and confirms that the characteris-
tics identified as most important in the bootstrap analysis are also those that earn large coefficients
in the beta specification.
Next, we investigate the functional shape of the factor exposures instrumented with our set of
option characteristics. Although IPCA is a linear factor model at heart, non-linearities may be
captured via choosing a relevant set of asset characteristics. This is particularly relevant in the
context of options which have non-linear payoff profiles. To investigate this feature, we sort our
sample into 10 bins, ∆ j , by absolute option deltas and compute the unconditional factor exposures
β∆ j = z̃′j Γβ where we fix characteristics at their per-bin means, z̃ j . Figure 5 plots the unconditional
factor exposures for factors F1 to F3 against moneyness. Indeed, we find that unconditional factor
exposures exhibit a non-linear association with option moneyness.
Alternative Sets of Characteristics. While IPCA allows for a large number of characteristics to be
considered as instruments, the researcher must ultimately take a stance on the set of characteristics
in the model. In our analysis so far the set of characteristics included moneyness, time-to-maturity,
implied volatility, embedded leverage, gamma and vega, and all characteristics were interacted
with both a constant and the put/call dummy variable. In order to put our choice of characteristics
into perspective, we run two sets of analyses.
First, we compare the performance of the baseline model against a model in which the choice
of characteristics is motivated economically from an option return decomposition. One way to
27
approach the choice of characteristics problem is from the perspective of a no-arbitrage model
such as BMS. In that context, a standard option return decomposition is obtained via a simple
Taylor expansion of the option price Ft along its most relevant dimensions as follows
dFt dFt d2 F t
∆Ft = ∆S t + ∆IV t + (∆S t )2 + ... . (11)
dS t dIV t dS t 2
This can be re-arranged to yield the relative change in the option price
The terms ∆Decomp , V Decomp , and ΓDecomp can be understood as coefficients on a set of factors. Table
B.1 details the performance of an IPCA model that uses the three model-implied coefficients as
characteristics. We find that the model with characteristics following the return decomposition in
Eq. (12), achieves a slightly lower total R2 than the baseline model (e.g. for K = 3 and Γα = 0,
80.99% vs. 85.57%), while the predictive R2 of the decomposition based model is comparable to
the agnostic specification (e.g. for K = 3 and Γα = 0, 6.36% vs. 6.39%). Next, we estimate a bat-
tery of models with varying sets of characteristics to further assess the dependence of our baseline
model’s performance on the choice of characteristics. There are a number of plausible characteris-
tics that are available beyond our baseline set. Appendix Table B.3 contains an overview of IPCA
specifications using different sets of characteristics. The results demonstrate the robustness of our
main findings because the model fits do not change dramatically with small changes in the char-
acteristic set. One finding of this analysis is that higher order option Greeks such as BMS Volga
(i.e., sensitivity to changes in BMS gamma) add little in terms of total and predictive R2 . Overall,
this evidence shows that the choice of characteristics in our baseline specification stacks up well
against other plausible choices of characteristics.
Longer Maturities. Our baseline sample filters out contracts with maturities beyond one year due
to their low liquidity. We conduct a robustness analysis that includes contracts with up to two years
to maturity. We keep the same set as instruments as in our baseline specification and re-estimate
28
the IPCA model. Appendix Table B.2 presents the results. Focusing on the restricted IPCA model
with K = 3, we confirm that the total R2 is relatively similar to baseline specification (85.57% vs.
86.11%). However, we note that the predictive R2 is somewhat lower (5.41% vs. 6.39%) when
adding longer maturity contracts. This finding may be related to the illiquidity associated with
these kinds of contracts that makes precise measurement of expected returns more challenging.
Our analysis to this point focuses entirely on the monthly frequency. In this section, we estimate
the IPCA model using daily data on delta-hedged returns. This horizon is of particular interest to
market makers and traders that often only hold contracts for short periods of time and that place an
emphasis on risk managing a book of derivatives.
As a benchmark, we compare IPCA against the P&L attribution framework proposed by Carr
and Wu (2020) which is developed particularly with short horizon returns in mind. Their frame-
work attributes option returns to variation in the first and second moments of the underlying price
process and the implied volatility process, and is a formalization of the typical trader’s “rule-
of-thumb” model for managing option risk. It is obtained from a Taylor expansion of the BMS
pricing formula, taking expectations under the risk-neutral measure and enforcing a no-arbitrage
condition. A particularly neat version of their P&L attribution is obtained by scaling option gains
by cash gamma:
dF FS dS t dIVt
= + IVt2 S t τ ( − µt dt)
FS S S t FS S S t IVt
1 dS t 2
+ S t (( ) − σ2t dt)
2 St
(13)
1 dIVt 2
+ S t z+ z− (( ) − ω2t dt)
2 IVt
dS t dIVt
+ z+ S t (( ) − γt dt) ,
S t IV t
where FS denotes the BMS option delta, FS S denotes the BMS option gamma, τ = (T i − t)/365 is
the standardized time-to-maturity, and z± = (log(Ki /S t ) ± 12 IV 2t τ). Further, the moments under the
29
risk-neutral measure are
dIVt dS t 2
µt = E t [ ] /dt, σ2t = Et [( ) ] /dt,
IVt St
dIVt 2 dS t dIVt
ω2t = Et [( ) ] /dt, γt2 = Et [( , )] /dt,
IVt S t IVt
i.e. µt is the expected rate of change in the BMS implied volatility, σ2t and ω2t are the conditional
variance rate of underlying and implied volatility, respectively, and γt is the conditional covariance
between the two processes. Note that in contrast to the BMS framework itself, the Carr and Wu
(2020) framework does not require specifying the full dynamics of the underlying.
The risk-neutral moments above can be obtained from the implied volatility surface. However,
since we are not limiting the analysis to ATM options on a grid of fixed maturities as in Carr
and Wu (2020), a few generalizations are necessary. We estimate µt from the implied volatility
term-structure, but instead of using fixed maturity points to obtain the term structure slope, we
approximate the slope by taking a symmetric 60-day window around the location of a given con-
tract on the implied volatility surface. For contracts close to the edges of the surface, we shift the
window from which we compute the slope to ensure a window size of 60 days. The variance rate
ωt is estimated from a 63-day rolling window variance of daily implied volatility changes where
we ensure the availability of at least 21 observations. The covariance rate γt is estimated from
the rolling 21-day covariance between spot changes and implied volatility changes and we ensure
availability of at least 10 observations. Finally, we estimate σ2t from the ATM implied volatility at
a maturity of one month.
We measure the performance of the Carr and Wu (2020) approach individually for each of the
30
risk attributions. To this end we define the series
dF dS t
R0 = − FS
St St
dIVt
R1 = R0 − IVt2 S t FS S τ ( − µt dt)
IVt
1 dS t 2
R2 = R1 − S t FS S (( ) − σ2t dt)
2 St
1 dIVt 2
R3 = R2 − S t FS S z+ z− (( ) − ω2t dt)
2 IVt
dS t dIVt
R4 = R3 − z+ S t FS S (( ) − γt dt) .
S t IV t
We then compute the fraction of variation in delta-hedged option returns rS∆ pot explained by se-
quentially accounting for vega risk, gamma risk and so on as
Since there is no direct analogue of R2pred in this framework, we focus our analysis on the total R2 .
We summarize the results from a comparison of the Carr and Wu (2020) model and the re-
stricted IPCA frame work in Table 14. Before diving into the comparison of the two approaches,
we note that IPCA also performs well at daily frequency and delivers high total R2 across a large
range of option contracts. In analogy to our earlier exercise, we present results for K = 1, . . . , 5
factors to highlight how the IPCA performance increases as we allow for more factors to mirror
progressively hedging additional sources of risk in the Carr and Wu (2020) model. Table B.4
reports a more detailed set of results for the IPCA model at daily frequency.
With two or more factors, IPCA performs at least as well as Carr and Wu (2020) in describing
the behavior of daily option returns (Table 14, Panel A). This is further illustrated in sub-samples
of our data that isolate observations by moneyness, maturity, and the level of the VIX (Panels B, C,
and D). While the Carr and Wu (2020) attribution framework performs reasonably well for options
near the money, its comparative performance versus IPCA deteriorates for OTM options (Panel B)
and for short-dated options with one month to maturity (Panel C). Finally, Panel D demonstrates
31
that both models are able to capture option return variation over a wide range of market conditions
as proxied by the VIX index.
5. Conclusion
In this paper we study a latent factor model (IPCA) for option returns. We demonstrate that
a coherent factor-based description of option returns is possible in a model with time-varying
factor loadings. The problems typically posed by short option lifespans and rapidly changing
risk attributes for factor beta estimation is resolved by IPCA’s use of time-varying betas that are
instrumented by option contract characteristics.
We find that a low dimensional latent factor model is successful in capturing variation in option
returns and describing differences in risk across a wide range of options. It substantially outper-
forms observable factor models in terms of its ability to capture the variation in realized option
returns.
The model also provides an accurate description of the risk-return trade-off in options markets.
When using three IPCA factors, we do not reject the hypothesis of zero conditional alpha in our
data. We also find that a trading strategy designed to efficiently capture the risk-return trade-off
(as estimated from IPCA) earns an annualized Sharpe ratio as high as 1.8 and has positive alpha
versus previously proposed investment strategies using index options.
We test the robustness of our findings in an out-of-sample exercise and find that IPCA’s per-
formance remains strong, still outperforming the even in-sample results from observable factor
models. The risk factors recovered by IPCA can be interpreted as capturing fluctuations in the
level of the volatility surface, in the maturity slope, and in the short-dated moneyness skew. While
most of our analysis focuses on monthly data, we also find that the IPCA model matches the
behavior of options returns at the daily frequency across a wide range of option contracts.
Our model and findings suggest a number of promising directions for future research. One is
to extend this modeling scheme to the single name options markets. In particular, the IPCA frame-
work may present the solution to identifying a unified factor model that can accurately price all
single name options simultaneously, taking both stock characteristics and option contract attributes
32
as inputs to the beta specification. Recently, Kelly et al. (2020a) use IPCA to study the integration
of stock and bond markets. Likewise, our framework may be used to pursue a joint asset pricing
model of options and the underlying stocks.
33
Acknowledgements
For helpful comments and suggestions we thank Torben Andersen, Bjorn Eraker, Scott Murray,
Simon Schmickler, Peter van Tassel, Yinan Su, Kaushik Vasudevan, Steffen Windmüller, and con-
ference participants at the FMA Derivatives & Volatility 2019 Workshop, the WFA Annual Meet-
ing 2020, the Econometric Society World Congress 2020, and seminar participants at Warwick
University. Matthias Büchner gratefully acknowledges the support of the Centre for Endowment
Asset Management (CEAM) at Judge Business School, University of Cambridge. AQR Capital
Management is a global investment management firm, which may or may not apply similar invest-
ment techniques or methods of analysis as described herein. The views expressed here are those
of the authors and not necessarily those of AQR.
34
References
Bakshi, G., Kapadia, N., 2003. Delta-Hedged Gains and the Negative Market Volatility Risk
Premium. Review of Financial Studies 16, 527–566.
Black, F., Scholes, M., 1973. The Pricing of Options and Corporate Liabilities. Journal of Political
Economy 81, 637–654.
Broadie, M., Chernov, M., Johannes, M., 2007. Model specification and risk premia: Evidence
from futures options. Journal of Finance 62, 1453–1490. URL: [Link]
[Link]/doi/pdf/10.1111/j.1540-6261.2007.01241.x.
Brooks, R., Chance, D., Shafaati, M., 2018. The Cross-Section of Individual Equity Option Re-
turns. Unpublished working Paper. Lousiana State University.
Cao, J., Han, B., 2013. Cross-Section of Option Returns and Idiosyncratic Stock Volatility. Journal
of Financial Economics 108, 231–249.
Cao, J., Han, B., Zhan, X., Tong, Q., 2021. Option Return Predictability. The Review of Financial
Studies URL: [Link] doi:10.1093/rfs/hhab067.
Carr, P., Wu, L., 2004. Time-changed Lévy Processes and Option Pricing. Journal of Financial
Economics 71, 113–141.
Carr, P., Wu, L., 2020. Option Profit and Loss Attribution and Pricing: A New Framework. Journal
of Finance 75, 2271–2316.
Chambers, D.R., Foy, M., Liebner, J., Lu, Q., 2014. Index Option Returns: Still Puzzling. Review
of Financial Studies 27, 1915–1928.
Christoffersen, P., Fournier, M., Jacobs, K., 2018. The factor structure in equity options. Review
of Financial Studies 31, 595–637.
Constantinides, G.M., Jackwerth, J.C., Perrakis, S., 2009. Mispricing of S&P 500 Index Options.
Review of Financial Studies 22, 1247–1277.
35
Coval, J.D., Shumway, T., 2001. Expected Option Returns. Journal of Finance 56, 983–1009.
Dew-Becker, I., Giglio, S., Kelly, B., 2021. Hedging macroeconomic and financial uncertainty and
volatility. Journal of Financial Economics 142, 23–45.
Duffie, D., Pan, J., Singleton, K., 2000. Transform Analysis and Asset Pricing for Affine Jump-
Diffusions. Econometrica 68, 1343–1376.
Eraker, B., 2004. Do stock prices and volatility jump? Reconciling evidence from spot and option
prices. Journal of Finance 59, 1367–1404.
Eraker, B., Johannes, M., Polson, N., 2003. The Impact of Jumps in Volatility and Returns. Journal
of Finance 58, 1269–1300.
Fournier, M., Jacobs, K., Orlowski, P., 2021. Modeling Conditional Factor Risk Premia Implied
by Index Option Returns. Unpublished Working Paper. HEC Montreal.
Frazzini, A., Pedersen, L.H., 2012. Embedded Leverage. NBER Working Paper 18558. NBER
Working Paper Series.
Goyal, A., Saretto, A., 2009. Cross-Section of Option Returns and Volatility. Journal of Financial
Economics 94, 310–326.
He, Z., Kelly, B., Manela, A., 2017. Intermediary Asset Pricing: New Evidence from many Asset
Classes. Journal of Financial Economics 126, 1–35.
Heston, S.L., 1993. A Closed-Form Solution for Options with Stochastic Volatility with Applica-
tions to Bond and Currency Options. Review of Financial Studies 6, 327–343.
Horenstein, A.R., Vasquez, A., Xiao, X., 2020. Common Factors in Equity Option Returns. Un-
published Working Paper. University of Miami.
Israelov, R., Kelly, B., 2017. Forecasting the Distribution of Option Returns. Working Paper.
Jones, C.S., 2006. A Nonlinear Factor Analysis of S&P 500 Index Option Returns. Journal of
Finance 61, 2325–2363.
36
Karakaya, M., 2013. Characteristics and Expected Returns in Individual Equity Options. Unpub-
lished Working Paper. University of Chicago.
Kelly, B.T., Palhares, D., Pruitt, S., 2020a. Modeling Corporate Bond Returns. Unpublished
Working Paper. Yale School of Management.
Kelly, B.T., Pruitt, S., Su, Y., 2019. Characteristics Are Covariances: A Unified Model of Risk and
Return. Journal of Financial Economics, forthcoming 134.
Kelly, B.T., Pruitt, S., Su, Y., 2020b. Instrumented principal component analysis. Available at
SSRN 2983919 .
Ledoit, O., Wolf, M., 2004. Honey, I shrunk the sample covariance matrix. Journal of Portfolio
Management 30, 1–22. doi:10.3905/jpm.2004.110.
Ofek, E., Richardson, M., 2003. DotComMania: The Rise and Fall of Internet Stock Prices.
Journal of Finance 58, 1113–1137.
Ofek, E., Richardson, M., Whitelaw, R.F., 2004. Limited Arbitrage and Short Sales Restrictions:
Evidence from the Options Markets. Journal of Financial Economics 74, 305–342.
Vasquez, A., 2017. Equity Volatility Term Structures and the Cross-Section of Option Returns.
Journal of Financial and Quantitative Analysis 52, 2727–2754.
Windmüller, S., 2021. Firm Characteristics and Global Stock Returns: A Conditional Asset Pricing
Model. Unpublished Working Paper. Technische Universität München (TUM).
37
Tables & Figures
Table 1: Summary Statistics of Option Level Variables. This table provides summary statistics for the sample
of option returns used in the analysis. Panel A contains results for call option contracts, while Panel B contains
results for put option contracts. The sample period is 1996 to 2017. For option√level variables the table shows the
time-to-maturity ttm, moneyness mness computed as mness = ln(K/S )/(IV ⋅ ttm) with strike K and spot price
S , Black-Scholes implied volatility IV, option delta, option hamma, option vega, option theta, and the annualized
delta-hedged return versus spot price rS∆ pot .
38
Table 2: IPCA Performance. This table reports the IPCA fit performance as measured by R2total and R2pred using
both the restricted model (Γα = 0) and the unrestricted model (Γα ≠ 0) with K = 1, . . . , 5. The employed option
characteristics delta, ttm, embed lev, theta, impvvol, gamma, vega are each interacted with a constant and an
indicator variable that is equal one if the option is a put and is zero otherwise. Additionally, a constant is included in
the set of characteristics. In Panel A performance measures are computed with respect to individual option contracts,
while in Panel B performance measures are computed with respect to the characteristics-managed portfolios. Panel C
reports p-values for the test with null hypothesis H0 ∶ Γα = 0 from a bootstrap with 1000 draws for each time t.
No. Factors
1 2 3 4 5
Panel A: Individual Options
R2total Γα = 0 72.28% 79.65% 85.07% 88.90% 90.22%
Γα ≠ 0 74.03% 81.08% 85.57% 89.32% 90.46%
39
Table 3: Panel Regression of Option Returns on Option Characteristics. The dependent variables is the monthly
delta-hedged return versus the prevailing spot price rS∆ pot . Regression specification (1) uses a common intercept for
all observations, while specification (2) uses time-fixed effects. As regressors we include the Black-Merton-Scholes
(BMS) delta, gamma, vega, theta, time-to-maturity ttm, and the BMS implied volatility (impvol). We interact all
characteristics with a put dummy variable that has value one for put options and zero for call options. For example, the
interaction of delta and the put indicator is denoted delta:put. All predictors are standarized by their sample standard
deviation. Standard errors are clustered by option contract and in parentheses we report t-statistics.
rS∆ pot
(1) (2)
delta -0.0088 0.0560
(-0.7169) (6.4211)
ttm 0.0244 0.0327
(5.9405) (8.3471)
embed lev -2.011e-05 0.0063
(-0.0069) (2.8575)
gamma -0.0151 -0.0140
(-3.4482) (-4.1134)
vega -0.0091 -0.0254
(-1.8856) (-6.3281)
theta -0.0167 0.0118
(-2.1788) (1.8536)
impvol -0.0821 -0.1467
(-13.383) (-14.711)
delta:put 0.0635 0.0291
(3.7247) (2.6428)
ttm:put -0.0141 -0.0295
(-3.4176) (-7.2445)
embed lev:put 0.0151 0.0074
(4.4433) (2.1717)
gamma:put -0.0056 0.0080
(-0.8119) (1.7138)
vega:put -0.0015 0.0311
(-0.2155) (6.2716)
theta:put 0.0524 0.0424
(5.7418) (6.0961)
impvol:put 0.0543 0.1109
(11.800) (15.992)
Effects Time
R2 3.15% 6.23%
No. Obs. 77,090 77,090
40
Table 4: IPCA Performance by Bins of Option Delta, Maturity, and VIX. The table details the total R2 for restricted
IPCA models with K = 1, . . . , 5 by bins sorted on option characteristics. Note, the total R2 is computed using only
observations from the respective bin examined.
Panel C: VIX
Range K=1 K=2 K=3 K=4 K=5 No. Obs
0% to 10% 25.31% 63.86% 65.96% 69.91% 78.96% 2,783
10% to 20% 69.65% 77.56% 83.67% 87.63% 89.52% 47,061
20% to 30% 65.77% 73.93% 81.85% 86.66% 88.40% 20,736
30% to 90% 78.83% 84.98% 88.34% 91.47% 91.99% 6,510
41
Table 5: IPCA versus Observable Factor Models. The table compares the total and predictive R2 of the restricted
IPCA model with K = 1, . . . , 5 factors (Panel A), a number of observable factor models (Panel B), and canoncial
principal components analysis (Panel C). In Panel B we start from the Fama-French three factor model and add the
Carhart momentum factor (FFC4) the Frazzini and Pedersen (2012) ’betting against beta’ factor (FFCB5), and the
Coval and Shumway (2001) straddle factor (FFCBS6). Panel B contains the results when loadings are dynamic by
instrumenting with characteristics. In Panel C we report the result from a standard principal components analysis
(PCA) of the time-series of option returns. Given panel unbalancedness an alternating least squares algorithm is
used to estimate PCA. To aid convergence we limit the sample to those contracts with at least six valid observations
during the sample period. We report performance measures for individual options as test assets R2total / R2pred and for
characteristics managed portfolios as test assets R2total,x / R2pred,x .
Panel A: IPCA
K
1 2 3 4 5
R2tot 72.28% 79.65% 85.07% 88.90% 90.22%
R2pred 5.47% 5.54% 6.39% 6.59% 6.77%
R2tot,x 94.41% 96.64% 98.86% 99.39% 99.61%
R2pred,x 7.20% 7.44% 7.90% 7.99% 8.06%
42
Table 6: IPCA Portfolio Alphas. The table summarizes the conditional and unconditional portfolio average absolute
alphas when the factors come from the restricted (Γα = 0) IPCA model with K = 1, . . . , 5 factors. The test portfolios
are the characteristics managed portfolios. Unconditional portfolio alphas are obtained from time series regressions
of portfolio returns onto the set of factors. Conditional alphas are obtained as the time-series averages of period-by-
period portfolio residuals in the main IPCA model. The reported values are the average absolute alphas across the set
of test portfolios.
Table 7: Out-of-Sample Performance The table shows the out-of-sample performance of the restricted IPCA model
with K = 1, . . . , 5 factors. The out-of-sample exercise starts at 50% of our sample length, i.e. the first forecast is made
for January 2007. Panel A contains results for individual option contracts, while Panel B examines the out-of-sample
performance of managed portfolios.
No. Factors
1 2 3 4 5
Panel A: Individual Options
R2total 71.47% 76.18% 82.98% 86.53% 88.88%
R2pred 4.41% 3.31% 3.67% 4.34% 4.46%
Table 8: Out-of-Sample Factor Portfolio Sharpe Ratios. The table summarizes the out-of-sample performance of
the mean-variance optimal (tangency) portfolios formed from the IPCA factors. We assume a portfolio volatility target
of 10% per year and rescale the portfolio weights accordingly using only backwards looking information. The perfor-
mance measures are annualized expected return (ER), volatility, Sharpe ratio, skewness, and kurtosis. In addition, the
table shows the time series alphas versus the embedded leverage factor (BAB) and the straddle factor with t-statistics
in parentheses. We use an in-sample period of five years and then recursively expand the estimation window. The
out-of-sample period runs from January 2001 to December 2017.
43
Table 9: Managed Portfolio Alphas - IPCA vs. Observable Factors. This table details the individual managed
portfolios alphas in the conditional IPCA model with K = 3 and conditional observable factor model (FFCBS6)
instrumented with the same set of characteristics. The portfolio alphas are obtained as time-series averages of the
period-by-period model residuals. Absolute portfolio alphas with t-statistics greater than 2.0 are highlighted in bold
print.
IPCA FFCBS6
alpha t-stat alpha t-stat
delta 4.6% 5.01 6.9% 3.69
delta:put 2.4% 2.92 2.3% 1.23
ttm 0.4% 0.58 5.8% 3.11
ttm:put 1.0% 1.21 6.0% 3.25
embed lev 0.0% 0.02 3.5% 2.04
embed lev:put 0.2% 0.30 5.1% 2.74
theta 0.7% 1.21 2.7% 1.91
theta:put 3.3% 3.46 7.9% 5.36
impvol -0.9% -1.27 -3.0% -1.31
impvol:put 0.5% 0.64 0.3% 0.16
gamma -0.6% -2.42 -0.4% -0.27
gamma:put 2.2% 2.26 4.3% 3.08
vega 0.0% 0.10 -3.4% -1.91
vega:put 1.0% 1.45 -1.4% -0.70
const -0.1% -0.50 -1.8% -1.15
Avg. Abs. Alpha 1.2% 3.7%
44
Table 10: IPCA Factor Summary Statistics. This table reports summary statistics of the restricted IPCA model for the full sample period as well as
distinguished sub-periods. The sub-periods are: “Price Jump” indicated by S&P 500 return smaller than -4% in the past month, “Vol Jump” indicated by a
VIX index increase of greater than 4%, “Recession” as indicated by NBER recession periods, “Severe Bear Mkt.” indicated by S&P 500 return smaller than
-25% in the previous 12 months, “Rising Mkt.” constructed as the opposite of “Severe Bear Mkt.”, and the VIX regime (high, medium, low) constructed from
terciles of the VIX level distribution over our sample period. The IPCA model is estimated using three factors and characteristics delta, ttm, embed lev,
impvol, gamma, theta, vega. Reported measures are annualized. Significance of the Sharpe ratio is obtained from a regression of the factor return series
on a constant where the regression is using Newey-West standard errors with up to 4 lags. Sharpe ratios printed in bold font indicate an absolute t-statistic
exceeding 2.0. The sample period is from 1996 to 2017 covering 261 months.
45
Non Recession 235 0.06 0.05 1.32 0.02 0.03 0.52 0.02 0.01 1.42
High VIX 65 0.14 0.10 1.48 -0.02 0.06 -0.38 0.02 0.02 1.21
Medium VIX 130 0.04 0.04 0.94 0.02 0.03 0.85 0.02 0.01 1.49
Low VIX 66 0.02 0.02 1.22 0.02 0.01 1.39 0.02 0.01 1.96
F1 F2 F3
Level -0.10 0.32 1.13
(-1.58) (6.35) (27.16)
Maturity Slope 0.71 -0.38 0.59
(10.59) (-9.44) (12.82)
Moneyness Skew 0.53 0.58 -0.53
(8.80) (7.63) (-8.61)
R2ad j 72.3% 88.0% 89.3%
No. Obs. 261 261 261
Shapley-Owen R2
Level 9.1% 28.4% 49.4%
Maturity Slope 38.6% 26.2% 20.2%
Moneyness Skew 24.7% 33.4% 19.8%
46
Table 12: IPCA Factors and the Dynamics & Liquidity of the Underlying. This table shows the results of multi-
variate regressions of the IPCA Factors from the model with K = 3 onto liquidity proxies and measures of underlying
dynamics. The regressors are the changes in the CBOE VIX index, the changes in realized variance of the S&P 500
index, the realized return of the S&P 500 index, and the intermediary capital risk factor from He et al. (2017), The re-
gressions are using Newey-West standard errors with four lags. Both the IPCA factors and regressors are standardized
using their time series standard deviation. The bottom panel shows the the Shapley-Owen value based decomposition
of the adjusted R2 . The sample period is 2000 to 2017.
F1 F2 F3
VIX -0.17 -0.46 -0.02
(-0.88) (-3.17) (-0.22)
Realized Variance -0.23 -0.08 -0.40
(-4.19) (-0.74) (-3.27)
Realized Return -0.13 0.14 -0.18
(-1.29) (1.04) (-2.53)
Intermed. Cap. Risk -0.48 -0.08 0.25
(-2.45) (-1.07) (3.02)
R2ad j 22.1% 34.5% 19.4%
No. Obs. 215 215 215
Shapley-Owen R2
VIX 2.3% 16.2% 2.8%
Realized Variance 4.5% 6.3% 9.5%
Realized Return 4.7% 9.0% 1.4%
Intermed. Cap. Risk 10.6% 3.0% 5.8%
47
Table 13: IPCA Instrument Significance. This table contains results from the bootstrap test for individual charac-
teristics contributions to overall fit in the restricted IPCA specification with K = 3 factors. The test sets all elements
in Γβ related to a given characteristics equal to zero and compares the overall model fit against the model with fully
specified Γβ . The first column summarizes the absolute reduction in total R2 from setting the row in the matrix Γβ
pertaining to a given characteristic to zero. The second column contains p-values for the bootstrap test Wβ with 1000
draws that tests H0 ∶ Γβ = [γβ,1 , . . . , γβ,l−1 , 0K×1 , γβ,l+1 , . . . , γβ,L ] against the alternative H1 ∶ [γβ,1 . . . , γβ,L ]. The table is
sorted by the reduction in total R2 from largest to smallest.
48
Table 14: Comparison of IPCA against a No-Arbitrage Model at Daily Frequency. This table details the per-
formance of both the Carr and Wu (2020) model and the restricted IPCA model with K = 1, . . . , 5 factors at daily
frequency. The daily returns are delta-hedged. For the no-arbitrage model the total R2 is computed as follows: for a
series Ri , 1, . . . , 4 the R-Squared is computed as R2total,i = 1 − Var(Ri )/Var(R0 ) where R0 is the series of delta-hedged
daily returns. The computation of the total R2 in the IPCA model follows the usual form.
49
Figure 1: Open Interest and Volume of S&P 500 Options
This figure shows the average total open interest and average traded volume for contracts sorted on their absolute
forward BMS delta and time-to-maturity on the monthly rebalancing dates used in our study. The sample period is
1996 to 2017.
1e6
2.00 12M
24M
1.75 36M
1.50
Open Interest (Total)
1.25
1.00
0.75
0.50
0.25
0.00
(0.0, 0.1] (0.1, 0.2] (0.2, 0.3] (0.3, 0.4] (0.4, 0.5] (0.5, 0.6] (0.6, 0.7] (0.7, 0.8] (0.8, 0.9] (0.9, 1.0]
Abs. Delta
140000
12M
24M
120000 36M
100000
Volume (Avg.)
80000
60000
40000
20000
0
(0.0, 0.1](0.1, 0.2](0.2, 0.3](0.3, 0.4](0.4, 0.5](0.5, 0.6](0.6, 0.7](0.7, 0.8](0.8, 0.9](0.9, 1.0]
Abs. Delta
50
Figure 2: Test of Mean-Variance Efficieny using Managed Portfolios.
This figure plots portfolio alphas from a time series regression of portfolio returns on the set of factors from either
an observable factor model (FFCBS6) and the IPCA model with K = 3 factors. The test portfolios are the managed
portfolios constructed from the set of characteristics included in the estimated IPCA model: moneyness, maturity, im-
plied volatility, embedded leverage, gamma and vega interacted with both a constant and the put/call dummy variable.
This yields a total of 15 portfolios including the equally weighted portfolio. Portfolio alphas are plotted against the
raw portfolio returns. All portfolios are re-leveraged to yield 10% annualized volatility. Filled-in diamond markers
correspond to alphas with t-statistic greater than 2.0. Panels (a) and (b) show results from unconditional asset pricing
tests, while panels (c) and (d) show results from the conditional FFCBS6 / IPCA model, i.e. alphas are computed as
time-series averages of period-by-period portfolio residuals.
25.0% 25.0%
20.0% 20.0%
15.0%
Avg. Abs. Alpha = 5.40%
Alpha (p.a.)
Alpha (p.a.)
15.0%
Avg. Abs. Alpha = 3.92%
10.0%
10.0%
5.0%
5.0%
0.0%
0.0%
0.0% 5.0% 10.0% 15.0% 20.0% 25.0% 0.0% 5.0% 10.0% 15.0% 20.0% 25.0%
Raw Return (p.a.) Raw Return (p.a.)
25.0% 25.0%
20.0% 20.0%
15.0%
Avg. Abs. Alpha = 3.67%
Alpha (p.a.)
Alpha (p.a.)
15.0%
Avg. Abs. Alpha = 1.19%
10.0%
10.0%
5.0%
5.0%
0.0%
0.0%
0.0% 5.0% 10.0% 15.0% 20.0% 25.0% 0.0% 5.0% 10.0% 15.0% 20.0% 25.0%
Raw Return (p.a.) Raw Return (p.a.)
51
Figure 3: Test of Unconditional Mean-Variance Efficieny using Double Sorted Portfolios.
This figure plots portfolio alphas from a time series regression of portfolio returns on the set of factors from either an
observable factor model (FFCBS6) and the IPCA model with K = 3 factors. The test portfolios are constructed by
double sorting all option contracts on their time-to-maturity and option delta. Specifically, we construct portfolios of
options with 1 month, 2 months, 3 to 6 months, 7 to 12 months, and 13 to 24 months to maturity and absolute option
delta between zero and 0.1, 0.1 and 0.2, and so on up to an absolute delta of 0.5. This generates 20 double sorted
portfolios. Portfolio alphas are plotted against the raw portfolio returns. All portfolios are re-leveraged to yield 10%
annualized volatility. Filled-in diamond markers correspond to alphas with t-statistic greater than 2.0. Panels (a) and
(b) show result from unconditional asset pricing tests, while panel (c) shows results from the conditional IPCA model,
i.e. alphas are computed as time-series averages of period-by-period portfolio residuals.
17.5%
15.0% 15.0%
12.5%
10.0% Avg. Abs. Alpha = 2.46%
10.0% Avg. Abs. Alpha = 1.35%
Alpha (p.a.)
Alpha (p.a.)
7.5%
5.0%
5.0%
0.0% 2.5%
0.0%
-5.0%
-2.5%
4.0% 6.0% 8.0% 10.0% 12.0% 14.0% 16.0% 18.0% 4.0% 6.0% 8.0% 10.0% 12.0% 14.0% 16.0% 18.0%
Raw Return (p.a.) Raw Return (p.a.)
52
Figure 4: Plots of loadings Γβ
This figure shows the loadings, Γβ , that create the mapping between characteristics and factors. Each subplot corre-
sponds to one column of Γβ , i.e. one IPCA factor. The IPCA model fitted is the restricted model with K = 3. The colon
notation is used to indicate interaction with another variable, e.g. ttm:put refers to the interaction between an option’s
time-to-maturity with an indicator variable that is equal one for puts and zero for calls.
0.6
0.4
0.4
0.2
0.2
0.0 0.0
0.2 0.2
0.4 0.4
delta
delta:put
ttm
ttm:put
embed_lev
impvol
embed_lev:put
theta
theta:put
impvol:put
gamma
gamma:put
vega
vega:put
const
delta
delta:put
ttm
ttm:put
embed_lev
embed_lev:put
theta
theta:put
impvol
impvol:put
gamma
gamma:put
vega
vega:put
const
(a) Factor 1 (b) Factor 2
0.2
0.0
0.2
0.4
delta
delta:put
ttm
ttm:put
embed_lev
impvol
vega
embed_lev:put
theta
theta:put
impvol:put
gamma
gamma:put
vega:put
const
(c) Factor 3
53
Figure 5: IPCA Factor Exposure by Level of Moneyness
This figure shows the unconditional factor exposures as a function of contract moneyness measured by the absolute
value of the forward option delta. Specifically, we sort the sample of option returns into buckets on the absolute
value of the option delta and for each bucket ∆ j compute the average factor exposure β∆ j = z̃′j Γβ where z̃ j is the mean
of the characteristics in bucket ∆ j . The buckets of absolute forward delta are (0, 0.05], (0.05, 1], ..., (0.45, 5]. For
comparability, we scale the factor exposure by multiplying with the factor standard deviations.
0.001 F1
F2
F3
0.000
Factor Exposure
0.001
0.002
0.003
(0.0, 0.05] (0.1, 0.15] (0.2, 0.25] (0.3, 0.35] (0.4, 0.45]
Abs. Delta
54
Figure 6: IPCA Factor Regression Betas for Double Sorted Option Portfolios
This figure displays multiple regression betas of double sorted option portfolio returns onto the latent factors extracted
by the restricted (Γα = 0) IPCA model with K = 3 factors. The portfolios are constructed from equally weighting the
returns on all options that fall in a given bin on the portfolio formation date. The option portfolios are constructed by
double sorting all option contracts on their time-to-maturity and absolute forward option delta. The black overlayed
bars indicate 95% confidence intervals.
0.0
0.4 TTM 1M
TTM 2-6M
TTM 7-12M
0.2
0.2
0.4
0.0
Beta
Beta
0.6
0.2
0.8
0.4
TTM 1M
TTM 2-6M
1.0 TTM 7-12M
(0, 0.1] (0.1, 0.2] (0.2, 0.3] (0.3, 0.4] (0.4, 0.5] (0, 0.1] (0.1, 0.2] (0.2, 0.3] (0.3, 0.4] (0.4, 0.5]
Delta Delta
0.0
0.2
0.4
Beta
0.6
0.8
TTM 1M
TTM 2-6M
TTM 7-12M
1.0
(0, 0.1] (0.1, 0.2] (0.2, 0.3] (0.3, 0.4] (0.4, 0.5]
Delta
(c) Factor 3
55
Appendix A. Appendix: Bootstrap Test for Instrument Significance
Characteristics enter the IPCA model through the loadings matrix Γβ that contains the loadings
for specific characteristics in each of its rows. The bootstrap test essentially tests the magnitude
of the difference of a given row in Γβ from zero. We start by estimating the unrestricted model,
(i.e., the model that does not force the coefficients of a given characteristic in Γβ to zero) and
T
save the estimated model parameters for Γ̂β , { fˆt }t=1 , as well as the residuals from the fitted model
{dt }t=1 . Using the residuals we generate a bootstrap sample under the null hypothesis that the l-th
T
characteristic does not affect loadings.17 We can then compare a Wald-type test statistic of the form
′
Wβ,l = γ̂β,l γ̂β,l from the alternative model with unrestricted Γβ , to the analogous test statistic W̃β,l
b
for
the model estimated on the bootstrapped data, where the super-script b indexes the bootstrap draw.
b
Finally, p-values can be computed as the fraction of test statistics W̃β,l that exceed Wβ,l .
17
For details, we refer the reader to Kelly et al. (2019), Section 3.3.
56
Table B.1: IPCA Performance - Robustness: Characteristics This table reports the IPCA fit performance as mea-
sured by R2total and R2pred using both the restricted model (Γα = 0) and the unrestricted model (Γα ≠ 0) with K = 1, . . . , 5.
In contrast to Table 2, the model estimated in this table uses charachteristics as implied by a Taylor expansion of the
BMS model price as shown in Equation (12). The resulting characteristics are each interacted with a constant and an
indicator variable that is equal one if the option is a put and is zero otherwise. Additionally, a constant is included in
the set of characteristics. In Panel A performance measures are computed with respect to individual option contracts,
while in Panel B performance measures are computed with respect to the characteristics-managed portfolios.
No. Factors
1 2 3 4 5
Panel A: Individual Options
R2total Γα = 0 70.86% 77.44% 80.74% 83.24% 84.18%
Γα ≠ 0 71.65% 78.00% 80.99% 83.34% 84.23%
57
Table B.2: IPCA Performance - Robustness: Longer Maturities This table reports the IPCA fit performance as
measured by R2total and R2pred using both the restricted model (Γα = 0) and the unrestricted model (Γα ≠ 0) with
K = 1, . . . , 5. In contrast to Table 2, the model estimated in this table uses an expanded sample that includes options
with maturities up to two years. In Panel A performance measures are computed with respect to individual option
contracts, while in Panel B performance measures are computed with respect to the characteristics-managed portfolios.
No. Factors
1 2 3 4 5
Panel A: Individual Options
R2total Γα = 0 73.32% 81.27% 86.11% 89.26% 90.89%
Γα ≠ 0 75.54% 82.45% 86.61% 89.65% 91.16%
58
Table B.3: IPCA Performance for Varying Sets of Instruments. The table details the performance of different IPCA specifications for both the restricted
and unrestricted models where the number of factors is K = 1, . . . , 5. The performance measures include total and predictive R2 . The characteristics
and market variables included in a given specification are marked by an ”x”, while fields left empty indicate that a characteristic was not included in the
specification.
59
2 n 74.76 5.35 97.38 7.35 x x x
3 n 79.15 5.86 99.15 7.61 x x x
4 n 82.20 6.06 99.70 7.67 x x x
5 n 83.50 6.20 99.86 7.68 x x x
1 y 70.38 6.97 96.47 8.01 x x x x
2 y 76.99 6.91 97.37 8.02 x x x x
3 y 81.02 6.87 98.95 8.03 x x x x
4 y 84.63 6.84 99.61 8.03 x x x x
5 y 85.97 6.82 99.77 8.03 x x x x
1 n 68.98 5.61 95.72 7.32 x x x x
2 n 75.77 5.80 97.32 7.63 x x x x
3 n 80.73 6.67 98.84 8.01 x x x x
4 n 84.42 6.67 99.61 8.01 x x x x
5 n 85.82 6.69 99.78 8.02 x x x x
Table B.3 continued.
60
4 n 86.88 6.84 99.49 8.05 x x x x x
5 n 88.22 6.82 99.66 8.07 x x x x x
1 y 72.75 7.63 95.94 8.34 x x x x x x
2 y 79.86 7.60 96.97 8.33 x x x x x x
3 y 84.37 7.33 98.82 8.28 x x x x x x
4 y 88.20 7.34 99.43 8.28 x x x x x x
5 y 89.36 7.27 99.61 8.26 x x x x x x
1 n 71.07 5.62 94.92 7.34 x x x x x x
2 n 78.39 5.74 96.15 7.44 x x x x x x
3 n 84.07 6.37 98.79 8.01 x x x x x x
4 n 87.76 6.55 99.42 8.08 x x x x x x
5 n 89.03 6.71 99.71 8.15 x x x x x x
Table B.3 continued.
61
4 n 88.90 6.59 99.39 7.99 x x x x x x x
5 n 90.22 6.77 99.61 8.06 x x x x x x x
1 y 74.30 7.74 94.75 8.25 x x x x x x x x
2 y 81.23 7.64 96.89 8.23 x x x x x x x x
3 y 85.95 7.34 98.85 8.16 x x x x x x x x
4 y 89.58 7.31 99.36 8.14 x x x x x x x x
5 y 90.73 7.28 99.64 8.13 x x x x x x x x
1 n 72.50 5.42 93.68 7.15 x x x x x x x x
2 n 79.78 5.60 96.51 7.41 x x x x x x x x
3 n 85.28 6.20 98.83 7.79 x x x x x x x x
4 n 89.16 6.55 99.34 7.92 x x x x x x x x
5 n 90.55 6.65 99.59 7.94 x x x x x x x x
Table B.4: IPCA Performance - Daily Frequency. This table reports the performance as measured by R2total and R2pred
using both the restricted model (Γα = 0) and the unrestricted model (Γα ≠ 0) with K = 1, . . . , 5. Option characteristics
mness, ttm, embed lev, theta, iv, gamma, vega are each interacted with a constant and an indicator variable that is
equal one if the option is a put and is zero otherwise. In Panel A performance measures are computed with respect to
individual option contracts, while in Panel B performance measures are computed with respect to the characteristics
managed portfolios. Panel C reports p-values for the test Γα = 0 from a bootstrap with 1000 draws for each time t.
No. Factors
1 2 3 4 5
Panel A: Individual Options
R2total Γα = 0 69.98% 85.89% 91.53% 92.92% 93.76%
Γα ≠ 0 70.32% 86.03% 91.62% 93.02% 93.84%
62