Microstructure and Market Dynamics in Crypto Markets
Microstructure and Market Dynamics in Crypto Markets
April, 2024
We investigate the role of market microstructure metrics in explaining and predicting price
dynamics for 5 major cryptocurrencies. Using machine learning, we show how microstructure
measures of liquidity and price discovery have predictive power for price dynamics of interest
for electronic market making, dynamic hedging strategies and volatility estimation. We identify
important own market and cross-market effects for BTC and ETH Roll measures and VPINs.
Our results are little changed during crypto winter, demonstrating a stability to these effects. Our
findings suggest that market dynamics of cryptocurrencies can be viewed as similar to those of
other investible asset classes.
The challenge confronting cryptocurrencies has long been apparent: How to make crypto an
asset class investible by retail and institutional traders? The resurgence of crypto prices suggests
that some impediments to reaching this goal are receding, at least as they apply to retail investors.
Following the January 2024 SEC approval of spot-market based Bitcoin ETFs, inflows to the new
Bitcoin ETFs reached almost $70 billion in just 2 months. 1 This remarkable growth testifies to
the appeal that being able to buy bitcoin exposure through brokerage accounts rather than via
crypto exchanges or futures markets has for retail traders. But other obstacles remain for both retail
and institutional investors. 2 Whether these difficulties can be overcome remains to be seen, but
certainly fundamental to greater participation is the ability to understand what drives the market
dynamics of cryptocurrencies.
standard tools from other asset classes. Other approaches focus on crypto-specific factors such as
underlying users or mining costs to explain crypto valuations. Yet other researchers eschew
understanding the determinants of crypto value and instead turn to more “black box” machine
learning to simply predict future prices. While these approaches yield interesting insights, accurate
crypto valuation continues to prove elusive, underscoring the difficulty of making crypto more
1
AUM numbers as of March 28, 2024 from https://cryptonews.com/news/assets-invested-in-crypto-etfs-and-etps-
rise-359-in
2024.htm#:~:text=ETFGI%2C%20an%20independent%20research%20firm%2C%20reports%20assets%20invested,
increase%20from%20%2415.12%20billion%20at%20end%20of%202023. See also “Bitcoin ETFs on Better Win
Streak than 95% of Traditional Funds,” Forbes, March 25, 2024.
2
Vanguard, for example, will not offer crypto-products arguing bitcoin is “a speculation [rather] than an
investment”. JP Morgan and Goldman Sachs have expressed similar views. Institutional roadblocks include
regulatory issues (e.g. is crypto a security?, know your customer compliance), market manipulation concerns,
valuation issues, market fragmentation, excessive volatility, and lack of liquidity. For discussion, see
blog.ionixxtech.com/top-5-challenges-institutional-investors-face-in-crypto-trading/.
In this paper, we offer a different approach to investigating crypto market behavior. Our
particular focus is on a basic question: Can standard microstructure measures predict crypto
market dynamics? Unlike valuation-based analyses, our approach is based more on understanding
the liquidity and price discovery process involved in crypto trading. Microstructure theory
provides various measures related to liquidity (for example, the Amihud measure), asymmetric
information and toxicity (Kyle lambda, VPIN), and overall spreads and auto-correlations (Roll
measure and Roll Impact measure) that have been shown to matter for liquidity and price dynamics
in other asset markets. We estimate these variables for 5 major cryptocurrencies, giving insight
into how these metrics differ from those found in more standard market settings.
We then use these measures to predict 5 outcomes of market price dynamics of particular
interest for electronic market making, dynamic hedging strategies and volatility estimation. These
outcome variables are the signs of the change in realized volatility, the change in auto-correlation
of realized returns, the change in skewness of realized returns, the change in kurtosis of realized
returns, and the change in the Jarque-Bera statistic. We follow the approach taken in Easley, Lopez
de Prado, O’Hara, and Zhang (ELOZ) (2021) of using machine learning to ask if our
Some readers might find it odd to care about crypto market liquidity and price dynamics
rather than crypto valuation per se. Two reasons to do so are paramount. First, as noted in ELOZ,
in high frequency markets how the market is structured turns out to be critical in predicting where
the market is going. The less “efficient” the market, the more predictable it is, so understanding
the efficiency of crypto trading matters. The second reason is that institutional and high frequency
turn, rely on predicting market dynamics to determine the optimal path for executing trades.
Algorithmic trading is widely used in crypto markets, where trade bots allow both crypto “whales”
and retail traders alike the ability to trade dynamically. 3 But the algorithmic strategies employed
there generally rely on common market indicators, and not on the underlying microstructure
variables that may drive the more sophisticated trade execution strategies found in more standard
asset markets. 4 The labels we focus on are inputs to those quantitative strategies, so understanding
their predictability is fundamental to attracting such traders to the crypto space. Perhaps the
simplest way to characterize our interest is that we are asking: Are crypto markets really different,
Our research design uses high frequency data from Binance (the largest crypto exchange) for
cryptocurrencies of five major blockchain platforms: Bitcoin (BTC), Ethereum (ETH), Ripple
(XRP), Solano (SOL), and Cardano (ADA) 6. We are interested in both the effects of own
microstructure variables for prediction (e.g. does the ADA Roll metric predict the sign of changes
in future ADA volatility) and the cross effects for prediction (e.g. does Bitcoin VPIN predict the
sign of changes in future Ethereum autocorrelation). Our interest in these cross-asset effects
reflects the reality that high frequency trading behavior often involves complex multi-asset
3
For a discussion of trade bots see Coinbase, “How trade bots work? where it is noted that “some common
parameters bots use include price, time frame, and order volume, while common market indicators include moving
averages (MAs), relative strength index (RSI), and more”. Available at https://www.coindesk.com/learn/what-are-
crypto-trading-bots-and-how-do-they-work/
4
There is some discussion in the crypto industry of the potential use of microstructure measures, but no systematic
evaluation of microstructure measures and their ability to predict characteristics of the distribution of returns. See for
example, https://medium.com/@kryptonlabs/vpin-the-coolest-market-metric-youve-never-heard-of-e7b3d6cbacf1
5
A related approach is taken by Kogan, Makarov, Niessner, and Schoar [2024] who investigate whether retail
trading in crypto differs from retail trading in stocks and gold. These authors find substantial differences in retail
crypto trading.
6
Each blockchain has a native token in which transactions are charged. In addition, various non-native tokens can
be deployed on a blockchain. Here we focus the study only on the native tokens, which are represented by the
tickers in parentheses.
determine which features (i.e. microstructure variables) are most important for understanding
market dynamics. Our sample period spans January 2021 – July 2023, allowing us to investigate
whether the predictability of crypto markets behaved differently during the “crypto winter” period.
First, we find surprisingly high values for the Roll Measure and VPIN in crypto markets
relative to more standard equity and futures market settings. The greater serial correlation in crypto
prices is consistent with more momentum-based trading, while the higher levels of VPIN are
indicative of greater trade toxicity arising potentially from more asymmetric information.
Second, we find strong predictability of microstructure measures for future market price
dynamics. Averaging across all currencies and variables, we find an AUC > .55, a very strong
result consistent with deviations from market efficiency. 7 Focusing on the individual features, we
find similar predictability (AUC ranging from .54 - .61) with the exception of skewness where we
find no predictability (AUC=.50). Overall, we find that predictability is driven by the Roll measure
and by VPIN. Own measures of Roll and VPIN matter for most of our predicted features, with
other microstructure features having little or no importance for own market prediction. Cross
effects are also important: BTC Roll and VPIN and ETH Roll and VPIN have strong predictive
Third, a particular concern for institutional trading methods is the stability of these market
dynamics. We use the natural experiment of “crypto winter” to investigate how these market
dynamics are affected by different market regimes. Somewhat surprisingly, we find no effects of
7
A standard metric in random forest algorithms to measure the performance of classification models is ROC, or the
receiver operating classification curve. The AUC stands for “area under the ROC curve” and essentially captures
the predictive ability of the machine learning algorithm. For more discussion of evaluation techniques for financial
machine learning see Lopez de Prado (2018).
uncovered here.
Overall, we show that the price dynamics of the crypto market respond to its market
microstructure in ways similar to that of other financial assets. That the market exhibits
inefficiency is not too surprising – crypto markets are relatively young, exhibit large volatility
relative to other asset classes and are much less regulated, all features that undermine achieving
price efficiency. 8 But even in mature markets, there can be predictability from microstructure
variables. As ELOZ [2021] showed, futures markets also exhibit inefficiency, although not to the
degree found here in the crypto markets. What is true in both market settings is that over short
horizons liquidity is predictable, and thus potentially exploitable for optimizing trading strategies.
Moreover, our results on the strong cross effects of the Roll measure, a metric capturing
autocorrelation properties, and VPIN, a measure of market toxicity, convincingly support that there
Our research relates to several streams of research directed to understanding the market
drivers of cryptocurrencies. A large literature in computer science uses machine learning to predict
future crypto prices and returns (see, for example, Koker and Koutmos (2020); Jaquart, Kopke,
and Weinhardt (2022); Cortese, Kolm, and Lindstrom (2023); Filippou, Rapach, and Thimsen
(2024)). Our analysis also relies on machine learning but differs from the black box prediction
methods typically used in that we are testing for the effects of particular model-based metrics on
market dynamics. In common with these papers, we find strong predictability, and thus
8
See also Nimalendram et al [2021] who use variance tests to show cryptocurrency inefficiency and examine its
relationship to market regulation.
9
For a discussion of factor pricing in crypto see Cong, Karolyi, Tang and Zhao (2022).
Buraschi (2018), Cong, et al (2020), Bias et al (2022) for theoretical analyses). Much of the
Erb (2020) investigates linkages between gold valuation and bitcoin valuation but concludes that
“neither gold nor bitcoin are obvious inflation hedges, stores of value, or safe havens” and so are
hard to value. 10 Liu and Tsyvinski (2018) argue that crypto has no exposure to common equity
market or macro factors or to currency or commodity markets. 11 Liu, Tsyvinski and Wu [2021]
develop a three-factor model based solely on crypto market, size and momentum to explain returns.
Cong, Karolyi, Tang and Zhao (2021) augment these factors with value and network factors,
yielding a five-factor model of crypto returns. Our analysis suggests that inter-dependencies across
crypto currencies arising from information and liquidity dynamics may be potential factors to
Another substantial body of research focuses on the trading of crypto assets (for a general
survey see Hallaburda et al (2022)). Early papers here include Easley, O’Hara, and Basu (2018),
Huberman and Lasko (2021), and Cong He, and Li (2021), who analyzed the equilibrium role of
transaction fees and miners in Bitcoin. Recent work analyzes trading across crypto exchanges,
with Makorov and Schoar (2019; 2020) finding extensive arbitrage opportunities and Crepelliere
et al (2022) documenting a decreasing trend in such opportunities. Our results are consistent with
the market inefficiency found there, with our results on its persistence during crypto winter
Makorov and Schoar (2022) who present intriguing evidence on the trading and network structure
10
See Erb (2020) pg. 15.
11
Related research investigates the linkages of crypto markets to other markets, with Iyer (2022) establishing
increased interdependence between crypto and equity markets.
and concentrated players, raising anew the issue of whether crypto currencies can appeal to a wider
investment audience. Research directly addressing this issue includes Hardle, Harvey and Ruelle
consider, the market statistics we predict using these microstructure variables and the random
forest procedure we use to generate predictions. Section III describes the data set: Binance data on
prices and volumes of trade for the leading five crypto currencies for the period January 2021 to
July 2023. Section IV provides results about predictability and the relative importance of each
microstructure measure in generating predictions. Section V offers two robustness tests: the effect
of crypto winter; and using logistic regressions rather than random forests to generate predictions.
Section VI concludes.
We ask whether five standard microstructure variables are useful in predicting various
measures of market dynamics. The specific microstructure measures considered are: Roll
measure, Roll impact measure, Kyle’s lambda, Amihud measure and VPIN (Volume synchronized
illiquidity or the presence of information-based trade. Illiquidity and information should lead to
price volatility, and ELOZ (2021) demonstrate that these microstructure measures are successful
The price dynamics we focus on are measures of changes in the distribution of realized returns.
The specific price dynamics measures we predict are: the sign of the change in the sequential
correlation, the sign of the change in the Jarque-Bera statistic, the sign of the change in realized
measures have distinct implications for trading strategies. For example, if realized volatility is
expected to increase, then increasing the speed of algo execution would be expected to reduce fill
price uncertainty. An increase in predicted serial correlation can result in greater or less price
impact depending upon the trade side being executed. As ELO [2015] show, this should change
the optimal speed of trading. 12 The Jarque-Bera statistic captures normality of returns, so its
increase signals non-normal returns, suggesting that estimates of implementation shortfall may be
too small. If skewness is expected to increase, then the distribution of returns is shifted to one
side, perhaps consistent with toxicity in order flow. An increase in kurtosis means greater weight
in the tails, an outcome that may signal a withdraw of liquidity support by market makers.
These price variables are defined using Binance data on prices and volume. For each crypto
we first create one-minute time bars; that is, we split the data into segments of length one-minute
and we record the price at the beginning of each time bar, at the end of each time bar, and the
dollar volume of trade that occurs within each time bar. 13 Let t=1,2,… index time measured in
minutes. The basic variables we consider are the ending price pt in time bar t, the return
We next compute a realization of each of our microstructure measures for each time bar t. Each
of these microstructure measures are based on some amount of past data. For any time bar t, a
microstructure measure at t is computed using data in periods t, t-1, …, t-W where W is the
12
These authors also demonstrate how incorporating microstructure variables into trading strategies can improve
upon the outcomes provided by standard trade algorithms. In particular, an algorithm based on predicted VPIN
changes and volume participation dominates a VWAP trading strategy. See Lopez de Prado et al (2020) for more
discussion.
13
We use one-minute time bars because that is the highest granularity bar data the public exchanges offer. This time
period also seems appropriate for capturing market dynamics in a high frequency setting.
example, the Amihud measure at time bar t is the average ratio of absolute returns to dollar volume
where the average is computed over the past W time bars. Note that a microstructure measure at
time bar t and one at time bar t+1 are computed using W-1 overlapping bars of market data.
Our market microstructure variables are defined from these basic variables as follows:
2 �|𝑐𝑐𝑐𝑐𝑐𝑐(𝚫𝚫𝑷𝑷𝒕𝒕 , 𝚫𝚫𝑷𝑷𝒕𝒕−𝟏𝟏 )| ,
2. The Roll impact measure---the Roll measure divided by dollar volume over a certain
period, is
�|𝑐𝑐𝑐𝑐𝑐𝑐(𝚫𝚫𝑷𝑷𝒕𝒕 ,𝚫𝚫𝑷𝑷𝒕𝒕−𝟏𝟏 )|
2 .
𝑝𝑝𝑡𝑡 𝑉𝑉𝑡𝑡
4. Kyle’s 𝜆𝜆 is
𝑝𝑝𝑡𝑡 −𝑝𝑝𝑡𝑡−𝑊𝑊
∑𝑡𝑡𝑖𝑖=𝑡𝑡−𝑊𝑊 𝑏𝑏𝑖𝑖 𝑉𝑉𝑖𝑖
,
5. VPIN is
𝑡𝑡
1 |𝑉𝑉𝑖𝑖𝑆𝑆 − 𝑉𝑉𝑖𝑖𝐵𝐵 |
� ,
𝑊𝑊 𝑉𝑉𝑖𝑖
𝑖𝑖=𝑡𝑡−𝑊𝑊+1
10
We use these microstructure measures to predict the signs of changes in various market
statistics. These “signs of change” are represented by -1 for a negative change and +1 for a positive
change. Thus, our market statistics data is a sequence of -1 and +1, one for each time bar. Similar
to the way we compute microstructure measures, these market statistics are also computed using
some number of past observations. For example, the distribution of realized returns as of time bar
t is the empirical distribution of returns over some number of past time bars. The number of past
time bars used in computing each market statistic is also W (the lookback window). Note that this
procedure implies that the sign of change in a market statistic at time bar t and at time bar t+1 also
uses W-1 overlapping observations of data. So, when we want to predict the sign of the change in
a market statistic we predict it over enough future time bars to avoid using overlapping data. That
is, we use a substantial ``look ahead’’ window. We set this look ahead window to be h=1,500 bars;
𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠(𝜎𝜎𝑡𝑡+ℎ − 𝜎𝜎𝑡𝑡 ) ,
where 𝜎𝜎𝑡𝑡 is the realized volatility of one bar returns over a look back window of size W.
𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠�𝐽𝐽𝐽𝐽(𝑟𝑟𝑡𝑡+ℎ ) − 𝐽𝐽𝐽𝐽(𝑟𝑟𝑡𝑡 )� ,
𝑛𝑛 2 1
𝐽𝐽𝐽𝐽(𝑟𝑟) = �𝑆𝑆 + (𝐶𝐶 − 3)2 � ,
6 4
11
bars.
𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠(𝑠𝑠𝑠𝑠𝑡𝑡+ℎ − 𝑠𝑠𝑠𝑠𝑡𝑡 ),
should provide information about its price dynamics. But the literature provides little guidance
about exactly what form this relationship should take. From prior research it also seems reasonable
to expect cross-asset relationships, for example BTC or ETH microstructure measures may help
predict market statistics of other cryptos, but again the literature provides no guidance about what
form this relationship should take. 14 To avoid restricting ourselves to an arbitrary structure we use
A wide variety of machine learning procedures could be used to ask about the predictive
content of our market microstructure measures (see Lopez de Prado (2018) and Kelly and Xiu
(2023) for discussion of these approaches). We chose a random forest procedure primarily to make
it possible to compare our results to previous results about the predictive content of microstructure
measures. In ELOZ (2021) we used a random forest applied to futures data to ask if these same
14
See, for example, research on crypto currency interdependence by Kukacka and Kristofex (2023) and Qureshi et
al (2020).
12
market, the answer was yes; some own market measures matter and so do some cross-market
measures. Here we ask if these same microstructure measures provide insight into crypto price
dynamics.
In the language of random forests, our “features” (the microstructure measures) are used
to predict “labels” (the market statistics). Our random forest procedure begins by building, for each
crypto currency, decision trees based on repeated cuts of the time series of labels and features into
two pieces. The division of the data is determined by a two-step process. First, for each feature we
compute the information gain (gain in homogeneity of the data) that would be obtained by splitting
the data using that feature. Second, we cut the data into two pieces (two branches of the decision
tree) by randomly selecting two features and using the feature with the largest information gain.
We repeat this cutting process until no additional cut will yield an information gain. For each
crypto currency this process yields a decision tree that predicts labels for any assignment of
features. To avoid tying our predictions to a single tree that may be too heavily influenced by
randomness in the data, we create multiple decision trees for each crypto currency. This is done
by creating 100 trees using bootstrapped samples of the data from the actual data set. The
prediction given any list of features is then the majority prediction using the collection of trees. 15
Ultimately, we are interested in the accuracy of our predictions and in which features
the importance of any predictor. This is, however, an in-sample measure and its computation is
tied to the regression analysis. In contrast, the feature importance measure Mean Decreased
15
For more discussion of this procedure see ELOZ (2021).
16
For a discussion of how best to measure accuracy in financial machine learning applications see Lopez de Prado
(2018), Chapter 8.
13
importance of any predictive variable. Mean Decreased Accuracy for a feature represents how
much prediction accuracy we lose if we compute accuracy first using that feature and then using
shuffled values of the feature. To compute MDA, we first split the data into disjoint training and
test sets. We run our random forest procedure on the training set and compute accuracy of
predictions on the test set (the definition of accuracy is given below). We then rerun the random
forest procedure on the training set after randomizing the value of a feature and compute accuracy
again. The relative loss in accuracy is defined as the MDA measure for this feature.
III. Data
The data used in this study are obtained from the Binance public database for the period
January 2021 to July 2023. Historically, there have been a large number of crypto exchanges, some
of which were relatively short-lived. As of December 2023, active centralized crypto exchanges
with daily volume greater than one-hundred million dollars numbers in the dozens, according to
the crypto analytics site https://www.coingecko.com/. Among these exchanges, Binance is the
largest as measured by daily dollar volume so we chose it for the focus our study. Because each
exchange runs their own limit order book, we cannot directly apply our current analysis to data
The format of the data is the standard time-bar candle-stick with 1-minute intervals,
including open, close, high, low price and volume aggregated from tick data within each interval.
All features and labels in this study are derived from this data. Since the number of
cryptocurrencies has grown exponentially, and is quite volatile, it is infeasible to examine all of
the tokens traded on the exchange. We focus on the top 5 cryptocurrencies as measured by their
market-capitalization in January 2021: BTC, ETH, ADA, SOL, and XRP. A deep dive into how
14
IV. Results
Table 1 provides estimates of the mean values of the market microstructure variables over
our sample period. As is standard in crypto settings, we express each currencies’ microstructure
measures with respect to the U.S. dollar as given by Tethers (USDT). As expected, all variables
have positive signs. Averaging over longer time bars tends to reduce mean values for the Roll and
Kyle metrics, but has little impact on the Amihud and VPIN measures. The Roll measure is
markedly different across the currencies, with Bitcoin having substantially higher autocorrelation.
The presence of “whales” trading large quantities algorithmically could be one reason for such a
finding. The VPIN measure is remarkably stable across currencies, but its level is surprising: ELO
(2012) found (using a slightly different methodology) average VPINs for the E-mini S&P500 and
crude oil futures of 0.22 to 0.23 whereas these are range from .45 to .47. 17 Such high toxicity is
We now ask how much predictive accuracy we can achieve from our microstructure
measures and which microstructure measures contribute to this predictive accuracy. We consider
two ways to measure predictive accuracy. One measure (Accuracy) is the number of correct
predictions of the sign of change divided by the total number of predictions. The second measure
(Area Under the Curve or AUC) can be interpreted as the probability that our fitted random forest
will rank a randomly drawn +1 higher than it ranks a randomly drawn -1. We confine our
discussion in the text to the simple Accuracy measure as the two measures yield nearly identical
17
They are not as high as the highest VPIN levels of approximately 0.8 that ELO (2012) found for the E-mini during
the time period of the flash crash.
15
we divide the sample into two periods and ask about the effects of Crypto Winter.
For each crypto currency we use the random forest procedure described in Section II applied
to 25 features: the five microstructure measures for each of the five crypto currencies. So, we
include both the effect of own microstructure measures and cross-currency microstructure
measures. Random guessing should lead to predictive accuracy of approximately 0.5 as we are
predicting whether labels are positive (+1) or negative (-1). Anything above 0.5 suggests that our
features have some power in predicting our labels. We do not attempt to maximize prediction
accuracy. For our purposes, it is enough to demonstrate that our microstructure measures
contribute to substantial prediction accuracy here as they do in more established financial markets.
Previous research (ELOZ 2020) on futures markets using similar features to predict similar labels
We first examine average (averaged over the five crypto currencies) predictive accuracy.
The most important question is whether we can make useful predictions at all. A secondary
question is how the level of predictive accuracy compares to that found in other financial markets.
Note that we are predicting changes in market statistics one day ahead and we make predictions
every minute. So even small amounts of accuracy (above 0.5) can be valuable. Average (across all
cryptos in our sample and all labels) predictive accuracy is provided in Table 2. Depending on the
number of bars used as a lookback window in constructing our market microstructure measures,
If we exclude skewness, predictive accuracy for our entire sample period ranges (across
accuracy measures, the number of bars used and labels) from 0.52 to 0.58. These results are
18
Results for both accuracy measures are provided in the tables.
19
The results for AUC are also given in Table 1 and they range from 0.53 to 0.54.
16
statistic), 5(Kurtosis), and 6 (Realized Volatility). However, as the last two columns of Table 7
show, skewness is not predictable; for skewness we find predictive accuracy of 0.5 meaning that
for the crypto currencies we examine our microstructure measures are not at all useful in predicting
market statistics for skewness. This is similar to results for futures where skewness is the least
predictable label.
Predictive accuracy is greatest for the sign of the change in realized volatility. For both of
accuracy measures, the average accuracy of prediction ranges from 0.56 to 0.58 depending on the
number of bars used. This is a remarkably high accuracy level for random forest predictions in
The predictive accuracy results indicate that the crypto market exhibits inefficiencies. When
order flow is imbalanced, causing positive correlation in price changes and an increased Roll
measure, volatility increases and it remains high through our look-ahead window of 1,500 bars or
roughly a day. The ability to predict changes in realized volatility from microstructure measures
intended to measure imbalances or correlations in order flow suggests that there is trend following
We find it interesting that the level of predictive accuracy is not very different from that
found in futures markets using a similar approach to prediction. Both market settings are electronic
and trade almost continuously over a 24-hour day 20, but they do differ in that futures are well-
established markets widely used by institutional traders whereas crypto trading is more nascent
and dominated by crypto “natives”. Our results suggest that the market dynamics of crypto
CME, one of the leading futures markets offer products that trade up to 23 hours a day Monday through Friday.
20
17
contribute to prediction accuracy. Figures 1 through 4 provide average MDA scores (averaged
over our five crypto currencies) for prediction of the change in realized volatility (Figure 1), the
change in Kurtosis (Figure 2), the change of the JB statistic (Figure 3) and the change in auto-
correlation (Figure 4). 21 These results are derived from prediction of each market statistic for each
crypto currency using only that currencies own microstructure measures. These results are
remarkably consistent across the various market statistics. For prediction of each market statistic,
the Roll measure is the most important feature as measured by MDA. VPIN is the second most
important measure and the Roll impact measure is third most important. It is reasonable to expect
that microstructure measures for other crypto currencies could be useful in predicting market
statistics for a specific crypto. In particular, as BTC and ETH are the leading cryptos it could be
that trade in these cryptos leads trade, and thus changes in market statistics, for other cryptos.
Figures 5 through 9 provide the results for this analysis. Each figure provides MDA scores for one
of our price dynamics labels for each of our five crypto currencies. For example, Panel 1 of Figure
5 provides MDA scores for the 25 microstructure measures for the sign of the change in realized
volatility for ADA. Examining this panel of Figure 5 shows that ADA’s own Roll measure is the
most important feature. The next three most important features for ADA are, in order, ADA’s own
VPIN measure ,the Roll measure for BTC, and the Roll measure for ETH. and
Figures 5 through 9 tell a compelling story. In almost every case, each crypto’s own Roll
measure is the most important feature for predicting price dynamics. It is reassuring that some
21
We do not provide results for Skewness as there is no predictive accuracy for it.
18
those in futures markets. In futures markets, the Amihud measure and VPIN were the most
important own measures. In our crypto sample, own VPIN shows up frequently as an important
feature; and only occasionally do the own Roll impact measure or the Amihud measure have
importance. The own Kyle measure typically has a low, nearly-zero MDA score.
The cross-market result for Bitcoin and Ethereum reveal particularly interesting dynamics.
The BTC Roll measure and BTC VPIN are the most important features in driving Bitcoin price
dynamics, with the Ethereum Roll measure also playing an important but lesser role. However, the
other crypto currency cross-measures have virtually no influence. Similarly, price dynamics for
Ethereum are driven by the ETH Roll measure and the ETH VPIN with the Bitcoin Roll measure
playing again playing an important but lesser role. As is the case for Bitcoin, the other crypto
currency cross-measures have virtually no influence. Across all other cryptos and labels we study,
Roll measures for BTC and ETH have strong predictability signified by high MDA scores. All
other cross-crypto features typically have very low MDA scores. This suggests that trade in BTC
and ETH leads price changes and volatility in other cryptos; not a surprising result, but one that is
consistent with the notion that trade in other cryptos follows trade in these two large cryptos.
Although the level of predictability we find suggests that there are inefficiencies in the
crypto markets, the fact that market microstructure measures are important for price dynamics
demonstrates that these markets have much in common with more standard financial asset markets.
Our random forest analysis does not imply causation flowing from our features to our labels. But
it does suggest that trading tools based on own Roll and VPIN as well as cross BTC and ETH Roll
19
using the beginning of crypto winter as a date to split our sample. Second, we ask if a logistic
Prices and trading volumes of crypto currencies changed dramatically over our sample
period. Approximately the first half of our sample falls in a “boom” period for crypto currencies
with rising prices and increasing high trading volumes. During this period, overall daily trading
volume reached a peak of $158.64B on April 10, 2021, and remained high over much of 2021.
Emblematic of crypto prices, during this period the price of Bitcoin reached a high of $67,617 on
November 2021, where after prices began a steady decline, reaching a low of $15,742 in October
2022. 22 This latter period is generally referred to as “crypto winter”. A natural question is whether
our predictability results differ in the period before crypto winter and after it began. We selected
November 10, 2021 as the date to use in breaking our sample into two pieces as the total crypto
We reran our analysis separately on these before and after periods. Ex ante it is not obvious
whether predictability should increase or decrease as a result of crypto winter or if there should be
changes which microstructures contribute most to them. Crypto winter was a time of great
uncertainty in the value of crypto and this should make predicting more difficult. But the market
was also likely less efficient and this might make predicting easier.
Table 8 provides our results. Although valuations changed dramatically between these two
sub-periods, predictability is nearly unchanged and the importance of our various microstructure
measures is also unchanged. These results suggest that although valuations changed, the structure
22
Along with depressed prices, crypto winter also featured several spectacular failures of crypto currencies and
exchanges. For discussion of these aspects of crypto winter see Arner, Zetsche, Buckley, and Kirkwood (2023).
20
We regard this as good news for Crypto markets as it implies that trading tools based on these
microstructure measures should be robust to the extreme volatility of Crypto currency markets.
Furthermore, in Tables 9 and 10 we present the values calculated for before and after
November 2021 for the microstructure features used in our analysis. It is worth noting that out of
the five market microstructure variables that we consider, Roll measure, Amihud measure and
Kyle’s λ are proportional to the scale of price while Roll Impact and VPIN are not. As such,
because the price of most crypto tokens dropped significantly after the peak in November 2021,
there is a more pronounced change in the Roll measure, the Amihud measure and Kyle’s λ. On the
other hand, the values of Roll Impact and VPIN are comparably more stable before and after
November 2021.
An alternative to using a random forest to predict our binary labels is to use a logistic
regression. The logistic regression uses a liner regression to model the log odds of the two labels. 23
The aggregated accuracy and MDA results for the logistic regression are provided in Figure 10.
Overall, the results are similar to those obtained via a random forest. There is predictive accuracy
for all labels other than skewness. The ranking of features by MDA is unchanged, although the
actual MDA scores for the most important features, the Roll measure and VPIN, are increased
relative to the scores for the less useful features. This is reassuring as these results suggest that our
ability to predict and our ranking of the importance of various microstructure measures is not a
23
The specific approach we use is discussed in ELOZ (2022), Section 4.6.
21
The degree of predictability for market dynamics in leading crypto currencies indicates that
there are some inefficiencies in the crypto markets, and it’s reasonable to suspect that these
inefficiencies would be even greater for less well established cryptos and might differ across
exchanges. However, at least for trades of these five leading cryptos in Binance, the amount of
predictability is not particularly different from what we found in futures markets. And this
predictability remained almost unchanged during the crypto winter period, suggesting that market
dynamics on the largest crypto exchange exhibit substantial stationarity. Specifically, using
standard microstructure metrics, we find non-trivial prediction accuracy and AUC scores for
multiple return statistics measures such as volatility and auto-correlation. We also highlight the
Perhaps the most intriguing result is that the market microstructure measures we find to be
important for price dynamics in cryptos are similar to those that matter for prediction in futures
markets. This similarity suggests that these crypto markets have much in common with liquidity
and price dynamics in more standard financial asset markets. Our random forest analysis does not
imply causation flowing from our features to our labels. But it does suggest that trading tools based
on own Roll and VPIN as well as cross BTC and ETH Roll measures can be valuable. For
institutional and high frequency traders, whose trading relies on sophisticated algorithmic and
participation. Whether that tips the balance to crypto becoming an asset class in its own right
remains to be seen.
22
Ang, Andrew and Morris, Tom and Savi, Raffaele, 2022, Asset Allocation with Crypto:
Biais, Bruno and Bisiere, Christophe and Bouvard, Matthieu and Casamatta, Catherine and
Menkveld, Albert J., 2023, Equilibrium Bitcoin Pricing, Journal of Finance, 78(2) 967-
1014
Cong, Lin and He, Zhiguo and Li, Jiasun, 2021, Decentralized Mining in Centralized Pools,
Cong, Lin W. and Karolyi, George A. and Tang, Ke and Zhao, Weiyi, 2022, Value Premium,
Cong, Lin and Li, Ye and Wang, Neng, 2020, Tokenomics: Dynamic Adoption and Valuation
Cortese, Federico and Kolm, Petter N. and Lindstrom, Erik, 2023, What Drives Cryptocurrency
Crépellière, Tommy and Pelster, Matthias and Zeisberger, Stefan, 2022, Arbitrage in the Market
24
Easley, D., M. Lopez de Prado, and M. O’Hara, ELO, 2015, “Optimal Execution Horizon,”
Easley, D., M. Lopez de Prado, M. O’Hara, and Z. Zhang ELOZ, 2021. Microstructure in the
Easley, D., M. O’Hara, and S. Basu, 2019, From Mining to Markets: The Evolution of Bitcoin
Erb, Claude B., 2020, Bitcoin is Exactly Like Gold Except When it Isn't, Available at
Filippou, Ilias and Rapach, David and Thimsen, Christoffer, 2024, Cryptocurrency Return
https://ssrn.com/abstract=3914414 or http://dx.doi.org/10.2139/ssrn.3914414
Halaburda, Hanna and Haeringer, Guillaume and Gans, Joshua and Gandal, Neil, 2022 The
Hardle, W., C. Harvey and R. Reule, 2020, Understanding Cryptocurrencies, Journal of Financial
Harvey, C., T.A. Zeid, T. Draaisma, M. Luk, H., Neville, A. Ryzm, and O. Van Hemert, An
Economic Analysis of the Bitcoin Payment System,” Review of Economic Studies, 88 (6),
3011-3040;
25
Jaquart, P., S. Kopke, and C. Weinhardt, 2022, Machine learning for cryptocurrency market
prediction and trading, Journal of Finance and Data Science, 8, (2022) 331-352.
Kelly, Bryan T. and Xiu, Dacheng, 2023, Financial Machine Learning, Available at
SSRN: https://ssrn.com/abstract=4501707
Kogan, S., I. Makarov, M. Niessner, and A. Schoar, 2024Are Cryptos Different? Evidence
Koker, T. and D. Koutmos, Cryptocurrency trading using machine learning, 2020, J. Risk
Kukacka, Jiri and Kristoufek, Ladislav, 2023, Fundamental and Speculative Components of
Liu, Yukun, Tsyvinski, Aleh, and Wu, Xi, 2022, Common Risk Factors in Cryptocurrency,
Lopez de Prado, M., 2018, Advances in Financial Machine Learning, (Wiley; New York).
Makarov, Igor and Schoar, Antoinette, 2020, Trading and Arbitrage in Cryptocurrency Markets,
Makarov, Igor and Schoar, Antoinette, 2020, Blockchain Analysis of the Bitcoin Market
26
SSRN: https://ssrn.com/abstract=3818818
Qureshi, S., M. Aftab, E. Bouri, and T. Saeed, 2020, Dynamic interdependence of crypto
markets: An analysis across time and frequency, PhysicsA: Statistical Mechanics and its
Applications, 559(1).
27
Both accuracy measures are aggregated across all 5 cryptocurrencies, across the entire test
period, and across all 5 labels. The results provided in the table are the average values.
28
29
Table 8: Aggregated accuracy and AUC before and after Nov 2021.
Period Window Aggregated Accuracy Aggregated AUC
2021.1 – 2021.11 50 bars 0.537937 0.537763
100 bars 0.530761 0.530467
2021.11 – 2023.7 50 bars 0.53951 0.539162
100 bars 0.530105 0.52948
2021.1 – 2023.7 50 bars 0.538089 0.538134
100 bars 0.530436 0.530428
Both accuracy measures are aggregated across all 5 cryptocurrencies, and across all 5 labels. The
results in the table are the averages.
30
31
32
Figure 1: Average MDA own feature importance for change of realized volatility.
MDA feature importance using only own features for change of realized volatility. The
prediction window is 50 bars. Results are aggregated across all five cryptocurrencies.
33
MDA feature importance using only own features for the change of Kurtosis. The
prediction window is 50 bars. Results are aggregated across all five cryptocurrencies
MDA feature importance using only own features for the change in the JB statistic. The
prediction window is 50 bars. Results are aggregated across all five cryptocurrencies.
34
MDA feature importance using only own features for the change in return
autocorrelation. The prediction window is 50 bars. Results are aggregated across all five
cryptocurrencies.
35
36
37
38
39
40
41
42
43
44
45
46