We use extreme value theory to study idiosyncratic tail risk for a large panel of U.S. stocks. Surprisingly, calls (respectively, puts) contain important information about lower (respectively, upper) tails, and [...] high out-of-the-money call (put) prices negatively predict upper (lower) tail risks. Finally, our analysis emphasizes the importance of big data in the efficient estimation of non-linear predictive models: A linear (LASSO) model matches the performance of deep neural networks on a smaller sub-sample (S&P500 stocks), while on the extended dataset (about 6,000 stocks), the non-linearity of the neural network translates into higher out-of-sample predictability.
The dataset covers 5349 U.S. public firms from 2007-01-03 to 2020-09-28 and contains data for options bids, asks, volume, open interest, implied volatilities, as well as the price of the underlying asset .
All data are originally pulled from ORATS - Options Research & Technology Services.
In particular, we use the following ORATS resources:
- Historical options quotes and greeks
- Companies' dividends calendar
- Companies' earning calls calendar
- Companies' stock split history
To recreate the train and test sets please run:
./scripts/create_datasets.sh
Before running the script, make sure to export $DATASET a variable pointing to your ORATS data and that has the following folder structure:
$DATASET
├── raw_option_data
└── underlyings_info
├── business_days_to_maturity.csv
├── orats_dividends.csv
├── orats_earnings.csv
├── orats_price_history.csv
Where raw_option_data
is a directory that contains a CSV file for each company option quotes pulled from (1). orats_price_history.csv
is created using the stkpx field from (1) and adjust the close price using (4). Also, we compute options' monenyness using calendar business days to maturity, see create_datasets/create_bdays_to_maturity.py
. Finally, orats_dividends.csv
and orats_earnings.csv
are retrieved from (2) and (3), respectively.
(a) Firms | (b) Options | (c) Volume |
---|
The figures above describe the evolution of our dataset size and composition across time. Panel (a) shows the total number of different firms. Panel (b) shows the daily number of distinct options, while panel (c) shows the daily volume of the number of contracts for put options and call options. We smooth all results with a 12-month rolling average.
Balkema and De Haan (1974) and Pickands (1975) state that the distribution of tail risk exhibits a power law beyond a sufficiently high threshold. Formally there exists a sufficiently high threshold
for some
with a support on the whole
We define our sample of extreme returns as
where
the historical volatility stock
(b) Negative events | (b) Positive events |
---|
The figures above show the distribution of the log exceedances
To predict tail risk at the weekly horizon, two sets of predicting variables or features:
-
We define eleven distinct backward-looking
$X^{(b)}_{i,t}$ features: a) three historical moments: standard deviation, skewness, and kurtosis, which we estimate over three different rolling time windows: one week, one month and one year for a total of nine different predictors, and b) a variable containing the numbers of days until the next earning announcement, c) a variable including the number of days until the next dividend. -
Forward-looking predictors,
$X^{(f)}_{i,t},$ use options' implied volatilities. For the construction of these features, please refer to the PAPER.
The figure above shows the out-of-sample
(a) | (b) |
---|
The figures above show a comparison between our two forward-looking models, namely DNN and LASSO. In both figures, we show on the x-axis the dates, while the y-axis represents each model performance with the out of sample
The figure above shows that the large sample allows the DNN to find a non-linear pattern. We show the out-of-sample
(a) | (b) |
---|
The figures above show average predicted mean
(a) Full tail events | (b) Earnings only |
---|
The figures above show scatter plots of the average predicted mean of a negative tail event
For both negative and positive tails, we investigate the relative importance of
each feature. Following Gu et al. (2020), we compute the reduction in
(a) | (b) |
---|
The figures above show the normalized features importance (
(a) | (b) |
---|
The figures above show scatter plots of the predicted mean of a negative tail risk event
Run:
./run_full_paper.sh
First Version: February 2021. I thank Antoine Didisheim, Semyon Malamud, Simon Scheidegger, and Yuan Zhang for their helpful comments and suggestions. I also acknowledge the financial support of the Swiss National Science Foundation, Grant 100018_192692. and the Swiss Finance Institute. All errors are my own.