Testing several approaches for implied volatility modeling and forecasting.
Comparative empirical study with four traded options.
Non-parametric higher-order spline is better than parametric stochastic volatility inspired (SVI) in China.
Our results imply that even though popular on Wall Street, SVI seems not to be utilized by traders and market-makers in China.
Traders may consider higher-order spline as a better method for implied volatility modeling and forecasting.
Propose to model and forecast implied volatility via the fifth-order spline interpolation as a first; initiates studies of the empirical performance of SVI and the fifth-order spline models in implied volatility modeling and forecasting.
1. Introduction
Implied volatility, which primarily relies on the Black-Scholes-Merton (BSM) model (Black and Scholes, 1973; Merton, 1973), has both theoretical and practical implications. Theoretically, the study of implied volatility can lead to better pricing models for options and a better understanding of market information contents (Jiang and Tian, 2005) and completeness (Almeida and Freire, 2022). In practice, implied volatility estimating is vital in options trading for pricing and hedging (i.e. risk management) (Muzzioli, 2010). Market makers, who facilitate the smooth functioning of options markets, quote implied volatilities in trading and utilize them in hedging risks of short option positions (Hull, 2015).
Implied volatility has been studied extensively for the US options market (Breeden and Litzenberger, 1978; Gatheral, 2006; Figlewski, 2010; Derman and Miller, 2016). In contrast, such research for China’s options market, which has a relatively short history but is developing rapidly (Jiao et al., 2021), is just getting started. For modeling implied volatility, both industry-oriented and academia-oriented approaches exist (Homescu, 2011). There seems to be a disconnection between industry and academia, however, because the two groups seem to focus on different methods without too many overlaps. For these reasons, we propose to investigate the empirical performance of both practical-oriented and academic-oriented approaches in China’s options market.
Among many approaches to implied volatility modeling, our study chooses two archetypal representatives: the practical-oriented stochastic volatility inspired (SVI) model and the academically popular 5th-order spline interpolation. Originated at Merrill Lynch in 1999 and popular on Wall Street, SVI was initially publicized by Gatheral (2004). Subsequent developments are summarized in a seminar paper by Gatheral and Jacquier (2014). The SVI model, providing a functional form, is parametric [1]. However, most research on SVI focuses primarily on the arbitrage-free parametrization of the surface SVI (SSVI) (Zhao and Hodges, 2013; Guo et al., 2016; Corbetta et al., 2019; Martini and Mingone, 2022, 2023) and recently the extended SSVI (eSSVI) (Hendriks and Martini, 2019; Mingone, 2022), while overlooking the empirical performance of SVI in implied volatility modeling and forecasting.
In contrast, Figlewski (2010) proposes a nonparametric approach, the 5th-order spline interpolation. This spline approach, leading to smooth volatility curves, is used primarily to extract empirical risk-neutral densities. Later, it is employed to study empirical pricing kernels by Linn et al. (2018) and Jiao et al. (2021). In addition to extracting risk-neutral densities, we propose that the 5th-order spline interpolation can also be used to model and forecast implied volatilities, which seems to be overlooked by previous studies [2]. With a smooth volatility curve, we can compute both in-sample and out-of-sample pricing errors for options (a.k.a. implied volatility modeling and forecasting).
Based on the reasons given above, we initiate the study of the empirical performance of implied volatility modeling and forecasting. For our empirical analysis, we choose China’s options market [3]. Given that traders and market makers in China are relatively inexperienced and unsophisticated, it is an open question whether SVI, even though well-established and popular on Wall Street, is employed in practice. Therefore, we believe it is interesting to learn whether SVI is suitable for China, where the options market is emerging as a significant rival to the US options market. Further, it can be fascinating to compare SVI (industrial and parametric) with the 5th-order spline (academic and nonparametric).
Methodologically, we make several innovations. First, we propose to utilize risk-neutral tests to rule out possible arbitrage in the spline-smoothed volatility curves. We focus on curves instead of surfaces in this initial study because, arguably, it is more important for market makers to avoid being arbitraged for price quotes of options with the same maturity but different strike prices. Second and more importantly, we propose to gauge volatility modeling by in-sample pricing errors and volatility forecasting by out-of-sample pricing errors. It is worth noting that price-based errors, being model-independent, are better than model-dependent (implied) volatility-based errors. Innovatively, we use only arbitrage-free data for assessing pricing errors. It is meaningless, we argue, to compare pricing performance if prices in the dataset admit arbitrage. Finally, we propose a volatility arbitrage strategy and use the resultant P&L as an additional measure to evaluate volatility modeling. The arbitrage trade employs three out-of-the-money (OTM) calls and three OTM puts closest to at-the-money (ATM) and utilizes the Derman-Miller arbitrage (2016).
As a first, our paper carries out a comprehensive empirical study of implied volatility modeling and forecasting for China’s new market. We choose four representative options in China for the empirical study (for our reasons, see Section 3.1.1). They are China 50 ETF options (stock portfolio as underlying, European), CSI 300 index options (stock index with known cash dividends as underlying, European), soybean meal futures options (futures as underlying, American), and copper futures options (futures as underlying, European). We include daily data for 2021 and use only out-of-money options. To assess volatility modeling, we use in-sample pricing errors and P&Ls of a volatility arbitrage strategy (see Section 2.4 later). We find that with lower pricing errors and smaller arbitrage P&Ls, the 5th-order spline appears to be a better model for options traded in China. Next, we gauge volatility forecasting by out-of-sample pricing performance. In addition to the 5th-order spline, we forecast volatility by employing linear interpolation on implied variance and interpolating cubic spline on implied volatility. Empirically, we find that the three splines outperform SVI for all four options. Our results imply that even though popular on Wall Street, SVI seems not to be utilized by traders and market-makers in China.
Our study makes four contributions to the derivatives literature [4]. First, we propose to model and forecast implied volatility via the 5th-order spline interpolation, which has been used primarily to extract empirical risk-neutral densities (Figlewski, 2010). Note that implied volatility modeling was studied previously in Gatheral (2004) and Gatheral and Jacquier (2014), for example. Second, our paper initiates studies of the empirical performance of SVI and the 5th-order spline models in implied volatility modeling and forecasting for the options market in China. Previous studies mainly focus on the arbitrage-free parametrization of the surface SVI (Zhao and Hodges, 2013; Guo et al., 2016; Corbetta et al., 2019; Hendriks and Martini, 2019; Nagy and Ormos, 2019; Mingone, 2022; Martini and Mingone, 2022, 2023). Third, to measure the performance of implied volatility modeling, we propose to use in-sample pricing errors and volatility arbitrage P&Ls. Note that volatility errors are found in Koopman et al. (2005), Ederington and Guan (2005), Gospodinov et al. (2006), Becker (2007), Seo and Kim (2015), and Jeon et al. (2020). Fourth, we suggest assessing the performance of implied volatility forecasting by analyzing out-of-sample pricing errors. Most current literature relies on in-sample pricing errors, which may perform poorly in gauging the forecasting of implied volatility. Recently, Liu et al. (2021) and Vrontos et al. (2021) used machine learning to study the forecasting of implied volatility in the US; however, their out-of-sample pricing result is limited.
The remainder of the paper is structured as follows. Section 2 introduces the paper’s methodology, including the surface SVI model, the 5th-order spline model, two interpolating spline methods, and a volatility arbitrage strategy. Data and computational details are presented in Section 3. In Section 4, we discuss the empirical results, including in-sample errors, volatility arbitrage P&Ls, and out-of-sample errors. Finally, Section 5 concludes the paper with remarks.
2. Methodology
In this section, we briefly reintroduce two representative approaches to volatility modeling. The stochastic volatility inspired (SVI) model, which dates back to Merrill Lynch in 1999, is widely used on Wall Street (Gatheral and Jacquier, 2014). In academic studies, the 5th-degree spline smoothing of implied volatility curves is proposed (Figlewski, 2010) and utilized to construct the empirical risk-neutral density (Linn et al., 2018; Jiao et al., 2021).
2.1 SVI
According to Gatheral and Jacquier (2014), many arbitrage-free SVI surface parametrizations are possible. Our unreported exploratory study shows that the raw SVI and SVI jump-wings (SVI-JW) parameterizations are somewhat unstable numerically. Therefore, we choose to utilize the surface SVI in this paper.
SVI parametrization We reintroduce the surface SVI by directly following Gatheral and Jacquier (2014). First, the total implied variance is defined in Definition 4.1 of Gatheral and Jacquier (2014) as follows:
where is the log forward moneyness (with strike and forward price for stocks or futures price for futures ), is the at-the-money (ATM) [5] total implied variance [6], is the BSM implied volatility (with maturity ). Note that .
Further, Gatheral and Jacquier (2014) provide several functional forms for [7]. Our exploratory study shows that the following power-law function in Example 4.2 of Gatheral and Jacquier (2014),
(with and ) and the Heston-like function in their Example 4.1 are both robust. In this paper, we choose the above power-law function in our implementation.
Finally, Theorem 4.2 of Gatheral and Jacquier (2014) provides the following conditions for the absence of butterfly (convex) arbitrage in the surface SVI model:
This paper focuses on volatility curves instead of the surface, so we do not try to eliminate calendar spread arbitrages on the surface.
SVI calibration Given a group of BSM implied total variances, we interpolate linearly to obtain the at-the-money and calibrate the power-law parameter set by minimizing the following objective function:
where indexes the given moneyness of the implied total variances.
We propose a straightforward minimization scheme here. With a set of random guesses for as initial values, we minimize the above objective function via the Nelder-Mead simplex algorithm. Then, we apply the parameter constraints, , , and , and no butterfly arbitrage conditions (3–4). If an optimal parameter set satisfies both the parameter constraints and no-arbitrage conditions, it is deemed acceptable and kept. Following Jiao et al. (2021), to mitigate the trap of local minima, we generate 100 sets of random initial parameter values [8], carry about the minimization, check constraints and no-arbitrage conditions, and take the optimal set with the lowest value for the objective function to be our calibrating result.
2.2 The spline model
Figlewski (2010) proposes to use the 5th-order spline interpolation of implied volatility to recover the risk-neutral density from market option prices. This approach is preferred because, in addition to leading to smooth volatility curves, it can also yield continuous and smooth risk-neutral densities. This approach was later adopted by Linn et al. (2018) and adapted by Jiao et al. (2021) [9]. Our reintroduction follows Jiao et al. (2021).
The 5th-order spline Given a group of BSM implied volatilities, we employ the 5th-order spline interpolation to smooth the implied volatility curve on moneyness [10]. We place one knot at-the-money (i.e. moneyness of one) with boundary knots at the lowest and highest moneyness observed [11].
We briefly reintroduce the spline function for the convenience of readers. The fitted volatility function is as follows:
where is moneyness (i.e. strike price divided by stock price), is the jth B-spline basis function, and is the number of basis functions. The coefficients are obtained by the following minimization:
where is the market moneyness of ith traded options. In our case, .
Risk-neutral tests With the smoothed volatility curve, we compute finely-spaced BSM European call prices on the curve. Further, we numerically estimate the risk-neutral densities within the observed moneyness range, using Equation (4) of Jiao et al., (2021):
where is the risk-free rate, is the BSM call price, and m is the moneyness. Following Jiao et al. (2021), we set to be 0.0001.
If any of the estimated turns out to be negative, a 5th-order spline interpolation without knots is carried out again. Finally, the smoothed volatility curve is accepted if no negative densities exist (Figlewski, 2010; Linn et al., 2018; Jiao et al., 2021). For CSI 300 index options, ten curves are discarded out of 1,362 curves. For copper futures options, only one curve is removed out of 596 curves.
SVI is a parametric volatility model, while the spline-interpolated volatility curve is nonparametric. Therefore, the spline model can also be used to arbitrage volatility. When forecasting volatilities outside the observed moneyness range, we set the volatility to be the nearest observed volatility.
2.3 Interpolating splines
The 5th-order spline model, which does not require any observed data points to be on the spline curve, may perform poorly in forecasting volatilities. For forecasting, we include two other interpolating splines, which go through all the observed data points, as benchmarks. The simplest is undoubtedly the linear interpolation on implied variance (i.e. the squared implied volatility) [12]. Further, interpolating cubic spline on implied volatility is also widely used [13].
By definition, the in-sample pricing error of interpolating splines is zero. As a result, we only need to consider out-of-sample pricing errors for interpolating splines. Again, in forecasting volatilities outside the observed moneyness range, we set the volatility to be the nearest observed volatility.
The four approaches introduced above have pros and cons. With Formula (1), SVI is trivial to compute for any given moneyness and can be made arbitrage-free. For the 5th-order spline, it is not easy to estimate variance for moneyness outside the observed moneyness range. Finally, the two other interpolating splines can be used to forecast, but not model, volatilities.
2.4 Volatility-arbitrage strategy
Academic studies usually focus on volatility forecasting or out-of-sample pricing. In industry, traders may be more interested in volatility modeling-based arbitrage opportunities to make money. A simple volatility-arbitrage strategy is suggested by Derman and Miller (2016). If an option’s BSM implied volatility is higher (lower) than the model volatility, one could sell (buy) the option and hedge with the underlying at the model volatility. If the abnormal market volatility reverts to the mean (model), the trade is closed to make a profit.
In this paper, we propose a strategy utilizing only liquid options. For each trading day, we select three OTM calls and three OTM puts closest to at-the-money. Among these six options, we choose the option having the largest discrepancy between the BSM implied and model volatilities and apply the Derman-Miller arbitrage [14]. The arbitrage position is held for one trading day and then liquidated.
We compute the one-day P&L of this volatility-arbitrage strategy as is commonly done in the literature (Hull, 2015; Liu et al., 2022). Ignoring the time value of money, we can write the arbitrage P&L as follows:
where is 1 (−1) for longing (shorting) options, is the market option price on day 0, is the option delta computed with the model volatility on day 0, is the underlying price on day , is the maturity of the option on day 0, and equals one trading day. For simplicity, we assume that short sales are allowed and there are no transaction costs.
3. Empirical studies
3.1 Data
3.1.1 Data source
At the beginning of our research in 2021, the Chinese market traded 24 options. This paper chooses four representative options from four exchanges for the empirical studies [15]. They are European options on China 50 ETF (50ETF options), European CSI 300 stock index option, American soybean meal futures option, and European copper futures options. The 50ETF option from Shanghai Stock Exchange is the first exchange-traded option in China, while the CSI 300 index option from China Financial Futures Exchange is the first stock index option in China. The soybean meal futures option from Dalian Commodity Exchange is China’s first commodity futures option. Finally, the copper futures option from Shanghai Futures Exchange is China’s first European commodity futures option. Appendix provides additional details about these options.
The data in our paper are obtained from the WIND database. For consistency, we utilize daily data for 2021 for China 50 ETF options, CSI 300 index options, and soybean meal futures options. The sample interval for the copper futures options is from November 2020 to October 2021 however, because this option became American-style as of November 2021. The underlying closing prices [16] are also from WIND.
Note that the daily data provides only closing prices for options. Fortunately, WIND also provides daily bid-ask mid-price implied volatilities for options. With the implied volatility, we can recover the option’s mid-price. In our empirical study, we choose the option prices with fewer cases of butterfly arbitrage. As a result, the mid-prices for 50ETF and CSI 300 index options are utilized, while the closing prices for the two commodities futures options are employed.
3.1.2 Recovering mid-price
The bid-and-ask mid-price is commonly utilized in empirical studies of traded options (Bakshi et al., 1997; Barone-Adesi et al., 2008). Even though WIND does not provide bid or ask prices, it does the mid-price implied volatility. Therefore, to recover the mid-price, we need to reverse the computation of WIND.
There are three issues with the approach of WIND. First, WIND defines the days to maturity as , where is the expiry date, and is the trading date, which unfortunately counts the days to maturity by one more day. Second, WIND uses the yield to maturity of a one-year (Chinese) treasury bond as the risk-free rate, which ignores the different maturities of traded options. Third, For the CSI 300 index, WIND sets the dividend yield to zero. This may be incorrect because the components of the CSI 300 index pay cash dividends.
There are two more details to note. WIND prices American options via binomial trees with the Cox-Ross-Rubinstein parameterization (Cox et al., 1979) and 100 steps. Further, WIND sets the dividend yield to zero for China 50 ETF. We follow the WIND approach to compute the mid-price from the implied volatility in WIND.
3.1.3 Implied volatility
With either the closing prices or the recovered mid-prices of options, we can now estimate “better” implied volatilities and use them in our empirical study.
Unlike WIND, we correctly define the days to maturity as , where is the expiry date, and is the valuation date. Further, Following Jiao et al. (2021), we use Shibor rates as risk-free interest rates. Shibor rates are converted to the continuously-compounded zero-coupon rate. Then, for any given maturity, the risk-free rate is obtained by linear interpolating the converted continuously-compounded zero-coupon rate.
European options are priced via BSM formulas. Following Jiao et al. (2021), we do not consider dividends in pricing 50ETF options. According to Liu and Qiao (2017), the CSI 300 index is more appropriately treated as an investment asset with known cash dividends because companies in China pay dividends predominantly between May and August. As a result, we utilized the modified BSM formulas (2-3) from Guo and Liu (2019) to price CSI 300 index options. Please also see Appendix for in-depth information. For American options, we compute prices via binomial trees with the Trigeorgis (1991) parameterization, which is numerically better than the Cox-Ross-Rubinstein parameterization, and 500 steps.
3.1.4 Data filters
Following roughly common practice (Bakshi et al., 1997; Jiao et al., 2021), we filter the options data using the following seven criteria. First, delete options with zero volatility from WIND. Second, delete options with prices that are less than ten times the minimum tick of the option [17]. Third, delete options with a trading volume of 100 for 50ETF options and zero for the other three options. Fourth, delete options with less than ten days to maturity. Fifth, delete at-the-money and in-the-money options because typically, only out-of-the-money (OTM) options are used in SVI. Sixth, remove a trading day if there is any butterfly (or convex) arbitrage among the OTM calls or OTM puts [18]. Seventh, keep a trading day if there are at least six puts and at least one call, at least six calls and at least one put, or at least three calls and at least three puts after the previous six filters. Finally, note that for 50ETF options, we do not include options that are adjusted for dividends, as is done in Jiao et al. (2021).
We filter the mid-prices and closing prices separately. With mid-price, we end up with 7,333 50ETF options, 23,797 CSI 300 index options, 3,017 soybean meal futures options, and 5,004 copper futures options. With the closing price, we obtain 7,295 50ETF options, 17,563 CSI 300 index options, 14,074 soybean meal futures options, and 11,442 copper futures options. Arguably, the more options, the better for empirical studies. Therefore, we estimate implied volatilities for 50ETF and CSI 300 index options from mid-prices but copper and soybean meal futures options from closing prices.
For 50ETF and CSI 300 index options, a call auction session usually sets the closing price in the last three minutes of each trading day. Therefore, it stands to reason that traders can place any bid or ask price without worrying about being arbitraged. Consequently, one can expect more arbitrage opportunities in the closing prices, which seems to be confirmed by the results reported in the previous paragraph.
3.2 Computed variables
Following Bakshi et al. (1997), Barone-Adesi et al. (2008), and Liu et al. (2022), we categorize options by moneyness and maturity. Moneyness is defined as the ratio of the strike price to the underlying price. Moneyness between 0.85 and 1 for puts (between 1 and 1.15 for calls) is grouped into OTM, while moneyness below 0.85 for puts (above 1.15 for calls) does into deep OTM. Maturity is grouped by calendar days to maturity: 10–90 (short), 91–180 (medium), and 181 and up (long) for soybean meal futures options, or 10–90 (short) and 91 and up (long) for the three other options.
3.2.1 Implied volatility
Tables 1–4 present the four options’ descriptive statistics on price, implied volatility, moneyness, and maturity. Let’s discuss these tables one by one. Table 1 shows 3,361 50ETF puts and 3,972 50ETF calls. Overall, option prices are between 0.001 and 0.2742, implied volatilities range from 12.97% to 48.06%, minimum (maximum) moneyness is 0.7478 (1.3942), and maturities go from 12 to 237. Most puts and calls (3,034 and 2,982, respectively) belong to the OTM group. For the same moneyness groups, there are more options within the short maturity group. Noticeably, deep OTM-short maturity options have higher average implied volatilities; the average implied volatilities show smiles for both long and short maturities, but the curvature of the smile for short maturity is more pronounced. Similar patterns are reported for S&P 500 index option in Barone-Adesi et al. (2008).
For CSI 300 index options, there are 14,251 and puts 9,546 calls (Table 2). The option mid-prices, implied volatilities, and maturities range from 2 to 519.8, 9.98% to 56.58%, and 10 to 361, respectively. The minimum (maximum) moneyness is 0.5539 (1.3470). The average implied volatilities also show smiles.
Soybean meal futures options consist of 6,597 filtered puts and 7,477 filtered calls (Table 3). The closing prices, implied volatilities, and maturities vary widely between 5 and 221.5, 7.97% and 70.98%, and 10 and 324, respectively. The minimum (maximum) is 0.6570 (1.3140). Similar to Table 1, there are way more OTM calls and puts. The average implied volatilities again show smiles, and the smiles flatten with longer maturities.
Table 4 shows 7,682 puts and 3,760 calls for copper futures options. The closing prices, implied volatilities, and maturities change from 10 to 4,878, 4.66% to 94.10%, and 10 to 323, respectively. The minimum (maximum) moneyness is 0.5296 (1.2616). Again, there are more OTM puts and calls. The average implied volatilities once again show smiles.
3.2.2 SVI parameters
For each trading day and each maturity, we try to search for a set of surface SVI parameters; If no parameter set is found, we give up searching for an SVI model curve. The optimal surface SVI parameter set is then converted to the SVI-JW parameter set via Lemma 4.1 of Gatheral and Jacquier (2014):
These five parameters can be interpreted as follows. is the observed ATM variance, is the ATM skew, is the slope of the put (left) wing of the curve, is the slope of the call (right) wing of the curve, and is the minimum variance of the curve.
We average the SVI-JW parameters across trading days within maturity groups and summarize the resultant means and standard deviations in Table 5. Panel A of Table 5 shows the five parameters for 50ETF options. Only 84.8% of the trading-expiry days’ combinations in the long-maturity group yield optimal parameters. The mean minimum variances are very close to the mean ATM variances, implying that the parameter is small. The average skewnesses are negative, so the minimum variance corresponds to or is located on the right of the ATM variance. Further, the mean skewness for the long-maturity group is bigger absolutely, implying a steeper curve for the put wing.
Similar findings can be seen for CSI 300 index options (Panel B of Table 5), but the put slopes are higher than those in Panel A, implying stepper left wings. It is puzzling that only roughly one-third (37.1%) of the trading-expiry days’ combinations in the long-maturity group yield optimal parameters, which may be worth further investigation in another research.
Next, soybean meal futures options seem to differ from 50ETF and CSI 300 index options (Panel C of Table 5). The average skewnesses are positive, so the minimum variance corresponds to or is located on the left of the ATM variance. Further, the call slopes are bigger, implying steeper curves for the right wing. Finally, the long-maturity group for copper futures options is similar to soybean meal futures options, while the two wings for the short-maturity group are somewhat symmetrical.
4. Empirical results
We empirically test the performance of implied volatility modeling via in-sample pricing errors and the volatility arbitrage P&L. For testing volatility forecasting, we utilize out-of-sample pricing errors. The computational details are presented, and the results are discussed in this section.
4.1 Implied volatility modeling
4.1.1 In-sample pricing errors
In the empirical study of the SVI model, we only include options on days on which an optimal SVI parameter set is available (see Table 5). Following Bakshi et al. (1997) and Barone-Adesi et al. (2008), we propose to use three measures to assess pricing errors: the mean pricing error (MPE), the mean absolute pricing error (MAE), and the root mean square error (RMSE). It is worth noting that price-based errors, which are widely used in empirical studies of options, provide straightforward magnitudes for gauging pricing errors. Volatility-based errors, however, are challenging to interpret. More importantly, if we view the BSM model as merely a mapping tool, the BSM implied volatility depends on BSM, while an option price does not. Therefore, price-based errors are arguably better measures than volatility-based errors.
The in-sample pricing errors are shown in Tables 6–9 for the four kinds of options. For 50ETF options, three observations can be made (Table 6). First, except for deep OTM call - Long, spline has much smaller errors in all three measures. For MAE, the largest error is only 7.50% (deep OTM call - Short) for spline but 20.46% (deep OTM call - Short) for SVI. Second, SVI is particularly bad for the short-maturity groups. Third, the spline is more robust for pricing more options.
CSI 300 index options (Table 7) are quite different from 50ETF options. First, the MAE errors are larger overall for CSI 300 index options. The smallest errors are 9.68% (spline) and 13.97% (SVI), respectively. Second, the errors for the two maturity groups are roughly the same. Again, spline outperforms SVI in six out of eight categories. Further, spline prices much more options than SVI. Interestingly, SVI is more robust for put options, while spline is more robust for call options.
Table 8 presents the in-sample pricing errors for soybean meal futures options. The MAE errors are roughly the same as those of 50ETF options, with the largest errors at 13.17% (SVI) and 8.78% (spline), respectively. Amazingly, spline outperforms SVI in all twelve categories.
Finally, SVI shows the largest MAE, 44.49% (deep OTM put - Short), for copper futures options (Table 9). MAE errors are larger for the short-maturity groups. Finally, spline outperforms SVI in all eight categories.
We can see the difference between SVI and spline more clearly in Figure 1. As expected, SVI curves show nice “smiles.” On the other hand, the spline follows the data points more closely but can go only as far as the observed data. In all four graphs for the four kinds of options, spline does fit the market implied volatilities better. In accordance with previous discussions, SVI fits the observed implied volatilities rather poorly for CSI 300 index options.
In summary, spline outperforms SVI in in-sample pricing [19]. This implies that spline fits the market-implied volatilities better in most cases. Further, we can speculate that spline should lead to small arbitrage profits, which we will discuss next.
4.1.2 Arbitrage P&L
Because the prices of the four kinds of options vary widely, we follow Liu et al. (2022) and focus on Column 4 of Table 10, which are the average results of dividing the day-1 arbitrage P&L, , by the day-0 option price, (see Equation 4). For the long-maturity group of 50ETF options (Panel A), neither SVI nor spline makes money because the absolute P&Ls are less than 1%. The spline model is marginally profitable for the short-maturity group, however. CSI 300 index options seem similar to 50ETF options (Panel B).
In contrast, the SVI models with soybean meal futures options for all three maturity groups are substantially profitable and more profitable than the spline model (Panels C). For cooper futures options, the SVI model for the long-maturity groups is the most profitable, with a 16.23% profit. These results are robust if we use eight options that are closest to at-the-money to arbitrage.
In summary, the SVI model is significantly profitable in four out of nine groups, which is roughly consistent with the results of in-sample pricing errors. In theory, though, if a model leads to significant arbitrage P&L, the model is highly suspicious because the arbitrage P&L may be purely an artifact of the model. To conclude, the 5th-order spline is a better model for options traded in China.
4.2 Volatility forecasting
We use implied volatility (variance) curves to determine (i.e. forecast) the volatility (variance) for an option with the same expiry day that is traded on the immediate next trading day. Again, we work only with filtered options; i.e. we require that both the trading day for modeling and the trading day for forecasting are in our filtered dataset. As the data filtering shows, a high percentage of convex arbitrage exists in the price data. Arguably, we believe a model may be worse and more difficult to justify if it yields smaller errors for prices admitting arbitrage.
As was discussed in Section 2, we employ four approaches for forecasting volatility. For SVI, we can compute volatility via Equation (1) for any given log forward moneyness () of an option. For the 5th-order spline, we estimate volatility using Equation (3). For linear interpolation on implied variance and interpolating cubic spline on implied volatility, we interpolate volatility for any given moneyness of an option from those spline curves. Using such forecasted volatility, we can price options out-of-sample via the BSM formulas (for European) or binomial tree (for American).
Tables 11–14 summarize the out-of-sample errors using the same categories and three error measures as for in-sample pricing. Let’s again start with 50ETF options (Table 11) and focus on MAE. As is expected, the out-of-sample errors are larger than the corresponding in-sample errors for both SVI and 5th-order spline. Even though it outperforms SVI in all eight categories, spline underperforms the two interpolating splines. Overall, interpolating cubic spline turns out the best, with the largest MAE at 16.68% (deep OTM put - Short).
For CSI 300 index options (Table 12), the 5th-order spline outperforms SVI in six out of eight categories, with larger errors than in-sample. Again, both interpolating splines outperform the 5th-order spline; interpolating cubic spline, with the largest MAE of 15.86% (deep OTM call - Short), is slightly better than linear interpolation.
Table 13 shows the out-of-sample pricing errors for soybean meal futures options. In nine out of twelve categories, the 5th-order spline outperforms SVI. The performance of the two interpolating splines is mixed, however. Linear interpolation outperforms interpolating cubic spline in seven out of twelve categories and does SVI in ten out of twelve categories. The largest MAE errors are 16.57% (deep OTM put - Long) and 17.64% (deep OTM put - medium) for linear interpolation and interpolating cubic spline, respectively.
Finally, errors for copper futures options are summarized in Table 14. Clearly, the 5th-order spline outperforms SVI significantly in every category. The performance of the 5th-order spline, linear interpolation, and interpolating cubic spline, however, are hardly distinguishable. Finally, the largest MAE errors are over 30% and seem unacceptable.
Graphs can give us certain visual insight (Figure 2). SVI can significantly deviate from the observed implied volatilities, as is evident from the diagrams for China’s 50 ETF options, CSI 300 index options, and soybean meal futures options. Further, the three splines differ little for copper futures options.
In summary, we can draw two empirical conclusions. First, in-sample pricing appears to be an excellent indicator of out-of-sample pricing. Second, the three splines outperform SVI for all four kinds of options. This implies that even though SVI is quite popular on Wall Street, it was not employed by traders and market-makers in China.
The explanations for our conclusions are as follows [20]. First, compared to the 5th-order spline, SVI faces greater difficulty in parameter optimization, is less robust, and can be numerically unstable. As we write in Section 2.1, “to mitigate the trap of local minima, we generate 100 sets of random initial parameter values” in the calibration of SVI parameters. Further, we report in Section 3.2.2 that “[For CSI 300 index options], only roughly one-third (37.1%) of the trading-expiry days’ combinations in the long-maturity group yield optimal parameters.” Worse, for all four kinds of options, SVI prices fewer options in-sample (Tables 6–9) and out-of-sample (Tables 11–14) and hedges fewer options (Table 10). In extreme cases, only 1,067 (2,827) OTM puts with long maturity are priced by SVI (spline) (Table 12). Second, spline is more straightforward to understand and simpler to implement. In fact, Wind provides cubic-spline interpolated implied volatilities so that traders can use them directly. Consequently, it is reasonable to argue that if traders and market-makers had employed spline for pricing, trading, and hedging, the markets would have been more closely aligned with spline, resulting in more favorable empirical evidence in historical data in China for spline.
5. Conclusions
This paper initiates studies of the empirical performance of implied volatility modeling and forecasting. As two representative approaches, we choose the Wall Street parametric SVI model and propose utilizing the nonparametric 5th-order spline model for comparison. The empirical research includes four representative options from four exchanges in China: China 50 ETF European options, CSI 300 index European options, soybean meal futures American options, and copper futures European options.
We include the most recent whole-year daily data in 2021 and use only out-of-money options. With lower pricing errors and smaller volatility arbitrage P&Ls, the 5th-order spline is found to be a better model for the four options traded in China. In addition to the 5th-order spline, we employ linear interpolation on implied variance and interpolating cubic spline on implied volatility to forecast volatility. Empirically, we find that the three splines outperform SVI for all four options.
Further research can be carried out in at least two aspects in the future. The current research focuses on the SVI and spline curve without considering the time dimension or the volatility surface. Studying the performance of the whole volatility surface in China can be worthwhile. In addition, it could be an exciting and challenging project to extend the spline model to the surface.
We thank Ye Du for bringing SVI to our attention. All three authors contributed equally to this paper; their last names are in random order. This work was supported by the National Natural Science Foundation of China under Grant numbers 72371208, 71701171, 72073109, and 72071162, and by the Liberal Arts and Social Sciences Foundation of the Chinese Ministry of Education under Grant number 21XJC790003.
Notes
In addition to SVI, parametric models include polynomial (Dumas et al., 1998) and Vanna-Volga models (Castagna and Mercurio, 2007).
Nonparametric models include interpolation (Kahale, 2004; Lee, 2004), kernel density smoothing (Aït-Sahalia and Lo, 1998; Rosenberg, 2000; Cont and Fonseca, 2002), spline smoothing (Fengler, 2009), and local polynomial smoothing method (Benko et al., 2007).
Of course, the US options market is interesting to study. But unfortunately, we do not have access to US options data.
We thank Reviewer No. 2 for making the exposition of our contributions better.
Note that this definition of “at-the-money” is slightly different from moneyness of one later.
The square of BSM implied volatility will be termed implied variance.
We drop the subscript for convenience.
The initial values of {ρ,η,γ} are uniformly drawing from
{[−0.999,0.999],[0.001,100.0],[0.001,0.999]}.
Figlewski (2010) and Jiao et al. (2021) use the term “4th-order spline” unconventionally. They actually mean 5th-order spline, which is a more commonly used term.
We define moneyness as the ratio of the strike price to the underlying price.
We utilize the Matlab function spap2 with k (spline order) being equal to 5.
Arguably, it makes more sense to interpolate linearly on variance, because variance is additive.
Wind database utilizes linear and cubic spline interpolations for implied volatility modeling.
Ye Du suggested this arbitrage as a practical trading strategy.
With trading volumes of 629 (China 50 ETF option), 67 (CSI 300 option), and 38 (soybean meal futures option) million contracts within the sample periods, three chosen options out of four are most actively traded.
For the two futures options, our unreported results show that using the futures settlement prices in the empirical analyses does not change the conclusions.
The minimum ticks are 0.0001 (50ETF options), 0.2 (CSI 300 index options), 0.5 (soybean meal futures options), and 1 (copper futures options), respectively.
Following Jiao et al. (2021), arbitrage means that , where is the market price of an option with strike , is the minimum tick of the option, and .
Using 50ETF as an example, we show in the Appendix that Spline outperforms SVI in the dynamic (delta) hedge of options.
We thank Reviewer No. 1 for the suggestion of explaining our results.
Twenty-one, Twenty-six, Twenty-five, and Twenty-eight component stocks are replaced on June 15, 2020, December 14, 2020, June 15, 2021, and December 13, 2021, respectively.
References
Appendix
Contracts information for four options
China 50 ETF options European options. The minimum tick is 0.0001. The contract size is 10,000 units of 50 ETFs. The contract trades for the current month, the next month, and the following two march-cycle months. The contract expires on the fourth Wednesday of the contract month. The trading hours are between 9:30 a.m. and 11:30 a.m. and between 13:00 and 15:00 (Beijing time). The opening call auction is from 9:15 to 9:25 a.m., and the closing call auction is from 14:57 to 15:00.
CSI 300 index options European options. The minimum tick is 0.2 points, with a multiplier of 100 Chinese yuan. The contract trades for the current month, the next two months, and the following three march-cycle months. The contract expires on the third Friday of the contract month. The trading hours and call auctions are the same as those of China 50 ETF options.
Soybean meal futures options American options. The minimum tick is 0.5 Chinese yuan per ton. The contract size is 10 tons. Contract months are January, March, May, July, August, September, November, and December. The contract expires on the 5th trading day of the month before the delivery month of the futures contract. The trading hours are from 9:30 a.m. to 11:30 a.m. and from 13:00 to 15:00 (Beijing time).
Copper futures options European options before November 16, 2021. The minimum tick was one yuan per ton. The contract size was 5 tons. The contract was traded for all twelve months. The contract expired on the 5th last trading day of the month prior to the delivery month of the futures contract. The trading hours were the same as those of soybean meal futures options.
Dividends of the CSI 300 stock index
Chinese firms predominately pay one dividend per year between May and August (Table A1). Consequently, it makes more sense to treat the CSI 300 index as an investment asset with “known dollar dividends” in pricing derivatives on the CSI 300 index (Liu and Qiao, 2017).
In practice, traders “know” the cash dividend for a firm by using the prior-year dividend payment and ex-dividend date. We follow the same logic here. For any trading day, we obtain the prior-year dividend payment and ex-dividend date for a component stock of the current CSI 300 index from WIND. For pricing index options in 2021, we have to include all component stocks of the CSI 300 index in 2020 and 2021. Note that the CSI 300 index is adjusted twice a year [21].
Following Bos and Vandermark (2002) and Guo and Liu (2019), we partition a dividend payment into two parts, and . Similar to Liu and Qiao (2017), we estimate the two parts as follows:
where t is the valuation date, is the day-t closing point of the CSI 300 index, is the closing price of the kth stock in the CSI 300 index, is the number of shares used for computing the index for stock k (available from Wind), is the ith prior-year cash dividend for stock k with the ex-dividend date , T is the expiry date of an option, is the day-t term-T risk-free rate, and is the indicator function.
With and , we use Formulas (2-3) of Guo and Liu (2019), which are the BSM formulas in which stock price (strike price ) is replaced by (), to price CSI 300 index options. Note that for 2021, () is between 0.0 and 67.2 index points (between 0.0 and 67.3 index points).
The hedge of 50ETF options
It is complicated to test hedge empirically. We follow the procedure of Liu et al. (2022) as follows. First, we explicitly identify an option series by maturity and strike, which is a time series of trading days for a specific option. In order to ensure the totality of hedging, we include options that expire on or before December 22, 2021 (the last expiration date in 2021). The hedge is rebalanced weekly (i.e. seven calendar days). When a chosen hedge rebalancing day does not fall on a trading day, we decide to use the next, next second, or next third trading day instead.
We obtain the implied volatility for an option from either the SVI or spline models and compute the delta needed for hedging for each hedge-rebalancing day. On days when implied-volatility models are unavailable, we utilize the implied volatility for the immediate previous trading day. As a result, the number of options hedged is the same for both models.
To empirically assess the hedge effectiveness of SVI and spline, we compute the single-option hedging error for options according to Equation (15) in Liu et al. (2022), and report the mean hedging error (MHE) and mean absolute hedging error (MAHE).
Table A2 presents the hedging errors for China 50 ETF put and call options. For put options, the MHEs (MAHEs) exhibit significant variations across all eight categories, ranging from 0.2246 (0.5627) to 1.5247 (1.5279) for SVI and from 0.2015 (0.5513) to 1.4713 (1.4772) for spline model. The hedging errors for DOTM puts are much greater than those for OTM puts, and the errors for long-term puts are greater than those for short-term puts, which are consistent with the simulated results under BSM. Notably, The errors of the spline model are smaller than that of the SVI model in all four categories.
For calls, the MHEs (MAHEs) range from 0.0274 (0.1409) to 0.5530 (0.7301) for SVI and from 0.0488 (0.1422) to 0.5938 (0.7794) for the spline model. Except for OTM-long calls and DOTM-short calls, the errors of the spline model are smaller than those of the SVI model in the other two call categories.
In summary, the proposed spline outperforms in delta hedging of 50ETF OTM options in six out of eight categories. Economically speaking, this is expected and logically consistent with the superior performance of the spline model in pricing (both in-sample and out-of-sample), as extensively discussed in our paper. Therefore, our findings provide additional evidence that the spline model not only surpasses the SVI model in option pricing but also excels in option hedging for 50ETF options, which is highly encouraging.


