In this paper we develop a novel neural network model for predicting implied volatility surface. Prior financial domain knowledge is taken into account. A new activation function that incorporates volatility smile is proposed, which is used for the hidden nodes that process the underlying asset price. In addition, financial conditions, such as the absence of arbitrage, the boundaries and the asymptotic slope, are embedded into the loss function. This is one of the very first studies which discuss a methodological framework that incorporates prior financial domain knowledge into neural network architecture design and model training. The proposed model outperforms the benchmarked models with the option data on the S&P 500 index over 20 years. More importantly, the domain knowledge is satisfied empirically, showing the model is consistent with the existing financial theories and conditions related to implied volatility surface.
More than Just Lines on a Map: Best Practices for U.S Bike Routes
Incorporating Financial Domain Knowledge into Neural Networks for Implied Volatility Prediction
1. Incorporating Prior Financial Domain Knowledge into
Neural Networks for Implied Volatility Surface Prediction
Yu Zheng
1. School of Finance
Southwestern University of Finance
and Economics
2. Inboc Technologies Ltd.
zhengyu@swufe.edu.cn
Yongxin Yang
1. School of Informatics
University of Edinburgh
2. ArrayStream Technologies Ltd.
yongxin.yang@ed.ac.uk
Bowei Chen*
Adam Smith Business School
University of Glasgow
bowei.chen@glasgow.ac.uk
* Corresponding author. | DOI: 10.1145/3447548.3467115
KDD 2021
2. Option on S&P 500 index
Call option payoff
S&P 500 index
𝑋
max{0, 𝑋 − 𝐾}
Strike price 𝐾
Payoff
3. Stochastic underlying models
• Brownian motion (or Wiener process, or continuous-time random walk)
• Geometric Brownian motion
• Jump diffusion process
• Levy model
• Bachelier, L. (1900) Theorie de la speculation. Annales Scientifiques de l’Ecole Normale Superieure, 3(17):21–86.
• Samuelson, P. (1965) Proof that properly anticipated prices fluctuate randomly. Industrial Management Review, 41– 49.
• Kou, S. (2002) A jump-diffusion model for option pricing. Management Science, 48(8):1086–1101.
• Hobson, D. (2004) A survey of mathematical finance. Proceedings of the Royal Society: Mathematical, Physical and Engineering
Sciences. 460:2052, 3369-3401
4. Black-Scholes model
Black, F., & Scholes, M. (1973) The pricing of options and corporate liabilities. Journal of Political Economy, 81, 637–654.
5. Implied volatility
The implied volatility of an option is defined as the
inverse problem of option pricing, mapping
from the option price in the current market to a
single value. When it is plotted against the
option price strike price and the time to maturity,
it is referred to as the implied volatility surface.
Strike price
Tim
e to
m
aturity
Implied
volatility
6. To avoid dealing with interest rates and dividends, the forward measure is used
Cont, R., & Da Fonseca, J. (2002) Dynamics of implied volatility surfaces. Quantitative Finance, 2, 45–60.
7.
8. Absence of arbitrage conditions
Gulisashvili, A. (2012) Analytically Tractable
Stochastic Stock Price Models. Springer.
Boundaries conditions
Carr, P., & Wu, L. (2007) Stochastic
skew in currency options. Journal of
Financial Economics, 86, 213–247.
Asymptotic slope condition
Lee, R. (2004) The moment formula for
implied volatility at extreme strikes.
Mathematical Finance. 14(3), 469–480.
9. Volatility smile and smile function
Implied
volatility
Strike price 𝐾
ATM
OTM call
ITM call
10. Neural network architecture
Yang, Y, Zheng, Y, & Hospedales, T. (2017). Gated neural networks for option pricing: rationality by design. AAAI.
11. Embedding constraints into optimisation
Joint data loss
Monotonicity condition
Absence of the butterfly arbitrage condition
Boundary condition
Asymptotic condition
Regularization
16. Conclusion
• Technology wise, we propose a framework of incorporating prior financial domain
knowledge into neural network design and training. This is an important step for interpretable
machine learning, and we hope the framework can motivate many other investigations of
machine learning applications in finance.
• From the application perspective, we develop a best-performing prediction model, and the
conventional financial conditions and empirical evidence are met empirically. To the best of
our knowledge, this is one of the very first neural networks tailored for implied volatility
surface.