DeepOption: A novel option pricing framework based on deep learning with fused distilled data from multiple parametric methods

doi:10.1016/j.inffus.2020.12.010

Information Fusion

Volume 70, June 2021, Pages 43-59

https://doi.org/10.1016/j.inffus.2020.12.010 Get rights and content

Highlights

•
A new option pricing and delta-hedging framework based on deep learning is proposed.
•
The framework can process small and imbalanced datasets containing real option data.
•
Fused distilled data from conventional parametric option pricing methods are used.
•
The biased data problem is tackled with label augmentation and transfer learning.
•
We propose a single model applicable to all situations, regardless of moneyness.

Abstract

The remarkable performance of deep learning is based on its ability to learn high-level features by processing large amounts of data. This exceptionally superior performance has attracted the attention of researchers studying option pricing. However, option data are more expensive and less accessible than other types of data and are imbalanced because of the liquidity of options. This motivated us to propose a new option pricing and delta-hedging framework called DeepOption. This framework, which is based on deep learning, can improve the performance even when applying imbalanced real option data. In particular, the framework fuses simulated big data, known as distilled data, obtained using various traditional parametric methods. The proposed model employs the following three-stage training approach: Our model is pre-trained using big distilled data after it is fine-tuned using real option data through transfer learning. Finally, a delta branch is added to the model and trained. We experimentally evaluated the proposed method using three sets of real option data, namely S&P 500 European call options, EuroStoxx50 call options, and Hang Seng Index put options. Our experimental results on option pricing demonstrate that our proposed model outperforms parametric methods and other machine learning methods. Specifically, our model, which uses pre-training with distilled data, reduces the overall mean absolute percentage error (MAPE) by more than 50%, compared with that of a deep learning model using only real option data without pre-training.

Introduction

Options are one of the most important derivatives applied to risk management. An option offers the buyer the right, as opposed to an obligation, to trade the underlying asset at a fixed price. These rights enable buyers to trade at a certain price; thus, accurate pricing is especially crucial for investors by helping them to make the right decisions in terms of their perceived risk. However, option pricing is highly complicated owing to numerous internal and external factors, including stock price, strike price, option type, time-to-maturity, and macroeconomic factors. Considering the significance of risk management and the difficulty of accurate pricing incorporating the many crucial factors, option pricing is one of the most challenging and critical issues in finance.

Option pricing is a methodical process that entails the calculation of theoretically fair prices under certain conditions [1]. The basic factors that affect the pricing are the strike price, spot price of the underlying asset, volatility, time-to-maturity, and risk-free rate. The option price can be calculated using both parametric and non-parametric methods [2]. Black and Scholes initiated the application of parametric methods to option pricing [1]. Although the Black–Scholes (BS) model assumes constant volatility over the maturity period, it is widely used in practice because it is easy to apply and only slightly different from other methods with respect to the pricing of at-the-money options and options with short maturity. However, the BS model has limitations in that the method is based on constant volatility. Thus, advanced option pricing methods using stochastic volatility have been studied to reflect more realistic market environments and overcome the limitations of the BS model. Typical advanced models include Heston's random diffusion process [3] and Merton's jump diffusion process [4]. In addition, more advanced models, such as the Lévy process [5] and variance gamma process [6], have been actively studied.

Parametric methods that apply numerical approaches to option pricing have also been developed. Numerical methods are used when it is difficult to solve a model in a regular form owing to a large number of variables or when the BS model is difficult to apply when a closed-form does not exist. Typical numerical option pricing methods include the Monte Carlo method (MC), finite differential method (FDM), and binomial tree method (BI) [7]. The BI method, developed by Cox, Ross, and Rubinstein [8], is straightforward, converges to the BS model within the limit of a small period, and offers greater flexibility for incorporating other factors such as dividends and the interest rate, which can change prior to maturity [9]. Boyle [10] used the MC method for option pricing. As the dimensionality of the problem increases, this approach becomes more attractive than others because the error convergence rate is independent of the dimensionality of the problem. In addition, the restriction of the MC method is lower than those of other numerical approaches. Subsequently, more efficient methods of generating random numbers, such as the quasi-Monte Carlo method [11], the variance reduction method (e.g., the antithetic variates [12] and control variates [13] methods), and the importance sampling method [14] for error reduction have been proposed. The FDM, which was first employed by Schwartz [15], determines the option price by approximating the differential equations to difference equations. The FDM obtains accurate numerical solutions by solving the partial differential equations, where it quickly converges to a stable solution compared to other methods. Since the ultimate goal of the FDM is to solve partial differential equations, it is studied with the goal of solving new models developed through the progress of the BS model.

In addition to these approaches, deep neural networks (DNNs) with multiple hidden layers, which learn highly complex nonlinear relationships among input data, have demonstrated outstanding performances. Therefore, DNNs have been applied in many areas, including speech recognition, visual object recognition, object detection, drug discovery, and genomics [16]. Advances in neural networks have led to the development of non-parametric DNN-based option pricing methods [7,17,18]. Unlike parametric methods, neural network-based option pricing methods do not require many assumptions and are easy to implement, as long as sufficient data are available.

Option pricing models using an artificial neural network were first introduced by Hutchinson et al. [19]. Unlike their predecessors, such models do not specify the way in which inputs affect the option prices; instead, they allow the network to explore the relationships using data. The results showed that the neural network model for option pricing and delta hedging achieved superior performance compared to that of BS, which employed historical volatility. Lajbcygier and Connor [20] proposed a hybrid neural network to estimate the difference between the price obtained using conventional parametric methods and the real market option price by applying a bootstrap method for trading. Gençay and Qi [21] overcame one of the shortcomings of a neural network (i.e., overfitting) using Bayesian regulation, early termination, and bagging. This approach yields better performance when the proposed methods are applied to out-of-sample pricing and delta-hedging. Amilon [22] proposed a neural network model for predicting the bid price and ask price using daily call option data. Gradojevic et al. [18] proposed a method using a modular neural network consisting of three to nine modules based on moneyness and time-to-maturity; that is, they used different modules depending on these two factors. Liang et al. [7] forecasted option prices using an improved version of a conventional option pricing method. This allowed them to forecast option prices with the aid of neural networks and a support vector machine. Das et al. [23] proposed a hybrid parametric and machine-learning model to price European index options with homogeneity, for example, categorizing option data based on moneyness and time-to-maturity.

Unlike traditional machine-learning algorithms, deep learning (DL) tends to become more accurate as data availability increases [24]. Given sufficient data with a distribution similar to that of out-of-sample data, a DL model can learn more features based on the training data. Furthermore, it is possible to use large and deep networks, which have a superior representation capacity, and thus learn high-level representations without being concerned about the overfitting problem. Neural network architectures and hyperparameters are critical because basic DL algorithms learn features by themselves from the training data. In addition, large and good-quality training data are essential when using DL. However, the availability of daily financial data is rather limited, and a relatively small amount of data is available compared with other fields.

Option data are more expensive and less accessible than other types of data, including other financial data [25]. Option data are mainly obtained using databases such as Bloomberg and Wind Database, but these databases are expensive for individuals to use. Even if individuals can buy option data separately without using a database, the longer the period to acquire more data, the higher the purchase cost. In addition, owing to the liquidity of options, option data may be imbalanced with respect to moneyness, even if they are obtained at a high cost. Moneyness (i.e., in-the-money (ITM), at-the-money (ATM), or out-of-the-money (OTM) option) is an important issue in option pricing because it is the main indicator that helps option traders choose correct options based on specific situations. Imbalanced data also present a challenge in fields other than option pricing. A model trained using imbalanced data predominantly learns the patterns of classes that constitute the majority, whereas the patterns in small classes are insufficiently learned. Several methods have been developed to solve the problem presented by imbalanced data, including resampling, changes in evaluation metrics, and synthetic data augmentation. Synthetic data augmentation is one of the most efficient methods to tackle the imbalance problem. However, it is difficult to apply data augmentation to pattern-sensitive financial data because using the method incorrectly could cause more serious problems. Therefore, in an attempt to solve the problem of financial data scarcity, Baek and Kim [26] proposed ModAugNet, which enables feature augmentation such that the representation capacity of the DNN is sufficiently large and overfitting is prevented. Jeong and Kim [27] used pre-training methods with index component stocks to develop a trading system with a deep Q-network.

In the field of option pricing, previous approaches have employed moneyness to divide the data into sections to which different models can then be applied [19] or allow different modules to be used depending on the moneyness and maturity [18]. Other possible approaches involve the use of only a fraction of the data, OTM [17], or the use of data simulated based on the BS model [28]. However, methods applicable to all situations are more useful when applied as a single model rather than as a model that can be used in specific situations. This is because training a single model is more cost-effective. In addition, the division of data or the use of only a portion of the data is inappropriate in DL, where the amount of data is important. Therefore, models that use only data simulated by the BS model cannot be realistically applied to the pricing of real market data because the pricing obtained with this model are not close to real market prices as a result of the numerous assumptions made.

The motivation for our research is as follows. First, to improve the pricing performance over that of parametric methods and other machine learning methods, we consider the application of DL in the area of option pricing. Second, in situations in which the real market data are either insufficient or imbalanced, the use of a single model rather than multiple models will be helpful in reducing the computational cost and amount of training required. Third, combining various existing option pricing methods in the proposed framework and fusing the data they generate will enable the knowledge acquired by each method to be mimicked; this will both increase the data and ensure a more balanced dataset. Specifically, the distilled data will allow the DNN model to mimic the knowledge acquired by various parametric option pricing methods such as BS, BI, MC, and FDM. In other words, using supervised learning, the proposed framework will be able to imitate a function with fused input and output information from various traditional option pricing models. Thus, transferring knowledge from these various option pricing models can improve real option forecasts.

In addition, to estimate the option prices for all types of moneyness using a single model, the generation of distilled data enables us to create balanced data with respect to moneyness. This solves the problem caused by the imbalanced nature of real data. Furthermore, learning the outputs of diverse models simultaneously has a regularization effect, which improves the prediction performance. Ultimately, we developed a combined delta-hedging module and an option pricing module based on DL because delta can be used to measure the major risks associated with the options. In particular, delta is the ratio comparing the change in price of the underlying asset with the corresponding change in the option price, which is the first derivative of the option price with respect to the price of the underlying asset. Therefore, our delta-hedging model estimates differentiation through the use of a DNN.

In this study, a novel option pricing and delta-hedging framework called DeepOption is proposed, which is applicable to all situations with respect to moneyness. This framework consists of an option pricing module and a delta module, and both modules are designed using a DNN. We use real market option data and simulated big data. The latter are generated using four parametric option valuation methods—BS, BI, MC, and FDM—and are subsequently fused. We refer to these simulated big data as (big) distilled data. DeepOption is a three-stage training model that not only improves real option price forecasts but also overcomes the problem of imbalanced data. In addition to estimating option prices, DeepOption estimates the value of delta for hedging purposes. Specifically, the first stage of the proposed framework entails pre-training the DNN model for option pricing using the distilled data. This pre-training stage with the distilled big data is necessary to help the DNN improve the representation capacity. The second stage of DeepOption is the main training stage of the option pricing module. We fine-tune the model using real market option data to learn the patterns of real option prices. Since the amount of distilled data is about tens to hundreds of times the amount of real market option data, the process of training DeepOption will be less successful if we were to use these two types of data together. The final training stage of DeepOption is to train the delta module. Specifically, the delta module is trained to estimate delta by using the estimated option price, which is the output of the option pricing module, and the stock price as inputs. In addition, this module can learn gamma for the delta-gamma hedging strategy, and can be applied to other Greek estimation modules by changing the input variables. This is explained in Section 3.2.2.

We evaluated our model by analyzing its performance with respect to pricing and hedging. Empirical tests using out-of-sample data were conducted on S&P 500 call options, EuroStoxx50 call options, and Hang Seng Index put options, which are globally important benchmark indicators. We calculated the mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) to measure the pricing accuracy of our proposed method and compared it with those of other approaches. We verified the effectiveness of our model in terms of pricing performance in three different ways. First, we compared the error in the option price estimated by the models using various combinations of four types of distilled data to determine the best combinations. Second, we compared the proposed method with a baseline DNN model that is not pre-trained with distilled data and has the same architecture to verify the effect of distilled data. Finally, we compared the performance of our proposed method with those of the parametric methods and other machine learning methods. We evaluated the hedging performance using delta hedging, which is one of the most common hedging methods. The hedging performance was evaluated by calculating the tracking error, which was then compared with those of other parametric methods. In addition, we conducted delta-gamma hedging to confirm that our proposed framework is applicable to other Greeks.

The remainder of this paper is organized as follows. Section 2 discusses the related methods used in this study. Section 3 describes the data and our proposed methodology. Section 4 compares the performance of the proposed method with those of other approaches. Finally, Section 5 provides some concluding remarks and directions for future research.

Section snippets

Background

In this section, we first describe the best-known parametric option pricing methods—BS, BI, MC, and FDM—which we used to generate distilled option pricing data, and then briefly introduce the DNNs applied. In this paper, the following notations are used: $S$ is the price of the underlying asset, $K$ is the exercise price of the option, $σ$ is the volatility of the asset price, $r$ is the current risk-free interest rate, $Δ t$ is the time interval, $T$ is the expiration time, and $C$ is the option price.

Methodology

A DFNN is advantageous in that it can learn unrevealed relationships among input variables such as $S$ , $K$ , $r$ , $τ$ , $σ$ , and the target value, which is the option price. However, the learning process of a DFNN may be ineffective when insufficient or biased data are used. Real market call option data are typically biased data consisting mostly of ITM options, while real market put option data are biased for OTM options, as mentioned in the Introduction section. These skewed data may prevent the DFNN

Experiments and results

The option data for the S&P 500 call, EuroStoxx50 call, and Hang Seng Index put were used to comprehensively analyze the performance and robustness of the proposed framework. Instead of using real market put option data, we can theoretically calculate put option prices by put-call parity if we assume a perfectly efficient market. However, the prediction errors, MAE and RMSE, of the put option, which is calculated by the put-call parity theorem, are the same as those of the call option.

Conclusion

DL has recently received considerable attention owing to its outstanding performance and is being widely applied to solve financial problems such as forecasting the stock market index [26], credit card fraud detection [37], detection of the relationship between banking operations [38], and taxi demand prediction [39]. Since DL learns features from data, the quantity and quality of data have a crucial impact on performance. Real data is generally difficult to obtain, and the distribution of data

Financial support

This research was supported by a grant (NRF-2020R1F1A1071527) from the National Research Foundation of Korea (NRF), funded by the Korea government (MSIT; Ministry of Science and ICT). The funders played no role in the study design, data collection, and analysis, decision to publish, or preparation of the manuscript.

CRediT authorship contribution statement

Ji Hyun Jang: Writing - original draft, Methodology, Formal analysis, Software, Data curation, Investigation, Visualization. Jisang Yoon: Software, Writing - review & editing, Formal analysis, Methodology. Jungeun Kim: Software, Writing - review & editing, Formal analysis, Methodology. Jinmo Gu: Software, Writing - review & editing, Formal analysis. Ha Young Kim: Conceptualization, Methodology, Formal analysis, Investigation, Writing - review & editing, Supervision, Project administration,

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References (39)

H. Park et al.
Parametric models and non-parametric machine learning models for predicting option prices: Empirical comparison study over KOSPI 200 Index options
Expert Syst. Appl.
(2014)
R.C. Merton
Option pricing when underlying stock returns are discontinuous
J. Financ. Econ.
(1976)
X. Liang et al.
Improving option price forecasts with neural networks and support vector regressions
Neurocomputing
(2009)
J.C. Cox et al.
Option pricing: a simplified approach
J. Financ. Econ.
(1979)
P.P. Boyle
Options: a Monte Carlo approach
J. Financ. Econ.
(1977)
E.S. Schwartz
The valuation of warrants: implementing a new approach
J. Financ. Econ.
(1977)
Y. Baek et al.
ModAugNet: a new forecasting framework for stock market index value with an overfitting prevention LSTM module and a prediction LSTM module
Expert Syst. Appl.
(2018)
G. Jeong et al.
Improving financial trading decisions using deep Q-learning: predicting the number of shares, action strategies, and transfer learning
Expert Syst. Appl.
(2019)
I. González-Carrasco et al.
Automatic detection of relationships between banking operations using machine learning
Inf. Sci.
(2019)
F. Rodrigues et al.
Combining time-series and textual data for taxi demand prediction in event areas: A deep learning approach
Inf. Fusion.
(2019)

F. Black et al.

The pricing of options and corporate liabilities

J. Polit. Econ.

(1973)

S.L. Heston

A closed-form solution for options with stochastic volatility with applications to bond and currency options

Rev. Financ. Stud.

(1993)

P. Carr et al.

Stochastic volatility for Lévy processes

Math. Financ.

(2003)

D.B. Madan et al.

The variance gamma process and option pricing

Rev. Financ.

(1998)

Y. Tian

A flexible binomial option pricing model

J. Futur. Mark.

(1999)

H. Niederreiter

Quasi-Monte Carlo methods and pseudo-random numbers

Bull. Am. Math. Soc.

(1978)

J.M. Hammersley et al.

A new Monte Carlo technique: antithetic variates

Math. Proc. Cambridge Philos. Soc.

(1956)

R.Y. Rubinstein et al.

Efficiency of multivariate control variates in monte carlo simulation

Oper. Res.

(1985)

D. Siegmund

Importance sampling in the Monte Carlo study of sequential tests

Ann. Stat.

(1976)

Cited by (11)

A multi-attribute decision-making fusion model for stock trading with customizable investor personality traits in a picture fuzzy environment[Formula presented]
2023, Applied Soft Computing
In this paper, a fuzzy logic-based machine learning (ML) algorithm is introduced. This proposed ML algorithm accepts picture fuzzy sets (PFS) as the fuzzified input and incorporates genetic algorithm (GA) during the training process. The proposed ML algorithm is then incorporated into two well-known decision-making methods, namely the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) and Evaluation Based on Distance from Average Solution (EDAS). These two decision-making methods and the proposed ML algorithm are then applied to solve a multi-attribute decision-making (MADM) problem related to the evaluation and ranking of public listed companies based on their stock performance, in accordance with investors’ personalities. The actual daily closing stock price of five public listed companies from the big market capitalization (Big Cap) category traded in the Kuala Lumpur Stock Exchange (KLSE) for a period of 10 years is used as the datasets for this study. Monte Carlo simulation is used to verify the accuracy of the results. In addition, a comprehensive comparative study of some recent PFS-based decision-making methods in the existing literature and the proposed methods is conducted, and all the typical instances of the investors’ personalities are observed. The results obtained through this comparative study corroborates the results obtained via the proposed methods, and this proves the effectiveness of the proposed methods. The differences in the results obtained via the different methods are analyzed and discussed, and this again proves that the results obtained via the proposed methods are effective and consistent with the judgments of human experts.
Attentive gated graph sequence neural network-based time-series information fusion for financial trading
2023, Information Fusion
Citation Excerpt :
Huang et al. [6] designed a novel sampling strategy to reduce the immense computational resources required for RL. Jang et al. [7] proposed a new option pricing and delta-hedging framework based on deep learning. Moews et al. [8] proposed the use of deep neural networks (DNNs) that apply stepwise linear regressions with exponential smoothing in preparatory feature engineering.
With the advances in financial technology (FinTech) in recent years, the finance industry has attempted to enhance the efficiency of their services through technology. The financial service most commonly desired by people is a robo-advisor, which is a virtual expert that advises people on how to make good decisions related to financial market trading. Some mathematical models may be difficult to apply to prediction problems involving multiple cross correlation. To overcome this issue, the rules of and implicit correlation between the collected data must be determined through algorithms to forecast the future market situation. By combining financial information from the financial market, we apply an information fusion approach to collect data. Also, a relational model can be developed using a deep neural network, which can be represented using a graph structure, to determine the real market situation. However, the structural information of the graph may be lost if a traditional deep learning module is used. Therefore, in this paper, we present a visual-question-answering-like deep learning fusion model based on the graph structure and attention mechanism. This model was used to study the interaction between the time-series data of various financial variables. The relationships among financial commodities were formulated, and the graph representation was learned through graph neural networks. Moreover, the importance of each commodity was determined through the attention mechanism. The proposed method improved the accuracy of trend and volatility prediction up to 6% on S&P500 and developed a pairs trading application.
An accurate and stable numerical method for option hedge parameters
2022, Applied Mathematics and Computation
Citation Excerpt :
Ivacu [15] examined various machine learning algorithms including “support vector regressions” to overcome the limitations of parametric models, and the algorithms outperformed classical methods such as Black-Scholes in pricing European call options. Jang et al. [16] proposed the “Deep Option” framework, which solves the inherent data shortage problem due to the liquidity of the option data by pre-training the deep learning model with distilled data. Based on a hybrid gated neural network (hGNN), [17] developed a model that effectively handles options on the boundary and option Greeks by applying no-arbitrage constraints to the input layer parameters for their differentiable pricing model. [18]
We propose an unconditionally stable numerical algorithm, which uses the Feynman-Kac formula of the Black-Scholes equation to obtain accurate option prices and hedge parameters. We discretize the asset and time using uniform grid points. We approximate the option values by piecewise quadratic polynomials for each time step and integrate them analytically over the log-normal distribution. The piecewise quadratic approximation gives the third-order convergence in the asset direction, and the analytic integration reduces truncation error in the time direction. The estimation errors are propagated backward in time following the convection and diffusion characteristics of the Black-Scholes equation, which assures the unconditional stability of our method. The vectorized code implementation reduces the time complexity. The convergence test shows that our approach outperforms the Crank-Nicolson scheme of the finite difference method in both time and asset directions, and the stability test verifies that our method is stable as the Crank-Nicolson. Furthermore, we show that our algorithm reduces the price errors and hedge parameter errors by more than 50% from the benchmark.
Neural Network Approach to the Problem of Predicting Interest Rate Anomalies under the Influence of Correlated Noise
2023, Doklady Mathematics
Artificial Intelligence in Accounting and Finance: Challenges and Opportunities
2023, IEEE Access
The efficient hedging frontier with deep neural networks
2021, ICAIF 2021 - 2nd ACM International Conference on AI in Finance

View all citing articles on Scopus

View full text

DeepOption: A novel option pricing framework based on deep learning with fused distilled data from multiple parametric methods

Highlights

Abstract

Introduction

Section snippets

Background

Methodology

Experiments and results

Conclusion

Financial support

CRediT authorship contribution statement

Declaration of Competing Interest

Expert Syst. Appl.

J. Financ. Econ.

Neurocomputing

J. Financ. Econ.

J. Financ. Econ.

J. Financ. Econ.

Expert Syst. Appl.

Expert Syst. Appl.

Inf. Sci.

Inf. Fusion.

The pricing of options and corporate liabilities

J. Polit. Econ.

A closed-form solution for options with stochastic volatility with applications to bond and currency options

Rev. Financ. Stud.

Stochastic volatility for Lévy processes

Math. Financ.

The variance gamma process and option pricing

Rev. Financ.

A flexible binomial option pricing model

J. Futur. Mark.

Quasi-Monte Carlo methods and pseudo-random numbers

Bull. Am. Math. Soc.

A new Monte Carlo technique: antithetic variates

Math. Proc. Cambridge Philos. Soc.

Efficiency of multivariate control variates in monte carlo simulation

Oper. Res.

Importance sampling in the Monte Carlo study of sequential tests

Ann. Stat.