[PDF] Forecasting volatility with a stacked model based on a hybridized Artificial Neural Network

Abstract

An appropriate calibration and forecasting of volatility and market risk are some of the main challenges faced by companies that have to manage the uncertainty inherent to their investments or funding operations such as banks, pension funds or insurance companies. This has become even more evident after the 2007-2008 Financial Crisis, when the forecasting models assessing the market risk and volatility failed. Since then, a significant number of theoretical developments and methodologies have appeared to improve the accuracy of the volatility forecasts and market risk assessments. Following this line of thinking, this paper introduces a model based on using a set of Machine Learning techniques, such as Gradient Descent Boosting, Random Forest, Support Vector Machine and Artificial Neural Network, where those algorithms are stacked to predict S&P500 volatility. The results suggest that our construction outperforms other habitual models on the ability to forecast the level of volatility, leading to a more accurate assessment of the market risk.

Full PDF

FForecasting volatility with a stacked model based on ahybridized Artiﬁcial Neural Network

Eduardo Ramos-P´erez (1) ,Pablo J. Alonso-Gonz´alez (2) , Jos´e Javier N´u˜nez-Vel´azquez (2) (1) Ph D Student (Economics and Management Program). Universidad de Alcal´a.(2) Economics Department. Universidad de Alcal´a. ∗† Abstract

An appropriate calibration and forecasting of volatility and market risk are someof the main challenges faced by companies that have to manage the uncertaintyinherent to their investments or funding operations such as banks, pension fundsor insurance companies. This has become even more evident after the 2007-2008 Financial Crisis, when the forecasting models assessing the market risk andvolatility failed. Since then, a signiﬁcant number of theoretical developmentsand methodologies have appeared to improve the accuracy of the volatility fore-casts and market risk assessments. Following this line of thinking, this paperintroduces a model based on using a set of Machine Learning techniques, such asGradient Descent Boosting, Random Forest, Support Vector Machine and Ar-tiﬁcial Neural Network, where those algorithms are stacked to predict S&P500volatility. The results suggest that our construction outperforms other habitualmodels on the ability to forecast the level of volatility, leading to a more accurateassessment of the market risk.

Keywords:

Machine learning, Stacking algorithms, Risk assessment, Volatility forecasting,Hybrid models

AMS Subject Classiﬁcation:

During the Financial Crisis of 2007-2008, unexpected falls in stock prices resultedin signiﬁcant losses for individual investors and ﬁnancial institutions. Since then,new regulations have entered in force in order to ensure the correctness of the mar-ket risk assessment provided by ﬁnancial institutions and to allow individual marketparticipants to be aware of the risk linked to ﬁnancial products. As volatility is anindicator of the uncertainty associated with the asset proﬁtability (Hull 2015 andRajashree and Ranjeeeta 2015), this variable tends to play a key role within the risk ∗ Authors’ address: (1)&(2) :Economics Department, Universidad de Alcal´a, Plaza de la Victoria2, 28802 Alcal´a de Henares, Spain. E–mails: P.J. Alonso-Gonz´alez, [email protected] , J.J.N´u˜nez, [email protected] , E. Ramos, [email protected] † Corresponding author: P. Alonso; Date: August 19, 2020. This manuscript version is made availableunder the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/ a r X i v : . [ q -f i n . R M ] A ug odels. In fact, events like the bankruptcy of LTCM in 1998 (Lowenstein 2000), thedotcom crash in 2001 (Aharon et al. 2010) or, more recently, the aforementionedFinancial Crisis of 2007-2008 were not foreseen by most of the risk models due toinaccurate estimates produced by the volatility forecasting models. It is worth men-tioning that, as volatility is not directly observed, before estimating any statisticalmodel it is necessary to select a volatility proxy (Poon and Granger 2003). In thefollowing paragraphs, the proposed methodology and main families of volatility fore-casting models (GARCH, Stochastic and Machine Learning) are presented.First of all, GARCH models are introduced as this family of models is probably themost widely used in the literature due to its ability to ﬁt the volatility clustering(Mandelbrot 1963) empirically observed in ﬁnancial time series. This auto-regressiveapproach and its generalization were developed by Engle (1982) and Bollerslev (1986)respectively. Classical GARCH models were discovered to be too rigid for ﬁtting re-turns series, especially over a long time span, because the estimated persistence ofconditional variances is close to one (Bauwens et al. 2012). Therefore, more ﬂexi-ble GARCH models were developed in order to overcome this problem. Engle andLee (1999) suggested a two equation model where each of them represents long-runand short-run components of volatility, respectively. Mixed-normal GARCH (Haaset al. 2004a) is a second way to deal with this problem. This kind of model allowsto choose amongst several regimes in each instant of time t. The drawback of thismethodology is that it assumes that the variables used to decide amongst regimes areall independent over time. To overcome this problem, Haas et al. (2004b) proposeda Markov-switching model where the parameters of a GARCH model change accord-ing to a Markov process. An extension of this kind of model can be found in Haasand Paolella (2012). Before concluding with the GARCH models, it is important tomention that volatility can behave diﬀerently depending on the trend of the market:bullish or bearish. To ﬁt this behaviour, Nelson (1991) developed the EGARCHmodel that allows the sign and the volume of previous values to have separate im-pacts on the volatility forecasts. In addition to the EGARCH model, Glosten et al.(1993) proposed the GJR-GARCH to replicate the aforementioned behaviour. Otherdevelopments within this family can be found in Engle and Kroner (1995) with theirBEKK model, the factor model (Engle et al. 1990), the Constant Conditional Cor-relation model (Bollerslev 1990), the time-varying correlation model (Tse and Tsui2002), the dynamic correlation model (Engle 2002) or the multivariate GARCH ap-proach proposed by Kraft and Engle (1982) and Engle et al. (1984) and its ﬁnancialimplementation by Bollerslev et al. (1988). More recently, Zhang et al. (2018) haveproposed a ﬁrst order zero drift GARCH (ZD-GARCH) to study heteroscedasticityand conditional heteroscedasticity together.The second family is composed of those models which assume that the volatility isdriven by its own stochastic process. This approach was introduced by Taylor (1982)as an Euler approximation of the underlying diﬀusion model. Assuming that stockprices follow a Brownian motion, Heston (1993) derived a model where the volatil-ity follows an Ornstein-Uhlenbeck process. To derive the parameters of the HestonModel, two diﬀerent strategies have been adopted in the literature: moment or sim-ulation. For the ﬁrst one, the Generalized Method of Moments was proposed by2elino and Turnbull (1990) and Andersen and Sorensen (1999), while the simulationapproach has been used by Danielsson (2004), Durbin and Koopman (1997), Brotoand Ruiz (2004) or Andersen (2009), amongst others.The last family presented is Machine Learning, which comprises a set of techniquesused to analyse the future evolution of stock prices and volatility. These algorithmstry to learn automatically and recognize patterns in a large amount of data (Krollneret al. 2010). It is worth mentioning that the ﬁtting of these algorithms is quite sen-sitive to the forecasting time-frame and the selected input variables. Armano et al.(2005) and de Faria et al. (2009) suggest using one day as a time-frame and laggedor technical indicators as input variables for the Machine Learning algorithms. Stockprices, volatilities and portfolio selection have been analysed using diﬀerent method-ologies based on Machine Learning, such as Support Vector Machine (Gestel et al.2001), hidden Markov models (Gupta and Dhinga 2012 and Dias et al. 2019) or Ar-tiﬁcial Neural Networks (ANN) (Hamid and Iqbid 2002). These last authors showedthat volatility forecasts made by an ANN outperform the implied volatility derivedfrom Barone-Adesi and Whaley options models. Additionally, ANNs have been ap-plied successfully to other ﬁnancial series diﬀerent from volatility and stock prices:bond rates (Surkan and Xingren 2001) and bank failures (Hutchinson et al. 1994).Deep learning (LeCun et al. 2015) is a framework closely related with ANN whichhas been employed for predicting the evolution of Korean stock market index (Changet al. 2017).Despite the high performance of ANN, predictions derived from the use of this al-gorithm could be inaccurate when stock prices move sharply (Patel and Yalamalle2014). To overcome this problem, ANN were combined with other statistical models(Kristjanpoller et al. 2014) creating the so called hybrid models. Hybridization canbe deﬁned as an approach in which several models are merged to form a new enhancedmodel in order to produce better forecasting results. Therefore, a hybrid model isa combination of the artiﬁcial intelligence techniques with some components of thetraditional forecasting models (like the ones presented within the GARCH family).Examples of this approach are discussed in Roh (2006), Hajizadeh et al. (2012),Lu et al. (2016) Monfared and Enke (2014) or Kristjanpoller et al. (2014), wherediﬀerent outputs from a GARCH-based model are used as inputs in an ANN. A moregeneral picture of this type of hybrid models is provided by Bildirici and Ersin (2009),since they compared and combined an ANN with diﬀerent types of GARCH models(GARCH, EGARCH, GJR-GARCH, TGARCH, NGARCH, SAGARCH, PGARCH,APGARCH and NPGARCH). In addition to the above-mentioned researches, thistype of hybrid models has been broadly used in other papers. Bildirici and Ersin(2014) proposed a MS-GARCH with an ANN to improve the forecasting accuracy,Bektipratiwi and Irawan (2011) combined a radial basis function with an EGARCHto model stocks returns of an Indonesian bank and Arneric and Poklepovic (2016)developed an ANN model as an extension of a GJR-GARCH to forecast the marketreturns of six European emerging markets. GARCH-based models have been alsocombined with ANNs to predict the volatility in commodity markets, such as gold(Kristjanpoller and Minutolo 2015) or oil (Kristjanpoller and Minutolo 2016). In thislast case, the hybrid model included ﬁnancial variables to improve the forecasts. This3trategy can also be found in Kristjanpoller and Hern´andez (2017). Kim and Won(2018) propose a hybrid model that combines a LSTM with various GARCH-typemodels to forecast the volatility of KOSPI index. A reﬁnement of this model can befound in Back and Kim (2018). It should be mentioned that these models can begenerated in both directions: some outputs of a GARCH model can be used as inputof an ANN and vice versa (Lu et al. 2016). Finally, it should be noted that hybridi-sation can not only be made with ANN. Peng et al. (2018) proposed a structurecombining traditional GARCH-models with Support Vector Machine (SVM) (Cortesand Vapnik 1995).The research carried out along this paper develops a volatility forecasting model thatconsists of two diﬀerent levels which is based on stacking algorithms methodology(Hastie et al. 2009) and statistical models of the Machine Learning family. RandomForest (RF) (Breiman 2001), Gradient Boosting (GB) with regression trees (Fried-man 2000) and Support Vector Machine (SVM) (Cortes and Vapnik 1995) are usedin the ﬁrst level, while an ANN (Mcculloch and Pitts 1943) is incorporated within thesecond level of the stacked model (Stacked-ANN) in order to generate the volatilityforecasts. A diﬀerent two-level approach can be found in Kristjanpoller and Minutolo(2018). They use an ANN-GARCH model with a pre-processing based on principalcomponents analysis to reduce the number of inputs employed in their network. Incontrast to the hybrid models deﬁned previously, the proposed model is merging theresults arising from other machine learning algorithms which are free of some theo-retical assumptions like the use of a predeﬁned distribution for the underlying assetreturns or the constant level of unconditional variance. Because of this and with theaim to build a more ﬂexible model, the GARCH-based models are not present in theStacked-ANN architecture. The proposed model relies completely on the predictionsmade by machine learning algorithms and market data. Additionally, in the case ofthe Stacked-ANN the ﬁnal forecasts made by the ﬁrst level algorithms are directlyused as inputs within the ANN while, in most of the hybrid models discussed in theprevious paragraphs, sections of the GARCH-based models are inserted separatelyin the ANN.The rest of the paper proceeds as follows: Section 2 presents the set of volatilityforecasting models used for comparison purposes. Furthermore, the risk measuresand tests used to validate the results are discussed. In Section 3 the theoreticalbackground and architecture of the volatility forecasting model based on stackingalgorithms (Stacked-ANN) are explained. The empirical results of the diﬀerent fore-casting models are shown in Section 4, where the accuracy and the risk measuresarising from the proposed model are compared with results obtained by the method-ologies explained in Section 2. Finally, Section 5 presents the main conclusions of theresults and comparisons carried out along Section 4.4 Benchmark models, risk measurements and statisticaltests

As stated above, this section is focused on explaining the benchmark models and thetests used to back-test the risk measurements. Thus, the ﬁrst paragraphs are ded-icated to ANN, ANN-GARCH, ANN-EGARCH and Heston Model, while the endof this section is focused on the risk measurements and tests performed to validateand compare the results of the benchmark models with the one proposed in Section 3.The ﬁrst benchmark model is a feed-forward ANN. Following the notation providedby Bishop (2006) and assuming that the algorithm has two hidden layers, the modelwould be deﬁned by the following expression:ˆ σ t +1 = h (3)  T (cid:88) k =1 w (3) p,k h (2)  M (cid:88) j =1 w (2) k,j h (1) (cid:32) D (cid:88) i =1 w (1) j,i x i + w (1) j, (cid:33) + w (2) k,  + w (3) p,  (1)Where h ( n ) is the activation function associated with the layer n , w ( n ) z,v is the v-th weight associated with the neuron z inside the layer n and x i refers to the i inputvariable of database comprised by the explicative variables selected by the analyst.The second benchmark model is an ANN-GARCH( p , q ). As brieﬂy introduced inSection 1, the aim of this hybrid model is to combine the GARCH( p , q ) estimateswith other input variables by using an ANN, which is a more ﬂexible model thanGARCH( p , q ). Therefore, before starting with the ﬁtting of the ANN, the parametersof the GARCH( p , q ) model need to be estimated:ˆ σ t = ω + q (cid:88) i =1 α i r t − i + p (cid:88) i =1 β i σ t − i / ˆ r t = ˆ σ t (cid:15) t (2)In this formulation ω , α i and β i are the parameters to be estimated, while r t and σ t refer to the return and volatility respectively. The returns distribution is determinedby the distribution selected for (cid:15) t . If a standardize normal or standardize Student’st-distribution is selected, then the returns generated by the model follow a con-ditional normal (CND) or conditional t-distribution (CTD) respectively (Bauwenset al. 2012). Once the GARCH( p , q ) parameters are estimated, (cid:80) qi =1 α i r t − and (cid:80) pi =1 β i σ t − can be computed and used as input (together with the rest of explica-tive variables) within the ANN.The third benchmark model is an ANN-EGARCH. The architecture of this modeland the previous one can be considered the same with the unique diﬀerence that theﬁrst step consists of ﬁtting an EGARCH( p , q ) instead of a GARCH( p , q ) model. TheEGARCH( p , q ) can be deﬁned as follows (Nelson 1991):log ˆ σ t = ω + p (cid:88) i =1 α i log ˆ σ t − i + q (cid:88) i =1 ( β i (cid:15) t − i + γ i ( | (cid:15) t − i | − E | (cid:15) t − i | )) (3)5nce the EGARCH is ﬁtted, the following terms can be calculated and used as inputwithin the ANN together with the rest of the explicative variables selected by theanalyst: p (cid:88) i =1 α i log ˆ σ t − i q (cid:88) i =1 β i (cid:15) t − i q (cid:88) i =1 γ i ( | (cid:15) t − i | − E | (cid:15) t − i | ) (4)The last benchmark is the Heston (1993) Model. Even though this approach belongsto the stochastic family and the proposed one to the Machine Learning one, thismodel is going to be used as benchmark during this paper as this process is the mostwidely used within the family of the stochastic volatility models. It assumes thatchanges in stock prices through the time ( dX t ) follow a Brownian diﬀusion process: dX t = µX t dt + (cid:113) σ t X t dB t (5)Where B t ∼ N (0 , σ t t ). Therefore, if volatility follows an Ornstein-Uhlenbeck process,the changes in this variable are deﬁned by the following expression: dσ t = θ ( υ − σ t ) dt + δσ t dB ∗ t (6)where υ is the long term volatility, θ is the rate of return to υ , δ is the volatility of σ t and B ∗ t is a Wiener process that has a correlation of ρ with B t .Once the four benchmark models have been explained, the section focuses on the riskmeasurements. As stated before, volatility plays a key role in market risk assessment.Therefore, the models will not be only compared in terms of accuracy, but the riskmeasurements arising from every volatility model are going to be tested. For thispurpose, VaR and CVaR have been selected as risk measures. Even though VaR isprobably the most used metric due to its simplicity and easy interpretation, CVaRhas been also included as it is considered to be a coherent risk measure (Artzner et al.1999). Consequently, for every volatility model the aforementioned risk measures aregoing to be computed and validated by means of the following tests: • Kupiec (1995) introduced a test in order to check if the number of VaR excessesare align with the level of conﬁdence selected. • An extension of the previous test was developed by Christoﬀersen et al. (1997).The aim of this test is to validate that VaR excesses are independent, identicallydistributed and in line with the selected level of conﬁdence. • Acerbi and Szekely (2014) developed a test (AS1) to assess the appropriatenessof the CVaR based on the assumption that VaR has been already tested andconsidered to be correct from a statistical point of view. The test is inspiredby the following equation: E (cid:20) r t CV aR α,t + 1 (cid:12)(cid:12)(cid:12)(cid:12) r t + V aR α,t < (cid:21) = 0 (7)As VaR needs to be previously validated, the result of this test has to be assessedtogether with the two aforementioned tests.6 In addition to the previous test, Acerbi and Szekely (2014) introduced anothermethod (AS2) to validate the CVaR without making any assumption about theappropriateness of the VaR. To do so, this test tries to check a CVaR expressionthat is not conditioned by the correctness of a previous VaR estimate.Before beginning with the Stacked-ANN architecture, it is worth noticing that thetwo ﬁrst tests are parametric while the two last are non-parametric so, for furtherdetails about how to compute the statistics and their distributions please refer toaforementioned papers.

This section has been divided in several sub-sections in order to explain sequentiallythe proposed volatility forecasting model. As the Stacked-ANN model is composedby two diﬀerent levels, the two ﬁrst sub-sections are dedicated to the input data andthe algorithms within the ﬁrst level of the Stacked-ANN model, while the third andforth sub-sections are focused on the data required to generate the stacking procedureand the details of the ANN ﬁtted with the aforementioned information. (Figure 1explains brieﬂy the process followed to estimate and test the Stacked-ANN model)

The ﬁrst step is concerned with the creation of the database containing the volatilityproxy to be used as a response and the explanatory variables selected to ﬁt thealgorithms. As the aim of the study is to predict future volatilities, the True RealizedVolatility (hereinafter TRV) is going to be used as response variable (Roh 2006):

T RV t = (cid:118)(cid:117)(cid:117)(cid:116) n n (cid:88) i =1 ( r t + i − − (cid:98) r t ) (8)Where (cid:98) r t = (cid:80) ni = n ( r t + i − ) /n and n = 5. The window has been selected to be largeenough to compute a stable TRV and small enough to avoid, as much as possible,mixing diﬀerent volatility regimes.The variables given to the ﬁrst level algorithms to forecast the TRV are the last 30volatilities computed with returns already observed in the market: V t = (cid:118)(cid:117)(cid:117)(cid:116) n n − (cid:88) i =0 ( r t − n + i − (cid:98) r t ) (9)Where (cid:98) r t = (cid:80) n − i =0 ( r t − n + i ) /n and n = 5. Only the last 30 volatilities have been se-lected because the correlations between previous volatilities and the TRV are residualand therefore their explanatory power is considered to be non-signiﬁcant. The his-torical data to compute all the aforementioned variables is obtained by using the quantmod (Ryan and Ulrich 2017) package from the R project (R Core Team 2017)and, as suggested by Hastie et al. (2009), they will be scaled to the range [0 ,

1] to7mprove the training of the algorithms.Before beginning with the section related with the algorithms included within theﬁrst level, it is important to mention that the ﬁrst 25% of the data is used to ﬁt theﬁrst level algorithms, the next 50% is dedicated to the ANN estimation and the last25% is the test set. The comparison of the benchmark models with the proposed onein terms of accuracy and risk measurement will be made with a diﬀerent set of datacontaining the information of the following year (e.g. if data from 2000 to 2007 isused to train and test the Stacked-ANN model, the out of sample data selected forcomparison purposes would be market movements happened during 2008).Figure 1: Stacked-ANN model structure

The methods applied to optimize the hyper-parameters of the algorithms within theﬁrst level of the Stacked-ANN architecture are introduced below: • Minimization of the Mean Square Error (hereinafter, MMSE) for the wholedatabase to train the ﬁrst level algorithms.8

Circular Block Bootstrap (CBB). This method (Politis and Romano 1991) gen-erates new samples by selecting random blocks from the original database. Thelength of these blocks is ﬁxed and the procedure to calculate it was introducedby Politis and White (2004) and Patton et al. (2009). CBB can only be appliedto stationary time series. • Stationary Bootstrap (hereinafter, SB) (Politis and Romano 1994). Similar tothe case of CBB, this method can only be used with stationary time series.However, the diﬀerence with the former method is that the length of the blocksinstead of being ﬁxed, it is randomly selected with a certain average that can becalculated using diﬀerent approaches (see Politis and White 2004 and Pattonet al. 2009). • Maximum Entropy Bootstrap (hereinafter, MEB) (Vinod 2006 and Vinod andde Lacalle 2009). Unlike the two previous approaches, stationarity is not re-quired as the new samples are obtained from the maximum entropy distributionof the original time series. • H Cross-Validation (HCV). This method introduced by Chu and Marron (1991)tries to avoid the eﬀect of the correlation that can exist between the responseand the explanatory variables while dealing with time series by eliminating hdata points between them. The bandwidth selection is obtained minimizing theabsolute autocorrelation between the response and explanatory variables, witha maximum width of 100 days.The optimum hyper-parameters combination of each one of the ﬁve previous meth-ods is obtained by applying grid search. Then, these combinations are tested againstdata out of sample (the following 50% of the database) to choose the most accurateoption for ﬁtting the algorithm.As stated before, the ﬁrst level of the stacked model architecture is composed by threealgorithms: Random Forest (RF) (Breiman 2001), Gradient Boosting with regressiontrees (GB) (Friedman 2000) and Support Vector Machine (SVM) (Cortes and Vapnik1995).

As explained in Section 3.1, the ﬁrst 25% percent of data is dedicated to ﬁt the ﬁrstlevel algorithms while the following 50% and 25% are used for ﬁtting the ANN andtesting the results respectively. The explanatory variables given to the ANN are: • As with the ﬁrst level algorithms, the last 30 volatilities ( V t , V t − , ..., V t − )scaled to the range [0 , • The True Realized Volatility forecasts made by the ﬁrst level algorithms: Ran-dom forest ( (cid:91)

T RV t,RF ), Gradient boosting ( (cid:91)

T RV t,GB ) and Support Vector Ma-chine ( (cid:91)

T RV t,SV M ).The response variable is the

T RV t as deﬁned in Section 3.1.9 .4 Second level: Stacking algorithm As stated previously, the last step of the Stacked-ANN model is the ﬁtting of theANN, which is the algorithm stacking the forecasts made by the RF, GB and SVM.Before starting with the details of the ANN architecture, notice that the methodsand procedures related to the hyper-parameters optimization are the same as the ﬁrstlevel algorithms: Grid search in combination with the methods explained in Section3.2 and ﬁnal hyper-parameters decision based on the out of sample error (last 25%of the database).Below, the main characteristics and details of the stacking algorithm are presented: • The feed-forward ANN has two hidden layers with 20 and 10 neurons respec-tively. The sigmoid activation function has been selected for all the neuronswithin the hidden layers while the linear activation function has been used inthe output layer, which is comprised by one neuron. • The optimization algorithm selected is Adaptive Moment Estimation (ADAM),which was created by Kingma and Ba (2014). This method consists in a pro-gressive adaptation of the initial learning rate, taking into consideration currentand previous gradients. The default calibration proposed by the authors is ap-plied: β = 0 . β = 0 . • The number of epochs are 10,000 and the batch size is equal to the length ofthe data used for training the ANN. • The backward pass calculations are done according to the selection of root meansquared error as a loss function. • As indicated in Section 3.1, the 50% of the information is selected for trainingthe ANN while the following 25% of the data is the test set. Note that the ﬁrst25% of the data is used to ﬁt the ﬁrst level algorithms. • The parameter adjusting the level of L2 regularization ( φ ) and the initial learn-ing rate λ used within ADAM are the hyper-parameters to be optimized duringthe estimation process.Taking into consideration all the above-mentioned details, the T RV t forecasted bythe Stacked-ANN model (S-ANN) is obtained by means of the following expression: (cid:91) T RV t,S − ANN = (cid:98) f ( (cid:91) T RV t,RF , (cid:91) T RV t,GB , (cid:91) T RV t,SV M , V t , V t − , ..., V t − ) == h (3)  (cid:88) k =1 w (3)1 ,k h (2)  (cid:88) j =1 w (2) k,j h (1) (cid:32) (cid:88) i =1 w (1) j,i x i + w (1) j, (cid:33) + w (2) k,  + w (3)1 ,  (10)As explained in Section 3.3, x i are the last 30 volatilities scaled to the range [0 , Results

During this section, the data used in the empirical analysis, the ﬁtting process and theﬁnal comparison between the Stacked-ANN and the benchmark models are shown.

In order to analyse the models under diﬀerent market conditions, the algorithms havebeen trained and tested ﬁve diﬀerent times with the S&P 500 volatilities observedin the following periods: 2000-2007, 2001-2008, 2002-2009, 2009-2016 and 2010-2017.As stated in Section 3.1, during the training and testing of the models the ﬁrst 25%of the periods selected is dedicated to ﬁt the ﬁrst level algorithms, the next 50% isused to optimize the ANN while the last 25% is reserved for testing purposes. Theyear after the aforementioned periods (2008, 2009, 2010, 2017 and 2018 respectivelyfor each period) has been used to compare the out of sample results of the Stacked-ANN with the benchmark models. The ﬁrst three data-sets have been selected inorder to analyse the performance of the models during the years after the ﬁnancialcrisis, when the markets where dominated by a high volatile regime. Although theyears inﬂuenced by the ﬁnancial crisis are valuable to test the accuracy of the volatil-ity forecasting models, the two last data-sets have been selected in order to analysethe models performance with the most recent data. Additionally, the lower level ofvolatility during the last periods, especially in 2017, allows to assess the robustnessof the models by analysing them in diﬀerent market conditions. In order to supportthe explanations given during this paragraph, Table 1 summarizes the moments ofthe TRV during the diﬀerent periods selected to compare the models:Table 1: True Realised Volatility statisticsPeriod Mean STD Skewness KurtosisYear 2008 0.022 0.016 1.510 4.519Year 2009 0.015 0.008 0.853 3.248Year 2010 0.010 0.006 0.854 3.736Year 2017 0.004 0.002 0.911 3.369Year 2018 0.009 0.006 1.406 4.702

Source : own elaborationIn addition, the Kolmogorov-Smirnov test has been applied sequentially to the TRVin order to assess statistically if the behaviour of the volatility changes over the dif-ferent periods. As 2008 is the year when the most extreme events related with crisishappened and the market changed from a low to a high volatile regime, the skewnessand mean of that year volatility is higher than the one related with 2009. Becauseof that, the aforementioned test reveals that the volatility of 2008 and 2009 do notbelong to the same distribution ( KS p − value = 0 . KS p − value = 0 . KS p − value = 0 . Source : own elaborationAs the critical values are − .

63 and − .

43 with a probability of 5% and 1% respec-tively, it can be concluded that the data meet the requirements imposed by CBB andSB methods.Previously to the ﬁtting of the algorithms, the parameters needed for the diﬀerentbootstrap and cross validation methods are obtained by means of the methodologiespresented in Section 3.2. As the Stacked-ANN architecture is comprised by two dif-ferent levels, the length of blocks for CBB, the average of the blocks for SB and thedistance, h , to be used within the HCV method are obtained for both, the data-setto ﬁt ﬁrst level algorithms and the one dedicated to the second level. Table 3 sum-marizes the former parameters and it shows non-signiﬁcant changes over time for thediﬀerent periods and levels: 12able 3: Calibration of the elements for bootstrap and CVData for training Data for trainingMethod Period 1st level algorithms 2nd level algorithmCBB Block (2000-2007) 28 63CBB Block (2001-2008) 36 58CBB Block (2002-2009) 40 56CBB Block (2009-2016) 39 58CBB Block (2010-2017) 38 30SB Block average (2000-2007) 25 55SB Block average (2001-2008) 32 51SB Block average (2002-2009) 35 49SB Block average (2009-2016) 34 51SB Block average (2010-2017) 33 27HCV length (2000-2007) 26 31HCV length (2001-2008) 31 51HCV length (2002-2009) 31 40HCV length (2009-2016) 32 55HCV length (2010-2017) 35 27 Source : own elaboration

As explained in Section 3.2, diﬀerent approaches have been followed to ﬁnd theoptimum hyper-parameter combination. Table 4 shows the methods that minimizethe out of sample error per each algorithm and period:Table 4: Methods optimizing OOS errorStacking Gradient SupportPeriod Algorithm (ANN) Random Forest Boosting Vector Machine(2000-2007) ME SB CBB SB(2001-2008) CBB CBB CBB SB(2002-2009) CBB CBB CBB CBB(2009-2016) HCV HCV HCV SB(2010-2017) SB CBB SB SB

Source : own elaborationRegardless of the period, the empirical results suggest that CBB and SB outperformthe rest of the methods. These outcomes are expected as these two methods basedon re-sampling blocks from the original database are speciﬁcally prepared to workwith stationary time series. Table 5 summarizes the hyper-parameters suggested bythe methods shown in Table 4: 13able 5: Final hyper-parametersStacking Gradient SupportPeriod Algorithm (ANN) Random Forest Boosting Vector Machine(2000-2007) φ = 0 N = 10 B = 1479 γ = 0 . λ = 0 . Obs = 24 λ = 0 . (cid:15) = 0 . φ = 0 . N = 10 B = 3000 γ = 0 . λ = 0 . Obs = 107 λ = 0 . (cid:15) = 0 . φ = 0 N = 1 B = 3583 γ = 0 . λ = 0 . Obs = 37 λ = 0 . (cid:15) = 0 . φ = 0 . N = 30 B = 1000 γ = 0 . λ = 0 . Obs = 118 λ = 0 . (cid:15) = 0 . φ = 0 . N = 7 B = 1000 γ = 0 . λ = 0 . Obs = 175 λ = 0 . (cid:15) = 0 . Source : own elaborationWhere λ is the learning rate of the ANN and GB, φ the parameter adjusting thelevel of L2 regularization of the ANN, B the number of iterations performed whileﬁtting the GB, N the number of variables randomly selected by the RF and Obs theminimum number of observations to be kept in the terminal nodes of every ﬁtted treewithin the RF architecture. Finally, γ refers to the parameter included within theradial basis function kernel (the lower the parameter, the higher the non-linearity)and (cid:15) deﬁnes the threshold where the error begins to be penalized by the SVM. Once the Stacked-ANN is ﬁtted, its performance is compared with the benchmarkmodels explained in Section 2 (ANN, ANN-GARCH(1,1), ANN-EGARCH(1,1) andHeston Model). Before beginning with the comparisons, the three following remarksabout the benchmark models have to be done: • Due to the nature of the Heston Model, 20,000 simulations per each day havebeen computed and the daily average of them has been taken to assess itsaccuracy. • The GARCH(1,1) and EGARCH(1,1) (included in the ANN-GARCH(1,1) andANN-EGARCH(1,1) architecture respectively) have been estimated assumingStudent-t innovations. • The ﬁtting procedure and architecture of the ANNs included within ANN-GARCH(1,1), ANN-EGARCH(1,1) and ANN models are the same as the onesexplained for the Stacked-ANN (see Section 3.4).Table 6 shows the out of sample error of the diﬀerent periods selected to compare theperformance and robustness of the Stacked-ANN with the benchmark models. Theresults shown in this table suggest the following conclusions:14able 6: Accuracy analysisRMSE: RMSE: RMSE: RMSE: RMSE:Model 2008 2009 2010 2017 2018Stacked-ANN 0.01192 0.00534 0.00494 0.00254 0.00544ANN-EGARCH 0.01332 0.00588 0.00537 0.00276 0.00571ANN-GARCH 0.01335 0.00584 0.00539 0.00263 0.00575Heston 0.02066 0.00714 0.00547 0.00359 0.00610ANN 0.01526 0.00615 0.00541 0.00274 0.00590

Source : own elaboration • Regardless of the period, the Stacked-ANN outperforms other hybrid modelsbased on auto-regressive methodologies like ANN-GARCH and ANN-EGARCH.In relative terms, minor deviations are observed between the diﬀerent periods. • All the hybridized models tend to outperform the pure ANN model. • As expected due to the extremely high volatilities observed during the ﬁnancialcrisis, the results show that, regardless of the model, 2008 forecasts are lessaccurate. All the models minimize their error rate in the year with the lowestlevel volatility, 2017. • The forecasts made by the Heston Model tend to be the less accurate due tothe non-predictive nature of this model.In addition to the above-mentioned analysis, the risk measures obtained by usingeach one of the volatility models are tested. In order to do so, a returns distributionis selected for each one of the forecasting volatility methods. As described in Section2, Heston Model requires the changes in stock prices to follow a Brownian diﬀusionprocess. Nevertheless, for the rest of the benchmark models and the Stacked-ANN(which are free of assumptions about the returns) a Student t-distribution has beencombined with the diﬀerent volatility forecasts. This assumption about Student t-distribution has been selected when possible as returns tend to be leptokurtic andheavier-tailed than Normal distribution (McNeil et al. 2015).Before analysing the results of the tests presented in Section 2, it is worth mentioningthat the level of conﬁdence (99%) and number of days (10) selected are based on theones set by Basel Directive, whose aim is to monitor, amongst others, the marketrisk. Table 7 shows the p-value of the tests dedicated to VaR (Kupiec and Christof-fersen) and CVaR (AS1 and AS2). If a 95% is set as conﬁdence level, Stacked-ANNin combination with Student t-distribution is the only model that produces an ap-propriate p-value for Kupiec, AS1 and AS2 tests in every period under analysis. Allthe models show diﬃculties to produce a p-value higher or equal than 0 .

05 for theChristoﬀersen test because VaR exceedances tend to happen in a short period of timeinstead of spread over the period analysed. It is worth mentioning that the hybridmodels taken as benchmark (ANN-EGARCH and ANN-GARCH) also fail in produc-ing an appropriate value for the Kupiec test in several periods while, as stated before,15he proposed hybrid model (Stacked-ANN) pass the test for every period. Finally,Heston Model tends to produce less appropriate risk measures due to the distributionconstrain mentioned previously.Table 7: P-value of the VaR and CVaR testsPeriod: Period: Period: Period: Period:Model Test 2008 2009 2010 2017 2018Stacked-ANN Kupiec 0.85 0.84 0.65 0.85 0.85Christ. 0.01 0.79 0.02 0.01 0.01AS1 0.66 0.85 0.61 0.90 0.91AS2 0.56 0.63 0.36 0.67 0.69ANN-EGARCH Kupiec 0.12 0.12 0.84 0.03 0.03Christ. 0.00 0.00 0.01 0.03 0.03AS1 0.52 0.85 0.61 1.00 1.00AS2 0.07 0.19 0.62 0.91 0.91ANN-GARCH Kupiec 0.12 0.03 0.01 0.03 0.03Christ. 0.00 0.03 0.00 0.03 0.03AS1 0.51 1.00 0.77 1.00 1.00AS2 0.08 0.92 0.05 0.85 0.89Heston Model Kupiec 0.00 0.00 0.65 0.03 0.00Christ. 0.00 0.00 0.59 0.03 0.00AS1 0.00 0.01 0.83 1.00 0.06AS2 0.00 0.00 0.36 0.92 0.00ANN Kupiec 0.65 0.04 0.65 0.30 0.29Christ. 0.02 0.00 0.00 0.00 0.00AS1 0.24 0.86 0.59 0.81 0.00AS2 0.29 0.11 0.35 0.24 0.00

Source : own elaboration 16

Conclusions

This paper introduces a Stacked-ANN model based only on Machine Learning tech-niques with the aim to improve the accuracy of the volatility forecasts made by otherhybrid models based on a combination of GARCH or EGARCH with ANNs. Its pre-dictive power and performance has been tested in terms of RMSE, VaR and CVaR.Two main results have to be pointed out. Firstly, the Stacked-ANN has been able togenerate more accurate volatility forecasts than other models in a high volatile regimeperiod like the one occurred after the Financial Crisis of 2007-2008. The models out-performed by the Stacked-ANN during that time lapse are other hybrid models likeANN-GARCH and ANN-EGARCH, the most widely used stochastic volatility the-ory (Heston Model) and a feed-forward ANN without any combination with otheralgorithms or statistical models. Notwithstanding the Stacked-ANN performance, itis observed for every model that the higher the volatility the lower the accuracy. Inaddition to this analysis, the Stacked-ANN has been tested with the most recent data(2017 and 2018) in order to check its performance in the current market conditions.As it occurred with the tests carried out during the ﬁnancial crisis, the proposedarchitecture outperforms the benchmark models in terms of accuracy. The superiorperformance shown by the Stacked-ANN in periods with diﬀerent levels of volatilityare due to the model ﬂexibility. In contrast with ANN-GARCH or ANN-EGARCH,the inputs introduced in the ANN stacked model do not follow any theoretical as-sumption about the returns distribution or volatility. As explained throughout Sec-tion 3, the architecture proposed uses previous volatilities and forecasts made by arandom forest, gradient boosting with regression trees and support vector machineas inputs. Before beginning with the second point of the conclusion, it is worthmentioning that it has been empirically demonstrated that block bootstrap methodsare of special interest when ﬁtting algorithms to volatility as these procedures areespecially prepared to work with stationary time series.Secondly, the forecasts made by the volatility models have been combined with a cer-tain distribution in order to compute the VaR and CVaR for all the diﬀerent periodsanalysed. The distribution selected has been the Student’s t-distribution for everymodel with the exception of the Heston Model which requires changes in asset pricesto follow a Brownian diﬀusion process. The empirical results demonstrated that onlythe Stacked-ANN model is able to produce an appropriate p-value for Kupiec, AS1and AS2 tests in every period under analysis, including those ones related with theﬁnancial crisis.The aforementioned ﬂexibility and predictive power of the Stacked-ANN comparedwith other volatility models suggest to develop further investigations about the im-plications of using this model for derivative valuation purposes. As the price of theseinstruments is closely related to the volatility of the underlying assets, further re-searches should be done in order to compare the implied volatilities observed in themarket with the ones arising from the proposed model. If the volatility measured bythe Stacked-ANN is more accurate than market expectations, it would be possible toidentify under and overvalued derivatives.17 eferences

Acerbi, C. and B. Szekely (2014). Backtesting expected shortfall.

Risk , 1–14.Aharon, D., I. Gavious, and R. Yosef (2010). Stock markets bubble eﬀects on merg-ers and acquisitions.

The Quarterly Review of Economics and Finance 50 ((4)),456–70.Andersen, T. (2009).

Encyclopedia of Complexity and System Sciences , ChapterStochastic volatility. Springer Verlag.Andersen, T. and B. Sorensen (1999). GMM estimation of a stochastic volatilitymodel: A Monte Carlo study.

Journal of Business and Economic Statistics 14 ,329–352.Armano, G., M. Marchesi, and A. Murru (2005). A hybrid genetic-neural architec-ture for stock indexes forecasting.

Information Sciences 170 (1), 3–83.Arneric, J. and T. Poklepovic (2016).

Nonlinear Extensions of AsymmetricGARCH Model within Neural Network Framework . AIRCC Publishing Cor-poration, Chennai, India.Artzner, P., F. Delbaen, J.-M. Eber, and D. Heath (1999). Coherent measures ofrisk.

Mathematical Finance 9 (3), 203–228.Back, Y. and H. Kim (2018). ModAugNet: A new forecasting framework for stockmarket index value with an overﬁtting prevention LSTM module and a predic-tion LSTM module.

Expert Systems with Applications 113 , 457–480.Bauwens, L., C. Hafner, and S. Laurent (2012).

Handbook of Volatility Models andTheir Applications . Wiley Handbooks in Financial E. Wiley.Bektipratiwi, A. and M. Irawan (2011). A RBF-EGARCH neural network modelfor time series forecasting. pp. 1–8.Bildirici, M. and O. Ersin (2009). Improving forecasts of GARCH family mod-els with the artiﬁcial neural networks: An applicaiton to the daily returns inIstanbul Stock Exchange.

Expert Systems with Applications 36 (4), 7355–7362.Bildirici, M. and O. Ersin (2014). Modelling Markov Switching ARMA-GARCHNeural Networks Models and an Application to Forecasting Stock Returns.

Hindawi Publishing Corporation: The Scientiﬁc World Journal 2014 .Bishop, C. M. (2006).

Pattern Recognition and Machine Learning (InformationScience and Statistics) . Berlin, Heidelberg: Springer-Verlag.Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity.

Journal of Econometrics 31 (3), 307–327.Bollerslev, T. (1990). Modeling the coherence in short-run nominal exchange rates:A Multivariate Generalized ARCH Model.

Review of Economics and Statis-tics 72 , 498–505.Bollerslev, T., R. Engle, and J. Wooldridge (1988). A Capital Asset Pricing Modelwith time-varying covariances.

Journal of Political Economy 96 , 116–131.18reiman, L. (2001, Oct). Random forests.

Machine Learning 45 (1), 5–32.Broto, C. and E. Ruiz (2004). Estimation methods for stochastic volatility models:A survey.

Journal of Economic Surveys 18 , 613–649.Chang, E., C. Han, and F. Park (2017). Deep learning networks for stock marketsanalysis and prediction: Methodology, data representations and case studies.

Expert System with Applications 83 , 187–205.Christoﬀersen, P. F., A. Bera, J. Berkowitz, T. Bollerslev, F. Diebold, L. Gior-gianni, J. Hahn, J. Lopez, and R. Mariano (1997). Evaluating interval forecasts.

International Economic Review 39 , 841–862.Chu, C.-K. and J. S. Marron (1991, 12). Comparison of two bandwidth selectorswith dependent errors.

Ann. Statist. 19 (4), 1906–1918.Cortes, C. and V. Vapnik (1995, Sep). Support-vector networks.

Machine Learn-ing 20 (3), 273–297.Danielsson, J. (2004). Stochastic volatility in asset prices: Estimation by simulatedmaximum likelihood.

Journal of Econometrics 64 , 375–400.de Faria, E., M. Albuquerque, J. Gonz´alez, J. Cavalcante, and M. Albu-querque (2009). Predicting the Brazilian stock market through neural networksand adaptive exponential smoothing methods.

Expert Systems with Applica-tions 36 (10), 12506–12509.Dias, F., R. Nogueira, G. Peixoto, and W. Moreira (2019). Decision-making forﬁnancial trading: A fusion approach of machine learning and portfolio selection.

Expert Systems with Applications 115 , 635–655.Dickey, D. A. and W. A. Fuller (1979). Distribution of the estimators for au-toregressive time series with a unit root.

Journal of the American StatisticalAssociation 74 (366a), 427–431.Durbin, J. and S. Koopman (1997). Monte Carlo maximum likelihood estimationfor non-Gaussian state space models.

Biometrika 84 , 669–684.Engle, R. (1982). Autoregressive conditional heteroscedasticity with estimates ofthe variance of United Kingdom inﬂation.

Econometrica 50 , 987–1007.Engle, R. (2002). Dynamic conditional correlation: A simple class of multivari-ate generalized autoregressive conditional heteroskedasticity models.

Journalof Business and Economic Statistics 20 , 339–350.Engle, R., C. Granger, and D. Kraft (1984). Combining competing forecasts ofinﬂation with a bivariate ARCH model.

Journal of Economic Dynamics andControl 8 , 151–165.Engle, R. and F. Kroner (1995). Multivariate simultaneous generalized ARCH.

Econometric Theory 11 , 122–150.Engle, R. and G. Lee (1999).

A permanent and transitory component model of stockreturn volatility , pp. 475–497. Oxford: Oxford University Press.Engle, R., V. Ng, and M. Rotschild (1990). Asset pricing with a factor-ARCHcovariance structure: Empirical estimates for Treasury Bills.

Journal of Econo-metrics 45 , 213–238. 19riedman, J. H. (2000). Greedy function approximation: A gradient boosting ma-chine.

Annals of Statistics 29 , 1189–1232.Gestel, T., J. Suykens, D. Baestens, A. Lambrechts, and G. Laneknet (2001).Financial time series prediction using least squares Support Vector Machineswithin the evidence framework.

IEEE Transactions on Neural Networks 12 (4),8009–821.Glosten, L., R. Jagannathan, and D. Runkle (1993). On the Relation between theExpected Value and the Volatility of the Nominal Excess Return on Stocks.

The Journal of Finance 48 (5), 1779–1801.Gupta, A. and B. Dhinga (2012). Stock markets prediction using hidden Markovmodels. , 1–4.Haas, M., S. Mittnik, and M. Paolella (2004a). Mixed normal conditional het-eroskedasticity.

Journal of Financial Econometrics 2 , 211–250.Haas, M., S. Mittnik, and M. Paolella (2004b). A new approach to Markov-switching GARCH models.

Journal of Financial Econometrics 2 , 493–530.Haas, M. and M. Paolella (2012).

Mixture and regime-switching GARCH models ,pp. 71–102. John Wiley and Sons.Hajizadeh, E., A. Seiﬁ, F. Zarandi, and I. Turksen (2012). A hybrid modelingapproach for forecasting the volatility of S&P 500 index return.

Expert Systemswith Applications 39 (1), 531–536.Hamid, S. and Z. Iqbid (2002). Using neural networks for forecasting volatility ofS&P 500 index futures prices.

Journal of Business Research 57 (10), 1116–1125.Hastie, T., R. Tibshirani, and J. Friedman (2009).

The Elements of StatisticalLearning: Data Mining, Inference, and Prediction, Second Edition . SpringerSeries in Statistics. Springer New York.Heston, S. L. (1993). A closed-form solution for options with stochastic volatilitywith applications to bond and currency options.

Review of Financial Studies 6 ,327–343.Hull, J. (2015).

Risk management and Financial Institutions, 4th edition . Wileyand Sons, London.Hutchinson, J., A. Lo, and T. Poggio (1994). A nonparametric approach to pricingand hedgind derivative securities via learning networks.

Journal of Finance 49 ,851–859.Kim, H. and C. Won (2018). Forecasting the volatility of stock price index: Ahybrid model integrating lstm with multiple garch-type models.

Expert Systemswith applications 103 , 25–37.Kingma, D. P. and J. Ba (2014). Adam: A method for stochastic optimization.

CoRR abs/1412.6980 .Kraft, D. and R. Engle (1982). Autoregressive conditional heteroskedasticity inmultiple time series. Department of Economics, UCSD.Kristjanpoller, W., A. Fadic, and M. Minutolo (2014). Volatility forecast usinghybrid neural network models.

Expert Systems with Applications 41 (5), 2437–2442. 20ristjanpoller, W. and E. Hern´andez (2017). Volatility of main metals forecastedby a hybrid ANN-GARCH model with regressors.

Expert Systems with Appli-cations 84 , 290–300.Kristjanpoller, W. and M. Minutolo (2015). Gold price volatility: A Forecastingapproach using the Artiﬁcial Neural Network-GARCH model.

Expert Systemswith Applications 42 (20), 7245–7251.Kristjanpoller, W. and M. Minutolo (2016). Forecasting volatility of oil price us-ing an Artiﬁcial Neural Network-GARCH model.

Expert Systems with Appli-cations 65 (15), 233–241.Kristjanpoller, W. and M. Minutolo (2018). A hybrid volatility forecasting frame-work integrating GARCH, artiﬁcial neural network, technical analysis and prin-cipal components analysis.

Expert Systems with Applications 109 , 1–11.Krollner, B., B. Vanstone, and G. Finnie (2010). Financial time series forecastingwith machine learning techniques: A survey.

European Symposium on ArtiﬁcialNeural Networks: Computational and Machine Learning .Kupiec, P. H. (1995, January). Techniques for verifying the accuracy of risk mea-surement models.

The Journal of Derivatives 3 (2), 73–84.LeCun, Y., Y. Bengio, and G. Hinton (2015). Deep Learning.

Nature 521 (7553),436–444.Lowenstein, R. (2000).

When genius failed: the Rise and Fall of Long-Term CreditManagement . Random House.Lu, X., D. Que, and G. Cao (2016). Volatility forecast based on the hybrid artiﬁcialneural network and garch-type models.

Procedia Computer Science 91 , 1044 –1049.Mandelbrot, B. (1963). The variation of certain speculative prices.

Journal of Busi-ness 36 , 394–419.Mcculloch, W. and W. Pitts (1943). A logical calculus of ideas immanent in nervousactivity.

Bulletin of Mathematical Biophysics 5 , 127–147.McNeil, A. J., R. Frey, and P. Embrechts (2015).

Quantitative Risk Management:Concepts, Techniques and Tools . Princeton, NJ, USA: Princeton UniversityPress.Melino, A. and S. Turnbull (1990). Pricing foreign currency options with stochasticvolatility.

Journal of Econometrics 45 , 239–265.Monfared, S. A. and D. Enke (2014). Volatility forecasting using a hybrid gjr-garchneural network model.

Procedia Computer Science 36 , 246 – 253.Nelson, D. B. (1991). Conditional heteroskedasticity in asset returns: A new ap-proach.

Econometrica 59 (2), 347–70.Patel, M. and S. Yalamalle (2014). Stock price prediction using artiﬁcial neuralnetwork.

International Journal of Innovative Research in Science, Engineeringand Technology 3 (June 2014), 13755 – 13762.Patton, A., D. N. Politis, and H. White (2009). Correction to automatic block-length selection for the dependent bootstrap by d. politis and h. white.

Econo-metric Reviews 28 (4), 372–375. 21eng, Y., P. Melo, J. Camboim de S´a, A. Akaishi, and M. Montenegro (2018). Thebest of two worlds: Forecasting high frequency volatility for cryptocurrenciesand traditional currencies with support vector regression.

Expert Systems withApplications 97 , 177–192.Politis, D. and J. Romano (1991).

A Circular Block-resampling Procedure for Sta-tionary Data . Purdue University. Department of Statistics.Politis, D. N. and J. P. Romano (1994). The stationary bootstrap.

Journal of theAmerican Statistical Association 89 (428), 1303–1313.Politis, D. N. and H. White (2004). Automatic block-length selection for the de-pendent bootstrap.

Econometric Reviews 23 (1), 53–70.Poon, S. and C. Granger (2003). Forecasting volatility in ﬁnancial markets. a re-view.

Journal of Economic literature 41 (2), 478–539.R Core Team (2017).

R: A Language and Environment for Statistical Computing .Vienna, Austria: R Foundation for Statistical Computing.Rajashree, P. and B. Ranjeeeta (2015). A diﬀerential harmony search based hybridinternal type2 fuzzy EGARCH model for stock market volatility prediction.

International Journal of Approximate Reasoning 59 , 81–104.Roh, T. (2006). Forecasting the volatility of stock price index.

Expert Systems withApplications 33 (4), 916–922.Ryan, J. A. and J. M. Ulrich (2017). quantmod: Quantitative Financial ModellingFramework . R package version 0.4-12.Surkan, A. and Y. Xingren (2001). Bond rating formulas derived through simplify-ing a trained neural network.

Proceedings of the IEEE International conferenceon neural network 2 , 1028–1031.Taylor, S. (1982).

Financial returns modelled by the product of two stochastic pro-cesses, A study of daily sugar prices 196179 , Volume 1, pp. 223–226. North-Holland.Tse, Y. and K. Tsui (2002). A multivariate GARCH model with time-varyingcorrelations.

Journal of Business and Economic Statistics 20 , 351–362.Vinod, H. (2006). Maximum entropy ensembles for time series inference in eco-nomics.

Journal of Asian Economics 17 (6), 955–978.Vinod, H. D. and J. L. de Lacalle (2009). Maximum entropy bootstrap for timeseries: The meboot R package.

Journal of Statistical Software 29 (5), 1–19.Zhang, L., K. Zhu, and S. Ling (2018). The ZD-GARCH model: A new way tostudy heteroscedasticity.