[PDF] Modeling Fractional Polytropic Gas Spheres Using Artificial Neural Network

Abstract

Lane-Emden differential equations describe different physical and astrophysical phenomena that include forms of stellar structure, isothermal gas spheres, gas spherical cloud thermal history, and thermionic currents. This paper presents a computational approach to solve the problems related to fractional Lane-Emden differential equations based on neural networks. Such a solution will help solve the fractional polytropic gas spheres problems which have different applications in physics, astrophysics, engineering, and several real-life issues. We used Artificial Neural Network (ANN) framework in its feedforward back propagation learning scheme. The efficiency and accuracy of the presented algorithm are checked by testing it on four fractional Lane-Emden equations and compared with the exact solutions for the polytopic indices n=0,1,5 and those of the series expansions for the polytropic index n=3. The results we obtained prove that using the ANN method is feasible, accurate, and may outperform other methods.

Full PDF

11 Modeling Fractional Polytropic Gas Spheres Using Artificial Neural Network

Mohamed I. Nouh , Yosry A. Azzam and Emad A.-B. Abdel-Salam Astronomy Department, National Research Institute of Astronomy and Geophysics (NRIAG), 11421 Helwan, Cairo, Egypt Department of Mathematics, Faculty of Science, New Valley University, El-Kharja 72511, Egypt e-mail: [email protected]

Abstract:

Keywords:

Neural Network; Stellar structure; Polytropic gas sphere; Fractional Lane-Emden equation; Conformable fractional derivatives. Introduction

The nonlinear Lane-Emden differential equations (polytropic and isothermal) have a singularity at origin and possess only exact solutions for the polytropic index n=0, 1, and 5 [1-2]. In astrophysics, these equations could be used to model many problems such that; spherical cloud of gas, stellar structure, and galactic structure. There are several methods proposed to solve the integer version of these equations i.e. homotopy perturbation method [3], variational iteration method [4], Sinc-Collocation method [5], an implicit series solution [6], accelerated series solution [7-8] and Adomian decomposition method [9]. Recently and due to their wide applications in science and engineering, analytical solutions to the fractional nonlinear differential equations acquired great attention, [10-11]. The Newtonian stellar polytrope's fractional version was investigated by [12] for the fractional white dwarf model, [13] for the incompressible gas sphere, [14] for the fractional isothermal gas sphere. Nouh and Abdel-Salam [15] and [16] constructed fractional polytropic stellar models for white dwarfs with n=1.5 and solar type stars n=3. For n =3, [15-16] showed that solar-like stars could be modeled by dividing the interior into two layers with different fractional parameters. Neural networks (NNs) have acquired a solid role in many areas of human activity over the past decades and have found application in a wide range of scientific issues, including astronomy, geology, geophysics, and the environmental sciences [17-21]. It was commonly used in the areas of pattern recognition, data classification, prediction, function approximation, signal processing, medical diagnosis, modeling, and control, etc. [22-26]. The Artificial Neural Networks (ANN) try to imitate the biological brain mathematically, in which linear or nonlinear processes information and neuron models are connected in a parallel and distributed style. ANN performs computations at a much higher speed because of its massively parallel nature. They have the capabilities of learning and self-organization that can memorize and pick up a mapping between an input and an output vector space and synthesize an associative memory which recovers the correct output when the input is introduced and generalizes when new inputs are introduced [27]. Because of their excellent properties of fault tolerance, self-learning, adaptivity, nonlinearity, ANNs are used mostly nowadays for function approximation in numerical models [28]. The finite-time and adaptive finite-time synchronization principle in graph theory perspectives have been investigated under two different control strategies by Pratab et al.[29]. Zhou et al. [30] consider the finite-time synchronization of dynamic networks with nonlinear coupling strength and stochastic perturbations by using intermittent control. Zhang et al. [31] investigate a cluster of limitless, delayed neural networks spread. Besides, ANNs have been extensively used to solve linear and nonlinear differential equation related problems with integer or non-integer derivative and used various paradigms for ANN architecture [32-39]. Besides, Ahmad et al. [40] computed the solution of Lane–Emden type equations by the use of artificial neural networks (ANNs). Based on active-set (AS), interior-point (IP), and sequential quadratic programming (SQP) algorithms, local optimization procedures have been used in this research to optimize the energy functions. Jalab et al. [41] introduced a neural network based numerical method, for solving the integer and fractional Lane-Emden type equations. In the present paper, we will formulate the fractional Lane-Emden equation of the polytropic gas sphere and solve it analytically and train the ANN algorithm using tables of the fractional Emden functions and mass-radius relation computed by means of the accelerated series expansion method. For the numerical simulation, we use the ordinary feed-forward neural network to approximate the solution of fractional Lane-Emden equation of the polytropic gas sphere and mass-radius relation which is proved to have more benefits when compared to other computational methods. The structure we used is a three-layer feed-forward neural network that is trained using the back-propagation learning algorithm based on the gradient descent rule. The rest of the paper is organized as follows: Section 2 deals with the principle of the conformable fractional derivatives. The derivation of the polytropic Lane-Emden equation is performed in section 3. Section 4 is devoted to the neural network algorithm. In section 5, the results are outlined. We give the conclusion reached in section 6. Conformable fractional derivatives

Different definitions of fractional derivatives exist. Examples include Riemann–Liouville, Kolwankar–Gangal, Caputo, modified Riemann–Liouville, Cresson’s, and Chen’s fractal derivatives, [42] and [10]. The conformable fractional derivative (CFD) introduced by Khalil et al. [43] used the limits in the form: ( ) ( )( ) lim 0, (0,1],     −→ + −=    f t t f tD f t t (1) ( ) ( )0 (0) lim ( ).   + → = t f f t (2) Here ( ) (0)  f is not defined. This fractional derivative reduces to the ordinary derivative when = . The following properties are found in the conformable fractional derivative: , , 0, ( ) ,   − =  =  = p p D t p t p D c f t c (3) ( ) , ,    + = +  

D a f b g a D f b D g a b , (4) ( ) ,    = +

D f g f D g f D g (5) ( ) , ( ) ,    − = = df dfD f g D g D f t tdg dg (6) where , f g are two − differentiable functions and c constant is an arbitrary constant. Equations (5) to (6) are demonstrated by [43]. The corresponding fractional derivative of certain functions could be given by: , sin( ) cos( ), cos( ) sin( ),, sin( ) cos( ), cos( ) sin( ). ct ctct ct D e c t e D ct c t ct D ct c t ctD e c e D ct c ct D ct c ct                  − − − = = = −= = = − (7) Fractional Polytropic Gas Spheres

The polytropic equation of state has the form p K  = ,

1 1 n  = + . (8) Where K is the pressure constant and n is the polytropic index A self-gravitating object's equilibrium structure is derived from hydrostatic equations. The simplest case is a spherical, non-rotating, static configuration in which all macroscopic properties for a given equation of state are parameterized by a single parameter, e.g. central density. The equation representing the conformable fractional form for mass conservation and hydrostatic equilibrium is given by: r D M r    = (9) and r G MD P r   = − . (10) Rearrange Equation (10) we get r r D P G M    = − , (11) By conducting the first fractional derivative of Equation (11) we will get r r r rD D P G D M      = −   , (12) Combining Equations (11) and (12) we get r r rD D P G r       = −   , (13) or r r rD D P Gr      = −   (14) Now, by defining the u function (Emden function), which is a dimensionless function, as: nc u  = , (15) where  and c  are the density and central density respectively. The dimensionless variable x could be written as rx a  = . (16) Inserting Equations (8) and (12) in Equation (14) we get ncnc d a x d K G ua x d a x u d a x           = −   , (17) ( )( ) 4( ) ( ) ( ) n n nc cnc d uK d a x G ua x d a x u d a x        +    = −    . (18) The Emden's fractional derivative u could be written as ( 1) n n d d uu n udx dx   + = + . (19) Inserting Equation (19) in Equation (18) we get ( 1) 4 nn nc cnc n x uK d d u G ua x d x u d x        +  +  = −    , (20) or ( 1) 4 nn ncX X cnc n x uK D D u G ua x u      +  +  = −    , (21) rearrange ( ) ( 1) 14 n nc X X K n D x D u uGa x     − + = − . (22) Now by taking ( 1)4 nc K na G  − += , (23) then the fractional form of Lane-Emden equation is given by: ( ) nX X D x D u ux    = − . (24) with the initial conditions: (0) 1, (0) 0 x u D u  = = (25) where ( ), u u x = is the Emden function and   . Assume the transform X x  = , the Emden function will take the form [16] ( ) , mmm u X A X = =  (26) the series expansion coefficients are written as , 2( 2)( 3) kk QA kk k  + = −  + + (27) and mm i m ii Q m in m i A Q mm A −= = − − +    , (28) We get the series coefficients of the integer LEE by putting = in Equations (27) and (28). If we insert

0, 1, 2, 3 k = in Equations (27) and (28) we will get:

11, 0, 1, 0, , 0, , 0.6 120 nA A Q Q A A A A  = = = = = − = = =

Consequently, the series solution at = is reduced to the integer version of LEE [7] as

1( ) 1 ............6 120 n nu x x x = − + − For n = 0, 1 and 5, the exact solutions are given by

1( ) 1 ,6 xu x   = −    (29) ( ) sin , x xu x     −    =        (30) and

12 2

1( ) 1 3 xu x   −    = +      . (31) The radius, mass, and density of the polytrope could be given by [15] and [16] ( 1)( ) 4 4 nnc x x K n d uM x xG d x      − =   + = −        , (32) ( 1)4 nnc

K nR xG    − + =    . (33) Neural network algorithm 4.1 Mathematical modeling of the problem

The neural network architecture proposed to simulate the fractional Lane-Emden equation is as shown in Figure 1. The fractional Lane-Emden equation could be written as nX X D u D u ux   + = − . (34) To obtain a neural network solution along with the initial conditions (0) 1 u = and (0) 0 D ux  = , we perform the following [44]: Figure 1. ANN architecture proposed to simulate the fractional Lane-Emden equation.

First, we consider ( , ) t u x p as the approximate solution of the neural network for Equation (34) and can be formulated to be of the following form: ( , ) ( ) ( , ( , )), t u x p A x f x N x p = + (35) where the first term satisfies the initial or boundary values and the second term represents feed forward neural network with input vector x and p is the corresponding vector of adjustable weight parameters. The neural network output ( , ) N x p is given by ( , ) ( ), H i ii

N x p v z  = =  (36) where nj ij j ii z w x  = = +  and ij w denotes the weight from the input unit j to the hidden unit i , i v represents weight from the hidden unit i to the output, i  is the bias of the i th hidden unit, and ( ) i z  is the sigmoid activation function that has the form

1( ) 1 x x e  − = + . Now the derivative of networks output N for input vector j x is ( ) ( )1 1 1 ( , ) , ( ), j j H n hx x i i ij j i i ij xi i i

D N x p D v z w x v w D x           = = =   = = + = =        (37) Similarly, the n th fractional derivative of N , Equation (36), is ... ( )1 1 ( , ) , , , ( ), n times kj nn nx i i i i ik i ii k D N x p v P P w z       = = = = =   (38) Hence, the proposed approximate solution for the fractional Lane-Emden equation is given as ( , ) 1 ( , ), t u x p x N x p = + (39) which satisfies the initial conditions as: (0, ) 1 0. (0, ) 1 t u p N p = + = , (40) and ( , ) ( , ) ( , ) x t x D u x p x N x p x D N x p   − = + , (41) so (0, ) (0) ( , ) 0. ( , ) 0, x t x D u p N x p D N x p   − = + = (42) Now, if we considered the approximate solution represented by Equation (39), the problem is converted into an unconstrained optimization problem and the error quantity to be minimized can be given by

2( ) ( , ) ( , ) ( , ( , )) x t i x t i i t ii

E x D u x p D u x p f x u x px    = + −    , (43) where ( , ) ( , ) ( , ), x t x D u x p x N x p x D N x p   − = + (44) and ( , ) (1 ) ( , ) 2 ( , ) ( , ), x t x x

D u x p x N x p x D N x p x D N x p       − − = − + + (45) where ( , ) x D N x p  and ( , ) x D N x p  is given by Equations (37-38). For network parameter updating, we compute the fractional derivative of the neural network for input as well as for parameters of the network and train the neural network for the optimized value of parameters. Once the network is trained set up the network with optimized network parameters and compute ( , ) t u x p from ( , ) 1 ( , ). t u x p x N x p = + The conformable fractional derivative with respect to any of its inputs is equivalent to a feed-forward neural network N with one hidden layer, having the same values for the weights ij w and thresholds i  and with each weight i v being replaced with i i v P where k ni ikk P w = =  . Moreover, the transfer function of each hidden unit is replaced with the n th order fractional derivative of the sigmoid function. Therefore, the conformable fractional gradient of N with respect to the parameters of the original network can be obtained as: ( )(( 1) ) 1(( 1) ) ( )1, ii j kij nv i i ni i i n nw i i i i i j ij ik ik k j D N PD N v PD N x v P v w w            + −+ =  ==  = +     . (46) The network parameters updating rule can be given as, ( 1) ( ) i i i v v x v x a D N  + = + , (47) ( 1) ( ) i i i x x b D N   + = + , (48) ( 1) ( ) ij ij ij w w x w x c D N  + = + , (49) where , , a b c are learning rates,

1, 2,... , , i n = and

1, 2,... , j h = . In ANN, the main processing unit which can carry out localized information and can process a local memory is the neuron. The net input (z) at each neuron is calculated by adding the weights it receives to get a weighted sum of those inputs and add it with a bias (  ). Then the net input (𝑧) is passed through an activation function, resulting in the output of the neuron j u (as is shown in Figure 1). Different algorithms for training the neural network are found in the literature. The traditional and the most famous algorithm is the steepest descent algorithm which is also known as the error backpropagation (BP) algorithm [45]. The BP training algorithm is a gradient algorithm designed to minimize the mean square error between the actual output of a feed-forward net and the desired output. It requires continuously differentiable non-linearity. Although the convergence rate of this algorithm is slow [46], its stability is high compared to other training algorithms [47]. The mathematics of the gradient algorithm has to guarantee that a particular node has to be adjusted in direct proportion to the error in the units to which it is connected. The BP algorithm performs the steepest descent on a surface in a weight space whose height at any point in weight space is equal to the error measure. The error function which has to reduce can be written in the following form: x D E F x UD  =  (50) Here, F is some signed error measure, D is a set of training patterns at which error is to be evaluated and U represents the neural network output [48]. We can define the state of the unit to be the weighted sum of the output of the previous layer as [49]: pj ji pii S W O =  (51) The output, ( ) pj j pj O f S = (52) uses the sigmoid function. To get the correct generalization of the delta rule, ji W is set as: pp ji ji EW W   −  (53) It may be useful to see this derivative to be resulting from the product of two parts: one part reflecting the change in the net input to the unit and the other part representing the effect of changing a particular weight on the net input. Thus, we can write p p pjji pj ji

E E SW S W   =   (54) From (51), we can see that the second factor is: pj jk pk pikji ji

S W O OW W  = =   (55) Now, we can define: ppj pj ES  = −  (55) Equation (54) thus has the equivalent form p pj piji

E OW − = (56) This tells that to implement a gradient descent in E , we should make the weight changes according to: p ji pj pi W O  = (57)

Where  is the learning rate factor. It is interesting to see that there is a simple recursive computation of these δ’s that can be implemented by propagating an error signal back through the network. To compute equation (55), the chain rule is applied to write this partial derivative as the product of two factors, one factor reflecting the change in error as a function of the output of the unit, and the other one reflecting the change in the output as a function of changes in the input, p p pjpj pj pj pj E E OS O S    = − = −   (58) By (52), we can see that ' ( ) pj j pjpj O f SS  = (59) Which is the derivative of the compressing function j f for the jth unit, evaluated at the net input pj S to that unit. To compute the first factor, there are two cases. First, assume that the unit i U is an output unit of the network. In this case, it follows from the definition of p E that: ( ) p pj pjpj E T OO  = − − (60)

Substituting for the two factors in (58), we can get: ' ( ) ( ) pj pj pj j pj T O f S  = − (61) for any output unit j U . If j U is not an output unit, the chain rule is used to write p pk p pki pi kj pk kjk k i k kpk pj pk pj pk E S E EW O W WS O S O S     = = = −          (62) In this case, substituting for the two factors in (58) yields ' ( ) pj j pj pk pjk f S W  =  (63) Whenever j U is not an output unit. Equations (61) and (63) give a recursive procedure for computing the  ’s for all units in the network, which are then used to compute the weight changes in the network according to (57). Figure 2 shows a flow chart of a back-propagation off-line learning algorithm [49]. As is seen, a comparison of the output j u at the output layer with the target output j t is implemented using an error function that has the following form: ( )(1 ) j j j j j u t u u  = − − . (64) For the hidden layer, the error function takes the form: (1 ) j j j k kk u u w  = −  . (65) where 𝛿 𝑗 is the output layer error term, and 𝑤 𝑘 is the weight between the hidden and output layers. To update the weight of each connection, the error is replicated backward from the output layer to the input layer as follows: ( 1) ( ) ( ( ) ( 1)) ji ji j j ji ji w t w t u w t w t  + = + + − − (66) Learning rate 𝜂 has to be chosen such that it is neither very small leading to a slow rate of convergence nor too large leading to overshooting. The last term in Equation (66) is called the momentum term and is added with the momentum constant  to speed up the convergence of the back-propagation learning algorithm error and to help in kicking the changes over local increases in the energy function and pushing the weights to follow the overall downhill direction [50]. This term has the effect of adding a fraction of the most recent weight adjustment to the current weight adjustments. Both  and  terms are assigned at the beginning of the training phase and decide the network stability and speed [21], [27]. Initialize weights and biases Present input and desired output

Calculate actual output of hidden and output neurons

Adust weights by: ( 1) ( ) ( ( ) ( 1)) ji ji j j ji ji w t w t u w t w t  + = + + − −

If unit j is an output unit: ( )(1 ) j j j j j u t u u  = − − If unit j is a hidden unit: (1 ) j j j k kk u u w  = −  Change the training pattern

Training pattern: End ( )

21 1 P Jrms pj pjp j

E t uPJ = = = −   >= End

Increment the number of iteration ≠ Star t Figure 2. Flowchart of ANN back-propagation off-line training algorithm

For each input pattern, the process is repeated until the network output error is reduced to a pre-assigned threshold value. The final weights are frozen and used to obtain the exact fractional values of Lane-Emden differential equations during the test session. To assess the success and quality of the training, an error is calculated for the whole batch of training patterns using the root-mean-square normalized error which is defined as: ( )

21 1 P Jrms pj pjp j

E t uPJ = = = −   (67) where P is the number of training patterns, J is the number of output units, pj t is target output at unit j , and pj u is the actual output at the same unit j . An error of zero would indicate that all the output patterns calculated by the ANN match the expected values perfectly and that the ANN is well trained. Internal unit thresholds are adapted similarly by assuming they are connection weights on links from auxiliary constant-valued input. We have programmed the previous algorithms using C++ programming language running on Windows 7 of a CORE i7 PC. Numerical results and discussion 5.1 Preparation of the input data

To prepare the input data for the training procedure of the proposed ANN algorithm, we compute Emden functions and the physical parameters at various polytropic indices and fractional parameters. What concerns us in the present calculations is the polytropes having exact solutions, namely, polytropes with n=0, 1, 5. Besides, we study the polytropic mass-radius relation for normal stars with n=3. As a result, we will have four versions of the fractional Lane-Emden differential equation being extracted from Equation (24) which represents the different polytropic indices under study. These equations are: X X

D u D ux   + = − , (68) X X

D u D u ux   + = − , (69) X X

D u D u ux   + = − , (70) X X

D u D u ux   + = − , (71) for the polytropic indices n=0, 1, 3, 5 respectively. The general series solutions of the above four equations (Equation (26)), and by implementing the two recurrence relations, Equations (27-28), could be written as

11 6 120 ( ) n nu x x x   = − + −  (72) The series presented by Equation (72) not converge to the surface of the polytope, so it may be used only to model the region near the center of the stars. To allow the series to reach the surface of the polytropic sphere and consequently to the surface of the star, we used the acceleration technique proposed by [7]. The first step is to compare and check the accuracy of the zeroth calculated (this value is equivalent to the radius of the star) from Equation (72) with that of the exact solution presented by Equations (29-31). We used the code developed by [16] to calculate the zeroth of the Emden function ( x ) at different fractional parameters. Tables (1-2) illustrate the results for the polytopes with n=0 and n=1 respectively. The third column is the zeroth computed by Equation (26) for the series expansion, with the aid of the recurrence relation, Equations (27) and (28) and the second column represents the zeroth computed from the exact solution of Equations (29-31). As is shown in the table, the maximum relative error is about 1.6 %. The stellar mass-radius relation could be computed using Equations (32-33) and the series expansion for the first fractional derivative of Equation (72) is listed in Table (3) for mass-radius relation of the fractional polytrope with n=3 [16]. In this table, the ratio * 0 / R R is the ratio of the radius of the star to the radius of the sun and /* M M is the ratio of the star mass to the solar mass. Table 1: Radius of convergence for the n = 0 fractional polytropes.  x (exact) x (series) Absolute relative error 1 2.44 2.44 0 % 0.99 2.424 2.435 0.45 % 0.98 2.400 2.415 0.62 % 0.97 2.376 2.405 1.2 % 0.96 2.351 2.385 1.27 % 0.95 2.327 2.365 1.63 %

Table 2: Radius of convergence for the n = 1 fractional polytrope.  x (exact) x (series) Absolute relative error

1 3.14 3.14 0 % 0.99 3.110 3.114 0 % 0.98 3.078 3.085 0.23 % 0.97 3.047 3.054 0.23 % 0.96 3.015 3.035 0.7 % 0.95 2.984 3.0 0.53

Table 3: Mass-radius relation for fractional polytrope with n=3 [16].  * 0 / R R /* M M

1 1 1 0.99 0.969 0.950 0.98 0.951 0.909 0.97 0.933 0.874 0.96 0.915 0.840 0.95 0.897 0.809

The training phase of the proposed neural network is implemented by computing the distributions of the Emden functions and mass-radius relation for the values listed in the second column of Tables (4-5). Table 4: Training and testing data for the polytrope.

Training phase Testing phase n  

0 0.96, 0.97, 0.98, 0.99, 1 0.95 1 0.95, 0.97, 0.98, 0.99, 1 0.96 3 0.95, 0.96, 0.98, 0.99, 1 0.97 5 0.95, 0.96, 0.98, 0.99, 1 0.97

Table 5: Training and testing data for mass-radius and density-radius relations.

Training phase Testing phase n  

3 0.95, 0.98, 0.99, 1 0.96

The neural network (NN) used in this article uses two different configurations. For the polytropic case, the input layer of the NN has three individual inputs which are the polytropic index n , the fractional parameter  and the dimensionless parameter x ( x takes values from 0 to x , where x is the first zero of the Emden function as listed in Tables 1-2), while the output layer has 1 node for the Emden function u calculated for the same values of the input fractional parameter and dimensionless parameter x . For the mass-radius relation with a polytropic index n=3 and various fractional parameters  , the input layer of the NN has two individual inputs which are the radius and mass of the star, whereas the output layer has 2 nodes which are the radius and mass at the same values of the input fractional parameters. After testing different configurations of hidden neurons of 80,120 and 200 neurons in the NN (shown in Figure 1), it was found that one hidden layer containing 120 neurons gives the best network model to compute accurately the exact fractional values of the Emden function. This is shown in Figure (3) and Figure (4) below for both polytropic and mass-radius relation cases respectively. In Figure (3), it is clear that the 120 neurons in the hidden layer case are giving the least RMS error compared to the other two configurations along the whole cycle of NN training cycles. The same remark is applied for Figure (4) for mass-radius relation case in which we can see oscillations for the RMS errors during NN training cycles after which the error for the 120 neurons case decreases to its final value better than the other 2 configurations. Figure 3. RMS errors for a different number of hidden layer neurons for the polytropic case.

Figure 4. RMS errors for a different number of hidden layer neurons for the case of mass-radius relation. After various adjustments and modifications to the network parameters, the network converged to a threshold RMS error of 0.000015 for the polytrope training case and of 0.000057 for the mass-radius relation training case. During those training, we used values of  = 0.03 for the learning rate and  = 0.5 for the momentum. Those values for  and  were found to speed up the convergence of the back-propagation learning algorithm of our ANN without over-shooting the solution. In order to demonstrate the stability and convergence of the computed values of weight parameters of the layers of the network, the convergence behaviors of the weights of the input layer, bias and the weights of output layer ( w i , β i and ν i ) for the polytropic case are displayed in Figure (5). Similarly, the stability and convergence behaviors of the computed values of weight parameters of the layers of the network for the input layer weights, bias, and output layer weights for the mass-radius relation case are displayed in Figure (6). As is shown in these figures, the weight values are initialized to some random values where they converge to stable values after somewhat large iteration values. By the end of the training phase of the ANN, its algorithm is ready to compute the Emden functions for the polytropic indices and the fractional parameters (third column) listed in Table (4). The results for the fractional polytropes are illustrated in Figures (7-10) for the following pairs of the polytropic indices and fractional parameters, (n=0,  =0.95), (n=1,  =0.96), (n=3,  =0.97) and (n=5,  =0.97). For n=0, 1, the Emden functions are computed using the exact solutions (Equations (29-31)), where for the polytropic index n=3 and due to the lack of the exact solution, the series solution is considered. To achieve the accuracy of the calculations, the ANN and the series solutions are plotted in the figures with different colors. As it is clear, they overlap and cannot be distinguished. (a) The convergence of weights of the input layer (w i ) (b) The convergence of bias (β i ) (c) The convergence of weights of output layer (v i ) Figure (5) Convergence of input, bias and output weights for the polytropic case -1-0.500.511.5 0 100000 200000 300000 400000 500000 600000 700000 800000 w i Iteration

Convergence of input layer weights (w i ) w[1] w[2] w[3]-1-0.8-0.6-0.4-0.20 0 100000 200000 300000 400000 500000 600000 700000 800000 β i Iteration

Convergence of bias ( β i ) -0.1-0.0500.050.10.150.2 0 100000 200000 300000 400000 500000 600000 700000 800000 v i Iteration

Convergence of output layer weights (v i ) (a) The convergence of weights of the input layer (w i ) (b) The convergence of bias (β i ) (c) The convergence of weights of output layer (v i ) Figure (6) Convergence of input, bias and output weights for the mass-radius relation case -3-2-1012345 0 100000 200000 300000 400000 500000 600000 700000 800000 w i Iteration

Convergence of input layer weights (w i ) w[1] w[2] w[3]-1-0.8-0.6-0.4-0.20 0 100000 200000 300000 400000 500000 600000 700000 800000 β i Iteration

Convergence of bias ( β i ) v i Iteration

Convergence of output layer weights (v i ) v[1] v[2] Figure 7: The fractional Emden function of the polytrope with n=0 and  =0.95. The maximum relative error is 3.5%. Figure 8: The fractional Emden function of the polytrope with n=1 and  =0.96. The maximum relative error is 5%. Figure 9: The fractional Emden function of the polytrope with n=3 and  =0.97. The maximum relative error is 5.4%. Figure 10: The fractional Emden function of the polytrope with n=5 and  =0.97. The maximum relative error is 1%. The maximum relative errors are, 3.5%, 5%, 5.4%, 1% respectively. The large errors that are appeared in some regions in the curves may be attributed to computer accuracy. In astrophysics, observational verification of the theoretical mass-radius relationship was a prime objective of numerous studies that considered individual stars and stellar associations with strong determinations of mass and radius individual stars and radius [51]. Following this motivation, we tried to model the mass-radius relation of the fractional polytrope with n=3. Figure (11) displays the series and the ANN models of the mass-radius relations. As we see, the two models are generally in good agreement except for the middle part of the distribution. One of the computational difficulties that may cause this numerical instability when computing the mass of the polytrope, is the fractional derivative appeared in Equation (32).

Figure 11: The fractional mass-radius relation for the polytropic star with n=3 and  =0.96. The maximum relative error is 9%. Conclusions

In this paper, we introduced an artificial neural network approach for solving the fractional Lane-Emden equation of the polytropic gas spheres. We used the ANN in its feedforward back propagation learning scheme. The input data for the training phase and that for testing were created using the code developed by [16], where the analytical solution is performed using the series expansion method. We have predicted the distribution of the Emden functions for the polytropic indices having exact solutions, n=0, 1, 5. Also, the approach was successfully applied to model the mass-radius relation of the polytropic star with n=3. The results reached, had been compared with the exact solution as well as the series expansion solution. These results show that the ANN and the series solutions can barely be distinguished; which consequently means that the neural network works very well in predicting values of fractional Lane-Emden functions and small errors were found in the outputs. A possible weakness of the obtained results may arise from the nature of the ANN method; it somewhat more rigid than the numerical and the analytical methods used to solve the fractional Lane-Emden equation. When a neural network is trained in looking for all the parameters, it requires to learn in a version of the question that can use additional knowledge. Conflict of Interest:

The authors declare that they have no conflict of interest.

References [1] Chandrasekhar, S. (1939) An Introduction to the Theory of Stellar Structure. University of Chicago Press, Chicago, IL. [2] Horedt, G. P. (2004) Polytropes - Applications in Astrophysics and Related Fields, Astrophysics and Space Science Library, 306, Kluwer Academic Publishers, Dordrecht, 2004. [3] Chowdhury, M., Hashim, I. (2009) Nonlinear Anal. 10, 104. [4] Ibrahim, R. W., Darus, M. (2008) J. Math. Anal. Appl. 345, 871. [5] Podlubny, I. (1999) Fractional Differential Equations (Academic Press, San Diego, CA, USA. [6] Momani, S. M., Ibrahim, R.W.(2008) J. Math. Anal. Appl. 339, 1210. [7] Nouh, M. I. (2004) New Astron. 9, 467. [8] Nouh, M. I., Saad, A. S. (2013) Int. Rev. Phys. 7, 1. [9] Wazwaz, A. (2001) Appl. Math. Comp. 118, 287. [10] Herrmann, R. (2014) Fractional Calculus: An Introduction for Physicists, 2nd ed. (World Scientific, Singapore). [11] Uchaikin, V. and Sibatov, R. (2018) Fractional Kinetics in Space, World Scientific. [12] El-Nabulsi, R. A. (2011) Applied Mathematics, and Computation, 218, 2837. [13] Bayin, S. S., Krisch, J. P. (2015) Astrophys. Space Sci. 359, 58. [14] Abdel-Salam, E.A.-B. and Nouh, M. I. (2016) Astrophysics 59, 398. [15] Nouh, M .I. and Abdel-Salam, E.A.-B. (2018) EPJP, 133, 149. [16] Abdel-Salam, E.A.-B. and Nouh, M. I. (2020) New Astronomy, 76, 101322. [17] Weaver, W. B. (2000) Spectral Classification of Unresolved Binary Stars With Artificial Neural Networks", The Astrophysical Journal, 541, 298-305. [18] Tagliaferri, R., Ciaramella, A., Milano, L., Barone, F. and Longo, G. (1999) " Spectral analysis of stellar light curves by means of neural networks", Astron. Astrophys. Suppl. Ser. 137, 391-405. [19] Tagliaferri, R., Longo, G., et. al. (2003) Neural networks in astronomy, ELSEVIER, Neural Networks, 16, 297-319. [20] Faris, H., Alkasassbeh, M., Rodan, A. (2014) Artificial neural networks for surface ozone prediction: models and analysis. Pol. J. Environ. Stud. 23, 341–348. [21] Hamdy K. Elminir, Yosry A. Azzam, Farag I. Younes (2007) Prediction of hourly and daily diffuse fraction using neural network, as compared to linear regression models, Energy, 32, 1513-1523. [22] El-Mallawany, R., Gaafar, M. S., Azzam, Y. A. (2014) " Prediction of ultrasonic parameters at low temperatures for tellurite glasses using ANN", Chalcogenide Letters, 11, 227 – 232. [23] Al-Shayea, Q.K. (2011) "Artificial neural networks in medical diagnosis", Int. J. Computer. Sci., 8, 150–154. [24] Leshno, M., Lin, V.Y., Pinkus, A., Schocken, S. (1993) Multilayer feedforward networks with a nonpolynomial activation function can approximate any function, Neural Network. 6, 861– 867. [25] Lippmann, R.P. (1989) "Pattern classification using neural networks", IEEE Commun. Mag. 27(11), 47–50. [26] Zhang, G.P. (2000) "Neural networks for classification: a survey", IEEE Trans. Syst. Man Cybern. C, 30(4), 451–462. [27] Basheer, I.A., Hajmeer, M. (2000)" Artificial neural networks: fundamentals, computing, design, and application", Journal of Microbiological Methods, 43, 3–31 [28] Oludare Isaac Abiodun, Aman Jantan, Abiodun Esther Omolara, Kemi Victoria Dada, Nachaat AbdElatif Mohamed, Humaira Arshad (2018) " State-of-the-art in artificial neural network applications: A survey", Heliyon 4, e00938. doi: 10.1016/j.heliyon.2018.e00938. [29] Pratap, A., Raja, R., Cao, J. et al. (2020) Finite-time synchronization criterion of graph theory perspective fractional-order coupled discontinuous neural networks. Adv Differ Equ ,

97 https://doi.org/10.1186/s13662-020-02551-x [30] Zhou, Y., Wan, X., Huang, C., Yang, X. (2020) Finite-time stochastic synchronization of dynamic networks with nonlinear coupling strength via quantized intermittent control. Appl. Math. Comput. 376, Article 125157,https://doi.org/10.1016/j.amc.2020.125157 [31] Zhang, J., Huang, C. (2020) Dynamics analysis on a class of delayed neural networks involving inertial terms. Adv Differ Equ 2020, 120, https://doi.org/10.1186/s13662-020-02566-4 [32] Muhammad Asif Zahoor Raja · Junaid Ali Khan · Ijaz Mansoor Qureshi (2010) " A new Stochastic approach for solution of Riccati differential equation of fractional order", Ann Math. Artif. Intell, 60, 229–250, DOI 10.1007/s10472-010-9222-x [33] Muhammad Asif Zahoor Raja, Ijaz Mansoor Qureshi and Junaid Ali Khan, (2011) " Swarm Intelligence Optimized Neural Networks for Solving Fractional Differential Equations", International Journal of Innovative Computing, Information and Control, Volume 7, Number 11. [34] Muhammad Asif Zahoor Raja a, Muhammad Anwaar Manzar b, Raza Samar (2015) " An efficient computational intelligence approach for solving fractional order Riccati equations using ANN and SQP", Applied Mathematical Modelling, 39 , 3075–3093 [35] Hadian-Rasanan, A.H., Rahmatic, D., Gorgind, S., Parand, K. (2020) "A single layer fractional orthogonal neural network for solving various types of Lane–Emden equation", New Astronomy 75, 101307. [36] Pakdaman, M., Ahmadian, A., Effati, S., Salahshour, S.,Baleanu, D. (2017) "

Solving differential equations of fractional order using an optimization technique based on training artificial neural network", Applied Mathematics and Computation 293, 81–95 [37] Zúñiga-Aguilar, C.J., Romero-Ugalde, H.M. , Gómez-Aguilar, J.F. , Escobar-Jiménez, R.F. , Valtierra-Rodríguez, M. (2017) "Solving fractional differential equations of variable-order involving operators with Mittag-Leffler kernel using artificial neural networks", Chaos, Solitons and Fractals 103, 382–403. [38] Muhammad Asif Zahoor Raja, Saleem Abbas, Muhammed Ibrahem Syam, Abdul Majid Wazwaz (2018) "Design of neuro-evolutionary model for solving nonlinear singularly perturbed boundary value problems", Applied Soft Computing 62, 373–394. [39] Muhammad Asif Zahoor Rajaa, ∗ , Raza Samarb, Muhammad Anwar Manzarc, Syed Muslim Shah (2017) "Design of unsupervised fractional neural network model optimized with interior point algorithm for solving Bagley–Torvik equation", Mathematics and Computers in Simulation 132, 139–158 [40] Ahmad, I., Raja, M. A., Bilal, M. and Ashraf, F. (2017) Neural Comput & Applic, 28 (Suppl 1):S929–S944 [41] Jalab,H. A., Ibrahim, R. W., Murad, S. A., Melhum, A. I. and Hadid, S. B. (2012) AIP Conf. Proc. 1482, 414; doi: 10.1063/1.4757505. [42] Mainardi, F.(2010) Fractional Calculus and Waves in Linear Viscoelasticity: An Introduction to Mathematical Models (Imperial College Press, London). [43] Khalil, R., Al-Horani, M., Yousef, A., and Sababheh, M. J. (2014) Comput. Appl. Math. 264, 65. [44] Yadav, N., Yadav, A. and Kumar, M. (2015) An Introduction to Neural Network Methods for Differential Equations, Springer Briefs in Applied Science and Technology, Springer. [45]

Rumelhart, D.E., Hinton, G.E., Williams, R.J., et al. (1988) Learning representations by back-propagating errors. Cognit. Model. 5 (3), 1 [46] Yu, H., Wilamowski, B.M. (2011) Levenberg ––