Nonlinear Processes in Geophysics | 2019

Generalization properties of feed-forward neural networks trained on Lorenz systems

 
 

Abstract


Abstract. Neural networks are able to approximate chaotic dynamical systems when provided with training data that cover all relevant regions of the system s phase space. However, many practical applications diverge from this idealized scenario. Here, we investigate the ability of feed-forward neural networks to (1)\xa0learn\nthe behavior of dynamical systems from incomplete training data\nand (2)\xa0learn the influence of an external forcing on the dynamics. Climate science is a real-world example where these questions may be relevant: it is concerned with a non-stationary chaotic system subject to external forcing and whose behavior is known only through comparatively short data series. Our analysis is performed on the Lorenz63 and Lorenz95 models. We show that for the Lorenz63 system, neural networks trained on data covering only part of the system s phase space struggle to make skillful short-term forecasts in the regions excluded from the training. Additionally, when making long series of consecutive forecasts, the networks struggle to reproduce trajectories exploring regions beyond those seen in the training data, except for cases where only small parts are left out during training. We find this is due to the neural network learning a localized mapping for each region of phase space in the training data rather than a global mapping. This manifests itself in that parts of the networks learn only particular parts of the phase space. In contrast, for the Lorenz95 system the networks succeed in generalizing to new parts of the phase space not seen in the training data. We also find that the networks are able to learn the influence of an external forcing, but only when given relatively large ranges of the forcing in the training. These results point to potential limitations of feed-forward neural networks in generalizing a system s behavior given limited initial information. Much attention must therefore be given to designing appropriate train-test splits for real-world applications.

Volume 26
Pages 381-399
DOI 10.5194/npg-26-381-2019
Language English
Journal Nonlinear Processes in Geophysics

Full Text