Potential for equation discovery with AI in the climate sciences

Huntingford, Chris; Nicoll, Andrew J.; Klein, Cornelia; Ahmad, Jawairia A.

doi:https://doi.org/10.5194/esd-16-475-2025

Articles | Volume 16, issue 2

https://doi.org/10.5194/esd-16-475-2025

Articles | Volume 16, issue 2

Perspective

| Highlight paper

28 Mar 2025

Perspective | Highlight paper |

| 28 Mar 2025

Potential for equation discovery with AI in the climate sciences

Chris Huntingford, Andrew J. Nicoll, Cornelia Klein, and Jawairia A. Ahmad

Video abstract

Abstract

Climate change and artificial intelligence (AI) are increasingly linked sciences, with AI already showing capability in identifying early precursors to extreme weather events. There are many AI methods, and a selection of the most appropriate maximizes additional understanding extractable for any dataset. However, most AI algorithms are statistically based, so even with careful splitting between data for training and testing, they arguably remain emulators. Emulators may make unreliable predictions when driven by out-of-sample forcing, of which climate change is an example, requiring understanding responses to atmospheric greenhouse gas (GHG) concentrations potentially much higher than for the present or recent past. The emerging AI technique of “equation discovery” also does not automatically guarantee good performance for new forcing regimes. However, equations rather than statistical emulators guide better system understanding, as more interpretable variables and parameters may yield informed judgements as to whether models are trusted under extrapolation. Furthermore, for many climate system attributes, descriptive equations are not yet fully available or may be unreliable, hindering the important development of Earth system models (ESMs), which remain the main tool for projecting environmental change as GHGs rise. Here, we argue for AI-driven equation discovery in climate research, given that its outputs are more amenable to linking to processes. As the foundation of ESMs is the numerical discretization of equations that describe climate components, equation discovery from datasets provides a format capable of direct inclusion into such models where system component representation is poor. We present three illustrative examples of how AI-led equation discovery may help generate new equations related to atmospheric convection, parameter derivation for existing equations of the terrestrial carbon cycle, and (additional to ESM improvement) the creation of simplified models of large-scale oceanic features to assess tipping point risks.

Download & links

Article (PDF, 4081 KB)

Download & links

How to cite.

Received: 19 Aug 2024 – Discussion started: 06 Sep 2024 – Revised: 15 Jan 2025 – Accepted: 17 Jan 2025 – Published: 28 Mar 2025

1 Introduction

Addressing climate change caused by fossil fuel burning presents a three-fold challenge to society and science. The first is to determine what constitutes a broad “safe” maximum level of global warming, for which there are already proposals of 1.5 or 2.0 °C (UNFCCC, 2015) above preindustrial times. One guide is to constrain global warming to levels that avoid triggering large-scale tipping points (TPs) (e.g. Abrams et al., 2023) (or even a self-perpetuating cascade of TPs; Wunderling et al., 2021), where major changes would occur to Earth system components for relatively small additional temperature increases. The second challenge, once a warming threshold is adopted, is supporting adaptation planning by determining detailed local changes in near-surface meteorology corresponding to that global temperature rise. Third is deriving greenhouse gas (GHG) emissions profiles compatible with the stabilization of global warming at prescribed target levels. Knowledge of such profiles may encourage mitigation plans to develop non-fossil-fuel energy sources sufficiently fast to prevent key global warming threshold exceedance. All three challenges depend on an accurate knowledge and simulation of the global coupled climate–carbon cycle system in response to fossil fuel burning. The reports of the Intergovernmental Panel on Climate Change (IPCC), of which the latest is the sixth assessment (IPCC, 2021), present the current state of such understanding, while also highlighting the substantial remaining uncertainties in key climate components. Such uncertainties aggregate, preventing the constraining of summary global parameters such as equilibrium climate sensitivity (ECS), which is global warming in a stabilized climate for doubling of atmospheric CO₂ concentration. The range of ECS values estimated by Earth system models (ESMs) remains substantial (Forster et al., 2021). Also, for many regions, there remains large uncertainty in how hourly to annual rainfall levels will change as GHGs rise (e.g. Tebaldi et al., 2021), including for extremes (Lenderink et al., 2017; Lenderink and Fowler, 2017). Uncertainty in ECS leads to poor knowledge of CO₂ emissions reductions compatible with keeping global warming below a target such as 2 °C. Uncertainty in future changes to rainfall statistics prevents adaptation planning for any altered future flood or drought frequencies.

ESMs are complex computer codes designed to estimate climate change for prescribed trajectories of potential future atmospheric GHG concentrations or emissions. The basis of ESMs is the numerical discretization (at scales of typically 100 km) of equations that describe all Earth system features, including the oceans, land surface, atmosphere, and cryosphere, and their feedbacks. Analysis of ESM diagnostics has enabled breakthroughs in climate system understanding, and a community achievement is that approximately 20 research centres contribute model output to a common database available for evaluation by researchers. The latest ensemble of models is the Coupled Model Intercomparison Project version 6 (Eyring et al., 2016). However, the large uncertainties (as noted above) are derived from differences between ESMs. Hence, a key requirement for climate researchers is understanding and removing such differences to create refined projections with smaller uncertainty bounds. An interim approach to uncertainty reduction is the method of emergent constraints (e.g. Hall et al., 2019; Williamson et al., 2021; Huntingford et al., 2023), which searches for inter-ESM regressions between simulated quantities that are also measured and changes in importance in the future. Measurements use any robust identified regressions to constrain bounds on future quantities. However, while emergent constraints provide a powerful methodology to lower inter-ESM spread, ultimately all ESMs need improved equation representation yielding more accurate simulations.

As climate science has progressed through ESM development in recent decades, so have artificial intelligence (AI) algorithms. The potential applications of AI in society are vast, including opportunities to advance scientific discovery (e.g. Wang et al., 2023). As expected, there are calls to apply AI to climate science (Jones, 2017) and in detail (e.g. Schneider et al., 2017; Huntingford et al., 2019; Reichstein et al., 2019; Eyring et al., 2024). Already AI has been found to have a strong ability to warn about emerging extreme climate events (e.g. Bi et al., 2023; Lam et al., 2023) and the timing or onset of key oscillatory features of the climate system, such as the Madden–Julian oscillation (MJO) (Delaunay and Christensen, 2022). Yet most AI algorithms are statistically based, creating interest in applying newer physics-informed methods (Karniadakis et al., 2021) to support understanding climate components (Kashinath et al., 2021). Physics-informed approaches strive to retain at least some consistency with known underlying process differential equations. Examples of applications include the reconstruction of atmospheric properties of tropical cyclone events (Eusebi et al., 2024) and characteristics of extreme precipitation (Kodra et al., 2020).

Even more recently, a branch of AI has emerged termed “AI-led equation discovery” which derives candidates for the governing equations that describe any dataset under investigation. Unlike physics-led approaches, the technique instead uses AI to discover hereto unknown equations, with the method proposed by Raissi et al. (2019, their Sect. 4) and Champion et al. (2019), Brunton et al. (2016), and Rudy et al. (2017). As described above, the advancement of ESMs implies the development of the equations encoded in them. Hence, we consider how this AI technique may support ESMs by discovering any required missing equations and parameters.

2 AI methods and including for equation discovery

2.1 Background climate analysis methods and existing AI methods

The range of AI methods is vast. Selecting the correct one depends on data attributes such as frequency, spatial size, and extent of system non-linearity. If there are “labels” describing the effects searched for, this suggests using supervised rather than unsupervised algorithms. Advances in the knowledge of geophysical processes and related mathematical models have traditionally driven the development of ESMs. However, climate research has also been influenced by statistically based approaches and methods, some of which are precursors to more modern AI techniques. In this section we (1) review some traditional statistical approaches to climate analysis; (2) describe the application of generic and currently available AI algorithms to climate science; (3) review currently available AI algorithms in a general non-climate context; and (4) consider newer techniques, including physics-informed calculations, again for the broader application background. To achieve this summary and for all four points, we point to and make summaries of four influential textbooks (Fig. 1). We select “keywords” from some section headings of Storch and Zwiers (1999) for an initial statistical analysis of climate attributes. Early applications of machine learning applied to environmental issues, including forecasting and components of the climate system, are presented in Hsieh (2009). For a general but extensive overview of available machine learning algorithms, we use Murphy (2013). Moving more towards the central theme of this perspective, Brunton and Kutz (2022) summarize very current methods of data-driven machine learning, including physics-led techniques.

https://esd.copernicus.org/articles/16/475/2025/esd-16-475-2025-f01

Figure 1Schematic illustrating the evolution of the application of statistical methods to climate research, as well as more recent general developments in AI methods. The techniques illustrated and applied to environmental research are based on sections of the books by Storch and Zwiers (1999) and Hsieh (2009). More generic AI developments, not necessarily used in climate research, are linked to parts of the books by Murphy (2013) and Brunton and Kutz (2022), where the latter describes newer physics-led algorithms. The top bar suggests using recent advances in AI that are capable of deriving underlying process equations to determine better features of the climate system where uncertainty remains. The call for applying AI-led equation discovery to climate research is the main subject of this commentary. We retain the idea that AI may also support climate research in ways not yet considered, as shown by the question mark.

Download

In more detail, Storch and Zwiers (1999) describe the initial application of statistical methods to climate-related research, including probability theory, time series analysis, Eigen techniques, and empirical orthogonal functions (EOFs). The EOF method is popular for spatio-temporal analyses of physical climate variables (Smith et al., 1996; Mu et al., 2004; Hannachi et al., 2007). EOFs reduce the degrees of freedom of key variables (such as sea surface temperatures; Smith et al., 1996), often presented as spatial patterns capturing geographical modes of variability multiplied by time series of their magnitudes. EOFs enable a simpler way to characterize climate models and therefore allow easier comparison against gridded datasets (e.g. Mu et al., 2004). Inspired by how the human brain is believed to operate, early neural networks evolved from the perceptron model to hidden-layer models (Hsieh, 2009). Standard multivariate regression and EOF methods contain strong implicit assumptions of linearity, while neural networks in all forms contain non-linear elements. Also of importance is the widespread application of Bayesian statistics in climate science. Bayesian statistics provide information on state variables, including a priori knowledge about such quantities, and therefore hint at the newer physics-led approaches. The application of deriving Bayesian probability distributions for climate quantities matured during the first decade of the 21st century. For example, Berliner et al. (2000) employ such methods to detect and attribute human forcing of the climate system, as represented by near-surface temperature fields. Boulanger et al. (2006) determine the dominant features of the temperature variability in South American data, which is then used to compare to the performance of the ESMs at generating such variations and from this weight such models and hence their future projections. Similar Bayesian-based analyses for near-surface temperature but for multiple regions across the globe are performed by Smith et al. (2009).

Most early studies with AI inputs focus on supervised learning, where model training uses a subset of labelled target data, and the remaining data are used to test predictive performance. However, in recent decades, clustering algorithms have substantially increased the popularity of unsupervised learning. Unsupervised learning, by definition, raises the exciting possibility of algorithms that interpret climate data or ESM-based outputs in new ways. The broad analysis of Steinbach et al. (2003) uses clustering to identify climate indices that characterize many behaviours of the oceans and the atmosphere. Similarly, Lund and Li (2009) use clustering of autocorrelated climate time series to identify the areal limits of distinct climate zones. Graphical models, also an unsupervised method, provide a novel technique to find links between variables of interest. Ebert-Uphoff and Deng (2012) use such models to explore causal relationships between atmospheric circulations and provide a framework with much potential to discover further relationships between climate variables.

Many scientific problems, including climate research, involve the interpretation of exceptionally large datasets. The advent of “big data” methods has enabled the analysis of climate models and data to generate better forecasting methods, supporting earlier or enhanced warnings of extreme events. For example, Liu et al. (2008) assessed different data-driven learning methods to downscale weather forecasts to provide statistics of near-surface meteorological conditions at single points or very small spatial scales and to include predictions of local extremes. Additionally, the climate system contains strong and complex non-linear interactions that operate over multiple timescales, including forecasting. Yet, despite this complexity, robust underlying reduced-complexity non-linear dynamical system descriptions may await discovery. Traditional scale analysis of underlying equations can reveal such dynamical systems, which led to the famous paper of Lorenz (1963), providing a three-variable system of coupled ordinary differential equations (ODEs) that simulate aspects of atmospheric convection. One suggestion is that newer algorithms may routinely identify the dominant processes in complex systems, such as the long short-term memory (LSTM) technique (a recurrent neural network algorithm; Vlachas et al., 2018), designed to forecast high-dimensional chaotic systems. Recent studies also highlight deep learning methods that analyse key datasets to improve medium-range weather forecasting (Bi et al., 2023; Lam et al., 2023). Although these two examples are for much shorter timescales than emerging climatic signals on decade-to-century timescales, they illustrate the usefulness of AI in extracting additional information from complex atmospheric data.

Reduced order models (ROMs) project the dominant processes described by partial differential equations (PDEs) onto low-rank spaces. The method improves the computational speed and optimization of any fully parameterized PDE system. As such, ROMs enable behaviours of the full PDE model to be better evaluated and understood more quickly. Proper orthogonal decomposition (POD) is a key method for creating a ROM, used to study complex spatio-temporally dynamic systems in fluid dynamics (e.g. for oceanic circulations; San and Iliescu, 2015), and provides a viable way to interpret ESM diagnostics. EOFs are an earlier form and a subset of ROMs. We suggest that the full utility of ROMs and their many potential configurations and applications are largely unexplored for climate science. Brunton and Kutz (2022) make a case for deriving a process interpretation for the main components of any ROM-type decomposition.

Despite remarkable progress, most AI methods are statistical in construction. There is a growing view (again reflecting the Bayesian viewpoint) that AI needs to recognize there are often underlying processes for which substantial knowledge exists. Hence, physics-informed learning is gaining traction, providing methods of constraining machine-learning-based predictions using physical laws. Karniadakis et al. (2021) reviewed embedding physics in machine learning and concluded that combined data and physics-based model integration is achievable even in uncertain and high-dimensional contexts. That research discussed several applications of physics-informed learning for inverse and ill-posed problems in fluid dynamics, thermodynamics, and seismology, illustrating the possibility of increased process consistency but expressed via neural network architectures.

We now turn to a new frontier in AI development, which we suggest is of potential extensive use in climate science. In this next step, where uncertainty exists in the underlying physical processes, AI derives the underlying descriptive equations. Such discoveries can constitute either a full equation set or a smaller reduced-complexity set that captures the dominant system responses. The upper row of Fig. 1 presents this as an emerging direction for machine learning (ML) or AI (we use the terminology of ML and AI interchangeably; see Kuehl et al., 2022, for precise definitions and how the two differ).

Equation discovery using ML is well positioned to advance our understanding of Earth's climate, which contains non-linear features, given that the basis of much AI is to find underlying non-linearities. A specific approach is symbolic regression, the most common AI-based approach to discovering equations implicit in data. This form of regression procedure searches a space of mathematical expressions to find the optimal combination (i.e. a symbolic model) that best fits the data. Sparse regression is a symbolic regression method with the advantage of diminishing the search space of possible terms in the equation discovery process, substantially reducing the likelihood of over-fitting to observed data. Brunton and Kutz (2022) place a special emphasis on a sparse regression method, sparse identification of non-linear dynamics (SINDy), a data-driven approach to uncover ROMs of systems with unknown spatial-temporal dynamics.

2.2 Symbolic regression methods for equation discovery to uncover unknown dynamics

Equation discovery techniques can be categorized as data-driven or knowledge-driven discovery (Tanevski et al., 2020). These approaches involve inferring the best possible derived model structure and parameter values by ensuring minimal error between observations and model predictions. The former approach, considered to be general AI-led equation discovery, is applicable for systems where there is very little or no understanding of the underlying dynamics and, therefore, no obvious model structure pre-exists. The latter approach relies on existing expert knowledge of the system, in which those developing the discovery process ensure features of the existing models remain in the new derived equations.

A field of AI already existing is that of explainable AI (Linardatos et al., 2021). This approach defines a set of methods and techniques that provide accessible and understandable justifications for predictions with ML, which are often “black-box” models such as neural networks. However, we make an important distinction that AI-led equation discovery can be considered more useful, instead as a form of interpretable AI, due to its inherent ability to produce human-readable and interpretable mathematical expressions as outputs. By default, the equations themselves are generally explainable. However, in some cases where the generated equations are complex and unintuitive, explainable AI methods may be needed to make the expressions more comprehensible (Aldeia and De França, 2021).

Where the behaviours of a dynamical system are largely or completely unknown, an emerging method to determine the underlying equations is that of symbolic regression. The data-driven symbolic regression algorithm does not depend on user-specified prior knowledge of a system. Hence, unlike a usual regression task that involves a predefined model structure, symbolic regression finds the optimal model and its parameters that best fit the data.

The usual form of symbolic regression, which can effectively minimize both model complexity and prediction error, is sparse regression, which is the main focus of this section. However, we first note other methods, such as a deep-learning-based symbolic regression model proposed by Petersen et al. (2019) that uses a recurrent neural network with a “risk-seeking policy gradient” to generate better fitting expressions. This approach has been shown to be robust against noisy data.

Deep neural networks have the inherent capability to approximate non-linear functions, and, in certain set-ups, can also accurately approximate non-linear operators. For instance, the DeepONet model developed by Lu et al. (2021) can approximate a diverse range of non-linear continuous operators from data such as integrals, as well as implicit operators that represent deterministic and stochastic differential equations. We briefly mention an issue that can arise with AI methods, known as the “closed-world assumption” (Chen and Liu, 2018). This issue arises if not all relevant knowledge is contained within the available data forming the training dataset. This may lead to a situation where previously unseen dynamics not captured during an AI training period may be present in the data held for testing and is therefore not recognized by the model. AI models operating with this assumption cannot update themselves with new information especially in open and dynamic environments, where new features in data continually appear. We also note a novel data-driven method for solving ODEs and PDEs rather than “discovering” them, as introduced by Cao et al. (2023). In Cao et al. (2023), the Laplace neural operator is utilized for solving differential equations that can account for non-periodic signals, unlike the more well-known Fourier neural operator. The Laplace neural operator is an alternative approach to the more traditional numerical solvers and can be advantageous since it has the capability to rapidly approximate solutions over a wide range of parameter values and without the need for further training.

Another type of symbolic regression method is that of genetic algorithms (GAs) (Keren et al., 2023). GAs can include prior physical knowledge of the system in the optimization procedure, and they work particularly well for systems with strong linearity. This technique involves building “trees” of random symbolic expressions and using stochastic optimization to perform the replacement and recombination of tree subsamples. Ultimately, this finds the combination of terms that best fit the data.

Common to these three symbolic regression methods (sparse regression, deep learning, and GAs) is an optimization procedure which finds a linear combination of (potentially non-linear) functions from a large functional space which best fits the underlying system behaviour. The quickest and most general approach is to use sparse regression, which substantially reduces the search space of possible functions. Such speed is needed, compared to a computationally inefficient “brute-force” method of looping over all combinations of possible contributing functions. Sparse regression also reduces the likelihood of over-fitting, generating equations with limited terms, although sufficient to explain the features of the underlying datasets. A popular sparse regression algorithm developed by Brunton et al. (2016), known as sparse identification of non-linear dynamics (SINDy), identifies the simplest (parsimonious) model that describes the dynamics of non-linear systems implicit in data. SINDy investigates time series data to extract interpretable and generalizable models in the form of ordinary differential equations evolving in time. In the event of multiple time series spanning a spatial region, SINDy can then determine partial differential equations. A general dynamical system model takes the form of $\dot{x} (t) = f (x (t))$ , where the vector $x = \begin{array}{cccc} [x_{1} (t) & x_{2} (t) & \cdot \cdot \cdot & x_{d} (t)]^{T} \in R^{d} \end{array}$ represents the state of the system at a single time instance, t, consisting of d system variables. The SINDy algorithm finds a function $f : R^{d} \to R^{d}$ defining the dynamics and time evolution of the system. Collecting a time history of the state $\dot{x} (t)$ across the m set of times $t_{1}, t_{2}, \dots, t_{m}$ produces the complete m×d data matrix.

\begin{matrix} (1) & X = [\begin{array}{c} x^{T} (t_{1}) \\ x^{T} (t_{2}) \\ ⋮ \\ x^{T} (t_{m}) \end{array}] = [\begin{array}{cccc} x_{1} (t_{1}) & x_{2} (t_{1}) & \dots & x_{d} (t_{1}) \\ x_{1} (t_{2}) & x_{2} (t_{2}) & \dots & x_{d} (t_{2}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{1} (t_{m}) & x_{2} (t_{m}) & \dots & x_{d} (t_{m}) \end{array}] \end{matrix}

The symbolic regression task is to find the form of f from a time series of the state X(t) that maps to the derivative $\dot{X} (t)$ and that is valid across the m set of times $t_{1}, t_{2}, \dots, t_{m}$ at which data are available. In order to find a sparse representation of f, an augmented library, we first start with Θ(X), consisting of n candidate functions. The individual functions contributing to f may include polynomial and trigonometric terms. This construction gives a library of dimensions $n \times m \times d$ . We show this construction below, where the horizontal direction (size n) are the candidate functions, the vertical direction (size m) are the time steps, and “out of the page” is size d, which are the different state variables. In the matrix Θ(X) below, functions can include “cross terms”, so for instance a quadratic term X² and for d=2 would have $x_{1}^{2}, x_{2}^{2}$ and additionally x₁x₂ terms (see Eq. 2 of Brunton et al., 2016).

\begin{matrix} (2) & Θ (X) = [\begin{array}{cccccccccc} | & | & | & | & | & | & | & | \\ 1 & X & X^{2} & X^{3} & \cdot \cdot \cdot & \sin X & \cos X & \sin 2 X & \cos 2 X & \cdot \cdot \cdot \\ | & | & | & | & | & | & | & | \end{array}] \end{matrix}

The sparse regression problem is then set up as $\dot{X} (t) = Θ (X) Ξ$ , where we want to solve for the matrix Ξ∈ℝ^nd, which contains vectors of n coefficients corresponding to the linear expansion for each of the d state variables, $Ξ = [\begin{array}{llll} ξ_{1} & ξ_{2} & \dots & ξ_{d} \end{array}]$ . For simplicity, looking at this regression problem for only one system variable, let y be a vector of data measurements (i.e. a column of X), where y∈ℝ^m. The fitting procedure is then attempting to minimize the difference between y and Θξ since y=Θξ, where Θ(X)∈ℝ^mn and ξ∈ℝⁿ. In the case of multiple variables, this becomes a multi-regression problem as previously introduced, and the minimization is a single sweep across all state variables and so is not the best fit for each individual variable.

Various sparse regression optimizers can solve for ξ. A common algorithm known as LASSO introduces sparsity to the regression procedure via an L¹ regularization term: $ξ = \underset{ξ^{'}}{argmin} {∥Θ ξ^{'} - y∥}_{2} + λ {∥ξ^{'}∥}_{1}$ . The key result of solving for Ξ using sparse regression is that the coefficient vectors that it obtains are sparse (where most entries are set to zero) due to the optimization procedure. This means that only a few non-linear terms in the candidate library are active and therefore included in the right-hand side of one of the row equations ${\dot{x}}_{k} = f_{k} (x)$ . This leads to a sparse representation of f and therefore parsimonious dynamical models.

A particularly comprehensive verification of the capability of sparse identification to derive equations is presented in Chen et al. (2021). In that analysis, and pretending to have no knowledge beforehand of the underlying equations, five fundamental governing equations are reproduced purely from data. These equations are those of Burgers, Kuramoto–Sivashinsky, non-linear Schrödinger, Navier–Stokes, and a reaction–diffusion equation. Although we place an emphasis in this section on sparse regression for equation discovery where we might have little or no knowledge of underlying model structures beforehand, this method may also be used where there exists some process understanding. Such an application is closer to a physics-informed approach. In these circumstances, the library Θ is restricted to take only a limited set of functional forms based on such a priori knowledge, possibly allowing faster convergence of the optimization procedure as some components of the dynamical system are known.

The SINDy algorithm can additionally include forcing variables in the sparse representation of the dynamics, known as “SINDy with control” (Brunton et al., 2016). This configuration gives the ability to simultaneously disambiguate the internal dynamics of a system and the effect of forcing variables. For climate modelling, an external forcing variable could be a time series of GHG emissions, their atmospheric concentration levels, or radiative forcing that integrates the effect of all changes in different GHG concentrations. One of the important properties of dynamical systems is stability, which is not guaranteed with the standard SINDy regression algorithm. For physical systems involving fluid flows where the underlying equations are known to be energy-preserving, although also non-linear (e.g. having quadratic terms), the “trapping SINDy” algorithm is available, based on the Schlegel–Noack trapping theorem (Kaptanoglu et al., 2021). This algorithm offers necessary conditions for the discovered models to be globally stable and energy conserving. We note that the confirmation of basic conservation properties is a cornerstone of ESM development and testing. The SINDy algorithm was originally used to only discover systems of ODEs but was quickly extended to search for PDEs, using an algorithm known as “PDEFIND”, which fully captures the spatial-temporal behaviour of dynamical systems (Rudy et al., 2017).

There are computer packages that implement the SINDy algorithm and its configurations (for example, trapping capability), for ODE and PDE systems, such as the Python-based PySINDy package (Brunton et al., 2016).

3 Potential applications of AI-led equation discovery

We discuss the potential application of AI-led equation discovery to three Earth system components. In each example, there is presently a deficiency in understanding, causing uncertainty in the representation of processes by equations and their parameterization. Each application falls into one of three categories.

In the first example, we address the requirement to better parameterize small-scale convective events at the larger scale to enable their planet-wide representation in coarser-scale Earth system models. In this instance, arguably, we do not understand the form of the governing equations valid at larger scales.

In the second example, we consider closing the global carbon cycle, where the largest uncertainty is often the magnitude of atmosphere–land CO₂ exchanges. We suggest seeking parameters valid at the large ESM gridbox scale, although initially for placement in existing land equations. Due to parameter uncertainties, global land–atmosphere CO₂ exchange is often derived as a residual, after contemporary CO₂ emissions and changes in atmospheric and oceanic carbon content are accounted for (e.g. Canadell et al., 2007), circumnavigating using a land surface model. However, while this provides valuable contemporary information, it prevents predictions of future land changes. Land surface models are improving (Blyth et al., 2021) with new key processes already represented by equations, but their parameterization may apply only at the field scale or smaller, depending on data used for calibration. Yet the land surface is heterogeneous, providing an opportunity for algorithms to determine equation parameters that instead aggregate fine-scale processes to ESM gridbox scale. In some other instances, terrestrial processes do remain poorly understood, and so equation discovery may also identify additional equation terms that capture such effects. Hence we focus on whether AI may advance existing equations by deriving parameters valid at large spatial scales but note that discovery methods might also characterize missing processes in equation form.

Our third example concerns ocean circulations where the governing equations are fully understood at the local scale, but of interest is how their internal interactions aggregate to create regional and global responses. Spatially upscaled computationally fast equations can generate key knowledge of oceanic response for a broad range of potential future GHG trajectories that ESMs have not simulated. Many reduced-complexity large-scale ocean models exist, but the equations are presently estimates. We conjecture that AI-generated spatial aggregation may refine such equations. An additional benefit is that comparing these simpler models with large-scale oceanic measurement datasets may provide summary information on the performance of ESMs from which AI has derived the large-scale equations.

3.1 Large-scale parameterization of fine-resolution convective events

The representation of convection remains a major shortcoming in traditional ESMs, where grid scales of 50–200 km cannot explicitly resolve convection, necessitating parameterization. These empirical parameterizations simulate sub-grid vertical displacement of mass, energy, and water on the ESM gridbox scale, producing modelled rainfall. However, common convective parameterizations often fail to capture typical diurnal cycles of cloud cover and rainfall (Fosser et al., 2015; Prein et al., 2013), with rainfall estimates that are too frequent and light. Such parameterizations also struggle to represent long-lived convection that propagates across multiple gridboxes, organizing the atmosphere on the mesoscale (Stephens et al., 2010). Meanwhile, rainfall intensities are rising with global temperatures, scaling with the water-holding capacity of a warmer atmosphere at 7 % K⁻¹ on average, following the Clausius–Clapeyron relationship (e.g. Westra et al., 2014). However, this statistic does not account for complex meso-scale dynamics unresolved by ESMs. Thus, shortcomings in sub-grid convection representation in ESMs have significant implications for climate change preparedness, limiting the reliability of future rainfall intensification estimates.

Convection is complex, with governing equations not amenable to analytical analysis. Therefore, the current approach involves discretizing these equations and conducting convection-permitting (CP) simulations on high-resolution (< 10 km) model grids, which better represent convective storms (Kendon et al., 2017) (see Fig 2). Unfortunately, due to the high computational requirements of fine-resolution calculations, global climate CP simulations have yet to routinely emerge. Consequently, CP climate simulations are currently limited to specific spatial domains and time periods (Kendon et al., 2021; Stevens et al., 2019).

Nevertheless, these individual CP simulations enable us to assess the added value of such high-resolution projections, particularly by comparing them with diagnostics from lower-resolution climate models (e.g. Fosser et al., 2024). However, due to the small number of simulations and their limited temporal extent, CP models currently provide little information on projection uncertainty or transient climate behaviour. An alternative is to perform multiple CP calculations in parallel for specific target regions using ESM boundary conditions. Such limited-area downscaling provides valuable regional information but prevents the modelling of large-scale feedbacks (“upscale effects”), which are expected to change and modulate how climate evolves as GHGs rise.

A key challenge for climate science is to derive mean large-scale governing equations that accurately capture the statistical properties of convective storms for inclusion in ESMs. These equations need to simulate how storm properties will respond to higher levels of GHGs and, crucially, how any changes feed back to the large-scale climate system. A promising strategy is to use AI to analyse CP simulations, treating these as “true data” despite being computer-generated (Rasp et al., 2018). Processes that could be extracted from CP models to enhance ESM convective parameterizations include the interaction of storm-scale circulations with large-scale wind, temperature, and humidity fields (O'Gorman and Dwyer, 2018); the effects of convective upscale growth (Bao et al., 2024); entrainment variability due to wind shear (Mulholland et al., 2021; Maybee et al., 2024); and the relative importance of thermodynamic versus dynamical drivers of precipitation changes under global warming (Klein et al., 2021). Key target variables would include gridbox-mean temperature, humidity, and momentum for direct ESM use, as well as cloud cover properties (Grundner et al., 2024) and distributions of convective precipitation and extremes.

https://esd.copernicus.org/articles/16/475/2025/esd-16-475-2025-f02

Figure 2Explicit representation of convective storm circulations in a CP model. We show a simulated convective storm cross-section and for a single timestamp, centred on a storm updraught in a 4.4 km gridbox resolution convection-permitting climate model simulation (CP4-Africa; Senior et al., 2021). The organized storm is visible as an area of > 90 % relative humidity (shading), with extensive cloud anvil across 600–350 hPa pressure levels vertically and extending to a horizontal scale (“x” axis) of > 500 km. Wind vectors indicate a high vertical velocity at the cross-section centre point (0 km), extending across a horizontal scale of approximately 30 km, these being typical features resolvable in this high-resolution CP model but missing in climate models. The resolved updraught circulation is co-located with very high rainfall intensity locally at 0 km (white line, second bottom “y” axis), which rapidly decreases as a function of distance and is specifically linked to the correct representation of the internal updraught circulation. These features of storm processes are expected to be sensitive to the background global warming level (e.g. Prein et al., 2017). A key possibility for AI is to derive equation sets, potentially with stochastic components, that broadly aggregate these complex processes to scales of order 100 km, and so appropriate for inclusion in ESMs

Download

Deriving equation sets via AI brings challenges, first in the large number of input variables that influence convection. While temperature, humidity, wind, and pressure fields may serve as the baseline, derived quantities like convective available potential energy (CAPE) and other vertical profile descriptors, along with spatially variable land characteristics (e.g. topography, vegetation, land use, soil moisture) and oceanic features (e.g. sea surface temperature, surface roughness), significantly impact convective processes. Omitting these variables from new equations could hinder the transferability of knowledge from CP simulations. Moreover, equations must also accurately bridge the scale gap between CP and coarser models, necessitating an algorithm that discovers fundamental relationships transferable to ESM resolutions (e.g. Grundner et al., 2024).

Secondly, while equation discovery offers promise in generating transferable equations for process descriptions beyond their training domain (Ross et al., 2023), it is important to verify they maintain physical consistency and adhere to fundamental principles like moisture and energy conservation. Thus, a question is whether both transferability and physical consistency challenges can be overcome by equation discovery targeting interpretability in ways that other methods cannot achieve. Expert judgement can constrain equation parameters within realistic physical limits, enhancing trustworthiness for extrapolation beyond training conditions (Jebeile et al., 2023). Reliable equations for ESMs that capture convective behaviours will enhance projections of future rainfall extremes in future GHG-enriched environments. Moreover, better constraints on upscale changes in circulation and radiative feedbacks could improve cloud cover modelling in ESMs. A major concern is that some ESMs project very high simulated ECS values, often depending on how they represent climate change feedbacks on cloud features (Bjordal et al., 2020). Dufresne and Bony (2008) provide a detailed disaggregation of direct and feedback drivers (including changes to cloud characteristics) that contribute to simulated global warming as GHGs rise.

As computing power advances, century-long global climate model ensembles at kilometre-scale may become feasible (Slingo et al., 2022), offering more robust projections of convection and related storms as GHG levels rise. However, given the urgent need to understand climate impacts at fine scales, an AI-supported approach is likely invaluable. Equation discovery that captures local effects within a structure available for global calculations may offer an interim solution, reducing resource costs for large ensembles and uncertainty estimation, while providing crucial insights into future rainfall patterns.

3.2 Improving models of terrestrial carbon cycling

We consider modelling large-scale land–atmosphere carbon dioxide (CO₂) exchanges. A substantial fraction of CO₂ emissions are currently absorbed by the ocean and land surface. Whether this continues affects global climate policy, as decreased future natural “drawdown” implies that fewer emissions are compatible with any societal goal to restrict global warming, for example, to 2 °C above preindustrial levels. However, the magnitude of these fluxes, even for the contemporary period, is highly uncertain, despite efforts to constrain it (e.g. Chandra et al., 2022). Budget calculations comparing emissions and atmospheric concentration changes can reveal with high accuracy the combined global land plus ocean CO₂ drawdown and hence offset of emissions. However, Chandra et al. (2022) note (citing Friedlingstein et al., 2020) that the balance between the land and ocean components is unknown within a gigaton of carbon per year. Approaches to reducing uncertainty in regional-to-global land–atmosphere CO₂ fluxes include using FLUXNET towers (e.g. Baldocchi et al., 2001) above strategic representative biomes, atmospheric CO₂ measurements merged with atmospheric transport models generating atmospheric inversions (e.g. Table 1 of Kondo et al., 2020) and forward modelling with dynamic global vegetation models (DGVMs) (e.g. Sitch et al., 2008). Robust forward modelling allows quantifying flux changes expected for any future altered climatic state.

The challenge of simulating the land surface is different from that of the atmosphere. One generalization is that the equations and parameters describing atmospheric processes are well understood but admit rich behaviours, including local convection (Sect. 3.1), which may not be fully understood at large scales. The land surface, however, is modelled with simpler equations, including some components that are algebraic (i.e. not differential equations), but instead, the complexity is substantial heterogeneity in their parameterization. Variation in parameters can be due to multiple factors, including that a typical transect of land will contain many biomes or plant functional types, all having slightly different responses to imposed environmental variations. We propose AI-led approaches that quantify similar processes but recognize different levels of response at finer scales. AI methods may also successfully aggregate such spatial behaviours to generate equation parameters valid at very large scales for inclusion in ESMs.

The eddy covariance method measures high-frequency (many times per second) simultaneous fluctuations in vertical wind speed and a scalar quantity of interest, where their covariance statistic is linearly related to the land–atmosphere exchange of the scalar. There are a growing number of towers with such measurement devices installed on top of them, estimating momentum, heat, vapour, and CO₂ exchanges. The operation of eddy covariance systems and related measurement databases are undertaken by the expanding FLUXNET network (Baldocchi et al., 2001). These measurements already provide training data for ML methods to map from global Earth observation (EO) data products (that record land attributes) to estimates of surface fluxes (Tramontana et al., 2016). This approach, named FLUXCOM, also entrains meteorological measurements at towers as additional driving variables. FLUXCOM then extrapolates tower knowledge using EO and meteorological data, generating global historical estimates of surface energy fluxes (Jung et al., 2019) and CO₂ exchange (Jung et al., 2020).

We consider a slightly different approach to FLUXCOM. Using AI-led equation discovery without prior information will likely generate equations similar to existing knowledge, e.g. for surface energy partitioning (Monteith, 1981) and photosynthesis (Farquhar et al., 1980). However, two (or more) biomes are often in close proximity, which requires the development of “two-source” models (e.g. Huntingford et al., 1995), or descriptions of biomes with complex canopy structures are needed (e.g. Mercado et al., 2007). We suggest an AI-led approach to building models of land–atmosphere CO₂ exchange that is valid at the ESM gridbox scale and accounts for local-scale complexities. We would first use equation discovery methods to model land behaviours at FLUXNET sites. Revised equations would map driving data from EO retrievals that fall within the flux tower footprint (Schmid, 1994), along with FLUXNET meteorological measurements, to the tower data of land–atmosphere CO₂ exchange. Such use of EO supports the suggestions of Chen et al. (2011). AI would derive any equation terms and parameters in addition to standard formulations to capture surface heterogeneity factors. The explicit dependencies on meteorological conditions support creating equations valid for any altered future background climatic conditions.

https://esd.copernicus.org/articles/16/475/2025/esd-16-475-2025-f03

Figure 3Sentinel-2 L2A images capturing complex land surface information down to 10 m spatial resolution, in this example for (a) deforestation in Rôndonia, Brazil (10° S, 65.7° W), with visible wild fire plumes (please zoom in to see fine-scale land use detail) and (b) a permafrost landscape in Putorana State Nature Reserve, Russia (71° N, 96.3° E) (Copernicus Sentinel-2, 2021). A wealth of high-resolution imagery now documents processes acting on fine-scale land surface patterns and temporal changes therein, opening new avenues for AI-led mapping of sub-grid surface complexity onto physical variables typically used in climate models.

Once established at FLUXNET sites that AI-derived revised equations successfully translate EO measurements to CO₂ exchanges, EO and meteorological data fields can then force these equations elsewhere. Such spatial aggregation of local tower-based calculations to large scales would provide equations with parameters that capture, implicitly, fine spatial heterogeneity in the land surface, offering better ESM-based predictions of CO₂ exchanges. ML-derived spatial aggregation would be a form of technique known as computer vision. In Fig. 3 we present two representative images, panel (a) showing complexity in a tropical rainforest location in South America with extensive land use and (b) of permafrost at high latitude, where there is substantial variation in land attributes. An additional requirement of computer vision algorithms would be that they ignore locations in EO imagery where there are clouds or other masking factors such as smoke from fires (e.g. Fig. 3a).

Our proposed approach would become increasingly accurate as the eddy covariance network extends, noting that Papale (2020) request FLUXNET expansion to support better estimates of global annual land–atmosphere CO₂ exchanges. With more FLUXNET sites, it will be possible to more routinely test discovered equations at a broader range of locations. Where there are discrepancies, this may imply further missing processes in the equation set or a strong regional dependency of parameters, which our techniques may then quantify. As an example, the introduction of geochemical cycles beyond carbon in land models is still in its early stages, with Davies-Barnard et al. (2022) noting major differences in nitrogen cycling representation between ESMs. Furthermore, available data over ever-increasing time periods would allow more testing of whether AI-discovered equations capture climate-induced trends.

3.3 Dynamical system models of ocean circulation

The study of major oceanic circulations is conducted mainly with high-resolution numerical simulations, e.g. ESMs. However, the large computational time of such simulations maintains interest in faster summary models, such as coupled ODEs. Reduced-form operationally fast spatially aggregated bulk ODEs that evolve in time allow researchers to more readily scan parameters and a broader range of future climate forcings, enabling a better assessment of potential features of circulation stability. These simpler dynamical systems can enable levels of understanding not possible with restricted computer power constraining the number of possible perturbed-parameter full-complexity simulations.

Early model attempts at simplified descriptions of oceanic behaviour exist including the Atlantic Meridional Overturning Circulation (AMOC) (e.g. Stommel, 1961). More recently, simplified models have emerged that include atmospheric drivers and their impact on the important El-Niño–Southern Oscillation (ENSO) (e.g. Timmermann et al., 2003). Here, we present the Timmermann model in Fig. 4, in both schematic form (panel a) and bifurcation diagram (panel b), with details in the caption.

https://esd.copernicus.org/articles/16/475/2025/esd-16-475-2025-f04

Figure 4A schematic and bifurcation diagram of the equatorial coupled ocean–atmosphere system as represented in the Timmermann model of ENSO. Panel (a): T₁ (K) and T₂ (K) are the sea surface temperatures of the western Pacific and eastern Pacific respectively, and τ is the wind stress on the ocean surface due to easterly trade winds, given in newtons per metre squared. T_sub (K) is the temperature below the mixed layer of depth H_m (m). The ocean upwelling velocity is denoted w (m s⁻¹), and u (m s⁻¹) is the atmospheric zonal surface wind. Diagram adapted from Dijkstra (2013). Panel (b): bifurcation diagram of eastern Pacific temperature T₂ (K) as a function of zonal advection efficiency ϵ (dimensionless), showing solutions to Eqs. (3) to (4) and their stability. Diagram adapted from Timmermann et al. (2003). We suggest that AI-led equation discovery is well-positioned to investigate oceanic datasets in order to determine if the simplified model presented here remains the most appropriate to maximally represent the ocean–atmosphere system at very large scales.

ENSO is hypothesized to occur as follows. There is a positive ocean–atmosphere feedback process that activates ENSO, first suggested by Bjerknes (1969). The feedback process may start with weakened easterly trade winds, which reduces the strength of the ocean current responsible for drawing surface water away from the western equatorial Pacific. This in turn reduces the ocean upwelling of colder water from the deep ocean, flattening the thermocline. A buildup of warmer surface water in the equatorial east Pacific (El Niño) then emerges. As a result, we now have a reduced east–west sea surface temperature (SST) gradient that further weakens the Walker circulation (a positive feedback mechanism). However, after the El Niño matures, a negative feedback mechanism emerges to turn El Niño into a cold phase known as La Niña. The characteristic timescale of the phase oscillation is 2 to 7 years. In addition, the tropical Pacific SST also exhibits decadal variability related to decadal variations in the amplitude of ENSO (Timmermann et al., 2001; Choi et al., 2012).

When modelling the processes that cause irregular inter-annual El Niño occurrences, there are usually two approaches. The first is deterministic, albeit that there exists chaotic behaviour in the large-scale dynamics of the coupled ocean–atmosphere system due to non-linear interactions. The other viewpoint assumes that this behaviour is only weakly non-linear and the irregular oscillations are mainly due to stochastic noise. The former approach is more suited to using reduced-complexity models of ENSO, which may be gained from using AI-led equation discovery methods to uncover chaotic non-linear dynamics without the need for stochastic terms in the equations.

Modelling the positive and negative feedback mechanisms of ENSO initially led to four basic linear deterministic oscillator models. Such models are known as the delayed oscillator (Suarez and Schopf, 1998), the recharge–discharge oscillator (Jin, 1997), the western Pacific oscillator (Weisberg and Wang, 1997) and the advective–reflective oscillator (Picaut et al., 1997). However, observed ENSO behaviour exhibits non-linear and irregular oscillations which are not accounted for in these simpler models. However, a non-linear ENSO model developed by Timmermann et al. (2003) captures both the inter-annual oscillations and the decadal variability in El Niño events seen in observations and climate models. The details of the physical set-up are given in Jin (1997) and Zebiak and Cane (1987). The heat budget of the Timmermann model is given by two coupled first-order ordinary differential equations in time,

\begin{matrix} (3) & \frac{d T_{1}}{d t} = - α (T_{1} - T_{r}) - \frac{u (T_{2} - T_{1})}{L / 2}, \\ (4) & \frac{d T_{2}}{d t} = - α (T_{2} - T_{r}) - \frac{w (T_{2} - T_{sub})}{H_{m}}, \end{matrix}

where the variables T₁, T₂, T_sub, H_m, w, and u are described in the caption to Fig. 4. The additional variables in Eqs. (3) and (4) are as follows. T_r (K) is the radiation equilibrium temperature in kelvin, L (m) is the basin width, and $1 / α$ (d⁻¹) represents a typical inverse timescale of thermal damping. Furthermore, we have the physically derived relations $\frac{u}{L / 2} = ϵ β τ$ and $w / H_{m} = - ζ β τ$ , where ϵ and ζ are the strengths of zonal and vertical advection respectively (model bifurcation parameters) and are both dimensionless quantities. The variable β (km⁻¹ s⁻¹) quantifies the strength of the influence of thermocline depth perturbations on SSTs.

The subsurface temperature, T_sub is defined as

\begin{matrix} (5) & T_{sub} = T_{r} - \frac{T_{r} - T_{r 0}}{2} [1 - \frac{\tanh (H + h_{2} - z_{0})}{h_{*}}], \end{matrix}

where T_r0 is the temperature beneath the thermocline; h₂ is the east equatorial Pacific thermocline depth (relative to a depth H) in metres; z₀ is the depth in metres, for which the upwelling velocity w, in metres per second, becomes its characteristic value; and h_* is the sharpness of the thermocline. The thermocline depths are calculated as follows:

\begin{matrix} (6) & h_{2} = h_{1} + b L τ, \\ (7) & \frac{d h_{1}}{d t} = - r (h_{1} + \frac{b L τ}{2}), \end{matrix}

where h₁ is the west equatorial Pacific thermocline depth in metres, $1 / r$ (d⁻¹) represents a dynamical adjustment timescale, and b is the efficiency of wind stress τ to drive the thermocline tilt. Wind stress is given by

\begin{matrix} (8) & τ = \frac{μ (T_{2} - T_{1})}{β}, \end{matrix}

where μ (K⁻¹ d⁻¹) is a dynamical coupling coefficient. Overall this leaves us with six equations (Eqs. 3 to 8) and six unknown variables (T₁, T₂, T_sub, h₁, h₂, and τ). For the original parameter values used in this study, see Timmermann et al. (2003). In Timmermann et al. (2003), the bifurcation parameter, ϵ, is varied to understand how the non-linear zonal advection term generates chaotic behaviour.

This low-order model simulates strong decadal El Niño conditions (El Niño bursting) which have been observed in full-complexity simulations, where the ENSO mode grows in amplitude and then quickly resets, from which the amplitude variations grow again. For small values of the zonal advection efficiency bifurcation parameter, ϵ, and a fixed value of ζ, the system is in stable equilibrium with a cold “tongue” in the eastern Pacific and a warm pool in the western Pacific, as shown in Fig. 4a. However, this steady mean state becomes unstable for a larger critical value of ϵ, leading to a Hopf bifurcation, and a stable periodic orbit appears, as illustrated in Fig. 4b. Further analysis (e.g. Fig. 6 in Timmermann et al., 2003) of the dynamical system shows rich behaviour when both ϵ and ζ are varied simultaneously, resulting in ENSO amplitude modulations, period-doubling bifurcations, and chaos.

Relating to the premise of this paper, although the existing simpler models such as that repeated above appear to reproduce many features of ENSO, utilizing AI-led equation discovery offers a potential way to verify this dynamical behaviour using observational datasets and model simulations as training data. Do these newer algorithms back out a model with implicit Hopf bifurcations and low-dimensional chaos that match that found in Timmermann et al. (2003), or will they suggest refinements? We propose the three main potential outcomes of using equation discovery in this context. Firstly, equation discovery may back out simpler deterministic models of ENSO, such as the four basic linear oscillator models mentioned previously. Most of these models can be generalized into a recharge oscillator framework (Jin and An, 1999). Secondly, with careful consideration of the variables and the physical set-up involved, included in Fig. 4, the derived equations may provide verification of the Timmermann model by exhibiting similar non-linear chaotic behaviour for different values of the bifurcation parameters. If this is the case we expect the discovered equations to emulate (1) low-amplitude biannual ENSO oscillations at low values of ϵ, (2) amplitude-modulated and chaotic behaviour at intermediate values of ϵ, and (3) large-amplitude ENSO oscillations with periods of 3–4 years at higher values of ϵ. A third possibility is that the learned equations provide a new description of ENSO behaviour which may potentially come with (1) a better parameterization of the ENSO system, (2) an enhanced understanding of physical mechanisms leading to the generation of non-linear ENSO behaviours which agree better with observations and models, and (3) improved predictive skill of ENSO.

Due to the shortness of available observational records and the level of noise in the data, it can be difficult to determine from observations alone if ENSO amplitude modulations are a statistical manifestation or a result of deterministic processes (Timmermann et al., 2003). AI-led equation discovery methods such as sparse regression only require a limited time series to discover the underlying dynamics and work well in the presence of noise, and they therefore may be effective in this situation.

State-of-the-art ESMs still contain biases in the eastern equatorial Pacific (Timmermann et al., 2018), which leads to problems in representing key physical processes and feedback mechanisms and so may impact their ability to accurately predict future changes. ESMs struggle to converge in their simulations of ENSO characteristics such as amplitude, period, and the asymmetry between El Niño and El Niña phases (Jin et al., 2020). Yet, inter-ESM disagreements are an opportunity for the AI-led discovery of reduced-complexity equations, as such models are computationally fast and thus allow substantial sampling of different parameter values. If such parameters have process interpretations and if it is possible to map ESM projections onto different values of such parameters, this enables the differences and uncertainties between ESMs to be better understood.

4 Discussion and conclusions

AI plays an increasing role in society (e.g. Makridakis, 2017) and will likely influence multiple aspects of scientific research (e.g. Xu et al., 2021). AI methods are proposed to support the mitigation of climate change (e.g. Kaack et al., 2022; Chen et al., 2023; Rolnick et al., 2023) and climate research itself (e.g. Reichstein et al., 2019; Huntingford et al., 2019; Eyring et al., 2024). The basis for most AI algorithms is statistical, frequently introducing non-linearity into forms of regression. For instance Murphy (2013) contains an early chapter on linear regression, to be viewed in the context of subsequent chapters on key AI algorithms. Recently, there has been an emphasis on constraining AI-based discoveries to be compatible with known theory about underlying processes, referred to as “physics-informed AI” (e.g. Karniadakis et al., 2021). An early form of physics-informed activity has been the development and maintenance of data assimilation methods to generate historical gridded datasets of meteorological conditions, e.g. the European Centre for Medium-Range Weather Forecasts ERA5 reanalysis (Hersbach et al., 2020). Reanalysis products merge data with forecasts, balancing the need to simultaneously remain within the uncertainty bounds of measurements and broadly satisfy the equations of atmospheric dynamics.

We propose advancing climate change science with the even newer AI technique of “equation discovery”. Although existing AI methods provide powerful insights into the features of large datasets, they remain essentially statistical and therefore may not estimate well “out of sample”. Yet the main requirement of climate science is to predict environmental regimes different from those of the present day or the recent past. If, instead, AI reveals process equations, this opens the opportunity for their assessment and parameterization. Such an understanding of processes may confirm that the equations have the predictive capability needed to describe new climatic regimes. This supports a fundamental role of applied mathematics in determining data-led equations for a system that can robustly simulate responses to alternative forcings.

Already suggested is using AI to emulate the few limited-area very high-resolution atmosphere simulations and for the forcings for which they have been operated (e.g. Schneider et al., 2023). Such emulators may be placed in ESMs to predict high-resolution atmospheric features, including average storm characteristics, at other locations. Extending such analyses to generate governing equations, possibly with stochastic components, will more rigorously capture high-resolution effects at alternative places and atmospheric GHG levels. Discovered equation sets, amenable to discretization in ESMs, may also simulate small-to-large-scale feedbacks. AI may also enable extending basic equations, allowing additional perturbation term discovery, or location-specific parameters, e.g. for capturing ecological responses with strong spatial heterogeneity. A further application is AI-based discovery of equations that are instead valid over substantial areas (e.g. averaging ESM projections), aggregating geographical variation and reducing to ODEs in time only and that respond to changing GHGs. ODEs are often amenable to more complete analysis, and even if exact solutions are unavailable, they can be tested for linear stability about equilibrium states and for non-linear systems including how parameter perturbation may activate TPs. TPs in the Earth system are the basis for intense research (e.g. Mckay et al., 2022), given their strong potential impacts on society, yet estimated GHG levels causing occurrence are highly ESM-specific (Drijfhout et al., 2015). Mapping to a common simpler model enables characterizing ESMs by effective parameters in such reduced-complexity representations. This approach may identify parts of the Earth system that, if measured better, will better determine GHG levels likely to trigger TPs. Together, factors have led to our three illustrative examples in Sect. 3 of how AI-led “equation discovery” may support climate change science.

https://esd.copernicus.org/articles/16/475/2025/esd-16-475-2025-f05

Figure 5 Schematic of the grid of discovered equations. For our three illustrative examples, represented by different rows, the left-hand side shows the numerical mesh of the original data within which AI methods may discover underlying equations. The right-hand side is the potential mesh of such equations. For atmospheric convection, the original data comprise very high-resolution simulated meteorological variables. Each variable is illustrated as different 3-D blocks (vertically) and at different times (horizontally). Derived equations characterizing high-resolution convection would be embedded in existing ESMs on coarser scales, as shown on the top right. Land carbon modelling has two stages. Initially, at specific locations of FLUXNET data (yellow marks), time series of variables related to land–atmosphere carbon exchange, such as net ecosystem productivity (NEP) and meteorological variations, are used to derive time-evolving ODEs. Computer vision methods then calibrate and extrapolate these equations to all locations using high-resolution Earth observation data, ready for placement in the land components of ESMs. Large-scale oceanic circulation modelling would first simply spatially average key depth-independent quantities, T₁, T₂, T_sub, h₁, h₂, and τ, and equations are then found that describe their evolution in time, yielding a reduced-complexity set of ODEs. Not all data would be used in the initial training exercises to determine governing equation sets. As with most AI methods, the remaining data would be used to test algorithms, which here would determine the performance of the proposed equations, hence the right-to-left arrow at the bottom of the diagram. In some instances, there may be repeated cycles around these arrows, with alternative sets of equations derived for consideration and appropriate methods selected to compare them (e.g. the Akaike information criterion, AIC, statistics).

Download

The three examples are (i) simulation of atmospheric convection, which solves equations at very high resolution (∼ 1 km grid) to represent the spatial heterogeneity of individual storms. Such models have high computational demands, which limits them in the range of modelled spatial extent and GHG concentrations. The goal is to develop from these calculations “bulk” differential equations suitable for ESMs operating at coarser (∼ 100 km) grids. These new equations need to aggregate fine-scale dynamics and their interactions with boundary conditions, simulate feedbacks where appropriate on ESM-simulated large-scale dynamics, and may incorporate stochastic components to describe the intensity and duration of convective events at fine scales. (ii) The second is the simulation of terrestrial carbon cycling. The fundamental equations governing land–atmosphere CO₂ exchange due to photosynthesis and respiration are known and routinely included in ESMs. However, at local scales, the parameterization of these equations strongly depends on biome type. Additional terms may need derivation for complex canopy structures or where different biomes are close and have key within-canopy interactions. These factors may aggregate to impact ESM-scale parameterizations; hence the challenge is two-fold. The first is to calibrate and, where necessary, discover new equation terms for key or co-located biomes, possibly guided by eddy-covariance measurements. The second is to utilize AI methods to entrain Earth observation data, enabling spatial aggregation beyond flux towers to generate equations and parameters applicable at ESM gridbox scales. This suggestion will improve ESMs with interactive carbon cycle simulations, providing better assessments of the extent to which the land surface will partially offset future CO₂ emissions. (iii) The last is the large-scale summary simulation of ocean currents. This proposed AI application would derive globally applicable reduced-complexity ocean models from ESMs, offering several uses. A simpler model can explore responses to wide ranges of future emissions trajectories not possible with full ESMs due to their computational constraints. Reduced-complexity models facilitate parameter scanning and, in the case of non-linearity, build on dynamical systems theory to illustrate any potential for TP occurrence. Fitting simpler model parameters to each member within an ESM ensemble allows improved characterization of inter-ESM variations and thus uncertainties. Finally, newer AI-derived reduced-complexity equations, drawn from data or ESMs, may reveal if current simpler models, such as the Timmermann ENSO model, continue to be appropriate or if alternative versions of oscillator models are more valid. To capture at least some of the aspects of the potential sets of AI-derived equations, in Fig. 5 we present the numerical mesh of original models and data (left-hand side) and the grid for new equations (right-hand side).

It is easier to set aspirations for implementing AI methods in climate science than to actually perform the analysis itself. Some of the suggestions here are likely major research projects that could take multiple years to complete. However, with the rapid pace of algorithm development raising questions about applicability to climate research, we aim to highlight the particular method of equation discovery. We contend that equation discovery, a form of interpretable AI, may substantially enhance some aspects of climate research that traditional analytical, statistical, or other AI methods may not address. An emphasis on equation development, and their inherent description of processes, allows moving on from the complaint that AI-developed models are purely statistical and may fail if extrapolated to make predictions for higher future GHG levels (although some capability of statistical AI methods to predict new forcings is noted by Scher and Messori, 2019). Although an initial set of equations might also be suspected to have poor “out-of-sample” performance, their existence provides a stronger basis for interpreting processes and interactions. Subsequent careful fitting of equation parameters may generate robust predictive capability.

ESMs will almost certainly remain the primary tool for advising climate policy, and two of our examples (modelling convective storms and terrestrial carbon cycling) offer the possibility of improving the reliability of such models. Better aggregation of subgrid storm process representation to ESM gridbox scale may remove known issues with existing cloud representations (Randall et al., 2003). The current variation in proposed subgrid atmosphere and land parameterizations may contribute to the large inter-ESM differences in the projection of changes in rainfall patterns (e.g. Yazdandoost et al., 2021) and the global carbon cycle (e.g. Huntingford et al., 2009) respectively. Improving emulation of subgrid effects in ESM development aligns with the commentary of Wong (2024) on AI and climate, although we again stress the retention of process understanding allowed through equation representation.

Could AI replace conventional climate research? This question is already being asked about weather forecasting (e.g. Schultz et al., 2021). AI has shown skill in predicting severe weather events (e.g. Bi et al., 2023; Lam et al., 2023), but McGovern et al. (2017) argue that it remains essential to continue developing, in parallel, a physical understanding of high-impact meteorological events. A deeper understanding of the balance of dominant equation terms, possibly determined by AI, may reveal causal links between processes during the preceding periods of extreme events (“storylines”; Shepherd, 2019) and therefore support earlier warnings. We suggest that AI employed for equation discovery will actually engage climate research scientists further rather than creating any form of replacement. Although climate change is mainly simulated with ESMs, their ability to offer ever more refined estimates of change is arguably at a plateau. For similar future pathways in GHGs, the spread of projections between the models in version 5 of the Climate Model Intercomparison Project (CMIP5) and the more recent version 6 (CMIP6) has not decreased for the basic quantities of changes in global mean temperature and global mean precipitation (e.g. Fig 4 of Tebaldi et al., 2021). Reflecting on our three illustrative but specific examples (Sect. 3.1, 3.2, and 3.3), we conjecture that the particular form of AI that is the discovery of equations may lower uncertainties in ESMs and for both their regional and global projections. These reductions will be by (i) providing new robust equations that capture subgrid processes, (ii) creating valid grid-scale parameterizations for existing equations that aggregate fine-scale processes, and (iii) disentangling complex processes to equation sets far simpler than ESMs but that capture the dominant processes. The latter simpler equations may guide measurement programmes towards tuning key parameters, and where such knowledge ultimately feeds back by improving ESM parameter calibration.

Data availability

No new data are used in this paper. The paper contains modified Copernicus Sentinel data (2024) processed by Sentinel Hub: https://apps.sentinel-hub.com/eo-browser/ (Copernicus Sentinel Hub, 2023) and with processing details at https://doi.org/10.5270/S2_-znk9xsj (Copernicus Sentinel-2, 2021).

Details of Fig. 3a are that it was provided by the European Space Agency (ESA) in 2023 (https://doi.org/10.5270/S2_-znk9xsj, Copernicus Sentinel-2, 2021). Sentinel-2B MSI: MultiSpectral Instrument, Level-2A/B. Retrieved from https://browser.dataspace.copernicus.eu/ (Copernicus Dataspace Browser, 2023), Product ID: S2B_MSIL2A_20230821T143729_N0509_R096_T19LHK_20230821T184452.SAFE, acquired on 21 August 2023.

Details of Fig. 3b are that it was provided by ESA in 2022. Sentinel-3A OLCI: Level-1 Full Resolution. Retrieved from https://browser.dataspace.copernicus.eu/ (Copernicus Dataspace Browser, 2023), Product ID: S3A_OL_1_EFR____20221021T041216_20221021T041451_20221022T050042_0154_091_147_1800_PS1_O_NT_003.SEN3, Acquired on 21 October 2022.

Author contributions

CH devised the original concept and framework and selected the examples for the manuscript. AJN wrote the section on simplified oceanic modelling, CK wrote the section on aggregating atmospheric convective events, and JAA supported the description of existing ML and AI algorithms. All authors contributed to the general writing of the manuscript.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

Chris Huntingford and Jawairia A. Ahmad acknowledge the Natural Environment Research Council (NERC) National Capability award to the UK Centre for Ecology and Hydrology. Andrew J. Nicoll is grateful for NERC funding through the Oxford University Environmental Research Doctoral Training Partnership (NE/S007474/1). Cornelia Klein acknowledges funding from a NERC independent research fellowship (NE/X017419/1).

Financial support

This research has been supported by the UK Research and Innovation (CEH National Capability Award), Oxford University Environmental Research Doctoral Training Partnership, and NERC independent research fellowship (grant no. NE/X017419/1).

Review statement

This paper was edited by Gabriele Messori and reviewed by Sebastian Scher and one anonymous referee.

References

Abrams, J. F., Huntingford, C., Williamson, M. S., McKay, D. I. A., Boulton, C. A., Buxton, J. E., Sakschewski, B., Loriani, S., Zimm, C., Winkelmann, R., and Lenton, T. M.: Committed Global Warming Risks Triggering Multiple Climate Tipping Points, Earth's Future, 11, e2022EF003250, https://doi.org/10.1029/2022EF003250, 2023. a

Aldeia, G. and De França, F.: Measuring Feature Importance of Symbolic Regression Models Using Partial Effects, in: 2021 Genetic and Evolutionary Computation Conference (GECCO '21), 10–14 July 2021, Lille, France, ACM, New York, NY, USA, 9 pp., https://doi.org/10.1145/3449639.3459302, 2021. a

Baldocchi, D., Falge, E., Gu, L. H., Olson, R., Hollinger, D., Running, S., Anthoni, P., Bernhofer, C., Davis, K., Evans, R., Fuentes, J., Goldstein, A., Katul, G., Law, B., Lee, X. H., Malhi, Y., Meyers, T., Munger, W., Oechel, W., U, K. T. P., Pilegaard, K., Schmid, H. P., Valentini, R., Verma, S., Vesala, T., Wilson, K., and Wofsy, S.: FLUXNET: A new tool to study the temporal and spatial variability of ecosystem-scale carbon dioxide, water vapor, and energy flux densities, B. Am. Meteorol. Soc., 82, 2415–2434, https://doi.org/10.1175/1520-0477(2001)082<2415:FANTTS>2.3.CO;2, 2001. a, b

Bao, J., Stevens, B., Kluft, L., and Muller, C.: Intensification of daily tropical precipitation extremes from more organized convection, Science Adv., 10, 1–11, https://doi.org/10.1126/sciadv.adj6801, 2024. a

Berliner, L. M., Levine, R. A., and Shea, D. J.: Bayesian climate change assessment, J. Climate, 13, 3805–3820, https://doi.org/10.1175/1520-0442(2000)013<3805:BCCA>2.0.CO;2, 2000. a

Bi, K., Xie, L., Zhang, H., Chen, X., Gu, X., and Tian, Q.: Accurate medium-range global weather forecasting with 3D neural networks, Nature, 619, 533–538, https://doi.org/10.1038/s41586-023-06185-3, 2023. a, b, c

Bjerknes, J.: Atmospheric teleconnections from the equatorial Pacific, Mon. Weather Rev., 97, 163–172, https://doi.org/10.1175/1520-0493(1969)097<0163:ATFTEP>2.3.CO;2, 1969. a

Bjordal, J., Storelvmo, T., Alterskjaer, K., and Carlsen, T.: Equilibrium climate sensitivity above 5 °C plausible due to state-dependent cloud feedback, Nat. Geosci., 13, 718–721, https://doi.org/10.1038/s41561-020-00649-1, 2020. a

Blyth, E. M., Arora, V. K., Clark, D. B., Dadson, S. J., De Kauwe, M. G., Lawrence, D. M., Melton, J. R., Pongratz, J., Turton, R. H., Yoshimura, K., and Yuan, H.: Advances in Land Surface Modelling, Curr. Climate Change Rep., 7, 45–71, https://doi.org/10.1007/s40641-021-00171-5, 2021. a

Boulanger, J.-P., Martinez, F., and Segura, E. C.: Projection of future climate change conditions using IPCC simulations, neural networks and Bayesian statistics. Part 1: Temperature mean state and seasonal cycle in South America, Clim. Dynam., 27, 233–259, https://doi.org/10.1007/s00382-006-0182-0, 2006. a

Brunton, S. L. and Kutz, J. N.: Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control, Cambridge University Press, Cambridge, 2nd edn., https://doi.org/10.1017/9781108380690, 2022. a, b, c, d

Brunton, S. L., Proctor, J. L., and Kutz, J. N.: Discovering governing equations from data by sparse identification of nonlinear dynamical systems, P. Natl. Acad. Sci. USA., 113, 3932–3937, https://doi.org/10.1073/pnas.1517384113, 2016. a, b, c, d, e

Canadell, J. G., Le Quéré, C., Raupach, M. R., Field, C. B., Buitenhuis, E. T., Ciais, P., Conway, T. J., Gillett, N. P., Houghton, R. A., and Marland, G.: Contributions to accelerating atmospheric CO₂ growth from economic activity, carbon intensity, and efficiency of natural sinks, P. Natl. Acad. Sci. USA., 104, 18866–18870, https://doi.org/10.1073/pnas.0702737104, 2007. a

Cao, Q., Goswami, S., and Karniadakis, G.: LNO: Laplace Neural Operator for Solving Differential Equations, arXiv [preprint], https://doi.org/10.48550/arXiv.2303.10528, 2023. a, b

Champion, K., Lusch, B., Kutz, J. N., and Brunton, S. L.: Data-driven discovery of coordinates and governing equations, P. Natl. Acad. Sci. USA., 116, 22445–22451, https://doi.org/10.1073/pnas.1906995116, 2019. a

Chandra, N., Patra, P. K., Niwa, Y., Ito, A., Iida, Y., Goto, D., Morimoto, S., Kondo, M., Takigawa, M., Hajima, T., and Watanabe, M.: Estimated regional CO₂ flux and uncertainty based on an ensemble of atmospheric CO₂ inversions, Atmos. Chem. Phys., 22, 9215–9243, https://doi.org/10.5194/acp-22-9215-2022, 2022. a, b

Chen, B., Coops, N. C., Fu, D., Margolis, H. A., Amiro, B. D., Barr, A. G., Black, T. A., Arain, M. A., Bourque, C. P.-A., Flanagan, L. B., Lafleur, P. M., McCaughey, J. H., and Wofsy, S. C.: Assessing eddy-covariance flux tower location bias across the Fluxnet-Canada Research Network based on remote sensing and footprint modelling, Agr. Forest Meteorol., 151, 87–100, https://doi.org/10.1016/j.agrformet.2010.09.005, 2011. a

Chen, L., Chen, Z., Zhang, Y., Liu, Y., Osman, A. I., Farghali, M., Hua, J., Al-Fatesh, A., Ihara, I., Rooney, D. W., and Yap, P.-S.: Artificial intelligence-based solutions for climate change: a review, Environ. Chem. Lett., 21, 2525–2557, https://doi.org/10.1007/s10311-023-01617-y, 2023. a

Chen, Z. and Liu, B.: Lifelong machine learning, volume 1, Springer, https://doi.org/10.2200/S00737ED1V01Y201610AIM033, 2018. a

Chen, Z., Liu, Y., and Sun, H.: Physics-informed learning of governing equations from scarce data, Nat. Commun., 12, 6136, https://doi.org/10.1038/s41467-021-26434-1, 2021. a

Choi, J., An, S. I., and Yeh, S. W.: Decadal amplitude modulation of two types of ENSO and its relationship with the mean state, Clim. Dynam., 38, 2632–2644, https://doi.org/10.1007/s00382-011-1186-y, 2012. a

Copernicus Dataspace Browser: Data browser within the application interface to the Copernicus Data Space Ecosystem, https://browser.dataspace.copernicus.eu/ (last access: 25 March 2025), 2023. a, b

Copernicus Sentinel-2 (processed by ESA): MSI Level-2A BOA Reflectance Product, Collection 1, European Space Agency [data set], https://doi.org/10.5270/S2_-znk9xsj, 2021. a, b, c

Copernicus Sentinel Hub: Application interface to the Copernicus Data Space Ecosystem, https://apps.sentinel-hub.com/eo-browser/ (last access: 25 March 2025), 2023. a

Davies-Barnard, T., Zaehle, S., and Friedlingstein, P.: Assessment of the impacts of biological nitrogen fixation structural uncertainty in CMIP6 earth system models, Biogeosciences, 19, 3491–3503, https://doi.org/10.5194/bg-19-3491-2022, 2022. a

Delaunay, A. and Christensen, H. M.: Interpretable Deep Learning for Probabilistic MJO Prediction, Geophys. Res. Lett., 49, e2022GL098566, https://doi.org/10.1029/2022GL098566, 2022. a

Dijkstra, H. A.: Nonlinear Climate Dynamics, Cambridge University Press, https://doi.org/10.1017/CBO9781139034135, 2013. a

Drijfhout, S., Bathiany, S., Beaulieu, C., Brovkin, V., Claussen, M., Huntingford, C., Scheffer, M., Sgubin, G., and Swingedouw, D.: Catalogue of abrupt shifts in Intergovernmental Panel on Climate Change climate models, P. Natl. Acad. Sci. USA., 112, E5777–E5786, https://doi.org/10.1073/pnas.1511451112, 2015. a

Dufresne, J.-L. and Bony, S.: An assessment of the primary sources of spread of global warming estimates from coupled atmosphere-ocean models, J. Climate, 21, 5135–5144, https://doi.org/10.1175/2008JCLI2239.1, 2008. a

Ebert-Uphoff, I. and Deng, Y.: Causal discovery for climate research using graphical models, J. Climate, 25, 5648–5665, https://doi.org/10.1175/JCLI-D-11-00387.1, 2012. a

Eusebi, R., Vecchi, G. A., Lai, C. Y., and Tong, M.: Realistic tropical cyclone wind and pressure fields can be reconstructed from sparse data using deep learning, Commun. Earth Env., 5, 8, https://doi.org/10.1038/s43247-023-01144-2, 2024. a

Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, https://doi.org/10.5194/gmd-9-1937-2016, 2016. a

Eyring, V., Collins, W. D., Gentine, P., Barnes, E. A., Barreiro, M., Beucler, T., Bocquet, M., Bretherton, C. S., Christensen, H. M., Dagon, K., Gagne, D. J., Hall, D., Hammerling, D., Hoyer, S., Iglesias-Suarez, F., Lopez-Gomez, I., McGraw, M. C., Meehl, G. A., Molina, M. J., Monteleoni, C., Mueller, J., Pritchard, M. S., Rolnick, D., Runge, J., Stier, P., Watt-Meyer, O., Weigel, K., Yu, R., and Zanna, L.: Pushing the frontiers in climate modelling and analysis with machine learning, Nat. Clim. Change, 14, 916–928, https://doi.org/10.1038/s41558-024-02095-y, 2024. a, b

Farquhar, G. D., Caemmerer, S. V., and Berry, J. A.: A biochemical-model of photosynthetic co2 assimilation in leaves of C-3 species, Planta, 149, 78–90, https://doi.org/10.1007/BF00386231, 1980. a

Forster, P., Storelvmo, T., Armour, K., Collins, W., Dufresne, J.-L., Frame, D., Lunt, D. J., Mauritsen, T., Palmer, M. D., Watanabe, M., Wild, M., and Zhang, H.: The Earth’s Energy Budget, Climate Feedbacks, and Climate Sensitivity, Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 923–1054, https://doi.org/10.1017/9781009157896.009, 2021. a

Fosser, G., Khodayar, S., and Berg, P.: Benefit of convection permitting climate model simulations in the representation of convective precipitation, Clim. Dynam., 44, 45–60, https://doi.org/10.1007/s00382-014-2242-1, 2015. a

Fosser, G., Gaetani, M., Kendon, E. J., Adinolfi, M., Ban, N., Belušić, D., Caillaud, C., Careto, J. A., Coppola, E., Demory, M. E., de Vries, H., Dobler, A., Feldmann, H., Goergen, K., Lenderink, G., Pichelli, E., Schär, C., Soares, P. M., Somot, S., and Tölle, M. H.: Convection-permitting climate models offer more certain extreme rainfall projections, npj Clim. Atmos. Sci., 7, 1–10, https://doi.org/10.1038/s41612-024-00600-w, 2024. a

Friedlingstein, P., O'Sullivan, M., Jones, M. W., Andrew, R. M., Hauck, J., Olsen, A., Peters, G. P., Peters, W., Pongratz, J., Sitch, S., Le Quéré, C., Canadell, J. G., Ciais, P., Jackson, R. B., Alin, S., Aragão, L. E. O. C., Arneth, A., Arora, V., Bates, N. R., Becker, M., Benoit-Cattin, A., Bittig, H. C., Bopp, L., Bultan, S., Chandra, N., Chevallier, F., Chini, L. P., Evans, W., Florentie, L., Forster, P. M., Gasser, T., Gehlen, M., Gilfillan, D., Gkritzalis, T., Gregor, L., Gruber, N., Harris, I., Hartung, K., Haverd, V., Houghton, R. A., Ilyina, T., Jain, A. K., Joetzjer, E., Kadono, K., Kato, E., Kitidis, V., Korsbakken, J. I., Landschützer, P., Lefèvre, N., Lenton, A., Lienert, S., Liu, Z., Lombardozzi, D., Marland, G., Metzl, N., Munro, D. R., Nabel, J. E. M. S., Nakaoka, S.-I., Niwa, Y., O'Brien, K., Ono, T., Palmer, P. I., Pierrot, D., Poulter, B., Resplandy, L., Robertson, E., Rödenbeck, C., Schwinger, J., Séférian, R., Skjelvan, I., Smith, A. J. P., Sutton, A. J., Tanhua, T., Tans, P. P., Tian, H., Tilbrook, B., van der Werf, G., Vuichard, N., Walker, A. P., Wanninkhof, R., Watson, A. J., Willis, D., Wiltshire, A. J., Yuan, W., Yue, X., and Zaehle, S.: Global Carbon Budget 2020, Earth Syst. Sci. Data, 12, 3269–3340, https://doi.org/10.5194/essd-12-3269-2020, 2020. a

Grundner, A., Beucler, T., Gentine, P., and Eyring, V.: Data‐Driven Equation Discovery of a Cloud Cover Parameterization, J. Adv. Model. Earth Sy., 16, e2023MS003763, https://doi.org/10.1029/2023MS003763, 2024. a, b

Hall, A., Cox, P., Huntingford, C., and Klein, S.: Progressing emergent constraints on future climate change, Nat. Clim. Change, 9, 269–278, https://doi.org/10.1038/s41558-019-0436-6, 2019. a

Hannachi, A., Jolliffe, I. T., and Stephenson, D. B.: Empirical orthogonal functions and related techniques in atmospheric science: A review, Int. J. Climatol., 27, 1119–1152, https://doi.org/10.1002/joc.1499, 2007. a

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horanyi, A., Munoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Holm, E., Janiskova, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thepaut, J.-N.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020. a

Hsieh, W. W.: Machine Learning Methods in the Environmental Sciences: Neural Networks and Kernels, Cambridge University Press, Cambridge, https://doi.org/10.1017/CBO9780511627217, 2009. a, b, c

Huntingford, C., Allen, S. J., and Harding, R. J.: An intercomparison of single and dual-source vegetation-atmosphere transfer models applied to transpiration from Sahelian savanna, Bound-Lay. Meteorol., 74, 397–418, https://doi.org/10.1007/BF00712380, 1995. a

Huntingford, C., Lowe, J. A., Booth, B. B. B., Jones, C. D., Harris, G. R., Gohar, L. K., and Meir, P.: Contributions of carbon cycle uncertainty to future climate projection spread, Tellus B, 61, 355–360, https://doi.org/10.1111/j.1600-0889.2009.00414.x, 2009. a

Huntingford, C., Jeffers, E. S., Bonsall, M. B., Christensen, H. M., Lees, T., and Yang, H.: Machine learning and artificial intelligence to aid climate change research and preparedness, Environ. Res. Lett., 14, 124007, https://doi.org/10.1088/1748-9326/ab4e55, 2019. a, b

Huntingford, C., Cox, P. M., Williamson, M. S., Clarke, J. J., and Ritchie, P. D. L.: Emergent constraints for the climate system as effective parameters of bulk differential equations, Earth Syst. Dynam., 14, 433–442, https://doi.org/10.5194/esd-14-433-2023, 2023. a

IPCC: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, https://doi.org/10.1017/9781009157896, 2021. a

Jebeile, J., Lam, V., Majszak, M., and Räz, T.: Machine learning and the quest for objectivity in climate model parameterization, Climatic Change, 176, 1–19, https://doi.org/10.1007/s10584-023-03532-1, 2023. a

Jin, F.-F.: An Equatorial Ocean Recharge Paradigm for ENSO. Part I: Conceptual Model, J. Atmos. Sci., 54, 811–829, https://doi.org/10.1175/1520-0469(1997)054<0811:AEORPF>2.0.CO;2, 1997. a, b

Jin, F.-F. and An, S.-I.: Thermocline and Zonal Advective Feedbacks Within the Equatorial Ocean Recharge Oscillator Model for ENSO, Geophys. Res. Lett., 26, 2989–2992, https://doi.org/10.1029/1999GL002297, 1999. a

Jin, F.-F., Chen, H.-C., Zhao, S., Hayashi, M., Karamperidou, C., Stuecker, M. F., Xie, R., and Geng, L.: Simple ENSO Models, chap. 6, American Geophysical Union (AGU), 119–151, ISBN 9781119548164, https://doi.org/10.1002/9781119548164.ch6, 2020. a

Jones, N.: Machine learning tapped to improve climate forecasts, Nature, 548, 379–380, https://doi.org/10.1038/548379a, 2017. a

Jung, M., Koirala, S., Weber, U., Ichii, K., Gans, F., Camps-Valls, G., Papale, D., Schwalm, C., Tramontana, G., and Reichstein, M.: The FLUXCOM ensemble of global land-atmosphere energy fluxes, Scientific Data, 6, 74, https://doi.org/10.1038/s41597-019-0076-8, 2019. a

Jung, M., Schwalm, C., Migliavacca, M., Walther, S., Camps-Valls, G., Koirala, S., Anthoni, P., Besnard, S., Bodesheim, P., Carvalhais, N., Chevallier, F., Gans, F., Goll, D. S., Haverd, V., Köhler, P., Ichii, K., Jain, A. K., Liu, J., Lombardozzi, D., Nabel, J. E. M. S., Nelson, J. A., O'Sullivan, M., Pallandt, M., Papale, D., Peters, W., Pongratz, J., Rödenbeck, C., Sitch, S., Tramontana, G., Walker, A., Weber, U., and Reichstein, M.: Scaling carbon fluxes from eddy covariance sites to globe: synthesis and evaluation of the FLUXCOM approach, Biogeosciences, 17, 1343–1365, https://doi.org/10.5194/bg-17-1343-2020, 2020. a

Kaack, L. H., Donti, P. L., Strubell, E., Kamiya, G., Creutzig, F., and Rolnick, D.: Aligning artificial intelligence with climate change mitigation, Nat. Clim. Change, 12, 518–527, https://doi.org/10.1038/s41558-022-01377-7, 2022. a

Kaptanoglu, A. A., Callaham, J. L., Aravkin, A., Hansen, C. J., and Brunton, S. L.: Promoting global stability in data-driven models of quadratic nonlinear dynamics, Phys. Rev. Fluids, 6, 094401, https://doi.org/10.1103/PhysRevFluids.6.094401, 2021. a

Karniadakis, G. E., Kevrekidis, I. G., Lu, L., Perdikaris, P., Wang, S., and Yang, L.: Physics-informed machine learning, Nat. Rev. Phys., 3, 422–440, https://doi.org/10.1038/s42254-021-00314-5, 2021. a, b, c

Kashinath, K., Mustafa, M., Albert, A., Wu, J. L., Jiang, C., Esmaeilzadeh, S., Azizzadenesheli, K., Wang, R., Chattopadhyay, A., Singh, A., Manepalli, A., Chirila, D., Yu, R., Walters, R., White, B., Xiao, H., Tchelepi, H. A., Marcus, P., Anandkumar, A., Hassanzadeh, P., and Prabhat: Physics-informed machine learning: case studies for weather and climate modelling, Philos. T. R. Soc. A, 379, 20200093, https://doi.org/10.1098/rsta.2020.0093, 2021. a

Kendon, E. J., Ban, N., Roberts, N. M., Fowler, H. J., Roberts, M. J., Chan, S. C., Evans, J. P., Fosser, G., and Wilkinson, J. M.: Do convection-permitting regional climate models improve projections of future precipitation change?, B. Am. Meteorol. Soc., 98, 79–93, https://doi.org/10.1175/BAMS-D-15-0004.1, 2017. a

Kendon, E. J., Prein, A. F., Senior, C. A., and Stirling, A.: Challenges and outlook for convection-permitting climate modelling, Philos. T. R. Soc. A, 379, 20190547, https://doi.org/10.1098/rsta.2019.0547, 2021. a

Keren, L., Liberzon, A., and Lazebnik, T. A.: computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge, Sci. Rep.-UK, 13, 1249, https://doi.org/10.1038/s41598-023-28328-2, 2023. a

Klein, C., Jackson, L. S., Parker, D. J., Marsham, J. H., Taylor, C. M., Rowell, D. P., Guichard, F. F., Vischel, T. T., Famien, A. M. M. L., and Diedhiou, A.: Combining CMIP data with a regional convection-permitting model and observations to project extreme rainfall under climate change, Environ. Res. Lett., 16, 104023, https://doi.org/10.1088/1748-9326/ac26f1, 2021. a

Kodra, E., Bhatia, U., Chatterjee, S., Chen, S., and Ganguly, A. R.: Physics-guided probabilistic modeling of extreme precipitation under climate change, Sci. Rep.-UK., 10, 10299, https://doi.org/10.1038/s41598-020-67088-1, 2020. a

Kondo, M., Patra, P. K., Sitch, S., Friedlingstein, P., Poulter, B., Chevallier, F., Ciais, P., Canadell, J. G., Bastos, A., Lauerwald, R., Calle, L., Ichii, K., Anthoni, P., Arneth, A., Haverd, V., Jain, A. K., Kato, E., Kautz, M., Law, R. M., Lienert, S., Lombardozzi, D., Maki, T., Nakamura, T., Peylin, P., Rödenbeck, C., Zhuravlev, R., Saeki, T., Tian, H., Zhu, D., and Ziehn, T.: State of the science in reconciling top-down and bottom-up approaches for terrestrial CO₂ budget, Glob. Change Biol., 26, 1068–1084, https://doi.org/10.1111/gcb.14917, 2020. a

Kuehl, N., Schemmer, M., Goutier, M., and Satzger, G.: Artificial intelligence and machine learning, Electronic Mark., 32, 2235–2244, https://doi.org/10.1007/s12525-022-00598-0, 2022. a

Lam, R., Sanchez-Gonzalez, A., Willson, M., Wirnsberger, P., Fortunato, M., Alet, F., Ravuri, S., Ewalds, T., Eaton-Rosen, Z., Hu, W., Merose, A., Hoyer, S., Holland, G., Vinyals, O., Stott, J., Pritzel, A., Mohamed, S., and Battaglia, P.: Learning skillful medium-range global weather forecasting, Science, 382, 1416–1421, https://doi.org/10.1126/science.adi2336, 2023. a, b, c

Lenderink, G. and Fowler, H. J.: Understanding rainfall extremes, Nat. Clim. Change, 7, 391–393, https://doi.org/10.1038/nclimate3305, 2017. a

Lenderink, G., Barbero, R., Loriaux, J. M., and Fowler, H. J.: Super-Clausius-Clapeyron Scaling of Extreme Hourly Convective Precipitation and Its Relation to Large-Scale Atmospheric Conditions, J. Climate, 30, 6037–6052, https://doi.org/10.1175/JCLI-D-16-0808.1, 2017. a

Linardatos, P., Papastefanopoulos, V., and Kotsiantis, S.: Explainable AI: A Review of Machine Learning Interpretability Methods, Entropy, 23, https://doi.org/10.3390/e23010018, 2021. a

Lorenz, E. N.: Deterministic Nonperiodic Flow, J. Atmos. Sci., 20, 130–141, https://doi.org/10.1175/1520-0469(1963)020<0130:DNF>2.0.CO;2, 1963. a

Lu, L., Jin, P., Pang, G., Zhang, Z., and Karniadakis, G. E.: Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators, Nat. Mach. Intell., 3, 218–229, https://doi.org/10.1038/s42256-021-00302-5, 2021. a

Lund, R. and Li, B.: Revisiting climate region definitions via clustering, J. Climate, 22, 1787–1800, https://doi.org/10.1175/2008JCLI2455.1, 2009. a

Makridakis, S.: The forthcoming Artificial Intelligence (AI) revolution: Its impact on society and firms, Futures, 90, 46–60, https://doi.org/10.1016/j.futures.2017.03.006, 2017. a

Maybee, B., Marsham, J. H., Klein, C. M., Parker, D. J., Barton, E. J., Taylor, C. M., Lewis, H., Sanchez, C., Jones, R. W., and Warner, J.: Wind Shear Effects in Convection–Permitting Models Influence MCS Rainfall and Forcing of Tropical Circulation, Geophys. Res. Lett., 51, e2024GL110119, https://doi.org/10.1029/2024GL110119, 2024. a

McGovern, A., Elmore, K. L., Gagne, II, D. J., Haupt, S. E., Karstens, C. D., Lagerquist, R., Smith, T., and Williams, J. K.: Using artificial intelligence to improve real-time decision-making for high-impact weather, B. Am. Meteorol. Soc., 98, 2073–2090, https://doi.org/10.1175/BAMS-D-16-0123.1, 2017. a

Mckay, D. I. A., Staal, A., Abrams, J. F., Winkelmann, R., Sakschewski, B., Loriani, S., Fetzer, I., Cornell, S. E., Rockstrom, J., and Lenton, T. M.: Exceeding 1.5 °C global warming could trigger multiple climate tipping points, Science, 377, 1171, https://doi.org/10.1126/science.abn7950, 2022. a

Mercado, L. M., Huntingford, C., Gash, J. H. C., Cox, P. M., and Jogireddy, V.: Improving the representation of radiation interception and photosynthesis for climate model applications, Tellus B, 59, 553–565, https://doi.org/10.1111/j.1600-0889.2007.00256.x, 2007. a

Monteith, J. L.: Evaporation and surface-temperature, Q. J. Roy. Meteor. Soc., 107, 1–27, https://doi.org/10.1256/smsqj.45101, 1981. a

Mu, Q., Jackson, C. S., and Stoffa, P. L.: A multivariate empirical-orthogonal-function-based measure of climate model performance, J. Geophys. Res-Atmos., 109, D15101, https://doi.org/10.1029/2004JD004584, 2004. a, b

Mulholland, J. P., Peters, J. M., and Morrison, H.: How does vertical wind shear influence entrainment in squall lines?, J. Atmos. Sci., 78, 1931–1946, https://doi.org/10.1175/JAS-D-20-0299.1, 2021. a

Murphy, K. P.: Machine learning: a probabilistic perspective, MIT Press, Cambridge, Mass., USA, https://doi.org/10.1080/09332480.2014.914768, 2013. a, b, c

O'Gorman, P. A. and Dwyer, J. G.: Using Machine Learning to Parameterize Moist Convection: Potential for Modeling of Climate, Climate Change, and Extreme Events, J. Adv. Model. Earth Sy., 10, 2548–2563, https://doi.org/10.1029/2018ms001351, 2018. a

Papale, D.: Ideas and perspectives: enhancing the impact of the FLUXNET network of eddy covariance sites, Biogeosciences, 17, 5587–5598, https://doi.org/10.5194/bg-17-5587-2020, 2020. a

Petersen, B. K., Landajuela, M., Mundhenk, T. N., Santiago, C. P., Kim, S. K., and Kim, J. T.: Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients, arXiv [preprint], https://doi.org/10.48550/arXiv.1912.04871, 2019. a

Picaut, J., Masia, F., and du Penhoat, Y.: An advective-reflective conceptual model for the oscillatory nature of the ENSO, Science, 277, 663–666, https://doi.org/10.1126/science.277.5326.663, 1997. a

Prein, A. F., Gobiet, A., Suklitsch, M., Truhetz, H., Awan, N. K., Keuler, K., and Georgievski, G.: Added value of convection permitting seasonal simulations, Clim. Dynam., 41, 2655–2677, https://doi.org/10.1007/s00382-013-1744-6, 2013. a

Prein, A. F., Liu, C., Ikeda, K., Trier, S. B., Rasmussen, R. M., Holland, G. J., and Clark, M. P.: Increased rainfall volume from future convective storms in the US, Nat. Clim. Change, 7, 880–884, https://doi.org/10.1038/s41558-017-0007-7, 2017. a

Raissi, M., Perdikaris, P., and Karniadakis, G. E.: Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations, J. Comput. Phys., 378, 686–707, https://doi.org/10.1016/j.jcp.2018.10.045, 2019. a

Randall, D., Khairoutdinov, M., Arakawa, A., and Grabowski, W.: Breaking the cloud parameterization deadlock, B. Am. Meteorol. Soc., 84, 1547–1564, https://doi.org/10.1175/BAMS-84-11-1547, 2003. a

Rasp, S., Pritchard, M. S., and Gentine, P.: Deep learning to represent subgrid processes in climate models, P. Natl. Acad. Sci. USA., 115, 9684–9689, https://doi.org/10.1073/pnas.1810286115, 2018. a

Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., Carvalhais, N., and Prabhat: Deep learning and process understanding for data-driven Earth system science, Nature, 566, 195–204, https://doi.org/10.1038/s41586-019-0912-1, 2019. a, b

Rolnick, D., Donti, P. L., Kaack, L. H., Kochanski, K., Lacoste, A., Sankaran, K., Ross, A. S., Milojevic-Dupont, N., Jaques, N., Waldman-Brown, A., Luccioni, A. S., Maharaj, T., Sherwin, E. D., Mukkavilli, S. K., Kording, K. P., Gomes, C. P., Ng, A. Y., Hassabis, D., Platt, J. C., Creutzig, F., Chayes, J., and Bengio, Y.: Tackling Climate Change with Machine Learning, ACM Comput. Surv., 55, 1–96, https://doi.org/10.1145/3485128, 2023. a

Ross, A., Li, Z., Perezhogin, P., Fernandez-Granda, C., and Zanna, L.: Benchmarking of Machine Learning Ocean Subgrid Parameterizations in an Idealized Model, J. Adv. Model. Earth Sy., 15, e2022MS003258, https://doi.org/10.1029/2022MS003258, 2023. a

Rudy, S. H., Brunton, S. L., Proctor, J. L., and Kutz, J. N.: Data-driven discovery of partial differential equations, Sci. Advances, 3, e1602614, https://doi.org/10.1126/sciadv.1602614, 2017. a, b

San, O. and Iliescu, T.: A stabilized proper orthogonal decomposition reduced-order model for large scale quasigeostrophic ocean circulation, Adv. Comput. Math., 41, 1289–1319, https://doi.org/10.1007/s10444-015-9417-0, 2015. a

Scher, S. and Messori, G.: Generalization properties of feed-forward neural networks trained on Lorenz systems, Nonlin. Processes Geophys., 26, 381–399, https://doi.org/10.5194/npg-26-381-2019, 2019. a

Schmid, H. P.: Source areas for scalars and scalar fluxes, Bound-Lay. Meteorol., 67, 293–318, https://doi.org/10.1007/BF00713146, 1994. a

Schneider, T., Lan, S., Stuart, A., and Teixeira, J.: Earth System Modeling 2.0: A Blueprint for Models That Learn From Observations and Targeted High-Resolution Simulations, Geophys. Res. Lett., 44, 12396–12417, https://doi.org/10.1002/2017GL076101, 2017. a

Schneider, T., Behera, S., Boccaletti, G., Deser, C., Emanuel, K., Ferrari, R., Leung, L. R., Lin, N., Mueller, T., Navarra, A., Ndiaye, O., Stuart, A., Tribbia, J., and Yamagata, T.: Harnessing AI and computing to advance climate modelling and prediction, Nat. Clim. Change, 13, 887–889, https://doi.org/10.1038/s41558-023-01769-3, 2023. a

Schultz, M. G., Betancourt, C., Gong, B., Kleinert, F., Langguth, M., Leufen, L. H., Mozaffari, A., and Stadtler, S.: Can deep learning beat numerical weather prediction?, Philos. T. R. Soc. A, 379, 20200097, https://doi.org/10.1098/rsta.2020.0097, 2021. a

Senior, C. A., Marsham, J. H., Berthou, S., Burgin, L. E., Folwell, S. S., Kendon, E. J., Klein, C. M., Jones, R. G., Mittal, N., Rowell, D. P., Tomassini, L., Vischel, T., Becker, B., Birch, C. E., Crook, J., Dougill, A. J., Finney, D. L., Graham, R. J., Hart, N. C. G., Jack, C. D., Jackson, L. S., James, R., Koelle, B., Misiani, H., Mwalukanga, B., Parker, D. J., Stratton, R. A., Taylor, C. M., Tucker, S. O., Wainwright, C. M., Washington, R., and Willet, M. R.: Convection-Permitting Regional Climate Change Simulations for Understanding Future Climate and Informing Decision-Making in Africa, B. Am. Meteorol. Soc., 102, E1206–E1223, https://doi.org/10.1175/BAMS-D-20-0020.1, 2021. a

Shepherd, T. G.: Storyline approach to the construction of regional climate change information, Philos. T. R. Soc. A, 475, 20190013, https://doi.org/10.1098/rspa.2019.0013, 2019. a

Sitch, S., Huntingford, C., Gedney, N., Levy, P. E., Lomas, M., Piao, S. L., Betts, R., Ciais, P., Cox, P., Friedlingstein, P., Jones, C. D., Prentice, I. C., and Woodward, F. I.: Evaluation of the terrestrial carbon cycle, future plant geography and climate-carbon cycle feedbacks using five Dynamic Global Vegetation Models (DGVMs), Glob. Change Biol., 14, 2015–2039, https://doi.org/10.1111/j.1365-2486.2008.01626.x, 2008. a

Slingo, J., Bates, P., Bauer, P., Belcher, S., Palmer, T., Stephens, G., Stevens, B., Stocker, T., and Teutsch, G.: Ambitious partnership needed for reliable climate prediction, Nat. Clim. Change, 12, 499–503, https://doi.org/10.1038/s41558-022-01384-8, 2022. a

Smith, R. L., Tebaldi, C., Nychka, D., and Mearns, L. O.: Bayesian modeling of uncertainty in ensembles of climate models, J. Am. Stat. Assoc., 104, 97–116, https://doi.org/10.1198/jasa.2009.0007, 2009. a

Smith, T. M., Reynolds, R. W., Livezey, R. E., and Stokes, D. C.: Reconstruction of historical sea surface temperatures using empirical orthogonal functions, J. Climate, 9, 1403–1420, https://doi.org/10.1175/1520-0442(1996)009<1403:ROHSST>2.0.CO;2, 1996. a, b

Steinbach, M., Tan, P.-N., Kumar, V., Klooster, S., and Potter, C.: Discovery of climate indices using clustering, in: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, ACM Digital Library, 446–455, https://doi.org/10.1145/956750.956801, 2003. a

Stephens, G. L., L'Ecuyer, T., Forbes, R., Gettelman, A., Golaz, J.-C., Bodas-Salcedo, A., Suzuki, K., Gabriel, P., and Haynes, J.: Dreary state of precipitation in global models, J. Geophys. Res,-Atmos., 115, D24211, https://doi.org/10.1029/2010JD014532, 2010. a

Stevens, B., Satoh, M., Auger, L., Biercamp, J., Bretherton, C. S., Chen, X., Düben, P., Judt, F., Khairoutdinov, M., Klocke, D., Kodama, C., Kornblueh, L., Lin, S. J., Neumann, P., Putman, W. M., Röber, N., Shibuya, R., Vanniere, B., Vidale, P. L., Wedi, N., and Zhou, L.: DYAMOND: the DYnamics of the Atmospheric general circulation Modeled On Non-hydrostatic Domains, Prog. Earth Planet. Sci., 6, 61, https://doi.org/10.1186/s40645-019-0304-z, 2019. a

Stommel, H.: Thermohaline Convection with Two Stable Regimes of Flow, Tellus, 13, 224–230, https://doi.org/10.1111/j.2153-3490.1961.tb00079.x, 1961. a

Storch, H. v. and Zwiers, F. W.: Statistical Analysis in Climate Research, Cambridge University Press, https://doi.org/10.1017/CBO9780511612336, 1999. a, b, c

Suarez, M. J. and Schopf, P. S.: A delayed action oscillator for ENSO, J. Atmos. Sci., 45, 3283–3287, https://doi.org/10.1175/1520-0469(1988)045<3283:ADAOFE>2.0.CO;2, 1998. a

Tanevski, J., Todorovski, L., and Džeroski, S.: Combinatorial search for selecting the structure of models of dynamical systems with equation discovery, Eng. Appl. Artif. Intel., 89, 103423, https://doi.org/10.1016/j.engappai.2019.103423, 2020. a

Tebaldi, C., Debeire, K., Eyring, V., Fischer, E., Fyfe, J., Friedlingstein, P., Knutti, R., Lowe, J., O'Neill, B., Sanderson, B., van Vuuren, D., Riahi, K., Meinshausen, M., Nicholls, Z., Tokarska, K. B., Hurtt, G., Kriegler, E., Lamarque, J.-F., Meehl, G., Moss, R., Bauer, S. E., Boucher, O., Brovkin, V., Byun, Y.-H., Dix, M., Gualdi, S., Guo, H., John, J. G., Kharin, S., Kim, Y., Koshiro, T., Ma, L., Olivié, D., Panickal, S., Qiao, F., Rong, X., Rosenbloom, N., Schupfner, M., Séférian, R., Sellar, A., Semmler, T., Shi, X., Song, Z., Steger, C., Stouffer, R., Swart, N., Tachiiri, K., Tang, Q., Tatebe, H., Voldoire, A., Volodin, E., Wyser, K., Xin, X., Yang, S., Yu, Y., and Ziehn, T.: Climate model projections from the Scenario Model Intercomparison Project (ScenarioMIP) of CMIP6, Earth Syst. Dynam., 12, 253–293, https://doi.org/10.5194/esd-12-253-2021, 2021. a, b

Timmermann, A., Voss, H. U., and Pasmanter, R.: Empirical Dynamical System Modeling of ENSO Using Nonlinear Inverse Techniques, J. Phys. Oceanogr., 31, 1579–1598, https://doi.org/10.1175/1520-0485(2001)031<1579:EDSMOE>2.0.CO;2, 2001. a

Timmermann, A., Jin, F. F., and Abshagen, J.: A nonlinear theory for El Nino bursting, J. Atmos. Sci., 60, 152–165, https://doi.org/10.1175/1520-0469(2003)060<0152:ANTFEN>2.0.CO;2, 2003. a, b, c, d, e, f, g, h

Timmermann, A., An, S.-I., Kug, J.-S., Jin, F.-F., Cai, W., Capotondi, A., Cobb, K., Lengaigne, M., McPhaden, M., Stuecker, M., Stein, K., Wittenberg, A., Yun, K.-S., Bayr, T., Chen, H.-C., Chikamoto, Y., Dewitte, B., Dommenget, D., Grothe, P., and Zhang, X.: El Niño–Southern Oscillation complexity, Nature, 559, 535–545, https://doi.org/10.1038/s41586-018-0252-6, 2018. a

Tramontana, G., Jung, M., Schwalm, C. R., Ichii, K., Camps-Valls, G., Ráduly, B., Reichstein, M., Arain, M. A., Cescatti, A., Kiely, G., Merbold, L., Serrano-Ortiz, P., Sickert, S., Wolf, S., and Papale, D.: Predicting carbon dioxide and energy fluxes across global FLUXNET sites with regression algorithms, Biogeosciences, 13, 4291–4313, https://doi.org/10.5194/bg-13-4291-2016, 2016. a

UNFCCC: Adoption of the Paris Agreement, Report No. FCCC/CP/2015/L.9/Rev.1, https://unfccc.int/resource/docs/2015/cop21/eng/l09r01.pdf (last access: 1 December 2024), 2015. a

Vlachas, P. R., Byeon, W., Wan, Z. Y., Sapsis, T. P., and Koumoutsakos, P.: Data-driven forecasting of high-dimensional chaotic systems with long short-term memory networks, Philos. T. R. Soc. A, 474, 20170844, https://doi.org/10.1098/rspa.2017.0844, 2018. a

Wang, H., Fu, T., Du, Y., Gao, W., Huang, K., Liu, Z., Chandak, P., Liu, S., Van Katwyk, P., Deac, A., Anandkumar, A., Bergen, K., Gomes, C. P., Ho, S., Kohli, P., Lasenby, J., Leskovec, J., Liu, T.-Y., Manrai, A., Marks, D., Ramsundar, B., Song, L., Sun, J., Tang, J., Velickovic, P., Welling, M., Zhang, L., Coley, C. W., Bengio, Y., and Zitnik, M.: Scientific discovery in the age of artificial intelligence, Nature, 620, 47–60, https://doi.org/10.1038/s41586-023-06221-2, 2023. a

Weisberg, R. H. and Wang, C.: A western Pacific oscillator paradigm for the El Nino-Southern Oscillation, Geophys. Res. Lett., 24, 779–782, https://doi.org/10.1029/97GL00689, 1997. a

Westra, S., Fowler, H. J., Evans, J. P., Alexander, L. V., Berg, P., Johnson, F., Kendon, E. J., Lenderink, G., and Roberts, N. M.: Future changes to the intensity and frequency of short-duration extreme rainfall, Rev. Geophys., 52, 522–555, https://doi.org/10.1002/2014RG000464, 2014. a

Williamson, M. S., Thackeray, C. W., Cox, P. M., Hall, A., Huntingford, C., and Nijsse, F. J. M. M.: Emergent constraints on climate sensitivities, Rev. Mod. Phys., 93, 025004, https://doi.org/10.1103/RevModPhys.93.025004, 2021. a

Wong, C.: How AI is improving climate forecasts, Nature, 628, 710–712, https://doi.org/10.1038/d41586-024-00780-8, 2024. a

Wunderling, N., Donges, J. F., Kurths, J., and Winkelmann, R.: Interacting tipping elements increase risk of climate domino effects under global warming, Earth Syst. Dynam., 12, 601–619, https://doi.org/10.5194/esd-12-601-2021, 2021. a

Xiaoli Liu, Coulibaly, P., and Evora, N.: Comparison of data-driven methods for downscaling ensemble weather forecasts, Hydrol. Earth Syst. Sci., 12, 615–624, https://doi.org/10.5194/hess-12-615-2008, 2008. a

Xu, Y., Liu, X., Cao, X., Huang, C., Liu, E., Qian, S., Liu, X., Wu, Y., Dong, F., Qiu, C.-W., Qiu, J., Hua, K., Su, W., Wu, J., Xu, H., Han, Y., Fu, C., Yin, Z., Liu, M., Roepman, R., Dietmann, S., Virta, M., Kengara, F., Zhang, Z., Zhang, L., Zhao, T., Dai, J., Yang, J., Lan, L., Luo, M., Liu, Z., An, T., Zhang, B., He, X., Cong, S., Liu, X., Zhang, W., Lewis, J. P., Tiedje, J. M., Wang, Q., An, Z., Wang, F., Zhang, L., Huang, T., Lu, C., Cai, Z., Wang, F., and Zhang, J.: Artificial intelligence: A powerful paradigm for scientific research, Innovation, 2, 100179, https://doi.org/10.1016/j.xinn.2021.100179, 2021. a

Yazdandoost, F., Moradian, S., Izadi, A., and Aghakouchak, A.: Evaluation of CMIP6 precipitation simulations across different climatic zones: Uncertainty and model intercomparison, Atmos. Res., 250, 105369, https://doi.org/10.1016/j.atmosres.2020.105369, 2021. a

Zebiak, S. E. and Cane, M. A.: A Model El Niño–Southern Oscillation, Mon. Weather Rev., 115, 2262–2278, https://doi.org/10.1175/1520-0493(1987)115<2262:AMENO>2.0.CO;2, 1987. a

Articles

Chief editor

Artificial Intelligence (AI) is often criticised for being a "black-box" approach that provides no physical insights into the data being analysed. Very recently, a new branch of AI has emerged, called “AI-led equation discovery”. As the name suggests, it aims to reveal process equations underlying the data. This Perspective Article offers a path to align AI methods with climate research, with a focus on the use of AI-led equation discovery in support of Earth System Models.

Short summary

AI is impacting science, providing key data insights, but most algorithms are statistical requiring cautious "out-of-sample" extrapolation. Yet climate research concerns predicting future climatic states. We consider a new method of AI-led equation discovery. Equations offer process interpretation and more robust predictions. We recommend this method for climate analysis, suggesting illustrative application to atmospheric convection, land–atmosphere CO₂ flux, and global ocean circulation models.