Past, present, and future variability of Atlantic meridional overturning circulation in CMIP6 ensembles

Coquereau, Arthur; Sévellec, Florian; Huck, Thierry; Hirschi, Joël J.-M.; Jamet, Quentin

doi:10.5194/esd-17-209-2026

Articles | Volume 17, issue 2

https://doi.org/10.5194/esd-17-209-2026

Articles | Volume 17, issue 2

Research article

04 Mar 2026

Research article |

| 04 Mar 2026

Past, present, and future variability of Atlantic meridional overturning circulation in CMIP6 ensembles

Arthur Coquereau, Florian Sévellec, Thierry Huck, Joël J.-M. Hirschi, and Quentin Jamet

Abstract

The Atlantic Meridional Overturning Circulation (AMOC) is a key component of the climate system, exhibiting strong variability across daily to millennial timescales and significantly influencing global climate. Sensitive to external conditions such as freshwater input, greenhouse gas concentrations, and aerosol forcing, important variations of the AMOC can be triggered by anthropogenic emissions. This study presents a comprehensive analysis of sources of AMOC variance in state-of-the-art climate ensemble models. By decomposing the effects of scenario, model, ensemble, and time variability, along with their interactions, through an Analysis of Variance (ANOVA) and by introducing a novel combination of the variance contributions based on physical considerations, we identify three distinct regimes of AMOC variability from 1850 to 2100. The first regime, spanning most of the historical period, is characterized by a relatively stable AMOC dominated by internal variability (i.e., ensemble spread). The second regime, initiated by AMOC decline at the end of the 20th century and lasting until mid-21st century, is governed by a transient increase of time variability. Notably, the direct effect of forcing scenario differences remains muted all along this regime, despite the start of emission-scenarios in 2015. The third regime, beginning around 2050, is marked by the emergence and rapid dominance of inter-scenario variability. Throughout the simulations, inter-model variability remains the primary source of uncertainty, influenced by aerosol forcing response, AMOC decline magnitude, and the physical variability. A key finding of this work is the evidence that internal variability decreases simultaneously with AMOC intensity and seems inversely proportional to emission-scenario intensity.

Download & links

How to cite.

Received: 02 Jan 2025 – Discussion started: 27 Jan 2025 – Revised: 19 Feb 2026 – Accepted: 23 Feb 2026 – Published: 04 Mar 2026

1 Introduction

The Earth's climate is a complex system made up of various intertwined components interacting with each other and varying over a wide range of temporal and spatial scales. Among the major components, the Atlantic Meridional Overturning Circulation (AMOC) plays a key role by controlling a substantial part of the poleward heat transport with major impacts on the surrounding regions (Srokosz et al., 2012). At 26° N, it transports about 1.2 PW (1 PW =10¹⁵ W) representing 60 % of the net poleward heat flux of the ocean and 30 % considering both ocean and atmosphere (Ganachaud and Wunsch, 2000; Trenberth and Fasullo, 2017; Johns et al., 2023). Several remote influences have also been identified, with the North East Brazilian and Sahel rainfalls, or the Atlantic hurricanes activity to only name a few (Knight et al., 2006). Finally, the AMOC contributes significantly to carbon sequestration through deep-water formation at high latitudes (Zickfeld et al., 2008).

Considering this leading role for the climate system, it appears critical to understand how the AMOC varies over time and identify the associated mechanisms. In this regard, a recent review provided a comprehensive description of the scales involved in AMOC variability (Hirschi et al., 2020). It shows that the variability unfolds over a large collection of temporal and spatial scales, driven by diverse physical processes – from daily to interannual and even decadal fluctuations. These include fast synoptic weather systems and near-inertial gravity waves, mesoscale eddies and large-scale baroclinic waves, as well as slower climate modes such as the North Atlantic Oscillation or the Atlantic Multi-decadal Oscillation. Accordingly, a common simplified distinction is made between intra-annual variability, primarily linked to Ekman wind forcing, and interannual-to-decadal variability, which is more strongly associated with geostrophic processes and large-scale density contrasts (see Buckley and Marshall, 2016, for a review). At the end of the spectrum, the AMOC has also been associated with abrupt climate events, such as Dansgaard-Oeschger and Heinrich events at millennial scale (Dansgaard et al., 1993; McManus et al., 2004; Böhm et al., 2015; Henry et al., 2016). These events, which are not yet fully understood, underscore another critical aspect of the AMOC: its multi-stability (e.g., Sévellec and Fedorov, 2014). Indeed, a long literature has studied this aspect and shown that the AMOC is subject to collapses during which the system shifts from a first state with an intense circulation to another one where the circulation considerably weakens (Stommel, 1961; Lenton et al., 2008; Armstrong McKay et al., 2022).

Beyond these natural, internal, and chaotic variations of the AMOC, the system is also sensitive to external forcings, such as the anthropogenic emissions of aerosols and carbon in the atmosphere. These emissions can either reduce the surface temperature by increasing the reflection of incoming solar radiation (for the aerosols), or on the contrary, increase the temperature by intensifying the greenhouse effect (in the case of carbon dioxide or methane). While carbon dioxide and methane are “well-mixed” in the atmosphere, resulting in a globally homogeneous effect, the concentration of aerosols is more heterogeneous, leading to localized radiative forcing. In Subpolar North Atlantic regions, these changes in surface temperatures can have a direct impact on the buoyancy of surface waters, and can therefore reduce or intensify the formation of deep-water that need sufficient density to sink. In addition, human activities can impact the freshwater input in the subpolar gyre, for instance by modifying precipitation regimes or by boosting the Greenland Ice Sheet melting rate, and thus further increase the surface buoyancy and decrease dense water formation consequently (Gierz et al., 2015).

For our understanding of the future evolution of the AMOC and the potential impacts on the climate, it therefore appears crucial to study the different sources of variability and their role in the past, present, and future state of the AMOC. Our objective, in this work, is precisely to tackle this challenge and provide a comprehensive analysis of the AMOC variability in state-of-the-art climate models from the 6th Phase of the Coupled Model Intercomparison Project (CMIP6). In particular, we aim to separate the internal signal induced by internal modes of variability from the forced signal associated with anthropogenic fingerprint. The analysis benefits from a large body of work, over the last decades, dedicated to partitioning the variability (or the uncertainty) in climate simulations (Hawkins and Sutton, 2009, 2011; Yip et al., 2011; Sévellec and Sinha, 2018; Lehner et al., 2020; Zhang et al., 2023).

In this study, we take advantage of an improvement over the 6th phase of the CMIP project, namely the presence of relatively large ensemble simulations. Each ensemble consists of several simulations of a single model covering the same time period with the same forcing, but starting with different initial conditions. This initial difference together with the chaotic nature of the system, then, allows the simulations to sample the internal variability by covering the phase space of the system. Traditionally, internal variability was assessed by computing the variance of the residuals of a climate variable (after removing trends) over a fixed period, and was therefore often assumed to be stationary (e.g., Hawkins and Sutton, 2009). However, this hypothesis had already been highlighted in the seminal article (Hawkins and Sutton, 2009), and after being challenged in numerous articles, the latter demonstrated that the climate phase space evolves over time and that internal variability is anything but stationary when a forcing is applied (e.g., Cheng et al., 2016; MacMartin et al., 2016; Coquereau et al., 2025). It is therefore necessary to detach ourselves from this temporal dimension in order to accurately estimate this internal variability and its evolution. To evaluate the role of forcing in climate variability, CMIP6 simulations provide different Shared Socio-Economic Pathways (SSP) allowing to investigate the response of the system under various forcing intensities (O'Neill et al., 2016). These scenarios extend after the 1850–2015 historical period up to 2100 (at least). The current analysis is based on 10 ensembles, each of them being produced with a different model, to draw a robust “model-independent” picture of the variability, and to investigate the sources of uncertainty associated with the representation of the AMOC in the different models.

To analyze this high-dimensional dataset and separate the different factors of variability, we used a proven and state-of-the-art method for climate datasets called Analysis of Variance (ANOVA, Zwiers, 1996; Wang and Zwiers, 1999; Hingray et al., 2007; Yip et al., 2011; Zhang et al., 2023). This method allows us to separate the total variability into the direct effect of each dimension of the data set (i.e., time, scenarios, models, and realizations; referred to as “main effects” in the ANOVA framework) and the interactions among them. The ANOVA approach also enables us to move beyond another key assumption of the widely-accepted methodology (Hawkins and Sutton, 2009, 2011; Lehner et al., 2020) – namely, the additivity of variability/uncertainty components – by explicitly exploring their interactions. Zhang et al. (2023) computed an ANOVA decomposition involving three dimensions (3-way; i.e., models, realizations, and scenarios) and focusing on ensemble simulations. They applied this decomposition to temperature and precipitation, with a separation between ensemble members, scenarios, and models and showed that interactions account for almost half of the variance in surface temperatures. In this study, we propose incorporating the time dimension to examine how the interannual-to-decadal variability of the AMOC, including long-term trends, changes over time, by analyzing successive 30-year climate periods. Previous climate studies using ANOVA did not include time because the method was mainly used to measure uncertainty. A trend or event that is common across all scenarios, models, and ensemble members does not contribute to uncertainty per se, but rather to variability. By explicitly including time, we generalize the ANOVA framework to study not just uncertainty, but how variability itself changes over time. In the historical period, the time dimension is particularly critical, as it enables the detection of externally driven changes in the climate state, for example those linked to volcanic eruptions, in the absence of forcing scenarios.

It should be noted, however, that despite continuous progress, models still exhibit significant biases in their representation of the AMOC. This stems in part from the fact that the AMOC arises from a complex interplay of processes, many of which involve small-scale dynamics – such as mesoscale and submesoscale eddies or narrow boundary currents – that remain unresolved in most climate models (Hirschi et al., 2020; Jackson et al., 2020, 2023; Gou et al., 2024). As a result, many simulations produce an overturning circulation that is too shallow and a western boundary current that is overly strong, while underestimating AMOC variability on interannual to decadal timescales (Jackson et al., 2023; Gou et al., 2024). Substantial model uncertainty also persists in estimates of the AMOC strength at 26.5° N, and models differ markedly in their depiction of overturning within the western subpolar gyre, as well as the water exchanges with the Indo-Pacific Ocean (Weijer et al., 2020; Jackson et al., 2023; Baker et al., 2023). Beyond mean-state biases, differences in properties such as Labrador Sea salinity – sensitive to model and resolution – further influence AMOC by modulating the intensity and variability of dense water formation and, consequently, the strength and variability of the overturning itself (Jackson et al., 2020, 2023). It is important to keep in mind the biases of these models when interpreting the results.

In the next section, we will present the ANOVA methodology and the data used for this study. Section 3 will be dedicated to the results. We will start with a general overview of the evolution of AMOC and the dominant sources of variance (Sect. 3.1). Then, we will focus on the evolution of the different physical components of this variance during three periods of the time series representing three different regimes (Sect. 3.2). The final part of the results section concerns inter-model variability and associated uncertainty (Sect. 3.3). Finally, in Sect. 4, we will summarize and discuss the results in order to shed light on possible future evolutions of the AMOC, and to discuss their meaning from the point of view of uncertainty and predictability.

2 Materials and Methods

2.1 Material

The present work is based on state-of-the-art climate simulations from CMIP6. The AMOC intensity is derived from the maximum meridional overturning streamfunction in the Atlantic at 26° N. Here, we focus on initial-condition ensemble simulations to sample the phase-space and investigate the spread of the different possible trajectory as a proxy of internal variability (Fig. 1). Initial conditions are derived following a predefined strategy for the CMIP6 framework, starting with different years of the multi-secular preindustrial control run (known as piControl), which is run under fixed external forcing conditions from the year 1850 (Eyring et al., 2016). This initialization strategy does not impose a specific clock to individual ensemble members such that it is not possible to statistically distinguish one ensemble member from another (although they might have dynamical peculiarities with important implications, e.g. Hawkins et al., 2016). This is true within an ensemble, but is also true across models and scenarios as a result of their common initialization procedure. When performing model averaging in ANOVA, nothing imposes that member #00 of ensemble A must be averaged with member #00 rather than member #01 of ensemble B. This provides a first insight that averaging involved in ANOVA will strongly impact ensemble statistics, as will be shown later on. Among the models, three offers relatively important sizes (with 25, 30, and 40 members, see Table 1), and we will mostly focus our analysis on these models, which we refer to as the “large ensemble models”. We, nonetheless, extend the analysis to seven smaller ensembles (3–6 members) to test the robustness of our results and improve the representation of inter-model variability. We refer to these as the “small ensemble models”. When performing inter-model statistics, we adopt a model democracy approach (Knutti, 2010), assigning equal weights to each model. The time series are separated in two parts: an historical period from 1850 to 2014 where the forcing are based on observations and a projection period from 2015 to 2100. For the 21st century, three major SSP are studied to estimate the forcing-scenario variability: SSP1-2.6 (“Sustainability”), SSP2-4.5 (“Middle of the road”) and SSP5-8.5 (“Fossil-fueled development”). In the future scenarios, volcanic eruptions, which constitute short-term external forcings common to all members and can induce substantial climate variations, are not included, thereby removing a source of time variability. For more information on sources of climate variability in the SSP, see O'Neill et al. (2016). Overall, four dimensions are investigated: model, scenarios, ensemble, and time.

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f01

Figure 1Evolution of AMOC intensity anomaly. Each subplot presents historical time series (black) followed by the three scenarios SSP1-2.6 (blue), SSP2-4.5 (orange), or SSP5-8.5 (red) for a given model. For each model, all realizations/members of the ensemble are shown, and the relative intensity uses as a reference the ensemble average between 1850 and 1900.

Download

Ziehn et al. (2020)Olonscheck et al. (2023)Swart et al. (2019)Yukimoto et al. (2019)Tatebe et al. (2019)Danabasoglu et al. (2020)Sellar et al. (2019)Voldoire et al. (2019)Boucher et al. (2020)Rind et al. (2020)

Table 1List of models and members used in the study.

Download Print Version | Download XLSX

2.2 Methods

2.2.1 Analysis of Variance

The Analysis of Variance (ANOVA) method allows to decompose and attribute the variance in a multidimensional data set. It enables us to investigate the explanatory power of several qualitative factors (or dimensions, e.g. the various time steps, ensemble members, models, scenarios) on a quantitative variable (here, the AMOC intensity). It returns the role of each individual effect associated with single dimensions and the role of interactions associated with two or more dimensions. As described in Berrington de González and Cox (2007), interaction occurs when the separate effects of the factors do not combine additively. The ANOVA is thus generally represented as a linear decomposition of the total variance, with main effects depending on single dimensions and interactions depending on multiple factors that cannot be separated. The ANOVA is particularly useful when investigating three or more dimensions, which is difficult with more classical analyzes as covariance or correlation. There are, however, connections between covariance and ANOVA 2nd order interactions. Both methods assess relationships between variables. The covariance measures how two variables change together, while the interaction in the ANOVA captures how a combination of two dimensions explains the variability in a dataset that would not be captured by either factor alone. Therefore, both refer to a joint variability.

Originally, the one-dimension ANOVA (referred to as 1-way ANOVA) was first employed by Yates (1938) in research on agriculture. Since then, it has been extended to the 2-way ANOVA and extensively used in climate science to characterize variability in simulations (e.g., Zwiers, 1996; Wang and Zwiers, 1999). In particular, different works used ANOVA to investigate the sources of uncertainty in climate projections, to separate model and scenario uncertainties in CMIP analyzes (Yip et al., 2011), but also in more regional contexts (Hingray et al., 2007). More recently, a study took advantage of the larger ensembles available in CMIP6 to introduce the ensemble dimension directly in a 3-way ANOVA decomposition (Zhang et al., 2023). It therefore provides a separated dimension for internal variability, whereas this latter was previously represented as the residual of the decomposition.

As explained in the introduction, in the present work, we build upon the previous 3-way ANOVA (Zhang et al., 2023) and extend it to a 4-way formulation by adding time dimension to investigate the changes of interannual-to-decadal time variability (i.e., dynamical adjustment). This is done by applying a 30-year rolling window (a typical climate period, discussed below) for the decomposition.

The main idea behind ANOVA is to separate the variance in a multidimensional data set by computing the variance after averaging the data over some of the dimensions. The importance of a given dimension is thus highlighted by the amount of variability removed when averaging this dimension. If a large fraction of the variability is removed therefore the dimension is an important factor of variability. Initially, the multidimensional data set is not averaged and varies over the four dimensions $X (s, m, r, t)$ , where s represents scenario dimension, m represents various models, r the different realizations associated with the ensemble dimension, and t represents time. Then, the running window method is implemented by successively extracting 30-year subsets x_τ from the original dataset, each centered around a given time point τ, as follows:

\begin{matrix} (1) & x_{τ} (s, m, r, t) = X (s, m, r, t) for t \in [τ - 15, τ + 15] . \end{matrix}

In the analysis, it is important to acknowledge the two time axes: t, on which statistical analyses are computed, and τ, which is 30 years shorter and represents the centers of the successive windows of analysis. When averaging over a dimension, the data set cannot vary in that dimension, this is signaled by the bar over the x and by removing the dimension from the parentheses: for instance, $\overline{x_{τ}} (s, r, t)$ represents an average among models, evaluated at time τ (see Sect. 2.2.3 for technical details). This central idea of ANOVA can be implemented for the four dimensions and lead to different remaining spread in the dataset after averaging some of the dimensions.

We will now use a few specific examples to detail the calculation of the main effect and interactions. The formulations can be applied to all individual dimensions or combinations of dimensions. The realizations/ensemble main effect ( $V_{r}^{main}$ or called R in the paper) is the variance of the dataset with respect to the ensemble of realizations, when averaging all dimensions but r. This corresponds to:

\begin{matrix} (2) & R (τ) = V_{r}^{main} (τ) = \frac{1}{N_{r}} \sum_{r = 1}^{N_{r}} {[\overline{x_{τ}} (r) - \overline{x_{τ}}]}^{2}, \end{matrix}

with N_a the number of samples in dimension a, e.g. N_r is the number of ensemble members. To compute the interactions, we use the total variance budget involving several dimensions. The total variance involving realizations and time (r and t), for instance, corresponds to:

\begin{matrix} (3) & V_{r t}^{total} (τ) = \frac{1}{N_{r} N_{t}} \sum_{r = 1}^{N_{r}} \sum_{t = 1}^{N_{t}} {[\overline{x_{τ}} (r, t) - \overline{x_{τ}}]}^{2} . \end{matrix}

This total variance budget is also equal to the sum of the main effects of r and t and their interaction:

\begin{matrix} (4) & V_{r t}^{total} (τ) = V_{r}^{main} (τ) + V_{t}^{main} (τ) + V_{r t}^{interaction} (τ) . \end{matrix}

Thus, the 2nd order interaction between realizations and time (called RT hereafter) can be obtain with:

\begin{matrix} (5) & \begin{aligned} R T (τ) & = V_{r t}^{interaction} (τ) = V_{r t}^{total} (τ) - V_{r}^{main} (τ) - V_{t}^{main} (τ) \\ = \frac{1}{N_{r} N_{t}} \sum_{r = 1}^{N_{r}} \sum_{t = 1}^{N_{t}} [\overline{x_{τ}} (r, t)^{2} - \overline{x_{τ}} (r)^{2} - \overline{x_{τ}} (t)^{2} - {\overline{x_{τ}}}^{2} \\ - 2 \overline{x_{τ}} [\overline{x_{τ}} (r, t) - \overline{x_{τ}} (r) - \overline{x_{τ}} (t)]] . \end{aligned} \end{matrix}

We note that by averaging, the three terms in the last parenthesis will all become equal to $\overline{x_{τ}}$ . Thus, the expression can be simplified in:

\begin{matrix} (6) & R T (τ) = \frac{1}{N_{r} N_{t}} \sum_{r = 1}^{N_{r}} \sum_{t = 1}^{N_{t}} [\overline{x_{τ}} (r, t)^{2} - \overline{x_{τ}} (r)^{2} - \overline{x_{τ}} (t)^{2} + {\overline{x_{τ}}}^{2}] . \end{matrix}

The more classical 2nd order interaction formulation of ANOVA (presented in Eq. 7 or in Zhang et al., 2023) is equal to Eq. (6), as the cross terms disappear when averaging:

\begin{matrix} (7) & R T (τ) = \frac{1}{N_{r} N_{t}} \sum_{r = 1}^{N_{r}} \sum_{t = 1}^{N_{t}} {[\overline{x_{τ}} (r, t) - \overline{x_{τ}} (r) - \overline{x_{τ}} (t) + \overline{x_{τ}}]}^{2} . \end{matrix}

In the analysis, these interactions are named after the initial of the involved dimensions (e.g., RT corresponds to the interaction between time and ensemble dimensions). For higher-order interactions, we use the same principle by removing lower order terms from the total variance budget. For 3^rd order interactions, SRT, for example, reads:

\begin{matrix} (8) & \begin{aligned} S R T = V_{s r t}^{interaction} & = V_{s r t}^{total} - V_{s r}^{interaction} - V_{s t}^{interaction} \\ - V_{r t}^{interaction} - V_{s}^{main} - V_{r}^{main} - V_{t}^{main}, \end{aligned} \end{matrix}

and for 4th order, SMRT, for example, reads:

\begin{matrix} (9) & \begin{aligned} V_{s m r t}^{interaction} & = V_{s m r t}^{total} - V_{s m r}^{interaction} - V_{s m t}^{interaction} - V_{s r t}^{interaction} \\ - V_{m r t}^{interaction} - V_{s m}^{interaction} - V_{s r}^{interaction} - V_{m r}^{interaction} \\ - V_{s t}^{interaction} - V_{m t}^{interaction} - V_{r t}^{interaction} \\ - V_{s}^{main} - V_{m}^{main} - V_{r}^{main} - V_{t}^{main} . \end{aligned} \end{matrix}

Other methods have also been used to investigate the different sources of variability/uncertainty in climate simulations, such as the one used in Hawkins and Sutton (2009, 2011) or Lehner et al. (2020). While their method has the advantage of incorporating more models as it does not require ensembles, the internal variability is estimated as the residual of a polynomial fit. Here, given the importance of internal variability for our study, we decided to focus on ensemble simulations and use ANOVA. Another difference is the fact that the method developed by Hawkins and Sutton (2009) does not evaluate the importance of interactions among sources of variance. While this simplifies the analysis, here we will see that interactions play an important role and must be considered to fully understand the evolution of AMOC variability.

Our results do not appear sensitive, at least qualitatively, to the size of the time window (Fig. B2). However, it should be noted that increasing the window size reinforces the main effect of temporal variance and reduces the ensemble variance main effect. The reason for this is explained in the next section. As 30 years is the typical period for studying the state of the climate (Arguez and Vose, 2011), we chose this duration to ensure a minimum coherence of the climate state and provide a good representation of temporal variance, while avoiding oversmoothing that would make transitions between regimes much more difficult to detect. Conversely, results concerning inter-model variability are sensitive to the AMOC reference chosen, particularly depending on whether we use AMOC anomalies or absolute intensities. This sensitivity is due to the persistent AMOC bias in climate models and their difficulty to capture the right AMOC intensity (Weijer et al., 2020). In our work, we consider the AMOC anomaly relative to the period 1850–1900 because we are interested in the model uncertainty/variability associated with the model response to historical and future emissions rather than the initial and overall AMOC differences between climate models. Results with absolute AMOC values are discussed in Sect. 3.3 and shown in Fig. B3.

2.2.2 Interpretation and combination of ANOVA components

The important number of separated components returned by the ANOVA decomposition requires an integrative approach if one wants to provide a conceptually coherent dynamical interpretation. Comparing the ANOVA with more standard method for studying the variability allows us to contextualize the meaning of each main effect and interaction. Sources of variability in a dataset are often characterized by computing the variance across the dimension of interest and averaging the results among other dimensions. An interesting property of the ANOVA is that this statistic can be directly retrieved by summing the main effects and the interactions associated with the given dimension (Fig. B1). As an example, ensemble studies on internal variability usually rely on the computation of ensemble variance (e.g., Coquereau et al., 2024), which can be reconstructed from the ANOVA with:

\begin{matrix} (10) & \begin{aligned} Mean Ensemble Variance & = R + R T + S R + M R + S R T \\ + S M R + M R T + S M R T . \end{aligned} \end{matrix}

This combination of components – referred to as variance reconstruction in our work – directly equals to the classical variance, allows to better understand the ANOVA decomposition. However, with this reconstruction, the interactions are embedded within each interacting dimension and, thus, are accounted multiple times when deriving all dimensions. For example, RT will be taken into account both in the mean variance of the ensemble and in the mean variance of the time. As a result, the sum of the variance components exceeds the total variability of the dataset, i.e.:

\begin{matrix} (11) & \begin{aligned} Total Variance & \leq Mean Ensemble Variance \\ + Mean Time Variance \\ + Mean Model Variance \\ + Mean Scenario Variance . \end{aligned} \end{matrix}

An alternative combination method have been proposed to provide an overall picture of the sources of variability without over-estimating the total variability. Zhang et al. (2023) proposed a statistical separation of the sources based on the division of interactions between the involved components. For example, the 2nd order interaction between time and ensemble dimensions (i.e., RT) is divided by two and each half is allocated to one dimension: time or ensemble. Following this statistical separation method, the calculation for ensemble dimension leads to:

\begin{matrix} (12) & \begin{aligned} ENSEMBLE & = R + \frac{1}{2} (R T + S R + M R) \\ + \frac{1}{3} (S R T + S M R + M R T) + \frac{1}{4} S M R T . \end{aligned} \end{matrix}

In the next section, we will employ this separation method to provide an overview of the variance distribution. However, this method does not allow for a direct comparison with traditional variance computation (e.g., Mean Ensemble Variance). Moreover, the statistical separation relies on a dimensional-wise approach rather than physical considerations. As a result, a single physical mechanism driving changes in variance may influence several variance components simultaneously. For instance, the ST components, which might highlight differences in trends between scenarios and therefore appear naturally linked to the scenario dimension, would nevertheless be partitioned equally into the time dimension. Finally, compensations can occur between ANOVA components and cause spurious results with the statistical separation method.

Beyond these combination methods (i.e., variance reconstruction or statistical separation), we will analyze, in the present study, each component individually, as individual components provide important insights on the evolution of variance in the dataset.

Main effects are the most intuitive components of the ANOVA, as they represent the variance across a single dimension when other dimensions are averaged. As an illustration, model main effect (M) corresponds to the differences between model averages, where the mean across ensemble, scenarios, and time is computed for each model. The interaction terms can be more challenging to interpret as they represent the variability located in multiple dimensions. Approaching them from the perspective of internal variability makes it easier to understand their physical significance as this variability represent the full variability of the physical system under steady external forcing conditions (considering a single ensemble model). Consistently with the variance reconstruction method (Eq. 10), the mean ensemble variance is exactly equal to the sum of the realizations/ensemble main effect (R) and all interactions involving the ensemble dimension. This highlights that each dimension includes a part of internal variability. For time and ensemble dimensions, this is somehow natural, but it is also true for scenarios. When the trajectories separate under different forcing scenarios, a slight phase-shift of internal variability can appear. This phase shift represents the fraction of internal variability associated with the scenario dimension. Averaging over scenarios removes part of this phase shift and, consequently, part of the internal variability, similarly to averaging over a time window or across realizations. If all dimensions are sufficiently large (many members, many scenarios, long time period), the interaction between scenarios, realizations, and time (SRT) should capture the entire internal variability. However, if the dimensions are too small, SRT will not capture this entire variability, and the importance of SRT decreases, compensated by an increase in other main effects or interactions. To illustrate this, we designed, and analyzed with ANOVA, a synthetic model providing AMOC time series that closely mimic CMIP6 simulations (see A). This especially allows us to modulate the number of scenarios, which is challenging with “real” AMOC simulations. Increasing the variety of scenarios, lead to an increase of SRT and a decrease of RT, which demonstrate the presence of internal variability in the scenario dimensions as explained previously (Fig. A1). This corresponds to a “relocation” (as used later) of variability from one component to another – i.e., a shift in the component where variability is detected by the ANOVA. If the number of scenario is too small, only a part of internal variability will be canceled by averaging scenarios.

The same logic applies for time dimension. Over the CMIP6 historical period, R represents the part of the internal variability that is not captured in RT, i.e., the internal variability with a period larger than the rolling time window of 30 years. The sensitivity test on time windows depicts an anti-correlation of R with RT, and with the length of the time windows (Fig. B2). Therefore, R represents the low-frequency internal variability, and when the size of the time window increases there is, logically, less variability at lower frequency. After the beginning of scenarios this low-frequency internal variability shifts from R to SR, if the number and variety of scenario are sufficient. The fact that all components involving S are null at the beginning of the time series is due to the fact that all scenarios are merged, making SR and SRT equal to zero because averaging over scenarios does not remove any internal variability. Finally, ST represents the part of the scenario dispersion removed by the inherent smoothing of the time averaging, as shown in the sensitivity test on time window size (Fig. B2), where increasing the window size decreases S and increases ST, and vice versa.

2.2.3 Bootstrapping

To obtain robust results and assess the model uncertainty it is important to combine the different models. As the ensemble models have different sizes, we used a Bootstrap methodology to aggregate them. Only models of comparable size are combined. Large ensemble models (with 25 to 40 members) are combined together and smaller ensemble models (from 3 to 6 members) are combined together. The bootstrapping procedure is relatively simple. For each large ensemble model, we select randomly 20 members (with replacement) which are assembled with 20 members of each other large ensemble model. The ANOVA decomposition is thus applied on the 60 selected members from the three models. The selection of 20 random members and the ANOVA decomposition is then replicated a given number of time. In our case we use 100 resamplings and we did not observe substantial improvements by further increasing this number (e.g. to 1000). The results of the 100 resamplings are then averaged and analyzed. For the smaller members, the procedure is similar but by selecting 3 members per model instead of 20.

A sensitivity test was carried out to assess the impact of sub-sample size for large ensemble models (Figs. B4a, c, e and B5a, c). The test indicates that reducing the subsample size from 20 to 3 members leads to a decrease of internal variability and an increase of time variability. Specifically, during the historical period, we observe a transfer from R and RT to T. While in the 21st century, the transfer occurs from RT and SRT to ST. The size of the ensemble has therefore a direct impact on the level of internal variability that can be detected. This is because when three ensemble members are used, the ensemble average does not completely eliminate internal variability. We also assessed the impact of a combination of large and small ensemble models versus the use of small ensemble models only. In this case, we applied 3-member resampling for all ensemble models. While the model-associated factors of variability are unaffected (Fig. B5), the combination leads to a decrease in all physical factors. The decrease is relatively homogeneous between components, as the variance distribution remains unchanged (Fig. B4f).

3 Results

Before delving into the various factors and contributions of the variance, we will start by drawing a general picture of the AMOC intensity evolution in the CMIP6 dataset to better understand the changes of variability.

The ensemble-averaged AMOC time series for different models and scenarios appear relatively stable during most of the historical period, from the mid-19th to the mid-20th (Fig. 2a and b). Some models then present a weak increase attributed in the literature to increasing aerosol concentration (Menary et al., 2020; Robson et al., 2022). In the last decades of the 20th century, the AMOC intensity initiates a substantial decrease, with no visible differences among emission pathways up to the middle of the 21st century. Afterward, the scenarios start to separate with the strongest forcing presenting a continuous decline, while the weakest forcing scenario stabilizes or even slowly recovers depending on the models. At the end of the projection period, while the large ensemble models seem to converge under a given scenario in terms of relative decrease, the small ensemble models present growing differences.

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f02

Figure 2General picture of the AMOC evolution at 26° N from CMIP6 ensemble models. (a, b) Ensemble-averaged AMOC anomaly time series over historical (1850–2015, black) and projection (2015–2100, colors) periods. Reference value is the individual model average from 1850–1900. (c, d) AMOC variance associated with each of the four general factors based on the statistical separation methodology of Zhang et al. (2023). Model contribution has been represented with a dashed line after 2040 to highlight the lack of consistency between (c) and (d) regarding this result. Small and large ensemble models have the same y-axis, and a zoom-out box is displayed for the variance of the small ensemble models to show the full increase in model contribution (d). (e, f) Contribution of each factor to the total variance. Results are presented for large ensemble models (a, c, e) with 25–40 members and small ensemble models (b, d, f) with 3–6 members.

Download

This brief overview reveals substantial AMOC variability across time, scenarios, and models – consistent with previous studies (e.g., Weijer et al., 2020; Jackson et al., 2023) – from the last decades of the 20th century. This behavior is expected to persist and even intensify throughout the 21st century.

3.1 A general picture of the variability

To analyze the evolution of the variability and its various contributions, we use the Analysis of Variance (ANOVA) method, described in Sect. 2.2.1. The total variance from the multidimensional dataset is thus split into different contributions separated into main effects and associated interactions. Again, given the number of components, it is helpful to start the analysis simple and progressively increase complexity. For this purpose, we take advantage of the statistical separation method proposed by Zhang et al. (2023, and summarized in Eq. 12). For each dimension of the dataset, we gather its main effect and redistribute, with equivalent weights, the interaction terms (Fig. 2c–f).

The inter-model variability (representing differences among models) appears to be the dominant factor of variance among most of the studied period. After a low-level of variability up to the beginning of the 20th century, due to the AMOC intensity reference taken as the 1850–1900 model average, the contribution rapidly increases to dominate both in large and small ensemble models. Two consecutive periods of increase detach from this time series. The first, which starts around 1940 in the larger ensembles and reaches its peak in the second half of the 20th century, is likely associated with the differences of AMOC response to aerosol forcing among models (Menary et al., 2013, 2020; Robson et al., 2022). In small ensemble models, this increase is temporally shifted, emerging only after 1970. The second increase, starting at the very end of the 20th century, seems due to the important AMOC decline and to the differences of decline magnitude among models. For example, it has been shown that the rate of AMOC weakening is linked to surface salinity and dense water formation in the Labrador sea and further influenced by model horizontal resolution (Jackson et al., 2020, 2023). Historical AMOC strengths and pathways detected in models also appear to correlate with their rate of weakening (Baker et al., 2023).

After 2040, the trajectory of the time series becomes increasingly uncertain, as evidenced by the growing divergence between small and large ensemble models (grey dashed line). This uncertainty partly arises from differences in AMOC sensitivity to external forcing between the two groups. Specifically, the three large ensemble models show similar AMOC sensitivity, while the seven small ensemble models exhibit greater diversity in their responses to external forcing (e.g., Fig. 2 in Weijer et al., 2020). This study, which focuses on AMOC anomalies relative to the historical state (defined as the 1850–1900 mean), emphasizes the AMOC's sensitivity to external forcing, i.e. its tendency to shift states as forcing deviates from the historical baseline. Consequently, the seven small ensemble models, representing a broader range of sensitivities, exhibit an increasingly larger inter-model spread over the second half of the 21st century. In addition to the distinct characteristics of the individual models within each group and the greater number of small ensemble models analyzed, this divergence is also potentially associated to the poorer separation of internal variability in small ensembles. This may contribute to differences between models (Bonnet et al., 2021), although selecting only three members from each large ensemble model does not increase model variability (Fig. B5).

Ensemble and time variability are the two other important factors of variance during the historical period, with comparable orders of magnitude. Nevertheless, they have very different evolutions. Time variability is relatively stable during most of the historical period. Then, at the end of the 20th century, it presents a transient increase for a few decades as a signature of the AMOC decline. It is particularly striking in large ensemble models, where the time variance bump is perfectly phased with the AMOC decline and the associated second increase of inter-model variability. In small ensemble models, the transient aspect of the increase is slightly less clear, but still visible. The ensemble spread is also very stable at the beginning of the historical period. Then, concomitantly to the AMOC decline, it initiates a slow and progressive decrease up to the end of the 21st century.

Finally, the last component is the scenario variability, mainly associated with the divergence of future emission pathways. By definition, it is zero at the outset, since the scenarios only start in 2015. At the beginning of the 21st century (due to the 30-year running window), it thus starts to increase but stabilizes at a low level until the mid-21st century. After which, differences between scenarios, particularly those linked to the levels of emissions and mitigation, rapidly increase to dominate over both internal and time variability in small ensemble models, and even inter-model variability in large ensemble simulations.

To move a step further in understanding these changes of variability and the various phenomena involved, we will go beyond the statistical separation and analyze each individual contribution, including the interactions. As we shall see, interaction terms can be responsible for a large part of the variability and provide crucial information that must be analyzed.

3.2 A focus on the three regimes of variability

In this section, we leave aside the model-associated factors of variance that do not refer to directly observable variability, and focus on analyzing the “physical factors” of variance. As it shows the robustness of the study, model-associated variability and interactions are discussed later in Sect. 3.3.

Three phases and regimes of variability can be isolated in the simulated time series from 1850 to 2100. Each of them will be analyzed in the following subsections.

Special attention is paid to the analysis of large ensemble models as they provide a better cover of the phase-space and thus a more accurate picture of the variability, especially associated with internal factors. However, the small ensemble models remain important both for assessing the consistency of the results and for the analysis of model-associated variability (since there are more models with a small ensemble) performed in Sect. 3.3.

3.2.1 First century of the historical period controlled by internal variability

Among the different physical factors (Fig. 3), the interaction between time and realizations (RT) dominates the variability during most of the historical period: from the mid-19th to the last decades of the 20th century. It is particularly striking in large ensemble models where it represents around 80 % of the physical factor variability (Fig. 3e). In small ensemble models, it is lower but still represents approximately 40 % of the total physical factor variability (Fig. 3f). The remainder is divided into the main effects of realizations/ensemble (R) and time (T).

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f03

Figure 3Evolution of the variability of the physical factors including ensemble, time, and scenarios dimensions. (a–d) Variance associated with each main effect (a, b) and their interaction (c, d). (e, f) Relative variance contribution of each main effect and their interactions to the total physical factor variability. Interactions are highlighted by hatches. Letters refer to the dimension involved in the variance calculation, with a single letter for main effects and a combination of letters for interactions: S refers to the scenario dimension, T to time, and R to realizations (see Sect. 2.2.1). Results are presented for large ensemble models (a, c, e) with 25–40 members and small ensemble models (b, d, f) with 3–6 members.

Download

In a relatively stable context, such as the beginning of the historical period, quasi-ergodicity can be assumed (Hingray and Saïd, 2014). Ergodicity defines a situation where the ensemble and time statistics tend toward the same values. Here, we cannot speak of ergodicity because of the impact of the trend on time variance. However, as the trend is small during this period so is its impact on time variance allowing to use the term quasi-ergodicity. This can be highlighted by computing the ensemble-to-time variance ratio (Fig. 4a). During this regime, the ratio appears close to one, indicating relatively similar ensemble and time variance and thus quasi-ergodic conditions. In this context, internal variability is the dominant component among the “physical factors” of variability. This reflects in the RT interaction that gathers most of the internal variability, since a large part of the variability is removed in this period if either time or realizations are averaged.

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f04

Figure 4Time and ensemble variance. (a) Comparison of their variability with the ensemble-to-time variance ratio. We use this ratio as a proxy of ergodicity. Here ensemble and time variance are derived from the sum of interactions according to the variance reconstruction method described in Sect. 2.2.2. Similar results are obtained using the statistical separation methodology proposed by Zhang et al. (2023). (b) Evolution of ensemble variability (i.e., inter-member spread) over time. Thick lines represent large ensemble models and thin lines small ensemble models.

Download

A small fraction of internal variability remains, however, present in the R main effect and corresponds to the internal variability with period larger than the filtering window, as explained in Sect. 2.2.2 and shown in Fig. B2, where the importance of R appears anti-correlated with the size of the time window. This fraction is relatively similar between the two categories of models.

The fraction of variability located in T originates from two sources: the weak non-ergodicity, and the limited ensemble size. The weak non-ergodic aspect of the evolution is associated with the weak linear trend over this period and the response of the AMOC to aerosol forcing (Menary et al., 2013, 2020; Robson et al., 2022). This effect corresponds approximately to the magnitude of T seen in large ensemble models. Alternatively, the size of ensembles is mostly observable in small ensemble models, where the number of realizations is too limited and do not allows to cover the entire phase space and completely remove the internal variability when averaging. Thus, a significant part of internal variability remains after averaging realizations as observed with the larger magnitude of T in small ensemble models. This impact of ensemble size is demonstrated by the sensitivity test on the number of realizations in Fig. B4. This test shows that when decreasing the number of realizations taken from large ensemble models, a significant fraction of internal variability from RT is relocated to T, leading to a repartition similar to small ensemble models.

Going into the details, the ensemble-to-time variance ratio appears slightly smaller than one for small ensemble models, whereas it is slightly greater than one for the large ensembles (Fig. 4a). This can be directly linked to the previous results showing that the repartition of variance between T and RT components is sensitive to the ensemble size. The statistical separation method used in Fig. 4, attributes the variance in RT to both ensemble and time dimensions, ant the variance in T to time dimension only. Thus, when smaller ensemble models present a relocation from RT to T, it results in a decrease of ensemble-to-time variance ratio. In a ideal case, with a sufficient large ensemble model, one would expect the ensemble variance to be slightly larger than the time variance, since it allows to capture all frequency of internal variability while time variance miss by definition the periods larger than the time window. This effect is highlighted by the long de-correlation timescale of AMOC trajectories in CMIP6 (calculated as the e-folding timescale of the auto-correlation) that appears to range between 2 and 40 years (Fig. B6) suggesting non-negligible low frequency variability in the data.

The decomposition of the total variance budget involving time and ensemble dimensions is particularly insightful to understand the role of these two dimensions (Fig. 5a and b). It corresponds to the sum of the main effects and their interaction, as shown in (Eq. 4). Following (Eq. 3), it is also equal to the variance of the initial dataset after averaging models and scenarios. The decomposition of the variance clearly highlights the dominant role of the interaction especially in large ensemble models (Fig. 5a). In small ensemble models, the interaction is much lower consistently with previous results on subsample size. The time main effect is larger than the ensemble main effect showing that quasi-ergodicity is not reached because of the too small ensemble size.

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f05

Figure 5Evolution of variance associated with time and ensemble (a, b) and time and scenarios (c, d). Grey shaded areas represent the total variance associated with the considered dimensions. It corresponds to the sum of the colored lines representing the different variance contributions including the two main effects and their interaction. Letters refer to the dimension involved in the variance calculation, with a single letter for main effects and a combination of letters for interactions: S refers to the scenario dimension, T to time, and R to realizations (see Sect. 2.2.1). Results are presented for large ensemble models (a, c) with 25–40 members and small ensemble models (b, d) with 3–6 members.

Download

The first century of the historical period is therefore characterized by strong, stable internal variability, with a major role for the RT interaction underlining the quasi-ergodicity of the regime. Scenario variability remains zero, by definition, since the scenarios only start in 2015.

3.2.2 AMOC decline insensitive to forcing-scenarios up to mid-21st century

After the first regime of relatively stable AMOC intensity and variance components, the AMOC enters a phase of intense decline reflected both in the statistical separation (Fig. 2) and by the substantial and transient increase of T main effect (Fig. 3e and f). This regime is thus associated with a clear loss of ergodicity (Fig. 4a), where the ensemble-to-time variance ratio presents a decrease of 40 % to 60 % for large and small ensemble models, respectively.

Around 2000, the increase in the time main effect caused it to exceed the realizations/ensemble main effect in the large ensemble models (Fig. 5a). In addition to the increase of T, this change of dominant factor is also due to the decline of R, that follows the same decreasing pathway as RT. During this regime R losses around $3 / 4$ of its variance in both ensembles (Fig. 3).

From 2015, scenarios start and the associated variability emerges (Fig. 2). This emergence is, however, not generated by the scenario main effect (S) but by the interaction between time, scenarios, and realizations (SRT, Fig. 3). This increase of scenario-associated variability before stabilizing, is therefore not the direct effect of the forcing but, rather, a sort of chaotic internal variability triggered by slight differences among scenarios allowing the simulations to spread. Again, during the historical period, the three time series corresponding to scenarios for each member of models are perfect replicas of the historical simulation. When the scenarios formally begin in 2015, small differences or perturbations in forcing emerge among them due to the progressive separation of scenario trajectories, for instance in terms of CO₂ emissions (O'Neill et al., 2016). Although these differences are initially minor, they are amplified by the chaotic nature of the system, causing the three scenario time series to spread similarly to pseudo-ensemble members, as evidenced by their appearance in SRT. This internal and chaotic nature of SRT variability is also underlined by the magnitude of this variability, which appears to be three times as large in large ensemble models as in small ones, consistently with RT that is greater in large ensembles. The absence of direct scenario main effect variability is also evident by S, that remains at zero before 2040–2050 (Fig. 5c and d). This particular behavior was highlighted by Weijer et al. (2020) and acknowledged in the latest Intergovernmental Panel on Climate Change (IPCC) report (AR6, Lee et al., 2021, p. 576).

This internal and chaotic aspect of SRT interaction provides a first explanation for the common decline of R and RT, due to the relocation of a substantial fraction of the internal variability originally located in these two components toward SRT (consistently with interpretation of interactions provided in Sect. 2.2.2). However, this decrease may also be driven by an overall decline of the total internal variability if the latter is sensitive to the AMOC intensity. We investigate this second factor by computing for each individual model and scenario the evolution of ensemble variance over time (Fig. 6). This analysis shows an overall decrease of the ensemble variance over time, illustrating a contraction of the phase-space. To assess this visual decline, we compute the average ensemble variance over 50-year time window in the historical and projection periods. In the historical period, we select the 1900–1950 window, located a few decades after the beginning of the historical simulations (allowing for possible AMOC adjustment from the control preindustrial forcing to the historical one) and before the peak of aerosol forcing that occurred in the latter half of the 20th century. For the projection, we select the last 50 years to target the potential maximum impact of scenarios. In all models, the 1900–1950 period presents a larger ensemble variance than the 2050–2100 one, with an intensifying decrease over time. If we compare now the different scenarios, the majority of models ( $7 / 10$ ), including the three large ensemble models, presents weaker ensemble variance under SSP5-8.5 than under SSP1-2.6; $6 / 10$ models present weaker variability under SSP5-8.5 than SSP2-4.5; and $8 / 10$ a weaker variability under SSP2-4.5 than under SSP1-2.6. For the three large ensemble models, the ensemble spread is sorted according to scenario intensity, except for CanESM5 where SSP2-4.5 shows a slightly weaker variance than SSP5-8.5. These results suggest a strong link between forced AMOC weakening and decrease in the AMOC ensemble variance. The decline of R and RT seems, therefore, driven by the combined effect of a declining total internal variability and a relocation of a part of this variability from R and RT toward SRT under the effect of the scenario-perturbed pseudo members.

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f06

Figure 6Evolution of internal variability, measured by the ensemble variance, over time and scenarios. Each subplot presents historical time series (black) followed by the three scenarios SSP1-2.6 (blue), SSP2-4.5 (orange), or SSP5-8.5 (red) for a given model. The values written in each subplots correspond to 50-year averaged period of internal variability: 1900–1950 for historical (black) and 2050–2100 for scenarios (colors). A 30-year rolling averaged is applied to isolate long term trends.

Download

Finally, the last component evolving during this period is the interaction between time and scenarios (ST) that represent in a way the scenario uncertainty on temporal trends. The very weak increase of ST when scenarios appear, remaining at a very low level up to 2050, especially in large ensemble, underline the previously discussed absence of inter-scenario differences before mid-21st century. The analysis of terms included in the total variance involving time and scenario dimensions also confirms that time is by far the leading factor of the associated variability during this phase, with S and ST negligible compared to T (Fig. 5c and d). During this regime, while the scenarios have started, the AMOC variability remain driven by the declining trend associated with a dynamical-adjustment without direct impact of forcing scenarios.

Beyond the AMOC decline associated with the transient increase in time variability, the T main effect initially decreases and then stabilizes together with SRT around 2050. RT interaction decrease rate also flattens, while ST only exhibits a very weak increase. Before the start of the third regime, we therefore observe an intermediate phase where all components of variability appear stable. This short phase around 2040–2050 is dominated by the SRT interactions associated with a “scenario-perturbed” internal variability. This is, in a way, another phase were internal variability is important despite the significant trend. In small ensemble models, while most of the components present a clear signature of this intermediate regime, it is less clear for T main effect. We suggest this is due to the less accurate diagnostic of internal variability in small ensemble models, hence impacting the separation of internal and dynamical-adjustment variability.

3.2.3 Late separation of anthropogenic emissions scenarios after the mid-21st century

The third regime starts with the emergence of the scenario main effect S. It presents a steep increase from the middle of the 21st century and becomes, in a few decades, the dominant factor of physical variability. In parallel, the ST interaction presents a weaker but still substantial increase, since the scenarios have an important role in AMOC evolution over time. As described in Sect. 2.2.2, the ST interaction also represent the smoothing of scenarios by the time window and can thus be directly associated to scenario variability and effect.

The inter-scenario variability becomes the first order factor of variance even above inter-model spread in large ensemble models over the last decades of the 21st century. The increasing and prominent role of scenarios/forced variability demonstrates that the system leaves a regime driven by past, historical forcings to enter in a phase of forced evolution driven at first order by the future emission scenarios.

After focusing on the physical factors of variability, we can investigate variability associated with inter-model differences and the drivers of these differences throughout the multisecular simulations. This will also measure the confidence and robustness of our physical factor results.

3.3 Evolution of inter-model variability and uncertainty

Overall, the dominant factor in model-associated variability lies in the model main effect (M, Fig. 7). It follows the same two-step increase of inter-model variability, described at the beginning with the statistical separation method (Fig. 2). As a reminder the two steps are associated with two uncertainties of AMOC response among models: (i) the response to the increase of aerosol concentration over the second half of the 20th century, and (ii) the magnitude of the AMOC decline beginning in the late 20th century. It is difficult to analyze the M main effect at the end of the simulations considering the differences between small and large ensemble models. Indeed, while large ensemble models present a decreasing inter-model variability due to the convergence of relative AMOC intensity decline, small ensemble models present an exponential increase of inter-model variability, linked to a divergence in terms of relative AMOC intensity decline. Taking aside potential inherent characteristics of small ensembles (which would break model democracy ; Knutti, 2010) and the impact of internal variability on the model trends, we interpret this as being also associated with the particular combination of models in the “small ensemble” and “large ensemble” category, which happen to exhibit different AMOC sensitivities to forcing. Although the processes underlying this sensitivity remain unclear, recent research highlights the potential roles of water mass pathways, overflow dynamics, eddy mixing, Gulf Stream separation, and model resolution (Fox-Kemper et al., 2019; Jackson et al., 2023, 2025). These results are also sensitive to the reference chosen for computing AMOC anomalies, as discussed in the method section. Indeed, when considering the absolute AMOC intensity, we observe a convergence in AMOC intensity among the small ensemble models and a divergence among the large ensemble models after 2040 (Fig. B3).

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f07

Figure 7Evolution of the model-associated factors of variability. (a, b) Variance associated to model main effect and interactions. Small and large ensemble models have the same y-axis, and a zoom-out box is displayed for the variance of the small ensemble models to show the full increase in model contribution (b). (c, d) Variance contribution of model main effect and interactions to the total model-associated variability. Letters refer to the dimension involved in the variance calculation, with a single letter for main effects and a combination of letters for interactions: S refers to the scenario dimension, T to time, M to model, and R to realizations (see Sect. 2.2.1). Results are presented for large ensemble models (a, c) with 25–40 members and small ensemble models (b, d) with 3–6 members.

Download

The second key property of the inter-model variability is the fact that it closely follows the evolution of physical components, especially internal variability. Most of the interactions involving models and realizations follow the same path as the corresponding interaction without the model dimension (Figs. 3 and 7). For instance, the interaction between models, time, and ensemble (MRT) evolves like the interaction between time and realizations (RT). This is the same for MR and R or SMRT and SRT. This illustrates that a fraction of internal variability is present in the model dimension and that averaging across models removes a part of internal variability – consistent with the logic previously discussed for the scenario dimension (see Sect. 2.2.2). This behavior reflects the fact that inter-model differences also manifest in terms of internal variability, which recent studies have linked to salinity biases and differences in dense water formation in the Labrador Sea, as well as to model horizontal resolution (Jackson et al., 2020, 2023).

In small ensemble models, MT present a substantial level of variability prior to the 21st century, which indicates that small ensemble models were presenting different time evolution across this period and that a part of time variability has been removed with inter-model averaging. These greater inter-model differences in terms of trends is consistent with the greater AMOC sensitivity observed in models of the “small ensemble” category (Fig. 2).

3.4 Physical attribution of the ANOVA components

To summarize the previous findings on the ANOVA components, we propose a third approach – complementary to statistical separation and variance reconstruction – that organizes the variance components following a physical attribution. This physical attribution method distributes the terms into four different sources of variance (Fig. 8), following the interpretation of the ANOVA components detailed in Sect. 2.2.2. The first source is the internal variability that corresponds to the mean ensemble variance (Eq. 10) and gathers all terms involving ensemble dimension (R, RT, SR, MR, SRT, SMR, MRT and SMRT). This allows the fraction of internal variability located in each dimension to be robustly separated and analyzed as a distinct physical phenomena. The second source is the scenario-forced variability that represents the effects of future forcing applied to the system and the AMOC uncertainty associated with future anthropogenic emissions. It is constructed from the scenario main effect and all interactions smoothing the scenario differences (associated with time and/or inter-model averaging: S, ST, SM, SMT). The third factor represents the dynamical adjustment associated with the response to the historical forcing located in the time main effect and the interaction between time and model that represents inter-model differences smoothing this response (T and MT). Finally, the inter-model variability represents the evolution of inter-model differences over time and is only constituted by the model main effect (M).

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f08

Figure 8Summary picture of the sources of AMOC variability. (a, b) AMOC variance associated with each of the four sources of variance based on the physical attribution methodology described in Sect. 4. Line style represents the size of the time window, either 30 (solid line), or 50 years (dashed line). Small and large ensemble models have the same y-axis, and a zoom-out box is displayed for the variance of the small ensemble models to show the full increase in model contribution. Results are presented for large ensemble models (a) with 25–40 members and small ensemble models (b) with 3–6 members.

Download

4 Summary and discussion

The analysis of AMOC variability at 26° N in CMIP6 ensembles based on a 4-way analysis of variance, depicts three successive phases associated with three distinct regimes of AMOC variability, each dominated by a particular factor, that can be summarized using the physical attribution method presented previously.

The first phase from 1850 to around 1990 presents a relatively stable AMOC, whose variability is driven by internal variability, associated with time and ensemble dimension given the quasi-ergodicity of this regime;
The second phase from 1990 to 2050 is characterized by a decline of the AMOC associated with an adjustment to past historical anthropogenic emissions. It is thus a regime where time changes dominate, the forcing-scenario effect remaining weak. The importance of this regime is particularly evident when employing a 50-year window for the ANOVA that allows to take better account of this declining trend (Fig. 8);
Finally the third phase starting around 2050 is a regime forced by emission scenarios, where the impact of anthropogenic emissions pathways takes control on the divergence of the simulated trajectories.

An important finding of this work is the existence of two increasing phases of scenario variability. The first one, after 2015, rapidly stabilizes in a few decades. The second one, emerging after 2050, presents an exponential increase of the variability. During the first phase, the scenarios do not induce substantial forced divergence between simulations. Instead, they introduce minor perturbations that effectively expand the ensemble spread, mimicking “larger ensembles”. This is particularly evident as the main signal associated with scenarios in this period lies in the interaction between scenarios, realizations, and time dimensions (SRT). This also leads us to detect an intermediate regime around 2040–2050, where most of the variability components stabilize and where the system is driven by this “scenario-perturbed” internal variability. In the second phase, the increase of variability is purely forced by the divergence of scenarios, and scenarios main effect takes the lead.

The AMOC intensity decline associated with the transient increase of time variability, could suggest that this evolution is not forced by scenarios nor synchronous to anthropogenic emissions, because of the absence of clear scenario separation before 2050. In this respect, AMOC behavior differs from global CO₂ concentration, anthropogenic radiative forcing, and temperature change, that already start diverging one or two decades earlier (O'Neill et al., 2016). Although this trend could be influenced by regional aerosol forcing (e.g., Deser et al., 2020; Menary et al., 2020; Robson et al., 2022), analyses of the local radiative imbalance over the North Atlantic (45–70° N, 60° W–20° E) also show an early and rapid separation among scenarios (Fig. B7).

This study also provides robust evidence that the AMOC internal variability in CMIP6 decreases in the projection period compared to historical reference. We also detected a potential link between this decline and emission pathways intensity. This is consistent with MacMartin et al. (2016) results with a single Earth System Model or Cheng et al. (2016) on interdecadal variability of the previous CMIP exercise. This also aligns with previous results on large-scale climate showing that internal variability is highly sensitive to forced variability and mean state (e.g., Coquereau et al., 2024). The decline of internal variability observed in the present work corresponds to a contraction of the phase-space and therefore an increasing predictability of the AMOC in the future from one time step to another. However, it does not substantially increase our ability to predict the future of the AMOC, since internal variability is a small component of the AMOC uncertainty, smaller than model or scenario-uncertainty after mid-century. Yet, internal variability seems to decrease concomitantly to the AMOC magnitude, thus it could be a potential proxy or warning signal of an AMOC decline, often hardly detectable because of the large AMOC interannual variability (Lobelle et al., 2020). Furthermore, as seen in the results section, this progressive decline of internal variability coincide with the relocation of variance from R and RT to SRT. The physical attribution method facilitates the isolation of the internal variability decline signal without being impacted by the relocation of variance between different components related to internal variability.

Finally, the physical attribution shows that model-only variability is someway over-estimated by the use of statistical separation and that model uncertainty, while being a major contributor, do not largely dominate the other factors. However, its contribution in AMOC projection uncertainty (mostly associated with AMOC absolute intensity and climate sensitivity, as discussed in Sect. 3.3 and in Weijer et al. (2020)) has appeared as the strongest independent source of uncertainty (i.e. with the largest main effect), suggesting that further progress in AMOC modeling is needed to obtain more reliable AMOC projections.The disagreement between small and large ensemble models at the end of the 21st century raises interesting questions about ensemble strategies and (i) the use of a small number of large ensemble models, versus (ii) the use of a large number of small ensemble models. There is thus a choice to be made between relying on few models (i) or not having a good separation of internal variability (ii). Here, all results are displayed separately between small and large ensemble models to give readers all the information. The fact that interactions involving model and ensemble dimensions follow the same evolution as the corresponding interactions without model dimension is interesting from an uncertainty perspective. Indeed, it underlines that when the internal variability – which represents the range of possible states the system can occupy under a given forcing – is large, it is more difficult for the models to accurately find the correct state. Furthermore, the observed decline of ensemble variance when the AMOC intensity decreases, corresponds to a contraction of the space of possible states, and consequently one expects the absolute difference between models concerning this phase-space location to reduce.

A limitation of this study, tied to modeling capacity, lies in the relatively small number of large ensemble climate models and the limited size of most ensembles. This constraint is evident, for example, in the analysis of ensemble variance within individual models (Fig. 6), where variability exhibits noisy behavior and notable differences across small ensemble models, despite a generally consistent pattern of decline. Additionally, while some high-resolution climate simulations for a single member are becoming available, large-scale ensemble simulations remain restricted to relatively coarse spatial resolutions – on the order of one degree in the ocean. In this context, different research groups have highlighted the importance of fine-scale processes, such as mesoscale eddies, overflows and convection, in influencing the AMOC, raising concerns about the reliability of AMOC trends in coarse-resolution (one-degree) models (e.g., Hirschi et al., 2020; Jackson et al., 2020; Hewitt et al., 2022; Jackson et al., 2023; Gou et al., 2024). However, regarding AMOC strength at 26° N, a recent study found strong agreement between high- and low-resolution simulations (with 0.1° resolution in the ocean, Gou et al., 2024), which is the central focus of this work.

The ANOVA method employed in this study enables a detailed analysis of interaction terms between sources of variance, which is not possible with the widely used approach of Hawkins and Sutton (2009). This approach allows us to go beyond the assumption of additivity of variance associated with each individual dimension and directly examine cross-terms and interdependencies between different dimensions. With the ANOVA, by combining the main effect of each dimension with their interactions, we achieve a full reconstruction of the total variance. This contrasts with the previous method, where total variance was simply the sum of explicitly computed components.

Appendix A: A stochastic model to interpret ANOVA interactions

To better understand the role and physical meaning of these interactions, we designed a minimalist synthetic model representing AMOC trajectories for different members and scenarios, with prescribed slopes and a superimposed random internal variability (see Fig. A1, first panel). The model is based on a stochastic mean-reverting process called Ornstein-Uhlenbeck process, similar to the model proposed in Hasselmann (1976). The evolution of our model is governed by the following equation:

\begin{matrix} (A1) & d ψ_{t} = - λ (ψ_{t} - F_{t}) d t + σ (ψ_{t}) d W_{t}, \end{matrix}

with ψ_t the AMOC intensity, λ a damping term (also known as mean-reversion rate term, here equal to 0.3 yr⁻¹). In this model, we used a slight variation of a classical Ornstein-Uhlenbeck process (as used in Hasselmann, 1976) since the system is not pull toward the mean, but toward F_t a given intensity that evolves over time. This term represents the forcing and drives the intensity decline. It is constant up to year 150 (i.e. $F_{t} = ψ_{0} = 1$ Sv) and then declines, following a common trend across scenarios up to year 200 (i.e. $F_{t} = ψ_{150} + β t$ , with $β = - 0.006$ Sv yr⁻¹), and with a scenario-dependent trend afterward (i.e. $F_{t} = ψ_{200} + γ t$ , with γ ranging from −0.01 to +0.002 Sv yr⁻¹). W_t is the Wiener process responsible for the stochastic fluctuations representing internal variability. The intensity of these fluctuations is set by σ that depends on the AMOC intensity such that σ(ψ_t)=αψ_t, with α a constant, here equal to 0.04 yr $^{- 1 / 2}$ . Parameters were chosen to mimic the evolution of the AMOC time series observed in CMIP6 models. 1000 members have been computed to ensure a robust evaluation of the internal variability and a 250-year spin-up has been performed to let the system adjust before the analysis.

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f09

Figure A1ANOVA decomposition of the synthetic data set. (Top panel) Synthetic AMOC trajectories for 1000 members and 3 scenarios, with a constant first period (years 0–150) where all scenarios are merged, a second period (years 150–200) with the same slope between scenarios but realizations show phase shifts in variability, and a third period (after year 200) where scenarios start to separate and follow different slopes. Internal variability is represented by a random Gaussian noise scaling with the AMOC intensity such that σ(ψ_t)=αψ_t, with α=0.04 yr $^{- 1 / 2}$ ; dt=1 years. (Middle and bottom panels) ANOVA decomposition of the synthetic data set with separation between the main effects of time (T), scenarios (S), and realizations (R), as well as the interactions between these dimensions. The mean variance of the ensemble (red dotted line) is calculated as the variance of the ensemble (across realizations) averaged over the scenarios and over a sliding window of 30 years. The total internal variability from the ANOVA is calculated from the sum of all factors associated with the R dimension (R, RT, SR, and SRT, black line). The sensitivity of the results to the number of scenarios is represented by different lines of the same color: 3 scenarios in solid line, 5 scenarios in dashed line, 50 scenarios in dash-dotted line, and 100 scenarios in dotted line.

Download

Appendix B: Additional figures

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f10

Figure B1Statistical variance reconstructed from ANOVA components. The mean variance computed across each dimension is compared to the sum of all components involving this dimensions. Results are presented for large ensemble models (left) with 25–40 members and small ensemble models (right) with 3–6 members. Results indicate that the two methods lead to exactly equal time series.

Download

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f11

Figure B2Sensitivity test on time window size. Variance associated to each main effect (a–d) and interaction (e, f). (c, d) Zoom on ensemble main effect. Line thickness represents the size of the time window from 10 (thin) to 50 years (thick). Results are presented for large ensemble models (a, c, e) with 25–40 members and small ensemble models (b, d, f) with 3–6 members.

Download

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f12

Figure B3Sensitivity test with AMOC absolute intensity. (a, b) AMOC absolute intensity time series over historical (1850–2015, black) and projection (2015–2100, colors) periods. (c, d) Variance associated with model main effect only (solid line) and with all model-associated factors combined with the separation methodology by Zhang et al. (2023) (dashed line). Values correspond to absolute AMOC intensity. Results are presented for large ensemble models (a, c) with 25–40 members and small ensemble models (b, d) with 3–6 members.

Download

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f13

Figure B4Sensitivity test on bootstrapping methodology: application to the variability of physical factors. (a–d) Variance associated to each main effect (a, b) and their interaction (c, d). (e, f) Relative variance contribution of each main effect and their interactions to the total physical factor variability. Left-hand panels show the sensitivity of the large ensemble results to the number of members in each resampling. Thin lines present the control experiment with 20 members of each model (identical to Fig. 3a, c, e) and thick lines present the results obtained using 3 members of each model per resampling. Right-hand panels show the sensitivity to combining large and small ensemble models. Thin lines present the control experiment with only small ensemble models (identical to Fig. 3b, d, f) and thick lines present the results obtained with all ensembles. 3-member resampling are used here.

Download

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f14

Figure B5Sensitivity test on bootstrapping methodology: application to the variability of model-associated factors. (a, b) Variance associated to model main effect and interactions. (c, d) Variance contribution of model main effect and interactions to the total model-associated variability. Left-hand panels show the sensitivity of the large ensemble results to the number of members in each resampling. Thin lines present the control experiment with 20 members of each model (identical to Fig. 3a, c, e) and thick lines present the results obtained using 3 members of each model per resampling. Right-hand panels show the sensitivity to combining large and small ensemble models. Thin lines present the control experiment with only small ensemble models (identical to Fig. 3b, d, f) and thick lines present the results obtained with all ensembles. 3-member resampling are used here.

Download

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f15

Figure B6Decorrelation timescale of AMOC intensity time series for each model. Evolution of the normalized auto-correlation with respect to the considered lag. Decorrelation timescale corresponds to the e-folding timescale, i.e. the lag when the normalized auto-correlation falls below $1 / e$ .

Download

https://esd.copernicus.org/articles/17/209/2026/esd-17-209-2026-f16

Figure B7Evolution of the net radiative imbalance averaged across the three large-ensemble models at the global scale (left) and regionally over the North Atlantic (45–70° N, 60° W–20° E; right). The historical period is shown in black, while future projections are shown in blue (SSP1-2.6), orange (SSP2-4.5), and red (SSP5-8.5). The multi-model mean is calculated using a bootstrapping approach, which is also used to estimate the 95 % confidence interval percentiles, displayed as shaded envelopes.

Download

Code and data availability

The program code (in Python) for computing the 4-way ANOVA is available at https://github.com/coquereau/ANOVA_4way, last access: 15 February 2026 (https://doi.org/10.5281/zenodo.18835920, Coquereau, 2026). CMIP6 ensemble model outputs can be downloaded from the various Earth System Grid Federation (ESGF) nodes. The article references for each model can be found in Table 1.

Author contributions

Conceptualization: A.C., F.S., and Q.J. Methodology: A.C., F.S., T.H., J.J.H, and Q.J. Investigation: A.C., F.S., and Q.J. Visualization: A.C. Supervision: F.S., T.H., and J.J.H. Writing – original draft: A.C., F.S., and Q.J. Writing – review and editing: A.C., F.S., T.H., J.J.H, and Q.J.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Views and opinions expressed are however those of the authors only and do not necessarily reflect those of the European Union or the European Climate Infrastructure and Environment Executive Agency (CINEA). Neither the European Union nor the granting authority can be held responsible for them.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

This work was supported by the ARVOR project funded by the LEFE IMAGO program, by the OceaniX project funded by the French ANR program, and by the ISblue project (Interdisciplinary graduate school for the blue planet, ANR-17-EURE-0015) funded by a grant from the French government under the program “Investissements d’Avenir”. This work was also supported by the EERIE project (Grant Agreement no. 101081383) funded by the European Union.

Financial support

This work was supported by the ARVOR project funded by the LEFE IMAGO program, by the OceaniX project funded by the French ANR program, and by the ISblue project (Interdisciplinary graduate school for the blue planet, ANR-17-EURE-0015) funded by a grant from the French government under the program “Investissements d’Avenir”. This work was also supported by the EERIE project (grant agreement no. 101081383) funded by the European Union.

Review statement

This paper was edited by Gabriele Messori and reviewed by four anonymous referees.

References

Arguez, A. and Vose, R. S.: The Definition of the Standard WMO Climate Normal: The Key to Deriving Alternative Climate Normals, B. Am. Meteorol. Soc., 92, 699–704, 2011. a

Armstrong McKay, D. I., Staal, A., Abrams, J. F., Winkelmann, R., Sakschewski, B., Loriani, S., Fetzer, I., Cornell, S. E., Rockström, J., and Lenton, T. M.: Exceeding 1.5 °C global warming could trigger multiple climate tipping points, Science, 377, eabn7950, https://doi.org/10.1126/science.abn7950, 2022. a

Baker, J. A., Bell, M. J., Jackson, L. C., Renshaw, R., Vallis, G. K., Watson, A. J., and Wood, R. A.: Overturning Pathways Control AMOC Weakening in CMIP6 Models, Geophys. Res. Lett., 50, e2023GL103381, https://doi.org/10.1029/2023GL103381, 2023. a, b

Berrington de González, A. and Cox, D. R.: Interpretation of interaction: A review, Ann. Appl. Stat., 1, 371–385, https://doi.org/10.1214/07-AOAS124, 2007. a

Böhm, E., Lippold, J., Gutjahr, M., Frank, M., Blaser, P., Antz, B., Fohlmeister, J., Frank, N., Andersen, M. B., and Deininger, M.: Strong and deep Atlantic meridional overturning circulation during the last glacial cycle, Nature, 517, 73–76, https://doi.org/10.1038/nature14059, 2015. a

Bonnet, R., Swingedouw, D., Gastineau, G., Boucher, O., Deshayes, J., Hourdin, F., Mignot, J., Servonnat, J., and Sima, A.: Increased risk of near term global warming due to a recent AMOC weakening, Nat. Commun., 12, 6108, https://doi.org/10.1038/s41467-021-26370-0, 2021. a

Boucher, O., Servonnat, J., Albright, A. L., Aumont, O., Balkanski, Y., Bastrikov, V., Bekki, S., Bonnet, R., Bony, S., Bopp, L., Braconnot, P., Brockmann, P., Cadule, P., Caubel, A., Cheruy, F., Codron, F., Cozic, A., Cugnet, D., D'Andrea, F., Davini, P., de Lavergne, C., Denvil, S., Deshayes, J., Devilliers, M., Ducharne, A., Dufresne, J.-L., Dupont, E., Éthé, C., Fairhead, L., Falletti, L., Flavoni, S., Foujols, M.-A., Gardoll, S., Gastineau, G., Ghattas, J., Grandpeix, J.-Y., Guenet, B., Guez, L. E., Guilyardi, E., Guimberteau, M., Hauglustaine, D., Hourdin, F., Idelkadi, A., Joussaume, S., Kageyama, M., Khodri, M., Krinner, G., Lebas, N., Levavasseur, G., Lévy, C., Li, L., Lott, F., Lurton, T., Luyssaert, S., Madec, G., Madeleine, J.-B., Maignan, F., Marchand, M., Marti, O., Mellul, L., Meurdesoif, Y., Mignot, J., Musat, I., Ottlé, C., Peylin, P., Planton, Y., Polcher, J., Rio, C., Rochetin, N., Rousset, C., Sepulchre, P., Sima, A., Swingedouw, D., Thiéblemont, R., Traore, A. K., Vancoppenolle, M., Vial, J., Vialard, J., Viovy, N., and Vuichard, N.: Presentation and evaluation of the IPSL-CM6A-LR climate model, J. Adv. Model. Earth Sy., 12, https://doi.org/10.1029/2019MS002010, 2020. a

Buckley, M. W. and Marshall, J.: Observations, inferences, and mechanisms of the Atlantic Meridional Overturning Circulation: A review, Rev. Geophys., 54, 5–63, https://doi.org/10.1002/2015RG000493, 2016. a

Cheng, J., Liu, Z., Zhang, S., Liu, W., Dong, L., Liu, P., and Li, H.: Reduced interdecadal variability of Atlantic Meridional Overturning Circulation under global warming, P. Natl. Acad. Sci. USA, 113, 3175–3178, https://doi.org/10.1073/pnas.1519827113, 2016. a, b

Coquereau, A.: coquereau/ANOVA_4way: Release of v1.0 (v1.0), Zenodo [code], https://doi.org/10.5281/zenodo.18835920, 2026. a

Coquereau, A., Sévellec, F., Huck, T., Hirschi, J. J., and Hochet, A.: Anthropogenic Changes in Interannual-to-Decadal Climate Variability in CMIP6 Multiensemble Simulations, J. Climate, 37, 3723–3739, https://doi.org/10.1175/JCLI-D-23-0606.1, 2024. a, b

Coquereau, A., Sévellec, F., Huck, T., and Fedorov, A. V.: Increase in ENSO frequency and intensity under 20th and 21st century warming: Insights from CMIP6 large ensembles, Geophys. Res. Lett., 52, e2025GL116541, https://doi.org/10.1029/2025GL116541, 2025. a

Danabasoglu, G., Lamarque, J.-F., Bacmeister, J., Bailey, D. A., DuVivier, A. K., Edwards, J., Emmons, L. K., Fasullo, J., Garcia, R., Gettelman, A., Hannay, C., Holland, M. M., Large, W. G., Lauritzen, P. H., Lawrence, D. M., Lenaerts, J. T. M., Lindsay, K., Lipscomb, W. H., Mills, M. J., Neale, R., Oleson, K. W., Otto-Bliesner, B., Phillips, A. S., Sacks, W., Tilmes, S., van Kampenhout, L., Vertenstein, M., Bertini, A., Dennis, J., Deser, C., Fischer, C., Fox-Kemper, B., Kay, J. E., Kinnison, D., Kushner, P. J., Larson, V. E., Long, M. C., Mickelson, S., Moore, J. K., Nienhouse, E., Polvani, L., Rasch, P. J., and Strand, W. G.: The Community Earth System Model Version 2 (CESM2), J. Adv. Model. Earth Sy., 12, https://doi.org/10.1029/2019MS001916, 2020. a

Dansgaard, W., Johnsen, S. J., clausen, H. B., Dahl-Jensen, D., Gundestrup, N. S., Hammer, C. U., Hvldberg, C. S., Steffensen, J. P., Sveinbjörnsdóttir, A. E., Jouzel, J., and Bond, G.: Evidence for general instability of past climate from a 250-kyr ice-core record, Nature, 364, 218–220, 1993. a

Deser, C., Phillips, A. S., Simpson, I. R., Rosenbloom, N., Coleman, D., Lehner, F., Pendergrass, A. G., DiNezio, P., and Stevenson, S.: Isolating the Evolving Contributions of Anthropogenic Aerosols and Greenhouse Gases: A New CESM1 Large Ensemble Community Resource, J. Climate, 33, 7835–7858, https://doi.org/10.1175/JCLI-D-20-0123.1, 2020. a

Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, https://doi.org/10.5194/gmd-9-1937-2016, 2016. a

Fox-Kemper, B., Adcroft, A., Böning, C. W., Chassignet, E. P., Curchitser, E., Danabasoglu, G., Eden, C., England, M. H., Gerdes, R., Greatbatch, R. J., Griffies, S. M., Hallberg, R. W., Hanert, E., Heimbach, P., Hewitt, H. T., Hill, C. N., Komuro, Y., Legg, S., Le Sommer, J., Masina, S., Marsland, S. J., Penny, S. G., Qiao, F., Ringler, T. D., Treguier, A. M., Tsujino, H., Uotila, P., and Yeager, S. G.: Challenges and Prospects in Ocean Circulation Models, Front. Mar. Sci., 6, https://doi.org/10.3389/fmars.2019.00065, 2019. a

Ganachaud, A. and Wunsch, C.: Improved estimates of global ocean circulation, heat transport and mixing from hydrographic data, Nature, 408, 453–457, https://doi.org/10.1038/35044048, 2000. a

Gierz, P., Lohmann, G., and Wei, W.: Response of Atlantic overturning to future warming in a coupled atmosphere-ocean-ice sheet model, Geophys. Res. Lett., 42, 6811–6818, https://doi.org/10.1002/2015GL065276, 2015. a

Gou, R., Lohmann, G., and Wu, L.: Atlantic Meridional Overturning Circulation Decline: Tipping Small Scales under Global Warming, Phys. Rev. Lett., 133, 034201, https://doi.org/10.1103/PhysRevLett.133.034201, 2024. a, b, c, d

Hasselmann, K.: Stochastic climate models Part I. Theory, Tellus, 28, 473–485, https://doi.org/10.1111/j.2153-3490.1976.tb00696.x, 1976. a, b

Hawkins, E. and Sutton, R.: The Potential to Narrow Uncertainty in Regional Climate Predictions, B. Am. Meteorol. Soc., 90, 1095–1108, 2009. a, b, c, d, e, f, g

Hawkins, E. and Sutton, R.: The potential to narrow uncertainty in projections of regional precipitation change, Clim. Dynam., 37, 407–418, https://doi.org/10.1007/s00382-010-0810-6, 2011. a, b, c

Hawkins, E., Smith, R. S., Gregory, J. M., and Stainforth, D. A.: Irreducible uncertainty in near-term climate projections, Clim. Dynam., 46, 3807–3819, 2016. a

Henry, L. G., McManus, J. F., Curry, W. B., Roberts, N. L., Piotrowski, A. M., and Keigwin, L. D.: North Atlantic ocean circulation and abrupt climate change during the last glaciation, Science, 353, 470–474, https://doi.org/10.1126/science.aaf5529, 2016. a

Hewitt, H., Fox-Kemper, B., Pearson, B., Roberts, M., and Klocke, D.: The small scales of the ocean may hold the key to surprises, Nat. Clim. Change, 12, 496–499, https://doi.org/10.1038/s41558-022-01386-6, 2022. a

Hingray, B. and Saïd, M.: Partitioning Internal Variability and Model Uncertainty Components in a Multimember Multimodel Ensemble of Climate Projections, J. Climate, 27, 6779–6798, https://doi.org/10.1175/JCLI-D-13-00629.1, 2014. a

Hingray, B., Mezghani, A., and Buishand, T. A.: Development of probability distributions for regional climate change from uncertain global mean warming and an uncertain scaling relationship, Hydrol. Earth Syst. Sci., 11, 1097–1114, https://doi.org/10.5194/hess-11-1097-2007, 2007. a, b

Hirschi, J. J.-M., Barnier, B., Böning, C., Biastoch, A., Blaker, A. T., Coward, A., Danilov, S., Drijfhout, S., Getzlaff, K., Griffies, S. M., Hasumi, H., Hewitt, H., Iovino, D., Kawasaki, T., Kiss, A. E., Koldunov, N., Marzocchi, A., Mecking, J. V., Moat, B., Molines, J.-M., Myers, P. G., Penduff, T., Roberts, M., Treguier, A.-M., Sein, D. V., Sidorenko, D., Small, J., Spence, P., Thompson, L., Weijer, W., and Xu, X.: The Atlantic meridional overturning circulation in high-resolution models, J. Geophys. Res.-Oceans, 125, e2019JC015522, https://doi.org/10.1029/2019JC015522, 2020. a, b, c

Jackson, L. C., Roberts, M. J., Hewitt, H. T., Wood, R. A., Smith, R. S., Meccia, V., Park, W., Moat, B. I., and Keenlyside, N.: Impact of ocean resolution and mean state on the rate of AMOC weakening, Clim. Dynam., 55, 1711–1732, https://doi.org/10.1007/s00382-020-05345-9, 2020. a, b, c, d, e

Jackson, L. C., Hewitt, H. T., Bruciaferri, D., Calvert, D., Graham, T., Guiavarc’h, C., Menary, M. B., New, A. L., Roberts, M., and Storkey, D.: Challenges simulating the AMOC in climate models, Philos. T. R. Soc. A, 381, 20220187, https://doi.org/10.1098/rsta.2022.0187, 2023. a, b, c, d, e, f, g, h, i

Jackson, L. C., Chassignet, E. P., Danabasoglu, G., Treguier, A., and Zhang, R.: Toward Improving Representation of the Atlantic Meridional Overturning Circulation in Climate Models, B. Am. Meteorol. Soc., 106, E1032–E1036, https://doi.org/10.1175/BAMS-D-25-0078.1, 2025. a

Johns, W. E., Elipot, S., Smeed, D. A., Moat, B., King, B., Volkov, D. L., and Smith, R. H.: Towards two decades of Atlantic Ocean mass and heat transports at 26.5° N, Philos. T. R. Soc. A, 381, 20220188, https://doi.org/10.1098/rsta.2022.0188, 2023. a

Knight, J. R., Folland, C. K., and Scaife, A. A.: Climate impacts of the Atlantic Multidecadal Oscillation, Geophys. Res. Lett., 33, L17706, https://doi.org/10.1029/2006GL026242, 2006. a

Knutti, R.: The end of model democracy? An editorial comment, Climatic Change, 102, 395–404, 2010. a, b

Lee, J.-Y., Marotzke, J., Bala, G., Cao, L., Corti, S., Dunne, J., Engelbrecht, F., Fischer, E., Fyfe, J., Jones, C., Maycock, A., Mutemi, J., Ndiaye, O., Panickal, S., , and Zhou, T.: Future Global Climate: Scenario-based Projections and Near-term Information, in: Climate Change 2021 – The Physical Science Basis: Working Group I Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, Intergovernmental Panel on Climate Change (IPCC), Cambridge University Press, Cambridge, 553–672, ISBN 978-1-00-915788-9, https://doi.org/10.1017/9781009157896.006, 2021. a

Lehner, F., Deser, C., Maher, N., Marotzke, J., Fischer, E. M., Brunner, L., Knutti, R., and Hawkins, E.: Partitioning climate projection uncertainty with multiple large ensembles and CMIP5/6, Earth Syst. Dynam., 11, 491–508, https://doi.org/10.5194/esd-11-491-2020, 2020. a, b, c

Lenton, T. M., Held, H., Kriegler, E., Hall, J. W., Lucht, W., Rahmstorf, S., and Schellnhuber, H. J.: Tipping elements in the Earth's climate system, P. Natl. Acad. Sci., 105, 1786–1793, https://doi.org/10.1073/pnas.0705414105, 2008. a

Lobelle, D., Beaulieu, C., Livina, V., Sévellec, F., and Frajka-Williams, E.: Detectability of an AMOC decline in current and projected climate changes, Geophys. Res. Lett., 47, e2020GL089974, https://doi.org/10.1029/2020GL089974, 2020. a

MacMartin, D. G., Zanna, L., and Tziperman, E.: Suppression of Atlantic Meridional Overturning Circulation Variability at Increased CO₂, J. Climate, 29, 4155–4164, https://doi.org/10.1175/JCLI-D-15-0533.1, 2016. a, b

McManus, J. F., Francois, R., Gherardi, J.-M., Keigwin, L. D., and Brown-Leger, S.: Collapse and rapid resumption of Atlantic meridional circulation linked to deglacial climate changes, Nature, 428, 834–837, https://doi.org/10.1038/nature02494, 2004. a

Menary, M. B., Roberts, C. D., Palmer, M. D., Halloran, P. R., Jackson, L., Wood, R. A., Müller, W. A., Matei, D., and Lee, S.-K.: Mechanisms of aerosol-forced AMOC variability in a state of the art climate model, J. Geophys. Res.-Oceans, 118, 2087–2096, https://doi.org/10.1002/jgrc.20178, 2013. a, b

Menary, M. B., Robson, J., Allan, R. P., Booth, B. B. B., Cassou, C., and Gastineau, G.: Aerosol-forced AMOC changes in CMIP6 historical simulations, Geophys. Res. Lett., 47, e2020GL088166, https://doi.org/10.1029/2020GL088166, 2020. a, b, c, d

Olonscheck, D., Suarez-Gutierrez, L., Milinski, S., Beobide-Arsuaga, G., Baehr, J., Fröb, F., Ilyina, T., Kadow, C., Krieger, D., Li, H., Marotzke, J., Plésiat, É., Schupfner, M., Wachsmann, F., Wallberg, L., Wieners, K.-H., and Brune, S.: The New Max Planck Institute Grand Ensemble with CMIP6 forcing and high-frequency model output, J. Adv. Model. Earth Sy., 15, e2023MS003790, https://doi.org/10.1029/2023MS003790, 2023. a

O'Neill, B. C., Tebaldi, C., van Vuuren, D. P., Eyring, V., Friedlingstein, P., Hurtt, G., Knutti, R., Kriegler, E., Lamarque, J.-F., Lowe, J., Meehl, G. A., Moss, R., Riahi, K., and Sanderson, B. M.: The Scenario Model Intercomparison Project (ScenarioMIP) for CMIP6, Geosci. Model Dev., 9, 3461–3482, https://doi.org/10.5194/gmd-9-3461-2016, 2016. a, b, c, d

Rind, D., Orbe, C., Jonas, J., Nazarenko, L., Zhou, T., Kelley, M., Lacis, A., Shindell, D., Faluvegi, G., Romanou, A., Russell, G., Tausnev, N., Bauer, M., and Schmidt, G.: GISS Model E2.2: A climate model optimized for the middle atmosphere – Model structure, climatology, variability, and climate sensitivity, J. Geophys. Res.-Atmos., 125, https://doi.org/10.1029/2019JD032204, 2020. a

Robson, J., Menary, M. B., Sutton, R. T., Mecking, J., Gregory, J. M., Jones, C., Sinha, B., Stevens, D. P., and Wilcox, L. J.: The Role of Anthropogenic Aerosol Forcing in the 1850–1985 Strengthening of the AMOC in CMIP6 Historical Simulations, J. Climate, 35, 6843–6863, https://doi.org/10.1175/JCLI-D-22-0124.1, 2022. a, b, c, d

Sellar, A. A., Jones, C. G., Mulcahy, J. P., Tang, Y., Yool, A., Wiltshire, A., O'Connor, F. M., Stringer, M., Hill, R., Palmieri, J., Woodward, S., de Mora, L., Kuhlbrodt, T., Rumbold, S. T., Kelley, D. I., Ellis, R., Johnson, C. E., Walton, J., Abraham, N. L., Andrews, M. B., Andrews, T., Archibald, A. T., Berthou, S., Burke, E., Blockley, E., Carslaw, K., Dalvi, M., Edwards, J., Folberth, G. A., Gedney, N., Griffiths, P. T., Harper, A. B., Hendry, M. A., Hewitt, A. J., Johnson, B., Jones, A., Jones, C. D., Keeble, J., Liddicoat, S., Morgenstern, O., Parker, R. J., Predoi, V., Robertson, E., Siahaan, A., Smith, R. S., Swaminathan, R., Woodhouse, M. T., Zeng, G., and Zerroukat, M.: UKESM1: Description and evaluation of the U.K. Earth System Model, J. Adv. Model. Earth Sy., 11, 4513–4558, https://doi.org/10.1029/2019MS001739, 2019. a

Sévellec, F. and Sinha, B.: Predictability of Decadal Atlantic Meridional Overturning Circulation Variations, in: Oxford Encyclopedia of Climate Science, Oxford University Press, Oxford University Press, https://doi.org/10.1093/acrefore/9780190228620.013.81, 2018. a

Srokosz, M., Baringer, M., Bryden, H., Cunningham, S., Delworth, T., Lozier, S., Marotzke, J., and Sutton, R.: Past, Present, and Future Changes in the Atlantic Meridional Overturning Circulation, B. Am. Meteorological Society, 93, 1663–1676, https://doi.org/10.1175/BAMS-D-11-00151.1, 2012. a

Stommel, H.: Thermohaline Convection with Two Stable Regimes of Flow, Tellus, 13, 224–230, https://doi.org/10.1111/j.2153-3490.1961.tb00079.x, 1961. a

Swart, N. C., Cole, J. N. S., Kharin, V. V., Lazare, M., Scinocca, J. F., Gillett, N. P., Anstey, J., Arora, V., Christian, J. R., Hanna, S., Jiao, Y., Lee, W. G., Majaess, F., Saenko, O. A., Seiler, C., Seinen, C., Shao, A., Sigmond, M., Solheim, L., von Salzen, K., Yang, D., and Winter, B.: The Canadian Earth System Model version 5 (CanESM5.0.3), Geosci. Model Dev., 12, 4823–4873, https://doi.org/10.5194/gmd-12-4823-2019, 2019. a

Sévellec, F. and Fedorov, A. V.: Millennial Variability in an Idealized Ocean Model: Predicting the AMOC Regime Shifts, J. Climate, 27, 3551–3564, https://doi.org/10.1175/JCLI-D-13-00450.1, 2014. a

Tatebe, H., Ogura, T., Nitta, T., Komuro, Y., Ogochi, K., Takemura, T., Sudo, K., Sekiguchi, M., Abe, M., Saito, F., Chikira, M., Watanabe, S., Mori, M., Hirota, N., Kawatani, Y., Mochizuki, T., Yoshimura, K., Takata, K., O'ishi, R., Yamazaki, D., Suzuki, T., Kurogi, M., Kataoka, T., Watanabe, M., and Kimoto, M.: Description and basic evaluation of simulated mean state, internal variability, and climate sensitivity in MIROC6, Geosci. Model Dev., 12, 2727–2765, https://doi.org/10.5194/gmd-12-2727-2019, 2019. a

Trenberth, K. E. and Fasullo, J. T.: Atlantic meridional heat transports computed from balancing Earth's energy locally, Geophys. Res. Lett., 44, 1919–1927, https://doi.org/10.1002/2016GL072475, 2017. a

Voldoire, A., Saint-Martin, D., Sénési, S., Decharme, B., Alias, A., Chevallier, M., Colin, J., Guérémy, J.-F., Michou, M., Moine, M.-P., Nabat, P., Roehrig, R., Salas y Mélia, D., Séférian, R., Valcke, S., Beau, I., Belamari, S., Berthet, S., Cassou, C., Cattiaux, J., Deshayes, J., Douville, H., Ethé, C., Franchistéguy, L., Geoffroy, O., Lévy, C., Madec, G., Meurdesoif, Y., Msadek, R., Ribes, A., Sanchez-Gomez, E., Terray, L., and Waldman, R.: Evaluation of CMIP6 DECK experiments with CNRM-CM6-1, J. Adv. Model. Earth Sy., 11, 2177–2213, https://doi.org/10.1029/2019MS001683, 2019. a

Wang, X. L. and Zwiers, F. W.: Interannual Variability of Precipitation in an Ensemble of AMIP Climate Simulations Conducted with the CCC GCM2, J. Climate, 12, 1322–1335, 1999. a, b

Weijer, W., Cheng, W., Garuba, O. A., Hu, A., and Nadiga, B. T.: CMIP6 Models Predict Significant 21st Century Decline of the Atlantic Meridional Overturning Circulation, Geophys. Res. Lett., 47, e2019GL086075, https://doi.org/10.1029/2019GL086075, 2020. a, b, c, d, e

Yates, F.: Orthogonal Functions and Tests of Significance in the Analysis of Variance, Supplement to the Journal of the Royal Statistical Society, 5, 177–180, https://doi.org/10.2307/2983655, 1938. a

Yip, S., Ferro, C. A. T., Stephenson, D. B., and Hawkins, E.: A Simple, Coherent Framework for Partitioning Uncertainty in Climate Predictions, J. Climate, 24, 4634–4643, https://doi.org/10.1175/2011JCLI4085.1, 2011. a, b, c

Yukimoto, S., Kawai, H., Koshiro, T., Oshima, N., Yoshida, K., Urakawa, S., Tsujino, H., Deushi, M., Tanaka, T., Hosaka, M., Yabu, S., Yoshimura, H., Shindo, E., Mizuta, R., Obata, A., Adachi, Y., and Ishii, M.: The Meteorological Research Institute Earth System Model Version 2.0, MRI-ESM2.0: Description and Basic Evaluation of the Physical Component, J. Meteorol. Soc. Jpn. Ser. II, 97, 931–965, https://doi.org/10.2151/jmsj.2019-051, 2019. a

Zhang, S., Zhou, Z., Peng, P., and Xu, C.: A New Framework for Estimating and Decomposing the Uncertainty of Climate Projections, J. Climate, 37, 365–384, https://doi.org/10.1175/JCLI-D-23-0064.1, 2023. a, b, c, d, e, f, g, h, i, j, k

Zickfeld, K., Eby, M., and Weaver, A. J.: Carbon-cycle feedbacks of changes in the Atlantic meridional overturning circulation under future atmospheric CO₂, Global Biogeochem. Cy., 22, GB3024, https://doi.org/10.1029/2007GB003118, 2008. a

Ziehn, T., Chamberlain, M. A., Law, R. M., Lenton, A., Bodman, R. W., Dix, M., Stevens, L., Wang, Y.-P., and Srbinovsky, J.: The Australian Earth System Model: ACCESS-ESM1.5, Journal of Southern Hemisphere Earth Systems Science, 70, 193–214, https://doi.org/10.1071/ES19035, 2020. a

Zwiers, F.: Interannual variability and predictability in an ensemble of AMIP climate simulations conducted with the CCC GCM2, Clim. Dynam., 12, 825–847, https://doi.org/10.1007/s003820050146, 1996. a, b

Articles

Download

Article (11028 KB)
Full-text XML

Short summary

Using statistical methods and a set of ensemble climate models, we decompose the sources of Atlantic Meridional Overturning Circulation (AMOC) variance. Three distinct phases of physical variability are identified: from 1850 to 1990, internal variability dominates; from 1990 to 2050, dynamical adjustment related to AMOC decline takes over; after 2050, differences between forcing scenarios become dominant. Beyond these physical factors, model variability remains a major source of uncertainty.