the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Observation-based temperature and freshwater noise over the Atlantic Ocean
Henk A. Dijkstra
The ocean is forced at the surface by a heat flux and freshwater flux field from the atmosphere. Short-timescale variability in these fluxes, i.e., noise, can influence long-term ocean variability and might even affect the Atlantic meridional overturning circulation (AMOC). Often this noise is assumed to be Gaussian, but detailed analyses of its statistics appear to be lacking. Here we study the noise characteristics in reanalysis data for two fields that are commonly used to force ocean-only models: evaporation minus precipitation and 2 m air temperature. We construct several noise models for both fields, and a pointwise normal inverse Gaussian distribution model gives the best performance. An analysis of CMIP6 models shows that these models do a reasonable job at representing the standard deviation and skewness of the noise, but the excess kurtosis is more difficult to capture. The pointwise noise model performs better than the CMIP6 models and can be used as forcing in ocean-only models to study, for example, noise-induced transitions of the AMOC.
- Article
(41143 KB) - Full-text XML
- BibTeX
- EndNote
The ocean is forced at the surface by momentum, heat, and freshwater fluxes from the atmosphere. Since the ocean responds relatively slowly to the atmospheric forcing, anomalies in this forcing can be modeled as a noise process (Hasselmann, 1976). This study is motivated by the role of such noise in causing noise-induced transitions in the Atlantic meridional overturning circulation (AMOC). The AMOC has a major influence on the global and Northern Hemispheric climate and has been identified as one of the potential major tipping points in the Earth system (Lenton et al., 2008; McKay et al., 2022). A collapse or strong weakening of the AMOC has major consequences for the climate system by changing, e.g., global temperature patterns (van Westen et al., 2024b), atmospheric circulation (Orihuela-Pinto et al., 2022), Arctic sea ice cover (van Westen et al., 2024b), the global carbon cycle (Zickfeld et al., 2008; Boot et al., 2024), and marine ecosystems (Schmittner, 2005; Boot et al., 2025).
Simple box models have shown that the AMOC can show noise-induced transitions (Castellana et al., 2019; van Westen et al., 2024a) and that probabilities of such transitions can be obtained using rare-event techniques. In these types of studies, the noise is applied only in the freshwater flux and is often assumed to be white for simplicity. Recently, noise-induced transitions have also been studied in an intermediate complexity Earth system model (EMIC; Cini et al., 2024) using rare-event techniques. Ideally, one would want to study the transitions in full-complexity, CMIP6 type, Earth system models (ESMs). However, due to the complexity and cost of these models, it is not yet possible to systematically use these ESMs for these types of studies. Recently, a study did look at AMOC tipping in a 10-member ensemble of the NASA-GISS ESM, showing that under the same forcing some ensemble members simulate an AMOC recovery under future emissions, while others show a consistent weakening (Romanou et al., 2023). However, the AMOC does not show a complete collapse in these ensemble members.
To determine the probability of noise-induced transitions using rare event techniques, one is at the moment restricted to using ocean-only models, and hence the specification of the atmospheric noise is crucial. However, to our knowledge, a detailed study on the properties of the noise in the actual fields relevant in the forcing of ocean models is lacking. Here we focus on noise in the freshwater flux (E−P) and in the 2 m air temperature (T2 m). Noise in the momentum flux related to surface winds might also be important for the AMOC. However, we do not consider this here for two main reasons: the statistical properties of the surface winds have been studied more thoroughly before (Sura, 2003; Monahan, 2004, 2018), and the noise in the momentum flux is less important for simulating noise-induced transitions of the AMOC.
Such a study is also useful to determine whether EMICs and ESMs adequately capture these noise fields. We know that these types of models exhibit sometimes very large biases in their mean state and in variability on a whole range of timescales. For example, T2 m is biased too warm in the CMIP6 models over the Atlantic sector of the Southern Ocean and the eastern South Atlantic, while there is a cold bias over much of the North Atlantic and Arctic oceans (Eyring et al., 2021). The air temperature biases can also be seen in the sea surface temperatures (Zhang et al., 2023), thereby directly affecting the density structure of the ocean. For precipitation there is a consistent double Intertropical Convergence Zone (ITCZ) bias from CMIP3 to CMIP6 models (Tian and Dong, 2020). This means that in the Atlantic, the ITCZ, and therefore bands of high precipitation, extends too much towards the south. Following the double ITCZ bias (Tian and Dong, 2020; Li et al., 2020), there is a strong positive freshwater flux bias north of the Equator and a strong negative bias south of the Equator in the CMIP6 multi-model mean (MMM; Liu et al., 2022). Between 10 and 60° N and the Equator and 35° S, the freshwater flux is typically positively biased in the CMIP6 MMM (Liu et al., 2022). These biases are among the reasons why the AMOC is thought to be too stable in CMIP6 type models (Weijer et al., 2019; van Westen and Dijkstra, 2024).
In this study, we determine the statistical properties of the E−P and T2 m noise based on the ERA5 reanalysis data. We compare this observation-based noise with the noise simulated by coupled CMIP6 ESMs and identify relevant biases. Based on the ERA5 noise, we construct a noise model that can be used to force ocean-only models. This product can be used to study the influence of short-timescale atmospheric variability on long-term ocean variability and eventually to study noise-induced transitions of the AMOC.
2.1 ERA5 reanalysis data
We analyze the noise in E−P and T2 m over the Atlantic Ocean between 60° S and 80° N. For this we use ERA5 reanalysis data (Hersbach et al., 2020), the most recent reanalysis product of the European Center for Medium-Range Weather Forecasts (ECMWF) that replaces the ERA-Interim reanalysis product. The ERA5 product is created by combining both satellite and ground observations with a numerical model used for weather forecasting. For the freshwater flux we determine the net freshwater forcing by taking the sum of the variables “total precipitation” and “evaporation” (i.e., total precipitation + evaporation). Since evaporation is defined as being negative and total precipitation as being positive in ERA5 data, this results in a dataset for E−P where net evaporation is positive and net precipitation negative. The datasets contain monthly data from 1940 to 2022 on a 0.25° rectilinear grid. To determine the noise in both fluxes, we first detrend each grid point by subtracting a 5-year running mean. Next, we deseasonalize the data by subtracting a monthly climatology based on the detrended data. This results in a noisy dataset where each grid point has zero mean and no trend. We analyze the fields by looking at the standard deviation (σ), skewness, and excess kurtosis of the noise, where Gaussian white noise would have zero skewness and zero excess kurtosis.
2.2 CMIP6 models
We compare the noise in the ERA5 data to that found in CMIP6 ESMs. In total, we use 36 different models; note that we use two different realizations from the UKESM-1-0-LL model that are run by two different model groups (i.e., the Met Office Hadley Centre (MOHC) and the National Institute of Meteorological Sciences–Korea Meteorological Administration (NIMS-KMA), respectively). For each model we determine the evaporation minus precipitation using the variables “evspsbl” and “pr”, and we use “tas” for T2 m. We do this for the historical simulations between 1940 and 2014. We first regrid all models to a 1°×1° rectilinear grid. We next compute the noise in the models by following the same methodology as for the ERA5 data; i.e., we detrend and deseasonalize the data. We could also use simulations without forcing, i.e., the piControl simulations. However, for comparison with the ERA5 data it is better to use a similar methodology for both data sources, which includes the detrending in the ERA5 data.
When comparing the CMIP6 data to the ERA5 data, we use the same time period in the ERA5 data as in the CMIP6 data, i.e., 1940 to 2014, and regrid the ERA5 data to a 1° rectilinear grid. Due to its original higher resolution, the land mask in ERA5 captures small islands that are not captured by the CMIP6 land mask. To account for this, we mask out these small islands in both the ERA5 and the CMIP6 data. A full list of the models used and their citations can be found in Appendix A (Table A1).
3.1 ERA5 reanalysis data
Figure1a–c show the standard deviation, skewness, and excess kurtosis in the E−P noise. The highest standard deviation is found north of the Equator in the ITCZ, and other regions with relatively high standard deviations are those where the western boundary currents are present. In the Northern Hemisphere, the strongest negative skewness is found between 10 and 30° N (Fig. 1b). The negative skewness here indicates that the distribution is skewed towards extreme precipitation events, which is partially related to tropical storm activity in this region. In the Southern Hemisphere there is a strong negative skewness in the region of the South Equatorial Current, and there is a small region of moderate positive skewness to the south of this area. The rest of the ocean generally shows a slightly negative or near-zero skewness. The excess kurtosis shows a relatively similar pattern to the skewness but with the opposite sign (Fig. 1c). The strongest positive excess kurtosis is found over the entire latitudinal band 10° S to 30° N. This is also an indication of high extremes, and because of the negative skewness it indicates extreme precipitation events. The rest of the ocean has slightly positive or near-zero excess kurtosis. Due to the non-zero skewness and excess kurtosis in the noise at most grid points, the noise cannot be classified as Gaussian white noise in these grid points.
For T2 m (Fig. 1d–f), the largest standard deviation in the noise is found in the (seasonally) sea-ice-covered regions in the high-latitude North Atlantic (Fig. 1d). The Gulf Stream region also shows a relatively high standard deviation. Regions around the sea ice edge in both the Northern Hemisphere and Southern Hemisphere show a relatively strong negative skewness (Fig. 1e), which means the distributions in these regions are skewed towards more cooling events. The pattern for the skewness in the South Atlantic is relatively patchy, with both slightly negative and slightly positive values. In the North Atlantic, the regions around the trade winds show positive skewness, while the subtropical gyre shows negative skewness. The (seasonally) sea-ice-covered regions show strong negative skewness. For the excess kurtosis (Fig. 1f), most of the Atlantic region shows (strong) positive values, with the strongest signals being present over the sea -ice-covered regions, close to the seasonal sea ice edge (also in the South Atlantic), and in the Gulf of Mexico. The combination of negative skewness and positive excess kurtosis in the sea-ice-covered regions suggests that in these regions strong cooling events can take place, which is likely associated with strong increases in sea ice cover. Just as for the freshwater flux, the excess kurtosis deviates from zero in most regions of the ocean, which means that the noise in T2 m is also unlikely to be Gaussian white noise at most grid points.
To better understand the results, we look in the noise fields for regions with similar distributions. We do this by dividing both the E−P and T2 m noise fields into 12 different clusters (Fig. 2). For this we use a k-means clustering algorithm where we use the standardized standard deviation, skewness, and excess kurtosis as input. The decision for 12 clusters is based on several methods (Fig. A1), i.e., the elbow method, the silhouette score, the gap statistic, and a visual inspection of the clusters performed while the number of clusters is varied. The probability density function, standard deviation, skewness, and excess kurtosis values of the clusters are displayed in Figs. A2–A5
For E−P we find several relatively large clusters. The subpolar regions are divided into two clusters (clusters 1 and 4), where cluster 4 is more poleward. The main difference between the two clusters is the lower standard deviation in the higher-latitude cluster. The high standard deviation region of the ITCZ also clearly stands out as a separate cluster (cluster 6). The subtropical region is divided into nine different clusters. The six clusters that cover the North Equatorial Current and South Equatorial Current stand out with strong positive excess kurtosis and strong negative skewness (clusters 2, 3, 5, 9, 10, and 11). These are regions that experience tropical storms and hurricanes, which are recorded as very strong extreme precipitation events in the E−P noise fields. These extremes can be very local, explaining why six clusters are necessary for this region. The cluster closest to a Gaussian distribution is the cluster in the southeastern subtropical region (cluster 12) with a skewness of 0.16 and an excess kurtosis of 0.90 (Fig. 2b). For all clusters, kurtosis is larger than 1.5 times the square of the skewness (Fig. 2b), which is consistent with multiplicative noise (Sardeshmukh and Sura, 2009).
The 12 clusters for the T2 m noise do not show an overlap with the E−P clusters, and several of them appear to follow the general ocean circulation pattern. For example, cluster 12 is centered around the North Atlantic Current, while cluster 10 is centered around the North Equatorial Current and the North Brazil Current. While for the E−P noise several clusters are necessary for the subtropics, for T2 m several clusters are necessary for regions covered by sea ice or areas adjacent to those regions. Cluster 3 describes the regions in the Labrador and Greenland seas that experience sea ice annually. Clusters 2, 5, and 6 all cover regions close to the sea ice edge. The noise in these regions is likely affected by interannual variability in sea ice extent that leads to relatively strong positive excess kurtosis and relatively strong negative skewness. Two clusters (7 and 12) have near-zero area-weighted skewness and excess kurtosis and are therefore close to a Gaussian distribution (0.00 skewness for both areas and 0.06 and 0.13 for excess kurtosis, respectively; Fig. 2d). Cluster 7 covers the South Atlantic between 30 and 50° S and parts of the eastern North Atlantic between 30 and 60° N. Cluster 12 is the cluster around the North Atlantic Current, but it does show some variability in both skewness and excess kurtosis in the cluster. Just as for the E−P clusters, for all T2 m clusters kurtosis is larger than 1.5 times the square of the skewness (Fig. 2d), which is consistent with multiplicative noise.
3.2 CMIP6 data
In this section we analyze the results for the multi-model mean (MMM) of the CMIP6 models. We determine the MMM at the end of the analysis. For example, this means that we first determine the skewness for each model and then average it over the 2D skewness fields of all the models to create the MMM. Each model has been given the same weight. Results for individual models can be found in Appendix A (Figs. A10 to A15).
The MMM for the noise in the E−P flux does not always represent the amplitude in the statistics of the ERA5 noise well (Fig. 3a–f); however, the spatial patterns are relatively well resolved in the MMM. The standard deviation is underestimated over the entire ocean, with the strongest underestimation in the ITCZ regions and over the western boundary currents (Fig. 3d). The multi-model mean shows a stronger negative skewness over the South Equatorial Current that is also shifted more southward compared to ERA5 noise (Fig. 3e). Furthermore, the positive skewness over the eastern subtropical region is not captured by the CMIP6 MMM. The excess kurtosis is also positively biased in the CMIP6 MMM over the South Equatorial Current (Fig. 3f). In the region between 10° S and 25° N there is a patchy response, where most regions see an underestimation of the excess kurtosis compared to the ERA5 noise (red colors) while some regions see an overestimation of the excess kurtosis compared to the ERA5 noise (blue colors). The CMIP6 MMM does capture the spatial pattern and amplitude of the standard deviation of the noise in T2 m well compared to the ERA5 noise (Fig. 4a, b). The spatial pattern of the skewness is captured reasonably well in the Northern Hemisphere, but the amplitude is typically smaller than in the ERA5 noise (Fig. 4c, d). In the Southern Hemisphere the CMIP6 MMM shows mostly slightly positive skewness, whereas the ERA5 noise mostly shows slightly negative skewness. The absolute differences are not that large, but there is an important difference in sign. For excess kurtosis, the spatial pattern is also relatively similar in the CMIP6 MMM compared to the ERA5 noise; however, the regions in the ERA5 noise with slightly negative kurtosis are not captured by the CMIP6 MMM (Fig. 4e, f). The amplitude of the excess kurtosis, however, is not as well resolved as the spatial pattern. Most regions in the CMIP6 MMM show an underestimation of the excess kurtosis compared to ERA5.
The CMIP6 MMM appears to do a decent job in capturing the observation-based noise field of both E−P and T2 m. However, there is still a large spread in the model ensemble, meaning not all models are able to capture these noise fields adequately. Our aim in this section is to develop a statistical model of the noise in both E−P and T2 m that can be used as forcing in ocean models. We have tried several methods to construct such a model, and we will present four of these methods below. All of these models are based on the ERA5 reanalysis data.
Three of the methods are based on a principal component analysis (PCA) in which we base the noise model on the principal components (PCs) and corresponding empirical orthogonal functions (EOFs). The PCA is performed on the noise and is weighted to account for the grid cell areas. For all three methods we use the number of EOFs necessary to explain 90 % of the variance in the noise (i.e., 289 EOFs and PCs for E−P and 53 for T2 m). For the first two methods we directly sample (with replacement) from the PCs. The first method we name PC(1), as we select one random time step (i.e., month) for all PCs. For the PC(1) method we uniformly sample one integer from 1 to the length of the PCs, i.e., 996. We apply this integer for all PCs. For example, if our integer is 7, we sample the seventh month of each PC to construct the noise model. Using this method we therefore have in total 996 different realizations to sample from, meaning this method is not strictly stochastic. For the second method (PC(N)), we sample a random time step out of the PCs but a different time step for each PC. For the third method (PC(NIG)), we fit a normal inverse Gaussian (NIG) distribution to the individual PCs and next sample randomly from these distributions in a similar fashion as in the PC(N) method. The NIG distribution used in the PC(NIG) model has a probability density function determined by
where α is a tail heaviness parameter, β is an asymmetry parameter, μ regulates the shift in the distribution, and δ is the scale of the distribution. K1 represents a modified Bessel function of the second kind.
We choose to use three different PCA-based models. The PC(1) model is used to test whether the PCAs can in fact capture the statistics of the noise well. However, since this method is not fully stochastic, we also chose to use other models. The PC(N) model has a very similar setup, but it has a larger number of values to sample from than the PC(1) method. Since the PC(N) model also has a discrete number of values to sample from, we also used the PC(NIG) model, which does not have this problem. For all three methods, noise fields are constructed by multiplying the value sampled from the PCs with the spatial patterns captured by the EOFs and next summing over the number of PCs and EOFs. Results for the PC(1) and PC(N) models can be found in Appendix A (Figs. A6 to A9). The PC(NIG) model shows a good agreement with the noise diagnosed from the ERA5 data for the spatial patterns of the standard deviation (Fig. 5a, d), but it is unable to capture the spatial patterns of the skewness (Fig. 5b, e) and excess kurtosis (Fig. 5c, f). The standard deviation in the noise is captured reasonably well (Fig. 5d). Looking at the skewness (Fig. 5e) and the excess kurtosis (Fig. 5f), we can see that this model is unable to represent these metrics correctly since the PC(NIG) model simulates near-zero skewness and excess kurtosis. Just as for the E−P flux, the PC(NIG) model represents the spatial pattern of the noise in the T2 m well in the standard deviation (Fig. 6a, d) but not in the skewness (Fig. 6b, e) and excess kurtosis (Fig. 6c, f). The skewness and excess kurtosis are again near zero in all regions, with the exception of the excess kurtosis in the sea-ice-covered regions in the North Atlantic.
Since the models using the PCA show difficulty in representing the ERA5 noise, we have also fitted several statistical distributions directly to the noise for each grid cell as a fourth method. For both the E−P and T2 m, the normal inverse Gaussian (NIG) distribution appeared to be the best fit. Note that we have tried several other distributions as well, all of which performed worse than the NIG distribution. Other tested distributions mostly fail to capture the excess kurtosis well. A summary of the performance of a selection of the tested models can be seen in Fig. A18.
We have tested the goodness of fit with several metrics (Fig. A16). Firstly, we performed an Anderson–Darling normality test. We find that for the E−P noise only 8 % of the grid points pass this test (p<0.05) (Fig. A16b). For T2 m this is higher, i.e., 42 % (Fig. A16g). Next we have tested whether the NIG provides a better fit than a normal distribution for each grid point. For this we use the following equation:
where N is the number of bins used (i.e.m 50); fi the probability density function of the time series per grid point; and mi the fitted probability density function, which is fitted to either an NIG distribution or a normal distribution. We compute χn for both an NIG and Gaussian fit and compare the two. For 98 % of the grid points the NIG fit performs better (i.e., χn is smaller for the NIG fit) for the E−P noise (Fig. A16a), and this is the case for 94 % of the grid points for the T2 m noise (Fig. A16f). To test whether the NIG model is a good fit, we apply a Kolmogorov–Smirnov test. For the E−P only 27 grid points do not pass this test, while for T2 m there are 8 grid points that do not pass (out of 138 788 ocean grid points) (p<0.05). However, the Kolmogorov–Smirnov test is not well suited for heavy-tailed distributions such as those found in our data. Ideally, we would like to perform an Anderson–Darling test (or a similar test) as a goodness-of-fit test to check whether the NIG fits are statistically significant, but this is too computationally expensive. For the Anderson–Darling test we need to compute critical values, which is computationally demanding. Since these critical values are dependent on the parameters of the NIG distribution in Eq. (1) (i.e., α, β, μ, and δ), we would have to repeat the computations for each grid point, which leads to high computational cost. As an alternative, we computed the Akaike information criterion (AIC) and Bayesian information criterion (BIC) (Fig. A16c, h) scores for both the Gaussian and the NIG fits. For the E−P noise, the NIG fit is a better fit compared to a Gaussian fit for 98 % and 87 % of the grid points for the AIC and BIC metrics, respectively. For the T2 m noise this is 62 % and 35 %, respectively. Lastly, we test the significance of the skewness and kurtosis of the E−P and T2 m noise. We do this by fitting an AR(1) model to the data and subsequently generate sampling statistics from this model. The fitted (Gaussian) AR(1) model fails to represent the skewness and excess kurtosis in the E−P noise (p<0.05) for 93 % and 85 % of the grid points (Fig. A16d, e) and 38 % and 53 % of the grid points for the T2 m skewness and excess kurtosis (Fig. A16i, j).
Based on this collection of tests, we think that for most of the grid points the NIG model provides a good fit to the data. Furthermore, following the Anderson–Darling test on normality and the fitted AR(1) model, most of the E−P noise is non-Gaussian, and to a lesser degree this also applies to the T2 m noise. The grid points for the E−P noise that are likely Gaussian are located in clusters 4 and 11, which are indeed clusters with skewness and excess kurtosis close to 0 (Fig. 2a, b). For the T2 m noise, the grid points that show Gaussian behavior are mainly located in the sea-ice-free subpolar ocean. These grid points mainly belong to clusters 1, 7, and 12, which are also the clusters with approximately zero skewness and near-zero excess kurtosis (Fig. 2c, d).
Using the fitted NIG distribution, we can generate a fully stochastic noise field for each month using the four parameters for each grid cell. The model shows a good agreement with the noise diagnosed from the ERA5 data for the spatial patterns of the standard deviation (Fig. 7a, d), skewness (Fig. 7b, e), and excess kurtosis (Fig. 7c, f). The standard deviation in the noise is captured especially well, with only small deviations present between 10° S and 25° N (Fig. 7d). The NIG distribution underestimates the regions with strong negative skewness over the latitude bands 0° N to 10° S and 10 to 25° N (Fig. 7e). For excess kurtosis we see a similar underestimation in these regions, meaning that the excess kurtosis is higher in the ERA5 data (Fig. 7f). However, the region between these two latitude bands shows a much higher excess kurtosis in the NIG model compared to the ERA5 noise.
Just as for the E−P flux, the NIG model represents the spatial pattern of the noise in the T2 m well in the standard deviation (Fig. 8a, d), skewness (Fig. 8b, e), and excess kurtosis (Fig. 8c, f). Here the standard deviation is also captured very well by the NIG model, with only very small differences in the sea-ice-covered regions (Fig. 8d). The same applies to the skewness, where we also see some deviations in these same regions (Fig. 8e). For most regions the NIG model captures the excess kurtosis quite well (Fig. 8f). However, for regions with a high excess kurtosis in the ERA5 noise, such as the sea-ice-covered regions and the Gulf of Mexico, the NIG model strongly overestimates the excess kurtosis.
In this section, we compare the noise models and the CMIP6 models with the ERA5 noise using Taylor diagrams (Fig. 9) to provide a more in-depth discussion on the performance of the individual models. We compare how well the different models represent the standard deviation (Fig. 9a, b), the skewness (Fig. 9c, d), and the excess kurtosis (Fig. 9e, f) found in the ERA5 noise. Taylor diagrams are a good tool to better understand the performance of all the different models against the observation-based noise. In a Taylor diagram, three metrics are displayed: (1) the spatial correlation coefficient, (2) the variation in the data as represented by the standard deviation, and (3) the root-mean-square error (RMSE) between the observation-based data and the model. The spatial correlation coefficient is displayed on the outer circle and the straight dotted lines in Fig. 9, lines of constant correlation connect the origin with the outer circle. The standard deviation is displayed on both the x axis and y axis. Lines of equal standard deviation are circles with their center in the origin of the plot. The dashed black line in Fig. 9 displays the standard deviation in the observation-based noise. The RMSE is displayed using the black contour circles that have their center in the observation-based noise marker. The location of each of the markers therefore provides information about three important metrics and therefore the performance of the individual models compared to the observation-based noise. Ideally, a model will be in the lower part of the graph, since this indicates high spatial correlation, or close to the dashed black lines, since this indicates similar variability compared to the observation-based noise, and by combining these two the RMSE will consequently also be low. All three metrics are determined using weights considering the area of each grid cell.
For the E−P noise models, the PC(1) model performs best for the standard deviation, skewness, and excess kurtosis, while the PC(NIG) performs the worst (Fig. 9a, c and e). The PC(N) model performs equally well for the spatial correlation, but it strongly underestimates the variability in skewness and excess kurtosis. The NIG model has a lower spatial correlation, but it is much better in capturing the variability in all three statistical moments. All models have trouble representing the excess kurtosis in the latitudinal bands 10° S to 25° N. The PC(1) and PC(N) models overestimate the excess kurtosis in almost the entire region, whereas the NIG model underestimates the excess kurtosis over the ITCZ region and overestimates it in the other regions. This is because this region can experience very extreme rainfall episodes with a very low number of occurrences, which severely affects the excess kurtosis diagnosed from the ERA5 noise, as was also found with the clustering analysis (Fig. 2). Because these episodes only occur a few times in the time series, these are not represented well by the NIG model and are also difficult to represent in the PC(1) model.
We can explain the failure of the PC(NIG) model to accurately resemble the observation-based skewness and excess kurtosis using the central limit theorem. This theorem states that when summing over random variables, the distribution of this sum converges towards a Gaussian distribution, which by definition has zero skewness and excess kurtosis. In these PC models we sample values from the PCs, multiply them with the EOFs and sum the results, which following central limit theorem converge towards a Gaussian distribution. The same applies to the PC(N) model, which performs well for spatial correlation skill but (based on a time series of 10 000 realizations) underestimates the amplitude of the skewness and excess kurtosis. This underestimation increases when longer time series are used, and the model slowly converges to a Gaussian one in these cases. Methods based on a PCA, with the exception of the PC(1) model, will therefore be unable to represent the skewness and excess kurtosis in the observation-based noise. An alternative explanation as to why the PC-based models fail to capture the skewness and excess kurtosis is that the PCs might be (nonlinearly) dependent on each other. To test this, we have calculated the distance correlation (Székely et al., 2007) between the PCs, including whether the distance correlation is significant (p value <0.05) based on a permutation test of n=1000 (Fig. A17). For both the E−P and T2 m PCs, around 5 % of the possible PC combinations experience a significant dependence. However, the strongest distance correlation is only 0.14 for the PCs corresponding to the E−P noise and 0.11 for the PCs corresponding to the T2 m noise, meaning there is at best a very weak dependence between the PCs. We therefore do not expect that the weak nonlinear dependence between some of the PCs is the reason why the PC-based models fail, but we instead expect that the explanation mentioned before, i.e., the central limit theorem, is the main reason.
For T2 m, Fig. 9b, d, and f show that the NIG and PC(1) models consistently perform the best of all the models. All models capture the spatial pattern in the standard deviation of the noise as shown by the near-unity spatial correlation coefficient; however, the PC(N) and PC(NIG) models both overestimate the variability in the standard deviation of the noise as shown by the high RMSE and larger standard deviation (Fig. 9b). The spatial pattern of the skewness is captured reasonably well by the NIG, PC(1), and PC(N) models but not by the PC(NIG) model (Fig. 9d). The PC(N) shows a stronger underestimation of the variability compared to the PC(1) and NIG models. For the excess kurtosis a similar conclusion can be drawn, except that the NIG model strongly overestimates the variability (Fig. 9f). The worse performance for excess kurtosis can be explained by the overestimation of the sea-ice-covered regions and the Gulf of Mexico by the NIG model. In these regions, the distribution of the ERA5 noise has a relatively broad and flat peak or sometimes a slightly bimodal peak. This is the reason the NIG fit does not perform very well in these regions. Similar to the E−P noise, the PC(N) and PC(NIG) models are unable to explain the variability in the skewness and excess kurtosis as explained above.
As discussed in Sect. 3.2, the CMIP6 MMM captures the ERA5 E−P noise reasonably well, although the performance decreases for the higher statistical moments. This is likely related to strong biases over the South Equatorial Current where the skewness and the excess kurtosis are too negative and too positive, respectively, in the CMIP6 MMM compared to the ERA5 noise. This is potentially related to the double ITCZ bias present in most CMIP6 models (Tian and Dong, 2020). The latitudinal extent of the ITCZ is too southward in many models, which also causes a shift in the higher-order statistical moments in this region, resulting in relatively large biases. From Fig. 9 we see that the individual models that consistently perform the best are CESM2-WACCM (30), CESM2 (31), and NorESM2-MM (32) (except for excess kurtosis, where NorESM2-MM has quite a large RMSE). What these models have in common is that their atmospheric model is the Community Atmosphere Model 6 (CAM6) or, in the case of CESM2-WACCM, is based on CAM6 and run on a nominal 1° horizontal resolution. This suggests that this atmospheric model is able to capture the observation-based noise reasonably well.
Liu et al. (2022) also found that these models perform relatively well for precipitation biases, which they suggest is due to the specific two-moment prognostic cloud microphysics scheme (Gettelman and Morrison, 2015) used in CAM6. TaiESM1, which uses CAM5 and an earlier version of the prognostic cloud microphysics scheme, also performs relatively well. There are also two other CESM2 models that use a form of CAM6, i.e., CESM2-WACCM-FV2 (27) and CESM2-FV2 (29). These models perform less well compared to the other three, which might be explained by the fact that these models are run at a lower (i.e., 2°) resolution. The CMIP6 MMM has the same biases in the latitudinal band between 10° S and 25° N, although these biases are less strong in some regions. This is probably because the high-rainfall episodes in the ERA5 data are smoothed when regridded to a 1° grid, which is done before comparing it to the CMIP6 models and MMM.
For T2 m the CMIP6 MMM also performs reasonably well compared to the ERA5 noise, and the performance is lower for higher statistical moments as for E−P noise. The strongest biases (both positive and negative) in excess kurtosis are found over the sea-ice-covered regions. This might be related to biases in sea ice cover in the CMIP6 models (Watts et al., 2021). For the individual models it is more difficult to point towards models that consistently perform well. The UKESM1-0-LL (22) model simulations performed by the MOHC show the most consistency. Other models that perform relatively well in two out of three statistical moments are CESM2-FV2 (29) and CAS-ESM2-0 (8). Interestingly, the UKESM1-0-LL (33) simulations performed by the NIMS-KMA are among the worst-performing models. The only differences between the two models are the computer on which the model is run and the initial conditions. This suggests that there is also a dependency on initial conditions in the performance of the CMIP6 models.
Except for the excess kurtosis in the T2 m noise, the NIG model outperforms the individual CMIP6 models and MMM, which is due to the overestimation of the excess kurtosis over sea-ice-covered regions by the NIG model. The PC(NIG) model only outperforms the CMIP6 MMM for the standard deviation and is very poor for the skewness and excess kurtosis. The PC(1) model outperforms the CMIP6 models and MMM for the skewness and excess kurtosis. The PC(N) model outperforms the CMIP6 MMM for all moments with respect to the spatial correlation and is very similar to the CMIP6 MMM and the best CMIP6 models for RMSE. This means that we can capture “realistic” noise better with the statistical model than the fully coupled Earth system models. Among the PC-based models, the PC(1) model performs best and is similar to the NIG model, but this model is not fully stochastic like the other noise models.
In this study we have analyzed ERA5 evaporation minus precipitation (E−P) and 2 m air temperature (T2 m) fields to determine what observation-based noise is in these variables. We find that due to non-zero skewness and excess kurtosis, the noise in both variables typically cannot be classified as white, and studies that assume white noise in either of the two variables might not resolve the response of the ocean to atmospheric noise realistically. We have analyzed the noise in 36 different CMIP6 Earth system models and the CMIP6 multi-model mean (MMM) and compared these models to the ERA5 noise. There is quite a spread in the performance of the CMIP6 models, but the MMM performs relatively well compared to the individual models. Typically, the models perform best for standard deviation and worst for excess kurtosis. Furthermore, we have fitted a normal inverse Gaussian (NIG) distribution to the ERA5 noise of both variables. This results in a stochastic noise model that can be used as input in ocean general circulation models (OGCMs). We have shown that the NIG model captures the standard deviation, skewness, and excess kurtosis of the ERA5 noise reasonably well in both the E−P and T2 m data, with the exception of the excess kurtosis in the T2 m noise, where the NIG model strongly overestimates the positive excess kurtosis in sea-ice-covered regions. For most metrics and statistics, the NIG model performs better than the individual CMIP6 models and CMIP6 MMM.
Previous studies have looked into biases in CMIP6 models. However, these studies typically look into the biases in the mean state or the seasonality of the variables. Here, we have specifically looked at variability up to interannual timescales, the distribution, and related metrics (i.e., standard deviation, skewness and excess kurtosis). We found that biases in these quantities are still to some extent connected to biases in the mean state. For example, the biases in skewness and excess kurtosis in the E−P noise in the South Atlantic are likely to be related to the double ITCZ bias described in earlier studies (Tian and Dong, 2020; Li et al., 2020). Differences in the excess kurtosis in sea-ice-covered regions can also be related to the biases in Arctic sea ice thickness and cover (Watts et al., 2021).
In the development of a noise model, the best variant turned out to be a pointwise statistical fit of a normal inverse Gaussian (NIG) distribution. As shown in Sect. 3, the model performs relatively well at most grid points but can still deviate quite a bit (especially for excess kurtosis). One major drawback of fitting a statistical distribution to the data in a pointwise manner is that we lose spatially coherent structures and potentially auto-correlation in the noise for the individual noise fields (i.e., one random realization). We have constructed alternative models based on a principal component analysis (PCA) where the corresponding empirical orthogonal functions (EOFs) capture the spatial structures. The PCs contain nonlinear effects, but these are difficult to extract statistically. The PC-based models underestimate the skewness and excess kurtosis in the noise fields because of the central limit theorem or (for the PC(1) model) are not fully stochastic. Therefore, we eventually decided to fit a model to the data that can relatively accurately represent the standard deviation, skewness, and excess kurtosis in the ERA5 noise. However, when the spatially coherent structures captured by the EOFs are deemed more important than an accurate representation of the skewness and kurtosis of the noise, PC-based models can be used. The loss of spatially coherent structures can be important when studying noise-induced transitions of the AMOC. Noise that is spatially coherent influences larger areas of ocean. This could, for example, mean that a freshening of the surface ocean could happen over a larger area of the ocean and therefore might be more efficient in inhibiting deep convection in the North Atlantic. Whether it is actually important should be tested in an ocean model; however, such tests are outside the scope of this study.
Similar studies that look into the characteristics of E−P and T2 m noise are sparse. In Sura and Sardeshmukh (2008), they investigate the non-Gaussianity of daily sea surface temperature (SST) variability. The timescales assessed in Sura and Sardeshmukh (2008) are faster (i.e., daily versus monthly), and they look at SSTs, whereas we look at air temperatures. However, relatively similar results are achieved in our study compared to Sura and Sardeshmukh (2008). Skewness in daily SST variability is typically negative in the Atlantic Ocean, whereas the excess kurtosis is mostly positive, similar to what we find for the air temperature. They relate this to multiplicative noise in mixed-layer dynamics. However, they make the assumption that daily fluctuations in air temperature are Gaussian. Our study shows that this is not the case over most of the ocean (at least on monthly timescales). Whether the multiplicative noise signal we find in the T2 m noise originates from SST variability, atmospheric dynamics, or a combination of the two is left for further study.
To conclude, we have provided an analysis of observation-based noise from ERA5 reanalysis data. Based on this realistic noise, we have constructed a noise model based on a normal inverse Gaussian distribution fit to the ERA5 noise. This product is made publicly available in the repository related to this paper (Boot and Dijkstra, 2024). The noise model can, for example, be used as a forcing on ocean models to study noise-induced transitions of the AMOC under realistic noise forcing.
ERA5 data can be downloaded from the Copernicus Climate Data Store (CDS): https://doi.org/10.24381/cds.f17050d7 (Hersbach et al., 2023). CMIP6 data can be downloaded from the Earth System Grid Federation (ESGF) or using the scripts in the repository https://doi.org/10.5281/zenodo.14224062 (Boot and Dijkstra, 2024). Directions as to which exact data need to be downloaded and all scripts used for analyses and making the figures can be found at https://doi.org/10.5281/zenodo.14224062 (Boot and Dijkstra, 2024). A script that contains the noise models can also be found at the abovementioned DOI.
AAB and HAD conceptualized the study. AAB acquired the results. Both authors contributed to writing the manuscript.
The contact author has declared that none of the authors has any competing interests.
Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
This research has been supported by the European Research Council through the ERC-AdG project TAOC (principal investigator Henk A. Dijkstra; project no. 101055096).
This paper was edited by Christian Franzke and reviewed by two anonymous referees.
Bentsen, M., Oliviè, D. J. L., Seland, Ø., Toniazzo, T., Gjermundsen, A., Graff, L. S., Debernard, J. B., Gupta, A. K., He, Y., Kirkevag, A., Schwinger, J., Tjiputra, J., Aas, K. S., Bethke, I., Fan, Y., Griesfeller, J., Grini, A., Guo, C., Ilicak, M., Karset, I. H. H., Landgren, O. A., Liakka, J., Moseid, K. O., Nummelin, A., Spensberger, C., Tang, H., Zhang, Z., Heinze, C., Iversen, T., and Schulz, M.: NCC NorESM2-MM model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.8040, 2019. a
Boot, A. A. and Dijkstra, H. A.: ESD-noise-2024, Zenodo [code], https://doi.org/10.5281/zenodo.14224062, 2024. a, b, c
Boot, A. A., von der Heydt, A. S., and Dijkstra, H. A.: Response of atmospheric pCO2 to a strong AMOC weakening under low and high emission scenarios, Clim. Dynam., 62, 7559–7574, https://doi.org/10.1007/s00382-024-07295-y, 2024. a
Boot, A. A., Steenbeek, J., Coll, M., von der Heydt, A. S., and Dijkstra, H. A.: Global marine ecosystem response to a strong AMOC weakening under low and high future emission scenarios, Earth's Future, 13, e2024EF004741, https://doi.org/10.1029/2024EF004741, 2025. a
Byun, Y.-H.: NIMS-KMA UKESM1.0-LL model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.8379, 2020. a
Cao, J. and Wang, B.: NUIST NESMv3 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.8769, 2019. a
Castellana, D., Baars, S., Wubs, F. W., and Dijkstra, H. A.: Transition Probabilities of Noise-induced Transitions of the Atlantic Ocean Circulation, Sci. Rep., 9, 20284, https://doi.org/10.1038/s41598-019-56435-6, 2019. a
Chai, Z.: CAS CAS-ESM1.0 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.3353, 2020. a
Choudhury, A. D., Raghavan, K., Gopinathan, P. A., Narayanasetti, S., Singh, M., Panickal, S., and Modi, A.: CCCR-IITM IITM-ESM model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.3708, 2019. a
Cini, M., Zappa, G., Ragone, F., and Corti, S.: Simulating AMOC tipping driven by internal climate variability with a rare event algorithm, npj Climate and Atmos. Sci., 7, 31, https://doi.org/10.1038/s41612-024-00568-7, 2024. a
Danabasoglu, G.: NCAR CESM2 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.7627, 2019a. a
Danabasoglu, G.: NCAR CESM2-WACCM-FV2 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.11298, 2019b. a
Danabasoglu, G.: NCAR CESM2-WACCM model output prepared for CMIP6 CMIP historical, https://doi.org/10.22033/ESGF/CMIP6.10071, 2019c. a
Danabasoglu, G.: NCAR CESM2-FV2 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.11297, 2019d. a
Danek, C., Shi, X., Stepanek, C., Yang, H., Barbi, D., Hegewald, J., and Lohmann, G.: AWI AWI-ESM1.1LR model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.9328, 2020. a
Dix, M., Bi, D., Dobrohotoff, P., Fiedler, R., Harman, I., Law, R., Mackallah, C., Marsland, S., O'Farrell, S., Rashid, H., Srbinovsky, J., Sullivan, A., Trenham, C., Vohralik, P., Watterson, I., Williams, G., Woodhouse, M., Bodman, R., Dias, F. B., Domingues, C. M., Hannah, N., Heerdegen, A., Savita, A., Wales, S., Allen, C., Druken, K., Evans, B., Richards, C., Ridzwan, S. M., Roberts, D., Smillie, J., Snow, K., Ward, M., and Yang, R.: CSIRO-ARCCSS ACCESS-CM2 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.4271, 2019. a
Eyring, V., Gillett, N. P., Achuta Rao, K. M., Barimalala, R., Barreiro Parrillo, M., Bellouin, N., Cassou, C., Durack, P. J., Kosaka, Y., McGregor, S., Min, S., Morgenstern, O., and Sun, Y.: Human Influence on the Climate System, in: Climate Change 2021: The Physical Science Basis, Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Masson-Delmotte, V., Zhai, P., Pirani, A., Connors, S. L., Péan, C., Berger, S., Caud, N., Chen, Y., Goldfarb, L., Gomis, M. I., Huang, M., Leitzell, K., Lonnoy, E., Matthews, J. B. R., Maycock, T. K., Waterfield, T., Yelekçi, O., Yu, R., and Zhou, B., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 423–552, https://doi.org/10.1017/9781009157896.005, 2021. a
Gettelman, A. and Morrison, H.: Advanced Two-Moment Bulk Microphysics for Global Models. Part I: Off-Line Tests and Comparison with Other Schemes, J. Climate, 28, 1268–1287, https://doi.org/10.1175/JCLI-D-14-00102.1, 2015. a
Hajima, T., Abe, M., Arakawa, O., Suzuki, T., Komuro, Y., Ogura, T., Ogochi, K., Watanabe, M., Yamamoto, A., Tatebe, H., Noguchi, M. A., Ohgaito, R., Ito, A., Yamazaki, D., Ito, A., Takata, K., Watanabe, S., Kawamiya, M., and Tachiiri, K.: MIROC MIROC-ES2L model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.5602, 2019. a
Hasselmann, K.: Stochastic climate models Part I. Theory, Tellus, 28, 473–485, https://doi.org/10.1111/j.2153-3490.1976.tb00696.x, 1976. a
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., Chiara, G. D., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020. a
Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 monthly averaged data on single levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], https://doi.org/10.24381/cds.f17050d7, 2023. a
Lee, W.-L. and Liang, H.-C.: AS-RCEC TaiESM1.0 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.9755, 2020. a
Lenton, T. M., Held, H., Kriegler, E., Hall, J. W., Lucht, W., Rahmstorf, S., and Schellnhuber, H. J.: Tipping elements in the Earth's climate system, P. Natl. Acad. Sci. USA, 105, 1786–1793, https://doi.org/10.1073/pnas.0705414105, 2008. a
Li, J.-L. F., Xu, K.-M., Richardson, M., Lee, W.-L., Jiang, J. H., Yu, J.-Y., Wang, Y.-H., Fetzer, E., Wang, L.-C., Stephens, G., and Liang, H.-C.: Annual and seasonal mean tropical and subtropical precipitation bias in CMIP5 and CMIP6 models, Environ. Res. Lett., 15, 124068, https://doi.org/10.1088/1748-9326/abc7dd, 2020. a, b
Li, L.: CAS FGOALS-g3 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.3356, 2019. a
Liu, Y., Cheng, L., Pan, Y., Tan, Z., Abraham, J., Zhang, B., Zhu, J., and Song, J.: How Well Do CMIP6 and CMIP5 Models Simulate the Climatological Seasonal Variations in Ocean Salinity?, Adv. Atmos. Sci., 39, 1650–1672, https://doi.org/10.1007/s00376-022-1381-2, 2022. a, b, c
Lovato, T. and Peano, D.: CMCC CMCC-CM2-SR5 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.3825, 2020. a
Lovato, T., Peano, D., and Butenschön, M.: CMCC CMCC-ESM2 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.13195, 2021. a
McKay, D. I. A., Staal, A., Abrams, J. F., Winkelmann, R., Sakschewski, B., Loriani, S., Fetzer, I., Cornell, S. E., Rockström, J., and Lenton, T. M.: Exceeding 1.5°C global warming could trigger multiple climate tipping points, Science, 377, eabn7950, https://doi.org/10.1126/science.abn7950, 2022. a
Monahan, A. H.: A Simple Model for the Skewness of Global Sea Surface Winds, J. Atmos. Sci., 61, 2037–2049, https://doi.org/10.1175/1520-0469(2004)061<2037:ASMFTS>2.0.CO;2, 2004. a
Monahan, A. H.: Temporal Filtering Enhances the Skewness of Sea Surface Winds, J. Climate, 31, 5695–5706, https://doi.org/10.1175/JCLI-D-17-0814.1, 2018. a
NASA/GISS: NASA-GISS GISS-E2-1-G-CC model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.11762, 2019. a
NASA/GISS: NASA-GISS GISS-E2.1H model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.7128, 2019a. a
NASA/GISS: NASA-GISS GISS-E2.2H model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.15871, 2019b. a
Neubauer, D., Ferrachat, S., Drian, C. S.-L., Stoll, J., Folini, D. S., Tegen, I., Wieners, K.-H., Mauritsen, T., Stemmler, I., Barthel, S., Bey, I., Daskalakis, N., Heinold, B., Kokkola, H., Partridge, D., Rast, S., Schmidt, H., Schutgens, N., Stanelle, T., Stier, P., Watson-Parris, D., and Lohmann, U.: HAMMOZ-Consortium MPI-ESM1.2-HAM model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.5016, 2019. a
Orihuela-Pinto, B., England, M. H., and Taschetto, A. S.: Interbasin and interhemispheric impacts of a collapsed Atlantic Overturning Circulation, Nat. Clim. Change, 12, 558–565, https://doi.org/10.1038/s41558-022-01380-y, 2022. a
Park, S. and Shin, J.: SNU SAM0-UNICON model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.7789, 2019. a
Ridley, J., Menary, M., Kuhlbrodt, T., Andrews, M., and Andrews, T.: MOHC HadGEM3-GC31-LL model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.6109, 2019a. a
Ridley, J., Menary, M., Kuhlbrodt, T., Andrews, M., and Andrews, T.: MOHC HadGEM3-GC31-MM model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.6112, 2019b. a
Romanou, A., Rind, D., Jonas, J., Miller, R., Kelley, M., Russell, G., Orbe, C., Nazarenko, L., Latto, R., and Schmidt, G. A.: Stochastic Bifurcation of the North Atlantic Circulation under a Midrange Future Climate Scenario with the NASA-GISS ModelE, J. Climate, 36, 6141–6161, https://doi.org/10.1175/JCLI-D-22-0536.1, 2023. a
Sardeshmukh, P. D. and Sura, P.: Reconciling Non-Gaussian Climate Statistics with Linear Dynamics, J. Climate, 22, 1193–1207, https://doi.org/10.1175/2008JCLI2358.1, 2009. a
Schmittner, A.: Decline of the marine ecosystem caused by a reduction in the Atlantic overturning circulation, Nature, 434, 628–633, https://doi.org/10.1038/nature03476, 2005. a
Scoccimarro, E., Bellucci, A., and Peano, D.: CMCC CMCC-CM2-HR4 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.3823, 2020. a
Semmler, T., Danilov, S., Rackow, T., Sidorenko, D., Barbi, D., Hegewald, J., Sein, D., Wang, Q., and Jung, T.: AWI AWI-CM1.1MR model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.2686, 2018. a
Song, Z., Qiao, F., Bao, Y., Shu, Q., Song, Y., and Yang, X.: FIO-QLNM FIO-ESM2.0 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.9199, 2019. a
Stouffer, R.: UA MCM-UA-1-0 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.8888, 2019. a
Sura, P.: Stochastic Analysis of Southern and Pacific Ocean Sea Surface Winds, J. Atmos. Sci., 60, 654–666, https://doi.org/10.1175/1520-0469(2003)060<0654:SAOSAP>2.0.CO;2, 2003. a
Sura, P. and Sardeshmukh, P. D.: A Global View of Non-Gaussian SST Variability, J. Phys. Oceanogr., 38, 639–647, https://doi.org/10.1175/2007JPO3761.1, 2008. a, b, c
Swart, N. C., Cole, J. N. S., Kharin, V. V., Lazare, M., Scinocca, J. F., Gillett, N. P., Anstey, J., Arora, V., Christian, J. R., Jiao, Y., Lee, W. G., Majaess, F., Saenko, O. A., Seiler, C., Seinen, C., Shao, A., Solheim, L., von Salzen, K., Yang, D., Winter, B., and Sigmond, M.: CCCma CanESM5-CanOE model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.10260, 2019a. a
Swart, N. C., Cole, J. N. S., Kharin, V. V., Lazare, M., Scinocca, J. F., Gillett, N. P., Anstey, J., Arora, V., Christian, J. R., Jiao, Y., Lee, W. G., Majaess, F., Saenko, O. A., Seiler, C., Seinen, C., Shao, A., Solheim, L., von Salzen, K., Yang, D., Winter, B., and Sigmond, M.: CCCma CanESM5 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.3610, 2019b. a
Székely, G. J., Rizzo, M. L., and Bakirov, N. K.: Measuring and testing dependence by correlation of distances, Ann. Stat., 35, 2769–2794, https://doi.org/10.1214/009053607000000505, 2007. a
Tang, Y., Rumbold, S., Ellis, R., Kelley, D., Mulcahy, J., Sellar, A., Walton, J., and Jones, C.: MOHC UKESM1.0-LL model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.6113, 2019. a
Tatebe, H. and Watanabe, M.: MIROC MIROC6 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.5603, 2018. a
Tian, B. and Dong, X.: The Double-ITCZ Bias in CMIP3, CMIP5, and CMIP6 Models Based on Annual Mean Precipitation, Geophys. Res. Lett., 47, e2020GL087232, https://doi.org/10.1029/2020GL087232, 2020. a, b, c, d
van Westen, R. M. and Dijkstra, H. A.: Persistent climate model biases in the Atlantic Ocean's freshwater transport, Ocean Sci., 20, 549–567, https://doi.org/10.5194/os-20-549-2024, 2024. a
van Westen, R. M., Jacques-Dumas, V., Boot, A. A., and Dijkstra, H. A.: The Role of Sea-ice Insulation Effects on the Probability of AMOC Transitions, J. Climate, 37, 6269–6284, https://doi.org/10.1175/JCLI-D-24-0060.1, 2024a. a
van Westen, R. M., Kliphuis, M., and Dijkstra, H. A.: Physics-based early warning signal shows that AMOC is on tipping course, Sci. Adv., 10, eadk1189, https://doi.org/10.1126/sciadv.adk1189, 2024b. a, b
Watts, M., Maslowski, W., Lee, Y. J., Kinney, J. C., and Osinski, R.: A Spatial Evaluation of Arctic Sea Ice and Regional Limitations in CMIP6 Historical Simulations, J. Climate, 34, 6399–6420, https://doi.org/10.1175/JCLI-D-20-0491.1, 2021. a, b
Weijer, W., Cheng, W., Drijfhout, S. S., Fedorov, A. V., Hu, A., Jackson, L. C., Liu, W., McDonagh, E. L., Mecking, J. V., and Zhang, J.: Stability of the Atlantic Meridional Overturning Circulation: A Review and Synthesis, J. Geophys. Res.-Oceans, 124, 5336–5375, https://doi.org/10.1029/2019JC015083, 2019. a
Wieners, K.-H., Giorgetta, M., Jungclaus, J., Reick, C., Esch, M., Bittner, M., Legutke, S., Schupfner, M., Wachsmann, F., Gayler, V., Haak, H., de Vrese, P., Raddatz, T., Mauritsen, T., von Storch, J.-S., Behrens, J., Brovkin, V., Claussen, M., Crueger, T., Fast, I., Fiedler, S., Hagemann, S., Hohenegger, C., Jahns, T., Kloster, S., Kinne, S., Lasslop, G., Kornblueh, L., Marotzke, J., Matei, D., Meraner, K., Mikolajewicz, U., Modali, K., Müller, W., Nabel, J., Notz, D., von Gehlen, K. P., Pincus, R., Pohlmann, H., Pongratz, J., Rast, S., Schmidt, H., Schnur, R., Schulzweida, U., Six, K., Stevens, B., Voigt, A., and Roeckner, E.: MPI-M MPI-ESM1.2-LR model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.6595, 2019. a
Wu, T., Chu, M., Dong, M., Fang, Y., Jie, W., Li, J., Li, W., Liu, Q., Shi, X., Xin, X., Yan, J., Zhang, F., Zhang, J., Zhang, L., and Zhang, Y.: BCC BCC-CSM2MR model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.2948, 2018. a
Yukimoto, S., Koshiro, T., Kawai, H., Oshima, N., Yoshida, K., Urakawa, S., Tsujino, H., Deushi, M., Tanaka, T., Hosaka, M., Yoshimura, H., Shindo, E., Mizuta, R., Ishii, M., Obata, A., and Adachi, Y.: MRI MRI-ESM2.0 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.6842, 2019. a
Zhang, J., Wu, T., Shi, X., Zhang, F., Li, J., Chu, M., Liu, Q., Yan, J., Ma, Q., and Wei, M.: BCC BCC-ESM1 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.2949, 2018. a
Zhang, Q., Liu, B., Li, S., and Zhou, T.: Understanding Models' Global Sea Surface Temperature Bias in Mean State: From CMIP5 to CMIP6, Geophys. Res. Lett., 50, e2022GL100888, https://doi.org/10.1029/2022GL100888, 2023. a
Zickfeld, K., Eby, M., and Weaver, A. J.: Carbon-cycle feedbacks of changes in the Atlantic meridional overturning circulation under future atmospheric CO2, Global Biogeochem. Cy., 22, GB3024, https://doi.org/10.1029/2007GB003118, 2008. a
Ziehn, T., Chamberlain, M., Lenton, A., Law, R., Bodman, R., Dix, M., Wang, Y., Dobrohotoff, P., Srbinovsky, J., Stevens, L., Vohralik, P., Mackallah, C., Sullivan, A., O'Farrell, S., and Druken, K.: CSIRO ACCESS-ESM1.5 model output prepared for CMIP6 CMIP historical, Earth System Grid Federation, https://doi.org/10.22033/ESGF/CMIP6.4272, 2019. a