the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Multi-annual predictions of hot, dry and hot-dry compound extremes
Paolo De Luca
Carlos Delgado-Torres
Balakrishnan Solaraju-Murali
Margarida Samso Cabre
Markus G. Donat
Hot-dry compound extremes have recently gained increasing attention due to their potential impacts on environments and societies. For these reasons, assessing climate predictions is essential to providing reliable information on such extremes. However, despite several studies focusing on compound extremes in the past and climate projections, little is known on a multi-annual timescale. In this regard, decadal climate predictions have been produced to provide useful information for this specific timescale. Thus, we evaluate the ability of the CMIP6 multi-model decadal climate hindcast to predict hot-dry climate extremes, as well as their hot and dry univariate counterparts, for the forecast years 2–5. The multi-model skillfully predicts hot-dry compound extremes and hot extremes over most land regions, while the skill is more limited for dry extremes. However, we find only minor and spatially limited improvements from the initialisation of the hindcasts, especially for the hot-dry compound extremes, with most of the skill coming from external forcings, especially long-term trends. Finally, we find that the decadal hindcast is able to reproduce the connections between the compound extremes and their hot and dry univariate components. Evaluations of decadal hindcasts, such as this, are an essential tool for establishing the potential and limitations of these products. In turn, they represent a necessary step in providing reliable and valuable information regarding such impactful extremes.
- Article
(27191 KB) - Full-text XML
- BibTeX
- EndNote
Extreme weather events, such as droughts, heatwaves, heavy precipitation and storms, often result from a combination of different physical processes that can interact on various spatial and temporal scales. The aggregation or sequence of these extreme events on the same or different temporal and spatial scales can be considered a compound extreme (Bevacqua et al., 2021). The combination of these hazards can amplify the impacts on climate-vulnerable sectors and society. However, analyzing them solely from a univariate perspective may lead to a severe underestimation of the risk (Leonard et al., 2014; Zscheischler and Fischer, 2020). Furthermore, the dependence among the univariate variables of a compound extreme can increase the likelihood of such events, compared to situations where the univariate variables show no dependence (Zscheischler and Seneviratne, 2017).
Among these compound extremes, hot and dry compound extremes have gained increasing attention due to the potential for elevated environmental, economic, and societal losses. In July and August 2010, Russia experienced strong dry and hot conditions, resulting in massive wildfires and a loss of 20 %–30 % of the grain production (Grumm, 2011). Other examples from recent years include the European summers of 2003, 2015, and 2018, all characterized by record-breaking hot and dry conditions (Sutanto et al., 2020; Vogel et al., 2020; Zscheischler and Fischer, 2020). Overall, the beginning of the 21st century has witnessed a rise in the frequency of droughts and heatwaves, leading to the belief that this trend will continue throughout the century (Ionita et al., 2017). Several studies have already focused on the variability of these extreme events in the past and how they are projected to change in the future (Zscheischler et al., 2018; Ridder et al., 2021; Bevacqua et al., 2022; De Luca and Donat, 2023). However, little is known about the predictability of these events on multi-annual time scales. Trustworthy predictions are crucial for developing strategic plans to mitigate potential impacts. From this perspective, predictions that cover a multi-annual range could be more useful for potential users than projections for the long-term future. For instance, multi-annual predictions of compound hot-dry extremes can inform strategic and preventive decisions across sectors. For instance, they can support agricultural planning such as irrigation investments, the multiplication of drought-resilient crop varieties, post-harvest management, and enhance the preparedness against pests and diseases (Delgado-Torres et al., 2025). Multi-annual climate information can also guide infrastructure and energy planning (Dunstone et al., 2022), including the enhancement of energy storage capacity or reinforcement of urban green areas to mitigate heat and dry stress, and anticipate impacts on society and environment.
Decadal predictions bridge the gap between short-term seasonal predictions and long-term climate projections. Unlike climate projections, climate predictions prescribe the initial state of the climate system to closely represent the observed climate state at the time of the prediction's initiation. This process is called model initialisation (Meehl et al., 2021). To account for the observational uncertainty in the initial conditions, these climate predictions are initialised from a certain number of slightly perturbed states. Each of the outcoming simulations is called an ensemble member. Decadal prediction systems initially contributed to the Coupled Model Intercomparison Project Phase 5 (CMIP5; Taylor et al. (2012)) and Phase 6 (CMIP6; Eyring et al. (2016)) and have been shown to produce skilful predictions for several climate variables such as near-surface temperature and, to a lesser extent, precipitation (Smith et al., 2019; Delgado-Torres et al., 2022). Multi-annual predictions, which focus on the initial period of the decadal predictions (2–5 years), demonstrate the potential to provide helpful information for various socio-economic sectors. These predictions demonstrate significant skill for the average of forecast years 2–5, which in some cases is comparable, especially in the prediction of temperatures, to the skill shown for seasonal predictions in the average of the corresponding forecast months (Kushnir et al., 2019; O'Kane et al., 2023; Delgado-Torres et al., 2023). Further studies have also investigated the predictability of extreme events on a multi-annual scale, providing comparable results: temperature-related extremes show stronger prediction skill compared to precipitation-related extremes (Solaraju-Murali et al., 2019; Delgado-Torres et al., 2023). To the best of our knowledge, decadal predictions have only been assessed for climate variables or univariate climate extremes. Here, we aim to assess the skill of hot-dry compound extremes in multi-annual predictions within a multi-model setup. We also investigate the improvement of the skill due to the initialisation of the forecast. Finally, given the importance of the interconnections between compound extremes and their univariate counterparts, we also analyse these correlations and compare the results between observations and models.
We analyse the prediction skill of hot-dry compound extremes for a multi-model ensemble from the Coupled Model Intercomparison Project Phase 6 (CMIP6) (Eyring et al., 2016). The list of the models, the number of ensemble members and the corresponding horizontal resolutions are presented in Table A1. For each model, we use two different sets of experiments. The first set, hereafter referred to as DCPP MME, consists of 10-year initialised hindcasts from the Decadal Climate Prediction Project (DCPP) (Boer et al., 2016). The DCPP, being a multi-system approach, does not specify standard data or methods regarding the initialisation process. For this reason, for any further information, we refer to the references in Table A1. This set of retrospective decadal predictions is produced every year from 1960 to 2009. For this study, we select and analyse the forecast years 2–5 from these decadal forecasts. Thus, in this specific setup, the initial forecast-year 2 is 1962, while the last forecast-year 5 is 2014. We select this subset from the decadal forecast because previous studies have shown that this timeframe benefits from the impact of initialization, thus providing added information compared to uninitialized simulations (Boer et al., 2016). The second set of experiments, referred to hereafter as Hist MME, consists of CMIP6 historical forcing simulations from 1962 to 2014. To estimate the skill of DCPP MME, we use two observation-based reference datasets. The first reference dataset, which will be referred to hereafter as GPCC-BEST, combines the Global Precipitation Climatology Centre (GPCC, Schneider et al. (2016)) for monthly total precipitation and the Berkeley Earth Surface Temperatures (BEST, Rohde and Hausfather (2020)) for daily maximum and minimum temperatures. However, observational datasets are influenced by the presence of missing data. To overcome this problem, and provide a second benchmark for our analyses, we use a second reference dataset that combines daily total precipitation, daily maximum and daily minimum temperature and is obtained from the European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis v5, and will be referred to as ERA5 (Hersbach et al., 2020).
2.1 Univariate and compound extreme indicators
To identify hot-dry compound extreme days, we first define univariate hot and dry extremes. For temperature extremes, named hereafter TX90p, we select the days above the 90th percentile of daily maximum temperature, measuring the frequency of unusually warm days in each year, as per the definition of the Expert Team on Climate Change Detection and Indices (ETCCDI; Zhang et al. (2011)). The percentile is calculated with a distribution every day of the year, and with a 5 d running window. The use of a 5 d window enables the avoidance of seasonal biases associated with more extended window periods (Brunner and Voigt, 2024). For DCPP MME, this process is lead-time dependent; thus, we select every lead-day of the predictions (with its 5 d window) to build its corresponding distribution for that specific day. This method implicitly accounts for the drift of the decadal predictions. The percentile distributions are based on the entire period of the study, which, in the case of Hist MME, GPCC-BEST and ERA5 is the timeframe 1962–2014, while for DCPP MME, it depends on the forecast year, going from 1962–2011 for forecast year 2 to 1965–2014 for forecast year 5. In the case of the DCPP MME and Hist MME, the percentile is calculated using data from all the ensemble members for each model individually to account for the models' different variances.
For dry extremes, we use two different indicators: the Standardized Precipitation Index (SPI, McKee et al. (1993)) and the Standardized Precipitation-Evapotranspiration Index (SPEI, Vicente-Serrano et al. (2010)). The use of these two indicators allows us to explore two different types of drought. With the SPI, we explore a drought based only on precipitation (meteorological drought). In contrast, with the SPEI, we explore droughts based on the combination of precipitation and following evapotranspiration (hydrological droughts). These two indicators are calculated using a two-step method: an accumulation process and a standardization process. For the standardization process, we use a gamma distribution for the SPI and a log-logistic distribution for the SPEI, the latter utilising unbiased probability-weighted moments (Beguería et al., 2014; Stagge et al., 2015). The result is a standardized index for total precipitation (SPI) and one for the difference between precipitation and potential evapotranspiration (SPEI), each computed during selected accumulated periods, with potential evapotranspiration (PET) calculated using the Hargreaves method (Hargreaves, 1994). This specific method, which utilises precipitation, minimum and maximum temperatures, provides a good representation of potential evapotranspiration using relatively easy-to-access variables and minimal computational effort (De Luca and Donat, 2023). For computing these indices DCPP MME, we follow Solaraju-Murali et al. (2019). This method treats the accumulation and standardization steps differently due to the nature of decadal predictions. First, the accumulation is done from the first to the last forecast month of every initialisation. In the second step, the standardization is performed individually by building a distribution for each lead-month, using all initialisations together. As for TX90P, the lead-time dependent standardization process accounts for the drift in decadal predictions. Based on the results for the ensemble mean, the distributions show differences between lead years of about 5 %. The results for the single models show higher differences, but still considerably smaller compared to the absolute values obtained from the distributions, and limited to specific regions (not shown here). From the resulting time series (one for each month), we form the distribution used to calculate SPI and SPEI. Consequently, the start and end of the reference period will depend on the forecast month analysed. This process is done separately for each ensemble member. In this study, we examine a 3-month accumulation period. Thus, we analyse SPI3 and SPEI3, which are indicators of meteorological (SPI) and hydrological (SPEI) wet (positive values) or dry (negative values) conditions. Finally, at a grid-point level, we define as months with drought conditions all months with SPI or SPEI and name these dry indices as SPI3dry and SPEI3dry, respectively, where 3 represents the accumulation period in months (De Luca and Donat, 2023).
Hot-dry compound extremes are then identified following De Luca and Donat (2023), which retain all those TX90p days which fall into SPI3dry or SPEI3dry months. With this method, we obtain two types of hot-dry compound extremes: one based on SPI (HDSPI3, Hot-Dry selected by SPI3dry) and one on SPEI (HDSPEI3, Hot-Dry selected by SPEI3dry).
To assess the robustness of the extremes' thresholds selected in this study, we perform the same analysis of this study for hot extremes over the 95th percentile, SPI and SPEI with dry conditions below −1.5, and hot-dry compound extremes computed over these two previous conditions. In addition, different accumulation periods for the dry conditions represent different types of droughts. For this reason, considering the importance of assessing the influence of different types of droughts on hot-dry compound extremes, we also perform a sensitivity test for dry and hot-dry compound extremes for accumulation periods of 6 and 12 months.
2.2 Forecast quality assessment
We use the Spearman's rank correlation coefficient to assess the skill of the DCPP MME mean in predicting the annual TX90p, HDSPI3 and HDSPEI3, as well as the SPI3dry and SPEI3dry for the average of the forecast years 2–5. Even if the computation of the indicators is done at an ensemble-member level, we use the ensemble mean as in climate predictions, the ensemble mean is typically interpreted as the predictable component of the signal (Eade et al., 2014; Smith et al., 2019). To compare the reference datasets to the forecast average, we perform a running mean equal to the forecast years taken into account. This correlation coefficient measures the relationship between the forecasted and observed time series, respectively ranked from their lowest to their highest value (Wilks, 2011).
We also use the residual correlation to quantify the skill improvement due to initialisation over the skill that comes from the model responses to external forcing (Smith et al., 2019). This method involves linearly regressing out the ensemble mean of the Hist MME from the observations and the ensemble mean of the DCPP MME, obtaining the residual values. Finally, we calculate the correlation between the two residual time series, thereby measuring how much of the observed variability is captured by the DCPP MME forecast that is not already captured by the Hist MME forecast. To maintain consistency in the comparison between the DCPP MME and the Hist MME simulations, for the residual correlation analyses, we only use those models in the DCPP MME ensemble that have at least one member also in the Hist MME ensemble (see Table A1). For all the correlations shown here, we consider only values statistically significant at the 95 % confidence level (p-value < 0.05) using a one-tailed test for the prediction skill and a two-tailed test for the residual skill derived from initialisation, as well as the analysis on the correlations between univariate and hot-dry compound extremes.
3.1 Multi-annual prediction skill of hot, dry and hot-dry compound extremes
In this section, we present the forecast quality assessment of multi-annual averages for the DCPP MME's forecast years 2–5. The correlation analysis is performed on the HDSPI3 and HDSPEI3 indicators and their univariate counterparts (TX90p, SPI3dry and SPEI3dry) between the DCPP MME mean and GPCC-BEST (Fig. 1), as well as between DCPP MME mean and ERA5 as an alternative reference dataset (Fig. A1).
Figure 1Spearman's rank correlation for HDSPI3 (a), HDSPEI3 (b), TX90p (c), SPI3dry (d) and SPEI3dry (e) for the average forecast years 2–5 (1962–2014). Dots indicate regions where the correlation is not statistically significant at the 95 % confidence level. Grey areas indicate missing data.
For HDSPI3 (Fig. 1a), we observe significantly positive skill in the Mediterranean region, the Balkans, the Arabian Peninsula, southern and eastern Asia, the western coast of North America, Central America, most of South America, equatorial Africa, and southern Australia. On the other hand, HDSPEI3 shows significantly positive skill over most of the globe, with only the Arctic and sub-Arctic regions, the western Sahara, parts of South America and portions of equatorial Africa showing non-significant correlations (Fig. 1b).
When we analyse the skill of the univariate extremes that contribute to these compound extremes, we find that TX90p (Fig. 1c) shows statistically significant positive skill across the globe, with only a few regions, such as the lower latitudes of South America, Western Sahara and some areas of the sub-Arctic regions, showing non-significant values. These results are in accordance with Delgado-Torres et al. (2023).
On the other hand, SPI3dry and SPEI3dry show limited regions with significantly positive skill (Fig. 1d, e). For SPI3dry, most of the positive skill is found in northern and equatorial Africa, Scandinavia, certain areas of eastern Europe, and some northern and central Asian regions. Positive skill is also found in some parts of South America and Australia. Similarly, SPEI3dry exhibits positive and significant correlations over the Mediterranean region, large areas of Africa, the Arabian Peninsula, central and eastern Asia, Central America, South America, and the southern regions of Australia. In general, it appears that SPEI3dry shows higher skill than SPI3dry, especially over the mid-latitudes. If we compare the skill of hot-dry compound extremes to their univariate counterparts, in general, we see an enhanced skill compared to the univariate dry extremes, but not as high as shown by the hot ones. In addition, dry and especially hot-dry compound extremes computed from SPEI show increased skill compared to those calculated with SPI. This characteristic could be partly explained by the contribution of Potential Evapotranspiration, which, relying on temperature, positively contributes to the general skill. This explanation appears to be supported by the fact that the more weight temperature has in the definition of the hot-dry compound extremes (for example, HDSPEI3), the lower skill in dry condition (in this case SPEI3dry) seems to be a limiting factor in the overall skill.
By performing the analysis with ERA5 as the reference dataset (Fig. A1), we obtain some differences among the analysed indices. In the compound extremes (HDSPI3 and HDSPEI3), we find similar areas that show significant skill compared to GPCC-BEST. In addition, as in the previous results for GPCC-BEST, HDSPEI3 shows an enhanced skill compared to HDSPI3. Regarding the univariate extremes, TX90p still exhibits the strongest skill, and SPEI3dry shows stronger skill than SPI3dry, especially in the Northern Hemisphere. In general, compared to GPCC-BEST, the skill using ERA5 as a reference dataset shows a slight diminishment in the area where significant positive correlations are found. This diminishment is especially evident in the dry extremes (SPI3dry and SPEI3dry), where large areas, particularly at high latitudes and in equatorial Africa, show strong negative correlations.
Finally, in the sensitivity test, we find that changing the thresholds of the univariate and compound extremes does not change the overall results (Fig. A7). In fact, we find significant correlations in the same areas, with a slight difference between the strength of the correlations, especially in dry and compound extremes. On the other hand, a change in the accumulation period for the dry and compound extremes appears to bring significant differences (Figs. A8 and A9). Compared to extremes calculated with a 3-month accumulation, accumulating over 6 and 12 months generally shows a decrease in significant skill, the longer the accumulation period. However, this is not the case with Northeast Asia, where longer accumulations show higher correlations.
3.2 Residual correlations and trends
The skill of the decadal forecast shown above (Figs. 1 and A1) can be essentially divided into two components: the skill resulting from external forcings and the skill resulting from initialisation. In particular, the skill coming from the initialisation is beneficial in providing information regarding the variability of such extremes in the upcoming years. In this section, we analyse to what extent the two elements contribute to the overall skill.
To quantify the added skill contributed by the models' initialisation in decadal predictions, we perform the correlation between the residuals of GPCC-BEST and the DCPP MME ensemble mean (Fig. 2). The results for HDSPI3 globally show small amounts. Furthermore, these areas are scattered and spatially isolated, not showing a larger region where a clear added skill can be found (Fig. 2a). For HDSPEI3, on the other hand, despite showing a similar spatial pattern, areas of consistent added skill can be identified in central Asia and southern Australia, as well as in some regions of the high latitudes of North America (Fig. 2b).
Figure 2Spearman's rank residual correlation for HDSPI3 (a), HDSPEI3 (b), TX90p (c), SPI3dry (d) and SPEI3dry (e) for the average forecast years 2–5. Dots indicate areas where the residual correlation is not statistically significant at the 95 % confidence level. Grey areas indicate missing data.
On the other hand, between the univariate extremes, TX90p (Fig. 2c) appears to show the most benefit from the initialisation, with added skill located in central and eastern Asia, Greenland and southern Australia. Additionally, we also find some added value in small parts of North America, South America and Africa. For dry extremes, the areas of added skill are more limited. SPI3dry demonstrates added skill in some small areas of South America, Asia and Scandinavia (Fig. 2d). Moreover, northern Africa also shows a homogeneous area of added skill, which, at least in part, corresponds to the significant skill observed for SPI3dry (Fig. 1d). On the other hand, SPEI3dry shows slightly fewer areas of added skill, limited to southern Australia and central Asia (Fig. 2e), the latter being similar to the region where we find added skill for TX90p.
The residual correlation obtained with ERA5 as a reference dataset yields similar results, both in terms of general patterns and regions showing significant residual correlations (Fig. A2). However, for HDSPI3 and HDSPEI3, as well as in SPI3dry and SPEI3dry, an increase in added skill is noticeable in the tropical regions of South America and eastern Asia.
For different thresholds and accumulation, the residual skill yields results similar to those previously shown for the overall skill. Different thresholds lead to identical areas where we observe a significant contribution from initialization, with a slight reduction in size and strength of the correlation (Fig. A10). A change in the accumulation period instead leads to more substantial differences between the 3-month accumulation and the 6- and 12-month accumulations, respectively (Figs. A11 and A12). These differences, similar to those in the previous section, appear to intensify as the accumulation period is extended.
In general, the results in Figs. 2 and A2 show that, for the univariate hot and dry extremes, we have certain limited areas where the initialisation significantly adds skill to the forecast (Figs. 1, A1). However, the added skill detected in certain areas for the univariate extremes is mostly lost when analysing the compound ones. Indeed, the results suggest that the skill relies strongly on the models' response to external forcings, such as the strong warming trend in temperatures, especially in the case of lower latitudes for HDSPEI3, where we find strong overall skill (in terms of correlation with observations) but little residual correlation.
Figure 3Statistically significant Spearman's ank correlation against the trend sign for HDSPI3 (a), HDSPEI3 (b), TX90p (c), SPI3dry (d) and SPEI3dry (e) for the average forecast years 2–5 (as in Fig. 1). Pink (magenta) dots indicate areas where the correlation is positive and statistically significant, and where the trends of GPCC-BEST and DCPP MME show the same (opposite) sign. Light blue (Dark blue) dots indicate areas where the correlation is negative and statistically significant, and where the trends of GPCC-BEST and DCPP MME show the same (opposite) sign. Grey areas indicate grid points where the correlation is non-significant.
To better establish the role of external forcings and trends in predicting these extremes, we explore the connection between significant correlations and the signs of the trends (Fig. 3). In this analysis, we select the regions of DCPP MME which show significant positive or negative skill. For these specific regions, we then compare the trend signs for DCPP MME and GPCC-BEST, specifically examining whether the trend signs of the two datasets are in agreement. We can then classify regions into four categories: significant positive with a trend in agreement (or disagreement), and significant negative with a trend in agreement (or disagreement). From these results, a positive significant skill in most regions is tied to an agreement in trend sign between DCPP MME and GPCC-BEST. In addition, in most areas where the correlation is significantly negative, we find that the trends between the two datasets are discordant. An exception can be observed for SPI3dry, where, especially in Eastern Europe, the Western Sahel and regions of mid-latitude South America, the correlations are significantly positive, but the sign of the trend is discordant (Fig. 3d). Two factors could help explain this. The first is the weak trend in precipitation in those areas for both datasets (Fig. A4), which also explains the opposite scenario, a significantly negative correlation but coinciding trend signs, visible, for example, in South Africa and tropical South America. The second factor is the added skill from initialisation for those grid points, which, despite the discordant trend, leads to a significant positive skill (Fig. 2d).
On the contrary, all the extreme indices that include temperature show a strong agreement between positive (negative) significant correlations and agreeing (discordant) trend signs, specifically evident in HDSPEI3 and SPEI3dry (Fig. 3b and e). These results are in line with the findings of Donat et al. (2023). The same conclusions can be drawn concerning the results when using ERA5 as a reference dataset (Fig. A3). Here, the areas with opposite trends and significantly negative correlations are evident in tropical Africa for SPI3dry and SPEI3dry and in the high boreal latitudes for HDSPI3 and HDSPEI3. When examining the trends, we observe that these areas exhibit opposite trends in DCPP MME and ERA5 for the aforementioned extreme indices. However, in these regions, we can also observe areas with significant negative correlations and agreeing trend signs (Fig. A3d and A3e). These results could be explained by the weak trends in the region (Fig. A5), making it a non-significant factor in determining significant skill. For SPI3dry, we observe the same situation as for GPCC-BEST in tropical South America (Fig. A3d). In this region, in fact, we observe discordant trends between DCPP MME and ERA5, yet significant positive correlations. We can also see, between DCPP MME and ERA5, that in that area, the precipitation trends are not strong (Fig. A5) and that the forecast shows a large region of added skill for ERA5 (Fig. A2d).
In summary, the results discussed above indicate that decadal predictions exhibit little added skill compared to Hist MME, especially for dry and hot-dry compound extremes. However, most of the skill seen in Sect. 3.1 is still tied to the signal associated with external forcings. In particular, we observe that the presence of the skill, in the absence of added skill from the initialisation, is tied to the trends of the models and the reference datasets, and whether these two trends agree in their signs, especially if the selected variable is affected by strong temperature trends. The results presented in this section highlight the current limitations of multi-annual predictions, particularly in terms of providing information on the upcoming variability of such extreme events. In this regard, the question now lies in the extent to which the different univariate extremes influence the detection and the variability of the compound extremes in the model and the reference datasets.
3.3 Correlations between univariate and compound extremes
The method used in this study, which relies on the co-occurrence of hot and dry conditions at the same location, suggests that one of the two univariate extremes may play a more important role in the overall detection of the events. In addition, this co-occurrence may vary across different datasets, for example, between the DCPP MME and the observation-based datasets, resulting in different counts of compound extremes and, consequently, different interannual variations. Thus, we perform a correlation analysis between the annual sum of univariate and compound extremes to assess the representation of the underlying interconnections. Since the HDSPI3 and HDSPEI3 were estimated for every ensemble member, we calculated the correlations between the univariate and compound values for each ensemble member. We then selected the median correlation across DCPP MME. Finally, we consider a significant correlation in those areas where at least 50 % of the ensemble members show significant positive or negative correlations.
The results for the correlations between HDSPI3 and HDSPEI3 and the univariate extremes are shown in Figs. 4 and 5, respectively. If we look at the correlations between HDSPI3 and TX90p in GPCC-BEST and the DCPP MME (Fig. 4a, b), we can see that they both show significantly positive correlations across most of the globe. In addition, the DCPP MME can reproduce the large areas where the connections between HDSPI3 and TX90p are weaker, such as the high latitudes of the Northern Hemisphere. However, it is less effective in areas such as Africa or central Asia, where correlations in DCPP MME are still significant, unlike those in GPCC-BEST. On the other hand, when looking at the correlations between HDSPI3, SPI3dry and SPEI3dry, we see similar patterns (Figs. 4c, d and e, f, respectively). Both GPCC-BEST and DCPP MME show strong positive correlations between drought occurrence and hot-dry compound extremes in most regions globally, except mainly in northern Africa. However, as seen in the correlations with TX90p (Fig. 4a and b), DCPP MME partially fails to reproduce the high spatial differences in connections observed in GPCC-BEST, with only larger areas, such as northern Africa, showing differences in the correlation values (Fig. 4d and e).
Figure 4Spearman's rank correlation between the annual sum of HDSPI3 and TX90p (a, b), SPEI3dry (c, d) and SPI3dry (e, f), respectively. For (a), (c), and (e): Correlations are calculated with GPCC-BEST, and dots indicate correlations not statistically significant at the 95 % confidence level. For (b), (d) and (f): Correlations represent the median (ensemble-wise) correlation for the DCPP MME ensemble, and dots indicate grid points where more than 50 % of the ensemble members provide correlations not statistically significant at the 95 % confidence level. Grey areas indicate missing data.
For the correlations between HDSPEI3 and the univariate extremes (Fig. 5), we observe patterns similar to those in Fig. 4. In this case, the correlations between HDSPEI3 and TX90p appear more substantial and more widely significant in GPCC-BEST and the DCPP MME, with the high latitudes showing the weakest correlations (Fig. 5a, b). The correlations between HDSPEI3 and SPEI3dry in GPCC-BEST exhibit strong positive correlations globally, with DCPP MME reproducing them significantly well (Fig. 5c, d). On the other hand, the correlations between HDSPEI3 and SPI3dry in GPCC-BEST show high spatial differences, with large areas showing non-significant correlations. However, DCPP MME shows non-significant correlations uniquely across northern Africa, the Arabian Peninsula and central Asia (Fig. 5e, f).
Analysing the correlations between compound and univariate extremes in ERA5, we observe a pattern similar to the results seen above for GPCC-BEST (Fig. A6). In general, both reference datasets (GPCC-BEST and ERA5) show stronger correlations for TX90p with HDSPEI3, especially in the equatorial and mid-latitudes. Still, we find significant correlations between hot-dry compounds and dry extremes, which appear to be an essential driving factor in the variability of both HDSPI3 and HDSPEI3, and more specifically for the dry extreme with which the compound extreme is being calculated (i.e. SPI3dry for HDSPI3, SPEI3dry for HDSPEI3). The DCPP MME can reproduce these general patterns, particularly the weaker correlations in the high Northern Hemisphere latitudes for TX90p and in northern Africa for the dry extremes. However, it fails to represent the high spatial differences in the correlation values, which instead can be seen in the reference datasets. These results can be attributed to the smoothing that inevitably occurs when calculating the ensemble mean, causing the variability of the individual ensemble members to be lost.
This study provides an overall assessment of hot, dry and hot-dry compound extremes in multi-annual predictions. The results show that the predictive capabilities of the decadal forecast systems are not only limited to the univariate extremes but also show value for the compound extreme occurrences. Compared to the skill shown in the univariate extremes, compound extremes show higher or lower predictability: global compound skill is smaller compared to hot extremes, especially at high latitudes; on the other hand, they show higher skill compared to their dry univariate counterparts. These differences are likely associated with the fact that temperature-related extremes are more predictable than their precipitation-related counterparts, with the latter's predictability still representing a bottleneck that could damp the signal (Solaraju-Murali et al., 2019; Bellucci et al., 2015; Delgado-Torres et al., 2023; Smith et al., 2019; De Luca and Donat, 2023). From this perspective, the compound extremes partly benefit from the higher predictability of hot extremes.
However, the results from the residual correlation suggest that most of the skill of the compound extremes comes from external forcings, which cause long-term trends in some extreme indices. This conclusion is partially less true for dry extremes, where we can find specific regions with a weak trend, and where the initialisation actively plays a role in providing significant skill. In fact, we observe that in the variables where temperatures play a role in defining the extreme indices (i.e. TX90p, SPEI3dry, HDSPI3 and HDSPEI3), the tie with the trend is nearly ubiquitous (Donat et al., 2023). Thus, a skilful representation of the interannual variability still lacks in multi-annual predictions. This characteristic represents a limit to the usability of these forecasts and the information they can provide.
In addition to these results, the analysis of the interconnections between compound and univariate extremes shows that the decadal forecast systems are generally able to reproduce the connections between the compound extremes and their univariate counterparts based on observations, attributing a leading role to the dry conditions, and specifically to the one on which the compound extreme is calculated (e.g. drought defined based on SPI or SPEI). Thus, if we compare these results with the prediction skill, we can identify where the main limiting factor in the prediction of hot-dry compound extremes may lie: dry extremes show to be the leading factor determining the variability of hot-dry extremes, but on the other hand, it is the dry extremes that show the least prediction skill on a multi-annual time range.
Additionally, we confirm that using different reference datasets (GPCC-BEST and ERA5) yields results that differ slightly, but not significantly. The differences observed in the two reference datasets, especially for dry and hot-dry compound extremes, could be attributable to the different uncertainties in the datasets, such as the wet bias in Central Africa or the dry bias in the Northern Hemisphere continental areas for ERA5 (Hassler and Lauer, 2021).
Regarding the sensitivity analysis, a change in the threshold of the extreme events does not result in significant differences. The reason could be connected to the threshold of the hot (dry) extremes, which generally belong to the higher (lower) end of the distribution, and typically exhibit the same characteristics, especially in a normal distribution. On the other hand, we observe that a change in the accumulation period has a stronger impact on the skill of predicting dry and hot-dry compound extremes. By increasing the accumulation period of the dry conditions, we are observing different types of droughts and consequent impacts. 3-month accumulation SPI and SPEI are related to more short-term dry conditions, with an impact on stream-flow and soil moisture. On the other hand, 6-month accumulation represents medium-term dry conditions, related to anomalies in specific seasons, with impacts on stream-flow and reservoir levels. Finally, a 12-month accumulation reflects long-term precipitation patterns, with more long-lasting impacts on streamflows, reservoir levels and even groundwater levels (Svoboda et al., 2012). These differences in the type of drought also influence the predictability of such dry conditions. In fact, on one side, selecting a longer accumulation period leads to including a time range which is otherwise not considered, thus changing the nature of the variable in question. On the other side, longer accumulation periods tend to gravitate more towards zero, thus changing the predictability of the indicator. This interpretation could explain the increase and decrease in prediction skill between different accumulation periods in several regions and why the areas where we find the strongest changes between accumulation periods are common to both the overall and residual skills, underscoring the influence of the accumulation period on the internal variability of the dry condition. For this reason, the selection of the accumulation period when calculating SPI and SPEI should be carefully considered, depending on the type of drought one wants to observe.
We conclude that CMIP6 decadal forecast systems can skillfully predict the compound hot and dry extremes over large parts of the world at the multi-annual scale. However, this skill is strongly tied to external forcings, and a significant gap remains for improvement in predicting interannual variability. Finally, the decadal prediction systems can detect the underlying connections between compound and univariate extremes, even if at a large-scale level, showing the leading role of dry conditions. Similar forecast quality assessments should be conducted on a regional and seasonal scale, as well as for other compound indicators, to provide even more insightful and specific information on such extreme phenomena. These further steps could provide the information necessary to enable targeted climate information to support mitigation and adaptation policies against potential damages caused by these events.
Table A1List of models and number of ensemble members used for the DCPP MME and the Hist MME. Abbreviations used for the initialization schemes: atm,atmospheric component; ocn,ocean/sea-ice component; FF, full-field; AN,anomalies; EnKF,Ensemble Kalman Filter; IAU,incremental analysis updating. Where available, the datasets used in the initialization are shown in brackets. For further details we refer to the publications reported as references.
Figure A1Spearman rank correlation for TX90p (a), SPI3dry and SPEI3dry (b, c) and HDSPI3 and HDSPEI3 (d, e) for the average forecast years 2–5. Correlations are performed between the DCPP MME ensemble mean and the ERA5 dataset. Hatchings indicate correlations not statistically significant at the 95 % confidence level (p-value < 0.05) using a one-tailed test.
Figure A2Spearman rank residual correlation for TX90p (a), SPI3dry and SPEI3dry (b, c) and HDSPI3 and HDSPEI3 (d, e) for the average forecast years 2–5. Correlations are performed between the DCPP MME ensemble mean and the ERA5 dataset. Hatchings indicate correlations not statistically significant at the 95 % confidence level (p-value < 0.05) using a two-tailed test.
Figure A3Statistically significant Spearman Rank correlations against the trend sign for HDSPI3 (a), HDSPEI3 (b), TX90p (d), SPI3dry (c) and SPEI3dry (e) for the average forecast years 2–5 (as in Fig. 1). Correlations and trend sign comparison are performed between the DCPP MME ensemble mean and the ERA5 dataset. Pink (magenta) dots indicate areas where the correlation is positive and statistically significant at the 95 % confidence level (p-value < 0.05) and where the trend of GPCC-BEST and DCPP MME show the same (opposite) sign. Light blue (Dark blue) dots indicate areas where the correlation is negative statistically significant at the 95 % confidence level (p-value < 0.05) and where the trend of GPCC-BEST and DCPP MME show the same (opposite) sign. Grey areas indicate grid points where the correlation is non-significant.
Figure A4First row: Spearman rank correlation between the GPCC-BEST dataset and the DCPP MME ensemble mean for HDSPI3, HDSPEI3, TX90p, SPI3dry and SPEI3dry respectively. Hatchings indicate correlations not statistically significant at the 95 % confidence level (p-value < 0.05) using a one-tailed test. Second row: Normalized trends in the GPCC-BEST dataset for HDSPI3, HDSPEI3, TX90p, SPI3dry and SPEI3dry respectively. Third row: Median normalized trends, calculated ensemble-wise, in the DCPP MME ensemble for HDSPI3, HDSPEI3, TX90p, SPI3dry and SPEI3dry respectively. Grey areas indicate missing data.
Figure A5First row: Spearman rank correlation between the ERA5 dataset and the DCPP MME ensemble mean for HDSPI3, HDSPEI3, TX90p, SPI3dry and SPEI3dry respectively. Hatchings indicate correlations not statistically significant at the 95 % confidence level (p-value < 0.05) using a one-tailed test. Second row: Normalized trends in the ERA5 dataset for HDSPI3, HDSPEI3, TX90p, SPI3dry and SPEI3dry respectively. Third row: Median normalized trends, calculated ensemble-wise, in the DCPP MME ensemble for HDSPI3, HDSPEI3, TX90p, SPI3dry and SPEI3dry respectively.
Figure A6Spearman rank correlation between the annual sum of HDSPI3 (a, c, e) and HDSPEI3 (b, d, f) and TX90p, SPEI3dry and SPI3dry respectively, in the ERA5 dataset. Hatchings indicate correlations not statistically significant at the 95 % confidence level (p-value < 0.05) using a two-tailed test.
Figure A7First column: Spearman's rank correlation for calculated for different extreme thresholds (First column). Correlations are performed between the DCPP MME ensemble mean and the GPCC-BEST dataset. Second column: Difference between the thresholds used in the main study and the ones shown in this figure. Hatchings indicate correlations (or differences between correlations) not statistically significant at the 95 % confidence level. The significance of the difference has been computed using a Steiger z-test. Grey areas indicate missing data.
Figure A8First column: Spearman's rank correlation for calculated for dry and hot-dry compounds with a 6-month accumulation. Correlations are performed between the DCPP MME ensemble mean and the GPCC-BEST dataset. Second column: Difference between the accumulation used in the main study and the one shown in this figure. Hatchings indicate correlations (or differences between correlations) not statistically significant at the 95 % confidence level. The significance of the difference has been computed using a Steiger z-test. Grey areas indicate missing data.
Figure A9First column: Spearman's rank correlation calculated for dry and hot-dry compounds with a 12-month accumulation. Correlations are performed between the DCPP MME ensemble mean and the GPCC-BEST dataset. Second column: Difference between the accumulation used in the main study and the one shown in this figure. Hatchings indicate correlations (or differences between correlations) not statistically significant at the 95 % confidence level. The significance of the difference has been computed using a Steiger z-test. Grey areas indicate missing data.
Figure A10First column: Spearman's rank residual correlation calculated for different extreme thresholds. Correlations are performed between the DCPP MME ensemble mean and the GPCC-BEST dataset. Second column: Difference between the thresholds used in the main study and the ones shown in this figure. Hatchings indicate correlations (or differences between correlations) not statistically significant at the 95 % confidence level. The significance of the difference has been computed using a Steiger z-test. Grey areas indicate missing data.
Figure A11First column: Spearman's rank residual correlation calculated for dry and hot-dry compounds with a 6-month accumulation. Correlations are performed between the DCPP MME ensemble mean and the GPCC-BEST dataset. Second column: Difference between the accumulation used in the main study and the one shown in this figure. Hatchings indicate correlations (or differences between correlations) not statistically significant at the 95 % confidence level. The significance of the difference has been computed using a Steiger z-test. Grey areas indicate missing data.
Figure A12First column: Spearman's rank residual correlation calculated for dry and hot-dry compounds with a 6-month accumulation (First column). Correlations are performed between the DCPP MME ensemble mean and the GPCC-BEST dataset. Second column: difference between the accumulation used in the main study and the one shown in this figure. Hatchings indicate correlations (or differences between correlations) not statistically significant at the 95 % confidence level. The significance of the difference has been computed using a Steiger z-test. Grey areas indicate missing data.
Gridded ERA5 data ranging from 1961 to 2014 was downloaded from https://cds.climate.copernicus.eu/ (last access: 4 December 2025). Gridded GPCC data ranging from 1961 to 2014 were downloaded from https://opendata.dwd.de/climate_environment/GPCC/html/fulldata-monthly_v2020_doi_download.html (last access: 4 December 2025). Gridded BEST data ranging from 1961 to 2014 were downloaded from https://berkeleyearth.org/data/ (last access: 4 December 2025). CMIP6 dcppA hindcasts and historical simulations were downloaded from https://esg-dn1.nsc.liu.se/projects/esgf-liu (last access: 4 December 2025). The scripts used to support the results of this analysis are available on Zenodo: https://doi.org/10.5281/zenodo.17867448 (Aranyossy, 2025).
AA, MD and PDL conceptualized and designed the study. MSC provided the data for the simulations, observations and reanalysis. CDT and BSM contributed in the analyses. AA wrote the first draft. PDL, CDT and MD contributed to interpreting the results, discussing the findings and improving the final version of the paper.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
This research contributes to the Horizon Europe project ASPECT (grant no. 101081460). We are also grateful for support from the Departament de Recerca i Universitats de la Generalitat de Catalunya, via the Climate Variability and Change (CVC) Research Group at the Barcelona Supercomputing Center (reference 2021 SGR 00786), and support from the AXA Research Fund. The authors thank the three anonymous reviewers for their helpful comments.
This project has received support from the European Union's Horizon Europe research and innovation programme under grant agreement no. 101081460 (ASPECT). Alvise Aranyossy has received funding from MCIN/AEI/10.13039/501100011033 and ESF Investing in Your Future (grant no. PRE2022-104391). Paolo De Luca was funded by the EU Horizon Europe Marie Skłodowska-Curie Actions (grant no. 101059659).
This paper was edited by Olivia Martius and reviewed by three anonymous referees.
Aranyossy, A.: Multi-annual predictions of hot, dry and hot-dry compound extremes, Zenodo [code], https://doi.org/10.5281/zenodo.17867448, 2025. a
Beguería, S., Vicente-Serrano, S. M., Reig, F., and Latorre, B.: Standardized precipitation evapotranspiration index (SPEI) revisited: parameter fitting, evapotranspiration models, tools, datasets and drought monitoring, International Journal of Climatology, 34, 3001–3023, 2014. a
Bellucci, A., Haarsma, R., Gualdi, S., Athanasiadis, P. J., Caian, M., Cassou, C., Fernandez, E., Germe, A., Jungclaus, J., Kröger, J., Matei, D., Müller, W., Pohlmann, H., Melia, D. S. Y., Sanchez, E., Smith, D., Terray, L., Wyser, K., and Yang, S.: An assessment of a multi-model ensemble of decadal climate predictions, Climate Dynamics, 44, 2787–2806, 2015. a
Bethke, I., Wang, Y., Counillon, F., Keenlyside, N., Kimmritz, M., Fransner, F., Samuelsen, A., Langehaug, H., Svendsen, L., Chiu, P.-G., Passos, L., Bentsen, M., Guo, C., Gupta, A., Tjiputra, J., Kirkevåg, A., Olivié, D., Seland, Ø., Solsvik Vågane, J., Fan, Y., and Eldevik, T.: NorCPM1 and its contribution to CMIP6 DCPP, Geosci. Model Dev., 14, 7073–7116, https://doi.org/10.5194/gmd-14-7073-2021, 2021. a, b
Bevacqua, E., De Michele, C., Manning, C., Couasnon, A., Ribeiro, A. F. S., Ramos, A. M., Vignotto, E., Bastos, A., Blesić, S., Durante, F., Hillier, J., Oliveira, S. C., Pinto, J. G., Ragno, E., Rivoire, P., Saunders, K., Van Der Wiel, K., Wu, W., Zhang, T., and Zscheischler, J.: Guidelines for studying diverse types of compound weather and climate events, Earth's Future, 9, e2021EF002340, https://doi.org/10.1029/2021EF002340, 2021. a
Bevacqua, E., Zappa, G., Lehner, F., and Zscheischler, J.: Precipitation trends determine future occurrences of compound hot–dry events, Nature Climate Change, 12, 350–355, 2022. a
Bilbao, R., Wild, S., Ortega, P., Acosta-Navarro, J., Arsouze, T., Bretonnière, P.-A., Caron, L.-P., Castrillo, M., Cruz-García, R., Cvijanovic, I., Doblas-Reyes, F. J., Donat, M., Dutra, E., Echevarría, P., Ho, A.-C., Loosveldt-Tomas, S., Moreno-Chamarro, E., Pérez-Zanon, N., Ramos, A., Ruprich-Robert, Y., Sicardi, V., Tourigny, E., and Vegas-Regidor, J.: Assessment of a full-field initialized decadal climate prediction system with the CMIP6 version of EC-Earth, Earth Syst. Dynam., 12, 173–196, https://doi.org/10.5194/esd-12-173-2021, 2021. a
Boer, G. J., Smith, D. M., Cassou, C., Doblas-Reyes, F., Danabasoglu, G., Kirtman, B., Kushnir, Y., Kimoto, M., Meehl, G. A., Msadek, R., Mueller, W. A., Taylor, K. E., Zwiers, F., Rixen, M., Ruprich-Robert, Y., and Eade, R.: The Decadal Climate Prediction Project (DCPP) contribution to CMIP6, Geosci. Model Dev., 9, 3751–3777, https://doi.org/10.5194/gmd-9-3751-2016, 2016. a, b
Boucher, O., Servonnat, J., Albright, A. L., Aumont, O., Balkanski, Y., Bastrikov, V., Bekki, S., Bonnet, R., Bony, S., Bopp, L., Braconnot, P., Brockmann, P., Cadule, P., Caubel, A., Cheruy, F., Codron, F., Cozic, A., Cugnet, D., D’Andrea, F., Davini, P., de Lavergne, C., Denvil, S., Deshayes, J., Devilliers, M., Ducharne, A., Dufresne, J., Dupont, E., Éthé, C., Fairhead, L., Falletti, L., Flavoni, S., Foujols, M., Gardoll, S., Gastineau, G., Ghattas, J., Grandpeix, J., Guenet, B., Guez, L. E, Guilyardi, E., Guimberteau, M., Hauglustaine, D., Hourdin, F., Idelkadi, A., Joussaume, S., Kageyama, M., Khodri, M., Krinner, G., Lebas, N., Levavasseur, G., Lévy, C., Li, L., Lott, F., Lurton, T., Luyssaert, S., Madec, G., Madeleine, J., Maignan, F., Marchand, M., Marti, O., Mellul, L., Meurdesoif, Y., Mignot, J., Musat, I., Ottlé, C., Peylin, P., Planton, Y., Polcher, J., Rio, C., Rochetin, N., Rousset, C., Sepulchre, P., Sima, A., Swingedouw, D., Thiéblemont, R., Traore, A. K., Vancoppenolle, M., Vial, J., Vialard, J., Viovy, N., and Vuichard, N.: Presentation and evaluation of the IPSL-CM6A-LR climate model, Journal of Advances in Modeling Earth Systems, 12, e2019MS002010, https://doi.org/10.1029/2019ms002010, 2020. a
Brunner, L. and Voigt, A.: Pitfalls in diagnosing temperature extremes, Nature Communications, 15, 2087, https://doi.org/10.1038/s41467-024-46349-x, 2024. a
Delgado-Torres, C., Donat, M. G., Gonzalez-Reviriego, N., Caron, L., Athanasiadis, P. J., Bretonnière, P., Dunstone, N. J., Ho, A., Nicoli, D., Pankatz, K., Paxian, A., Pérez-Zanón, N., Cabré, M. S., Solaraju-Murali, B., Soret, A., and Doblas-Reyes, F. J.: Multi-model forecast quality assessment of CMIP6 decadal predictions, Journal of Climate, 35, 4363–4382, 2022. a
Delgado-Torres, C., Donat, M. G., Soret, A., González-Reviriego, N., Bretonnière, P.-A., Ho, A.-C., Pérez-Zanón, N., Cabré, M. S., and Doblas-Reyes, F. J.: Multi-annual predictions of the frequency and intensity of daily temperature and precipitation extremes, Environmental Research Letters, 18, 034031, https://doi.org/10.1088/1748-9326/acbbe1, 2023. a, b, c, d
Delgado-Torres, C., Octenjak, S., Marcos-Matamoros, R., Pérez-Zanón, N., Baulenas, E., Doblas-Reyes, F. J., Donat, M. G., Lwiza, L. M., Milders, N., Soret, A., Whittlesey, S., and Bojovic, D.: Supporting food security with multi-annual climate information: Co-production of climate services for the Southern African Development Community, Science of The Total Environment, 975, 179259, https://doi.org/10.1016/j.scitotenv.2025.179259, 2025. a
De Luca, P. and Donat, M. G.: Projected changes in hot, dry, and compound hot-dry extremes over global land regions, Geophysical Research Letters, 50, e2022GL102493, https://doi.org/10.1029/2022gl102493, 2023. a, b, c, d, e
Donat, M. G., Delgado-Torres, C., De Luca, P., Mahmood, R., Ortega, P., and Doblas-Reyes, F. J.: How credibly do CMIP6 simulations capture historical mean and extreme precipitation changes?, Geophysical Research Letters, 50, e2022GL102466, https://doi.org/10.1029/2022GL102466, 2023. a, b
Dunstone, N., Lockwood, J., Solaraju-Murali, B., Reinhardt, K., Tsartsali, E. E., Athanasiadis, P. J., Bellucci, A., Brookshaw, A., Caron, L., Doblas-Reyes, F. J., Früh, B., González-Reviriego, N., Gualdi, S., Hermanson, L., Materia, S., Nicodemou, A., Nicolì, D., Pankatz, K., Paxian, A., Scaife, A., Smith, D. and Thornton, H. E.: Towards useful decadal climate services, Bulletin of the American Meteorological Society, 103, E1705–E1719, 2022. a
Eade, R., Smith, D., Scaife, A., Wallace, E., Dunstone, N., Hermanson, L., and Robinson, N.: Do seasonal-to-decadal climate predictions underestimate the predictability of the real world?, Geophysical Research Letters, 41, 5620–5628, 2014. a
Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, https://doi.org/10.5194/gmd-9-1937-2016, 2016. a, b
Grumm, R. H.: The central European and Russian heat event of July–August 2010, Bulletin of the American Meteorological Society, 92, 1285–1296, 2011. a
Hargreaves, G. H.: Defining and using reference evapotranspiration, Journal of Irrigation and Drainage Engineering, 120, 1132–1139, 1994. a
Hassler, B. and Lauer, A.: Comparison of reanalysis and observational precipitation datasets including ERA5 and WFDE5, Atmosphere, 12, 1462, https://doi.org/10.3390/atmos12111462, 2021. a
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz‐Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5 global reanalysis, Quarterly Journal of the Royal Meteorological Society, 146, 1999–2049, 2020. a
Ionita, M., Tallaksen, L. M., Kingston, D. G., Stagge, J. H., Laaha, G., Van Lanen, H. A. J., Scholz, P., Chelcea, S. M., and Haslinger, K.: The European 2015 drought from a climatological perspective, Hydrol. Earth Syst. Sci., 21, 1397–1419, https://doi.org/10.5194/hess-21-1397-2017, 2017. a
Kataoka, T., Tatebe, H., Koyama, H., Mochizuki, T., Ogochi, K., Naoe, H., Imada, Y., Shiogama, H., Kimoto, M., and Watanabe, M.: Seasonal to decadal predictions with MIROC6: Description and basic evaluation, Journal of Advances in Modeling Earth Systems, 12, e2019MS002035, https://doi.org/10.1029/2019ms002035, 2020. a
Kushnir, Y., Scaife, A. A., Arritt, R., Balsamo, G., Boer, G., Doblas-Reyes, F., Hawkins, E., Kimoto, M., Kolli, R. K., Kumar, A., Matei, D., Matthes, K., Müller, W. A., O'Kane, T., Perlwitz, J., Power, S., Raphael, M., Shimpo, A., Smith, D., Tuma, M., and Wu, B.: Towards operational predictions of the near-term climate, Nature Climate Change, 9, 94–101, 2019. a
Leonard, M., Westra, S., Phatak, A., Lambert, M., van den Hurk, B., McInnes, K., Risbey, J., Schuster, S., Jakob, D., and Stafford-Smith, M.: A compound event framework for understanding extreme impacts, Wiley Interdisciplinary Reviews: Climate Change, 5, 113–128, 2014. a
Mahmood, R., Donat, M. G., Bilbao, R., Ortega, P., Lapin, V., Tourigny, E., and Doblas-Reyes, F.: Multi-decadal initialized climate predictions using the EC-Earth3 global climate model, Earth Syst. Dynam., 16, 1923–1934, https://doi.org/10.5194/esd-16-1923-2025, 2025. a
McKee, T. B., Doesken, N. J., and Kleist, J.: The relationship of drought frequency and duration to time scales, in: Proceedings of the 8th Conference on Applied Climatology, Boston, vol. 17, 179–183, https://climate.colostate.edu/pdfs/relationshipofdroughtfrequency.pdf (last access: 4 December 2025), 1993. a
Meehl, G. A., Richter, J. H., Teng, H., Capotondi, A., Cobb, K., Doblas-Reyes, F., Donat, M. G., England, M. H., Fyfe, J. C., Han, W., Kim, H., Kirtman, B. P., Kushnir, Y., Lovenduski, N. S., Mann, M. E., Merryfield, W. E., Nieves, V., Pegion, K., Rosenbloom, N., Sanchez, S. C., Scaife, A. A., Smith, D., Subramanian, A. C., Sun, L., Thompson, D., Ummenhofer, C. C., and Xie, S. P.: Initialized Earth System prediction from subseasonal to decadal timescales, Nature Reviews Earth & Environment, 2, 340–357, 2021. a
Müller, W. A., Jungclaus, J. H., Mauritsen, T., Baehr, J., Bittner, M., Budich, R., Bunzel, F., Esch, M., Ghosh, R., Haak, H., Ilyina, T., Kleine, T., Kornblueh, L., Li, H., Modali, K., Notz, D., Pohlmann, H., Roeckner, E., Stemmler, I., Tian, F., and Marotzke, J.: A higher-resolution version of the max planck institute earth system model (MPI-ESM1. 2-HR), Journal of Advances in Modeling Earth Systems, 10, 1383–1413, 2018. a
Nicolì, D., Bellucci, A., Ruggieri, P., Athanasiadis, P. J., Materia, S., Peano, D., Fedele, G., Hénin, R., and Gualdi, S.: The Euro-Mediterranean Center on Climate Change (CMCC) decadal prediction system, Geosci. Model Dev., 16, 179–197, https://doi.org/10.5194/gmd-16-179-2023, 2023. a
O'Kane, T. J., Scaife, A. A., Kushnir, Y., Brookshaw, A., Buontempo, C., Carlin, D., Connell, R. K., Doblas-Reyes, F., Dunstone, N., Förster, K., Graça, A., Hobday, A. J., Kitsios, V., Van Der Laan, L., Lockwood, J., Merryfield, W. J., Paxian, A., Payne, M. R., Reader, M. C., Saville, G. R., Smith, D., Solaraju-Murali, B., Caltabiano, N., Carman, J., Hawkins, E., Keenlyside, N., Kumar, A., Matei, D., Pohlmann, H., Power, S., Raphael, M., Sparrow, M., and Wu, B.: Recent applications and potential of near-term (interannual to decadal) climate predictions, Frontiers in Climate, 5, 1121626, https://doi.org/10.3389/fclim.2023.1121626, 2023. a
Ridder, N. N., Pitman, A. J., and Ukkola, A. M.: Do CMIP6 climate models simulate global or regional compound events skillfully?, Geophysical Research Letters, 48, e2020GL091152, https://doi.org/10.1029/2020gl091152, 2021. a
Rohde, R. A. and Hausfather, Z.: The Berkeley Earth Land/Ocean Temperature Record, Earth Syst. Sci. Data, 12, 3469–3479, https://doi.org/10.5194/essd-12-3469-2020, 2020. a
Schneider, U., Becker, A., Finger, P., Meyer-Christoffer, A., Rudolf, B., and Ziese, M.: GPCC full data reanalysis version 7.0: Monthly land-surface precipitation from rain gauges built on GTS based and historic data, Geoscience Data Exchange (GDEX) [data set], https://doi.org/10.5065/d6000072, 2016. a
Sellar, A. A., Walton, J., Jones, C. G., Wood, R., Abraham, N. L., Andrejczuk, M., Andrews, M. B., Andrews, T., Archibald, A. T., De Mora, L., Dyson, H., Elkington, M., Ellis, R., Florek, P., Good, P., Gohar, L., Haddad, S., Hardiman, S. C., Hogan, E., Iwi, A., Jones, C. D., Johnson, B., Kelley, D. I., Kettleborough, J., Knight, J. R., Köhler, M. O., Kuhlbrodt, T., Liddicoat, S., Linova-Pavlova, I., Mizielinski, M. S., Morgenstern, O., Mulcahy, J., Neininger, E., O'Connor, F. M., Petrie, R., Ridley, J., Rioual, J., Roberts, M., Robertson, E., Rumbold, S., Seddon, J., Shepherd, H., Shim, S., Stephens, A., Teixiera, J. C., Tang, Y., Williams, J., Wiltshire, A., and Griffiths, P. T.: Implementation of UK Earth system models for CMIP6, Journal of Advances in Modeling Earth Systems, 12, e2019MS001946, https://doi.org/10.1029/2019MS001946, 2020. a
Smith, D. M., Eade, R., Scaife, A. A., Caron, L., Danabasoglu, G., DelSole, T. M., Delworth, T., Doblas-Reyes, F. J., Dunstone, N. J., Hermanson, L., Kharin, V., Kimoto, M., Merryfield, W. J., Mochizuki, T., Müller, W. A., Pohlmann, H., Yeager, S., and Yang, X.: Robust skill of decadal climate predictions, Npj Climate and Atmospheric Science, 2, https://doi.org/10.1038/s41612-019-0071-y, 2019. a, b, c, d
Solaraju-Murali, B., Caron, L.-P., Gonzalez-Reviriego, N., and Doblas-Reyes, F. J.: Multi-year prediction of European summer drought conditions for the agricultural sector, Environmental Research Letters, 14, 124014, https://doi.org/10.1088/1748-9326/ab5043, 2019. a, b, c
Sospedra-Alfonso, R., Merryfield, W. J., Boer, G. J., Kharin, V. V., Lee, W.-S., Seiler, C., and Christian, J. R.: Decadal climate predictions with the Canadian Earth System Model version 5 (CanESM5), Geosci. Model Dev., 14, 6863–6891, https://doi.org/10.5194/gmd-14-6863-2021, 2021. a
Stagge, J. H., Tallaksen, L. M., Gudmundsson, L., Van Loon, A. F., and Stahl, K.: Candidate distributions for climatological drought indices (SPI and SPEI), International Journal of Climatology, 35, 4027–4040, 2015. a
Sutanto, S. J., Vitolo, C., Di Napoli, C., D’Andrea, M., and Van Lanen, H. A.: Heatwaves, droughts, and fires: Exploring compound and cascading dry hazards at the pan-European scale, Environment international, 134, 105276, https://doi.org/10.1016/j.envint.2019.105276, 2020. a
Svoboda, M., Hayes, M., and Wood, D.: Standardized precipitation index: user guide, https://www.droughtmanagement.info/literature/WMO_standardized_precipitation_index_user_guide_en_2012.pdf (last access: 04 December 2025), 2012. a
Taylor, K. E., Stouffer, R. J., and Meehl, G. A.: An overview of CMIP5 and the experiment design, Bulletin of the American meteorological Society, 93, 485–498, 2012. a
Tian, T., Yang, S., Karami, M. P., Massonnet, F., Kruschke, T., and Koenigk, T.: Benefits of sea ice initialization for the interannual-to-decadal climate prediction skill in the Arctic in EC-Earth3, Geosci. Model Dev., 14, 4283–4305, https://doi.org/10.5194/gmd-14-4283-2021, 2021. a
Vicente-Serrano, S. M., Beguería, S., and López-Moreno, J. I.: A multiscalar drought index sensitive to global warming: the standardized precipitation evapotranspiration index, Journal of Climate, 23, 1696–1718, 2010. a
Vogel, M. M., Hauser, M., and Seneviratne, S. I.: Projected changes in hot, dry and wet extreme events’ clusters in CMIP6 multi-model ensemble, Environmental Research Letters, 15, 094021, https://doi.org/10.1088/1748-9326/ab90a7, 2020. a
Wilks, D. S.: Statistical methods in the atmospheric sciences, Academic Press, https://doi.org/10.1016/c2010-0-65519-2, 2011. a
Yukimoto, S., Kawai, H., Koshiro, T., Oshima, N., Yoshida, K., Urakawa, S., Tsujino, H., Deushi, M., Tanaka, T., Hosaka, M., Yabu, S., Yoshimura, H., Shindo, E., Mizuta, R., Obata, A., Adachi, Y., and Ishii, M.: The Meteorological Research Institute Earth System Model version 2.0, MRI-ESM2. 0: Description and basic evaluation of the physical component, Journal of the Meteorological Society of Japan. Ser. II, 97, 931–965, 2019. a
Zhang, X., Alexander, L., Hegerl, G. C., Jones, P., Tank, A. K., Peterson, T. C., Trewin, B., and Zwiers, F. W.: Indices for monitoring changes in extremes based on daily temperature and precipitation data, Wiley Interdisciplinary Reviews: Climate Change, 2, 851–870, 2011. a
Zscheischler, J. and Fischer, E. M.: The record-breaking compound hot and dry 2018 growing season in Germany, Weather and Climate Extremes, 29, 100270, https://doi.org/10.1016/j.wace.2020.100270, 2020. a, b
Zscheischler, J. and Seneviratne, S. I.: Dependence of drivers affects risks associated with compound events, Science Advances, 3, e1700263, https://doi.org/10.1126/sciadv.1700263, 2017. a
Zscheischler, J., Westra, S., Van Den Hurk, B. J. J. M., Seneviratne, S. I., Ward, P. J., Pitman, A., AghaKouchak, A., Bresch, D. N., Leonard, M., Wahl, T., and Zhang, X.: Future climate risk from compound events, Nature Climate Change, 8, 469–477, 2018. a