<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">ESD</journal-id><journal-title-group>
    <journal-title>Earth System Dynamics</journal-title>
    <abbrev-journal-title abbrev-type="publisher">ESD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Earth Syst. Dynam.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">2190-4987</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/esd-11-807-2020</article-id><title-group><article-title>An investigation of weighting schemes <?xmltex \hack{\break}?> suitable for incorporating large ensembles <?xmltex \hack{\break}?> into multi-model ensembles</article-title><alt-title>An investigation of weighting schemes suitable for incorporating large ensembles</alt-title>
      </title-group><?xmltex \runningtitle{An investigation of weighting schemes suitable for incorporating large ensembles}?><?xmltex \runningauthor{A.~L.~Merrifield et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes">
          <name><surname>Merrifield</surname><given-names>Anna Louise</given-names></name>
          <email>anna.merrifield@env.ethz.ch</email>
        <ext-link>https://orcid.org/0000-0002-1081-5671</ext-link></contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Brunner</surname><given-names>Lukas</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-5760-4524</ext-link></contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Lorenz</surname><given-names>Ruth</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-3986-1268</ext-link></contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Medhaug</surname><given-names>Iselin</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-1115-8896</ext-link></contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Knutti</surname><given-names>Reto</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-8303-6700</ext-link></contrib>
        <aff id="aff1"><institution>Institute for Atmospheric and Climate Science, ETH Zurich, Zurich, Switzerland</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Anna Louise Merrifield (anna.merrifield@env.ethz.ch)</corresp></author-notes><pub-date><day>16</day><month>September</month><year>2020</year></pub-date>
      
      <volume>11</volume>
      <issue>3</issue>
      <fpage>807</fpage><lpage>834</lpage>
      <history>
        <date date-type="received"><day>11</day><month>November</month><year>2019</year></date>
           <date date-type="rev-request"><day>21</day><month>November</month><year>2019</year></date>
           <date date-type="rev-recd"><day>13</day><month>July</month><year>2020</year></date>
           <date date-type="accepted"><day>3</day><month>August</month><year>2020</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2020 Anna Louise Merrifield et al.</copyright-statement>
        <copyright-year>2020</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020.html">This article is available from https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020.html</self-uri><self-uri xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020.pdf">The full text article is available as a PDF file from https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e118">Multi-model ensembles can be used to estimate uncertainty in projections of regional climate, but this uncertainty often depends on the constituents of the ensemble. The dependence of uncertainty on ensemble composition is clear when single-model initial condition large ensembles (SMILEs) are included within a multi-model ensemble. SMILEs allow for the quantification of internal variability, a non-negligible component of uncertainty on regional scales, but may also serve to inappropriately narrow uncertainty by giving a single model many additional votes. In advance of the mixed multi-model, the SMILE Coupled Model Intercomparison version 6 (CMIP6) ensemble, we investigate weighting approaches to incorporate 50 members of the Community Earth System Model (CESM1.2.2-LE), 50 members of the Canadian Earth System Model (CanESM2-LE), and 100 members of the MPI Grand Ensemble (MPI-GE) into an 88-member Coupled Model Intercomparison Project Phase 5 (CMIP5) ensemble. The weights assigned are based on ability to reproduce observed climate (performance) and scaled by a measure of redundancy (dependence). Surface air temperature (SAT) and sea level pressure (SLP) predictors are used to determine the weights, and relationships between present and future predictor behavior are discussed.  The estimated residual thermodynamic trend is proposed as an alternative predictor to replace 50-year regional SAT trends, which are more susceptible to internal variability.</p>
    <p id="d1e121">Uncertainty in estimates of northern European winter and Mediterranean summer end-of-century warming is assessed in a CMIP5 and a combined SMILE–CMIP5 multi-model ensemble. Five different weighting strategies to account for the mix of initial condition (IC) ensemble members and individually represented models within the multi-model ensemble are considered. Allowing all multi-model ensemble members to receive either equal weight or solely a performance weight (based on the root mean square error (RMSE) between members and observations over nine predictors) is shown to lead to uncertainty estimates that are dominated by the presence of SMILEs. A more suitable approach includes a dependence assumption, scaling either by <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula>, the number of constituents representing a “model”, or by the same RMSE distance metric used to define model performance. SMILE contributions to the weighted ensemble are smallest (<inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> %) when a model is defined as an IC ensemble and increase slightly (<inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> %) when the definition of a model expands to include members from the same institution and/or development stream. SMILE contributions increase further when dependence is defined by RMSE (over nine predictors) amongst members because RMSEs between SMILE members can be as large as RMSEs between SMILE members and other models. We find that an alternative RMSE distance metric, derived from global SAT and hemispheric SLP climatology, is able to better identify IC members in general and SMILE members in particular as members of the same model. Further, more subtle dependencies associated with resolution differences and component similarities are also identified by the global predictor set.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<?pagebreak page808?><sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e165">Projections of regional climate change are both key to climate adaptation policy and fundamentally uncertain due to the nature of the climate system <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx39" id="paren.1"/>. In order to represent regional climate uncertainty to policy makers, scientists often turn to multi-model ensembles to provide a range of plausible outcomes a region may experience <xref ref-type="bibr" rid="bib1.bibx74" id="paren.2"/>. Uncertainty in a multi-model ensemble is commonly estimated from the ensemble spread, which can be represented, e.g., as the 5 %–95 % likely range of the distribution, and is usually presented with respect to the arithmetic ensemble mean <xref ref-type="bibr" rid="bib1.bibx13" id="paren.3"><named-content content-type="pre">e.g.,</named-content></xref>. This representation of uncertainty appears unambiguous but is perhaps deceptively so. It is influenced by choices made in multi-model ensemble construction, choices that are often overlooked <xref ref-type="bibr" rid="bib1.bibx35 bib1.bibx36" id="paren.4"/>.</p>
      <p id="d1e182">Multi-model ensembles, such as those constructed from Coupled Model Intercomparison Projects or CMIPs <xref ref-type="bibr" rid="bib1.bibx50" id="paren.5"/>, tend to be comprised of both different models and multiple members of the same model, subject to the same radiative forcing pathway intended to reflect plausible future emissions scenario <xref ref-type="bibr" rid="bib1.bibx77 bib1.bibx57" id="paren.6"/>. This choice allows the multi-model ensemble to represent two types of regional-scale uncertainty: model uncertainty and internal variability <xref ref-type="bibr" rid="bib1.bibx26 bib1.bibx15" id="paren.7"><named-content content-type="pre">e.g.,</named-content></xref>. Model uncertainty accounts for differences in how models simulate climate, from how the equations governing flow in the atmosphere are numerically solved to how sub-grid-scale processes in the climate system are parameterized. Sub-grid-scale processes are often the product of complex interactions and feedbacks between the land surface, ocean, cryosphere, and atmosphere, many of which cannot be directly measured <xref ref-type="bibr" rid="bib1.bibx66 bib1.bibx17" id="paren.8"><named-content content-type="pre">e.g.,</named-content></xref>. How models estimate these interactions can result in various advantages and limitations in regional climate representation and thus affect regional uncertainty estimates.</p>
      <p id="d1e201">By considering differences in regional “performance”, it becomes clear that uncertainty is affected by the assumption that each member of a multi-model ensemble is an equally plausible representation of observed climate. Known biases associated with cloud processes, land–atmosphere interactions, and sea surface temperature <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx43 bib1.bibx59 bib1.bibx52" id="paren.9"><named-content content-type="pre">e.g.,</named-content></xref> may result in more uncertainty in projections of future climate than is warranted given our understanding of the climate system <xref ref-type="bibr" rid="bib1.bibx78" id="paren.10"/>. Using expert judgment to weight or select multi-model ensemble members based on process- or region-specific metrics of performance has been shown to justifiably constrain uncertainty in other studies <xref ref-type="bibr" rid="bib1.bibx1 bib1.bibx38 bib1.bibx45" id="paren.11"><named-content content-type="pre">e.g.,</named-content></xref>.</p>
      <p id="d1e217"><?xmltex \hack{\newpage}?>The second type of uncertainty, internal variability, reflects the regional influence of the amalgamation of unpredictable fluctuations in the climate system <xref ref-type="bibr" rid="bib1.bibx26 bib1.bibx15 bib1.bibx34" id="paren.12"/>. Internal variability is ostensibly a feature of the climate system and  manifests itself in climate variables, such as regional surface air temperature (SAT), through a complex set of controlling influences, chief among them being variability in the attendant atmospheric circulation <xref ref-type="bibr" rid="bib1.bibx80 bib1.bibx81 bib1.bibx10" id="paren.13"/>. The influence of internal atmospheric variability on SAT can be quantified and accounted for in projections of future climate using dynamical adjustment methods <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx68" id="paren.14"><named-content content-type="pre">e.g.,</named-content></xref>. Additionally, internal variability can be explicitly represented by sets of simulations from the same model, subject to identical forcing, wherein members differ only by initial conditions <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx46" id="paren.15"><named-content content-type="pre">e.g.,</named-content></xref>. These single-model initial condition large ensembles, or SMILEs, have become an indispensable tool to concisely represent uncertainty within a model, information that should be considered in a multi-model ensemble context <xref ref-type="bibr" rid="bib1.bibx62" id="paren.16"/>.</p>
      <p id="d1e241">The prospect of including SMILE members in a multi-model ensemble directly challenges another assumption that tends to be made when calculating probabilistic estimates from multi-model ensembles: each member is an independent representation of climate. Though all members of a multi-model ensemble describe the same climate system, differences in model structure and internal variability create a distribution of regional climate change estimates. Differences in model structure are often welcome; for many applications, distributions comprised of several models are hypothesized to reflect the range of possible climate outcomes better than distributions from a single model <xref ref-type="bibr" rid="bib1.bibx2" id="paren.17"/>. When different models (that are deemed independent from each other) agree, there is a notion of robustness and increased certainty in the outcome. Ultimately, there is also a notion that as models improve, there will be a “convergence to reality” with models independently simulating the same “right” outcome.</p>
      <p id="d1e247">In reality, however, members of a multi-model ensemble are often dependent entities. Narrowing of uncertainty comes through redundant representation of historical and future climate rather than through the independent simulation of the right outcome <xref ref-type="bibr" rid="bib1.bibx28" id="paren.18"/>. Redundancy within a multi-model ensemble can arise from different models having similar biases with respect to observations. Models have historically shared code, from parametrization schemes to full components, and tend to have the same limitations associated with resolution (i.e., simplified topography) <xref ref-type="bibr" rid="bib1.bibx47 bib1.bibx37 bib1.bibx8" id="paren.19"/>. These commonalities can cause similar climate trajectories amongst models with different names, complicating the notion of convergence to reality through dependence of differently named models. Another clear contributor to redundancy is multiple<?pagebreak page809?> initial condition (IC) ensemble members that project climate trajectories which only differ by internal variability; similar trajectories are likely to exist amongst the 50 to 100 members of a SMILE. It is therefore important when assembling a multi-model ensemble that uncertainty estimates reflect the fact that not every member is an independent entity <xref ref-type="bibr" rid="bib1.bibx58" id="paren.20"/>.</p>
      <p id="d1e259">What constitutes an independent entity within a multi-model ensemble remains a topic of debate <xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx2" id="paren.21"/>. Independence can be decided a priori, i.e., that a model, as defined by its name, is an independent entity. This choice renders IC members dependent. It could also be decided that only models from different institutions of origin are independent entities, as in the “same-center hypothesis” explored by <xref ref-type="bibr" rid="bib1.bibx40" id="text.22"/>. In the absence of knowledge of model origin and development, independent entities could instead be defined using statistical properties of model outputs <xref ref-type="bibr" rid="bib1.bibx47 bib1.bibx6" id="paren.23"/>. In this a posteriori definition, models may have a degree of independence rather than simply an independent or dependent designation <xref ref-type="bibr" rid="bib1.bibx38" id="paren.24"/>.</p>
      <p id="d1e274">Regardless of how dependent and independent entities are defined, it is important that dependence is accounted for and redundancy mitigated in order to avoid an overconfident, inappropriately narrow distribution of future change <xref ref-type="bibr" rid="bib1.bibx40 bib1.bibx2" id="paren.25"/>. Dependent information reduction can be achieved through a subsetting, with which information deemed dependent is discarded, or through a weighting scheme, with which information is scaled by degree of dependence. In this study, we evaluate if a performance and independence weighting scheme <xref ref-type="bibr" rid="bib1.bibx38 bib1.bibx45 bib1.bibx11" id="paren.26"/> can be used to include three SMILEs in a CMIP5 multi-model ensemble and provide a justifiably constrained estimate of European regional end-of-century warming uncertainty. Northern European winter and Mediterranean summer SAT changes between the 1990–2009 and 2080–2099 mean states are considered. We discuss details of the weighting method including emergent predictor relationships and optimal parameter choices for attempting to comprehensively characterize member performance while separating independent information from information known to have a common origin (SMILE members). We highlight a new metric, the estimated residual thermodynamic trend, which can be used as an alternative to trend-based metrics that do not optimally reflect a model's performance on regional scales. We compare how five different weighting strategies, based on different dependence assumptions, constrain uncertainty in a CMIP5 multi-model ensemble with and without the SMILEs included. Weighted SMILE contributions in each CMIP5–SMILE “ALL” ensemble are explicitly computed. The five weighting strategies come from the continuum of assumptions that can arise in multi-model ensemble construction: (1) all members are independent and equally plausible (equal weighting), (2) some members are more realistic than others (performance weighting), (3, 4) members from the same model are dependent (<inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> scaling, <inline-formula><mml:math id="M5" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> being the number of IC members or modeling center contributions), and (5) all members are dependent to some degree (RMSE distance metric scaling). For the last approach, we demonstrate that an RMSE independence scaling that groups SMILE members and distinguishes them from other models can be obtained using large-scale, long-term SAT and sea level pressure (SLP) climatology fields. The SMILEs, CMIP5, and observational datasets used in the weightings are described in Sect. <xref ref-type="sec" rid="Ch1.S2"/>, while the weighting schemes are detailed in Sect. <xref ref-type="sec" rid="Ch1.S3"/>. The influence of SMILE inclusion on the weighting under different dependence assumptions and the predictor set that identifies SMILE members as dependent entities based on RMSE distance are discussed in Sect. <xref ref-type="sec" rid="Ch1.S4"/>. To close, conclusions and a discussion are presented in Sect. <xref ref-type="sec" rid="Ch1.S5"/>.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Data</title>
      <p id="d1e319">The multi-model ensemble used in this study is comprised of members from the CMIP5 archive and three SMILEs: a 50-member ensemble generated using the Community Earth System Model version 1.2.2 (CESM1.2.2-LE), the 50-member Canadian Earth System Model version 2 large ensemble (CanESM2-LE), and the 100-member Max Planck Institute for Meteorology Grand Ensemble (MPI-GE). This combined CMIP5–SMILE ensemble is summarized in Table <xref ref-type="table" rid="Ch1.T1"/>, which lists the name of each model and the members used. A similar CMIP5 multi-model ensemble was used in <xref ref-type="bibr" rid="bib1.bibx45" id="text.27"/> and <xref ref-type="bibr" rid="bib1.bibx11" id="text.28"/> and features 88 members from 40 (named) model setups, including 13 initial condition ensembles ranging from 2 to 10 members. Additionally, for the GISS-E2-H and GISS-E2-R experiments, NASA GISS provides members from three physics-version (“p”) setups that differ in atmospheric composition (AC) and aerosol indirect effects (AIEs) <xref ref-type="bibr" rid="bib1.bibx55" id="paren.29"/>. We treat the three setups as follows: p1 (prescribed AC and AIE) and p3 (prognostic AC and partial AIE) members are treated as two-member IC ensembles, and the p2 member (prognostic AC and AIE) is treated as a single-member representation (Table <xref ref-type="table" rid="Ch1.T1"/>). In Table <xref ref-type="table" rid="Ch1.T1"/>, IC ensembles are indicated in italics and SMILEs are indicated in bold with a star. Horizontal lines denote modeling centers and/or known development streams that are grouped as dependent entities under the fourth independence assumption we investigated.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e341">Summary of the CMIP5 <inline-formula><mml:math id="M6" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> SMILE multi-model ensemble used in this study. IC ensembles within CMIP5 are indicated in italics. SMILEs are indicated in bold with a star. Modeling center and/or development stream groupings are separated by horizontal lines.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="left"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Group</oasis:entry>
         <oasis:entry colname="col2">Model</oasis:entry>
         <oasis:entry colname="col3">Members used</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">Group</oasis:entry>
         <oasis:entry colname="col6">Model</oasis:entry>
         <oasis:entry colname="col7">Members used</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">ACCESS</oasis:entry>
         <oasis:entry colname="col2">ACCESS1-0</oasis:entry>
         <oasis:entry colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">NASA GISS</oasis:entry>
         <oasis:entry colname="col6">GISS-E2-R-CC</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">ACCESS1-3</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">(cont.)</oasis:entry>
         <oasis:entry colname="col6"><italic>GISS-E2-R</italic></oasis:entry>
         <oasis:entry colname="col7"><italic>r(1-2)i1p1</italic></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">BNU-ESM</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">GISS-E2-R</oasis:entry>
         <oasis:entry colname="col7">r1i1p2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NCAR</oasis:entry>
         <oasis:entry colname="col2"><italic>CCSM4</italic></oasis:entry>
         <oasis:entry colname="col3"><italic>r(1-6)i1p1</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6"><italic>GISS-E2-R</italic></oasis:entry>
         <oasis:entry rowsep="1" colname="col7"><italic>r(1-2)i1p3</italic></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">CESM1-BGC</oasis:entry>
         <oasis:entry colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">MOHC</oasis:entry>
         <oasis:entry colname="col6">HadGEM2-AO</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>CESM1-CAM5</italic></oasis:entry>
         <oasis:entry colname="col3"><italic>r(1-3)i1p1</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">HadGEM2-CC</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2"><bold>CESM1.2.2-LE</bold><inline-formula><mml:math id="M7" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry rowsep="1" colname="col3"><bold>r(0-49)i1p1</bold></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6"><italic>HadGEM2-ES</italic></oasis:entry>
         <oasis:entry rowsep="1" colname="col7"><italic>r(1-4)i1p1</italic></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CMCC</oasis:entry>
         <oasis:entry colname="col2">CMCC-CESM</oasis:entry>
         <oasis:entry colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">IPSL</oasis:entry>
         <oasis:entry colname="col6"><italic>IPSL-CM5A-LR</italic></oasis:entry>
         <oasis:entry colname="col7"><italic>r(1-3)i1p1</italic></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">CMCC-CMS</oasis:entry>
         <oasis:entry colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">IPSL-CM5A-MR</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">CMCC-CM</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6">IPSL-CM5B-LR</oasis:entry>
         <oasis:entry rowsep="1" colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2"><italic>CNRM-CM5</italic></oasis:entry>
         <oasis:entry rowsep="1" colname="col3"><italic>r(1,2,4,6,10)i1p1</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">MIROC</oasis:entry>
         <oasis:entry colname="col6">MIROC-ESM</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2"><italic>CSIRO-Mk3-6-0</italic></oasis:entry>
         <oasis:entry rowsep="1" colname="col3"><italic>r(1-10)i1p1</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">MIROC-ESM-CHEM</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CCCma</oasis:entry>
         <oasis:entry colname="col2"><bold>CanESM2-LE</bold><inline-formula><mml:math id="M8" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><bold>r(1-50)i1p1</bold></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6"><italic>MIROC5</italic></oasis:entry>
         <oasis:entry rowsep="1" colname="col7"><italic>r(1-3)i1p1</italic></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2"><italic>CanESM2</italic></oasis:entry>
         <oasis:entry rowsep="1" colname="col3"><italic>r(1-5)i1p1</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">MPI-M</oasis:entry>
         <oasis:entry colname="col6"><italic>MPI-ESM-LR</italic></oasis:entry>
         <oasis:entry colname="col7"><italic>r(1-3)i1p1</italic></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2"><italic>EC-EARTH</italic></oasis:entry>
         <oasis:entry rowsep="1" colname="col3"><italic>r(1,2,8,9,12)i1p1</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">MPI-ESM-MR</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">FGOALS-g2</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6"><bold>MPI-GE</bold><inline-formula><mml:math id="M9" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry rowsep="1" colname="col7"><bold>r(1-100)i1p3</bold><inline-formula><mml:math id="M10" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2"><italic>FIO-ESM</italic></oasis:entry>
         <oasis:entry rowsep="1" colname="col3"><italic>r(1-3)i1p1</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">MRI</oasis:entry>
         <oasis:entry colname="col6">MRI-CGCM3</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NOAA GFDL</oasis:entry>
         <oasis:entry colname="col2">GFDL-CM3</oasis:entry>
         <oasis:entry colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6">MRI-ESM1</oasis:entry>
         <oasis:entry rowsep="1" colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">GFDL-ESM2G</oasis:entry>
         <oasis:entry colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">NCC</oasis:entry>
         <oasis:entry colname="col6">NorESM1-M</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">GFDL-ESM2M</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6">NorESM1-ME</oasis:entry>
         <oasis:entry rowsep="1" colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NASA GISS</oasis:entry>
         <oasis:entry colname="col2">GISS-E2-H-CC</oasis:entry>
         <oasis:entry colname="col3">r1i1p1</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">BCC</oasis:entry>
         <oasis:entry colname="col6">bcc-csm1-1-m</oasis:entry>
         <oasis:entry colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>GISS-E2-H</italic></oasis:entry>
         <oasis:entry colname="col3"><italic>r(1-2)i1p1</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6">bcc-csm1-1</oasis:entry>
         <oasis:entry rowsep="1" colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">GISS-E2-H</oasis:entry>
         <oasis:entry colname="col3">r1i1p2</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" colname="col5"/>
         <oasis:entry rowsep="1" colname="col6">inmcm4</oasis:entry>
         <oasis:entry rowsep="1" colname="col7">r1i1p1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>GISS-E2-H</italic></oasis:entry>
         <oasis:entry colname="col3"><italic>r(1-2)i1p3</italic></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"><bold>Total</bold></oasis:entry>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"><bold>288 members</bold></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e1018">The CESM1.2.2-LE used in this study was derived from a 4700-year CESM control simulation with constant preindustrial forcing generated at ETH Zürich <xref ref-type="bibr" rid="bib1.bibx68" id="paren.30"/>. CESM1.2.2 uses the Community Atmosphere Model version 5.3 (CAM5.3) and has a horizontal atmospheric resolution of 1.9<inline-formula><mml:math id="M11" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> <inline-formula><mml:math id="M12" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 2.5<inline-formula><mml:math id="M13" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> with 30 vertical levels <xref ref-type="bibr" rid="bib1.bibx30" id="paren.31"/>. The preindustrial control run was branched at<?pagebreak page810?> 20-year intervals, starting from the year 580, to create an ensemble with “macro”-initial conditions, i.e., different coupled initial conditions picked from well-separated start dates <xref ref-type="bibr" rid="bib1.bibx70 bib1.bibx27" id="paren.32"/>. Members of the macro-initial-condition ensemble were run from 1850 to 1940 driven by historical CMIP5 forcing <xref ref-type="bibr" rid="bib1.bibx51" id="paren.33"/>. At year 1940, each macro-initial-condition member was branched into four different realizations, each subject to an atmospheric temperature perturbation of 10<inline-formula><mml:math id="M14" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">13</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> to create “micro”-initial-condition ensembles <xref ref-type="bibr" rid="bib1.bibx27" id="paren.34"/>. From these micro-initial-condition ensembles, 50 members were selected for the CESM1.2.2-LE (specifically, four micro-ensemble members from macro-ensemble members 1 through 12 and two micro-ensemble members from macro-ensemble member 13).</p>
      <p id="d1e1075">The MPI-GE was generated using the low-resolution setup of the MPI Earth System Model (MPI-ESM1.1) <xref ref-type="bibr" rid="bib1.bibx23" id="paren.35"/>. The 100-member ensemble has macro-initial conditions: a preindustrial control simulation was branched on 1 January for selected years between 1874 and 3524 to sample different states of a stationary and volcano-free 1850 climate <xref ref-type="bibr" rid="bib1.bibx46" id="paren.36"/>. The MPI-GE uses ECHAM6.3 run in a T63L47 configuration <xref ref-type="bibr" rid="bib1.bibx71" id="paren.37"/> as its atmospheric component for a horizontal resolution of approximately 1.8<inline-formula><mml:math id="M15" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>.</p>
      <?pagebreak page811?><p id="d1e1096">The CanESM2-LE <xref ref-type="bibr" rid="bib1.bibx5" id="paren.38"/> was initiated from the five CanESM2 members that contributed to CMIP5 (which are included in our CMIP5 basis multi-model ensemble). As with CESM1.2.2, the CanESM2 large ensemble has a combination of macro- and micro-initial conditions. Macro-initial conditions were taken from the year 1950 of the five original CanESM2 members. Each were then branched 10 times with micro-initial conditions (a random permutation to the seed used in the random number generator for cloud physics) to give a total of 50 members <xref ref-type="bibr" rid="bib1.bibx73" id="paren.39"/>. The CanESM2-LE uses the CanAM4 atmosphere model run at a T63 spectral resolution.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><label>Figure 1</label><caption><p id="d1e1107">Observational estimates (OBS; gray), the CMIP5 ensemble (blue), and the three SMILEs CESM1.2.2-LE (red), CanESM2-LE (yellow), and MPI-GE (green) evaluated in this study, shown in terms of area-averaged and seasonally averaged absolute surface air temperature time series (SAT; <inline-formula><mml:math id="M16" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C). The two OBS datasets, ERA-20C Temperature and the Berkeley Earth Surface Temperature (BEST) product, are shown in solid gray and dashed gray, respectively. Their average, used to determine member performance, is shown in solid black. For the CMIP5 and three SMILEs, the ensemble means across members are shown in solid color; the shading indicates the 5th–95th percentile of each distribution as a measure of ensemble spread. Note that the CMIP5 ensemble is a multi-model, multi-initial-condition member ensemble of 88 members from 40 (named) model setups, not the “one model, one vote” ensemble often used in multi-model ensemble studies. Panel <bold>(a)</bold> shows RCP8.5 projections for northern European winter (DJF NEU), and panel <bold>(b)</bold> shows RCP8.5 projections for Mediterranean summer (JJA MED) SAT. The number of members in each ensemble is indicated in parenthesis in the legend.</p></caption>
        <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f01.png"/>

      </fig>

      <p id="d1e1131">The CMIP5 ensemble and three SMILEs are shown in terms of their respective ensemble means and spreads (represented by the 5th–95th percentile of each distribution) in Fig. <xref ref-type="fig" rid="Ch1.F1"/> for the two regions and seasons of interest: northern European (NEU) winter (December–January–February; DJF) SAT (Fig. <xref ref-type="fig" rid="Ch1.F1"/>a) and Mediterranean (MED) summer (June–July–August; JJA) SAT (Fig. <xref ref-type="fig" rid="Ch1.F1"/>b). The NEU and MED regions used are the SREX regions defined in <xref ref-type="bibr" rid="bib1.bibx65" id="text.40"/>. All models are forced with historical CMIP5 forcing from 1950 to 2005 followed by Representative Concentration Pathway 8.5 (RCP8.5) forcing for 2006–2099 <xref ref-type="bibr" rid="bib1.bibx51" id="paren.41"/>. The multi-model CMIP5 ensemble (Fig. <xref ref-type="fig" rid="Ch1.F1"/> blue) has a larger spread than the SMILEs, demonstrating that model uncertainty does rise above well-defined estimates of internal variability in the two European regions and seasons considered. The combined macro–micro perturbation CESM1.2.2-LE (Fig. <xref ref-type="fig" rid="Ch1.F1"/> red) has a larger ensemble spread than the CanESM2-LE (Fig. <xref ref-type="fig" rid="Ch1.F1"/> yellow) but, on average, warms less by the end of the century. The MPI-GE (Fig. <xref ref-type="fig" rid="Ch1.F1"/> green) has approximately the same amount of JJA MED warming as the CMIP5 ensemble average.</p>
      <p id="d1e1155">In addition to the multi-model ensemble, several observational estimates are used to assess model performance. Two global atmospheric reanalysis products, ERA-20C and NOAA-CIRES-DOE 20th Century Reanalysis V3 (NOAA-20C), represent observed SLP, while ERA-20C and a merged temperature dataset, Berkeley Earth Surface Temperature (BEST), represent observed SAT. ERA-20C was created by the European Centre for Medium-Range Weather Forecasts (ECMWF) and assimilates surface pressure and marine wind observations over the 20th century (1900–2010) into the IFS version Cy38r1 model <xref ref-type="bibr" rid="bib1.bibx60" id="paren.42"/>. NOAA-20C, a co-effort between the National Oceanic and Atmospheric Administration (NOAA), the Cooperative Institute for Research in Environmental Sciences (CIRES), and the US Department of Energy (DOE), assimilates surface pressure observations into the NCEP GFS v14.0.1 model to provide output from 1836 to 2015 <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx69" id="paren.43"/>. BEST was created to be an independent estimate of global temperature obtained through the spatiotemporal interpolation of in situ temperature measurements <xref ref-type="bibr" rid="bib1.bibx61" id="paren.44"/>.</p>
      <p id="d1e1168">The <xref ref-type="bibr" rid="bib1.bibx38" id="text.45"/> weighting scheme can comprehensively account for observational uncertainty <xref ref-type="bibr" rid="bib1.bibx11" id="paren.46"/>, but for this study, we chose to use the average of two observational estimates in order to have a simple and straightforward definition of climate within which the sensitivity of the weighting scheme can be interrogated. ERA-20C and NOAA-20C reanalyses were chosen because they provide temporally and spatially complete fields that extend back to 1950. Additionally, as reanalysis products are, after all, model-based, we chose a reanalysis product with both SLP and SAT available (ERA-20C), as well as SAT and SLP fields from different sources (NOAA-20C and BEST). We further used the SLP–SAT relationship to obtain the circulation-induced component of SAT, which is removed to obtain the estimated residual thermodynamic SAT trends (see Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>). Though all products are observational estimates, we henceforth refer to them as “observations” or “OBS” to distinguish them from members of the multi-model ensemble.</p>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Weighting schemes</title>
      <p id="d1e1187">The weighting strategies used to constrain uncertainty in this study are rooted in a combined performance and independence weighting metric developed by <xref ref-type="bibr" rid="bib1.bibx38" id="text.47"/>, following on the work of <xref ref-type="bibr" rid="bib1.bibx63 bib1.bibx64" id="text.48"/>. Summarized in the subsections below, the five strategies considered arise from common assumptions surrounding plausibility and similarity made about constituents of multi-model ensembles. With the exception of the first strategy, which assigns each member an equal weight, the basic principle of the weighting is as follows: a member will receive a performance weight based on how closely it resembles observed climate (based on nine chosen predictors; detailed in the following section). That performance weight will then be scaled by a measure of dependence that represents whether (or to what degree) a member is identified as a “duplicate” of another member over the historical period. It is important to note that dependence in this study is never determined by future behavior. Doing so would jeopardize the “agreement suggests robustness” paradigm by penalizing convergence. Rather, dependence is either a model property decided upon beforehand or determined through RMSE distances between historical aspects of climate.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Equal weighting</title>
      <p id="d1e1203">The first way in which the multi-model ensemble is weighted is by all members receiving a weight, <inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, of 1.
            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M18" display="block"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></disp-formula>
          This equal weighting follows from the assumption that all multi-model ensemble members are independent and equally plausible and is sometimes referred to as a “model democracy” assumption <xref ref-type="bibr" rid="bib1.bibx35 bib1.bibx33" id="paren.49"/>. In instances in which SMILEs are incorporated into a multi-model ensemble, the equal weighting strategy is clearly flawed; 50–100 members from the same model is a clear voting advantage within the model democracy. However, equal weighting serves as a baseline handling of multi-model ensemble information against which other weighting strategies can be compared.</p><?xmltex \hack{\newpage}?>
</sec>
<?pagebreak page812?><sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Performance weighting</title>
      <p id="d1e1248">The second weighting strategy builds upon the first in that all members are still assumed to be independent, but some members are identified to be more realistic than others. Members are thus weighted (<inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">II</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>) by a measure of performance, here based on the numerator of the <xref ref-type="bibr" rid="bib1.bibx38" id="text.50"/> weighting function.
            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M20" display="block"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">II</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mstyle scriptlevel="+1"><mml:mfrac><mml:mrow><mml:msubsup><mml:mi>D</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:msup></mml:mrow></mml:math></disp-formula>
          The term <inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents the RMSE distance between a multi-model ensemble member and observations; <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">II</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> decreases exponentially as members increasingly differ from observations (<inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>≫</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>). A shape parameter <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> dictates the width of the performance weight Gaussian, determining how far apart a member and observations must be to be down-weighted. For a smaller value of <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, models are more rapidly down-weighted as they diverge from observed climate, which often results in a weighting whereby few models receive weights of meaningful magnitude. For a larger value of <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, models are not as strongly penalized for not resembling observations, which often results in a more even distribution of weights within the ensemble. Here, we select <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to be 0.32 for the DJF NEU weighting and 0.4 for the JJA MED weighting (further discussion in Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/>).</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><?xmltex \opttitle{$1/N$~scaling, IC~members}?><title><inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> scaling, IC members</title>
      <p id="d1e1412">The third weighting strategy extends the performance weighting by including a dependence assumption, making it suitable for the combined CMIP5–SMILE ensemble we evaluate. Each model gets a unique weight. The independent entity, a model, is assumed to be determinable by name (as listed in Table <xref ref-type="table" rid="Ch1.T1"/>), which renders members of IC ensembles within the multi-model ensemble (the 13 within the CMIP5 ensemble and the three SMILEs) dependent entities. To achieve the model weighting, models that are represented by one member receive their performance weight <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">II</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>. Models that are represented by IC members receive an average of the performance weights of their <inline-formula><mml:math id="M30" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> constituents
(<inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac></mml:mstyle><mml:msubsup><mml:mi mathvariant="normal">Σ</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:msubsup><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi>D</mml:mi><mml:mi>j</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>). That average performance weight, divided by <inline-formula><mml:math id="M32" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>, is assigned to each IC member. Therefore, the weight each member receives, <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">III</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, is
            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M34" display="block"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">III</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mfenced close="]" open="["><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:msubsup><mml:mi mathvariant="normal">Σ</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msubsup><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi>D</mml:mi><mml:mi>j</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          Each IC member is assigned the average performance weight of the IC ensemble (rather than its individually computed performance weight) to reflect the assumption that all IC members represent an equally likely outcome of the model. This choice rectifies the fact that when computed by RMSE, performance weights differ between IC members due to internal variability.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><?xmltex \opttitle{$1/N$~scaling, modeling center}?><title><inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> scaling, modeling center</title>
      <p id="d1e1593">The fourth weighting strategy is identical to the third but has a different definition of a model. The independent<?pagebreak page813?> entity is determined not by name, but by a conjecture about model origin. Similar to the same-center hypothesis <xref ref-type="bibr" rid="bib1.bibx40" id="paren.51"/>, we group all members provided by a modeling center and/or in a known development stream (i.e., the CESM1.2.2-LE is grouped with the NCAR models, though it was run at ETH Zürich) as dependent entities. The weight of each model, <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">IV</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, is computed as in the IC case, with averages taken over <inline-formula><mml:math id="M37" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>, the number of members that constitute a model.
            <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M38" display="block"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">IV</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mfenced open="[" close="]"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:msubsup><mml:mi mathvariant="normal">Σ</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msubsup><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi>D</mml:mi><mml:mi>j</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula></p>
</sec>
<sec id="Ch1.S3.SS5">
  <label>3.5</label><title>RMSE distance scaling</title>
      <p id="d1e1696">Finally, the fifth weighting strategy operates under the assumption that dependence cannot necessarily be determined by model name, but shared biases in simulating historical climate can give an idea of dependence that comes from differently named models sharing ideas and code. Instead of relying on knowledge of model origin, the RMSE weighting (<inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">V</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>) initially proposed by <xref ref-type="bibr" rid="bib1.bibx38" id="text.52"/> relies solely on model output to determine a model's overall weight. It features an independence scaling based on RMSE distance metrics in addition to the RMSE-derived performance weights. For results to be compatible with past assessments of this weighting scheme <xref ref-type="bibr" rid="bib1.bibx45 bib1.bibx11" id="paren.53"><named-content content-type="pre">e.g.,</named-content></xref>, we assign each member their unique performance weight (as computed in <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">II</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>) even if they are IC ensemble members. This puts the RMSE weighting in contrast to the <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> scaling approaches, which ensure that IC ensemble members have identical weights.
            <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M42" display="block"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">V</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi>D</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="normal">Σ</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>≠</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mi>M</mml:mi></mml:msubsup><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>
          <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents the distance between multi-model ensemble member <inline-formula><mml:math id="M44" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> and multi-model ensemble member <inline-formula><mml:math id="M45" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>. Unlike in the <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> strategies, the RMSE independence scaling is based solely on <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, how far a member is from all the other members in the ensemble, and not on any prior knowledge of the multi-model ensemble member's origin. As with the performance weight, a shape parameter <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> dictates the width of the Gaussian that is applied to the member pairs. <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents how close a member must be to another member before they are considered dependent entities. For a member with no close neighbors (<inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>≫</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>), the independence scaling tends to 1, preserving the member's overall weight. For a member with many close neighbors (<inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>≪</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>), the independence scaling is greater than 1 and reduces its overall weight. For the CMIP5–SMILE ensemble, the goal is to select a <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> that is large enough such that members of a SMILE are considered dependent entities but not so large that the majority of multi-model ensemble members are considered dependent as well. Here, we select DJF NEU <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to be 0.25 and JJA MED <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to be 0.26. Sensitivity to the choice of <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and further details on selection strategies are discussed in Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/>. Upon computation of the weights in each strategy, each weight is normalized by <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> such that they sum to 1.</p>
</sec>
<sec id="Ch1.S3.SS6">
  <label>3.6</label><title>Defining “climate”: predictor selection</title>
      <p id="d1e2020">The performance weight used in weighting strategies two through five and the independence scaling used in strategy five are based on a chosen definition of climate. A model's performance is based on its ability to reproduce observed climate. Under assumption five, a member's independence is based on how much its climate differs from the climate in other members. When defining climate, the aim is to optimize the “fitness for purpose”, which should include choosing predictors that are physically associated with the target and will indicate if a model is biased in a way that renders it unsuitable for realistic simulation of the target. For example, in <xref ref-type="bibr" rid="bib1.bibx38" id="text.54"/>, aspects of climate relevant for September sea ice extent, such as the climatological mean and trend in hemispheric mean September Arctic sea ice extent, were chosen. These chosen predictors reflected the fact that models with almost no sea ice in the present day or significantly more sea ice in the future than presently observed were less suitable for the task of projecting changes in sea ice extent. It is also good practice to avoid using a single predictor to define climate to avoid an overconfident uncertainty estimate. No one model property can comprehensively reflect if the model is “good” for a particular purpose, and it is dangerous to constrain uncertainty by dismissing models that do not match observations by a particular statistical definition for those that happen to be tuned to match that definition. <xref ref-type="bibr" rid="bib1.bibx45" id="text.55"/> discuss a more holistic strategy for choosing predictors and ultimately selected from a set of 24 predictors deemed relevant for projecting North American maximum temperature based on known physical relationships, predictor–target correlations, and variance inflation considerations.</p>
      <p id="d1e2029">Here fitness for purpose is a relatively simple and straightforward definition of climate within which the sensitivity of the weighting scheme can be interrogated. We base the performance weighting and the RMSE independence scaling on nine predictors: the climatology and interannual variability (represented by standard deviation) of SAT and SLP during the periods of 1950–1969 and 1990–2009 and a 50-year derived SAT trend (estimated residual thermodynamic trend; described in more detail in subsequent paragraphs) for the period of 1960–2009. We chose predictors to be aspects of regional temperature and pressure in a domain that encompasses modes of atmospheric circulation variability relevant<?pagebreak page814?> to European climate because they are (1) physically associated with the target (end-of-century warming) and (2) fields that may reflect model biases that would affect realistic simulation of future climate. For example, a model with a warmer-than-observed mean state in the Mediterranean may experience an enhanced land–atmosphere feedback mechanism that amplifies drying and warming of the region <xref ref-type="bibr" rid="bib1.bibx12 bib1.bibx56 bib1.bibx78" id="paren.56"><named-content content-type="pre">e.g.,</named-content></xref>. SAT and SLP are found to be highly relevant predictors by earlier studies <xref ref-type="bibr" rid="bib1.bibx11" id="paren.57"/> and are among the most comprehensively measured atmospheric fields prior to the satellite era <xref ref-type="bibr" rid="bib1.bibx75" id="paren.58"/>. In terms of spatial domain, SAT climatology and variability predictors are computed over their corresponding ocean-masked SREX regions (i.e., NEU for DJF and MED for JJA), and SLP climatology and variability predictors are computed over a larger European sector domain that includes the North Atlantic (25–90<inline-formula><mml:math id="M57" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> N and 60<inline-formula><mml:math id="M58" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> W–100<inline-formula><mml:math id="M59" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> E). The derived SAT trend, or the estimated residual thermodynamic trend, is computed over the ocean-masked continental European domain (EUR; 30–76.25<inline-formula><mml:math id="M60" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> N and 10<inline-formula><mml:math id="M61" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> W–39<inline-formula><mml:math id="M62" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> E).</p>
      <p id="d1e2098">To compute the aggregate distance metrics from nine predictors, all predictor and observational fields are bilinearly interpolated to a shared 2.5<inline-formula><mml:math id="M63" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> <inline-formula><mml:math id="M64" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 2.5<inline-formula><mml:math id="M65" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> latitude–longitude grid. The predictors are then time-aggregated, with the mean or standard deviation computed over the periods 1950–1969 and 1990–2009 and the estimated residual thermodynamic trend computed over the period 1960–2009. For each time-aggregated predictor, the differences between the observed mean value and member value (or member value and member value in the case of the RMSE independence scaling) are computed at each grid point and subsequently squared. The squared differences are then area-averaged over the predictor domain and square-rooted to obtain an RMSE distance for observed–member and member–member pairs. For each predictor, the resulting distributions of observed–member and member–member RMSEs are then normalized by their midrange value ((maximum <inline-formula><mml:math id="M66" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> minimum)<inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>) such that the distance for each of the nine predictors is on the same order of magnitude and can be combined into a single <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F6"/>) or <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> value (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F7"/>) for each member.</p>
      <p id="d1e2173">A final consideration in predictor selection is one of relationships between past and future predictor behavior. A model's performance weight is based on its ability to reproduce observed climate, and this methodological choice follows from the concept of emergent constraints <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx3 bib1.bibx9" id="paren.59"><named-content content-type="pre">e.g.,</named-content></xref>. The assumption is that if a model accurately represents an aspect of historical climate, it is likely to realistically represent relevant physical processes and is therefore likely to provide a reliable future projection. If a model is significantly biased with respect to observed climate, its future representation of climate may be cause for concern <xref ref-type="bibr" rid="bib1.bibx38" id="paren.60"/>, particularly when a statistical relationship between the historical and future climate feature of interest exists. In the absence of a strong statistical relationship, predictors serve to add degrees of difference between members that helps to ward against overconfident weighting.</p>
      <p id="d1e2185">Statistical relationships between historical and future climate can be obscured by internal variability, and the inclusion of SMILEs in a multi-model ensemble highlights the need to understand the role of internal variability in the chosen predictors. In particular, internal variability is shown to influence trends in regional SAT even on the 50-year predictor timescales we have selected <xref ref-type="bibr" rid="bib1.bibx18" id="paren.61"/>. Because of this, a member may have a similar-to-observed SAT trend (and thus a higher performance weight) by chance, simply because it has similar-to-observed climate variability over the trend period (i.e., a similar set of El Niño and La Niña events or similar phasing of the Atlantic Multidecadal Oscillation). Because internal variability is inherently random in temporal phase <xref ref-type="bibr" rid="bib1.bibx15" id="paren.62"/>, a member's match to observations over one trend period does not guarantee a match in the future. This issue is demonstrated in Fig. <xref ref-type="fig" rid="Ch1.F2"/>ai, which shows that there is no discernible relationship (<inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>) between the DJF EUR SAT trend for 1960–2009 and for 2050–2099 in CMIP5 with (black line) or without (blue line) the SMILEs. Even the two observational estimates differ in the European winter trend by more than a degree over 50 years. In summer, a season with less midlatitude climate variation, a relationship emerges between 1960–2009 and 2050–2099 European SAT trends. The linear relationship between past and future trends is reinforced by the SMILEs in a model mean sense; i.e., the three new models added to the CMIP5 ensemble support the relationship (Fig. <xref ref-type="fig" rid="Ch1.F2"/>bi). It is not evident within the SMILEs themselves, which reflects the fact that the relationship is due to model differences and not the behavior of individual IC members.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><label>Figure 2</label><caption><p id="d1e2215">Predictor relationships of the domain-averaged 50-year trends of <bold>(a)</bold> DJF and <bold>(b)</bold> JJA European (EUR) SAT. 50-year raw trends are shown in (i), and 50-year estimated residual thermodynamic trends are shown in (ii). In each panel, 1960–2009 is shown on the abscissa and 2050–2099 is shown on the ordinate. ERA-20C (BEST) observational estimates of the 1960–2009 trends are indicated by the solid (dashed) vertical lines. Least-squares regression fits (solid lines) and <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values computed solely for the CMIP5 output are shown in blue, and those computed for ALL output (CMIP5 and the three SMILEs) are shown in black.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f02.png"/>

        </fig>

      <p id="d1e2241">The removal of the estimated influence of internal atmospheric variability from regional SAT, however, provides an alternative performance metric with which observations and models can be compared. Using a method of dynamical adjustment (described in Appendix A and in further detail in <xref ref-type="bibr" rid="bib1.bibx18" id="altparen.63"/>), we construct an estimate of the component of SAT variability induced by large-scale atmospheric circulation patterns, remove it from the SAT record, and obtain the estimated residual thermodynamic trend for 1960–2009 and 2050–2099. The estimated residual thermodynamic trend is an estimate of both the influence of surface processes (i.e., land–atmosphere interactions; <xref ref-type="bibr" rid="bib1.bibx41 bib1.bibx53" id="altparen.64"/>) and the influence of the radiative forcing, an influence often defined as the forced response. In the model world, the forced response of a field is often defined as the ensemble mean or average across multiple ensemble members. However, there is no observational equivalent to the ensemble mean; there is only one observed realization of climate. Therefore, we use the estimated residual thermodynamic trend as a predictor because it can be computed in the same manner through dynamical adjustment in both observations and each multi-model ensemble member.</p>
      <p id="d1e2250"><?xmltex \hack{\newpage}?>Internal atmospheric variability serves to amplify both observed SAT trends in winter by approximately 0.6 <inline-formula><mml:math id="M72" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C. Removing the influence of dynamics results in an average observed estimated residual thermodynamic trend that falls centrally within the CMIP5 and SMILE distributions (Fig. <xref ref-type="fig" rid="Ch1.F2"/>aii). In summer, dynamical adjustment also centers the estimated residual thermodynamic trend and slightly reduces the difference between observational datasets (Fig. <xref ref-type="fig" rid="Ch1.F2"/>bii). In terms of weighting, the shift of observed values to the center of the model distribution will lead<?pagebreak page816?> to more models “performing” in their simulation of trend, which will, in turn, allow more models to contribute to the uncertainty estimate. The estimated residual thermodynamic trend can also be thought of as a property of each model, a measure that includes the response to the shared forcing analogous to climate sensitivity <xref ref-type="bibr" rid="bib1.bibx38" id="paren.65"/>. We find that SMILE members, which share both model setup and forcing, also tend to have similar estimated residual thermodynamic trends (Fig. <xref ref-type="fig" rid="Ch1.F2"/>a and bii). In winter, the clustering of SMILE-estimated residual thermodynamic trends is striking in comparison with SMILE trends: CESM1.2.2-LE members tend to have the least EUR warming in both periods, while CanESM2-LE members tend to warm the most. The addition of the SMILEs then introduces a slightly positive relationship between past and future responses (Fig. <xref ref-type="fig" rid="Ch1.F2"/>aii, black trend line) not apparent in the CMIP5 ensemble (Fig. <xref ref-type="fig" rid="Ch1.F2"/>aii, blue trend line), though no strong relationship emerges from variability in either case. In summer, the positive relationship seen between past and future Mediterranean SAT trends (Fig. <xref ref-type="fig" rid="Ch1.F2"/>bi) is robust to the combination of removing internal atmospheric variability and adding the SMILES (Fig. <xref ref-type="fig" rid="Ch1.F2"/>bii). CanESM2 has the most JJA MED warming in both the past and future periods, while MPI-GE has the least. Because estimated residual thermodynamic SAT trends in the broader European region are more comparable between members and observations due to the removal of an estimate of the influence of atmospheric variability that manifests on multidecadal timescales, we chose them as the ninth predictor in the definition of climate used in our performance weightings and RMSE independence weighting. Emergent relationships within the other eight predictors are discussed in Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results</title>
      <p id="d1e2292">To assess the influence of the weightings, we evaluate the magnitude of regional European end-of-century warming in terms of the SAT change (<inline-formula><mml:math id="M73" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>) from 1990–2009 climatology to 2080–2099 climatology.  Two ensembles are considered, one comprised solely of CMIP5 members (CMIP5; distribution of 88 values) and one comprised of all available members from CMIP5 and the three SMILEs (ALL; distribution of 288 values). The CMIP5 and ALL SAT <inline-formula><mml:math id="M74" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula> distributions are shown side by side as box-and-whisker elements in Fig. <xref ref-type="fig" rid="Ch1.F3"/>a and b for the five weighting strategies considered: equal, performance, <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> scaling of IC members, <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> scaling of modeling center contributions, and RMSE distance scaling. Weighted ensemble mean values are shown by solid horizontal lines within the box elements. Weighted ensemble spread is illustrated by the box, which indicates the 25th and 75th percentiles, and the whisker, which indicates the 5th and 95th percentiles.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><label>Figure 3</label><caption><p id="d1e2337"><bold>(a)</bold> Box-and-whisker plot showing how the five weighting strategies effect the distributions of DJF NEU SAT change (<inline-formula><mml:math id="M77" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>, (2080–2099)–(1990–2009)) for the CMIP5 ensemble (blue) and ALL ensemble (CMIP5 with the three SMILEs; gray). The box element spans the 25th to 75th percentile of the distribution; mean SAT change is indicated by the horizontal line within the box. The whisker element spans the 5th to 95th percentile. <bold>(b)</bold> As in <bold>(a)</bold>, but for JJA MED SAT change. <bold>(c)</bold> The contribution of SMILE and CMIP5 members to the DJF NEU ALL ensemble under different weighting strategies in terms of fraction of total weight. <bold>(d)</bold> As in <bold>(c)</bold>, but for the JJA MED ALL ensemble.</p></caption>
        <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f03.png"/>

      </fig>

      <p id="d1e2371">For each weighting strategy, comparisons between the CMIP5 and ALL distributions help to elucidate (i) how the weighting constrains uncertainty in the magnitude of end-of-century regional European warming and (ii) how the inclusion of SMILE members influences the distribution. To explicitly determine the contribution of the SMILEs, we also show the fraction of total weight received by each SMILE and CMIP5 in Fig. <xref ref-type="fig" rid="Ch1.F3"/>c and d. Contributions are determined by summing the normalized weights of the 50 CESM1.2.2-LE members (red bar), 50 CanESM2-LE members (yellow bar), 100 MPI-GE members (green bar), and the remaining 88 CMIP5 members (blue bar).</p>
      <p id="d1e2377">For the most part, the weighting strategies introduce only modest distributional shifts; both northern European winters and Mediterranean summers are projected to warm, most likely by about 5–6 <inline-formula><mml:math id="M78" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C, by the end of the century (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a and b). What is more at issue than the distributional statistics, though, is what the distribution actually represents. An equal weighting results in a distribution representative of warming in the models with the most votes, in this case the SMILEs. In both seasons, the equal weighting demonstrates why it is important to treat SMILE members as dependent entities within a multi-model ensemble. The CMIP5 ensemble projects an ensemble mean end-of-century warming of 5.9 <inline-formula><mml:math id="M79" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C and an interquartile spread of 2.2 <inline-formula><mml:math id="M80" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C for northern European winter (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a), as well as an ensemble mean end-of-century warming of 5.5 <inline-formula><mml:math id="M81" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C and an interquartile spread of 1.5 <inline-formula><mml:math id="M82" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C for Mediterranean summer (Fig. <xref ref-type="fig" rid="Ch1.F3"/>b). The addition of 200 SMILE members to the 88-member CMIP5 ensemble shifts the end-of-century warming distributions towards less DJF NEU end-of-century warming and more JJA MED end-of-century warming; it also reduces the interquartile spread by approximately 25 % in both cases. The large contributions of the three added SMILEs artificially constrain uncertainty: the CESM1.2.2-LE and CanESM2-LE each receive 17.4 % of the total ALL ensemble weight, while the MPI-GE makes up the majority 34.7 % (Fig. <xref ref-type="fig" rid="Ch1.F3"/>c and d).</p>
      <p id="d1e2434">Performance weighting results in a distribution representative of warming in the models that historically get things right. By diminishing the contribution of members that differ from observational estimates, the performance weighting acts to constrain uncertainty in both the CMIP5 and the ALL ensemble. For DJF NEU SAT change, the performance weighting shifts the CMIP5 ensemble mean downwards by 0.75 <inline-formula><mml:math id="M83" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C, the 75th percentile downwards by 1.2 <inline-formula><mml:math id="M84" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C, and the 25th percentile downwards by 0.44 <inline-formula><mml:math id="M85" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C. This distributional shift towards less end-of-century warming is a due, in part, to members with SAT <inline-formula><mml:math id="M86" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula> greater than 8 <inline-formula><mml:math id="M87" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C receiving weights that are 2 orders of magnitude smaller than the average assigned weight. Uncertainty in the DJF NEU ALL ensemble is constrained both by the performance weighting diminishing the contribution of CMIP5 members and because MPI is one of the highest-performing models based on the chosen DJF predictors. The high-performing MPI-GE receives 65.8 % of the total ALL ensemble weight, though individual MPI-GE members only receive up to 3 times more weight than the averaged assigned weight. The aggregate impact of 100 high-performing members is outsized and results in<?pagebreak page817?> the narrowing of the performance-weighted end-of-century warming distribution. The narrowing does not reflect the increased certainty that comes from the agreement of independent entities within the ensemble. Instead, it exemplifies the fact that there is a need for a dependence assumption in order to avoid the outsized influence that comes from being both historically realistic and numerously represented in the ensemble. For JJA MED SAT change, the performance weight reduces the contribution of the three SMILEs to the ALL distribution in comparison to the equal weighting case, with the largest reduction made to the CanESM2-LE contribution (17.4 % to 7.4 %; Fig. <xref ref-type="fig" rid="Ch1.F3"/>d). However, the three SMILEs (three independent entities) still receive 51 % of the total JJA MED ALL ensemble weight, their contributions again augmented by numerous representations. As in the equal weighting case, the JJA MED ALL performance-weighted ensemble mean is still modestly shifted towards more end-of-century warming than its JJA MED CMIP5 counterpart. This reflects the above CMIP5 average SAT change of the CESM1.2.2-LE and the CanESM2-LE in Mediterranean summer.</p>
      <p id="d1e2483">In an effort to more appropriately handle the mix of models and IC members present in the ALL ensemble, we next explore three scalings that reflect different member dependence assumptions: that IC members are dependent (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a and b; <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula>, IC members), that modeling center contributions are dependent (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a and b; <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula>, modeling center), and that members with similar historical climate are dependent (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a and b; <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula>, RMSE). The <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> IC member scaling is based on the widely accepted assumption that IC ensemble members are, by definition, dependent. Originating from the same model setup, differences in IC members are not due to differences in model skill. Therefore, it follows that IC members should all receive the same performance weight, which, in aggregate, reflects the skill of its basis model. We achieve this by averaging the performance weights of all members of a SMILE or CMIP5-based IC ensemble (Table <xref ref-type="table" rid="Ch1.T1"/>, in italics) and subsequently dividing this average performance weight by the number of members (<inline-formula><mml:math id="M92" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>). This reduces the number of unique weights in the CMIP5 ensemble from 88 (each member receive a unique weight) to 44 and the number of unique weights in the ALL ensemble from 288 to 47.</p>
      <?pagebreak page818?><p id="d1e2550"><?xmltex \hack{\newpage}?>The scaling of IC ensemble member weight within the CMIP5 ensemble (blue element) decreases DJF NEU end-of-century warming uncertainty and slightly increases JJA MED end-of-century warming uncertainty with respect to equal weighting. It is therefore evident that the IC ensembles within CMIP5, which range from 2 to 10 members, exert an influence on the performance-weighted DJF NEU distribution in the same way the SMILEs influence the corresponding performance-weighted ALL distribution. While this is not seen in the corresponding JJA MED CMIP5 equal and performance weightings, it is important to note that even two or three extra votes for a high-performing model are enough to influence uncertainty. The reduction of IC member influence is even more striking in the ALL distribution; the three SMILEs contribute 11.4 % of the total weight in the DJF NEU and 3.1 % in the JJA MED, down from performance weight contributions of 81.6 % and 50.7 %, respectively. As with other strategies, the <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> IC-member-scaled DJF NEU ALL distribution is shifted towards less end-of-century warming with respect to its CMIP5 counterpart. The ALL and CMIP5 <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> IC-member-scaled JJA MED distributions are almost identical.</p>
      <p id="d1e2578">In addition to IC members, it is reasonable to assume that members of the same model that differ in resolution (i.e., MPI-ESM-LR and MPI-ESM-MR) or in the component module used (i.e., MIROC-ESM and MIROC-ESM-CHEM) are dependent entities. However, determining where to draw the line between dependence and independence is difficult; models from different modeling centers share components, while models in a modeling center's development chain can differ from each other in most major parameterizations <xref ref-type="bibr" rid="bib1.bibx37" id="paren.66"/>. Here, we chose to take a logical approach to the dependent entity grouping based largely on model name or knowledge of institution of origin (Table <xref ref-type="table" rid="Ch1.T1"/>, “Group” column). <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> modeling center weights are computed in the same manner as the <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> IC member weight within these broader groupings. The number of unique weights becomes 20 in both the CMIP5 and ALL ensembles because the CESM1.2.2-LE is grouped with the other NCAR models, the CanESM2-LE is grouped with the five members of CanESM2 in the CMIP5 ensemble, and the MPI-GE is grouped with MPI-ESM-LR and MPI-ESM-MR members. The <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> modeling center scaling results in similar CMIP5 and ALL end-of-century warming distributions in both the DJF NEU and JJA MED, with distributions characterized by positive skewness and a narrower interquartile range than in the <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> IC member scaling case. The SMILE contributions all approximately double from their <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> IC member scaling levels to contribute a combined 22.3 % to the DJF NEU and 6.7 % to the JJA MED ALL distributions, respectively.</p>
      <p id="d1e2647">Finally, in the instance that dependence is not known a priori, an RMSE-based metric can be used to assign dependence. The idea is that because of model biases, dependent entities can be identified by their similar climates. Using the same set of predictors as used for performance, each member receives a unique weight: RMSE-based performance scaled by RMSE-based dependence.  The RMSE independence scaling allows for more SMILE contribution than the <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> independence scaling approaches (Fig. <xref ref-type="fig" rid="Ch1.F3"/>c and d) because internal variability distinguishes SMILE members from one another and thus allows them to be treated as separate entities. With more entities in the ensemble, it follows that the degree of dependence of the existing CMIP5 models increases (CMIP5 models become more dependent) in tandem with the SMILE member degree of dependence decreasing (SMILE members become less than fully dependent). In the DJF NEU, it is striking that the high-performing MPI-GE again contributes over 40 % of the total weight. In the JJA MED, the RMSE independence scaling leads to comparable CMIP5 and ALL distributions, with the ALL distribution projecting slightly less warming than the CMIP5 distribution. This is in contrast to the performance-weighted case in which the ALL distribution is narrower and features more warming than the CMIP5 distribution.</p>
<sec id="Ch1.S4.SSx1" specific-use="unnumbered">
  <?xmltex \opttitle{Reconciling the RMSE and $1/N$~scalings}?><title>Reconciling the RMSE and <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> scalings</title>
      <p id="d1e2683">For the weighting approach introduced by <xref ref-type="bibr" rid="bib1.bibx38" id="text.67"/> to be suitable for incorporating large initial condition ensembles into a multi-model ensemble, there must be a demonstrable reconciliation between the <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> IC member and the RMSE independence scalings. The RMSE independence scaling has the ability to assign a degree of independence to all members. This addresses the issue that we may not truly know how independent a model is based on name or modeling center of origin alone. However, when dependent entities (i.e., SMILE members) are known, the RMSE metric must be able to identify them as dependent and scale their influence appropriately. In practice, this means we seek an RMSE scaling that approaches (or exceeds) <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> for the SMILEs and the IC ensembles within the CMIP5 ensemble. The goal of an RMSE scaling proportional to ensemble size comes with the understanding that scaling may be larger if the IC ensemble is very similar to other models or smaller if the IC ensemble is not fully identified as one model (as was the case with the nine-predictor RMSE scalings).</p>
      <p id="d1e2713">One way to achieve an RMSE scaling that identifies IC members as dependent is to remove internal variability from the metric through predictor choice. While it would not be good practice to base member performance on few predictors because of overconfidence concerns, member dependence may be more accurately reflected by fewer predictors that distinguish models from one another. The advantage of choosing different sets of predictors for determining dependence and performance is twofold: first, by selecting for ability to distinguish models rather than realism, dependence predictors can achieve a more substantial separation between SMILE–SMILE distances and SMILE–model distances. This reduces reliance on and sensitivity to the independence shape parameter <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/>). Second, the<?pagebreak page819?> convergence to reality paradox is no longer an issue; models will not be down-weighted for moving closer to observations (and thus each other) based on performance predictors.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><label>Figure 4</label><caption><p id="d1e2731">RMSE independence scalings (colored lines) of the SMILEs and CMIP5 ensemble members, grouped as listed in Table <xref ref-type="table" rid="Ch1.T1"/>. CESM1.2.2-LE members are shown in red, CanESM2-LE members are shown in yellow, and MPI-GE members are shown in green, while the remainder of CMIP5 members are shown in blue. The <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> modeling center scaling is shown by gray bars behind each grouping as a point of reference. <bold>(a, b)</bold> The scalings computed from the nine predictors used in the original DJF and JJA RMSE distance weightings, respectively. <bold>(c)</bold> The scalings computed from global land SAT and Northern Hemisphere SLP climatology predictors.</p></caption>
          <?xmltex \igopts{width=455.244094pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f04.png"/>

        </fig>

      <p id="d1e2761">We find that large-scale, long-term climatological averages are the most suitable predictors for this purpose because, in general, the influence of internal variability increases on smaller spatial scales and shorter timescales <xref ref-type="bibr" rid="bib1.bibx26 bib1.bibx42" id="paren.68"/>. The climatological time aggregation (CLIM) was chosen because, of the nine original predictors utilized, 20-year climatological averages cluster in SMILEs more than 20-year variability or 50-year derived trend values on regional scales (Figs. <xref ref-type="fig" rid="Ch1.F2"/>, <xref ref-type="fig" rid="App1.Ch1.S3.F9"/>, and <xref ref-type="fig" rid="App1.Ch1.S3.F10"/>). We average over the entire historical period, 1950–2009, to obtain two long-term CLIM predictors: annually averaged Northern Hemisphere SLP and annually averaged global land SAT. The Northern Hemisphere region was selected for SLP to maintain the distinguishing characteristics of mean circulation biases in the target-relevant European sector (Figs. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/> and <xref ref-type="fig" rid="App1.Ch1.S3.F12"/>), while global land was selected for SAT to avoid convergence associated with models having similar average ocean temperatures. The RMSE independence scalings derived from the nine predictors in DJF and JJA are shown alongside the scalings derived from global land SAT and Northern Hemisphere SLP climatology predictors in Fig. <xref ref-type="fig" rid="Ch1.F4"/>. RMSE independence scalings for each member of the ALL ensemble are indicated by a thin horizontal colored line within their respective modeling center groupings. For comparison, the RMSE independence scalings are superposed on the <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> modeling center scaling (gray bar).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><label>Figure 5</label><caption><p id="d1e2794">Scatter plot showing how ALL ensemble members distribute in the Northern Hemisphere SLP climatology and global land SAT climatology predictor space. Members and IC ensembles within the CMIP5 ensemble (blue) are labeled by model name. The CESM1.2.2-LE is indicated in red, the CanESM2-LE is indicated in yellow, and the MPI-GE is indicated in green, consistent with other figures.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f05.png"/>

        </fig>

      <p id="d1e2803">In contrast to the nine-predictor RMSE scalings (Fig. <xref ref-type="fig" rid="Ch1.F4"/>a and b), the global land SAT–Northern Hemisphere SLP RMSE scaling allows for SMILE members to distinguish themselves and to approach or exceed <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> values (Fig. <xref ref-type="fig" rid="Ch1.F4"/>c). In both DJF and JJA, no member of the ALL ensemble has a nine-predictor RMSE scaling that exceeds <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">45</mml:mn></mml:mrow></mml:math></inline-formula>. Inter-member RMSE distances, shown in Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F7"/>a and b, reflect why this occurs; SMILE members can be as different from one another as CMIP5 models are from each other. The nine-predictor independence scaling is better able to distinguish SMILE members from CMIP5 members in JJA than in DJF (Fig. <xref ref-type="fig" rid="Ch1.F4"/>b). With the global land SAT and Northern Hemisphere SLP CLIM predictors, SMILE members are clearly closer to one another than to other models, with the exception of the CanESM2-LE. Because the CanESM2-LE is created using the five CanESM2 contributions to CMIP5, the SMILE and CMIP5 contributions cluster as a 55-member CanESM2 ensemble within the ALL ensemble (Fig. <xref ref-type="fig" rid="Ch1.F4"/>c). In terms of scaling, 55 CanESM2 members are scaled by an average of <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">55.0</mml:mn></mml:mrow></mml:math></inline-formula>, while the CESM1.2.2-LE and the MPI-GE are scaled by an average of <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">48.7</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">100.5</mml:mn></mml:mrow></mml:math></inline-formula>, respectively (Fig. <xref ref-type="fig" rid="Ch1.F4"/>c). In addition to the SMILEs, other IC ensembles within CMIP5, such as the 10-member CSIRO-Mk3-6-0 ensemble, also achieve a <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> scaling. Individually represented models, such as FGOALS-g2, are considered more independent and are thus scaled by factors that approach unity. On the other end of the dependence continuum, the four MPI-M contributions to CMIP5 are identified to have a high degree of similarity to the MPI-GE and are scaled accordingly by factors exceeding <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">60</mml:mn></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d1e2904">To understand why large-scale, long-term CLIM predictors are able to group SMILE members and set a degree of dependence for CMIP5 members, we investigate where each member falls in the global land SAT and Northern Hemisphere SLP climatology predictor space in Fig. <xref ref-type="fig" rid="Ch1.F5"/>. Each member is labeled either by color (SMILEs) or by model name (CMIP5), and IC ensembles within CMIP5 are circled. Circling IC ensembles within CMIP5 is possible because, along with the SMILEs, the IC members also tend to cluster. This phenomenon is in line with the assumption that IC members are dependent entities; the two large-scale, long-term CLIM predictors reflect this dependence. Notable IC clusters include MIROC5 (three members) and EC-EARTH (five members). The bifurcation in the GISS-E2-H and GISS-E2-R ensembles reflects the p3 (top) vs. the p1 and p2 (bottom) perturbations used for different members. CanESM2 CMIP5 members join the CanESM2-LE (as indicated by Fig. <xref ref-type="fig" rid="Ch1.F4"/>c) and the MPI-ESM-LR contributions fall near the MPI-GE.</p>
      <p id="d1e2911">The assumption that members from the same modeling center are dependent entities, however, is not as clear cut in the global land SAT and Northern Hemisphere SLP climatology predictor space. GISS contributions share a response (lower Northern Hemisphere average SLP and higher global land average SAT), while the contributions from CMCC, GFDL, and IPSL feature markedly different responses (Fig. <xref ref-type="fig" rid="Ch1.F5"/>). Another clustering feature present is that of several separate clusters for a modeling center. This can be seen for the NCAR modeling center grouping: CCSM4 and CESM1-BGC form a cluster separate from both the CESM1-CAM5 cluster and the CESM1.2.2-LE cluster. The NCAR case illustrates that new models in a modeling center's development stream can be distinct from their predecessors and should not necessarily be considered dependent based on their shared name. On the other hand, there are also instances in which models of different names are similar to each other. Bcc-csm1-1 falls within the CCSM4-CESM1-BGC cluster (Fig. <xref ref-type="fig" rid="Ch1.F5"/>), which suggests that with shared components <xref ref-type="bibr" rid="bib1.bibx33" id="paren.69"/>, models can have similar responses and be identified as more dependent than their name would suggest. Ultimately, discrepancies between model name and model response suggest that assigning each member a degree of dependence is a useful way to handle the continuum of dependence assumptions. Provided that care is taken to select an appropriate set of predictors for independence scaling, IC members cluster in an anticipatable way, while an interplay between named and unnamed model dependence remains.</p>
</sec>
</sec>
<?pagebreak page821?><sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d1e2931">We find that the performance and independence weighting scheme pioneered by <xref ref-type="bibr" rid="bib1.bibx38" id="text.70"/> can be used to incorporate regional climate information from three single-member initial condition large ensembles into a CMIP5 multi-model ensemble and return a justifiably constrained estimate of European regional end-of-century warming uncertainty. The performance weighting, which accounts for an ensemble member's ability to reproduce selected aspects of observed climate, is based on regional surface air temperature, sea level pressure climatology, and interannual variability over two 20-year intervals during the historical period (1950–1969 and 1990–2009) and a 50-year estimated residual thermodynamic SAT trend computed using a method of dynamical adjustment <xref ref-type="bibr" rid="bib1.bibx18" id="paren.71"/>. These predictors highlight emergent relationships between past and future climate and aspects of climate that are important for a model to historically simulate in order to realistically project future warming to the definition of performance. The principle of emergent constraints underpins the choice to use the estimated residual thermodynamic SAT trend over the SAT trend, as the former is an estimate of a model-specific property that can be compared with observations and the latter is influenced by internal variability even on 50-year timescales.</p>
      <p id="d1e2940">Five different strategies based on the <xref ref-type="bibr" rid="bib1.bibx38" id="text.72"/> performance and independence weighting are assessed for suitability of use in a CMIP5 and a combined CMIP5–SMILE ALL ensemble. While the different strategies introduce only modest distributional shifts (towards less end-of-century warming than in the equal weighting case), they imbue different meaning to the distribution. RCP8.5 SAT change between 1990–2009 and 2080–2099 is projected to be about 5–6 <inline-formula><mml:math id="M114" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C in both northern European winter and Mediterranean summer when historical model performance is considered. Equal and performance-weighted ALL distributions are narrowed by a 50 %–82 % contribution from the SMILEs, which is an outsize contribution from three models to an ensemble comprised of 40 uniquely named models. The high-performing, numerously represented MPI-GE receives over 65 % of the total weight in the performance-weighted DJF NEU end-of-century warming distribution, demonstrating that an independence scaling is necessary so no one model defines the uncertainty range of a multi-model ensemble regardless of its historical realism.</p>
      <p id="d1e2955">Three plausible dependence assumptions are made to account for model contribution issues in an ensemble comprised of both known (i.e., IC members) and unknown (i.e., model component sharing) dependencies. By explicitly defining IC members as dependent entities, SMILE contributions drop to less than 10 % while maintaining a distributional shift tendency towards less end-of-century warming. Taking the definition of dependence a step further by considering all members from the same modeling center and/or development stream dependent introduces positive skewness and a narrower interquartile range to the distributions now containing 20 uniquely weighted entities. Finally, by acknowledging that dependencies may not always be clearly determinable a priori, the independence scaling based on inter-member RMSE distances from the same nine predictors used to determine performance allows for reasonable levels of SMILE contribution to Mediterranean summer end-of-century warming uncertainty. However, the high-performing MPI-GE contributes approximately 40 % of the total weight to the northern European winter distribution as a result of predictor internal variability distinguishing SMILE members as independent models.</p>
      <p id="d1e2958">The advantages of the RMSE-based independence scaling, which include allowing for degrees of dependence, are subverted somewhat by the inability of performance predictors to distinguish known dependent entities (i.e., IC members) from (presumed) independent ones. To address this issue, we show that a set of two predictors, 60-year annual average global land SAT climatology and 60-year annual average Northern Hemisphere SLP climatology, is capable of rendering an RMSE scaling of <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> for SMILE members while assigning a degree of dependence to the rest of CMIP5. A notable achievement for these large-scale, long-term predictors is their ability to identify the CanESM2 members from CMIP5 as being from the same model version as the CanESM2-LE and scale the 55-member ensemble accordingly. A deeper look into groupings in the global land SAT and Northern Hemisphere SLP climatology predictor space reveals clustering of IC ensembles within the CMIP5 ensemble in addition to the SMILEs. MPI-ESM-MR and MPI-ESM-LR contributions cluster near the MPI-GE, while the NCAR model group separates into three distinct clusters consistent with NCAR's model development over time. The interplay between model name and model response does exhibit some complexity; models from the same center (i.e., GFDL) can have markedly different responses, and models from different centers (bcc-csm1-1 and CCSM4) can have similar responses. This suggests that assigning degrees of dependence is a useful way to represent the information in an ensemble of opportunity like CMIP5.</p>
      <p id="d1e2974">It is important to note that while the weighting has a relatively straightforward functional form, it requires application-specific sets of predictors and appropriate shape parameters. Strategies to select optimal shape parameters are discussed in Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/> of this study, and we advise that emergent predictor relationships be explored, as in Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>, to provide justification for the performance metric. When defining model skill for performance, it is important to carefully consider whether predictors are relevant to a model's ability to project the future target realistically. Different targets, such as hydrological changes, may require predictors to capture a more complex set of physical processes. It is also important to assess RMSE distance to observations of known dependent entities such as SMILEs to ensure that internal variability in the selected set of predictors does not<?pagebreak page822?> assign them skill of different orders of magnitude. Because SMILE members had relatively similar RMSE distance to observations over the nine original predictors, we did not require members of the SMILE to have identical performance weights under the performance and RMSE case assumptions evaluated. We do, however, see the merit in fixing IC member performance to an ensemble average value to ensure that model skill is appropriately assigned. We also recommend that different sets of predictors be used for determining performance weight and independence scaling to avoid down-weighting independent models with historical climate that converges to reality. Independence predictors should be fields with minimal internal variability, such as large-scale, long-term averages, and ideally fields that model developers do not explicitly tune, such as absolute global temperature <xref ref-type="bibr" rid="bib1.bibx48 bib1.bibx29" id="paren.73"/>.</p>
      <p id="d1e2984"><?xmltex \hack{\newpage}?>We assess a relatively unconventional multi-model ensemble in this study, which is comprised of 200 members from 3 models and only 88 members from the remaining 40 named models. This is a deliberate choice made to test and improve the independence scaling, as determining best practices for representing uncertainty in a multi-model ensemble that includes initial condition ensemble members is necessary in advance of CMIP6. Modeling centers are slated to submit more ensemble members to the project than were submitted to CMIP5 <xref ref-type="bibr" rid="bib1.bibx22 bib1.bibx72" id="paren.74"/>. For more conventional multi-model ensembles that include just a few initial condition ensemble members amongst the models, results may be less sensitive to choices underpinning the independence scaling. When large ensembles are included, however, it becomes clear that an independence scaling that scales known dependencies appropriately (i.e., <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> for IC ensemble members), such as the RMSE global predictor scaling presented here, is necessary. Such an independence scaling will be a useful tool with which to assess uncertainty in the combined multi-model, multi-initial-condition ensemble member CMIP6 ensemble.</p><?xmltex \hack{\clearpage}?>
</sec>

      
      </body>
    <back><app-group>

<?pagebreak page823?><app id="App1.Ch1.S1">
  <?xmltex \currentcnt{A}?><label>Appendix A</label><title>Dynamical adjustment</title>
      <p id="d1e3015">To obtain an estimated residual thermodynamic trend in SAT, a method of dynamical adjustment based on constructed circulation analogues is used <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx41 bib1.bibx53 bib1.bibx24" id="paren.75"/>. Dynamical adjustment provides an empirically derived estimate of the SAT trends induced by atmospheric circulation variability; removal of this circulation-driven component from an SAT record thus reveals an estimate of the SAT trend associated with thermodynamic processes and radiative effects. Dynamical adjustment relies on the ability to reconstruct a monthly mean circulation field, which we represent with sea level pressure (SLP) as in <xref ref-type="bibr" rid="bib1.bibx18" id="text.76"/>, from a large set of analogues. Here, SLP analogues are selected from 60 possible choices (from the period 1950–2010) excluding the target month, and the method is therefore referred to as the “leave-one-out” method of dynamical adjustment. SLP fields in SMILE members, CMIP5 ensemble members, and the observational estimates ERA-20C and NOAA-20C are constructed in this manner for target months in the 1950–2010 period. For model years 2011–2099, analogues are selected from the entire 1950-2010 period. No notable trends in SLP have been identified over this period in previous dynamical adjustment studies <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx18 bib1.bibx41" id="paren.77"/>.</p>
      <p id="d1e3027">It is important to acknowledge that because of the paucity of analogue choices in leave-one-out dynamical adjustment, the term “analogue” is a bit of a misnomer. The term evokes the idea of a match, though in practice, analogues may not closely resemble the target. For convenience, we will continue to refer to the months used in target SLP construction as analogues, but we do so with the understanding that target and analogue patterns may differ over the selection domain.</p>
      <p id="d1e3030">A month is determined to be an analogue of the target month if the Euclidean distance between target and analogue SLP is small. Euclidean distance is computed at each grid point and averaged over the European sector domain also used for SLP predictors in the nine-predictor RMSE weightings (25–90<inline-formula><mml:math id="M117" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> N, 60<inline-formula><mml:math id="M118" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> W–100<inline-formula><mml:math id="M119" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> E). This selection metric therefore does not require an analogue to match the target month spatially over the whole domain. This is necessary because, with 60 possible options, it is statistically unlikely that a “perfect” analogue will exist for a particular target month. The study by <xref ref-type="bibr" rid="bib1.bibx76" id="text.78"/> found that it would take on the order of 10<inline-formula><mml:math id="M120" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">30</mml:mn></mml:msup></mml:math></inline-formula> years to find two Northern Hemisphere circulation patterns that match within observational uncertainty. With this in mind, a smaller-than-hemispheric domain and an iterative averaging scheme are employed to make the most of the “imperfect” analogues available <xref ref-type="bibr" rid="bib1.bibx79 bib1.bibx16 bib1.bibx18" id="paren.79"/>.</p>
      <p id="d1e3076">Once the Euclidean distances are determined, the 50 closest SLP analogues are chosen, and the iterative process of selecting 30 of 50 SLP analogues and optimally reconstructing a target SLP field <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mi>h</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> commences. The optimal reconstruction of the target SLP is mathematically equivalent to multivariate linear regression; each analogue is assigned a weight (<inline-formula><mml:math id="M122" display="inline"><mml:mi mathvariant="bold-italic">β</mml:mi></mml:math></inline-formula>) such that a weighted linear combination of analogues produces a least-squares estimate of the target SLP. <inline-formula><mml:math id="M123" display="inline"><mml:mi mathvariant="bold-italic">β</mml:mi></mml:math></inline-formula> is computed through a singular value decomposition of a column vector matrix <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">X</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> containing the 30 selected analogues and can also be estimated through a Moore–Penrose pseudo-inverse:
          <disp-formula id="App1.Ch1.S1.E6" content-type="numbered"><label>A1</label><mml:math id="M125" display="block"><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mo>=</mml:mo><mml:mfenced close="]" open="["><mml:mrow><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">X</mml:mi><mml:mi mathvariant="normal">c</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:msub><mml:mi mathvariant="bold">X</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msubsup><mml:mi mathvariant="bold">X</mml:mi><mml:mi mathvariant="normal">c</mml:mi><mml:mi>T</mml:mi></mml:msubsup></mml:mrow></mml:mfenced><mml:msub><mml:mi mathvariant="bold">X</mml:mi><mml:mi>h</mml:mi></mml:msub><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
        The analogue weighting scheme ensures that analogues which are further from (closer to) the target, in a Euclidean distance sense, contribute less (more) to the constructed SLP field.</p>
      <p id="d1e3162">After the target SLP field is constructed, the <inline-formula><mml:math id="M126" display="inline"><mml:mi mathvariant="bold-italic">β</mml:mi></mml:math></inline-formula> values derived for each SLP analogue are applied to their corresponding monthly averaged SAT fields. Prior to the application of weights, a quadratic trend representing anthropogenic warming is removed from the SAT record at each point in space. The purpose of this detrending is so that months picked from the end of the record do not contribute higher SAT anomalies simply because of the anthropogenically forced warmer background climate, even if the SLP patterns are the same <xref ref-type="bibr" rid="bib1.bibx41" id="paren.80"/>. Detrending strategies are further discussed in <xref ref-type="bibr" rid="bib1.bibx18" id="text.81"/>. The weighted, detrended SAT fields are then used to construct a dynamic SAT anomaly field for the target month. SLP, which is representative of low-level atmospheric circulation, and SAT are physically related; SLP-derived weights are applied to SAT to empirically construct that relationship. Conceptually, dynamic SAT anomalies are those that would occur given the attendant circulation pattern. The second through fifth steps of dynamical adjustment (selection of 30 of 50 SLP analogues, optimal reconstruction of target SLP, and construction of dynamic SAT) are then repeated 100 times, following <xref ref-type="bibr" rid="bib1.bibx41" id="text.82"/>. The dynamic component of SAT in the target month is the average of the 100 constructions. It is then subtracted from SAT in the target month to find the residual thermodynamic component of SAT, which is used as an estimate of the regional SAT response to surface processes and radiative forcing. The trend of the residual thermodynamic SAT component is used as a predictor in this study; the trend is computed at each land grid point in the predictor domain and subsequently area-averaged.</p>
</app>

<app id="App1.Ch1.S2">
  <?xmltex \currentcnt{B}?><label>Appendix B</label><?xmltex \opttitle{Selecting~$\sigma _{{D}}$ and~$\sigma _{{S}}$}?><title>Selecting <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></title>
      <p id="d1e3211">Determining the shape parameters <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is an important step in the RMSE weighting process <xref ref-type="bibr" rid="bib1.bibx38" id="paren.83"/>. <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> can be set using a perfect model test, as described in <xref ref-type="bibr" rid="bib1.bibx45" id="text.84"/>. Here, a simplified perfect model test is performed on a 47-member ensemble, which includes only the first IC member from the SMILEs and each of<?pagebreak page824?> the CMIP5 models ensembles (40 named models with an additional four members from GISS-E2-R and GISS-E2-H physics physics-version ensembles). This is done because having multiple IC members (or a SMILE) in the ensemble could bias the perfect model test, which is based on predicting one member using a weighted distribution of the rest. We use member 1 for each IC ensemble because, often, when multiple IC members are available, the first member is selected <xref ref-type="bibr" rid="bib1.bibx44 bib1.bibx31 bib1.bibx67" id="paren.85"><named-content content-type="pre">e.g.,</named-content></xref>. During the perfect model test, each member is assumed to be the “truth” once, and a weighting is performed using the remaining members to predict the “true” SAT change. RMSE distances (based on nine predictors) are computed with respect to the truth for the remaining members and used in the performance weighting function <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">II</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> described in Sect. 3.2. The performance weights are computed for <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values ranging between 0 and 2 (on 0.01 intervals). For each <inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, the weighted mean SAT change is computed and compared to the true SAT change. The optimal <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for each truth is chosen such that the difference between the weighted mean SAT change and the true SAT change is minimized. In the few cases when the weighted mean exhibits asymptotic behavior with no clear minimum difference prior to <inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>, the <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> value is selected at the point at which the leveling-off begins (as determined by the intersection between a threshold value and the weighted mean curve). For the nine-predictor RMSE weightings, we set <inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values to the mean of the 47 optimal <inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values computed during the perfect model test. It is important to note that this choice is ultimately subjective, and further parameter sensitivity testing is recommended in studies focused on model performance.</p>
      <p id="d1e3354">The RMSE distances between multi-model ensemble members and observations (<inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) are shown in Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F6"/>. Members of the ALL ensemble are plotted in ascending order with the position of SMILE members indicated in red for the CESM1.2.2-LE, in yellow for the CanESM2-LE, and in green for the MPI-GE. In winter (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F6"/>a), distances between CMIP5 members and observations are distributed in a positively skewed fashion, with the mode of the distribution at approximately <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.40</mml:mn></mml:mrow></mml:math></inline-formula> with a tail of larger <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values. In contrast, CMIP5 distances in summer (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F6"/>b) are approximately normally distributed about a mean of <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.85</mml:mn></mml:mrow></mml:math></inline-formula>. The addition of the SMILEs to the distribution contributes to both of these distributional tendencies. <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is set to 0.32 in DJF and 0.4 in JJA in both the CMIP5 and ALL ensembles to eliminate a degree of freedom of the method. Members are more strongly weighted by performance in winter than in summer due to the different distance distributions.</p>
      <p id="d1e3427"><inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> can be determined using IC ensembles present in the multi-model ensemble, including SMILEs. The inclusion of SMILE members in a multi-model ensemble emphasizes the need for <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to be carefully selected, as SMILEs add redundant information and the purpose of <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is to reduce the influence of redundant information. However, not all information added by a SMILE is distinguishable from information in other models in the nine predictor cases; inter-member distances in an initial condition ensemble can be as large as inter-model distances in the multi-model ensemble (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F7"/>a and b). Checking inter-member vs. inter-model distances is an important first step in determining <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>; too much overlap between the distributions can blur the line between known dependent entities (IC members) and likely independent entities (different models).</p>
      <p id="d1e3476">If <inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is too small or too large, there are implications for the nine-predictor RMSE-weighted ensemble mean and spread. This sensitivity to <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is shown in Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F8"/>. We assess the characteristics of the nine-predictor RMSE-weighted CMIP5 distributions (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F8"/>a and bi) and RMSE-weighted ALL distributions (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F8"/>a and bii) for different values of <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, varying from 0.05 to 0.8.</p>
      <p id="d1e3519">For small <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, only members that are very close to each other in predictor space are considered dependent; most members of the multi-model ensemble will therefore be considered independent. In this case, the RMSE weighting tends toward the performance-weighted approach. If <inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is set on the order of the largest inter-member distances in a SMILE (<inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub><mml:mo>≥</mml:mo><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn></mml:mrow></mml:math></inline-formula>), few members of the multi-model ensemble will be considered independent from each other, despite coming from different models. The systematic scaling of performance weights in the ensemble at large tends to also lead to a narrowing of uncertainty. Only members that are very far from other members will not have a scaled performance weight, but these “independent” members tend to also be far from observations and therefore have little performance weight to begin with. For <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> between approximately 0.2 and 0.4, uncertainty in the RMSE-weighted distributions increases in all but the JJA MED CMIP5 case. The JJA MED CMIP5 distribution is relatively insensitive to <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> because 50 % of the RMSE distances between CMIP5 members are between 0.56 and 0.71 (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F7"/>b). For the ALL distributions, the RMSE-weighted mean shifts up modestly in DJF and down in JJA. In order to avoid an underestimate of uncertainty, either due to redundancy or from down-weighting independent information, we propose that <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> should be set carefully. For the set of nine predictors, we set <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> based on the <inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> distribution in IC ensembles present within the multi-model ensemble. We compute the <inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> within the three SMILEs and set <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at 2 standard deviations below the SMILE <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> mean value (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F7"/>). The three values are then averaged. By this metric, DJF NEU <inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is 0.26 and JJA MED <inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is 0.25.</p>
      <?pagebreak page825?><p id="d1e3686">Another more robust option, as discussed in the main text, is to select a set of independence predictors that explicitly differentiate inter-IC-member distances from inter-model distances. In this case, <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> should not be set to 2 standard deviations below the SMILE <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> mean; rather, it should be set to a value greater than all IC member <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> but less than inter-model <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> (particularly for differently named models). For the large-scale CLIM predictor set explored in Fig. <xref ref-type="fig" rid="Ch1.F4"/>, <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> can be computed based on IC member inter-member distances as described in <xref ref-type="bibr" rid="bib1.bibx11" id="text.86"/>; <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in this instance is 0.22.</p>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S2.F6"><?xmltex \currentcnt{B1}?><label>Figure B1</label><caption><p id="d1e3772">RMSE distance <inline-formula><mml:math id="M171" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, derived from nine predictors, between observations and the 288 members of the ALL ensemble – CMIP5 (blue) <inline-formula><mml:math id="M172" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> CESM1.2.2-LE (red) <inline-formula><mml:math id="M173" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> CanESM2-LE (yellow) <inline-formula><mml:math id="M174" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> MPI-GE (green). DJF NEU distances are shown in <bold>(a)</bold>, and JJA MED distances are shown in <bold>(b)</bold>.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f06.png"/>

      </fig>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S2.F7"><?xmltex \currentcnt{B2}?><label>Figure B2</label><caption><p id="d1e3826">Distributions of RMSE distance (<inline-formula><mml:math id="M175" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) within the SMILEs, the CESM1.2.2-LE (red), the CanESM2-LE (yellow), the MPI-GE (green), and the CMIP5 ensemble (blue). The box element spans the 25th to 75th percentile of the distribution; the median <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is indicated by the horizontal line within the box. The whisker element spans the full range of the <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> distribution. The value of <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> used for the weighting is indicated by the dashed line. DJF NEU distances based on the nine predictors are shown in <bold>(a)</bold>, JJA MED distances based on the nine predictors are shown in <bold>(b)</bold>, and distances based on annual global land SAT and Northern Hemisphere SLP climatology are shown in <bold>(c)</bold>.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f07.png"/>

      </fig>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S2.F8"><?xmltex \currentcnt{B3}?><label>Figure B3</label><caption><p id="d1e3903">The <inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> sensitivity of the nine-predictor RMSE weightings in Fig. <xref ref-type="fig" rid="Ch1.F3"/>; <inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>S</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> used for each weighting is indicated below each box element. Box-and-whisker plots show the SAT change distribution under the RMSE independence scaling weighting assumption (<inline-formula><mml:math id="M181" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>, (2080–2099)–(1990–2009)) for the CMIP5 ensemble (i; blue) and ALL ensemble (ii; gray). The box element spans the 25th to 75th percentile of the distribution; mean SAT change is indicated by the horizontal line within the box. The whisker element spans the 5th to 95th percentile. DJF NEU SAT change is shown in <bold>(a)</bold>, and JJA MED SAT change is shown <bold>(b)</bold>.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f08.png"/>

      </fig>

<?xmltex \hack{\clearpage}?>
</app>

<?pagebreak page828?><app id="App1.Ch1.S3">
  <?xmltex \currentcnt{C}?><label>Appendix C</label><title>Emergent predictor relationships</title>
      <p id="d1e3962">In addition to relationships between past and future (estimated residual thermodynamic) trends (Fig. <xref ref-type="fig" rid="Ch1.F2"/>), emergent relationships among the remaining predictors we use to represent climate are shown in Figs. <xref ref-type="fig" rid="App1.Ch1.S3.F9"/> and <xref ref-type="fig" rid="App1.Ch1.S3.F10"/>. Linear relationships are clear for climatological averages in both seasons; multi-model ensemble member climatological biases are more or less unchanged from past to future, with hotter mean state climate than other members during the historical period also tending to have hotter mean state climate than other members in the future. Similarly, the tendency of domain-averaged SLP values to be and remain lower or higher also persists into the future. This relationship is explored spatially in Figs. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/> and <xref ref-type="fig" rid="App1.Ch1.S3.F12"/>. Mean states within SMILEs tend to cluster together. With the exception of JJA MED SLP climatology (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/>b), the addition of the SMILEs does not change the linear relationship found in the CMIP5 multi-model ensemble.</p>
      <p id="d1e3978">For variability (standard deviation over the given period), members of SMILEs differ as much from each other as from other multi-model ensemble members in DJF (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F9"/>c and d). In JJA (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/>c and d), several members of the CMIP5 multi-model ensemble have domain-averaged variability that falls outside the distribution of SMILE members. The addition of the SMILEs to the CMIP5 multi-model ensemble reduces correlations between historical and future variability for SAT and SLP in both seasons. This is particularly striking in JJA when the correlations tend to be due to the CMIP5 multi-model ensemble outliers.</p>
      <p id="d1e3985">Because the SLP predictor domain has a larger spatial extent than the SAT predictor domains, we also assess spatial patterns of climatological SLP, which average to the lowest and highest domain-average values in the 1990–2009 climatological period (Figs. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/> and <xref ref-type="fig" rid="App1.Ch1.S3.F12"/>). The “end-members” illustrate the climatological emergent constraint relationship seen in Figs. <xref ref-type="fig" rid="App1.Ch1.S3.F9"/> and <xref ref-type="fig" rid="App1.Ch1.S3.F10"/> in terms of pattern; that is important for a field like SLP, which tends to feature dipoles on basin and continental scales. For simplicity, we compare the end-members to one observational estimate from ERA-20C.</p>
      <p id="d1e3996"><?xmltex \hack{\newpage}?>In winter, multi-model ensemble members tend to feature similar-to-observed spatial patterns of climatological SLP in the predictor domain, with a low-pressure center over the high-latitude North Atlantic and a region of high pressure over the Eurasian continent (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/>). For the member with the lowest domain average, the difference arises from a further extension of the low-pressure center across northern Europe and a weaker high-pressure center than observed, especially in the vicinity of the Tibetan Plateau (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/>ii and v). For the member with the highest domain average, the difference arises from high-pressure features over high-altitude regions, such as Greenland and the Tibetan Plateau (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/>iii and vi).</p>
      <p id="d1e4007">In summer, members differ in spatial patterns of climatological SLP in the predictor domain, though most feature a high-pressure center over the subtropical North Atlantic and lower pressure over the Eurasian continent seen in ERA-20C (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F12"/>). The member with the lowest domain average features the aforementioned spatial pattern but with a higher-than-observed amplitude, i.e., both a higher North Atlantic subtropical high-pressure center and a lower region of continental low pressure (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F12"/>ii and v). In contrast, the member with the highest domain average has high pressure over the entire Atlantic basin as well as over Greenland and the Tibetan Plateau (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F12"/>iii and vi). Most importantly, in all cases, the climatological behavior of the past continues into in the future, which supports the primary tenet of an emergent constraint.</p><?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S3.F9"><?xmltex \currentcnt{C1}?><label>Figure C1</label><caption><p id="d1e4018">Predictor relationships in DJF comparing domain-averaged climate in two historical periods, (i) 1950–1969 and (ii) 1990–2009, to a future period, 2080–2099, in all panels. Observational estimates in the respective historical periods are indicated with a solid vertical line (ERA-20C SAT and SLP) and dashed vertical black line (BEST SAT and NOAA-20C SLP) in each panel. <bold>(a)</bold> NEU SAT climatology (<inline-formula><mml:math id="M182" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C), <bold>(b)</bold> SLP climatology averaged over the predictor region (hPa), <bold>(c)</bold> NEU SAT standard deviation (<inline-formula><mml:math id="M183" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C), and <bold>(d)</bold> SLP standard deviation averaged over the predictor region (hPa), each aggregated over the two historical periods, are eight of the nine predictors used to determine RMSE member performance and independence in Fig. <xref ref-type="fig" rid="Ch1.F3"/>. Least-squares regression fits (solid lines) and <inline-formula><mml:math id="M184" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values computed solely for the CMIP5 output are shown in blue, and those computed for all output (CMIP5 and the three SMILEs) are shown in black.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f09.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S3.F10"><?xmltex \currentcnt{C2}?><label>Figure C2</label><caption><p id="d1e4075">As in Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F9"/>, but for JJA and the MED region in <bold>(a)</bold> and <bold>(c)</bold>.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f10.png"/>

      </fig>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S3.F11"><?xmltex \currentcnt{C3}?><label>Figure C3</label><caption><p id="d1e4097">The spatial pattern of DJF SLP climatology for 1950–1969 (i–iii), 1990–2009 (iv–vi), and 2080–2099 (vii–viii). The ERA-20C observational estimate of SLP climatology is shown in i and iv. The ensemble member with the lowest domain-average SLP climatology for the 1990–2009 historical period is shown in ii, v, and vii. The ensemble member with the highest domain-average SLP climatology for the 1990–2009 period is shown in iii, vi, and viii.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f11.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S3.F12"><?xmltex \currentcnt{C4}?><label>Figure C4</label><caption><p id="d1e4111">As in Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/>, but for JJA SLP climatology.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://esd.copernicus.org/articles/11/807/2020/esd-11-807-2020-f12.png"/>

      </fig>

<?xmltex \hack{\clearpage}?>
</app>
  </app-group><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d1e4130">CMIP5 data were obtained from <uri>https://esgf-node.llnl.gov/projects/cmip5/</uri> (last access: July 2019) <xref ref-type="bibr" rid="bib1.bibx20" id="paren.87"/>. The CESM1.2.2 large ensemble was generated at ETH Zürich and is available upon request. The CanESM2 large ensemble was generated by Environment and Climate Change Canada's Canadian Centre for Climate Modelling and Analysis and is available at <uri>http://open.canada.ca/data/en/dataset/aa7b6823-fd1e-49ff-a6fb-68076a4a477c</uri> (last access: August 2019) <xref ref-type="bibr" rid="bib1.bibx21" id="paren.88"/>. The MPI Grand Ensemble was generated at the Max Planck Institute for Meteorology and is available at <uri>https://esgf-data.dkrz.de/projects/mpi-ge/</uri> (last access: August 2019) <xref ref-type="bibr" rid="bib1.bibx49" id="paren.89"/>. ERA-20C data are provided by the ECMWF and were obtained from <uri>https://apps.ecmwf.int/datasets/data/era20c-moda/levtype=sfc/type=an/</uri> (last access: May 2019) <xref ref-type="bibr" rid="bib1.bibx19" id="paren.90"/>.</p>
  </notes><notes notes-type="codeavailability"><title>Code availability</title>

      <p id="d1e4161">The weighting protocol is available as a Python package and can be obtained via GitHub (<ext-link xlink:href="https://doi.org/10.5281/zenodo.4028924" ext-link-type="DOI">10.5281/zenodo.4028924</ext-link>) <xref ref-type="bibr" rid="bib1.bibx54" id="paren.91"/> under a GPLv3.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e4173">RK, RL, and LB conceived of and wrote the weighting scheme Python package. ALM and LB implemented the weighting scheme with contributions from RL. ALM, LB, and IK analyzed the output. ALM wrote the paper with contributions from all co-authors.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e4179">The authors declare that they have no conflict of interest.</p>
  </notes><notes notes-type="sistatement"><title>Special issue statement</title>

      <p id="d1e4185">This article is part of the special issue “Large Ensemble Climate Model Simulations: Exploring Natural Variability, Change Signals, and Impacts”. It is a result of the EGU General Assembly 2019, Vienna, Austria, 7–12 April 2019.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e4191">We would like to thank Nicola Maher, Flavio Lehner, Angeline Pendergrass, Sebastian Sippel, and two anonymous reviewers for their helpful comments on this paper. This project was funded by the European Union's Horizon 2020 research and innovation program under grant agreements 641816 (CRESCENDO) and 776613 (EUCP). We acknowledge the World Climate Research Programme's Working Group on Coupled Modelling, which is responsible for CMIP, and we thank the climate modeling groups for producing and making available their model
output.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e4196">This research has been supported by the European Union's Horizon 2020 research and innovation program (grant nos. 641816 and 776613).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e4202">This paper was edited by Sebastian Milinski and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Abramowitz et al.(2008)Abramowitz, Leuning, Clark, and
Pitman</label><?label abramowitz08?><mixed-citation>Abramowitz, G., Leuning, R., Clark, M., and Pitman, A.: Evaluating the
performance of land surface models, J. Climate, 21, 5468–5481,
<ext-link xlink:href="https://doi.org/10.1175/2008JCLI2378.1" ext-link-type="DOI">10.1175/2008JCLI2378.1</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Abramowitz et al.(2019)Abramowitz, Herger, Gutmann, Hammerling,
Knutti, Leduc, Lorenz, Pincus, and Schmidt</label><?label abramowitz19?><mixed-citation>Abramowitz, G., Herger, N., Gutmann, E., Hammerling, D., Knutti, R., Leduc, M., Lorenz, R., Pincus, R., and Schmidt, G. A.: ESD Reviews: Model dependence in multi-model climate ensembles: weighting, sub-selection and out-of-sample testing, Earth Syst. Dynam., 10, 91–105, <ext-link xlink:href="https://doi.org/10.5194/esd-10-91-2019" ext-link-type="DOI">10.5194/esd-10-91-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Allen and Ingram(2002)</label><?label allen02?><mixed-citation>Allen, M. R. and Ingram, W. J.: Constraints on future changes in climate and
the hydrologic cycle, Nature, 419, 228–232, <ext-link xlink:href="https://doi.org/10.1038/nature01092" ext-link-type="DOI">10.1038/nature01092</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Annan and Hargreaves(2017)</label><?label annan17?><mixed-citation>Annan, J. D. and Hargreaves, J. C.: On the meaning of independence in climate
science, Earth System Dynamics, 8, 211–224, <ext-link xlink:href="https://doi.org/10.5194/esd-8-211-2017" ext-link-type="DOI">10.5194/esd-8-211-2017</ext-link>,
2017.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Arora et al.(2011)Arora, Scinocca, Boer, Christian, Denman, Flato,
Kharin, Lee, and Merryfield</label><?label arora11?><mixed-citation>Arora, V. K., Scinocca, J. F., Boer, G. J., Christian, J. R., Denman, K. L.,
Flato, G. M., Kharin, V. V., Lee, W. G., and Merryfield, W. J.: Carbon emission limits required to satisfy future representative concentration
pathways of greenhouse gases, Geophys. Res. Lett., 38, L05805,
<ext-link xlink:href="https://doi.org/10.1029/2010GL046270" ext-link-type="DOI">10.1029/2010GL046270</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Bishop and Abramowitz(2013)</label><?label bishop13?><mixed-citation>Bishop, C. and Abramowitz, G.: Climate model dependence and the replicate Earth paradigm, Clim. Dynam., 41, 885–900, <ext-link xlink:href="https://doi.org/10.1007/s00382-012-1610-y" ext-link-type="DOI">10.1007/s00382-012-1610-y</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Boberg and Christensen(2012)</label><?label boberg12?><mixed-citation>Boberg, F. and Christensen, J.: Overestimation of Mediterranean summer
temperature projections due to model deficiencies, Nat. Clim. Change, 2, 433–436, <ext-link xlink:href="https://doi.org/10.1038/nclimate1454" ext-link-type="DOI">10.1038/nclimate1454</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx8"><?xmltex \def\ref@label{{Bo\'{e}(2018)}}?><label>Boé(2018)</label><?label boe18?><mixed-citation>Boé, J.: Interdependency in multimodel climate projections: Component
replication and result similarity, Geophys. Res. Lett., 45, 2771–2779, <ext-link xlink:href="https://doi.org/10.1002/2017GL076829" ext-link-type="DOI">10.1002/2017GL076829</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Borodina and Knutti(2017)</label><?label borodina17?><mixed-citation>Borodina, A., E. F. and Knutti, R.: Emergent Constraints in Climate Projections: A Case Study of Changes in High-Latitude Temperature Variability, J. Climate, 30, 3655–3670, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-16-0662.1" ext-link-type="DOI">10.1175/JCLI-D-16-0662.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Branstator and Teng(2017)</label><?label branstator17?><mixed-citation>Branstator, G. and Teng, H.: Tropospheric Waveguide Teleconnections and Their Seasonality, J. Atmos. Sci., 74, 1513–1532, <ext-link xlink:href="https://doi.org/10.1175/JAS-D-16-0305.1" ext-link-type="DOI">10.1175/JAS-D-16-0305.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Brunner et al.(2019)Brunner, Lorenz, Zumwald, and Knutti</label><?label brunner19?><mixed-citation>Brunner, L., Lorenz, R., Zumwald, M., and Knutti, R.: Quantifying uncertainty
in European climate projections using combined performance-independence
weighting, Environ. Res. Lett., 14, 124010, <ext-link xlink:href="https://doi.org/10.1088/1748-9326/ab492f" ext-link-type="DOI">10.1088/1748-9326/ab492f</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Christensen and Boberg(2012)</label><?label christensen12?><mixed-citation>Christensen, J. H. and Boberg, F.: Temperature dependent climate projection
deficiencies in CMIP5 models, Geophys. Res. Lett., 39, L24705,
<ext-link xlink:href="https://doi.org/10.1029/2012GL053650" ext-link-type="DOI">10.1029/2012GL053650</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Collins et al.(2013)Collins, Knutti, Arblaster, Dufresne, Fichefet,
Friedlingstein, Gao, Gutowski, Johns, Krinner, Shongwe, Tebaldi, Weaver, and Wehner</label><?label collins13?><mixed-citation>Collins, M., Knutti, R., Arblaster, J., Dufresne, J.-L., Fichefet, T.,
Friedlingstein, P., Gao, X., Gutowski, W., Johns, T., Krinner, G., Shongwe,
M., Tebaldi, C., Weaver, A., and Wehner, M.: Long-term Climate Change:
Projections, Commitments and Irreversibility, in: book section 12,  Cambridge University Press, Cambridge, UK and New York, NY, USA, 1029–1136, <ext-link xlink:href="https://doi.org/10.1017/CBO9781107415324.024" ext-link-type="DOI">10.1017/CBO9781107415324.024</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Compo et al.(2011)Compo, Whitaker, Sardeshmukh, Matsui, Allan, Yin,
Gleason, Vose, Rutledge, Bessemoulin, Brnnimann, Brunet, Crouthamel, Grant, Groisman, Jones, Kruk, Kruger, Marshall, Maugeri, Mok, Nordli, Ross, Trigo, Wang, Woodruff, and Worley</label><?label compo11?><mixed-citation>Compo, G., Whitaker, J., Sardeshmukh, P., Matsui, N., Allan, R., Yin, X.,
Gleason, B., Vose, R., Rutledge, G., Bessemoul<?pagebreak page832?>in, P., Brönnimann, S.,
Brunet, M., Crouthamel, R., Grant, A., Groisman, P., Jones, P., Kruk, M.,
Kruger, A., Marshall, G., Maugeri, M., Mok, H., Nordli, Ø., Ross, T., Trigo, R., Wang, X., Woodruff, S., and Worley, S.: The Twentieth Century Reanalysis Project, Q. J. Roy. Meteorol. Soc., 137, 1–28, <ext-link xlink:href="https://doi.org/10.1002/qj.776" ext-link-type="DOI">10.1002/qj.776</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Deser et al.(2012)Deser, Knutti, Solomon, and Phillips</label><?label deser12?><mixed-citation>Deser, C., Knutti, R., Solomon, S., and Phillips, A. S.: Communication of the
role of natural variability in future North American climate, Nature Clim.
Change, 2, 775–779, <ext-link xlink:href="https://doi.org/10.1038/NCLIMATE1562" ext-link-type="DOI">10.1038/NCLIMATE1562</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Deser et al.(2014)Deser, Phillips, Alexander, and Smoliak</label><?label deser14?><mixed-citation>Deser, C., Phillips, A., Alexander, M. A., and Smoliak, B. V.: Projecting
North American climate over the next 50 years: Uncertainty due to internal variability, J. Climate, 27, 2271–2296, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-13-00451.1" ext-link-type="DOI">10.1175/JCLI-D-13-00451.1</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Deser et al.(2015)Deser, Tomas, and Sun</label><?label deser15?><mixed-citation>Deser, C., Tomas, R., and Sun, L.: The Role of Ocean–Atmosphere Coupling in the Zonal-Mean Atmospheric Response to Arctic Sea Ice Loss, J. Climate, 28, 2168–2186, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-14-00325.1" ext-link-type="DOI">10.1175/JCLI-D-14-00325.1</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Deser et al.(2016)Deser, Terray, and Phillips</label><?label deser16?><mixed-citation>Deser, C. A., Terray, L., and Phillips, A. S.: Forced and internal components
of winter air temperature trends over North America during the past 50 years: Mechanisms and implications, J. Climate, 29, 2237–2258, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-15-0304.1" ext-link-type="DOI">10.1175/JCLI-D-15-0304.1</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>ECMWF(2019)</label><?label ECMWF2019?><mixed-citation>ECMWF: ERA-20C Output, available at: <uri>https://apps.ecmwf.int/datasets/data/era20c-moda/levtype=sfc/type=an/</uri>, last access: May 2019.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>ESGF(2019)</label><?label ESGF2019?><mixed-citation>ESGF: WCRP Coupled Model Intercomparison Project (Phase 5), available at:  <uri>https://esgf-node.llnl.gov/projects/cmip5/</uri>, last access: July 2019.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Environment and Climate Change Canada(2019)</label><?label Environment2019?><mixed-citation>Environment and Climate Change Canada: CanESM2 Large Ensembles Output, available at: <uri>https://open.canada.ca/data/en/dataset/aa7b6823-fd1e-49ff-a6fb-68076a4a477c</uri>, last access: August 2019.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Eyring et al.(2016)Eyring, Bony, Meehl, Senior, Stevens, Stouffer,
and Taylor</label><?label eyring16?><mixed-citation>Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer,
R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, <ext-link xlink:href="https://doi.org/10.5194/gmd-9-1937-2016" ext-link-type="DOI">10.5194/gmd-9-1937-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Giorgetta et al.(2013)Giorgetta, Jungclaus, Reick, Legutke, Bader,
Bttinger, Brovkin, Crueger, Esch, Fieg, Glushak, Gayler, Haak, Hollweg,
Ilyina, Kinne, Kornblueh, Matei, Mauritsen, Mikolajewicz, Mueller, Notz,
Pithan, Raddatz, Rast, Redler, Roeckner, Schmidt, Schnur, Segschneider, Six, Stockhause, Timmreck, Wegner, Widmann, Wieners, Claussen, Marotzke, and Stevens</label><?label giorgetta13?><mixed-citation>Giorgetta, M. A., Jungclaus, J., Reick, C. H., Legutke, S., Bader, J.,
Böttinger, M., Brovkin, V., Crueger, T., Esch, M., Fieg, K., Glushak, K.,
Gayler, V., Haak, H., Hollweg, H., Ilyina, T., Kinne, S., Kornblueh, L.,
Matei, D., Mauritsen, T., Mikolajewicz, U., Mueller, W., Notz, D., Pithan,
F., Raddatz, T., Rast, S., Redler, R., Roeckner, E., Schmidt, H., Schnur, R.,
Segschneider, J., Six, K. D., Stockhause, M., Timmreck, C., Wegner, J.,
Widmann, H., Wieners, K., Claussen, M., Marotzke, J., and Stevens, B.:
Climate and carbon cycle changes from 1850 to 2100 in MPI-ESM simulations
for the Coupled Model Intercomparison Project phase 5, J. Adv. Model. Earth Syst., 5, 572–597, <ext-link xlink:href="https://doi.org/10.1002/jame.20038" ext-link-type="DOI">10.1002/jame.20038</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Guo et al.(2019)Guo, Deser, Terray, and Lehner</label><?label guo19?><mixed-citation>Guo, R., Deser, C., Terray, L., and Lehner, F.: Human influence on winter
precipitation trends (1921–2015) over North America and Eurasia revealed by dynamical adjustment, Geophys. Res. Lett., 46, 3426–3434, <ext-link xlink:href="https://doi.org/10.1029/2018GL081316" ext-link-type="DOI">10.1029/2018GL081316</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Hall and Manabe(1999)</label><?label hall99?><mixed-citation>Hall, A. and Manabe, S.: The Role of Water Vapor Feedback in Unperturbed Climate Variability and Global Warming, J. Climate, 12, 2327–2346,
<ext-link xlink:href="https://doi.org/10.1175/1520-0442(1999)012&lt;2327:TROWVF&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0442(1999)012&lt;2327:TROWVF&gt;2.0.CO;2</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Hawkins and Sutton(2009)</label><?label hawkins09?><mixed-citation>Hawkins, E. and Sutton, R.: The Potential to Narrow Uncertainty in Regional Climate Predictions, B. Am. Meteorol. Soc., 90, 1095–1108,
<ext-link xlink:href="https://doi.org/10.1175/2009BAMS2607.1" ext-link-type="DOI">10.1175/2009BAMS2607.1</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Hawkins et al.(2016)Hawkins, Smith, Gregory, and
Stainforth</label><?label hawkins16?><mixed-citation>Hawkins, E., Smith, R. S., Gregory, J. M., and Stainforth, D. A.: Irreducible
uncertainty in near-term climate projections, Clim. Dynam., 46, 3807–3819,
<ext-link xlink:href="https://doi.org/10.1007/s00382-015-2806-8" ext-link-type="DOI">10.1007/s00382-015-2806-8</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx28"><?xmltex \def\ref@label{{Herger et~al.(2018)Herger, Abramowitz, Knutti, Ang\'{e}lil, Lehmann,
and Sanderson}}?><label>Herger et al.(2018)Herger, Abramowitz, Knutti, Angélil, Lehmann,
and Sanderson</label><?label herger18?><mixed-citation>Herger, N., Abramowitz, G., Knutti, R., Angélil, O., Lehmann, K., and
Sanderson, B. M.: Selecting a climate model subset to optimise key ensemble
properties, Earth Syst. Dynam., 9, 135–151, <ext-link xlink:href="https://doi.org/10.5194/esd-9-135-2018" ext-link-type="DOI">10.5194/esd-9-135-2018</ext-link>,
2018.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Hourdin et al.(2017)Hourdin, Mauritsen, Gettelman, Golaz, Balaji,
Duan, Folini, Ji, Klocke, Qian, Rauser, Rio, Tomassini, Watanabe, and
Williamson</label><?label hourdin17?><mixed-citation>Hourdin, F., Mauritsen, T., Gettelman, A., Golaz, J., Balaji, V., Duan, Q.,
Folini, D., Ji, D., Klocke, D., Qian, Y., Rauser, F., Rio, C., Tomassini, L.,
Watanabe, M., and Williamson, D.: The Art and Science of Climate Model Tuning, B. Am. Meteorol. Soc., 98, 589–602, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-15-00135.1" ext-link-type="DOI">10.1175/BAMS-D-15-00135.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Hurrell et al.(2013)Hurrell, Holland, Gent, Ghan, Kay, Kushner,
Lamarque, Large, Lawrence, Lindsay, Lipscomb, Long, Mahowald, Marsh, Neale,
Rasch, Vavrus, Vertenstein, Bader, Collins, Hack, Kiehl, and
Marshall</label><?label hurrell13?><mixed-citation>Hurrell, J., Holland, M., Gent, P., Ghan, S., Kay, J., Kushner, P., Lamarque,
J., Large, W., Lawrence, D., Lindsay, K., Lipscomb, W., Long, M., Mahowald,
N., Marsh, D., Neale, R., Rasch, P., Vavrus, S., Vertenstein, M., Bader, D.,
Collins, W., Hack, J., Kiehl, J., and Marshall, S.: The Community Earth
System Model: A Framework for Collaborative Research, B. Am. Meteorol. Soc., 94, 1339–1360, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-12-00121.1" ext-link-type="DOI">10.1175/BAMS-D-12-00121.1</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Karlsson and Svensson(2013)</label><?label karlsson13?><mixed-citation>Karlsson, J. and Svensson, G.: Consequences of poor representation of Arctic
sea–ice albedo and cloud–radiation interactions in the CMIP5 model ensemble, Geophys. Res. Lett., 40, 4374–4379, <ext-link xlink:href="https://doi.org/10.1002/grl.50768" ext-link-type="DOI">10.1002/grl.50768</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Kay et al.(2015)Kay, Deser, Phillips, Mai, Hannay, Strand, Arblaster, Bates, Danabasoglu, Edwards, Holland, Kushner, Lamarque, Lawrence, Lindsay, Middleton, Munoz, Neale, Oleson, Polvani, and Vertenstein</label><?label kay15?><mixed-citation>Kay, J. E., Deser, C., Phillips, A., Mai, A., Hannay, C., Strand, G.,
Arblaster, J., Bates, S., Danabasoglu, G., Edwards, J., Holland, M., Kushner,
P., Lamarque, J. F., Lawrence, D., Lindsay, K., Middleton, A., Munoz, E.,
Neale, R., Oleson, K., Polvani, L., and Vertenstein, M.: The Community Earth System Model (CESM) Large Ensemble Project: A community resource for studying climate change in the presence of internal climate variability, B. Am. Meteorol. Soc., 96, 1333–1349, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-13-00255.1" ext-link-type="DOI">10.1175/BAMS-D-13-00255.1</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Knutti(2010)</label><?label knutti10c?><mixed-citation>Knutti, R.: The end of model democracy?, Climatic Change, 102, 395–404,
<ext-link xlink:href="https://doi.org/10.1007/s10584-010-9800-2" ext-link-type="DOI">10.1007/s10584-010-9800-2</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx34"><?xmltex \def\ref@label{{Knutti and Sedl\'{a}\v{c}ek(2013)}}?><label>Knutti and Sedláček(2013)</label><?label knutti13a?><mixed-citation>Knutti, R. and Sedláček, J.: Robustness and Uncertainties in the New
CMIP5 Climate Model Projections, Nat. Clim. Change, 3, 369–373,
<ext-link xlink:href="https://doi.org/10.1038/NCLIMATE1716" ext-link-type="DOI">10.1038/NCLIMATE1716</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Knutti et al.(2010a)Knutti, Abramowitz, Collins, Eyring,
Gleckler, Hewitson, and Mearns</label><?label knutti10a?><mixed-citation>
Knutti, R., Abramowitz, G., Collins, M., Eyring, V., Gleckler, P., Hewitson,
B., and Mearns, L.: Good Practice Guidance Paper on Assessing and Combining Multi Model Climate Projections, in: Meeting Report of the Intergovernmental Panel on Climate Change Expert Meeting on Assessing and Combining Multi Model Climate Projections, edited by: Stocker, T., Qin, D., Plattner, G.-K., Tignor, M., and Midgley, P., IPCC Working Group I Technical Support Unit, University of Bern, Bern, Switzerland, 2010a.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Knutti et al.(2010b)Knutti, Furrer, Tebaldi, Cermak, and
Meehl</label><?label knutti10b?><mixed-citation>Knutti, R., Furrer, R., Tebaldi, C., Cermak, J., and Meehl, G.: Challenges in
Combining Projections from Multiple Climate Models, J. Climate, 23, 2739–2758, <ext-link xlink:href="https://doi.org/10.1175/2009JCLI3361.1" ext-link-type="DOI">10.1175/2009JCLI3361.1</ext-link>, 2010b.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Knutti et al.(2013)Knutti, Masson, and Gettelman</label><?label knutti13b?><mixed-citation>Knutti, R., Masson, D., and Gettelman, A.: Climate model genealogy:
Generation CMIP5 and how we got there, Geophys. Res. Lett., 40, 1194–1199, <ext-link xlink:href="https://doi.org/10.1002/grl.50256" ext-link-type="DOI">10.1002/grl.50256</ext-link>, 2013.</mixed-citation></ref>
      <?pagebreak page833?><ref id="bib1.bibx38"><?xmltex \def\ref@label{{Knutti et~al.(2017)Knutti, Sedl\'{a}c\v{e}k, Sanderson, Lorenz,
Fischer, and Eyring}}?><label>Knutti et al.(2017)Knutti, Sedlácěk, Sanderson, Lorenz,
Fischer, and Eyring</label><?label knutti17?><mixed-citation>Knutti, R., Sedlácěk, J., Sanderson, B. M., Lorenz, R., Fischer, E., and Eyring, V.: A climate model projection weighting scheme accounting for
performance and interdependence, Geophys. Res. Lett., 44, 1909–1918, <ext-link xlink:href="https://doi.org/10.1002/2016GL072012" ext-link-type="DOI">10.1002/2016GL072012</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Kunreuther et al.(2013)Kunreuther, Heal, Allen, Edenhofer, Field, and Yohe</label><?label kunreuther13?><mixed-citation>Kunreuther, H., Heal, G., Allen, M., Edenhofer, O., Field, C. B., and Yohe, G.: Risk management and climate change, Nat. Clim. Change, 3, 447–450,
<ext-link xlink:href="https://doi.org/10.1038/NCLIMATE1740" ext-link-type="DOI">10.1038/NCLIMATE1740</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx40"><?xmltex \def\ref@label{{Leduc et~al.(2016)Leduc, Laprise, de~El\'{i}a, and}}?><label>Leduc et al.(2016)Leduc, Laprise, de Elía, and</label><?label leduc16?><mixed-citation>Leduc, M., Laprise, R., de Elía, R., and S̄eparovič, L.: Is Institutional Democracy a Good Proxy for Model Independence?, J. Climate, 29, 8301–8316, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-15-0761.1" ext-link-type="DOI">10.1175/JCLI-D-15-0761.1</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Lehner et al.(2017)Lehner, Deser, and Terray</label><?label lehner17?><mixed-citation>Lehner, F., Deser, C., and Terray, L.: Towards a new estimate of “time of
emergence” of anthropogenic warming: insights from dynamical adjustment and a large initial-condition model ensemble, J. Climate, 109, 14337–14342,
<ext-link xlink:href="https://doi.org/10.1175/JCLI-D-16-0792.1" ext-link-type="DOI">10.1175/JCLI-D-16-0792.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Lehner et al.(2020)Lehner, Deser, Maher, Marotzke, Fischer, Brunner, Knutti, and Hawkins</label><?label lehner20?><mixed-citation>Lehner, F., Deser, C., Maher, N., Marotzke, J., Fischer, E. M., Brunner, L.,
Knutti, R., and Hawkins, E.: Partitioning climate projection uncertainty with
multiple large ensembles and CMIP5/6, Earth Syst. Dynam., 11, 491–508,
<ext-link xlink:href="https://doi.org/10.5194/esd-11-491-2020" ext-link-type="DOI">10.5194/esd-11-491-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Li and Xie(2012)</label><?label li12?><mixed-citation>Li, G. and Xie, S.: Origins of tropical-wide SST biases in CMIP multi-model
ensembles, Geophys. Res. Lett., 39, L22703, <ext-link xlink:href="https://doi.org/10.1029/2012GL053777" ext-link-type="DOI">10.1029/2012GL053777</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Liu et al.(2012)Liu, Allan, and Huffman</label><?label liu12?><mixed-citation>Liu, C., Allan, R. P., and Huffman, G. J.: Co-variation of temperature and
precipitation in CMIP5 models and satellite observations, Geophys. Res. Lett., 39, L13803, <ext-link xlink:href="https://doi.org/10.1029/2012GL052093" ext-link-type="DOI">10.1029/2012GL052093</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx45"><?xmltex \def\ref@label{{Lorenz et~al.(2018)Lorenz, Herger, Sedl\'{a}c\v{e}k, Eyring, Fischer,
and Knutti}}?><label>Lorenz et al.(2018)Lorenz, Herger, Sedlácěk, Eyring, Fischer,
and Knutti</label><?label lorenz18?><mixed-citation>Lorenz, R., Herger, N., Sedlácěk, J., Eyring, V., Fischer, E. M., and
Knutti, R.: Prospects and caveats of weighting climate models for summer
maximum temperature projections over North America, J. Geophys.Res.-Atmos., 123, 4509–4526, <ext-link xlink:href="https://doi.org/10.1029/2017JD027992" ext-link-type="DOI">10.1029/2017JD027992</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Maher et al.(2019)Maher, Milinski, Suarez-Gutierrez, Botzet,
Dobrynin, Kornblueh, Krger, Takano, Ghosh, Hedemann, Li, Li, Manzini, Notz, Putrasahan, Boysen, Claussen, Ilyina, Olonscheck, Raddatz, Stevens, and Marotzke</label><?label maher19?><mixed-citation>Maher, N., Milinski, S., Suarez-Gutierrez, L., Botzet, M., Dobrynin, M.,
Kornblueh, L., Krüger, J., Takano, Y., Ghosh, R., Hedemann, C., Li, C., Li, H., Manzini, E., Notz, N., Putrasahan, D., Boysen, L., Claussen, M., Ilyina, T., Olonscheck, D., Raddatz, T., Stevens, B., and Marotzke, J.: The Max Planck Institute Grand Ensemble: Enabling the Exploration of Climate System Variability, J. Adv. Model. Earth Syst., 11, 1–21, <ext-link xlink:href="https://doi.org/10.1029/2019MS001639" ext-link-type="DOI">10.1029/2019MS001639</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Masson and Knutti(2011)</label><?label masson11?><mixed-citation>Masson, D. and Knutti, R.: Climate model genealogy, Geophys. Res. Lett., 38,
L08703, <ext-link xlink:href="https://doi.org/10.1029/2011GL046864" ext-link-type="DOI">10.1029/2011GL046864</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Mauritsen et al.(2012)Mauritsen, Stevens, Roeckner, Crueger, Esch,
Giorgetta, Haak, Jungclaus, Klocke, Matei, Mikolajewicz, Notz, Pincus,
Schmidt, and Tomassini</label><?label mauritsen12?><mixed-citation>Mauritsen, T., Stevens, B., Roeckner, E., Crueger, T., Esch, M., Giorgetta, M., Haak, H., Jungclaus, J., Klocke, D., Matei, D., Mikolajewicz, U., Notz, D., Pincus, R., Schmidt, H., and Tomassini, L.: Tuning the climate of a global  model, J. Adv. Model. Earth Syst., 4, M00A01, <ext-link xlink:href="https://doi.org/10.1029/2012MS000154" ext-link-type="DOI">10.1029/2012MS000154</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Max Planck Institute for Meteorology(2019)</label><?label MPI2019?><mixed-citation>Max Planck Institute for Meteorology: MPI Grand Ensemble Output, available at: <uri>https://esgf-data.dkrz.de/projects/mpi-ge/</uri>, last access: August 2019.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Meehl et al.(2000)Meehl, Boer, Covey, Latif, and Stouffer</label><?label meehl00?><mixed-citation>
Meehl, G. A., Boer, G. J., Covey, C., Latif, M., and Stouffer, R. J.: The
Coupled Model Intercomparison Project (CMIP), B. Am. Meteorol. Soc., 81, 313–318, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Meinshausen et al.(2011)Meinshausen, Smith, Calvin, Daniel, Kainuma, Lamarque, Matsumoto, Montzka, Raper, Riahi, Thomson, Velders, and van Vuuren</label><?label meinshausen11?><mixed-citation>Meinshausen, M., Smith, S. J., Calvin, K., Daniel, J. S., Kainuma, M. L. T.,
Lamarque, J.-F., Matsumoto, K., Montzka, S. A., Raper, S. C. B., Riahi, K.,
Thomson, A., Velders, G. J. M., and van Vuuren, D. P.: The RCP greenhouse gas concentrations and their extensions from 1765 to 2300, Climatic Change, 109, 213–241, <ext-link xlink:href="https://doi.org/10.1007/s10584-011-0156-z" ext-link-type="DOI">10.1007/s10584-011-0156-z</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Merrifield and Xie(2016)</label><?label merrifield16?><mixed-citation>Merrifield, A. and Xie, S.: Summer U.S. Surface Air Temperature Variability: Controlling Factors and AMIP Simulation Biases, J. Climate, 29, 5123–5139, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-15-0705.1" ext-link-type="DOI">10.1175/JCLI-D-15-0705.1</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Merrifield et al.(2017)Merrifield, Lehner, Xie, and
Deser</label><?label merrifield17?><mixed-citation>Merrifield, A. L., Lehner, F., Xie, S.-P., and Deser, C.: Removing circulation effects to assess central U.S. land–atmosphere interactions in the CESM Large Ensemble, Geophys. Res. Lett., 44, 9938–9946,
<ext-link xlink:href="https://doi.org/10.1002/2017GL074831" ext-link-type="DOI">10.1002/2017GL074831</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Merrifield et al.(2020)</label><?label Merrifield2020?><mixed-citation>Merrifield, A. L., Brunner, L., and Lorenz, R.: ESD_weighting_large_ensembles: Paper Release (Version v1.0), Zenodo, <ext-link xlink:href="https://doi.org/10.5281/zenodo.4028924" ext-link-type="DOI">10.5281/zenodo.4028924</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Miller et al.(2014)Miller, Schmidt, Nazarenko, Tausnev, Bauer,
DelGenio, Kelley, Lo, Ruedy, Shindell, Aleinov, Bauer, Bleck, Canuto, Chen,
Cheng, Clune, Faluvegi, Hansen, Healy, Kiang, Koch, Lacis, LeGrande, Lerner, Menon, Oinas, Prez Garc­a-Pando, Perlwitz, Puma, Rind, Romanou, Russell, Sato, Sun, Tsigaridis, Unger, Voulgarakis, Yao, and Zhang</label><?label miller14?><mixed-citation>Miller, R. L., Schmidt, G. A., Nazarenko, L. S., Tausnev, N., Bauer, S. E.,
DelGenio, A. D., Kelley, M., Lo, K. K., Ruedy, R., Shindell, D. T., Aleinov,
I., Bauer, M., Bleck, R., Canuto, V., Chen, Y., Cheng, Y., Clune, T. L.,
Faluvegi, G., Hansen, J. E., Healy, R. J., Kiang, N. Y., Koch, D., Lacis, A. A., LeGrande, A. N., Lerner, J., Menon, S., Oinas, V., Pérez García-Pando, C., Perlwitz, J. P., Puma, M. J., Rind, D., Romanou, A., Russell, G. L., Sato, M., Sun, S., Tsigaridis, K., Unger, N., Voulgarakis, A., Yao, M.-S., and Zhang, J.: CMIP5 historical simulations (1850–2012) with GISS ModelE2, J. Adv. Model. Earth Syst., 6,
441–478, <ext-link xlink:href="https://doi.org/10.1002/2013MS000266" ext-link-type="DOI">10.1002/2013MS000266</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Mueller and Seneviratne(2014)</label><?label mueller14?><mixed-citation>Mueller, B. and Seneviratne, S. I.: Systematic land climate and evapotranspiration biases in CMIP5 simulations, Geophys. Res. Lett., 41,
128–134, <ext-link xlink:href="https://doi.org/10.1002/2013GL058055" ext-link-type="DOI">10.1002/2013GL058055</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>O'Neill et al.(2014)Oâ€™Neill, Kriegler, Riahi, Ebi, Hallegatte,
Carter, Mathur, and van Vuuren</label><?label oneill14?><mixed-citation>O'Neill, B. C., Kriegler, E., Riahi, K., Ebi, K. L., Hallegatte, S., Carter,
T. R., Mathur, R., and van Vuuren, D. P.: A new scenario framework for climate change research: the concept of shared socioeconomic pathways,
Climatic Change, 122, 387–400, <ext-link xlink:href="https://doi.org/10.1007/s10584-013-0905-2" ext-link-type="DOI">10.1007/s10584-013-0905-2</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Pennell and Reichler(2011)</label><?label pennell11?><mixed-citation>Pennell, C. and Reichler, T.: On the Effective Number of Climate Models, J. Climate, 24, 2358–2367, <ext-link xlink:href="https://doi.org/10.1175/2010JCLI3814.1" ext-link-type="DOI">10.1175/2010JCLI3814.1</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Pithan et al.(2014)Pithan, Medeiros, and Mauritsen</label><?label pithan14?><mixed-citation>Pithan, F., Medeiros, B., and Mauritsen, T.: Mixed-phase clouds cause climate
model biases in Arctic wintertime temperature inversions, Clim. Dynam., 43,
289–303, <ext-link xlink:href="https://doi.org/10.1007/s00382-013-1964-9" ext-link-type="DOI">10.1007/s00382-013-1964-9</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx60"><?xmltex \def\ref@label{{Poli et~al.(2016)Poli, Hersbach, Dee, Berrisford, Simmons, Vitart,
Laloyaux, Tan, Peubey, Th\'{e}paut, Tr\'{e}molet, H\'{o}lm, Bonavita, Isaksen, and Fisher}}?><label>Poli et al.(2016)Poli, Hersbach, Dee, Berrisford, Simmons, Vitart,
Laloyaux, Tan, Peubey, Thépaut, Trémolet, Hólm, Bonavita, Isaksen, and Fisher</label><?label poli16?><mixed-citation>Poli, P., Hersbach, H., Dee, D., Berrisford, P., Simmons, A., Vitart, F.,
Laloyaux, P., Tan, D., Peubey, C., Thépaut, J., Trémolet, Y., Hólm, E., Bonavita, M., Isaksen, L., and Fisher, M.: ERA-20C: An Atmospheric Reanalysis of the Twentieth Century, J. Climate, 29, 4083–4097,
<ext-link xlink:href="https://doi.org/10.1175/JCLI-D-15-0556.1" ext-link-type="DOI">10.1175/JCLI-D-15-0556.1</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Rohde et al.(2013)Rohde, Muller, Jacobsen, Perlmutter, Rosenfeld, and et al.</label><?label rohde13?><mixed-citation>Rohde, R., Muller, R., Jacobsen, R., Perlmutter, S., and Mosher, S.: Berkeley Earth Temperature Averaging Process, Geoinform. Geostat., 01, 1000103, <ext-link xlink:href="https://doi.org/10.4172/2327-4581.1000103" ext-link-type="DOI">10.4172/2327-4581.1000103</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Rondeau-Genesse and Braun(2019)</label><?label rg19?><mixed-citation>Rondeau-Genesse, G. and Braun, M.: Impact of internal variability on climate
change for the upcoming decades: analysis of the CanESM2-LE and CESM-LE
large ensembles, Climatic Change, 156, 299–314, <ext-link xlink:href="https://doi.org/10.1007/s10584-019-02550-2" ext-link-type="DOI">10.1007/s10584-019-02550-2</ext-link>, 2019.</mixed-citation></ref>
      <?pagebreak page834?><ref id="bib1.bibx63"><label>Sanderson et al.(2015a)Sanderson, Knutti, and
Caldwell</label><?label sanderson05a?><mixed-citation>Sanderson, B. M., Knutti, R., and Caldwell, P.: Addressing interdependency in a multimodel ensemble by interpolation of model properties, J. Climate, 28, 28, 150330095237008, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-14-00361.1" ext-link-type="DOI">10.1175/JCLI-D-14-00361.1</ext-link>, 2015a.</mixed-citation></ref>
      <ref id="bib1.bibx64"><label>Sanderson et al.(2015b)Sanderson, Knutti, and
Caldwell</label><?label sanderson05b?><mixed-citation>Sanderson, B. M., Knutti, R., and Caldwell, P.: A representative democracy to
reduce interdependency in a multimodel ensemble, J. Climate, 28, 5171–5194, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-14-00362.1" ext-link-type="DOI">10.1175/JCLI-D-14-00362.1</ext-link>, 2015b.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>Seneviratne(2012)</label><?label seneviratne12?><mixed-citation>
Seneviratne, S. I.: Changes in climate extremes and their impacts on the
natural physical environment, in: Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation. A Special Report of Working Groups I and II of the Intergovernmental Panel on Climate Change (IPCC), edited by: Field, C. B., Barros, V., Stocker, T. F., Qin, D., Dokken, D. J., Ebi, K. L., Mastrandrea, M. D., Mach, K. J., Plattner, G. K., Allen, S. K., Tignor, M., and Midgley, P. M., Cambridge University Press, Cambridge, UK and New York, NY, USA, 109–230, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Seneviratne et al.(2010)Seneviratne, Corti, Davin, Hirschi, Jaeger,
Lehner, Orlowsky, and Teuling</label><?label seneviratne10?><mixed-citation>Seneviratne, S., Corti, T., Davin, E., Hirschi, M., Jaeger, E., Lehner, I.,
Orlowsky, B., and Teuling, A.: Investigating soil moisture–climate
interactions in a changing climate: a review, Earth-Sci. Rev., 99, 125–161,
<ext-link xlink:href="https://doi.org/10.1016/j.earscirev.2010.02.004" ext-link-type="DOI">10.1016/j.earscirev.2010.02.004</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx67"><label>Sillmann et al.(2013)Sillmann, Kharin, Zhang, Zwiers, and
Bronaugh</label><?label sillman13?><mixed-citation>Sillmann, J., Kharin, V. V., Zhang, X., Zwiers, F. W., and Bronaugh, D.: Climate extremes indices in the CMIP5 multimodel ensemble: Part 1. Model evaluation in the present climate, J. Geophys. Res.-Atmos., 118, 1716–1733, <ext-link xlink:href="https://doi.org/10.1002/jgrd.50203" ext-link-type="DOI">10.1002/jgrd.50203</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx68"><label>Sippel et al.(2019)Sippel, Meinshausen, Merrifield, Lehner,
Pendergrass, Fischer, and Knutti</label><?label sippel19?><mixed-citation>Sippel, S., Meinshausen, N., Merrifield, A., Lehner, F., Pendergrass, A.,
Fischer, E., and Knutti, R.: Uncovering the Forced Climate Response from a
Single Ensemble Member Using Statistical Learning, J. Climate, 32, 5677–5699, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-18-0882.1" ext-link-type="DOI">10.1175/JCLI-D-18-0882.1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx69"><label>Slivinski et al.(2019)Slivinski, Compo, Whitaker, Sardeshmukh, Giese, McColl, Allan, Yin, Vose, Titchner, Kennedy, Spencer, Ashcroft, Brnnimann, Brunet, Camuffo, Cornes, Cram, Crouthamel, DomnguezCastro, Freeman, Gergis, Hawkins, Jones, Jourdain, Kaplan, Kubota, Blancq, Lee, Lorrey, Luterbacher, Maugeri, Mock, Moore, Przybylak, Pudmenzky, Reason, Slonosky, Smith, Tinz, Trewin, Valente, Wang, Wilkinson, Wood, and
Wyszyki</label><?label slivinski19?><mixed-citation>Slivinski, L. C., Compo, G. P., Whitaker, J. S., Sardeshmukh, P. D., Giese,
B. S., McColl, C., Allan, R., Yin, X., Vose, R., Titchner, H., Kennedy, J.,
Spencer, L. J., Ashcroft, L., Brönnimann, S., Brunet, M., Camuffo, D.,
Cornes, R., Cram, T. A., Crouthamel, R., Domínguez-Castro, F., Freeman,
J. E., Gergis, J., Hawkins, E., Jones, P. D., Jourdain, S., Kaplan, A., Kubota, H., Blancq, F. L., Lee, T., Lorrey, A., Luterbacher, J., Maugeri, M.,
Mock, C. J., Moore, G. K., Przybylak, R., Pudmenzky, C., Reason, C., Slonosky, V. C., Smith, C. A., Tinz, B., Trewin, B., Valente, M. A., Wang, X. L., Wilkinson, C., Wood, K., and Wyszyńki, P.: Towards a more reliable
historical reanalysis: Improvements for version 3 of the Twentieth Century
Reanalysis system, Q. J. Roy. Meteorol. Soc., 145, 2876–2908,
<ext-link xlink:href="https://doi.org/10.1002/qj.3598" ext-link-type="DOI">10.1002/qj.3598</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx70"><label>Stainforth et al.(2007)Stainforth, Allen, Tredger, and
Smith</label><?label stainforth07?><mixed-citation>Stainforth, D., Allen, M., Tredger, E., and Smith, L.: Confidence, uncertainty and decision-support relevance in climate predictions, Philos. T. Roy. Soc. A, 265, 2145–2161, <ext-link xlink:href="https://doi.org/10.1098/rsta.2007.2074" ext-link-type="DOI">10.1098/rsta.2007.2074</ext-link>, 2007.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx71"><label>Stevens et al.(2013)Stevens, Giorgetta, Esch, Mauritsen, Crueger,
Rast, Salzmann, Schmidt, Bader, Block, Brokopf, Fast, Kinne, Kornblueh,
Lohmann, Pincus, Reichler, and Roeckner</label><?label stevens13?><mixed-citation>Stevens, B., Giorgetta, M., Esch, M., Mauritsen, T., Crueger, T., Rast, S.,
Salzmann, M., Schmidt, H., Bader, J., Block, K., Brokopf, R., Fast, I., Kinne, S., Kornblueh, L., Lohmann, U., Pincus, R., Reichler, T., and Roeckner, E.: Atmospheric component of the MPI-M Earth System Model: ECHAM6, J. Adv. Model. Earth Syst., 5, 146–172, <ext-link xlink:href="https://doi.org/10.1002/jame.20015" ext-link-type="DOI">10.1002/jame.20015</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx72"><label>Stouffer et al.(2017)Stouffer, Eyring, Meehl, Bony, Senior, Stevens, and Taylor</label><?label stouffer17?><mixed-citation>Stouffer, R., Eyring, V., Meehl, G., Bony, S., Senior, C., Stevens, B., and
Taylor, K.: CMIP5 Scientific Gaps and Recommendations for CMIP6, B. Am. Meteorol. Soc., 98, 95–105, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-15-00013.1" ext-link-type="DOI">10.1175/BAMS-D-15-00013.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx73"><label>Swart et al.(2018)Swart, Gille, Fyfe, and Gillett</label><?label swart18?><mixed-citation>Swart, N. C., Gille, S. T., Fyfe, J. C., and Gillett, N. P.: Recent Southern
Ocean warming and freshening driven by greenhouse gas emissions and ozone
depletion, Nat. Geosci., 11, 836–841, <ext-link xlink:href="https://doi.org/10.1038/s41561-018-0226-1" ext-link-type="DOI">10.1038/s41561-018-0226-1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx74"><label>Tebaldi and Knutti(2007)</label><?label tebaldi07?><mixed-citation>Tebaldi, C. and Knutti, R.: The Use of the Multi-Model Ensemble in Probabilistic Climate Change Projections, Philos. T. Roy. Soc. A, 365, 2053–2075, <ext-link xlink:href="https://doi.org/10.1098/rsta.2007.2076" ext-link-type="DOI">10.1098/rsta.2007.2076</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx75"><label>Trenberth and Paolino(1980)</label><?label trenberth80?><mixed-citation>Trenberth, K. and Paolino, D.: The Northern Hemisphere Sea-Level Pressure Data Set: Trends, Errors and Discontinuities, Mon. Weather Rev., 108, 855–872, <ext-link xlink:href="https://doi.org/10.1175/1520-0493(1980)108&lt;0855:TNHSLP&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0493(1980)108&lt;0855:TNHSLP&gt;2.0.CO;2</ext-link>,
1980.</mixed-citation></ref>
      <ref id="bib1.bibx76"><label>van den Dool(1994)</label><?label vandendool94?><mixed-citation>van den Dool, H. M.: Searching for analogues, how long must we wait?, Tellus A, 46, 314–324, <ext-link xlink:href="https://doi.org/10.1034/j.1600-0870.1994.t01-2-00006.x" ext-link-type="DOI">10.1034/j.1600-0870.1994.t01-2-00006.x</ext-link>, 1994.</mixed-citation></ref>
      <ref id="bib1.bibx77"><label>van Vuuren et al.(2011)van Vuuren, Edmonds, Kainuma, Riahi, Thomson, Hibbard, Hurtt, Kram, Krey, Lamarque, Masui, Meinshausen, Nakicenovic, Smith, and Rose</label><?label vanvuuren11?><mixed-citation>van Vuuren, D., Edmonds, J., Kainuma, M., Riahi, K., Thomson, A., Hibbard, K., Hurtt, G. C., Kram, T., Krey, V., Lamarque, J.-F., Masui, T., Meinshausen, M., Nakicenovic, N., Smith, S. J., and Rose, S. K.: The representative concentration pathways: an overview, Climatic Change, 109, 5–31, <ext-link xlink:href="https://doi.org/10.1007/s10584-011-0148-z" ext-link-type="DOI">10.1007/s10584-011-0148-z</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx78"><label>Vogel et al.(2018)Vogel, Zscheischler, and Seneviratne</label><?label vogel18?><mixed-citation>Vogel, M. M., Zscheischler, J., and Seneviratne, S. I.: Varying soil
moisture–atmosphere feedbacks explain divergent temperature extremes and
precipitation projections in central Europe, Earth Syst. Dynam., 9, 1107–1125, <ext-link xlink:href="https://doi.org/10.5194/esd-9-1107-2018" ext-link-type="DOI">10.5194/esd-9-1107-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx79"><label>Wallace et al.(2012)Wallace, Fu, Smoliak, Lin, and
Johanson</label><?label wallace12?><mixed-citation>Wallace, J., Fu, Q., Smoliak, B. V., Lin, P., and Johanson, C. M.: Simulated
versus observed patterns of warming over the extratropical Northern Hemisphere continents during the cold season, P. Natl. Acad. Sci. USA, 109, 14337–14342, <ext-link xlink:href="https://doi.org/10.1073/pnas.1204875109" ext-link-type="DOI">10.1073/pnas.1204875109</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx80"><label>Wallace et al.(1995)Wallace, Zhang, and Renwick</label><?label wallace95?><mixed-citation>Wallace, J. M., Zhang, Y., and Renwick, J. A.: Dynamic contribution to
hemispheric mean temperature trends, Science, 270, 780–783,
<ext-link xlink:href="https://doi.org/10.1126/science.270.5237.780" ext-link-type="DOI">10.1126/science.270.5237.780</ext-link>, 1995.</mixed-citation></ref>
      <ref id="bib1.bibx81"><label>Wallace et al.(2015)Wallace, Deser, Smoliak, and
Phillips</label><?label wallace15?><mixed-citation>
Wallace, J. M., Deser, C., Smoliak, B., and Phillips, A.: Attribution of climate change in the presence of internal variability, in: Climate Change:
Multidecadal and Beyond, edited by: Chang, C.-P., Ghil, M., Latif, M., and
Wallace, J. M., World Scientific Publishing, Singapore, 1–29, 2015.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>An investigation of weighting schemes  suitable for incorporating large ensembles  into multi-model ensembles</article-title-html>
<abstract-html><p>Multi-model ensembles can be used to estimate uncertainty in projections of regional climate, but this uncertainty often depends on the constituents of the ensemble. The dependence of uncertainty on ensemble composition is clear when single-model initial condition large ensembles (SMILEs) are included within a multi-model ensemble. SMILEs allow for the quantification of internal variability, a non-negligible component of uncertainty on regional scales, but may also serve to inappropriately narrow uncertainty by giving a single model many additional votes. In advance of the mixed multi-model, the SMILE Coupled Model Intercomparison version 6 (CMIP6) ensemble, we investigate weighting approaches to incorporate 50 members of the Community Earth System Model (CESM1.2.2-LE), 50 members of the Canadian Earth System Model (CanESM2-LE), and 100 members of the MPI Grand Ensemble (MPI-GE) into an 88-member Coupled Model Intercomparison Project Phase 5 (CMIP5) ensemble. The weights assigned are based on ability to reproduce observed climate (performance) and scaled by a measure of redundancy (dependence). Surface air temperature (SAT) and sea level pressure (SLP) predictors are used to determine the weights, and relationships between present and future predictor behavior are discussed.  The estimated residual thermodynamic trend is proposed as an alternative predictor to replace 50-year regional SAT trends, which are more susceptible to internal variability.</p><p>Uncertainty in estimates of northern European winter and Mediterranean summer end-of-century warming is assessed in a CMIP5 and a combined SMILE–CMIP5 multi-model ensemble. Five different weighting strategies to account for the mix of initial condition (IC) ensemble members and individually represented models within the multi-model ensemble are considered. Allowing all multi-model ensemble members to receive either equal weight or solely a performance weight (based on the root mean square error (RMSE) between members and observations over nine predictors) is shown to lead to uncertainty estimates that are dominated by the presence of SMILEs. A more suitable approach includes a dependence assumption, scaling either by 1∕<i>N</i>, the number of constituents representing a <q>model</q>, or by the same RMSE distance metric used to define model performance. SMILE contributions to the weighted ensemble are smallest ( &lt; 10&thinsp;%) when a model is defined as an IC ensemble and increase slightly ( &lt; 20&thinsp;%) when the definition of a model expands to include members from the same institution and/or development stream. SMILE contributions increase further when dependence is defined by RMSE (over nine predictors) amongst members because RMSEs between SMILE members can be as large as RMSEs between SMILE members and other models. We find that an alternative RMSE distance metric, derived from global SAT and hemispheric SLP climatology, is able to better identify IC members in general and SMILE members in particular as members of the same model. Further, more subtle dependencies associated with resolution differences and component similarities are also identified by the global predictor set.</p></abstract-html>
<ref-html id="bib1.bib1"><label>Abramowitz et al.(2008)Abramowitz, Leuning, Clark, and
Pitman</label><mixed-citation>
Abramowitz, G., Leuning, R., Clark, M., and Pitman, A.: Evaluating the
performance of land surface models, J. Climate, 21, 5468–5481,
<a href="https://doi.org/10.1175/2008JCLI2378.1" target="_blank">https://doi.org/10.1175/2008JCLI2378.1</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Abramowitz et al.(2019)Abramowitz, Herger, Gutmann, Hammerling,
Knutti, Leduc, Lorenz, Pincus, and Schmidt</label><mixed-citation>
Abramowitz, G., Herger, N., Gutmann, E., Hammerling, D., Knutti, R., Leduc, M., Lorenz, R., Pincus, R., and Schmidt, G. A.: ESD Reviews: Model dependence in multi-model climate ensembles: weighting, sub-selection and out-of-sample testing, Earth Syst. Dynam., 10, 91–105, <a href="https://doi.org/10.5194/esd-10-91-2019" target="_blank">https://doi.org/10.5194/esd-10-91-2019</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Allen and Ingram(2002)</label><mixed-citation>
Allen, M. R. and Ingram, W. J.: Constraints on future changes in climate and
the hydrologic cycle, Nature, 419, 228–232, <a href="https://doi.org/10.1038/nature01092" target="_blank">https://doi.org/10.1038/nature01092</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Annan and Hargreaves(2017)</label><mixed-citation>
Annan, J. D. and Hargreaves, J. C.: On the meaning of independence in climate
science, Earth System Dynamics, 8, 211–224, <a href="https://doi.org/10.5194/esd-8-211-2017" target="_blank">https://doi.org/10.5194/esd-8-211-2017</a>,
2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Arora et al.(2011)Arora, Scinocca, Boer, Christian, Denman, Flato,
Kharin, Lee, and Merryfield</label><mixed-citation>
Arora, V. K., Scinocca, J. F., Boer, G. J., Christian, J. R., Denman, K. L.,
Flato, G. M., Kharin, V. V., Lee, W. G., and Merryfield, W. J.: Carbon emission limits required to satisfy future representative concentration
pathways of greenhouse gases, Geophys. Res. Lett., 38, L05805,
<a href="https://doi.org/10.1029/2010GL046270" target="_blank">https://doi.org/10.1029/2010GL046270</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Bishop and Abramowitz(2013)</label><mixed-citation>
Bishop, C. and Abramowitz, G.: Climate model dependence and the replicate Earth paradigm, Clim. Dynam., 41, 885–900, <a href="https://doi.org/10.1007/s00382-012-1610-y" target="_blank">https://doi.org/10.1007/s00382-012-1610-y</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Boberg and Christensen(2012)</label><mixed-citation>
Boberg, F. and Christensen, J.: Overestimation of Mediterranean summer
temperature projections due to model deficiencies, Nat. Clim. Change, 2, 433–436, <a href="https://doi.org/10.1038/nclimate1454" target="_blank">https://doi.org/10.1038/nclimate1454</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Boé(2018)</label><mixed-citation>
Boé, J.: Interdependency in multimodel climate projections: Component
replication and result similarity, Geophys. Res. Lett., 45, 2771–2779, <a href="https://doi.org/10.1002/2017GL076829" target="_blank">https://doi.org/10.1002/2017GL076829</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Borodina and Knutti(2017)</label><mixed-citation>
Borodina, A., E. F. and Knutti, R.: Emergent Constraints in Climate Projections: A Case Study of Changes in High-Latitude Temperature Variability, J. Climate, 30, 3655–3670, <a href="https://doi.org/10.1175/JCLI-D-16-0662.1" target="_blank">https://doi.org/10.1175/JCLI-D-16-0662.1</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Branstator and Teng(2017)</label><mixed-citation>
Branstator, G. and Teng, H.: Tropospheric Waveguide Teleconnections and Their Seasonality, J. Atmos. Sci., 74, 1513–1532, <a href="https://doi.org/10.1175/JAS-D-16-0305.1" target="_blank">https://doi.org/10.1175/JAS-D-16-0305.1</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Brunner et al.(2019)Brunner, Lorenz, Zumwald, and Knutti</label><mixed-citation>
Brunner, L., Lorenz, R., Zumwald, M., and Knutti, R.: Quantifying uncertainty
in European climate projections using combined performance-independence
weighting, Environ. Res. Lett., 14, 124010, <a href="https://doi.org/10.1088/1748-9326/ab492f" target="_blank">https://doi.org/10.1088/1748-9326/ab492f</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Christensen and Boberg(2012)</label><mixed-citation>
Christensen, J. H. and Boberg, F.: Temperature dependent climate projection
deficiencies in CMIP5 models, Geophys. Res. Lett., 39, L24705,
<a href="https://doi.org/10.1029/2012GL053650" target="_blank">https://doi.org/10.1029/2012GL053650</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Collins et al.(2013)Collins, Knutti, Arblaster, Dufresne, Fichefet,
Friedlingstein, Gao, Gutowski, Johns, Krinner, Shongwe, Tebaldi, Weaver, and Wehner</label><mixed-citation>
Collins, M., Knutti, R., Arblaster, J., Dufresne, J.-L., Fichefet, T.,
Friedlingstein, P., Gao, X., Gutowski, W., Johns, T., Krinner, G., Shongwe,
M., Tebaldi, C., Weaver, A., and Wehner, M.: Long-term Climate Change:
Projections, Commitments and Irreversibility, in: book section 12,  Cambridge University Press, Cambridge, UK and New York, NY, USA, 1029–1136, <a href="https://doi.org/10.1017/CBO9781107415324.024" target="_blank">https://doi.org/10.1017/CBO9781107415324.024</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Compo et al.(2011)Compo, Whitaker, Sardeshmukh, Matsui, Allan, Yin,
Gleason, Vose, Rutledge, Bessemoulin, Brnnimann, Brunet, Crouthamel, Grant, Groisman, Jones, Kruk, Kruger, Marshall, Maugeri, Mok, Nordli, Ross, Trigo, Wang, Woodruff, and Worley</label><mixed-citation>
Compo, G., Whitaker, J., Sardeshmukh, P., Matsui, N., Allan, R., Yin, X.,
Gleason, B., Vose, R., Rutledge, G., Bessemoulin, P., Brönnimann, S.,
Brunet, M., Crouthamel, R., Grant, A., Groisman, P., Jones, P., Kruk, M.,
Kruger, A., Marshall, G., Maugeri, M., Mok, H., Nordli, Ø., Ross, T., Trigo, R., Wang, X., Woodruff, S., and Worley, S.: The Twentieth Century Reanalysis Project, Q. J. Roy. Meteorol. Soc., 137, 1–28, <a href="https://doi.org/10.1002/qj.776" target="_blank">https://doi.org/10.1002/qj.776</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Deser et al.(2012)Deser, Knutti, Solomon, and Phillips</label><mixed-citation>
Deser, C., Knutti, R., Solomon, S., and Phillips, A. S.: Communication of the
role of natural variability in future North American climate, Nature Clim.
Change, 2, 775–779, <a href="https://doi.org/10.1038/NCLIMATE1562" target="_blank">https://doi.org/10.1038/NCLIMATE1562</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Deser et al.(2014)Deser, Phillips, Alexander, and Smoliak</label><mixed-citation>
Deser, C., Phillips, A., Alexander, M. A., and Smoliak, B. V.: Projecting
North American climate over the next 50 years: Uncertainty due to internal variability, J. Climate, 27, 2271–2296, <a href="https://doi.org/10.1175/JCLI-D-13-00451.1" target="_blank">https://doi.org/10.1175/JCLI-D-13-00451.1</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Deser et al.(2015)Deser, Tomas, and Sun</label><mixed-citation>
Deser, C., Tomas, R., and Sun, L.: The Role of Ocean–Atmosphere Coupling in the Zonal-Mean Atmospheric Response to Arctic Sea Ice Loss, J. Climate, 28, 2168–2186, <a href="https://doi.org/10.1175/JCLI-D-14-00325.1" target="_blank">https://doi.org/10.1175/JCLI-D-14-00325.1</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Deser et al.(2016)Deser, Terray, and Phillips</label><mixed-citation>
Deser, C. A., Terray, L., and Phillips, A. S.: Forced and internal components
of winter air temperature trends over North America during the past 50 years: Mechanisms and implications, J. Climate, 29, 2237–2258, <a href="https://doi.org/10.1175/JCLI-D-15-0304.1" target="_blank">https://doi.org/10.1175/JCLI-D-15-0304.1</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>ECMWF(2019)</label><mixed-citation>
ECMWF: ERA-20C Output, available at: <a href="https://apps.ecmwf.int/datasets/data/era20c-moda/levtype=sfc/type=an/" target="_blank"/>, last access: May 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>ESGF(2019)</label><mixed-citation>
ESGF: WCRP Coupled Model Intercomparison Project (Phase 5), available at:  <a href="https://esgf-node.llnl.gov/projects/cmip5/" target="_blank"/>, last access: July 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Environment and Climate Change Canada(2019)</label><mixed-citation>
Environment and Climate Change Canada: CanESM2 Large Ensembles Output, available at: <a href="https://open.canada.ca/data/en/dataset/aa7b6823-fd1e-49ff-a6fb-68076a4a477c" target="_blank"/>, last access: August 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Eyring et al.(2016)Eyring, Bony, Meehl, Senior, Stevens, Stouffer,
and Taylor</label><mixed-citation>
Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer,
R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, <a href="https://doi.org/10.5194/gmd-9-1937-2016" target="_blank">https://doi.org/10.5194/gmd-9-1937-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Giorgetta et al.(2013)Giorgetta, Jungclaus, Reick, Legutke, Bader,
Bttinger, Brovkin, Crueger, Esch, Fieg, Glushak, Gayler, Haak, Hollweg,
Ilyina, Kinne, Kornblueh, Matei, Mauritsen, Mikolajewicz, Mueller, Notz,
Pithan, Raddatz, Rast, Redler, Roeckner, Schmidt, Schnur, Segschneider, Six, Stockhause, Timmreck, Wegner, Widmann, Wieners, Claussen, Marotzke, and Stevens</label><mixed-citation>
Giorgetta, M. A., Jungclaus, J., Reick, C. H., Legutke, S., Bader, J.,
Böttinger, M., Brovkin, V., Crueger, T., Esch, M., Fieg, K., Glushak, K.,
Gayler, V., Haak, H., Hollweg, H., Ilyina, T., Kinne, S., Kornblueh, L.,
Matei, D., Mauritsen, T., Mikolajewicz, U., Mueller, W., Notz, D., Pithan,
F., Raddatz, T., Rast, S., Redler, R., Roeckner, E., Schmidt, H., Schnur, R.,
Segschneider, J., Six, K. D., Stockhause, M., Timmreck, C., Wegner, J.,
Widmann, H., Wieners, K., Claussen, M., Marotzke, J., and Stevens, B.:
Climate and carbon cycle changes from 1850 to 2100 in MPI-ESM simulations
for the Coupled Model Intercomparison Project phase 5, J. Adv. Model. Earth Syst., 5, 572–597, <a href="https://doi.org/10.1002/jame.20038" target="_blank">https://doi.org/10.1002/jame.20038</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Guo et al.(2019)Guo, Deser, Terray, and Lehner</label><mixed-citation>
Guo, R., Deser, C., Terray, L., and Lehner, F.: Human influence on winter
precipitation trends (1921–2015) over North America and Eurasia revealed by dynamical adjustment, Geophys. Res. Lett., 46, 3426–3434, <a href="https://doi.org/10.1029/2018GL081316" target="_blank">https://doi.org/10.1029/2018GL081316</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Hall and Manabe(1999)</label><mixed-citation>
Hall, A. and Manabe, S.: The Role of Water Vapor Feedback in Unperturbed Climate Variability and Global Warming, J. Climate, 12, 2327–2346,
<a href="https://doi.org/10.1175/1520-0442(1999)012&lt;2327:TROWVF&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0442(1999)012&lt;2327:TROWVF&gt;2.0.CO;2</a>, 1999.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Hawkins and Sutton(2009)</label><mixed-citation>
Hawkins, E. and Sutton, R.: The Potential to Narrow Uncertainty in Regional Climate Predictions, B. Am. Meteorol. Soc., 90, 1095–1108,
<a href="https://doi.org/10.1175/2009BAMS2607.1" target="_blank">https://doi.org/10.1175/2009BAMS2607.1</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Hawkins et al.(2016)Hawkins, Smith, Gregory, and
Stainforth</label><mixed-citation>
Hawkins, E., Smith, R. S., Gregory, J. M., and Stainforth, D. A.: Irreducible
uncertainty in near-term climate projections, Clim. Dynam., 46, 3807–3819,
<a href="https://doi.org/10.1007/s00382-015-2806-8" target="_blank">https://doi.org/10.1007/s00382-015-2806-8</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Herger et al.(2018)Herger, Abramowitz, Knutti, Angélil, Lehmann,
and Sanderson</label><mixed-citation>
Herger, N., Abramowitz, G., Knutti, R., Angélil, O., Lehmann, K., and
Sanderson, B. M.: Selecting a climate model subset to optimise key ensemble
properties, Earth Syst. Dynam., 9, 135–151, <a href="https://doi.org/10.5194/esd-9-135-2018" target="_blank">https://doi.org/10.5194/esd-9-135-2018</a>,
2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Hourdin et al.(2017)Hourdin, Mauritsen, Gettelman, Golaz, Balaji,
Duan, Folini, Ji, Klocke, Qian, Rauser, Rio, Tomassini, Watanabe, and
Williamson</label><mixed-citation>
Hourdin, F., Mauritsen, T., Gettelman, A., Golaz, J., Balaji, V., Duan, Q.,
Folini, D., Ji, D., Klocke, D., Qian, Y., Rauser, F., Rio, C., Tomassini, L.,
Watanabe, M., and Williamson, D.: The Art and Science of Climate Model Tuning, B. Am. Meteorol. Soc., 98, 589–602, <a href="https://doi.org/10.1175/BAMS-D-15-00135.1" target="_blank">https://doi.org/10.1175/BAMS-D-15-00135.1</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Hurrell et al.(2013)Hurrell, Holland, Gent, Ghan, Kay, Kushner,
Lamarque, Large, Lawrence, Lindsay, Lipscomb, Long, Mahowald, Marsh, Neale,
Rasch, Vavrus, Vertenstein, Bader, Collins, Hack, Kiehl, and
Marshall</label><mixed-citation>
Hurrell, J., Holland, M., Gent, P., Ghan, S., Kay, J., Kushner, P., Lamarque,
J., Large, W., Lawrence, D., Lindsay, K., Lipscomb, W., Long, M., Mahowald,
N., Marsh, D., Neale, R., Rasch, P., Vavrus, S., Vertenstein, M., Bader, D.,
Collins, W., Hack, J., Kiehl, J., and Marshall, S.: The Community Earth
System Model: A Framework for Collaborative Research, B. Am. Meteorol. Soc., 94, 1339–1360, <a href="https://doi.org/10.1175/BAMS-D-12-00121.1" target="_blank">https://doi.org/10.1175/BAMS-D-12-00121.1</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Karlsson and Svensson(2013)</label><mixed-citation>
Karlsson, J. and Svensson, G.: Consequences of poor representation of Arctic
sea–ice albedo and cloud–radiation interactions in the CMIP5 model ensemble, Geophys. Res. Lett., 40, 4374–4379, <a href="https://doi.org/10.1002/grl.50768" target="_blank">https://doi.org/10.1002/grl.50768</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Kay et al.(2015)Kay, Deser, Phillips, Mai, Hannay, Strand, Arblaster, Bates, Danabasoglu, Edwards, Holland, Kushner, Lamarque, Lawrence, Lindsay, Middleton, Munoz, Neale, Oleson, Polvani, and Vertenstein</label><mixed-citation>
Kay, J. E., Deser, C., Phillips, A., Mai, A., Hannay, C., Strand, G.,
Arblaster, J., Bates, S., Danabasoglu, G., Edwards, J., Holland, M., Kushner,
P., Lamarque, J. F., Lawrence, D., Lindsay, K., Middleton, A., Munoz, E.,
Neale, R., Oleson, K., Polvani, L., and Vertenstein, M.: The Community Earth System Model (CESM) Large Ensemble Project: A community resource for studying climate change in the presence of internal climate variability, B. Am. Meteorol. Soc., 96, 1333–1349, <a href="https://doi.org/10.1175/BAMS-D-13-00255.1" target="_blank">https://doi.org/10.1175/BAMS-D-13-00255.1</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Knutti(2010)</label><mixed-citation>
Knutti, R.: The end of model democracy?, Climatic Change, 102, 395–404,
<a href="https://doi.org/10.1007/s10584-010-9800-2" target="_blank">https://doi.org/10.1007/s10584-010-9800-2</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Knutti and Sedláček(2013)</label><mixed-citation>
Knutti, R. and Sedláček, J.: Robustness and Uncertainties in the New
CMIP5 Climate Model Projections, Nat. Clim. Change, 3, 369–373,
<a href="https://doi.org/10.1038/NCLIMATE1716" target="_blank">https://doi.org/10.1038/NCLIMATE1716</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Knutti et al.(2010a)Knutti, Abramowitz, Collins, Eyring,
Gleckler, Hewitson, and Mearns</label><mixed-citation>
Knutti, R., Abramowitz, G., Collins, M., Eyring, V., Gleckler, P., Hewitson,
B., and Mearns, L.: Good Practice Guidance Paper on Assessing and Combining Multi Model Climate Projections, in: Meeting Report of the Intergovernmental Panel on Climate Change Expert Meeting on Assessing and Combining Multi Model Climate Projections, edited by: Stocker, T., Qin, D., Plattner, G.-K., Tignor, M., and Midgley, P., IPCC Working Group I Technical Support Unit, University of Bern, Bern, Switzerland, 2010a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Knutti et al.(2010b)Knutti, Furrer, Tebaldi, Cermak, and
Meehl</label><mixed-citation>
Knutti, R., Furrer, R., Tebaldi, C., Cermak, J., and Meehl, G.: Challenges in
Combining Projections from Multiple Climate Models, J. Climate, 23, 2739–2758, <a href="https://doi.org/10.1175/2009JCLI3361.1" target="_blank">https://doi.org/10.1175/2009JCLI3361.1</a>, 2010b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Knutti et al.(2013)Knutti, Masson, and Gettelman</label><mixed-citation>
Knutti, R., Masson, D., and Gettelman, A.: Climate model genealogy:
Generation CMIP5 and how we got there, Geophys. Res. Lett., 40, 1194–1199, <a href="https://doi.org/10.1002/grl.50256" target="_blank">https://doi.org/10.1002/grl.50256</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Knutti et al.(2017)Knutti, Sedlácěk, Sanderson, Lorenz,
Fischer, and Eyring</label><mixed-citation>
Knutti, R., Sedlácěk, J., Sanderson, B. M., Lorenz, R., Fischer, E., and Eyring, V.: A climate model projection weighting scheme accounting for
performance and interdependence, Geophys. Res. Lett., 44, 1909–1918, <a href="https://doi.org/10.1002/2016GL072012" target="_blank">https://doi.org/10.1002/2016GL072012</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Kunreuther et al.(2013)Kunreuther, Heal, Allen, Edenhofer, Field, and Yohe</label><mixed-citation>
Kunreuther, H., Heal, G., Allen, M., Edenhofer, O., Field, C. B., and Yohe, G.: Risk management and climate change, Nat. Clim. Change, 3, 447–450,
<a href="https://doi.org/10.1038/NCLIMATE1740" target="_blank">https://doi.org/10.1038/NCLIMATE1740</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Leduc et al.(2016)Leduc, Laprise, de Elía, and</label><mixed-citation>
Leduc, M., Laprise, R., de Elía, R., and S̄eparovič, L.: Is Institutional Democracy a Good Proxy for Model Independence?, J. Climate, 29, 8301–8316, <a href="https://doi.org/10.1175/JCLI-D-15-0761.1" target="_blank">https://doi.org/10.1175/JCLI-D-15-0761.1</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Lehner et al.(2017)Lehner, Deser, and Terray</label><mixed-citation>
Lehner, F., Deser, C., and Terray, L.: Towards a new estimate of “time of
emergence” of anthropogenic warming: insights from dynamical adjustment and a large initial-condition model ensemble, J. Climate, 109, 14337–14342,
<a href="https://doi.org/10.1175/JCLI-D-16-0792.1" target="_blank">https://doi.org/10.1175/JCLI-D-16-0792.1</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Lehner et al.(2020)Lehner, Deser, Maher, Marotzke, Fischer, Brunner, Knutti, and Hawkins</label><mixed-citation>
Lehner, F., Deser, C., Maher, N., Marotzke, J., Fischer, E. M., Brunner, L.,
Knutti, R., and Hawkins, E.: Partitioning climate projection uncertainty with
multiple large ensembles and CMIP5/6, Earth Syst. Dynam., 11, 491–508,
<a href="https://doi.org/10.5194/esd-11-491-2020" target="_blank">https://doi.org/10.5194/esd-11-491-2020</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Li and Xie(2012)</label><mixed-citation>
Li, G. and Xie, S.: Origins of tropical-wide SST biases in CMIP multi-model
ensembles, Geophys. Res. Lett., 39, L22703, <a href="https://doi.org/10.1029/2012GL053777" target="_blank">https://doi.org/10.1029/2012GL053777</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Liu et al.(2012)Liu, Allan, and Huffman</label><mixed-citation>
Liu, C., Allan, R. P., and Huffman, G. J.: Co-variation of temperature and
precipitation in CMIP5 models and satellite observations, Geophys. Res. Lett., 39, L13803, <a href="https://doi.org/10.1029/2012GL052093" target="_blank">https://doi.org/10.1029/2012GL052093</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Lorenz et al.(2018)Lorenz, Herger, Sedlácěk, Eyring, Fischer,
and Knutti</label><mixed-citation>
Lorenz, R., Herger, N., Sedlácěk, J., Eyring, V., Fischer, E. M., and
Knutti, R.: Prospects and caveats of weighting climate models for summer
maximum temperature projections over North America, J. Geophys.Res.-Atmos., 123, 4509–4526, <a href="https://doi.org/10.1029/2017JD027992" target="_blank">https://doi.org/10.1029/2017JD027992</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Maher et al.(2019)Maher, Milinski, Suarez-Gutierrez, Botzet,
Dobrynin, Kornblueh, Krger, Takano, Ghosh, Hedemann, Li, Li, Manzini, Notz, Putrasahan, Boysen, Claussen, Ilyina, Olonscheck, Raddatz, Stevens, and Marotzke</label><mixed-citation>
Maher, N., Milinski, S., Suarez-Gutierrez, L., Botzet, M., Dobrynin, M.,
Kornblueh, L., Krüger, J., Takano, Y., Ghosh, R., Hedemann, C., Li, C., Li, H., Manzini, E., Notz, N., Putrasahan, D., Boysen, L., Claussen, M., Ilyina, T., Olonscheck, D., Raddatz, T., Stevens, B., and Marotzke, J.: The Max Planck Institute Grand Ensemble: Enabling the Exploration of Climate System Variability, J. Adv. Model. Earth Syst., 11, 1–21, <a href="https://doi.org/10.1029/2019MS001639" target="_blank">https://doi.org/10.1029/2019MS001639</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Masson and Knutti(2011)</label><mixed-citation>
Masson, D. and Knutti, R.: Climate model genealogy, Geophys. Res. Lett., 38,
L08703, <a href="https://doi.org/10.1029/2011GL046864" target="_blank">https://doi.org/10.1029/2011GL046864</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Mauritsen et al.(2012)Mauritsen, Stevens, Roeckner, Crueger, Esch,
Giorgetta, Haak, Jungclaus, Klocke, Matei, Mikolajewicz, Notz, Pincus,
Schmidt, and Tomassini</label><mixed-citation>
Mauritsen, T., Stevens, B., Roeckner, E., Crueger, T., Esch, M., Giorgetta, M., Haak, H., Jungclaus, J., Klocke, D., Matei, D., Mikolajewicz, U., Notz, D., Pincus, R., Schmidt, H., and Tomassini, L.: Tuning the climate of a global  model, J. Adv. Model. Earth Syst., 4, M00A01, <a href="https://doi.org/10.1029/2012MS000154" target="_blank">https://doi.org/10.1029/2012MS000154</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Max Planck Institute for Meteorology(2019)</label><mixed-citation>
Max Planck Institute for Meteorology: MPI Grand Ensemble Output, available at: <a href="https://esgf-data.dkrz.de/projects/mpi-ge/" target="_blank"/>, last access: August 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Meehl et al.(2000)Meehl, Boer, Covey, Latif, and Stouffer</label><mixed-citation>
Meehl, G. A., Boer, G. J., Covey, C., Latif, M., and Stouffer, R. J.: The
Coupled Model Intercomparison Project (CMIP), B. Am. Meteorol. Soc., 81, 313–318, 2000.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Meinshausen et al.(2011)Meinshausen, Smith, Calvin, Daniel, Kainuma, Lamarque, Matsumoto, Montzka, Raper, Riahi, Thomson, Velders, and van Vuuren</label><mixed-citation>
Meinshausen, M., Smith, S. J., Calvin, K., Daniel, J. S., Kainuma, M. L. T.,
Lamarque, J.-F., Matsumoto, K., Montzka, S. A., Raper, S. C. B., Riahi, K.,
Thomson, A., Velders, G. J. M., and van Vuuren, D. P.: The RCP greenhouse gas concentrations and their extensions from 1765 to 2300, Climatic Change, 109, 213–241, <a href="https://doi.org/10.1007/s10584-011-0156-z" target="_blank">https://doi.org/10.1007/s10584-011-0156-z</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Merrifield and Xie(2016)</label><mixed-citation>
Merrifield, A. and Xie, S.: Summer U.S. Surface Air Temperature Variability: Controlling Factors and AMIP Simulation Biases, J. Climate, 29, 5123–5139, <a href="https://doi.org/10.1175/JCLI-D-15-0705.1" target="_blank">https://doi.org/10.1175/JCLI-D-15-0705.1</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Merrifield et al.(2017)Merrifield, Lehner, Xie, and
Deser</label><mixed-citation>
Merrifield, A. L., Lehner, F., Xie, S.-P., and Deser, C.: Removing circulation effects to assess central U.S. land–atmosphere interactions in the CESM Large Ensemble, Geophys. Res. Lett., 44, 9938–9946,
<a href="https://doi.org/10.1002/2017GL074831" target="_blank">https://doi.org/10.1002/2017GL074831</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Merrifield et al.(2020)</label><mixed-citation>
Merrifield, A. L., Brunner, L., and Lorenz, R.: ESD_weighting_large_ensembles: Paper Release (Version v1.0), Zenodo, <a href="https://doi.org/10.5281/zenodo.4028924" target="_blank">https://doi.org/10.5281/zenodo.4028924</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Miller et al.(2014)Miller, Schmidt, Nazarenko, Tausnev, Bauer,
DelGenio, Kelley, Lo, Ruedy, Shindell, Aleinov, Bauer, Bleck, Canuto, Chen,
Cheng, Clune, Faluvegi, Hansen, Healy, Kiang, Koch, Lacis, LeGrande, Lerner, Menon, Oinas, Prez Garc­a-Pando, Perlwitz, Puma, Rind, Romanou, Russell, Sato, Sun, Tsigaridis, Unger, Voulgarakis, Yao, and Zhang</label><mixed-citation>
Miller, R. L., Schmidt, G. A., Nazarenko, L. S., Tausnev, N., Bauer, S. E.,
DelGenio, A. D., Kelley, M., Lo, K. K., Ruedy, R., Shindell, D. T., Aleinov,
I., Bauer, M., Bleck, R., Canuto, V., Chen, Y., Cheng, Y., Clune, T. L.,
Faluvegi, G., Hansen, J. E., Healy, R. J., Kiang, N. Y., Koch, D., Lacis, A. A., LeGrande, A. N., Lerner, J., Menon, S., Oinas, V., Pérez García-Pando, C., Perlwitz, J. P., Puma, M. J., Rind, D., Romanou, A., Russell, G. L., Sato, M., Sun, S., Tsigaridis, K., Unger, N., Voulgarakis, A., Yao, M.-S., and Zhang, J.: CMIP5 historical simulations (1850–2012) with GISS ModelE2, J. Adv. Model. Earth Syst., 6,
441–478, <a href="https://doi.org/10.1002/2013MS000266" target="_blank">https://doi.org/10.1002/2013MS000266</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Mueller and Seneviratne(2014)</label><mixed-citation>
Mueller, B. and Seneviratne, S. I.: Systematic land climate and evapotranspiration biases in CMIP5 simulations, Geophys. Res. Lett., 41,
128–134, <a href="https://doi.org/10.1002/2013GL058055" target="_blank">https://doi.org/10.1002/2013GL058055</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>O'Neill et al.(2014)Oâ€™Neill, Kriegler, Riahi, Ebi, Hallegatte,
Carter, Mathur, and van Vuuren</label><mixed-citation>
O'Neill, B. C., Kriegler, E., Riahi, K., Ebi, K. L., Hallegatte, S., Carter,
T. R., Mathur, R., and van Vuuren, D. P.: A new scenario framework for climate change research: the concept of shared socioeconomic pathways,
Climatic Change, 122, 387–400, <a href="https://doi.org/10.1007/s10584-013-0905-2" target="_blank">https://doi.org/10.1007/s10584-013-0905-2</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Pennell and Reichler(2011)</label><mixed-citation>
Pennell, C. and Reichler, T.: On the Effective Number of Climate Models, J. Climate, 24, 2358–2367, <a href="https://doi.org/10.1175/2010JCLI3814.1" target="_blank">https://doi.org/10.1175/2010JCLI3814.1</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Pithan et al.(2014)Pithan, Medeiros, and Mauritsen</label><mixed-citation>
Pithan, F., Medeiros, B., and Mauritsen, T.: Mixed-phase clouds cause climate
model biases in Arctic wintertime temperature inversions, Clim. Dynam., 43,
289–303, <a href="https://doi.org/10.1007/s00382-013-1964-9" target="_blank">https://doi.org/10.1007/s00382-013-1964-9</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Poli et al.(2016)Poli, Hersbach, Dee, Berrisford, Simmons, Vitart,
Laloyaux, Tan, Peubey, Thépaut, Trémolet, Hólm, Bonavita, Isaksen, and Fisher</label><mixed-citation>
Poli, P., Hersbach, H., Dee, D., Berrisford, P., Simmons, A., Vitart, F.,
Laloyaux, P., Tan, D., Peubey, C., Thépaut, J., Trémolet, Y., Hólm, E., Bonavita, M., Isaksen, L., and Fisher, M.: ERA-20C: An Atmospheric Reanalysis of the Twentieth Century, J. Climate, 29, 4083–4097,
<a href="https://doi.org/10.1175/JCLI-D-15-0556.1" target="_blank">https://doi.org/10.1175/JCLI-D-15-0556.1</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Rohde et al.(2013)Rohde, Muller, Jacobsen, Perlmutter, Rosenfeld, and et al.</label><mixed-citation>
Rohde, R., Muller, R., Jacobsen, R., Perlmutter, S., and Mosher, S.: Berkeley Earth Temperature Averaging Process, Geoinform. Geostat., 01, 1000103, <a href="https://doi.org/10.4172/2327-4581.1000103" target="_blank">https://doi.org/10.4172/2327-4581.1000103</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Rondeau-Genesse and Braun(2019)</label><mixed-citation>
Rondeau-Genesse, G. and Braun, M.: Impact of internal variability on climate
change for the upcoming decades: analysis of the CanESM2-LE and CESM-LE
large ensembles, Climatic Change, 156, 299–314, <a href="https://doi.org/10.1007/s10584-019-02550-2" target="_blank">https://doi.org/10.1007/s10584-019-02550-2</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Sanderson et al.(2015a)Sanderson, Knutti, and
Caldwell</label><mixed-citation>
Sanderson, B. M., Knutti, R., and Caldwell, P.: Addressing interdependency in a multimodel ensemble by interpolation of model properties, J. Climate, 28, 28, 150330095237008, <a href="https://doi.org/10.1175/JCLI-D-14-00361.1" target="_blank">https://doi.org/10.1175/JCLI-D-14-00361.1</a>, 2015a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Sanderson et al.(2015b)Sanderson, Knutti, and
Caldwell</label><mixed-citation>
Sanderson, B. M., Knutti, R., and Caldwell, P.: A representative democracy to
reduce interdependency in a multimodel ensemble, J. Climate, 28, 5171–5194, <a href="https://doi.org/10.1175/JCLI-D-14-00362.1" target="_blank">https://doi.org/10.1175/JCLI-D-14-00362.1</a>, 2015b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Seneviratne(2012)</label><mixed-citation>
Seneviratne, S. I.: Changes in climate extremes and their impacts on the
natural physical environment, in: Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation. A Special Report of Working Groups I and II of the Intergovernmental Panel on Climate Change (IPCC), edited by: Field, C. B., Barros, V., Stocker, T. F., Qin, D., Dokken, D. J., Ebi, K. L., Mastrandrea, M. D., Mach, K. J., Plattner, G. K., Allen, S. K., Tignor, M., and Midgley, P. M., Cambridge University Press, Cambridge, UK and New York, NY, USA, 109–230, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Seneviratne et al.(2010)Seneviratne, Corti, Davin, Hirschi, Jaeger,
Lehner, Orlowsky, and Teuling</label><mixed-citation>
Seneviratne, S., Corti, T., Davin, E., Hirschi, M., Jaeger, E., Lehner, I.,
Orlowsky, B., and Teuling, A.: Investigating soil moisture–climate
interactions in a changing climate: a review, Earth-Sci. Rev., 99, 125–161,
<a href="https://doi.org/10.1016/j.earscirev.2010.02.004" target="_blank">https://doi.org/10.1016/j.earscirev.2010.02.004</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Sillmann et al.(2013)Sillmann, Kharin, Zhang, Zwiers, and
Bronaugh</label><mixed-citation>
Sillmann, J., Kharin, V. V., Zhang, X., Zwiers, F. W., and Bronaugh, D.: Climate extremes indices in the CMIP5 multimodel ensemble: Part 1. Model evaluation in the present climate, J. Geophys. Res.-Atmos., 118, 1716–1733, <a href="https://doi.org/10.1002/jgrd.50203" target="_blank">https://doi.org/10.1002/jgrd.50203</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Sippel et al.(2019)Sippel, Meinshausen, Merrifield, Lehner,
Pendergrass, Fischer, and Knutti</label><mixed-citation>
Sippel, S., Meinshausen, N., Merrifield, A., Lehner, F., Pendergrass, A.,
Fischer, E., and Knutti, R.: Uncovering the Forced Climate Response from a
Single Ensemble Member Using Statistical Learning, J. Climate, 32, 5677–5699, <a href="https://doi.org/10.1175/JCLI-D-18-0882.1" target="_blank">https://doi.org/10.1175/JCLI-D-18-0882.1</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Slivinski et al.(2019)Slivinski, Compo, Whitaker, Sardeshmukh, Giese, McColl, Allan, Yin, Vose, Titchner, Kennedy, Spencer, Ashcroft, Brnnimann, Brunet, Camuffo, Cornes, Cram, Crouthamel, DomnguezCastro, Freeman, Gergis, Hawkins, Jones, Jourdain, Kaplan, Kubota, Blancq, Lee, Lorrey, Luterbacher, Maugeri, Mock, Moore, Przybylak, Pudmenzky, Reason, Slonosky, Smith, Tinz, Trewin, Valente, Wang, Wilkinson, Wood, and
Wyszyki</label><mixed-citation>
Slivinski, L. C., Compo, G. P., Whitaker, J. S., Sardeshmukh, P. D., Giese,
B. S., McColl, C., Allan, R., Yin, X., Vose, R., Titchner, H., Kennedy, J.,
Spencer, L. J., Ashcroft, L., Brönnimann, S., Brunet, M., Camuffo, D.,
Cornes, R., Cram, T. A., Crouthamel, R., Domínguez-Castro, F., Freeman,
J. E., Gergis, J., Hawkins, E., Jones, P. D., Jourdain, S., Kaplan, A., Kubota, H., Blancq, F. L., Lee, T., Lorrey, A., Luterbacher, J., Maugeri, M.,
Mock, C. J., Moore, G. K., Przybylak, R., Pudmenzky, C., Reason, C., Slonosky, V. C., Smith, C. A., Tinz, B., Trewin, B., Valente, M. A., Wang, X. L., Wilkinson, C., Wood, K., and Wyszyńki, P.: Towards a more reliable
historical reanalysis: Improvements for version 3 of the Twentieth Century
Reanalysis system, Q. J. Roy. Meteorol. Soc., 145, 2876–2908,
<a href="https://doi.org/10.1002/qj.3598" target="_blank">https://doi.org/10.1002/qj.3598</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Stainforth et al.(2007)Stainforth, Allen, Tredger, and
Smith</label><mixed-citation>
Stainforth, D., Allen, M., Tredger, E., and Smith, L.: Confidence, uncertainty and decision-support relevance in climate predictions, Philos. T. Roy. Soc. A, 265, 2145–2161, <a href="https://doi.org/10.1098/rsta.2007.2074" target="_blank">https://doi.org/10.1098/rsta.2007.2074</a>, 2007.

</mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Stevens et al.(2013)Stevens, Giorgetta, Esch, Mauritsen, Crueger,
Rast, Salzmann, Schmidt, Bader, Block, Brokopf, Fast, Kinne, Kornblueh,
Lohmann, Pincus, Reichler, and Roeckner</label><mixed-citation>
Stevens, B., Giorgetta, M., Esch, M., Mauritsen, T., Crueger, T., Rast, S.,
Salzmann, M., Schmidt, H., Bader, J., Block, K., Brokopf, R., Fast, I., Kinne, S., Kornblueh, L., Lohmann, U., Pincus, R., Reichler, T., and Roeckner, E.: Atmospheric component of the MPI-M Earth System Model: ECHAM6, J. Adv. Model. Earth Syst., 5, 146–172, <a href="https://doi.org/10.1002/jame.20015" target="_blank">https://doi.org/10.1002/jame.20015</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Stouffer et al.(2017)Stouffer, Eyring, Meehl, Bony, Senior, Stevens, and Taylor</label><mixed-citation>
Stouffer, R., Eyring, V., Meehl, G., Bony, S., Senior, C., Stevens, B., and
Taylor, K.: CMIP5 Scientific Gaps and Recommendations for CMIP6, B. Am. Meteorol. Soc., 98, 95–105, <a href="https://doi.org/10.1175/BAMS-D-15-00013.1" target="_blank">https://doi.org/10.1175/BAMS-D-15-00013.1</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Swart et al.(2018)Swart, Gille, Fyfe, and Gillett</label><mixed-citation>
Swart, N. C., Gille, S. T., Fyfe, J. C., and Gillett, N. P.: Recent Southern
Ocean warming and freshening driven by greenhouse gas emissions and ozone
depletion, Nat. Geosci., 11, 836–841, <a href="https://doi.org/10.1038/s41561-018-0226-1" target="_blank">https://doi.org/10.1038/s41561-018-0226-1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>Tebaldi and Knutti(2007)</label><mixed-citation>
Tebaldi, C. and Knutti, R.: The Use of the Multi-Model Ensemble in Probabilistic Climate Change Projections, Philos. T. Roy. Soc. A, 365, 2053–2075, <a href="https://doi.org/10.1098/rsta.2007.2076" target="_blank">https://doi.org/10.1098/rsta.2007.2076</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>Trenberth and Paolino(1980)</label><mixed-citation>
Trenberth, K. and Paolino, D.: The Northern Hemisphere Sea-Level Pressure Data Set: Trends, Errors and Discontinuities, Mon. Weather Rev., 108, 855–872, <a href="https://doi.org/10.1175/1520-0493(1980)108&lt;0855:TNHSLP&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0493(1980)108&lt;0855:TNHSLP&gt;2.0.CO;2</a>,
1980.
</mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>van den Dool(1994)</label><mixed-citation>
van den Dool, H. M.: Searching for analogues, how long must we wait?, Tellus A, 46, 314–324, <a href="https://doi.org/10.1034/j.1600-0870.1994.t01-2-00006.x" target="_blank">https://doi.org/10.1034/j.1600-0870.1994.t01-2-00006.x</a>, 1994.
</mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>van Vuuren et al.(2011)van Vuuren, Edmonds, Kainuma, Riahi, Thomson, Hibbard, Hurtt, Kram, Krey, Lamarque, Masui, Meinshausen, Nakicenovic, Smith, and Rose</label><mixed-citation>
van Vuuren, D., Edmonds, J., Kainuma, M., Riahi, K., Thomson, A., Hibbard, K., Hurtt, G. C., Kram, T., Krey, V., Lamarque, J.-F., Masui, T., Meinshausen, M., Nakicenovic, N., Smith, S. J., and Rose, S. K.: The representative concentration pathways: an overview, Climatic Change, 109, 5–31, <a href="https://doi.org/10.1007/s10584-011-0148-z" target="_blank">https://doi.org/10.1007/s10584-011-0148-z</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>Vogel et al.(2018)Vogel, Zscheischler, and Seneviratne</label><mixed-citation>
Vogel, M. M., Zscheischler, J., and Seneviratne, S. I.: Varying soil
moisture–atmosphere feedbacks explain divergent temperature extremes and
precipitation projections in central Europe, Earth Syst. Dynam., 9, 1107–1125, <a href="https://doi.org/10.5194/esd-9-1107-2018" target="_blank">https://doi.org/10.5194/esd-9-1107-2018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>Wallace et al.(2012)Wallace, Fu, Smoliak, Lin, and
Johanson</label><mixed-citation>
Wallace, J., Fu, Q., Smoliak, B. V., Lin, P., and Johanson, C. M.: Simulated
versus observed patterns of warming over the extratropical Northern Hemisphere continents during the cold season, P. Natl. Acad. Sci. USA, 109, 14337–14342, <a href="https://doi.org/10.1073/pnas.1204875109" target="_blank">https://doi.org/10.1073/pnas.1204875109</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib80"><label>Wallace et al.(1995)Wallace, Zhang, and Renwick</label><mixed-citation>
Wallace, J. M., Zhang, Y., and Renwick, J. A.: Dynamic contribution to
hemispheric mean temperature trends, Science, 270, 780–783,
<a href="https://doi.org/10.1126/science.270.5237.780" target="_blank">https://doi.org/10.1126/science.270.5237.780</a>, 1995.
</mixed-citation></ref-html>
<ref-html id="bib1.bib81"><label>Wallace et al.(2015)Wallace, Deser, Smoliak, and
Phillips</label><mixed-citation>
Wallace, J. M., Deser, C., Smoliak, B., and Phillips, A.: Attribution of climate change in the presence of internal variability, in: Climate Change:
Multidecadal and Beyond, edited by: Chang, C.-P., Ghil, M., Latif, M., and
Wallace, J. M., World Scientific Publishing, Singapore, 1–29, 2015.
</mixed-citation></ref-html>--></article>
