<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" dtd-version="3.0">
  <front>
    <journal-meta>
<journal-id journal-id-type="publisher">ESD</journal-id>
<journal-title-group>
<journal-title>Earth System Dynamics</journal-title>
<abbrev-journal-title abbrev-type="publisher">ESD</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Earth Syst. Dynam.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2190-4987</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>

    <article-meta>
      <article-id pub-id-type="doi">10.5194/esd-8-211-2017</article-id><title-group><article-title>On the meaning of independence in climate science</article-title>
      </title-group><?xmltex \runningtitle{Independence}?><?xmltex \runningauthor{J.~D.~Annan and J.~C.~Hargreaves}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Annan</surname><given-names>James D.</given-names></name>
          <email>jdannan@blueskiesresearch.org.uk</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Hargreaves</surname><given-names>Julia C.</given-names></name>
          
        </contrib>
        <aff id="aff1"><institution>BlueSkiesResearch.org.uk, The Old Chapel, Albert Hill, Settle, BD24 9HE, UK</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">James D. Annan (jdannan@blueskiesresearch.org.uk)</corresp></author-notes><pub-date><day>21</day><month>March</month><year>2017</year></pub-date>
      
      <volume>8</volume>
      <issue>1</issue>
      <fpage>211</fpage><lpage>224</lpage>
      <history>
        <date date-type="received"><day>12</day><month>August</month><year>2016</year></date>
           <date date-type="rev-request"><day>19</day><month>August</month><year>2016</year></date>
           <date date-type="rev-recd"><day>27</day><month>January</month><year>2017</year></date>
           <date date-type="accepted"><day>22</day><month>February</month><year>2017</year></date>
      </history>
      <permissions>
<license license-type="open-access">
<license-p>This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/3.0/">http://creativecommons.org/licenses/by/3.0/</ext-link></license-p>
</license>
</permissions><self-uri xlink:href="https://esd.copernicus.org/articles/8/211/2017/esd-8-211-2017.html">This article is available from https://esd.copernicus.org/articles/8/211/2017/esd-8-211-2017.html</self-uri>
<self-uri xlink:href="https://esd.copernicus.org/articles/8/211/2017/esd-8-211-2017.pdf">The full text article is available as a PDF file from https://esd.copernicus.org/articles/8/211/2017/esd-8-211-2017.pdf</self-uri>


      <abstract>
    <p>The concept of independence has been frequently mentioned in climate science
research, but has rarely been defined and discussed in a theoretically robust
and quantifiable manner. In this paper we argue that any discussion must
start from a clear and unambiguous definition of what independence means and
how it can be determined. We introduce an approach based on the statistical
definition of independence, and illustrate with simple examples how it can be
applied to practical questions. Firstly, we apply these ideas to climate
models, which are frequently argued to not be independent of each other,
raising questions as to the robustness of results from multi-model ensembles.
We explore the dependence between models in a multi-model ensemble, and
suggest a possible way forward for future weighting strategies. Secondly, we
discuss the issue of independence in relation to the synthesis of multiple
observationally based constraints on the climate system, using equilibrium
climate sensitivity as an example. We show that the same statistical theory
applies to this problem, and illustrate this with a test case, indicating how
researchers may estimate dependence between multiple constraints.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <title>Introduction</title>
      <p>Approximately 30 climate models contributed to recent iterations of the CMIP
databases, and they generally agree, at least on broad statements: the world
is warming, anthropogenic emissions of CO<inline-formula><mml:math id="M1" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> is the major cause of this, and
if we continue to emit it in large quantities then the world will continue to
warm at a substantial rate for the foreseeable
future <xref ref-type="bibr" rid="bib1.bibx34" id="paren.1"/>. The consensus across models is also
strong for more detailed statements regarding, for example, the warming rates
of land versus ocean, high versus low latitudes, and the likely changes in
precipitation over many areas. Even where models disagree qualitatively
amongst themselves (for example, concerning changes in ocean circulation and
some regional details of precipitation patterns), their range of results is
still quantitatively limited. Climate models are probably the most
widely used tool for predicting future climate changes, and their spread of
results is commonly used as an indication of what future changes might occur.</p>
      <p>But should this consensus between models really lead to confidence in these
results? If we were to re-run the same scenario with the same model 30 times,
we would get the same answer 30 times, whether it be a good or bad model.
This repetition of one experiment would not tell us how good the model is,
and the behaviour of the real climate system would almost certainly lie
outside this narrow range of results. Different model development teams share
code, and even if the code is rewritten from scratch, the underlying
algorithms and methods are often linked <xref ref-type="bibr" rid="bib1.bibx19" id="paren.2"/>.
Furthermore, many fundamental theories are common across all models. So how
much confidence can we draw from the fact that multiple models provide
consistent answers? How likely is it that common biases across all models are
greater than their spread of results, such that the ensemble range does not
provide trustworthy bounds on the behaviour of the climate system? These
questions have proved difficult to answer, and indeed there appears little
consensus as to how we can even address them. Further related issues arise
from the increasingly prevalent situation where a single modelling centre
contributes multiple simulations to the CMIP archive, some of which may only
differ in terms of the settings of uncertain parameters in the climate model,
or even just the initial state of the atmosphere–ocean system. A common
heuristic when performing multi-model analyses based on a generation of the
CMIP ensemble has been to use a single simulation from each modelling
centre <xref ref-type="bibr" rid="bib1.bibx21" id="paren.3"><named-content content-type="pre">e.g.</named-content></xref>, but it is not clear where to
draw the line when different centres may have shared a common core or
sub-models. Is there a better way to select models, and should we use a
weighted ensemble? In this case, further questions arise as to how the
weights should be defined, either in terms of model performance relative to
observations of the real climate or else in terms of their relationship to
other models, or some combination of both. Another related question that has
been posed in recent years is whether the scientific community could
collaboratively design or select ensemble members to contribute to CMIP in a
more rational and scientifically defensible way than the current ad hoc
“ensemble of opportunity”. It may be possible to address this issue in terms
of statistical sampling and experimental design, but appropriate methods and
even language do not yet appear to be well developed in this area.</p>
      <p>In Part 1 of this paper, we consider this question of model independence and
discuss how it may be addressed in a mathematically precise and well-founded
manner. We present an approach which links the usage in climate science to
the statistical definition of independence. We start by reviewing, in
Sect. <xref ref-type="sec" rid="Ch1.S2"/>, how the concept of independence has been discussed
in the recent literature. In Sect. <xref ref-type="sec" rid="Ch1.S3"/> we present a theoretical
and statistical viewpoint of independence within the Bayesian paradigm, which
we argue has direct relevance to this question. We consider how this
statistical viewpoint relates to the question of model independence in
Sect. <xref ref-type="sec" rid="Ch1.S4"/>, and also present some ideas for how to make
practical use of these ideas. We emphasise, however, that the purpose of our
paper is to provide a direction and motivation for future investigations
rather than attempting to present a complete solution.</p>
      <p>In Part 2, we illustrate how the theoretical basis for statistical
independence can also apply to the question of synthesising observational
constraints on the behaviour of the climate system, particularly the
equilibrium climate sensitivity. The equilibrium climate sensitivity <inline-formula><mml:math id="M2" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>
represents the equilibrium change in global mean surface temperature
following a doubling of atmospheric CO<inline-formula><mml:math id="M3" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, and while this parameter is far
from a comprehensive description of our future climate, it is commonly used
as a summary of the potential magnitude of changes which we might observe in
the long term. Different approaches have been proposed for constraining <inline-formula><mml:math id="M4" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>,
for example using data drawn from the modern instrumental period, or looking
to the palaeoclimate record and particularly the Last Glacial Maximum
(LGM; 19–23 ka), where global temperatures were far below those of the present
day for a sustained period, or searching for constraints that emerge when
process studies examine how well different models simulate various aspects of
the climate system such as seasonal and interannual variation. It has
previously been proposed that multiple constraints can be considered
“independent” and the resulting constraints combined into an overall
estimate <xref ref-type="bibr" rid="bib1.bibx3" id="paren.4"/>. However, the principles underlying this approach
have not be clearly investigated. In Sect. <xref ref-type="sec" rid="Ch1.S6"/> we consider how
this problem has been addressed in the previous literature, and in
Sect. <xref ref-type="sec" rid="Ch1.S7"/> we consider how the statistical principles apply
in both theoretical and practical terms by means of a simple example.</p>
</sec>
<sec id="Ch1.Sx1" specific-use="unnumbered">
  <title>Part 1 – Climate model independence</title>
</sec>
<sec id="Ch1.S2">
  <title>The literature on model independence in climate research</title>
      <p>The question of independence has featured widely in climate research, but the
research community has not yet arrived at a clear and unambiguous definition.
Different authors have approached the question of independence in different
ways, and their approaches are often mutually inconsistent.</p>
      <p>One common approach has been to interpret model independence as meaning that
the models can be considered as having errors which are independent,
identically distributed (i.i.d. in common statistical parlance) samples drawn
from some distribution (typically Gaussian) with zero
mean <xref ref-type="bibr" rid="bib1.bibx35" id="paren.5"/>. This is the so-called “truth-centred” or “truth
plus error” hypothesis. Although it has not generally been explicitly stated,
even a small ensemble of samples drawn from such a distribution would be an
incredibly powerful tool. If we could sample models from such a distribution,
then we could generate arbitrarily precise statements about the climate,
including future climate changes, merely by proceeding with the
model-building process indefinitely and taking the ensemble mean. This would
obviate the need both for computational advances and also for any additional
understanding of how to best simulate the climate system. As an illustration
of the power of such a (hypothetical) truth-centred ensemble, if the 19 CMIP3
models listed in Table 8.2 of <xref ref-type="bibr" rid="bib1.bibx28" id="text.6"/> provided independent (in
this sense) estimates of the equilibrium climate sensitivity <inline-formula><mml:math id="M5" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, then we
could immediately generate a 95 % confidence interval for the real value for
<inline-formula><mml:math id="M6" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> of <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.2</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.3</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M8" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C based on the assumption that the samples are drawn
from a Gaussian distribution of  a priori  unknown variance.</p>
      <p>However, the truth-centred hypothesis is clearly refuted by numerous analyses
of the ensemble. In particular, the errors of different models are observed
to be strongly related, as can be shown by the positive correlations between
spatial patterns of biases in
climatology <xref ref-type="bibr" rid="bib1.bibx18" id="paren.7"><named-content content-type="post">Fig. 3</named-content></xref>. As a corollary of this,
although the mean of the ensemble generally outperforms most if not all the
ensemble's constituent models <xref ref-type="bibr" rid="bib1.bibx6" id="paren.8"/>, it does not
actually converge to reality as the ensemble size grows. Rather, the ensemble
mean itself appears to have a persistent and significant bias. There have
been some attempts to compensate for this shared bias, for example by
estimating the number of “effectively independent” models contained in the
full ensemble <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx16 bib1.bibx27" id="paren.9"/>.
However, the theoretical basis for these calculations does not appear to be
clearly justified, and the results presented would, if valid, have startling
implications. For example, if we accept the arguments
of <xref ref-type="bibr" rid="bib1.bibx27" id="text.10"/> that the CMIP3 ensemble contains
eight “effectively independent” models then its full range of sensitivity
values, 2.1–4.4 <inline-formula><mml:math id="M9" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C, would still be a legitimate 99 % confidence
interval for the true sensitivity, as the probability of eight independent (in
this sense) estimates all simultaneously falling either below or above the
truth is only 1 part in <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">7</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. The same argument would apply to any other
output or derived parameter of the model climates. That is, we could be
“virtually certain” (to use the IPCC calibrated language) that the model
ensemble bounds multiple aspects of the behaviour of the climate system, even
with this very modest number of number of “effectively independent” models.
This confident conclusion does not seem very realistic when we consider the
limitations which are common to all climate models, and therefore we are
forced to question the appropriateness and validity of the assumptions
underlying such analyses.</p>
      <p><xref ref-type="bibr" rid="bib1.bibx1" id="text.11"/> define independence purely in terms of
inter-model differences and suggest down weighting models that are too
similar in outputs. This approach has the potential weakness that models that
agree <italic>because they are all accurate</italic> will be discounted, relative to
much worse models, without any allowance being made for their good
performance relative to reality. A challenge for this and related approaches
is that the use of a distance measure does not readily suggest a threshold at
which models can be considered <italic>absolutely</italic> independent. All models are
designed to simulate the real climate system, and are tuned towards
observations of it <xref ref-type="bibr" rid="bib1.bibx14" id="paren.12"/>. Therefore it should not be
surprising that climate models appear broadly similar, since the maximum
distance (in any relevant metric space) between a pair of models can be no
more than the sum of the distances between each of these models and reality.
<xref ref-type="bibr" rid="bib1.bibx8" id="text.13"/> use the pairwise correlations of model errors in
their analysis, but only after first debiasing the model simulations, and
thus exclude    a priori  one of the factors which is usually considered a
fundamental aspect of both model performance and model similarity.</p>
      <p>Some approaches to model independence have been less quantitative in nature.
<xref ref-type="bibr" rid="bib1.bibx25" id="text.14"/> define their interpretation as “independent in the
sense that every model contributes additional information”, but information
in this context is not further defined or quantified. In fact, the cluster
analysis presented by <xref ref-type="bibr" rid="bib1.bibx25" id="text.15"/> may be more precisely
described by the phrasing in the related paper by <xref ref-type="bibr" rid="bib1.bibx19" id="text.16"/>,
which states that independence is used “loosely to express that the
similarity between models sharing code is far greater than between those that
do not”. While that pair of papers certainly establishes that point
convincingly, there is again no indication of how much similarity should be
expected or tolerated between truly “independent” models, or whether absolute
independence is even a meaningful concept in their terms. The interesting
philosophical discussions of <xref ref-type="bibr" rid="bib1.bibx26" id="text.17"/> and <xref ref-type="bibr" rid="bib1.bibx24" id="text.18"/>
both consider the interpretation and implications of consensus across an
ensemble of models that are not independent, but the premise of model
dependence is adopted from the literature and these two authors do not
themselves attempt to further define this term in a quantifiable manner.</p>
      <p>Perhaps the most constructive and complete approach to date is that
of <xref ref-type="bibr" rid="bib1.bibx31" id="text.19"/>. In this work, dependence is again defined in terms
of inter-model differences in output, and this distance measure is used to
remove or downweight the models which are most similar to other models in
output. By comparing the inter-model distances both to model–data
differences and to what might be expected by chance with independent samples
from a Gaussian distribution that summarises the full distribution, the
authors introduce a threshold at which they argue model differences may be
considered appropriately large. However, the epistemic nature of their
resulting ensemble is unclear and the resulting reduced ensemble is still
only described in terms of <italic>reducing</italic> rather than eliminating dependency.</p>
      <p>To summarise, the literature presents a strong consensus that the models are
not independent, but does not appear to present such a clear viewpoint
concerning what to do about this, or even the precise meaning of this term.
Given this lack of clarity, it is perhaps unsurprising that the IPCC does not
address this topic in detail, while nevertheless acknowledging its
importance <xref ref-type="bibr" rid="bib1.bibx11" id="paren.20"><named-content content-type="post">Sect. 1.4.2</named-content></xref>. Thus, we see not only the
opportunity but also the necessity of making further progress.</p>
</sec>
<sec id="Ch1.S3">
  <title>The statistical context for independence</title>
      <p>In probability theory, independence has a straightforward definition. Two
events, <inline-formula><mml:math id="M11" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M12" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, are defined to be independent if the probability of <inline-formula><mml:math id="M13" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>,
<inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, is not affected by the occurrence of <inline-formula><mml:math id="M15" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, so that <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> <xref ref-type="bibr" rid="bib1.bibx39" id="paren.21"><named-content content-type="pre">e.g.</named-content><named-content content-type="post">Sect. 2.4.3</named-content></xref>. Since the joint probability of
both events <inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is given by <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, we see that two events
are independent if their joint probability is equal to the product of their
individual probabilities, i.e. if <inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>)</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Independence is
therefore a symmetric property: <inline-formula><mml:math id="M20" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> is independent of <inline-formula><mml:math id="M21" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> if and only if <inline-formula><mml:math id="M22" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>
is independent of <inline-formula><mml:math id="M23" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>. The concept of independence can also be generalised to
the case of conditional independence: two events, <inline-formula><mml:math id="M24" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M25" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, are defined to
be conditionally independent given a third event, <inline-formula><mml:math id="M26" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, if their joint
probability conditional on <inline-formula><mml:math id="M27" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, is equal to the product of
their individual probabilities both conditional on <inline-formula><mml:math id="M29" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.
Independence and conditional independence generalise naturally both to
continuous distributions <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, which is more appropriate for the situations
considered in this paper, and also to more than two events.</p>
      <p><?xmltex \hack{\newpage}?>As we have seen in Sect. <xref ref-type="sec" rid="Ch1.S2"/>, much research on model
independence either ignores or explicitly disavows any direct link to this
mathematical/statistical definition. Conversely, the primary goal of this
paper is to argue that this definition must be central to any usable,
quantitative theory.</p>
      <p>Bayes' theorem tells us how to update a prior probabilistic estimate of an
unknown, <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, in light of some observations or event <inline-formula><mml:math id="M33" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> via the equation
          <disp-formula id="Ch1.E1" content-type="numbered"><mml:math id="M34" display="block"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>|</mml:mo><mml:mi>A</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
        <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is known as the likelihood function (particularly when <inline-formula><mml:math id="M36" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> is held
fixed, and <inline-formula><mml:math id="M37" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> treated as a variable).</p>
      <p>If we have two events <inline-formula><mml:math id="M38" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M39" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> then the corresponding equation is
          <disp-formula id="Ch1.E2" content-type="numbered"><mml:math id="M40" display="block"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>|</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
        The first term on the right-hand side of this equation can be expanded by the
laws of probability, resulting in the equivalent formulation
          <disp-formula id="Ch1.E3" content-type="numbered"><mml:math id="M41" display="block"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>|</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>B</mml:mi><mml:mo>∩</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
        Either of these two equations can in principle be used to calculate the
posterior probability of <inline-formula><mml:math id="M42" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> conditional on both of the events <inline-formula><mml:math id="M43" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M44" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>,
though in practice it may not be straightforward to determine the terms on
the right-hand sides.</p>
      <p>If <inline-formula><mml:math id="M45" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M46" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> are conditionally independent given <inline-formula><mml:math id="M47" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, then <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>
can also be decomposed as <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Thus, in this case,
          <disp-formula id="Ch1.E4" content-type="numbered"><mml:math id="M50" display="block"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>|</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
        In practice, the term “independent” is frequently used to refer to
conditional independence, especially when <inline-formula><mml:math id="M51" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M52" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> are being discussed
primarily as observations of, or evidence concerning, some unknown <inline-formula><mml:math id="M53" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. The
significance of this conditional independence is that if we already have
likelihoods <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, then conditional independence allows us to
directly create the joint likelihood <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> by multiplication, rather
than requiring the construction of <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>B</mml:mi><mml:mo>∩</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as an additional step.
Inspection of Eqs. (<xref ref-type="disp-formula" rid="Ch1.E3"/>) and (<xref ref-type="disp-formula" rid="Ch1.E4"/>) shows that the
conditional independence of <inline-formula><mml:math id="M58" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M59" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> given <inline-formula><mml:math id="M60" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> is equivalent to the
condition that <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>B</mml:mi><mml:mo>∩</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. This equation states that the
predictive probability of <inline-formula><mml:math id="M62" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>, given both <inline-formula><mml:math id="M63" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M64" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, is equal to the
predictive probability of <inline-formula><mml:math id="M65" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> given <inline-formula><mml:math id="M66" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. In other words, if we know <inline-formula><mml:math id="M67" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, then
additionally learning <inline-formula><mml:math id="M68" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> does not change our prediction of <inline-formula><mml:math id="M69" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>. This
formulation can be a useful aid to understanding when independence does and
does not occur.</p>
<sec id="Ch1.S3.SS1">
  <title>The Bayesian perspective</title>
      <p>The above elementary probability theory applies equally to the frequentist
and Bayesian paradigms. Within the frequentist paradigm, the probability of
an event is defined as the limit of its relative frequency over a large
number of repeated but random trials. Within the Bayesian paradigm, the
probability calculus may be used to describe the subjective beliefs of the
researcher. In the remainder of this paper, we exclusively adopt this
paradigm, since all the relevant uncertainties discussed here are epistemic
in nature (relating to imperfect knowledge) and not aleatory (arising from
some intrinsic source of “randomness”). Thus, rather than considering “the
pdf of <inline-formula><mml:math id="M70" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>” it is more correct to refer to “my pdf of <inline-formula><mml:math id="M71" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>” or perhaps “our
pdf for <inline-formula><mml:math id="M72" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>” in the case that many researchers share a consensus view.</p>
      <p>It should be noted that Bayesian probabilities, being personal in nature, are
in general conditional on some personal “background” set of beliefs of the
researcher <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Ω</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Thus, <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> could be more precisely written as
<inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="normal">Ω</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. However, this background knowledge will usually be omitted
for convenience, and conditioning will usually be explicitly included only
when there is some specific information that may be considered particularly
relevant (and which is not assumed to be widely known).</p>
      <p>As we have seen, the question of (conditional) independence boils down to the
question of whether <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>B</mml:mi><mml:mo>∩</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is equal to <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Our discussion of
the subjective nature of likelihood within the Bayesian probabilities should
make it clear that there is not an objectively correct answer to this
question, but rather it depends on the subjective view of the researcher in
question. Posing the question presupposes that the researcher already has
likelihoods <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in mind, or else the observations <inline-formula><mml:math id="M80" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and
<inline-formula><mml:math id="M81" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> would not be considered useful evidence on <inline-formula><mml:math id="M82" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. Would knowing <inline-formula><mml:math id="M83" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> change
their predictive distribution for <inline-formula><mml:math id="M84" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> (i.e. the likelihood <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>? If it
would not, then <inline-formula><mml:math id="M86" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M87" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> <italic>are</italic> conditionally independent given <inline-formula><mml:math id="M88" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>,
for this researcher. That is, if the researcher does not know how to use the
additional information <inline-formula><mml:math id="M89" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> in order to better predict <inline-formula><mml:math id="M90" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>, then <inline-formula><mml:math id="M91" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M92" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>
are conditionally independent to that researcher. Thus, ignorance implies
independence. If, conversely, <inline-formula><mml:math id="M93" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> does provide helpful information in
addition to <inline-formula><mml:math id="M94" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, then their improved prediction is the new likelihood
function <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>B</mml:mi><mml:mo>∩</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, which directly enables the joint likelihood <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> to also be created.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <title>Model independence in the Bayesian framework</title>
      <p>We now explore how this Bayesian framework can be applied to the question of
model independence. We first consider the “truth-centred” hypothesis which is
perhaps most clearly presented by <xref ref-type="bibr" rid="bib1.bibx36" id="text.22"/>. In that
work, the outputs of the models, <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (where <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>&lt;</mml:mo><mml:mi>i</mml:mi><mml:mo>&lt;</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula> indexes the different
models) are assumed to be samples from a multivariate Gaussian distribution
centred on the truth <inline-formula><mml:math id="M99" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>. The likelihood for each model <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:mi>T</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is
therefore a Gaussian of the same width centred on the model outputs. The
joint likelihood for multiple models is equal to the product of their
individual likelihoods, which as we have seen above is equivalent to
considering that the models are independent conditional on the truth. The
joint likelihood will therefore be a Gaussian centred on the ensemble mean
and its width will narrow in proportion to the square root of the number of
models considered, which is the mathematical justification for the
supposition that the ensemble mean will converge to the truth. As we have
already mentioned, this behaviour is contradicted by analysis of the model
outputs <xref ref-type="bibr" rid="bib1.bibx18" id="paren.23"/>. Thus, although such a definition of the
concept of model independence could be presented in terms of the statistical
definition of independence, it does not describe the behaviour of the models
adequately because the models do in fact generally share common biases.</p>
      <p>In light of this failure of the truth-centred approach, we now present two
alternative interpretations of statistical independence that we believe could
be more relevant and appropriate in application to the ensemble of climate
models. We use CMIP3 here, rather than CMIP5, primarily in order that the
ideas developed here can in the future be tested against a somewhat new
sample, so as to defend against the risk of data mining.</p>
      <p>Consider firstly the case where the outputs of a subset of the models which
contributed to CMIP3 are labelled as <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, so as to conceal the
underlying model names. If told that one of these models <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
was actually MIROC, say, then a researcher who was asked to identify which
outputs came from this specific model and who did not have unusually detailed
knowledge of this and other climate models would quite possibly assign
uniform probabilities across these sets of outputs. Now consider how the
situation would change if another set of outputs <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:msup><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> (not included in the
original set) was provided and identified as having been generated by the
MRI model. If the same researcher was again asked to predict which of
<inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was from MIROC, then their answer would either change or
it would not, depending on their beliefs concerning the relationship between
these two climate models (which were contributed by neighbouring institutes
in Japan and have some common origins). In the case that their answer did not
change, this would imply that they considered the MRI and MIROC models to be
independent, conditional on the unlabelled ensemble of model outputs. If, on
the other hand, they thought MRI and MIROC were likely to be particularly
similar among the ensemble of climate models (due either to the legacy of
shared code or development methods), then it would be rational of them to
assign higher probabilities to the sets of outputs that were closer to
<inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:msup><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> in some metric.</p>
      <p>While the subjective nature of Bayesian probability precludes a definitive
answer, we expect that, for most researchers and most model pairs where there
is no clear institutional or historical link, they will indeed believe the
models to be independent in this manner (i.e. conditional on the unlabelled
ensemble of outputs). Conversely, if the pair of models appear to differ in
only some very limited manner, such as being different resolutions of the
same underlying code (consider for example the T63 and T42 versions of CCMA
which were submitted to CMIP3) then it might be sensible for a researcher to
instead update their prediction of the unknown model, increasing
probabilities of outputs which were closer (according to some reasonable
measure) to the named model, and with decreasing probabilities assigned to
more distant outputs. The extent to which the probabilities are changed would
be a direct indication of the strength of the dependence between the models,
as judged by the researcher.</p>
      <p>An alternative but similar approach can be formulated if, instead of using
the discrete distribution of actual climate model outputs, we parameterise
their distribution, for example as a multivariate Gaussian. If given the
parameters of a Gaussian distribution based on the outputs of <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (i.e. <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:msub><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>M</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula> being the mean and
<inline-formula><mml:math id="M108" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> the standard deviation of the outputs), and asked to predict the
outputs of MIROC (knowing it to be one of the constituent models), a
researcher might reasonably decide that a reasonable answer would be to use
this Gaussian directly as the predictive distribution. Additionally, learning
the outputs and true name of an additional model <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msup><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> will leave their
prediction unchanged if and only if the researcher thinks that this model is
independent of MIROC, conditional on the ensemble distribution. If the
researcher thinks that the model <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msup><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is related to MIROC, then they might
plausibly modify their prediction, for example by shifting the original
Gaussian towards <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msup><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> in some way. A numerical example is provided in
Sect. <xref ref-type="sec" rid="Ch1.S4.SS1"/> below.</p>
      <p>These approaches, we believe, encapsulate many of the same ideas as the model
similarity analyses of <xref ref-type="bibr" rid="bib1.bibx1" id="text.24"/>,
<xref ref-type="bibr" rid="bib1.bibx19" id="text.25"/>,
<xref ref-type="bibr" rid="bib1.bibx31" id="text.26"/> and others. However, our approaches have the advantage that
independence here can be defined in absolute terms (conditional on a clearly
defined background knowledge) and is not merely a measure of relative
difference. If a researcher does not know how to improve their prediction of
a particular model, in light of being given a particular set of outputs from
another named model, then this pair of models is in fact absolutely
independent to them in statistical terms.</p>
<sec id="Ch1.S4.SS1">
  <title>Example</title>
      <p>To provide a concrete demonstration of the previous ideas, we analyse the
models which contributed to the CMIP3 database. Several modelling centres
contributed more than one model version and we expect, based on the existing
literature such as <xref ref-type="bibr" rid="bib1.bibx19" id="text.27"/>, that these may be noticeably
more similar to each other than two models from different randomly selected
centres would be. In total, we use the outputs of 25 climate model
simulations and analyse two-dimensional climatological fields of surface air
temperature (TAS), precipitation (PREC) and sea level pressure (PSL) for
their pre-industrial control simulations. We can identify nine pairs of models
where both were contributed by the same institute and use these as examples
of models that we expect to show dependency, but note that this approach does
not make use of any detailed knowledge of model development or shared code
and other researchers might make different choices if asked to predict
dependence among the ensemble.</p>
      <p>We use as a simple distance metric the area-weighted root mean square (RMS)
difference between the climatological data fields (of commensurate variables)
after regridding to a common 5 <inline-formula><mml:math id="M112" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> Cartesian grid. For example, randomly
selected models from the ensemble have an area-weighted RMS difference of
around 3 <inline-formula><mml:math id="M113" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C. Given the model fields – or even just their pairwise RMS
differences – it would surely be difficult for most researchers to identify
with any confidence which field came from a specific model such as CSIRO3.0,
and if asked to provide a probabilistic prediction, they might reasonably
assign uniform probabilities across the set. However, if the researcher is
then given the outputs of a new model <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msup><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> and told that it was in fact
CSIRO3.5, it would now be reasonable to expect that CSIRO3.0 was more likely
to be one of its near neighbours rather than relatively distant from it,
under the assumption that the changes arising from the development between
these model versions were relatively modest. A simple way to account for this
expected similarity, in terms of formulating a probabilistic prediction for
the outputs of CSIRO3.0, would be to assign probabilities to the unnamed sets
of outputs, in some way such that the probability decreases with distance
from CSIRO3.5. By way of demonstration, we order the unnamed models by
increasing distance from CSIRO3.5 and assign them probabilities that decrease
proportional to the sequence <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula>. The choice of this
particular sequence was of course highly subjective and many different
distributions could have been used instead.</p>
      <p>A researcher who applied this probabilistic strategy to each of the 9 pairs
of models identified as coming from the same centre would assign a typical
(geometric mean) probability of around 0.09 to the correct field of outputs,
when averaged over all model pairs and over the three types of fields TAS,
PREC and PSL. The naive uniform distribution would in contrast only assign a
probability of <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">24</mml:mn><mml:mo>≃</mml:mo><mml:mn mathvariant="normal">0.04</mml:mn></mml:mrow></mml:math></inline-formula> to the correct field. Thus, taking account
of the shared origins can typically increase the probability of a correct
prediction by a factor of more than 2, and we may conclude that models from
the same institute are not independent, conditional on knowing the pairwise
distances between their outputs. This is of course little more than a
mathematical interpretation of the similarities noted
by <xref ref-type="bibr" rid="bib1.bibx25" id="text.28"/> and others. Thus, the result is not surprising,
but we believe it is worthwhile to demonstrate how those earlier empirical
investigations can be explained and expressed directly in terms of
statistical independence. The results for each pair of related climate
models, and for each of the three climate fields considered here, are
presented graphically in Fig. <xref ref-type="fig" rid="Ch1.F1"/>.</p>
      <p>Similar results can be obtained when the analysis is performed in parametric
terms, when rather than using the sets of model outputs, only a statistical
summary of the ensemble of outputs is provided in the form of multivariate
Gaussian approximation to their distribution <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>,</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, where
<inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:msub><mml:mo>∑</mml:mo><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>M</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula> is the ensemble mean and <inline-formula><mml:math id="M119" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> is the standard
deviation of the distribution. In this case, we consider a researcher who is
asked to predict the location of an additional model <inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. A natural
prediction is simply the distribution <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>,</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. The question
of dependence then rests on whether, when told the location of a plausibly
related model <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> already contained in the ensemble, the researcher changes
their prediction. One interesting detail to note is that for most model pairs
<inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> provided by a single modelling centre, the outputs of <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
actually provide a marginally worse prediction of <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> (in the sense of
being further away) than the ensemble mean <inline-formula><mml:math id="M126" display="inline"><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> does. However, this
very small increase in distance suggests that an interpolation almost
half-way from the ensemble mean to <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> might provide a better prediction
still, and we find that this is indeed the case. Using <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:msubsup><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mo>′</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.6</mml:mn><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> as a predictor for <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> generates a measurably
lower prediction error, typically by 10 % or so across the three data fields
used, than the original ensemble mean <inline-formula><mml:math id="M130" display="inline"><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> did. Therefore, the
original prediction of <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>,</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> can be replaced by
<inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mo>′</mml:mo></mml:msubsup><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn><mml:mo>×</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> to give a better prediction of the unknown
model <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. This result demonstrates empirically and numerically that
two models contributed by a single research centre are not conditionally
independent given <inline-formula><mml:math id="M134" display="inline"><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> and <inline-formula><mml:math id="M135" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>. These results are also
presented graphically in Fig. <xref ref-type="fig" rid="Ch1.F1"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><caption><p>Analysis of CMIP3 models. The <inline-formula><mml:math id="M136" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis indexes the nine model pairs being
considered. Crosses represent TAS, circles PREC and diamonds PSL. Grey
symbols indicate RMS distances of all models from the target model of the
pair. Black symbols indicate distance of target model from mean of residual
ensemble, blue symbols indicate distance of target model from plausibly
related model, and red indicates distance of target model from interpolated
prediction described in the text. The blue symbols being closer to zero than
almost all grey symbols shows that related models are typically closer
together than randomly selected models, and comparing red and black symbols
shows that the interpolation improves as a predictor over the ensemble mean
in almost all individual cases, and overall.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://esd.copernicus.org/articles/8/211/2017/esd-8-211-2017-f01.pdf"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS2">
  <title>Accounting for model dependence via weighting</title>
      <p>A natural question to ask is whether some weighting scheme could be developed
to account for model dependence of this type. If we anticipate that a pair of
models will be particularly similar, then including both in the ensemble
without downweighting either of them will tend to shift the ensemble mean
towards this pair of models. The correct weight to prevent this can easily be
calculated according to the interpolation formula in the following manner. If
we anticipate that a particular model <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> will help to predict a new model
<inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> via an interpolated prediction <inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:msubsup><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mo>′</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for some coefficient <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:mn mathvariant="normal">0</mml:mn><mml:mo>&lt;</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, then adding <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>
to the ensemble without any adjustment to weights (i.e. with all model
weights equal by default) will result in an a priori expectation that
the ensemble mean will be shifted towards <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, with the effect being
stronger the closer <inline-formula><mml:math id="M143" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> is to one. One simple approach to counteract
this effect would be to discard the candidate new model, effectively giving
it a weight of zero. However, the resulting ensemble would be sensitive to
the order in which models are added, and the symmetry of the dependence
relationship suggests that it would be more reasonable to apply an equal
weight to each model in the dependent pair. If an equal weight of
<inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is applied to both models (relative to unit weighting on the
other models), then the prior expectation will be that the ensemble mean is
unchanged by the inclusion of the additional model. Perhaps the simplest way
to show this is to start from the identity that for the original ensemble
            <disp-formula id="Ch1.E5" content-type="numbered"><mml:math id="M145" display="block"><mml:mrow><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mi>i</mml:mi></mml:munder><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></disp-formula>
          due to the definition of the mean. If the additional model has an expected
output of <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msubsup><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mo>′</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and we apply the
same weight <inline-formula><mml:math id="M147" display="inline"><mml:mi>w</mml:mi></mml:math></inline-formula> to both models <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, then our prior expectation
for the equivalent sum over the weighted, augmented ensemble is given by
            <disp-formula id="Ch1.E6" content-type="numbered"><mml:math id="M150" display="block"><mml:mrow><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>≠</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:munder><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>w</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>w</mml:mi><mml:mo>(</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          which simplifies to
            <disp-formula id="Ch1.E7" content-type="numbered"><mml:math id="M151" display="block"><mml:mrow><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>≠</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:munder><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>w</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>M</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mi>w</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          This sum equals zero when <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:mi>w</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p>For example, if a second identical replicate of an existing model were to be
contributed to the ensemble (in which case <inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>) then both models
will receive a weight of 0.5, precisely cancelling out the duplication. In
the numerical example presented above, we have chosen <inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula> and thus
the appropriate weight would be <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">1.4</mml:mn><mml:mo>≃</mml:mo><mml:mn mathvariant="normal">0.7</mml:mn></mml:mrow></mml:math></inline-formula>. Weighting the models will
not be expected a priori to affect ensemble spread, as we have no
expectation that the dependent models are systematically closer to, or
further away from, the ensemble mean when compared to the rest of the
ensemble. The effect of weighting on ensemble mean performance is also
expected to be very small as the change in effective ensemble size (which can
be defined as <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:msub><mml:mo>∑</mml:mo><mml:mi>i</mml:mi></mml:msub><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:mo>∑</mml:mo><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> are the relative weights)
will be modest. If we have an initial ensemble of, say, 15 independent models
and then 8 of these models are effectively assigned relatively higher weights
of 1.4 by addition of near-replicates to the ensemble, then the effective
ensemble size will only decrease from the original 15 to a new value of
14.6. This is a negligible difference that cannot be expected to affect
ensemble performance in any measurable way. Figure 3c and d
of <xref ref-type="bibr" rid="bib1.bibx18" id="text.29"/> shows that the typical performance of a
randomly selected sub-ensemble of, say, 20 models is only very marginally worse
than the full set of 23 used in that paper. However, if a future CMIP
ensemble were dominated by a large number of near-replicates of a small subset
of models, then this issue would undoubtedly become more important.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <title>Discussion</title>
      <p>We have presented a coherent statistical framework for understanding model
independence, and demonstrated how this framework can be applied in practice.
Climate models cannot sensibly be considered independent estimates of
reality, but fortunately this strong assumption is not required in order to
make use of them. A more plausible, though still optimistic, assumption
might be to interpret the ensemble as merely constituting independent samples
of a distribution which represents our collective understanding of the
climate system. This assumption is challenged by the near-replication of some
climate models within the ensemble, and therefore sub-sampling or
re-weighting the ensemble might be able to improve its usefulness. We have
shown how the statistical definition of (conditional) independence can apply
and how it helps in defining independence in a quantifiable and testable
manner.</p>
      <p>The definition we have presented is certainly not the only possible one and
we expect that others may be able to suggest improvements within this
framework. For instance, experts with knowledge of the model structures might
be able to predict more detailed similarities between the outputs of model
pairs. Moreover, there is no requirement that, in applying our principles, a
researcher would use the most naive ignorant prediction of uniform
probabilities across the ensemble of outputs, or the Gaussian summary of the
distribution, as their predictions of the target model.
However, our result here is sufficient to illustrate how the concept of
statistical independence can be directly applied in a quantitative
mathematical sense to the question of model independence, while encapsulating
much of what is discussed in the literature.</p>
      <p>An important point to note is that this interpretation of independence is
entirely unrelated to model and indeed ensemble
performance <xref ref-type="bibr" rid="bib1.bibx29 bib1.bibx4" id="paren.30"><named-content content-type="pre">e.g.</named-content></xref>. Here we consider these
questions to be separate topics, which require study in their own right.
Reality (e.g. observations of the real climate system) does not enter into
any of the calculations or definitions above. Thus, the two concepts of
performance and independence as used here are entirely unrelated. It remains
a challenge to develop some useful interpretation of (conditional)
independence which <italic>does</italic> use real data and which is informative
regarding both model performance and pairwise similarity. However, the
definition as presented here does have obvious applications in terms of
interpreting and using the model ensemble. It suggests that we may be able to
usefully reduce the full CMIP ensemble to a set which are independent
(conditional on the ensemble statistics, as above). This will provide a
smaller set of models for analysis and use in downstream applications
including downscaling to higher resolution regional simulations of climate
change. This is likely to be increasingly important and necessary given the
heterogenous nature of simulations which will likely be submitted to future
CMIP databases. An additional point which should not be overlooked is that
the numerical example presented here was undertaken purely in terms of the
modern climatologies of the models, and does not consider future climate
changes. However, the underlying principles of independence do apply more
broadly to any consideration of model outputs, and the conclusions reached
may be different depending on the data sets used.</p>
      <p>While the question of model similarity and ensemble member selection has
already been considered by others <xref ref-type="bibr" rid="bib1.bibx31" id="paren.31"><named-content content-type="pre">e.g.</named-content></xref>, the work here
provides a more clear-cut definition of what it means to be independent,
which is directly testable. If researchers can demonstrate dependence (in
terms of an improved prediction of model outputs as illustrated here) then
independence is violated, and if not, it may be reasonably assumed. Another
important difference between the approach presented here, and that of many
other authors, is that independence is determined a priori in terms of
the anticipated outputs of the models, rather than a posteriori in
light of the model outputs. Pairwise similarity between model outputs may
arise through convergence of different approaches to understanding the
climate system, and not merely through copying of ideas, and this would not
indicate any dependence as defined here. In fact, one pair of models which
exhibit unusually similar temperature fields in our analysis consists of the
model from CNRM and one of the GFDL models, which do not share any
particularly obvious relationship. We do not believe that coincidentally
similar behaviour should be penalised by downweighting of these models, as it
may represent a true “emergent constraint” on system behaviour. An obvious
future test of our ideas would be to apply this analysis to the CMIP5 and
CMIP6 ensembles of climate models, to check whether the interpolation and
dependence ideas presented here apply generally to ensembles of climate
models rather than being an example of over-enthusiastic data mining.</p>
</sec>
<sec id="Ch1.Sx2" specific-use="unnumbered">
  <title>Part 2 – Independence of constraints on climate system behaviour</title>
      <p>In Part 1, we discussed how the concept of independence applies to the sets
of models which form the CMIP ensembles of opportunity. In Part 2, we discuss
estimation of climate sensitivity, although the principles presented here
apply more generally to observational constraints on climate system
behaviour. While initially it may seem that this topic has little in common
with that of Part 1, we will show how the concept of probabilistic
independence also relates directly to this question. Thus, the probabilistic
background of Sect. <xref ref-type="sec" rid="Ch1.S3"/> is directly relevant and applicable here.</p>
</sec>
<sec id="Ch1.S6">
  <title>The literature concerning observational constraints on the climate sensitivity</title>
      <p>The magnitude of the equilibrium climate sensitivity <inline-formula><mml:math id="M158" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> (the
globally averaged equilibrium temperature response to a doubling of
atmospheric CO<inline-formula><mml:math id="M159" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>) has long been one of the fundamental questions of
climate change research <xref ref-type="bibr" rid="bib1.bibx9" id="paren.32"/>. A wide range of approaches
have been presented which attempt to estimate this number. Most commonly, a
Bayesian approach is used in which some prior estimate is updated by means of
an observationally based likelihood function to form a posterior estimate.
The observations frequently relate to the warming observed during the
instrumental period (which we refer to for convenience as the 20th century,
although the relevant observational data available does extend into the 19th
and 21st centuries) <xref ref-type="bibr" rid="bib1.bibx37 bib1.bibx12 bib1.bibx32" id="paren.33"/>, but analyses have also been presented which use longer-term
climate changes seen during the palaeoclimate
record <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx20" id="paren.34"/>, or short-term variations seen at
seasonal to interannual timescales <xref ref-type="bibr" rid="bib1.bibx38 bib1.bibx17" id="paren.35"/>. In
each case, however, the observations are not a direct measure of the
sensitivity <inline-formula><mml:math id="M160" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> per se but must be related to it through the use of a climate
model or models, which may be simple or complex. <xref ref-type="bibr" rid="bib1.bibx10" id="text.36"><named-content content-type="post">Box
12.2</named-content></xref> and <xref ref-type="bibr" rid="bib1.bibx2" id="text.37"/> survey and discuss some recent
analyses which use a variety of observational data sets and modelling
approaches, and <xref ref-type="bibr" rid="bib1.bibx30" id="text.38"/> cover the palaeoclimate field in
some detail.</p>
      <p>The question naturally arises as to whether these different constraints
could, and should, be synthesised. In most of the Bayesian analyses, the
prior is typically chosen to be vague, though there is some debate concerning
this choice <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx22" id="paren.39"/>. Irrespective of
the choice of prior, the posterior after updating with observations is
typically substantially narrower. One might reasonably wonder what the
results would look like if this resulting posterior was then used as the
prior in a new analysis in which it was updated by a <italic>different</italic> data
set. This question was first explicitly raised by <xref ref-type="bibr" rid="bib1.bibx3" id="text.40"/>, who made
an assumption of independence between the constraints and thus implemented a
straightforward process of sequential updating using Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>)
which resulted in a substantially tighter constraint than had previously been
obtained. <xref ref-type="bibr" rid="bib1.bibx13" id="text.41"/> similarly updated a posterior arising from
an estimate based on the 20th century warming, with a separate data set
relating to climate changes over earlier centuries. However, the validity of
these analyses is not immediately obvious, as the independence of different
constraints has not been clearly explained or demonstrated. Nevertheless, we
always expect to learn from new observations <xref ref-type="bibr" rid="bib1.bibx23" id="paren.42"/>, so
it is reasonable to expect that an analysis which accounts for multiple lines
of evidence will generate a more precise and reliable result than analyses
that do not. It is therefore surprising that there has been very little
discussion of this topic in the climate science literature, and very few
recent attempts to combine diverse data sources, although this topic is now
receiving some fresh attention <xref ref-type="bibr" rid="bib1.bibx33" id="paren.43"/>.</p>
</sec>
<sec id="Ch1.S7">
  <title>Independence of constraints in the Bayesian context</title>
      <p>It should be clear from the discussion in Sect. <xref ref-type="sec" rid="Ch1.S3"/> that the
concept of independence in relation to multiple constraints on the
equilibrium climate sensitivity <inline-formula><mml:math id="M161" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> is more precisely expressed as
conditional independence of these constraints given <inline-formula><mml:math id="M162" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. The issue is whether
it is valid to replace the term <inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E3"/>) with
<inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> to form Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>), or equivalently whether <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>∩</mml:mo><mml:mi>A</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. This is essentially the same concept as the
“truth-centred” approach to model independence discussed briefly in
Sect. <xref ref-type="sec" rid="Ch1.S4"/>, although the skewed and asymmetric forms of
general likelihood functions means that it is not necessarily appropriate to
think of them as being centred on the true value of <inline-formula><mml:math id="M166" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>.</p>
      <p>In Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>, we argued that ignorance of any dependency
implies independence. Given a likelihood <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> we ask ourselves, how can
we change this by additionally including <inline-formula><mml:math id="M168" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> to form <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>? If the
answer is that <inline-formula><mml:math id="M170" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> provides no additional information regarding <inline-formula><mml:math id="M171" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>
(conditional on knowing <inline-formula><mml:math id="M172" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>), then <inline-formula><mml:math id="M173" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M174" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> are conditionally independent
given <inline-formula><mml:math id="M175" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. This answer may seem a little unsatisfactory, as it relies on a
dogmatically subjectivist and personal interpretation of probability. While
we emphasise that Bayesian probability is at its heart a fundamentally
subjective concept, it is quite usual to use numerical or mathematical models
as a tool to represent and understand our uncertainties.</p>
      <p><?xmltex \hack{\newpage}?>While the subjective nature of Bayesian priors (i.e. <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="normal">Ω</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>,
where <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Ω</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the researcher's personal background knowledge) has been
regularly discussed in the literature, it is less widely appreciated that the
likelihood <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="normal">Ω</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>∩</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is also a fundamentally
subjective concept within the Bayesian paradigm. Even if <inline-formula><mml:math id="M179" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> is a
well-defined property of the real world (which is not always immediately
clear when <inline-formula><mml:math id="M180" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> is defined in sufficiently abstract terms), there is then no
alternative world in which <inline-formula><mml:math id="M181" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> takes a different value, with which we could
check to see which events take place in this case. Therefore, while the
likelihood should give a reasonable prediction of the evidence <inline-formula><mml:math id="M182" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> when the
correct value of <inline-formula><mml:math id="M183" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> is used, there is no objective constraint or check on
what the likelihood should predict for some alternative incorrect choice of
<inline-formula><mml:math id="M184" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. The only practical way in which a likelihood can be constructed is via
some model which allows <inline-formula><mml:math id="M185" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> to vary, either as an explicit parameter in a
simple model or perhaps as an emergent property of a more complex model
which includes multiple sources of uncertainty. There can be no “correct” way
to vary <inline-formula><mml:math id="M186" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, again because there is no world in which <inline-formula><mml:math id="M187" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> takes a different
value against which to validate our choices. Within the Bayesian paradigm,
therefore, the likelihood can only reflect the researcher's subjective
beliefs and modelling choices rather than any physical truth. Different
models will in principle lead to different likelihoods, though in practice
there may be a reasonable level of agreement between researchers.</p>
<sec id="Ch1.S7.SS1">
  <title>Example</title>
      <p>Here we explore these ideas in a little more detail in order to illustrate how it is
possible to provide a credible basis for what are fundamentally subjective
judgements. Typically, a likelihood <inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is generated not as a purely
subjective matter of belief but instead justified via a model or ensemble of
models. For example, if the equilibrium sensitivity is varied across an
ensemble of energy balance models (along with other input parameters: <inline-formula><mml:math id="M189" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>
here may be used as a shorthand for a vector of relevant uncertainties) then
we will find that in simulations of the 20th century, the warming observed
will vary across the ensemble. This can then be used as the basis for the
likelihood function <xref ref-type="bibr" rid="bib1.bibx37 bib1.bibx12 bib1.bibx32" id="paren.44"><named-content content-type="pre">e.g.</named-content></xref>. Similarly, for another observable <inline-formula><mml:math id="M190" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> such as the cooling
during the LGM, which may require another set of
simulations using the same ensemble. We now outline how it is possible to
test whether <inline-formula><mml:math id="M191" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is conditionally independent of <inline-formula><mml:math id="M192" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> given <inline-formula><mml:math id="M193" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, in the
context of this model.</p>
      <p>A simple example is used to illustrate the point. We use a zero-dimensional
energy balance model to simulate the climate changes of both the 20th century
and the LGM. For simplicity, we only consider a subset of the relevant
uncertain parameters: the equilibrium sensitivity <inline-formula><mml:math id="M194" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, the planetary
effective heat capacity <inline-formula><mml:math id="M195" display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula>, the uncertainties in radiative forcing due to
aerosol forcing over the 20th century <inline-formula><mml:math id="M196" display="inline"><mml:mi>F</mml:mi></mml:math></inline-formula>, and atmospheric dust and the large
ice sheets which existed during the LGM, <inline-formula><mml:math id="M197" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M198" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>
respectively.</p>
      <p>For the warming of the 20th century, we assume the total forcing <inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:mi>G</mml:mi><mml:mo>=</mml:mo><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>
follows a linear forcing ramp from 0 in 1900 to <inline-formula><mml:math id="M200" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo><mml:mi>F</mml:mi></mml:mrow></mml:math></inline-formula> in 2000 (using a value
of 2 Wm<inline-formula><mml:math id="M201" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> to approximately represent the sum of all other forcings other
than aerosols, which are dominated by greenhouse gases). We simulate the
climate change with the zero-dimensional energy balance model which satisfies
the equation
            <disp-formula id="Ch1.E8" content-type="numbered"><mml:math id="M202" display="block"><mml:mrow><mml:mi>C</mml:mi><mml:mo>⋅</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi>T</mml:mi><mml:mo>/</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mi>T</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3.7</mml:mn><mml:mo>/</mml:mo><mml:mi>S</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M203" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the temperature anomaly (relative to 1900) at time <inline-formula><mml:math id="M204" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>. The
radiative forcing due to a doubling of CO<inline-formula><mml:math id="M205" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> is taken to be 3.7 Wm<inline-formula><mml:math id="M206" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. Our
first observable <inline-formula><mml:math id="M207" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> is the change in global mean surface air temperature over
the 20th century as estimated by the linear trend over this interval. During
the LGM, the climate can be assumed to be at a quasi-equilibrium and thus the
planetary heat capacity (which moderates transient changes) can be ignored.
The equilibrium temperature anomaly <inline-formula><mml:math id="M208" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> during this period is calculated as
            <disp-formula id="Ch1.E9" content-type="numbered"><mml:math id="M209" display="block"><mml:mrow><mml:mi>B</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>+</mml:mo><mml:mi>D</mml:mi><mml:mo>+</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo><mml:mo>×</mml:mo><mml:mi>S</mml:mi><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3.7</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where the total forcing <inline-formula><mml:math id="M210" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>+</mml:mo><mml:mi>D</mml:mi><mml:mo>+</mml:mo><mml:mi>I</mml:mi></mml:mrow></mml:math></inline-formula> is the sum of greenhouse gases (3 Wm<inline-formula><mml:math id="M211" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>),
the uncertain dust forcing (<inline-formula><mml:math id="M212" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula>) and the uncertain effective forcing of the
ice sheet (<inline-formula><mml:math id="M213" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>) respectively. The ice sheet forcing uncertainty term used
here implicitly accounts for the nonlinearity of how this combines with the
other forcings. For simplicity, we do not consider observational
uncertainties for either the LGM or 20th century temperature changes, though
accounting for these would be straightforward. We use the following priors
which are all taken to be either uniform distributions <inline-formula><mml:math id="M214" display="inline"><mml:mrow><mml:mi>U</mml:mi><mml:mo>[</mml:mo><mml:mo>,</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> or Gaussian
<inline-formula><mml:math id="M215" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mo>,</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>:

                <disp-formula specific-use="align"><mml:math id="M216" display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi>S</mml:mi><mml:mo>∼</mml:mo><mml:mi>U</mml:mi><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">6</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>C</mml:mi><mml:mo>∼</mml:mo><mml:mi>U</mml:mi><mml:mo>[</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">30</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>F</mml:mi><mml:mo>∼</mml:mo><mml:mi>N</mml:mi><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi>D</mml:mi><mml:mo>∼</mml:mo><mml:mi>N</mml:mi><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>I</mml:mi><mml:mo>∼</mml:mo><mml:mi>N</mml:mi><mml:mo>[</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

            A plot of the simulated 20th century warming <inline-formula><mml:math id="M217" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> versus sensitivity <inline-formula><mml:math id="M218" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> is
shown in Fig. <xref ref-type="fig" rid="Ch1.F2"/>a, together with a linear regression fit to these
data. This relationship shown demonstrates the basis for a likelihood
function <inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>: for any specified sensitivity we can predict the resulting
temperature using the regression line (albeit with uncertainty), and
therefore we can calculate how the probability of any specific warming <inline-formula><mml:math id="M220" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>
varies with <inline-formula><mml:math id="M221" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. In this example, the linear regression provides a good fit
to the data, though the uncertainty clearly grows towards larger sensitivity
values. Similarly, the LGM cooling <inline-formula><mml:math id="M222" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is also linked to <inline-formula><mml:math id="M223" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>
(Fig. <xref ref-type="fig" rid="Ch1.F2"/>b), and this relationship can be used as the basis for a
likelihood function <inline-formula><mml:math id="M224" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p><?xmltex \hack{\newpage}?>By construction, we already know that the two constraints are independent
given <inline-formula><mml:math id="M225" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, since the other uncertain parameters that relate to each
observations are disjoint. However, if we did not know this analytically a priori
but were instead merely able to use the model as a black box, we
could check for the independence of two sets of constraining evidence <inline-formula><mml:math id="M226" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and
<inline-formula><mml:math id="M227" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> in the following manner. Firstly, we would form the likelihood <inline-formula><mml:math id="M228" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as
above and use this together with the known value of <inline-formula><mml:math id="M229" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> for each ensemble
member to generate a mean prediction (which we denote <inline-formula><mml:math id="M230" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>) of the
observation for each. On comparing to the actual observed value <inline-formula><mml:math id="M231" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> for each
ensemble member, there will typically be a residual (<inline-formula><mml:math id="M232" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>-</mml:mo><mml:msup><mml:mi>A</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>) between the
predicted and observed values, the magnitude of which indicates the limited
information which <inline-formula><mml:math id="M233" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> provides concerning <inline-formula><mml:math id="M234" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>. We can now explore whether an
additional observable <inline-formula><mml:math id="M235" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is informative regarding these residuals, i.e.
whether it exhibits any systematic relationship with them. If it does not,
then we may reasonably conclude that <inline-formula><mml:math id="M236" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> provides no additional information
on, and is conditionally independent of, <inline-formula><mml:math id="M237" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> given <inline-formula><mml:math id="M238" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. Conversely, if <inline-formula><mml:math id="M239" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is
informative regarding the residuals, then this is proof that it is not
independent of <inline-formula><mml:math id="M240" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>.</p>
      <p>In the context of our example, we first create an ensemble with an arbitrary
but fixed value of <inline-formula><mml:math id="M241" display="inline"><mml:mrow><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3.5</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M242" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C, say, and simulate both the 20th
century warming and the LGM state for each member of this ensemble. The
likelihood function arising from Fig. <xref ref-type="fig" rid="Ch1.F2"/>a gives us a predicted
warming of <inline-formula><mml:math id="M243" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mo>′</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.85</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M244" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C (with uncertainty of 0.4 <inline-formula><mml:math id="M245" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C) for
these ensemble simulations. We now check the prediction errors to see whether they
exhibit any relationship with <inline-formula><mml:math id="M246" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>. Figure <xref ref-type="fig" rid="Ch1.F2"/>c indicates that they
do not, with the regression coefficients being insignificantly different from
zero. The conclusion is that the additional knowledge of <inline-formula><mml:math id="M247" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, once the
sensitivity <inline-formula><mml:math id="M248" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> is known to be 3.5 <inline-formula><mml:math id="M249" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C, does not provide any
additional help in predicting <inline-formula><mml:math id="M250" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>. <inline-formula><mml:math id="M251" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M252" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> are therefore independent,
conditional on <inline-formula><mml:math id="M253" display="inline"><mml:mrow><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3.5</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M254" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C. This
experiment can be repeated for as many different values of <inline-formula><mml:math id="M255" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> as is desired,
and the same negative result will be found. This is of course not surprising,
as the model has been constructed in this way.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><caption><p>Outputs of ensemble simulations (red dots) and linear regression
fits (black lines): <bold>(a)</bold> 20th century warming (<inline-formula><mml:math id="M256" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>) versus equilibrium
sensitivity; <bold>(b)</bold> LGM cooling (<inline-formula><mml:math id="M257" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>) versus equilibrium sensitivity; <bold>(c)</bold> 20th
century prediction residuals (<inline-formula><mml:math id="M258" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>-</mml:mo><mml:msup><mml:mi>A</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>) versus LGM cooling (<inline-formula><mml:math id="M259" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>), independent
case; and <bold>(d)</bold> 20th century prediction residuals (<inline-formula><mml:math id="M260" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>-</mml:mo><mml:msup><mml:mi>A</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>) versus LGM cooling (<inline-formula><mml:math id="M261" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>),
dependent case.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://esd.copernicus.org/articles/8/211/2017/esd-8-211-2017-f02.png"/>

        </fig>

      <p>We now make a small change to the model, and substitute <inline-formula><mml:math id="M262" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> with <inline-formula><mml:math id="M263" display="inline"><mml:mi>F</mml:mi></mml:math></inline-formula> in
Eq. (<xref ref-type="disp-formula" rid="Ch1.E9"/>) to obtain <inline-formula><mml:math id="M264" display="inline"><mml:mrow><mml:mi>B</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mo>+</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo><mml:mo>×</mml:mo><mml:mi>S</mml:mi><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3.7</mml:mn></mml:mrow></mml:math></inline-formula>. This modified model
now makes the assumption that the magnitude of effective dust forcing at the
LGM is the same as that of the aerosol forcing during the 20th century. This
is of course again a very simplistic approach, but it is not completely
unreasonable to assume a link of some sort, as both forcings relate to the
effects of condensation nuclei on clouds. Importantly, the univariate
likelihood functions <inline-formula><mml:math id="M265" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M266" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are unchanged by this
substitution, as <inline-formula><mml:math id="M267" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M268" display="inline"><mml:mi>F</mml:mi></mml:math></inline-formula> are identically distributed. Therefore, we can
generate the same prediction for <inline-formula><mml:math id="M269" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>, conditional on a known
<inline-formula><mml:math id="M270" display="inline"><mml:mrow><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3.5</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M271" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C. However, with this change to the model, the prediction
errors are now strongly correlated with <inline-formula><mml:math id="M272" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, as is shown in
Fig. <xref ref-type="fig" rid="Ch1.F2"/>d. Therefore, a new distribution function <inline-formula><mml:math id="M273" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>
can be created which makes a more precise prediction of <inline-formula><mml:math id="M274" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> given knowledge
of both <inline-formula><mml:math id="M275" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M276" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>. Thus, it can be diagnosed from the model outputs alone,
without direct knowledge of the model's internal structure, that <inline-formula><mml:math id="M277" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M278" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>
are not independent conditional on <inline-formula><mml:math id="M279" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>. This result is of course easily
interpreted in terms of the known model structure: for a given sensitivity, a
smaller than expected cooling at the LGM suggests a low dust/aerosol forcing,
which then implies that the 20th century warming will be greater than would
be expected from knowledge of sensitivity alone.</p>
      <p>The linear regressions are not necessarily the best way to represent a
relationship that may in practice be more complex. However, such an approach
may be expected to capture any first-order effect. The central point of these
numerical experiments is to demonstrate that this dependence can in principle
be diagnosed from model outputs directly, without the need for detailed
knowledge or understanding of causal relationships embedded in the model
structure. Furthermore, a conditional likelihood <inline-formula><mml:math id="M280" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> can
subsequently be generated from the ensemble outputs. This then enables us to
generate the joint likelihood <inline-formula><mml:math id="M281" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>∩</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>B</mml:mi><mml:mo>|</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as required
for a Bayesian inversion.</p>
      <p>Such analyses may be impractical for the outputs of small ensembles such as
those arising from the CMIP multi-model experiments which explore structural
uncertainties. However, they may well be plausible for larger ensembles where
parameters are varied within a single model structure. The key requirement is
that the simulations relating to different observables are performed with the
same model in order that any dependence between constraints can be explored.
The results obtained will of course depend on the model used, but this is as
expected: the likelihood is not a property of reality, but rather a
consequence of the modelling assumptions, as was discussed in
Sect. <xref ref-type="sec" rid="Ch1.S7"/>.</p>
</sec>
</sec>
<sec id="Ch1.S8">
  <title>Summary of Part 2</title>
      <p>The question of how to combine multiple constraints on climate sensitivity
has been occasionally raised, but more commonly ignored, in analyses of this
parameter. It is well known that combining constraints should lead to more
confident conclusions, but the difficulty of accounting for possible
dependency appears to have widely discouraged researchers from attempting
this <xref ref-type="bibr" rid="bib1.bibx10" id="paren.45"><named-content content-type="post">Box 12.2</named-content></xref>. This situation may start to
change <xref ref-type="bibr" rid="bib1.bibx33" id="paren.46"><named-content content-type="pre">e.g.</named-content></xref>, and we hope that the analysis
presented here will encourage others to consider the question of dependence
more directly. In particular, we have argued that independence is
fundamentally a subjective matter, but we have also shown how it may in
principle be diagnosed from an ensemble of models which purports to represent
our subjective uncertainties. A more widespread use of model ensembles which
simulate multiple observationally constrained periods (such as both modern
and palaeoclimate periods) may enable more progress to be made.</p>
</sec>
<sec id="Ch1.S9" sec-type="conclusions">
  <title>Conclusions</title>
      <p>We have discussed and presented a coherent statistical framework for
understanding independence, and explained how this applies in two distinct
applications. Climate models cannot sensibly be considered independent
estimates of reality, but fortunately this strong assumption is not required
in order to make use of them. A more plausible, though still optimistic,
assumption might be to interpret the ensemble as merely constituting
independent samples of a distribution which represents our collective
understanding of the climate system. This assumption is challenged by the
near-replication of some climate models within the ensemble, and therefore
re-weighting or sub-sampling the ensemble could improve its usefulness. We
have shown how the statistical definition of (conditional) independence can
apply and how it helps in defining independence in a quantifiable manner. The
definition we have presented is certainly not the only possible one and we
expect that others may be able to suggest improvements within this framework.</p>
      <p>When considering the use of observational evidence in constraining climate
system behaviour (including the specific example of the equilibrium climate
sensitivity), observational uncertainties themselves can generally be
regarded as independent. However, the independence of the resulting
likelihood functions is not so immediately clear, as it typically also rests
on a number of modelling assumptions and uncertainties. Here we have shown
how the question of independence can be readily interpreted and understood in
terms of the conditional prediction of observations. These ideas may be
useful in the design and analysis of ensemble experiments underpinning the
analysis of observational constraints.</p>
      <p>While our examples do not provide complete solutions to the questions raised,
we have shown how the statistical framework can be usefully applied. Further,
we see little prospect for progress to be made unless it is underpinned by a
rigorous mathematical framework. Therefore, we hope that other researchers
will be able to make use of these ideas in their future work.</p>
</sec>

      
      </body>
    <back><notes notes-type="dataavailability">

      <p>The CMIP3 data used in this
paper are available at <uri>http://www-pcmdi.llnl.gov/ipcc/about_ipcc.php</uri> (PCMDI, 2015).</p>
  </notes><notes notes-type="authorcontribution">

      <p>Both authors contributed to the research and writing.</p>
  </notes><notes notes-type="competinginterests">

      <p>The authors declare that they have no conflict of interest.</p>
  </notes><ack><title>Acknowledgements</title><p>We acknowledge the modelling groups, the Program for Climate Model Diagnosis
and Intercomparison (PCMDI) and the WCRP's Working Group on Coupled Modelling
(WGCM) for their roles in making available the WCRP CMIP3 multi-model
data set. Support of this data set is provided by the Office of Science, US
Department of Energy. We are also particularly grateful for the many helpful
suggestions made both by the reviewers and by many participants at a recent
meeting at NCAR concerning this topic.<?xmltex \hack{\newline}?><?xmltex \hack{\newline}?>
Edited by: F. Sun<?xmltex \hack{\newline}?>
Reviewed by: G. Abramowitz, B. Sanderson, and one anonymous referee</p></ack><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Abramowitz and Gupta(2008)</label><mixed-citation>Abramowitz, G. and Gupta, H.: Toward a model space and model independence
metric, Geophys. Res. Lett., 35, L05705, <ext-link xlink:href="http://dx.doi.org/10.1029/2007GL032834" ext-link-type="DOI">10.1029/2007GL032834</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Annan(2015)</label><mixed-citation>
Annan, J. D.: Recent Developments in Bayesian Estimation of Climate
Sensitivity, Current Climate Change Reports,   1–5, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Annan and Hargreaves(2006)</label><mixed-citation>Annan, J. D. and Hargreaves, J. C.: Using multiple observationally-based
constraints to estimate climate sensitivity, Geophys. Res. Lett.,
33, L06704, <ext-link xlink:href="http://dx.doi.org/10.1029/2005GL025259" ext-link-type="DOI">10.1029/2005GL025259</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Annan and Hargreaves(2010)</label><mixed-citation>Annan, J. D. and Hargreaves, J. C.: Reliability of the CMIP3 ensemble,
Geophys. Res. Lett., 37, L02703, <ext-link xlink:href="http://dx.doi.org/10.1029/2009GL041994" ext-link-type="DOI">10.1029/2009GL041994</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Annan and Hargreaves(2011a)</label><mixed-citation>
Annan, J. D. and Hargreaves, J. C.: On the generation and interpretation of
probabilistic estimates of climate sensitivity, Climatic Change, 104,
423–436, 2011a.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Annan and Hargreaves(2011b)</label><mixed-citation>
Annan, J. D. and Hargreaves, J. C.: Understanding the CMIP3 multimodel
ensemble, J. Climate, 24, 4529–4538, 2011b.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Annan et al.(2005)</label><mixed-citation>
Annan, J. D., Hargreaves, J. C., Ohgaito, R., Abe-Ouchi, A., and Emori, S.:
Efficiently constraining climate sensitivity with paleoclimate simulations,
Scientific Online Letters on the Atmosphere, 1, 181–184, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Bishop and Abramowitz(2013)</label><mixed-citation>
Bishop, C. H. and Abramowitz, G.: Climate model dependence and the replicate
Earth paradigm, Clim. Dynam., 41, 885–900, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Charney et al.(1979)</label><mixed-citation>
Charney, J. G., Arakawa, A., Baker, D. J., Bolin, B., Dickinson, R. E., Goody,
R. M., Leith, C. E., Stommel, H. M., and Wunsch, C. I.: Carbon dioxide and
climate: a scientific assessment, National Academy of Sciences, Washington,
DC, 1979.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Collins et al.(2013)</label><mixed-citation>Collins, M., Knutti, R., Arblaster, J., Dufresne, J.-L., Fichefet, T.,
Friedlingstein, P., Gao, X., Gutowski, W., Johns, T., Krinner, G., Shongwe,
M., Tebaldi, C., Weaver, A., and Wehner, M.: Long-term Climate Change:
Projections, Commitments and Irreversibility, in: Climate Change 2013: The
Physical Science Basis. Contribution of Working Group I to the Fifth
Assessment Report of the Intergovernmental Panel on Climate Change, edited
by:
Stocker, T., Qin, D., Plattner, G.-K., Tignor, M., Allen, S., Boschung, J.,
Nauels, A., Xia, Y., Bex, V., and Midgley, P., chap. 12,  1029–1136,
Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA,
<ext-link xlink:href="http://dx.doi.org/10.1017/CBO9781107415324.024" ext-link-type="DOI">10.1017/CBO9781107415324.024</ext-link>,
2013.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Cubasch et al.(2013)</label><mixed-citation>Cubasch, U., Wuebbles, D., Chen, D., Facchini, M., Frame, D., Mahowald, N., and
Winther, J.-G.: Introduction, in: Climate Change 2013: The Physical Science
Basis. Contribution of Working Group I to the Fifth Assessment Report of the
Intergovernmental Panel on Climate Change, edited by: Stocker, T., Qin, D.,
Plattner, G.-K., Tignor, M., Allen, S., Boschung, J., Nauels, A., Xia, Y.,
Bex, V., and Midgley, P., book section 1,  119–158, Cambridge University
Press, Cambridge, United Kingdom and New York, NY, USA,
<ext-link xlink:href="http://dx.doi.org/10.1017/CBO9781107415324.007" ext-link-type="DOI">10.1017/CBO9781107415324.007</ext-link>,
2013.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Forest et al.(2006)</label><mixed-citation>Forest, C. E., Stone, P. H., and Sokolov, A. P.: Estimated PDFs of climate
system properties including natural and anthropogenic forcings, Geophys.
Res. Lett, 33, L01705, <ext-link xlink:href="http://dx.doi.org/10.1029/2005GL023977" ext-link-type="DOI">10.1029/2005GL023977</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Hegerl et al.(2006)</label><mixed-citation>
Hegerl, G. C., Crowley, T. J., Hyde, W. T., and Frame, D. J.: Climate
sensitivity constrained by temperature reconstructions over the past seven
centuries, Nature, 440, 1029–1032, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Hourdin et al.(2016)</label><mixed-citation>Hourdin, F.,   Mauritsen, T.,   Gettelman, A.,   Golaz,  J.,   Balaji, V.,   Duan, Q.,
Folini, D.,   Ji,  D.,   Klocke, D.,   Qian, Y.,  Rauser,   F.,   Rio, C.,   Tomassini,  L.,
Watanabe,  M.,  and   Williamson, D.: The art and science of
climate model tuning, B. Am. Meteorol. Soc., <ext-link xlink:href="http://dx.doi.org/10.1175/BAMS-D-15-00135.1" ext-link-type="DOI">10.1175/BAMS-D-15-00135.1</ext-link>,  in press,
2016.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Jun et al.(2008a)</label><mixed-citation>
Jun, M., Knutti, R., and Nychka, D.: Local eigenvalue analysis of CMIP3
climate model errors, Tellus, 60, 992–1000, 2008a.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Jun et al.(2008b)</label><mixed-citation>
Jun, M., Knutti, R., and Nychka, D.: Spatial analysis to quantify numerical
model bias and dependence: how many climate models are there?, J. Am. Stat. Assoc., 103, 934–947, 2008b.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Knutti et al.(2006)</label><mixed-citation>
Knutti, R., Meehl, G., Allen, M., and Stainforth, D.: Constraining climate
sensitivity from the seasonal cycle in surface temperature, J. Climate, 19, 4224–4233, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Knutti et al.(2010)</label><mixed-citation>
Knutti, R., Furrer, R., Tebaldi, C., Cermak, J., and Meehl, G. A.: Challenges
in combining projections from multiple climate models, J. Climate,
23, 2739–2758, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Knutti et al.(2013)</label><mixed-citation>
Knutti, R., Masson, D., and Gettelman, A.: Climate model genealogy: Generation
CMIP5 and how we got there, Geophys. Res. Lett., 40, 1194–1199,
2013.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Köhler et al.(2010)</label><mixed-citation>
Köhler, P., Bintanja, R., Fischer, H., Joos, F., Knutti, R., Lohmann, G.,
and Masson-Delmotte, V.: What caused Earth's temperature variations during
the last 800 000 years? Data-based evidence on radiative forcing and
constraints on climate sensitivity, Quaternary Sci. Rev., 29, 129–145,
2010.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Leduc et al.(2016)</label><mixed-citation>
Leduc, M., Laprise, R., de Elía, R., and Šeparović, L.: Is
Institutional Democracy a Good Proxy for Model Independence?, J. Climate, 29, 8301–8316, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Lewis(2014)</label><mixed-citation>
Lewis, N.: Objective Inference for Climate Parameters: Bayesian,
Transformation-of-Variables, and Profile Likelihood Approaches, J. Climate, 27, 7270–7284, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Lindley(1956)</label><mixed-citation>
Lindley, D. V.: On a measure of the information provided by an experiment,
The Annals of Mathematical Statistics, 27, 986–1005, 1956.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Lloyd(2015)</label><mixed-citation>Lloyd, E. A.: Model robustness as a confirmatory virtue: The case of climate
science, Stud. Hist. Philos.  Sci. Part A, 49, 58–68,
<ext-link xlink:href="http://dx.doi.org/10.1016/j.shpsa.2014.12.002" ext-link-type="DOI">10.1016/j.shpsa.2014.12.002</ext-link>,
2015.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Masson and Knutti(2011)</label><mixed-citation>Masson, D. and Knutti, R.: Climate model genealogy, Geophys. Res. Lett, 38,
L08703, <ext-link xlink:href="http://dx.doi.org/10.1029/2011GL046864" ext-link-type="DOI">10.1029/2011GL046864</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Parker(2011)</label><mixed-citation>
Parker, W. S.: When Climate Models Agree: The Significance of Robust Model
Predictions, Philos. Sci., 78, 579–600, 2011.</mixed-citation></ref>
      <ref id="bib1.bib1"><label>1</label><mixed-citation>PCMDI (Program for Climate Model Diagnosis and Intercomparison): WCRP CMIP3 multi-model dataset, available at:
<uri>http://www-pcmdi.llnl.gov/ipcc/about_ipcc.php</uri>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Pennell and Reichler(2011)</label><mixed-citation>Pennell, C. and Reichler, T.: On the Effective Number of Climate Models,
J. Climate, 24, 2358–2367,  <ext-link xlink:href="http://dx.doi.org/10.1175/2010JCLI3814.1" ext-link-type="DOI">10.1175/2010JCLI3814.1</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Randall et al.(2007)</label><mixed-citation>
Randall, D. A., Wood, R., Bony, S., Colman, R., Fichefet, T., Fyfe, J.,
Kattsov, V., Pitman, A., Shukla, J., Srinivasan, J., Stouffer, R., Sumi, A.,
and Taylor, K.: Climate Models and Their Evaluation,  in: Climate Change
2007: The Physical Science Basis. Contribution of Working Group I to the
Fourth Assessment Report of the Intergovernmental Panel on Climate Change,
chap. 8, Cambridge University Press, Cambridge, United Kingdom and New York,
NY, USA, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Reichler and Kim(2008)</label><mixed-citation>
Reichler, T. and Kim, J.: How well do coupled models simulate today's
climate?, B. Am. Meteorol. Soc., 89, 303–311,
2008.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Rohling et al.(2012)</label><mixed-citation>
Rohling,   E. J.,  Sluijs, A.,   Dijkstra, H. A.,   Köhler, P.,   van de Wal, R. S. W.,
von der Heydt,   A. S.,  Beerling, D. J.,  Berger,  A.,   Bijl, P. K.,  Crucifix,  M.,
DeConto,  R.,   Drijfhout, S. S.,   Fedorov, A.,   Foster, G. L.,   Ganopolski, A.,  Hansen,
J.,   Hönisch,  B.,  Hooghiemstra,  H.,  Huber,  M.,   Huybers,  P.,  Knutti, R.,   Lea, D. W.,
Lourens,  L. J.,   Lunt, D.,  Masson-Delmotte,  V.,   Medina-Elizalde, M.,
Otto-Bliesner,  B.,  Pagani,  M.,   Pälike, H.,  Renssen,  H.,   Royer, D. L.,   Siddall, M.,
Valdes,   P.,   Zachos, J. C., and   Zeebe, R. E.: Making
sense of palaeoclimate sensitivity, Nature, 491, 683–691, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Sanderson et al.(2015)</label><mixed-citation>Sanderson, B. M., Knutti, R., and Caldwell, P.: A Representative Democracy to
Reduce Interdependency in a Multimodel Ensemble, J. Climate, 28,
5171–5194, <ext-link xlink:href="http://dx.doi.org/10.1175/jcli-d-14-00362.1" ext-link-type="DOI">10.1175/jcli-d-14-00362.1</ext-link>,
2015.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Skeie et al.(2014)</label><mixed-citation>Skeie, R. B., Berntsen, T., Aldrin, M., Holden, M., and Myhre, G.: A lower
and more constrained estimate of climate sensitivity using updated
observations and detailed radiative forcing time series, Earth Syst. Dynam.,
5, 139–175, <ext-link xlink:href="http://dx.doi.org/10.5194/esd-5-139-2014" ext-link-type="DOI">10.5194/esd-5-139-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Stevens et al.(2016)</label><mixed-citation>Stevens, B., Sherwood, S. C., Bony, S., and Webb, M. J.: Prospects for
narrowing bounds on earth's equilibrium climate sensitivity, Earth's
Future,  4,  512–522,  <ext-link xlink:href="http://dx.doi.org/10.1002/2016EF000376" ext-link-type="DOI">10.1002/2016EF000376</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Stocker et al.(2013)</label><mixed-citation>
Stocker, T. F., Qin, D., Plattner, G.-K., Tignor, M., Allen, S., Boschung, J.,
Nauels, A., Xia, Y., Bex, V., and   Midgeley, P. M.  (Eds.): IPCC 2013: Summary for
Policymakers, in: Climate Change 2013: The Physical Science Basis.
Contribution of Working Group I to the Fifth Assessment Report of the
Intergovernmental Panel on Climate Change, Cambridge University Press, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Tebaldi and Knutti(2007)</label><mixed-citation>
Tebaldi, C. and Knutti, R.: The use of the multi-model ensemble in
probabilistic climate projections, Philos. T. R.
Soc. A, 365, 2053–2075, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Tebaldi et al.(2005)</label><mixed-citation>
Tebaldi, C., Smith, R., Nychka, D., and Mearns, L.: Quantifying uncertainty in
projections of regional climate change: a Bayesian approach to the analysis
of multimodel ensembles, J. Climate, 18, 1524–1540, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Tol and De Vos(1998)</label><mixed-citation>Tol, R. S. and De Vos, A. F.: A Bayesian statistical analysis of the enhanced
greenhouse effect, Climatic Change, 38, 87–112, 1998.
 </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx38"><label>Wigley et al.(2005)</label><mixed-citation>Wigley, T. M. L., Ammann, C. M., Santer, B. D., and Raper, S. C. B.: Effect of
climate sensitivity on the Response to Volcanic Forcing, J.
Geophys. Res., 110, D09107, <ext-link xlink:href="http://dx.doi.org/10.1029/2004JD005557" ext-link-type="DOI">10.1029/2004JD005557</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Wilks(1995)</label><mixed-citation>
Wilks, D. S.: Statistical methods in the Atmospheric sciences, Academic
Press, London, 1995.</mixed-citation></ref>

  </ref-list><app-group content-type="float"><app><title/>

    </app></app-group></back>
    <!--<article-title-html>On the meaning of independence in climate science</article-title-html>
<abstract-html><p class="p">The concept of independence has been frequently mentioned in climate science
research, but has rarely been defined and discussed in a theoretically robust
and quantifiable manner. In this paper we argue that any discussion must
start from a clear and unambiguous definition of what independence means and
how it can be determined. We introduce an approach based on the statistical
definition of independence, and illustrate with simple examples how it can be
applied to practical questions. Firstly, we apply these ideas to climate
models, which are frequently argued to not be independent of each other,
raising questions as to the robustness of results from multi-model ensembles.
We explore the dependence between models in a multi-model ensemble, and
suggest a possible way forward for future weighting strategies. Secondly, we
discuss the issue of independence in relation to the synthesis of multiple
observationally based constraints on the climate system, using equilibrium
climate sensitivity as an example. We show that the same statistical theory
applies to this problem, and illustrate this with a test case, indicating how
researchers may estimate dependence between multiple constraints.</p></abstract-html>
<ref-html id="bib1.bib1"><label>Abramowitz and Gupta(2008)</label><mixed-citation>
Abramowitz, G. and Gupta, H.: Toward a model space and model independence
metric, Geophys. Res. Lett., 35, L05705, <a href="http://dx.doi.org/10.1029/2007GL032834" target="_blank">doi:10.1029/2007GL032834</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Annan(2015)</label><mixed-citation>
Annan, J. D.: Recent Developments in Bayesian Estimation of Climate
Sensitivity, Current Climate Change Reports,   1–5, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Annan and Hargreaves(2006)</label><mixed-citation>
Annan, J. D. and Hargreaves, J. C.: Using multiple observationally-based
constraints to estimate climate sensitivity, Geophys. Res. Lett.,
33, L06704, <a href="http://dx.doi.org/10.1029/2005GL025259" target="_blank">doi:10.1029/2005GL025259</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Annan and Hargreaves(2010)</label><mixed-citation>
Annan, J. D. and Hargreaves, J. C.: Reliability of the CMIP3 ensemble,
Geophys. Res. Lett., 37, L02703, <a href="http://dx.doi.org/10.1029/2009GL041994" target="_blank">doi:10.1029/2009GL041994</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Annan and Hargreaves(2011a)</label><mixed-citation>
Annan, J. D. and Hargreaves, J. C.: On the generation and interpretation of
probabilistic estimates of climate sensitivity, Climatic Change, 104,
423–436, 2011a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Annan and Hargreaves(2011b)</label><mixed-citation>
Annan, J. D. and Hargreaves, J. C.: Understanding the CMIP3 multimodel
ensemble, J. Climate, 24, 4529–4538, 2011b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Annan et al.(2005)</label><mixed-citation>
Annan, J. D., Hargreaves, J. C., Ohgaito, R., Abe-Ouchi, A., and Emori, S.:
Efficiently constraining climate sensitivity with paleoclimate simulations,
Scientific Online Letters on the Atmosphere, 1, 181–184, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Bishop and Abramowitz(2013)</label><mixed-citation>
Bishop, C. H. and Abramowitz, G.: Climate model dependence and the replicate
Earth paradigm, Clim. Dynam., 41, 885–900, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Charney et al.(1979)</label><mixed-citation>
Charney, J. G., Arakawa, A., Baker, D. J., Bolin, B., Dickinson, R. E., Goody,
R. M., Leith, C. E., Stommel, H. M., and Wunsch, C. I.: Carbon dioxide and
climate: a scientific assessment, National Academy of Sciences, Washington,
DC, 1979.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Collins et al.(2013)</label><mixed-citation>
Collins, M., Knutti, R., Arblaster, J., Dufresne, J.-L., Fichefet, T.,
Friedlingstein, P., Gao, X., Gutowski, W., Johns, T., Krinner, G., Shongwe,
M., Tebaldi, C., Weaver, A., and Wehner, M.: Long-term Climate Change:
Projections, Commitments and Irreversibility, in: Climate Change 2013: The
Physical Science Basis. Contribution of Working Group I to the Fifth
Assessment Report of the Intergovernmental Panel on Climate Change, edited
by:
Stocker, T., Qin, D., Plattner, G.-K., Tignor, M., Allen, S., Boschung, J.,
Nauels, A., Xia, Y., Bex, V., and Midgley, P., chap. 12,  1029–1136,
Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA,
<a href="http://dx.doi.org/10.1017/CBO9781107415324.024" target="_blank">doi:10.1017/CBO9781107415324.024</a>,
2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Cubasch et al.(2013)</label><mixed-citation>
Cubasch, U., Wuebbles, D., Chen, D., Facchini, M., Frame, D., Mahowald, N., and
Winther, J.-G.: Introduction, in: Climate Change 2013: The Physical Science
Basis. Contribution of Working Group I to the Fifth Assessment Report of the
Intergovernmental Panel on Climate Change, edited by: Stocker, T., Qin, D.,
Plattner, G.-K., Tignor, M., Allen, S., Boschung, J., Nauels, A., Xia, Y.,
Bex, V., and Midgley, P., book section 1,  119–158, Cambridge University
Press, Cambridge, United Kingdom and New York, NY, USA,
<a href="http://dx.doi.org/10.1017/CBO9781107415324.007" target="_blank">doi:10.1017/CBO9781107415324.007</a>,
2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Forest et al.(2006)</label><mixed-citation>
Forest, C. E., Stone, P. H., and Sokolov, A. P.: Estimated PDFs of climate
system properties including natural and anthropogenic forcings, Geophys.
Res. Lett, 33, L01705, <a href="http://dx.doi.org/10.1029/2005GL023977" target="_blank">doi:10.1029/2005GL023977</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Hegerl et al.(2006)</label><mixed-citation>
Hegerl, G. C., Crowley, T. J., Hyde, W. T., and Frame, D. J.: Climate
sensitivity constrained by temperature reconstructions over the past seven
centuries, Nature, 440, 1029–1032, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Hourdin et al.(2016)</label><mixed-citation>
Hourdin, F.,   Mauritsen, T.,   Gettelman, A.,   Golaz,  J.,   Balaji, V.,   Duan, Q.,
Folini, D.,   Ji,  D.,   Klocke, D.,   Qian, Y.,  Rauser,   F.,   Rio, C.,   Tomassini,  L.,
Watanabe,  M.,  and   Williamson, D.: The art and science of
climate model tuning, B. Am. Meteorol. Soc., <a href="http://dx.doi.org/10.1175/BAMS-D-15-00135.1" target="_blank">doi:10.1175/BAMS-D-15-00135.1</a>,  in press,
2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Jun et al.(2008a)</label><mixed-citation>
Jun, M., Knutti, R., and Nychka, D.: Local eigenvalue analysis of CMIP3
climate model errors, Tellus, 60, 992–1000, 2008a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Jun et al.(2008b)</label><mixed-citation>
Jun, M., Knutti, R., and Nychka, D.: Spatial analysis to quantify numerical
model bias and dependence: how many climate models are there?, J. Am. Stat. Assoc., 103, 934–947, 2008b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Knutti et al.(2006)</label><mixed-citation>
Knutti, R., Meehl, G., Allen, M., and Stainforth, D.: Constraining climate
sensitivity from the seasonal cycle in surface temperature, J. Climate, 19, 4224–4233, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Knutti et al.(2010)</label><mixed-citation>
Knutti, R., Furrer, R., Tebaldi, C., Cermak, J., and Meehl, G. A.: Challenges
in combining projections from multiple climate models, J. Climate,
23, 2739–2758, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Knutti et al.(2013)</label><mixed-citation>
Knutti, R., Masson, D., and Gettelman, A.: Climate model genealogy: Generation
CMIP5 and how we got there, Geophys. Res. Lett., 40, 1194–1199,
2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Köhler et al.(2010)</label><mixed-citation>
Köhler, P., Bintanja, R., Fischer, H., Joos, F., Knutti, R., Lohmann, G.,
and Masson-Delmotte, V.: What caused Earth's temperature variations during
the last 800 000 years? Data-based evidence on radiative forcing and
constraints on climate sensitivity, Quaternary Sci. Rev., 29, 129–145,
2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Leduc et al.(2016)</label><mixed-citation>
Leduc, M., Laprise, R., de Elía, R., and Šeparović, L.: Is
Institutional Democracy a Good Proxy for Model Independence?, J. Climate, 29, 8301–8316, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Lewis(2014)</label><mixed-citation>
Lewis, N.: Objective Inference for Climate Parameters: Bayesian,
Transformation-of-Variables, and Profile Likelihood Approaches, J. Climate, 27, 7270–7284, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Lindley(1956)</label><mixed-citation>
Lindley, D. V.: On a measure of the information provided by an experiment,
The Annals of Mathematical Statistics, 27, 986–1005, 1956.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Lloyd(2015)</label><mixed-citation>
Lloyd, E. A.: Model robustness as a confirmatory virtue: The case of climate
science, Stud. Hist. Philos.  Sci. Part A, 49, 58–68,
<a href="http://dx.doi.org/10.1016/j.shpsa.2014.12.002" target="_blank">doi:10.1016/j.shpsa.2014.12.002</a>,
2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Masson and Knutti(2011)</label><mixed-citation>
Masson, D. and Knutti, R.: Climate model genealogy, Geophys. Res. Lett, 38,
L08703, <a href="http://dx.doi.org/10.1029/2011GL046864" target="_blank">doi:10.1029/2011GL046864</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Parker(2011)</label><mixed-citation>
Parker, W. S.: When Climate Models Agree: The Significance of Robust Model
Predictions, Philos. Sci., 78, 579–600, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>1</label><mixed-citation>
PCMDI (Program for Climate Model Diagnosis and Intercomparison): WCRP CMIP3 multi-model dataset, available at:
<a href="http://www-pcmdi.llnl.gov/ipcc/about_ipcc.php" target="_blank">http://www-pcmdi.llnl.gov/ipcc/about_ipcc.php</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Pennell and Reichler(2011)</label><mixed-citation>
Pennell, C. and Reichler, T.: On the Effective Number of Climate Models,
J. Climate, 24, 2358–2367,  <a href="http://dx.doi.org/10.1175/2010JCLI3814.1" target="_blank">doi:10.1175/2010JCLI3814.1</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Randall et al.(2007)</label><mixed-citation>
Randall, D. A., Wood, R., Bony, S., Colman, R., Fichefet, T., Fyfe, J.,
Kattsov, V., Pitman, A., Shukla, J., Srinivasan, J., Stouffer, R., Sumi, A.,
and Taylor, K.: Climate Models and Their Evaluation,  in: Climate Change
2007: The Physical Science Basis. Contribution of Working Group I to the
Fourth Assessment Report of the Intergovernmental Panel on Climate Change,
chap. 8, Cambridge University Press, Cambridge, United Kingdom and New York,
NY, USA, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Reichler and Kim(2008)</label><mixed-citation>
Reichler, T. and Kim, J.: How well do coupled models simulate today's
climate?, B. Am. Meteorol. Soc., 89, 303–311,
2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Rohling et al.(2012)</label><mixed-citation>
Rohling,   E. J.,  Sluijs, A.,   Dijkstra, H. A.,   Köhler, P.,   van de Wal, R. S. W.,
von der Heydt,   A. S.,  Beerling, D. J.,  Berger,  A.,   Bijl, P. K.,  Crucifix,  M.,
DeConto,  R.,   Drijfhout, S. S.,   Fedorov, A.,   Foster, G. L.,   Ganopolski, A.,  Hansen,
J.,   Hönisch,  B.,  Hooghiemstra,  H.,  Huber,  M.,   Huybers,  P.,  Knutti, R.,   Lea, D. W.,
Lourens,  L. J.,   Lunt, D.,  Masson-Delmotte,  V.,   Medina-Elizalde, M.,
Otto-Bliesner,  B.,  Pagani,  M.,   Pälike, H.,  Renssen,  H.,   Royer, D. L.,   Siddall, M.,
Valdes,   P.,   Zachos, J. C., and   Zeebe, R. E.: Making
sense of palaeoclimate sensitivity, Nature, 491, 683–691, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Sanderson et al.(2015)</label><mixed-citation>
Sanderson, B. M., Knutti, R., and Caldwell, P.: A Representative Democracy to
Reduce Interdependency in a Multimodel Ensemble, J. Climate, 28,
5171–5194, <a href="http://dx.doi.org/10.1175/jcli-d-14-00362.1" target="_blank">doi:10.1175/jcli-d-14-00362.1</a>,
2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Skeie et al.(2014)</label><mixed-citation>
Skeie, R. B., Berntsen, T., Aldrin, M., Holden, M., and Myhre, G.: A lower
and more constrained estimate of climate sensitivity using updated
observations and detailed radiative forcing time series, Earth Syst. Dynam.,
5, 139–175, <a href="http://dx.doi.org/10.5194/esd-5-139-2014" target="_blank">doi:10.5194/esd-5-139-2014</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Stevens et al.(2016)</label><mixed-citation>
Stevens, B., Sherwood, S. C., Bony, S., and Webb, M. J.: Prospects for
narrowing bounds on earth's equilibrium climate sensitivity, Earth's
Future,  4,  512–522,  <a href="http://dx.doi.org/10.1002/2016EF000376" target="_blank">doi:10.1002/2016EF000376</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Stocker et al.(2013)</label><mixed-citation>
Stocker, T. F., Qin, D., Plattner, G.-K., Tignor, M., Allen, S., Boschung, J.,
Nauels, A., Xia, Y., Bex, V., and   Midgeley, P. M.  (Eds.): IPCC 2013: Summary for
Policymakers, in: Climate Change 2013: The Physical Science Basis.
Contribution of Working Group I to the Fifth Assessment Report of the
Intergovernmental Panel on Climate Change, Cambridge University Press, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Tebaldi and Knutti(2007)</label><mixed-citation>
Tebaldi, C. and Knutti, R.: The use of the multi-model ensemble in
probabilistic climate projections, Philos. T. R.
Soc. A, 365, 2053–2075, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Tebaldi et al.(2005)</label><mixed-citation>
Tebaldi, C., Smith, R., Nychka, D., and Mearns, L.: Quantifying uncertainty in
projections of regional climate change: a Bayesian approach to the analysis
of multimodel ensembles, J. Climate, 18, 1524–1540, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Tol and De Vos(1998)</label><mixed-citation>
Tol, R. S. and De Vos, A. F.: A Bayesian statistical analysis of the enhanced
greenhouse effect, Climatic Change, 38, 87–112, 1998.

</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Wigley et al.(2005)</label><mixed-citation>
Wigley, T. M. L., Ammann, C. M., Santer, B. D., and Raper, S. C. B.: Effect of
climate sensitivity on the Response to Volcanic Forcing, J.
Geophys. Res., 110, D09107, <a href="http://dx.doi.org/10.1029/2004JD005557" target="_blank">doi:10.1029/2004JD005557</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Wilks(1995)</label><mixed-citation>
Wilks, D. S.: Statistical methods in the Atmospheric sciences, Academic
Press, London, 1995.
</mixed-citation></ref-html>--></article>
