44

This message in a Reuter's article from 25.02.2019 is currently all over the news:

Evidence for man-made global warming hits 'gold standard'

[Scientists] said confidence that human activities were raising the heat at the Earth’s surface had reached a “five-sigma” level, a statistical gauge meaning there is only a one-in-a-million chance that the signal would appear if there was no warming.

I believe that this refers to this article "Celebrating the anniversary of three key events in climate change science" which contains a plot, which is shown schematically below (It is a sketch because I could not find an open source image for an original, similar free images are found here). Another article from the same research group, which seems to be a more original source, is here (but it uses a 1% significance instead of $5\sigma$).


The plot presents measurements from three different research groups: Remote Sensing Systems, the Center for Satellite Applications and Research, and the University of Alabama at Huntsville.

The plot displays three rising curves of signal to noise ratio as a function of trend length.

anthropogenic signal

So somehow scientists have measured an anthropogenic signal of global warming (or climate change?) at a $5\sigma$ level, which is apparently some scientific standard of evidence.

For me such graph, which has a high level of abstraction, raises many questions$^{\dagger}$, and in general I wonder about the question 'How did they do this?'. How do we explain this experiment into simple words (but not so abstract) and also explain the meaning of the $5\sigma$ level?

I ask this question here because I do not want a discussion about climate. Instead I want answers regarding the statistical content and especially to clarify the meaning of such a statement that is using/claiming $5 \sigma$.


$^\dagger$ What is the null hypothesis? How did they set up the experiment to get a anthropogenic signal? What is the effect size of the signal? Is it just a small signal and we only measure this now because the noise is decreasing, or is the signal increasing? What kind of assumptions are made to create the statistical model by which they determine the crossing of a 5 sigma threshold (independence, random effects, etc...)? Why are the three curves for the different research groups different, do they have different noise or do they have different signals, and in the case of the latter, what does that mean regarding the interpretation of probability and external validity?

amoeba
  • 104,745
  • (+1) Thanks for asking this. I have many of the same and similar questions ! – Robert Long Feb 27 '19 at 10:26
  • The article in Nature has Supplementary Information; sections 6 and 7 in it are about the detection time and give the authors' comments on this issue. However, their words are not simple, and it wouldn't be easy to extract the statistical content from the climate modeling. See https://static-content.springer.com/esm/art%3A10.1038%2Fs41558-019-0424-x/MediaObjects/41558_2019_424_MOESM1_ESM.pdf – Matt F. Mar 06 '19 at 08:39
  • @MattF. I have been searching for simplistic explanations by others, and left the supplementary material asside for the moment, only skim reading it. I will do that after the bounty ends but it is not really straightforward and I would need to dig trough some additional literature as well (I found an explanation by Ross mcKitrick but that got criticised by the lead author with a bombardment of arguments). I almost start to believe that climate researchers are deliberately vague it is 'proof by intimidation'. – Sextus Empiricus Mar 06 '19 at 09:27
  • You seem to expect: 1) that an article published two weeks ago will have simple expositions elsewhere that satisfy all your questions, 2) that the statistical decisions, including the choice of null hypothesis, will all be justified without reference to the scientific context, 3) that the supplementary information will be skimmable and self-contained even for someone not in that field. I do not have those expectations. I agree that climate modeling is complicated, but I don’t think the goal of the researchers is intimidation; they’re just writing for people with different graduate training. – Matt F. Mar 06 '19 at 14:06
  • 3
    @MattF. My expectation is that it will be possible to make a simple exposition that explains the statistical concept of the $5\sigma$ threshold that has been used here (at least the high energy particle physicists, who also use $\sigma$ discrepancies/effects to describe signal to noise ratio's in counts of events, have no problem with this). With simple I mean something stripped away from the climatology jargon, but sophisticated enough to contain the essence. Say, it would be something written for professional statisticians and mathematicians such that they can understand the $5\sigma$ here. – Sextus Empiricus Mar 06 '19 at 14:50
  • 1
  • the article (the supplementary material) relates to an older article from last summer and the concept is not so fresh as two weeks 2) not my expectation, but it should at least be clear what they did, and how it can be put into simpler words. I don't expect to understand the scientific context fully, but good enough to understand why and how they applied the statistics 3) I normally have little problems understanding a different scientific piece of work, at least superficially. This requires however a specific type of presentation.
  • – Sextus Empiricus Mar 06 '19 at 14:57
  • 2
    To stress the contrast with high energy physics: for this field statisticians can understand that the $5\sigma$ level is basically meaningless and the bar is set high because the computation is technically wrong (1. the look elsewhere effect 2. wrong assumptions about the error distribution ignoring systematic effects 3. implicitly doing a Bayesian analysis, 'extraordinary claims require extraordinary evidence'). – Sextus Empiricus Mar 06 '19 at 15:11
  • 3
    The question is how much these three effects are present in the case of this man-made global warming article. I think it is important to make this clear, to demystify the sciency claims. It is so common to just throw some numbers into an argument to make it sound rigorous, and most people stop questioning it. – Sextus Empiricus Mar 06 '19 at 15:15
  • This link http://www.digitaljournal.com/news/environment/evidence-for-man-made-global-warming-hits-gold-standard/article/544095, combined with the in-article link (here: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.143.4030&rep=rep1&type=pdf) to the 1999 report on anthropogenic warming (the second of the three articles referenced,) may answer some of your questions. The tl;dr: $5\sigma$ comes from particle physics; the "gold standard" for signal detection (e.g., the Higgs boson.) The linked report is a summary of other reports, and does go into some detail of interest. – jbowman Mar 07 '19 at 03:21
  • Note that the article you reference is not the original research itself, it's highlighting three milestones in the history of research on this topic. – jbowman Mar 07 '19 at 03:24
  • 1
    @jbowman I will look into your links. Note that I also linked to the original research itselve in another link. And yes, I know about $5\sigma $ in physics. That is exactly the reason for the question, since the $5\sigma $ in physics is an extremely crude (and criticised) measure to capture multiple problems. – Sextus Empiricus Mar 07 '19 at 07:41
  • 3
    Have you seen this critique: https://judithcurry.com/2019/03/01/critique-of-the-new-santer-et-al-2019-paper/ ? – Robert Long Mar 07 '19 at 20:29
  • @RobertLong yes I read that critique. I mentioned it in an earlier comment. – Sextus Empiricus Mar 11 '19 at 08:12
  • @MartijnWeterings I don't think I saw that comment and can't seem to find it - am i missing something ? I am very interested in this topic and like you, suspect there are some in climatology research that are deliberately obfuscating the analysis of their research. – Robert Long Mar 11 '19 at 09:52
  • @RobertLong I mentioned it Mar 6 9:27. At the same point where I mentioned my suspicions about the deliberate obfuscation (although I feel now I should scrap the word 'deliberate', that is a bit of a hard conviction, for such weak and unfounded suspicion. That was a bit overdrawn, maybe to get some more attention or to stress the importance of the current knowledge gap between climate scientists and other scientist and the media). – Sextus Empiricus Mar 11 '19 at 10:04
  • I hope, when I get the time, to be able myselve to get a nice and simple summary that will help to understand the statistical principles that have been used. I like Nino Rode's answer but I find it not enough to fully answer the question. I would like to flesh out the description of signal and noise he gives there. – Sextus Empiricus Mar 11 '19 at 10:08
  • 3
    Coincidentally I was reading these papers just a few days ago, and now noticed your new bounty. I might write something up now. – amoeba Oct 06 '19 at 14:46
  • But can you be more specific what exactly you find lacking in the existing answer? – amoeba Oct 06 '19 at 15:20
  • @amoeba The current answer is a bit superficial. It mostly explains that the $\sigma$ relates to signal/noise, but I get that principle. What I am wondering about is how the derivation goes for this article. To be honest I have not taken the time to read in detail the supplementary materials, but this stuff is so much deep referencing several layers that it becomes very difficult to trace back whether the principles are standing strong or not...... – Sextus Empiricus Oct 06 '19 at 15:55
  • .....so I am looking for some simpler and quick introduction that describes the measurements the data (and possibly some data analysis when it comes to the climate models, but not the principle of 5 sigma) that form the basis for these kind of results. (the people in this link - judithcurry see next message - seem to be able to even reproduce the graph, that would be my golden aim to be able to do; and then break it down to see how it intuitively works and what it basically means; I guess you may have a bit of an idea about how my intuition works and what sort of explanations I like) – Sextus Empiricus Oct 06 '19 at 16:03
  • the link: https://judithcurry.com/2019/03/01/critique-of-the-new-santer-et-al-2019-paper/amp/ – Sextus Empiricus Oct 06 '19 at 16:04
  • Note to myself: this document explains how to model work in R with data from CMIP temperature models https://journal.r-project.org/archive/2017/RJ-2017-032/RJ-2017-032.pdf and here is a r-package with this goal: https://cran.r-project.org/web/packages/RCMIP5/README.html – Sextus Empiricus Oct 11 '19 at 11:59
  • 1
    obviously ridiculous statement. nothing can be at 5 sigma level in climate research – Aksakal Aug 09 '21 at 18:40