moyhu: Where do GHCN monthly numbers come from? A demo.

Sunday, December 4, 2016

Where do GHCN monthly numbers come from? A demo.

I've been arguing at WUWT, eg here. More and more I find people saying that all surface measures are totally corrupt. Of course, they give no evidence or rational argument. And when sceptics do mount an effort to actually investigate, eg here, it falls in a heap. BEST was actually one such effort that was followed through, but ended up confirming the main indices. So of course that is corrupt too.

As linked, I do sometimes point out that I have been tracking for six years with an index, TempLS, which uses unadjusted GHCN and gets very similar results to GISS and others. I have posted the code, which is only about 200 lines, and I have posted monthly predictions (ahead of GISS and all) for about six years. But no, they say, GHCN unadjusted is corrupted too. All rigged by Hansen or someone before you see it.

The proper way to deal with this is for some such sceptic to actually follow through the quite transparent recording process, and try to find some error. But I see no inclination there to do that. Just shout louder.

So here I'll track through the process whereby readings in my country, from BoM, go through the WMO collection in CLIMAT forms, and so into the GHCN repository. That's partly to show how it can be done, if a sceptic ever was inclined to stop ranting and start investigating.

I'll illustrate from my home town, Melbourne. For the state (and others), BoM posts half-hourly AWS readings, within a few minutes of measurement. The statewide site is here. Here is the line (as of now) which shows current temps, and ringed min/max.

Actually, this isn't quite so useful for my demo, because while the max is OK for today, the min is basically the min since then; it will show the proper min in the morning. But in a day or so, you'll be able to check. If you drill down to the Melbourne page, you'll see the last few days of half hour readings (and also daily max/min). These are the numbers that are quoted in news reports, etc. If someone says it was 35° yesterday, that is what they are quoting. So firstly, on the "corrupted" issue, that doesn't work here. Firstly, it would go against other people's experience and measurement if it was fiddled. And second, there is just no way that the process could have human intervention. There are thousands of figures posted every half hour.

Those numbers are entered into the curent month file. Here is a brief extract:

Today's numbers aren't yet entered, but will appear in a few hours. When the month is done, a page for that month is posted - last 13 months available. Here is the page for October for Melbourne Airport. The summary numbers are here; I have red ringed the relevant numbers:

Now those are the numbers that are sent off on the Climat form to WMO. If you follow through there, you'll see 100 entries for Australia. Here is an extract for Melbourne Airport for October 2016. I have redringed the min/max which you can see corresponds to the BoM posted file. It adds a calc of the average (13.2), which I have brown-ringed.

If you are really hankering for authenticity, you can scroll down to see the actual code they send. And finally, you can find the unadjusted GHCN data here. It's a big file, and you have to gunzip and untar. You can also get a file for max and min. Then you have a text file, which, if you search for 501948660002016TAVG you see this line:

It has the monthly numbers for 2016 (as integer, multiplied by 100) for Melb Airport, with some letter flags. I have ringed the corresponding number 1320. These are the data I use in TempLS unadjusted.

So you can see for Australia at least, the numbers can be followed through from the posted reports every 1/2 hour to the GHCN unadjusted monthly. Of course, it is just one month. But the pattern is there, and anyone who wants to say the process is corrupted should follow through other months to see if they can find a discrepancy. I bet they won't.

25 comments:

William M. ConnolleyDecember 5, 2016 at 3:53 AM
How public is the GTS data? BAS gets it (via the UKMO I think) and the web interface I wrote years ago still seems to be going (https://legacy.bas.ac.uk/met/metlog/).
ReplyDelete
Replies
Victor VenemaDecember 5, 2016 at 6:54 AM
Small caveat, there can be a difference between the monthly average you could compute from daily data from the near real-time GTS and monthly CLIMAT values. The meteorological GTS data has to go fast to be used for weather prediction. The CLIMAT messages for climatology undergo a better quality control, also often manual QC, which may flag values as faulty that could not be detected as such in the near real-time GTS data.
ReplyDelete
Replies
Bryan - oz4casterDecember 6, 2016 at 10:02 AM
Nice work Nick. Very interesting. But not all countries are so well automated. I suspect quite a bit of data are still hand-coded in many areas. And if you want to check old data, you would have to look for paper copies or scans of paper copies to check the data. VERY TEDIOUS indeed.

It's my understanding that only monthly averages are subjected to homogenization. I seriously doubt that the raw monthly data you use for TempLS is likely to have been tampered with, although I'm sure there is bound to be a very small percentage of coding errors in the older data (but that's a different issue).

It was interesting to see the BoM data reports. It appears that the min/max temperatures are not temporally resolved at more than half hour intervals, which is probably not ideal, but should not likely lead to any substantial bias over longer averaging times. Similarly, with automated data systems, we can now easily calculate monthly averages directly from all of the raw 1-minute, 5-minute, half hour, and/or hourly data instead of computing the mean of the minimum and maximum each day and using that to calculate the monthly average. However, my guess is that the differences are small and probably random, so that little or no substantial bias is introduced by doing it the old fashioned way for monthly and annual averages.

In my mind, the factors that contribute most to uncertainty in trying to estimate a global average surface temperature are consistency and representativeness of the constituent measurements over time as well as lack of spatial coverage over large areas. My guess is that how these issues are handled may have the greatest effect on the final estimates and derived trends. Oceans, deserts, remote mountains, and polar regions are probably the most problematic areas in this regard.

Conceptually, in the future I suspect that using the initializations for global weather models will be the best way to estimate and track global and large regional surface temperatures and temperature anomalies, as well as other modeled met parameters. In practice, there is probably still plenty of room for improvement, but what we have now may be an adequate start with the CFSV2 and ERAI.

Ideally, maybe one of these days there will be global weather models that work well enough to be used to forecast out to months and years, and maybe even decades with reasonable accuracy. It would also be interesting to look backward in time to peak glacial periods to use the same weather forecast models to look at hypothetical daily weather patterns as might have existed say 20,000 years ago or to model very warm periods like 50 million years ago. I'd really like to see a climate model that can predict the next glacial period. If humanity survives for a few more hundreds of thousands of years, maybe this will come to pass.
ReplyDelete
Replies
AnonymousDecember 8, 2016 at 3:42 PM
Nick wrote: "Of course, they give no evidence or rational argument. And when sceptics do mount an effort to actually investigate, eg here, it falls in a heap. BEST was actually one such effort that was followed through, but ended up confirming the main indices. So of course that is corrupt too."

I'll agree with you that most of the complaints about temperature data are poorly informed. I'll suggest some that may have some validity.

You've demonstrated that Australia sends raw data to the unadjusted GHCN database. That doesn't mean that other countries do the same. Victor notes that even Australia may do some QC, which is good.

No one ever sees raw GHCN data. They see processed output without transparency or explanation. Complete transparency would show the global average for all stations and say that this number is worthless for monitoring long-term climate change because temperature also changes with the season, seasonal changes are much bigger than year-to-year changes and there are far more thermometers in the NH than SH. Then we could see average temperature for November for Station X and the anomaly for that month. And the average global temperature anomaly for that month - with a clear warning that this number is also not useful for monitoring climate change because stations are not spread equally around the global and the number and location of stations is constantly changing. Then you could show the average temperature in each grid cell (or whatever method one uses to deal with station location inhomogeneity) and the global average grid cell anomaly.

Then we get to the most difficult problem, homogenization of apparent breakpoints - a process I think is scientifically dubious. Without metadata, one doesn't know if a breakpoint has been caused by a sudden change to new measuring conditions (TOB, equipment change or station move) OR by a gradual deterioration in measuring conditions that is corrected by maintenance (screen albedo, ventilation, changes near the station). If you have metadata - especially for a change in TOB (which can be corrected by a validated method in the US), correction is necessary. However, even a documented station move - say from a gradually urbanizing site to a large nearby park - may restore conditions similar to earlier measurements. Anytime you homogenize a breakpoint caused by correction gradually deteriorating conditions (maintenance), you introduce bias into the record. Since we have no idea why there are so many apparent undocumented breakpoints in our records, I suspect the best answer is to show both the unhomogenized and homogenized results. I suspect the best answer lies between these two values. That won't change the conclusion that the planet is warming.

Because BEST splits records at breakpoints, they keep any bias in the trend that may arise from deteriorating conditions and discard the correction. The net result is probably the same as homogenization. I'd prefer to see them report a record with and without splitting records at undocumented breakpoints. By creating two records from one by splitting, that are discarding useful information.

Transparency would show the following for a select group of stations, grid cells, countries, and the world:

[Daily temps)
Raw monthly averages
Monthly anomalies.
Grid cell anomalies and temperatures before homogenization.
Grid cell anomalies and temperature after homogenization.

Is this transparency worth the effort? I don't know. Given all of the revelations about fake news during the last election and how it was spread among Trump supporters via social media, I'm skeptical.

Frank
ReplyDelete
Replies
PeterDecember 8, 2016 at 4:53 PM
Hi Nick,

My question seems to have disappeared. Glitch? Should I try again?
ReplyDelete
Replies
PeterDecember 9, 2016 at 10:33 AM
OK, I am using Chrome, but most likely I did something silly (like inattentively click the button on the bottom right which looks like a submit button but is actually a sign out button).

Anyway, the question related to processing the files from BOM specifically, in relation to possible missing records, or either duplicated or additional records, as to what mechanisms or formula you use for infilling or otherwise dealing with any missing records, or how you deal with additional or duplicated records (if you do). I have looked your R code (thanks) but do not know enough R to be able to answer that question (I am much more conversant with Perl).

The typical AWS files look to have one record every half hour.

Peter
ReplyDelete
Replies
AnonymousDecember 12, 2016 at 6:16 PM
Nick: When discussing Sheldon's trend viewer (with you and Sheldon) at WUWT, I spent a lot of time using your trend viewer. I've always been concerned that difference colors for trends don't mean that a statistically significant difference between trends exists, leading users to over-interpret differences in trend. And it is a great tool for cherry-picking. Nevertheless, I found it partially useful for what I thought was an extremely important question: Has the rate of warming slowed down since 1998 or 2001 or between 2001 and 2013? So I tried to use it for this.

The starting point that produces the greatest long-term warming is about 1975 and the warming rate since then is currently 0.18 K/decade (0.16-0.20) for HadCRUT global. So, if I am looking for a slowdown or speedup in warming within this period, I can make use of the trend view to find the lowest warming rate (locally) and confidence interval. The appropriate next step would be to see if these differences in trend were meaningfully by using the standard formula for the statistical significance of the difference in two means (given their standard deviations). Then I'd like to select or highlight or code the regions of the triangle where the trend is significantly different from the overall trend for the triangle. That would tell us for what period the Pause was and was not statistically significantly different from the past six decades. That doesn't appear to be very often.

However, I didn't want to stop there. If you are using a 95% confidence interval, one expects to see about 5% of the trends be significantly different from the overall trend by chance. So I'd like to know what fraction of the triangle of trends is statistically significantly different from the overall trend. (It might be interesting to be able to chose your confidence level. The Pause may be significant at 0.01 or even lower.

In any case, this might be an interesting way to address the significance of The Pause. My level of skepticism about it was enhanced by my amateur efforts with your trend viewer. The 2015/16 El Nino raised the trend since 2001 from 0.02 K/decade to about 0.12 K/decade. There are some periods beginning in 2001 with negative trends, but I focus on the upper confidence interval for low trends, which gets down to 0.06 K/decades if you cherry-pick and below 0.10 K/decade over a reasonable area. However, that is exactly what one expects for normally distributed data - about 5% of the area significant different at 0.05.

I hope this makes some sense. Frank
ReplyDelete
Replies
BindidonDecember 21, 2016 at 11:45 AM
I think GHCN do confuse the issue by releasing a file showing altered records by station. It's a convenient way of recording the changes. But in fact homogenisation is a step on the way to compiling an index, which is a spatial integral. In that average, a station is used as a representative data point for a region. You assign its value to the region, multiply by area and add.

In homogenising, you say that the station value has behaviour that you think makes it not a good representative of the region.

Nick, this is correct.

On the page
https://www.ncdc.noaa.gov/ghcnm/v3.php?section=homogeneity_adjustment
we clearly can see a twofold, really confusing use of the word 'homogenization'.

But we should not forget how small the difference nevertheless is between GHCN's unadjusted and adjusted data.

You explained that years ago (in 2012!) and I recently tried to do a similar job as you did, by computing, out of the two datasets, the linear trend for each of the 7,280 stations having contributed to the data, and the trend differences.

The mean of these trend differences (adjusted minus unadjusted) is no more than 0.04 °C / decade.

And it would be by far lower if we eliminated all the nonsense data produced by stations like Tocumen (Panama) or Elliott (Australia) during only about ten years of activity, most of it dropped off in the adjusted record.

And the average trends computed over all stations for the period 1880-2016 show as follows:
- unadjusted: 0.214 °C / decade;
- adjusted: 0.229 °C / decade.

Thus, despite the legitime critique applied to performing spatial homogenization within a set of single stations, the difference between unadjusted and adjusted record remains incredibly small when compared with further homogenization steps e.g. at GISS, with as effect a trend for the same 1880-2016 period:
- GISTEMP: 0.071 °C / decade.

And this latter trend shows how meaningless some criticisms against homogenization are anyway, as it in fact shows a dramatic downsizing in comparison with the GHCN trends.

ReplyDelete
Replies

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Sunday, December 4, 2016

Where do GHCN monthly numbers come from? A demo.

Where do GHCN monthly numbers come from? A demo.

25 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Sunday, December 4, 2016

Where do GHCN monthly numbers come from? A demo.

Where do GHCN monthly numbers come from? A demo.

25 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me