moyhu: Why homogenise data?

Friday, February 6, 2015

Why homogenise data?

I've been writing a lot lately (eg here and here) about homogenisation. The main message is that adjustment has relatively small effects and is not, as some say, the arithmetic basis of AGW. But sometimes the answer is, well, if it has little effect, why do it? So I thought to make some general remarks on homogenisation and adjustment.

Firstly, some common mis-characterisations. It isn't "altering the record". The true record sits with the national met offices, but organisations like NOAA have online unadjusted records, which are as accessible as the adjusted.

Secondly, records aren't adjusted in the belief that thermometers were read incorrectly. They are usually adjusted specifically for the calculation of a regional average. Station records are used as representative of a region. When something happens that is not climate-based, then it isn't representative of the region.

Let's look at a specific case. In 1928, the station for Wellington, NZ, was moved from Thorndon, at sea level, to Kelburn, at about 100m altitude. The temperature dropped by nearly 1°C, and the NOAA algorithm picked this up and reduced the pre-1928 readings.

Now the Thorndon readings aren't "wrong". What is wrong is the 1°C drop. There is no reason to believe that the region experienced that. So the adjustment is to make the record consistent ("homogeneous"). The convention is to leave the present unchanged and adjust the past. It doesn't matter which.

Averaging

There is a lot of pixels wasted on some alleged mis-adjustment of some individual stations. That misses the big picture. When you calculate a global annual average, you take over a million readings to get a single number. The averaging drastically reduces the noise. White noise would reduce by a factor of 1000. There are various dependences, but still, the reduction is huge. Noise isn't much of a problem. What is a problem is bias. That doesn't reduce.

But there is a counter to bias too - the use of anomalies. Consistent bias goes out with the mean. So the remaining problem is inconsistent bias. Inhomogenieties.

We saw this with TOBS in the US. There is a bias depending on which time of day you read. And there is no "correct" time. The bias doesn't matter unless the time of observation changes. And still, for trend, that wouldn't matter unless there was a bias in the direction of changes. Otherwise they would cancel. But there is a bias. In the US, volunteers used to ne enjoined to observe in late afternoon. That weakened, and morning obs became popular. That created a cooling bias.

Correction introduces noise

So NOAA and others try to recognise inhomogenieties and correct them. Sometimes that goes wrong. A real change is wrongly corrected. Or a real but temporary change gets corrected but by a wrong amount, or not fixed when the cause goes away.

But this comes back to the tradeoff between noise and bias. Correcting bias is important. If you create noise in the process, that may be acceptable. And a fixed algorithm can be tested with synthetic data to see if it introduces bias.

Resisting ad-hoccery.

Most of the fusses about adjustment are totally spurious. "Look, they are altering the record! data must not be touched!". But sometimes a definite error shows up. There may have been one at Reykjavik. Part of a temperature dip was wrongly considered an inhomogeneity. So should it be fixed?

No! As noted above, a good algorithm has been tested for lack of bias. If you start intervening, it loses that property. Noise won't hurt the average, but taking out the bits that displease naysayers certainly will.

So does it matter?

In TempLS, I use unadjusted data. I satisfied myself a while ago that there was little difference in outcome. But I could only do that because someone had identified and corrected the inhomogeneities, to give me something to check against. And it may just be good luck that this time the inhomogeneities cancelled.

Naysayers bang the drum about adjustments. But if none were done, then I bet we'd be hearing stories about how some station was moved, and ignored, so now the record is unreliable.

Appendix

Beset by people who couldn't shed the idea that data should only be adjusted if fault is proved, I gave this analogy. Here is a table of BHP share prices. Note the final column - price adjusted for dividends and splits. It's not that the old prices were defective; it's just that dividends and splits do not alter the productivity of the company, or to what you get in total from your shareholding. If you want to make historic sense of the raw prices, you have to keep making allowances. But the adjusted prices give a continuous picture of what a holding is worth.
If you were compiling a stock index (eg Dow, cf GISS) that is what you would use.

23 comments:

AnonymousFebruary 7, 2015 at 9:19 AM
Hello Nick
You say: "Station records are used as representative of a region"

What if, at some point of the calculation of an average temperature, regions are used to infer station temperatures? How would that affect your thinking? Do you think such a step should not occur in the calculation?

You then say: "Let's look at a specific case. In 1928, the station for Wellington, NZ, was moved from Thorndon, at sea level, to Kelburn, at about 100m altitude. The temperature dropped by nearly 1°C, and the NOAA algorithm picked this up and reduced the pre-1928 readings."

That's good. But what if algorithms pick up 1C or ~3C drops in temperature and make local station changes, but without the corresponding station history to back up algorithm-detected drops? What if, following this, the next step in the algorithm involves saying 'well, since there is a drop that looks like it could have come from a move, it's a move' and makes corresponding changes?

How would you feel about it, methodologically speaking?

ReplyDelete
Replies
AnonymousFebruary 7, 2015 at 12:14 PM
There are two points. First, if stations are used to calculate average and averages are used to calculate stations, it is clear there is non-independent computation.

1) Why use non-independent, i.e., circular analysis? 'The answer doesn't change' is not a good answer, and there is a possibility it is a wrong answer.

The problems do not stop there. For example, in Paraguay, almost all stations are adjusted, presumably using neighbouring non-Paraguayan stations. Which means, the data represented as Paraguayan temperature does not represent Paraguay.

Why create information out of non-information?

Secondly, changes made to local station record, en-route to calculation of a global average *have to be compatible with the actual local history of the place AND the global average*. Why is this not followed? Again, the 'answer doesn't change' logic doesn't work. If you go to the BEST website, they advertise their product for local stations. "Did you know? Berkeley Earth gives you historical temperature data for your home town, state, and country" - they say.

Except we know that your home town is likely an alteree who has been estimated by its neighbours.

Are alternative methods not possible? Of course they are. Take SA. Can 'true' temperatures be calculated for the larger geographic region of Eastern South America, alternatively, excluding Paraguayan temperatures? Yes. But the error margins would be wider. Is this correct?
ReplyDelete
Replies
CarrickFebruary 7, 2015 at 10:54 PM
Nick, do I understand you correctly that your code doesn't perform any sort of homogenization?

By the way, that['s a very nice explanation of the trade off between measurement bias and measurement noise. I'd add that once the measurement noise is sufficiently large compared to the bias, there's no real advantage to correcting bias any further. I've encountered that trade off in localization algorithms;

Also, homogenization algorithms can reduce the spatial resolution of the reconstructed temperature field (we saw this with BEST for example).

I know you've been discussing the global mean signal, defined as an integral over the surface temperature field, but with recent interest in regional scale climate change, having a higher spatial resolution might be more important than bias free measurements of trend. When we see large deviations from the global mean value in e.g. the Arctic, it might be more important to not smear out this variation in order to achieve a minor increase in accurate of the long term trend.
ReplyDelete
Replies
AnonymousFebruary 7, 2015 at 11:34 PM
"It's certainly not circular."

If local stations are adjusted to match their neighbours - with the evidence supporting the need for adjustment being that local stations don't match their neighbours - the argument is circular. There is no question about circularity itself. My question is why such methods are preferred. After all, you could leave out such stations and yet derive a regional or global average.

"But "averages are used to calculate stations" suggests more than what happens..."

Take Puerto Casado, a now-familiar example. BEST turns the local C/century trend from -0.89 to +1.36. BEST knows nothing about the station's field history. At best the altitude change is about ~10 meters. Clearly, more rather than less has been done to the station's record.

"I don't know where that rule came from or what it means."

It's quite simple. When you makes adjustments to a station to 'correct' for something that is not climate-based and isn't representative of the region. the product needs to be representative of the station too.

Otherwise you just use the existence of a station to synthesize a record that has no connection to the station. Taken another way, you have done more to a station than you declare.

"Maybe it hurts Paraguayan national pride, but borders mean nothing for a global average"

Paraguay is not a large country. But it is not a small one either. If all stations in Paraguay needed adjustment based on rural stations from neighbouring countries - which I presume you would agree are no better developed than the Paraguayan towns themselves - the magnitude of adjustments is over a large region and not confined to a 'few local stations'.

What is the impact of synthesized temperature records over such large regions on the error in individual years? Do we know?
ReplyDelete
Replies
CarrickFebruary 8, 2015 at 4:24 AM
nigguraths, if you use Metadata as the basis homogenization corrections, then it certainly isn't circular.

You are right if they use regionalize data series to spot and correct other series, there is a circularity issue. It will produce a spatial smearing. The bias introduced by this smearing is probably minor on the global mean temperature but as I pointed out above, when researchers are trying to lean on the regional scale to be accurate, this is still a problem for the series.

Put another way, if the only measurement that you think is valid from your series is the global value, that is all you should publish. You certainly shouldn't publish the individual cities reconstructed temperature, if that is expected to actually be meaningless (due to smearing).

Using Nick's trend plotter, for example, we can see there is a clear spatial smoothing in the BEST approach.

Forget about Paraguay, the entire continent of South America virtually gets spatially homogenized to the same temperature.

Your question about the impact of this homogenization on global mean temperature is I think a valid issue.

If you want to look for an effect of homogenization, see if the temperature for BEST is running hotter than other series. We can also compared GISS 1200-km to GISS 250-km smoothing.

Comparing series with only meta data corrections to ones that internally data mine would be useful (what Zeke alluded to), but of course that won't tell you very much if the homogenization algorithm is guilty over smoothing for other reasons.

If I get a chance today, I'll perform an analysis on some of these series and see what I can come up with.
ReplyDelete
Replies
CarrickFebruary 9, 2015 at 9:00 PM
A couple of images showed up on twitter. I'm taking the liberty to repost them here:

Effect of homogenization on global temperature index

The impact of homogenization on the recent global temperature index seems minimal.

Effect of homogenization on spatial resolution.

No breakpoint adjustments and only metadata adjustments look to have similar spatial patterns. Adding empirical seems to produce substantial smearing.
ReplyDelete
Replies
jeffMarch 10, 2020 at 1:57 PM
here's a concept leave the records alone until it is absolutely necessary .Not believing a reading is not a prerequisite to change it
ReplyDelete
Replies

Add comment

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Friday, February 6, 2015

Why homogenise data?

Why homogenise data?

Averaging

Correction introduces noise

Resisting ad-hoccery.

So does it matter?

Appendix

23 comments:

Search This Blog

Maintained Pages

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Friday, February 6, 2015

Why homogenise data?

Why homogenise data?

Averaging

Correction introduces noise

Resisting ad-hoccery.

So does it matter?

Appendix

23 comments:

Search This Blog

Maintained Pages

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me