moyhu: Coverage, Hadcrut 4 and trends

Monday, November 18, 2013

Coverage, Hadcrut 4 and trends

This post is inspired by the new paper by Kevin Cowtan and Robert Way in QJRoyMetSoc, which I wrote about in my previous post. The key issue that they identified was bias due to lack of coverage in regions that were warming rapidly, specifically near the poles.

HADCRUT gathers temperature anomalies each month in a 5°x 5° lat/lon grid. Where gridcells have no data, they are omitted. As I said then, this is not a neutral decision. Whenever you average over a continuum with fixed divisions and have missing values which you omit, that is equivalent to replacing those points by the average of the data you have. That is often not a good choice, and if there is anyway of better estimating the missing values, it should be used. I did my own analysis of coverage here and here.

C&W use a quite elaborate scheme for deriving those infills, involving satellite data and kriging. I wondered how much could be achieved by a very simple improvement. The main bias is believed to be latitude-based; specifically, that polar regions behave differently to the rest. So I sought to replace the missing cells by a latitude band average rather than global. I'm not using kriging or satellite data.

I think this is useful because the new paper has been greeted as a "pausebuster" because it shows a much less reduced trend in recent years. So I'm focussing on the 16 year global trend since Jan 1997 (to end 2012), also treated in C&W. I think a simple demonstration of the coverage correction would reinforce C&W's much more thorough and elaborate treatment.

Coverage and latitude averages

This image from the C&W site gives an idea of the coverage issue. There is a lot of missing data outside the polar regions, but it is not clear whether that biases the trend. But the polar regions are warming rapidly, and to in effect treat the missing cells as global average does create a bias.

I formed a latitude average for each month for a 5° band using the following weighting rules. Cells in that band with data have weight 1. Cells in adjacent bands have weight r, where r is typically 0.1-0.2. Cells in the band adjacent to that have weight r^2. Others are not used.

The point of this is that where there is good coverage, the average will be close to the band average. But if the central band has few data cells, the adjacent band cells, though downweighted, will be more significant by their numbers, avoiding the high variance that would come from relying on just the few cells in the central band. And if both those bands have few entries, then the third level comes into play. This is really only relevant to the N pole band, where the two bands above 80° are sparse.

I then simply infill missing data for each month with the latitude band average value, and compute trends for the resulting complete set.

I expect the result to vary little with r - this will be shown.

Results

The trend over the period 1997-2012, in °C/decade was:

HAD 4 cited C&W	0.046
HAD 4 with global average infill	0.0539
HAD 4 with lat av infill r=0.05	0.0854
HAD 4 with lat av infill r=0.1	0.0846
HAD 4 with lat av infill r=0.2	0.0821
GISS cited by C&W	0.080
C&W hybrid	0.1187

So this simple infill almost doubles the trend, but does not go as far as the C&W hybrid method. It is, however, close to GISS, which interpolates to avoid missing cells.

The graph by latitude band is

Here is a graph to show the small variations with different r (parameter for spreading estimate of latitude band average)

Conclusion

This shows that the trend is indeed biased by coverage. Using a latitude average estimate to replace missing values is at least as justifiable as the default global average. No special interpolation techniques are used, nor any alternative datasets. The change is substantial, though not as complete as C&W. However, the plot of trend by latitude bands is quite similar to the hybrid method.

6 comments:

sunobaNovember 19, 2013 at 9:45 AM
Your simple but judicious modelling contributes significantly to understanding and a better portrayal of reality. Nice work Nick!
ReplyDelete
Replies
AnonymousNovember 20, 2013 at 3:01 AM
I'm guessing part of the remaining difference comes from the rebaselining. Using a baseline which is distant in time and norms from the trend period means that changes in coverage have a bigger effect on trends. Couple this with the fact that over the trend period HadCRUT4 land coverage declines and SST coverage increases. It works a bit like a product rule in calculus, with the terms being the contrast between the observed and unobserved regions, and the coverage mask. Both can change the bias.

Another way to address this would be latitude averages of the land and sea data separately - I haven't tried this, but it might shed some light one way or the other. More generally I haven't come up with a good way of attributing causes of bias in the case of changing coverage.

Kevin
ReplyDelete
Replies
AnonymousNovember 22, 2013 at 8:35 AM
OK, I've now got some evidence that the increase in trend on rebaselining comes from reducing the impact of declining land and increasing SST coverage. (We originally had to rebaseline to match the satellite data.)

Using the blended data, rebaselining increases the trend by nearly 0.02C/decade, and this increase survives through kriging. On the downside it means throwing away some observations because they don't have enough coverage on 1981-2010.

However, if I do separate reconstructions on the land and ocean data and then blend, the effect goes away and the results are similar whether or not you rebaseline. That is what you would expect if the rebaselining is mitigating the bias due to a shift in the land/ocean coverage balance. This is a strong additional justification for reconstructing the unblended data.

This should be trivial to do with your method too - with the exception that you then need to make an explicit decision about how to treat sea ice when you blend.

Kevin
ReplyDelete
Replies

Add comment

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Monday, November 18, 2013

Coverage, Hadcrut 4 and trends

Coverage, Hadcrut 4 and trends

Coverage and latitude averages

Results

Conclusion

6 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Monday, November 18, 2013

Coverage, Hadcrut 4 and trends

Coverage, Hadcrut 4 and trends

Coverage and latitude averages

Results

Conclusion

6 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me