moyhu: Adjusting Amberley

Tuesday, August 26, 2014

Adjusting Amberley - as it must be

In my last post, I commented briefly on a kerfuffle about adjustments at Amberley. An issue was being made, at WUWT and elsewhere, of the fact that the minimum had been adjusted so that a small cooling trend turned into quite a large warming trend. This made it into the Australian, and the BOM was prressed for an explanation. They pointed out, as I noted there, that there was a large change in 1980, with no associated metadata, which was presumably due to a move within the site.

Now WUWT, following JoNova, is pouring scorn on BoM, saying basically that they are making it up, since they don't have a record. But there is a very good reason why they don't have a record. It wasn't then a BoM site. It was Air Force, and they get the records from them. And the RAAF has its own priorities.

However, the need for the change, and the amount, is obvious if you just look at neighboring stations, and the BoM program did. I'll show this below the jump.

BoM has all the unadjusted data you need, starting on this page. Ask for the kind of data (monthly, mean min etc), with Amberley in the matching towns. Under Nearest Bureau stations, unset the "only show open" button, and it gives the nearest stations with that data. I want stations with data from 1975 to 1985; enough to see what is happening around 1980. The nearest with this data are Ipswich, Samford and Brisbane. I didn't include Mt Glorious because it is on a mountain top; the others are pretty much on the level.

I then subtracted the monthly means for that decade, to remove seasonality. So here is the graph:

It shows the fitted lines, with trends over the decade shown in the legend. Note that the red Amberley curve is pretty much above everything before 1980, and below soon after. The other stations track each other well. Here is tabulated the trends in °C/century. Notice that one of these is not like the others.

Amberley	-15.56
Ipswich	2.23
Samford	5.85
Brisbane RO	2.31

Update. To show the contrast more clearly, and following a suggestion from Victor Venema, here is a plot of the same data, but with the Ipswich values subtracted - ie relative to Ipswich (chosen because it is by far the closest to Amberley):

The fact that Amberley crosses the axis during 1980 is now much clearer.So I added an adjustment of 1.4°C after Jan 1980. Let's see what it looks like now:

Interesting. Amberley is now back tracking the others, except for some disturbance during 1980. And the slope matches the others. But it's irregular during 1980. That means I haven't guessed the date quite right. I've adjusted upward too early. About August 1980 looks right. Here we go:

Yes, that did it. Now tracking the three neighbors very well. And yes, the slope has gone from -15.6 to 3.51, right in the mid-range of the neighbors.

Where did I get 1.4°C? It's a linear calc - to bring the slope back to mid-range. How does this affect the whole trend for Amberley? It increases the trend by 2.86 °C/Cen. Apparently the BoM adjustment was 3.5°C/cen. I'm sure they looked at more stations.

So there you go. The change happened in August 1980, and it dropped temperatures artificially by ~~2.8~~ 1.4°C.

57 comments:

DavidRAugust 26, 2014 at 10:45 PM
Nick,

How do I obtain the 'raw' data from the BOM page. The link I get seems to be to the adjusted data.

Thanks.
ReplyDelete
Replies
AnonymousAugust 26, 2014 at 11:39 PM
how did adding 1.4C August 1980+ decrease the values pre August 1980? Looking at the value in your chart for Amberley raw data the first dip appears to be approx -1C and in the last two graphs it appears to be closer to -1.6C
ReplyDelete
Replies
AnonymousAugust 27, 2014 at 12:17 AM
How do these adjustments take into account any micro-climate differences? It would seem these differences are lost when the data is Homogenised.
ReplyDelete
Replies
AnonymousAugust 27, 2014 at 3:30 AM
So the adjustment you showed here was not the best way to do it, you needed to take one on the "sites" and adjust the data from the other so that you can then join the 2 together to give a more represenattive view for the period. Does that preserve the trend, irrespective of which series is kept intact?
ReplyDelete
Replies
Victor VenemaAugust 27, 2014 at 3:33 AM
The atmosphere at JoNova seems to be even worse than at WUWT. Or is this post a bad example? Very unpleasant.

You could compute the difference time series of Amberley and the other three stations. That would visualize the change even clearer and also make it easier for you to estimate the right date of the break.
ReplyDelete
Replies
AnonymousAugust 27, 2014 at 8:48 AM
Ipswich is a major growing urban centre.

Brisbane and Samford are nowhere near the same type of climate type.
ReplyDelete
Replies
AnonymousAugust 27, 2014 at 8:50 AM
There is NO RECORD of any site move at Amberley. You are adjusting the data using a whim and a fabrication.

The AGW way !
ReplyDelete
Replies
AnonymousAugust 27, 2014 at 9:30 AM
Good morning Nick

Looks like you have done almost exactly the same analysis of local sites as I did (except I looked at UQ Gatton as well, and used 1961-1990), plus I also checked vs nearest Acorn sites. Yes I see the big change in 1980- some adjustment appears warranted. However, how big should this adjustment be, and should it be applied to every year previous to 1980? There is another discontinuity around 1968 when Amberley minima rises above the mean of its close neighbours until 1980 when it drops below. The Acorn adjustment creates a trend that is greater than any of the Acorn neighbours, which is questionable.
Ken
ReplyDelete
Replies
AnonymousAugust 27, 2014 at 9:53 AM
Nick,

The only doubt I have about the homogenization algorithm is the (apparent) potential for circularity. If adjustments are made to individual stations (like the example above) based only on raw data for surrounding sites, then there can be no circularity. But if the surrounding neighbors have themselves been subjected to homogenization (with other neighbors), then there would seem at least some potential for adjustments to become exaggerated via multiple cycles of homogenization. Do you have any insight on this? Is homogenization based only on raw data for surrounding sites?

Steve Fitzpatrick
ReplyDelete
Replies
AnonymousAugust 27, 2014 at 10:35 AM
Hi I'm Dr. Bill Johnston and I was mentioned in Graham Lloyd's article in today's Australian.

I have analysed Rutherglen's and Amberley's minimum temperature series using annually resolved data. For Amberley, few people realise there are 3 datasets. The RAW; an early high-quality set (HQ), said to be fully homogenised; and ACORN-Sat, a homogenised daily set, which I summarised into annual averages.
Amberley RAW shows a negative trend; HQ a trend of 0.05 deg.C/decade and ACORN, 0.26 degC/decade.
For the RAW data, statistically significant step-changes were evident in 1973 (+0.71 deg.C) and 1981 (down 1.23 degrees) giving a net change of minus0.52 deg.C between 1972 and 1981. In ''73 the Vietnam war was raging and there were many changes at Amberley. The data suggests a temporary station move. The 1981 shift was consistent with establishment of a new site; presumably the current one.
Importantly, with those step-changes removed (deducted from the actual data) there was no residual trend. I used sequential t-tests (STARS), an Excel addin freely available from http://www.beringclimate.noaa.gov/regimes/ (Version 3.2).

ReplyDelete
Replies
UnknownAugust 27, 2014 at 12:10 PM
Further to what Bill said about the HQ data. The HQ data was created in 1996 when the Australian Temperature series was first homogenized.

Torok, S. and Nicholls, N., 1996. An historical
temperature record for Australia. Aust. Met. Mag. 45,
251-260

Here is the homogenize method used in 1966 for Rutherglen Station 82039.

Key
~~~
Station
Element (1021=min, 1001=max)
Year
Type (1=single years, 0=all previous years)
Adjustment
Cumulative adjustment
Reason : o= objective test
f= median
r= range
d= detect
documented changes : m= move
s= stevenson screen supplied
b= building
v= vegetation (trees, grass growing, etc)
c= change in site/temporary site
n= new screen
p= poor site/site cleared
u= old/poor screen or screen fixed
a= composite move
e= entry/observer/instrument problems
i= inspection
t= time change
*= documentation unclear

Station Number - Max or Min - Year - all years prior or single year - change - Adjustment - Cumulative adjustment - reason as per chart above.

Minimum
82039 1021 1948 0 -0.4 -0.4 odp
82039 1021 1926 0 -0.5 -0.9 odm*
82039 1021 1920 1 -1.0 -1.9 frd
82039 1021 1912 0 -2.1 -3.0 oda

Maximum
82039 1001 1980 0 -0.2 -0.2 od
82039 1001 1950 0 +0.2 +0.0 odp
82039 1001 1939 0 -0.6 -0.6 odu
82039 1001 1912 0 -1.0 -1.6 orda

There was a move in 1912 so all min prior to 1912 were dropped - 2.1 and Max -1.0. Apparently a fixed screen in 1939 caused all max prior to 1939 to be lowered -0.6.

They started with 1418 stations and broke it down to 224 for the HQ data set. The ACORN data set is from 112 stations and only post 1910.
The data available via - http://www.bom.gov.au/climate/data/ - is the HQ data so you are comparing homogenized with homogenized.

To obtain the Raw data set you must de-construct the HQ data and the data above shows you how.
ReplyDelete
Replies
UnknownAugust 27, 2014 at 4:36 PM
Yes Nick.
ReplyDelete
Replies
coheniteAugust 27, 2014 at 4:50 PM
Hi Nick. Ain't life grand?
ReplyDelete
Replies
tonybAugust 27, 2014 at 5:12 PM
Well Nick

I think this is murkier and more confused than ever. We have a number of factors in play here.

Raw data is taken. It may or may not be accurate for the dozens of reasons I cited in a previous article. But it IS the raw data

Then we have interpolation of data from sometimes many miles away that might use an average of wrong data, adjusted data or raw data.

Then we have an algorithm applied a la Mosh whereby the data of the past is adjusted to take into account new data.

Where do we end up? With an end product that now has added value and has limited relationship to the original product..

Camuffo was provided with 7milion euros to examine half a dozen long lived European data sets. If you really want to see the numerous factors that can be identified to 'justify' adjustments please let me know and I will link to his book. However, it is a safe bet that the changes being made to the type of data in your post has only received a fraction of the investigation that Camuffo carried out.

In the Met Office library are year books of temperatures for the UK, America and I am pretty sure that of Australian stations, going back to the end of the 19th Century.

It would be interesting to publish the original printed data AND the modern interpretation and see just where (and why) the changes have been made and whether the type of adjustment being made at such places as Amberley are common. Do you ever go back to the archived printed sources?

tonyb
ReplyDelete
Replies
AnonymousAugust 27, 2014 at 7:04 PM
Surely the take away message here is that the raw data appear to be from several different sources, which need to be viewed as different sites for the analysis. Presumably one of these sites can be taken as reliable, and the other sites need to be adjusted by comaprison with nearest neighbouring similar sites over the same period, as long as they have not been adjusted. I cannot see that Nick's post above has much relationship to the adjustments that needed to be done here, it's probably a good example of what not to do in this case, but a good starting point for a discussion on the subject.
ReplyDelete
Replies
AnonymousAugust 29, 2014 at 7:07 PM
Nick: Have you ever read this passage from Feynman's Cargo Cult Science?

"We have learned a lot from experience about how to handle some of the ways we fool ourselves. One example: Millikan measured the charge on an electron by an experiment with falling oil drops, and got an answer which we now know not to be quite right. It's a little bit off because he had the incorrect value for the viscosity of air. It's interesting to look at the history of measurements of the charge of an electron, after Millikan. If you plot them as a function of time, you find that one is a little bit bigger than Millikan's, and the next one's a little bit bigger than that, and the next one's a little bit bigger than that, until finally they settle down to a number which is higher.

Why didn't they discover the new number was higher right away? It's a thing that scientists are ashamed of--this history--because it's apparent that people did things like this: When they got a number that was too high above Millikan's, they thought something must be wrong--and they would look for and find a reason why something might be wrong. When they got a number close to Millikan's value they didn't look so hard. And so they eliminated the numbers that were too far off, and did other things like that. We've learned those tricks nowadays, and now we don't have that kind of a disease."

If you have ever done the Millikan oil drop experiment, you'll know that it produces noisy data with some outliers. It is easy to see how knowledge of the "correct answer" caused this to happen.

What you have done in this post is take data from four independent experiments and used the data from Ipswich to correct the data from Amberley - without any evidence that a problem actually exists with its data. (No studies with reliable equipment that doesn't require homogenization have been done showing how much trends actually vary within a region, but perhaps the USCRN will someday provide such information.) There are legitimate ways of identifying outliers in data, but one DISCARDS the bad data and reports the result from three experiments. One NEVER adjusts the outlier so it duplicates one experiment and reports results for four experiments! That is what you did when you produced your last graph. If you thought of this data as four replicates from a physics experiment, you would never process the data the way you did above. You would never guess which data was right or wrong - or use knowledge or guesses about the "right answer" to decide. Feynman's anecdote explains why.

Homogenization algorithms start by identifying breakpoints with a high degree of statistical certainty. Unlike your effort above, they have evidence that a problem exists in the data. Unfortunately, most stations have many breakpoints in their data, an average of one a decade if my memory is correct. No one can tell if a breakpoint should be corrected unless they have metadata documenting new observing conditions: a station move, a change in TOB, a change in equipment. Some breakpoints may be due to maintenance that restores earlier observing conditions. The latter breakpoints shouldn't be corrected.

Frank
ReplyDelete
Replies
DocMartynAugust 31, 2014 at 3:31 AM
Nick, if I have tubs of ice-cream and tubs of dog shit; what is the ratio of scoops of ice-cream to scoops of dog shit whereby the composite changes from being dog shit ice-cream into ice-cream?
ReplyDelete
Replies
AnonymousSeptember 2, 2014 at 8:11 AM
Nick: If you compare Ipswich and Amberley anomalies over the full record, you will find that Amberley gradually warmed vs Ipswich about 0.5-1.0 degC between 1965 and 1980 and before cooling about 1.5 degC in 1980. Some of the sudden cooling in 1980 may represent a return to a more normal relationship between these two stations. Whenever you correct a breakpoint that was created by GRADUAL change, you risk biasing the trend.

Frank
ReplyDelete
Replies

Add comment

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Tuesday, August 26, 2014

Adjusting Amberley - as it must be

Adjusting Amberley - as it must be

57 comments:

Search This Blog

Maintained Pages

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Tuesday, August 26, 2014

Adjusting Amberley - as it must be

Adjusting Amberley - as it must be

57 comments:

Search This Blog

Maintained Pages

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me