moyhu: Climate of the Past fails Fourier test

Friday, May 10, 2013

Climate of the Past fails Fourier test

This is a belated post. I'm writing about a paper by Ludecke et al which was accepted in February by the EGU online journal "Climate of the Past". Eli wrote about several aspects, including data quality and how the paper made it to acceptance. Tamino gave a definitive mathematical takedown. Primaklima has a thread with some of the major local critics chiming in.

So what's left to say? And why now? Well, Ludecke had a guest post at WUWT a few days ago, promoting the paper. While joining in the thread, I re-read the online discussion, and was surprised at the lack of elementary understanding of Fourier analysis on display. Surprisingly, the guest post was not well received at WUWT, at least by those with math literacy.

I expect that notwithstanding this negativity, the paper's memes will continue to circulate. It comes from EIKE, a German contrarian website. And they have been pushing it for a while. Just pointing out its wrongness won't make it go away.

So here my plan is to redo a similar Fourier analysis, pointing out that the claimed periodicities are just the harmonics on which Fourier analysis is based, and not properties of the data. Then I'll do a similar analysis of a series which is just constant trend; no periodicities at all. Ludecke et al claim that their analysis shows that there is no AGW trend, but I'll show the contrary, that trend alone not only gives similar periodicities, but is reconstructed successfully in the same way.

Data

Ludecke et al use an average of six long time European temperature series, dating back to 1757. As Eli says, the reliability of the early years is doubtful. They also, to try to get more information on longer periodicities, used a single stalagmite series and some Antarctic ice core data. But the thermometer series is central, and the only one I'll consider here.

At CotP, they were taken to task by Dr Mudelsee, then a reviewer, for not making the series available. It's components are not easy to access. However, Dr Mudelsee ceased to be a reviewer (story to come) and his requirement that they be available was not enforced. I will look at just one of the series, Hohenpeissenberg. L et al say the series are similar.

The analysis

To anyone who understands Fourier analysis, it was trivial. They took a DFT (Discrete Fourier Transform) of the series, truncated the spectrum to the first six terms, inverted, and showed that the result was similar to a smoothed version of the original series. They identified the peaks with claimed natural periodicities.

There was a complication. They added zero padding to the original series. They never said how much; not even if it was a lot or a little. The result was the panel on the left, in the image below:

Note the annotated periods on the peaks. The panel on the right was the stalagmite series, not discussed here.

When it came to truncating and inverting, they read the amplitudes off this graph, but used the original harmonics as frequencies. That gave a still good comparison with a 15-year moving average of the temperature series.

The claims

These were stretched. They say of the peaks marked on their transform:
"Four of our six selected frequencies in M6 have a confidence level over 95% and only one over 99 %. We find for SPA roughly the periods corresponding to 250, 80, 65, and 35 yr from M6."
They are comparing peaks from their padded DFT. They have compared them to a noise level derived from the signal. This involved Hurst components, DFA, Monte Carlo analysis etc. The referees complained that this was inadequately described. But it is nonsense. The peaks have little to do with the noise, or the data at all. I will show the trend case where the peaks are there and there is no noise at all.

The inverse of the truncated frequency series was indeed close, as it must be. They say in the abstract of the final version:
"The Pearson correlation between the mean, smoothed by a 15-yr running average (boxcar) and the reconstruction using the six significant frequencies, yields r = 0.961. This good agreement has a > 99.9% confidence level confirmed by Monte Carlo simulations. It shows that the climate dynamics is governed at present by periodic oscillations. We find indications that observed periodicities result from intrinsic dynamics."
So the fact that the transform/inversion gets back to the starting point is proof of "observed periodicities result from intrinsic dynamics".

In the original was the even more absurd claim
"The excellent agreement of the reconstruction of the temperature history, using only the 6 strongest frequency components of the spectrum, with M6 would leave, together with the agreement of temperatures in the Northern and Southern Hemisphere, no room for any influences of CO2 or other anthropogenic emissions or effects on the Earth’s climate."
They were prevailed on to take that out, but with this claim to follow:
"The agreement of the reconstruction of the temperature history using only the six strongest components of the spectrum, with M6, shows that the present climate dynamics is dominated by periodic processes."
Of course it shows nothing of the sort.

They then went on to predict. In the original it said:
"We note that the prediction of a rather substantial temperature drop of the Earth over the next decades (dashed blue line in Fig. 5) results essentially from the ~64 yr 10 periodicity, of which 4 cycles are clearly visible in Fig. 5; and which, consequently, can be expected to reliably repeat in the future.
They are, of course, just predicting on the basis that the fundamental periodicity of the Fourier analysis will repeat outside the range. There is no scientific basis for that. But worse, as a prediction, it just says you'll repeat from wherever you started the data. Start at a different point, and you get a different prediction.

The journal

The managing editor was Prof Zorita. He (presumably) chose the referees. One #1 clearly knows nothing of the maths involved. He began:
"The authors appear to be statistical experts in this form of analysis, although not mainly publishing in climate science, and it is refreshing to see new approaches to an old problem."
They aren't experts, and elementary FA is not a new approach. He raised minor objections, and one which he said was major:
"I simply do not see how what the authors have done has any relevance
on excluding GHGs as a major factor on the climate."
Indeed, and that does seem to have led to some excesses being removed.

The second reviewer seems to have been Dr Muldelsee, who was quite critical, making the essential points, although not as clearly as I would have liked. He fell into the trap of making good suggestions (eg detrending) which would not have rescued the analysis if followed. However, they were anyway ignored, so he wrote a more critical response. This seems to have been enough to have him removed from the process. He subsequently asked to be removed from the journal's list of reviewers.

His replacement #2 was more upbeat ("This article is interesting and it deserves publication after some revisions.") His one major objection was to the periodic prediction, marked in blue on Fig 5, on the correct grounds that Fourier analysis will always make a periodic prediction. The blue line remains.

So what was Zorita's role? He wrote a curious, semi-critical editor comment, which didn't come to grips with the technical objections, but recommended some steps such as using part of the data to predict the remainder. None of this was done.

Then with no further public discussion the paper appeared, with just a few changes implemented.

My Fourier analysis of temperature

As mentioned, I'll do just the 231 years of Hohenpeissenberg, ending in 2011. The power spectrum of the DFT without zero padding looks like this:

It is discrete, consisting of the red lines in this low frequency view. The effect of zero padding to total length 8192 years is to in effect convolve those lines with the sinc function associated with the 231-year data period:

I've kept the original red lines; they are just the harmonics of freq 1/231. The padding has the effect of shifting the apparent peaks. The reason is that each convolution sinc function has a zero at the neighbouring harmonics but a non-zero slope, so it shifts the peak. I'll illustrate with a single frequency. It is of course artefact - there is no new information being discovered by zero padding.

So the logical thing to do is to reconstruct using the first six harmonics of the DFT. I've done that, with this result:

This is all similar to Ludecke et al.

Peak shift - a single frequency

If we take a single sinusoid (cos) of period 231 years (one period) then the power spectrum of course has a single spike:

But if it is also zero-padded to 8192 years, the result is:

Note that all the side lobes of the sinc function appear, and the peak is shifted. In fact, the (non-power) spectrum has two spikes, one positive frequency and one negative, and each shifts the peak of the other - outwards in this case. But the displacement is artefact; the data was the sinusoid of the red line.

Pure trend data

Here is an example which will show what nonsense this all is. I take a series in which the temperature rises with absolute regularity, 0.01°C/year for 231 years. There is no periodicity or even noise. Here is the DFT with no padding:

This is an undergraduate-level problem; the series is given here. The power coefficients are 231*2.31/(2π*n), n=1,2,3.

And here is the DFT with padding to 8192 years.

Just as with the temperature data, there are peaks, in about the same places. We can't do a "significance" test, because there is no noise. They are "infinitely significant". Note the prominence of the base frequency, which Ludecke et al were promoting as a discovery.

Reconstruction

Here is the 6-harmonic reconstruction:

That is pure trend reappearing. But of course, the DFT makes it periodic, so there is oscillation at the ends so that it can go down sharply. This is very well known - called the Gibbs effect, after the 19th century physical chemist.

The boast in the abstract was:
"The Pearson correlation between the mean, smoothed by a 15-yr running average (boxcar) and the reconstruction using the six significant frequencies, yields r = 0.961. This good agreement has a > 99.9% confidence level confirmed by Monte Carlo simulations. It shows that the climate dynamics is governed at present by periodic oscillations."
My corresponding r-value for Hohenp was 0.9496. But for the pure trend case, it was 0.988. Much higher, and no periodicity at all.

Prediction

Ludecke et al used their reconstruction for prediction. They mark the continuation in blue, and find a reason based on the 4th harmonic:
"The prediction of a temperature drop in the near future results essentially from the ~64-yr cycle, which to our knowledge is the Atlantic (Pacific) Multidecadal Oscillation (Mantua and Hare, 2002; Hurrel and van Loon, 1997)."
So, following their method, where is this uniformly rising trend data headed? Down! It rose 2.31°C by uniform steps; Ludecke-style foretelling is that it goes back to where it was, very quickly. And of course, there is absolutely no basis for that in the data.

And if the analyst had happened to FT 500 years of data, then the prediction is a 5°C drop.

Conclusion

The analysis is spurious, based on elementary Fourier Series misunderstanding. And I think it is very bad that Zorita could not see it, especially given Muldersee's warnings.

34 comments:

William M. ConnolleyMay 10, 2013 at 9:44 PM
Lovely. I must steal it.

I agree, its very bad that Zorita couldn't see it. But also bad that none of the other referees saw it clearly. Even Tamino missed the key point about the padding (and I'm just going with your flow; I did FFTs as an undergraduate and what you say *sounds* entirely plausible but I can't claim to have checked).
ReplyDelete
Replies
Paul SMay 10, 2013 at 11:27 PM
Nick,

Open discussion comments shown on the CotP website can be posted by anyone on a voluntary basis, so it's not necessarily the case that Zorita chose them, or even knew who they were.

AIUI there is another round of reviews, after this initial open discussion phase, in which Zorita would select reviewers in his capacity as editor, but those reviews aren't posted.
ReplyDelete
Replies
AnonymousMay 11, 2013 at 5:16 PM
Nick, do you have any plans to make this into a comment to the paper? I think some people need to be embarrassed (Zorita claimed the reviewers were experts in time series analysis; surely they should have noted the problems?).

Marco
ReplyDelete
Replies
K.a.r.S.t.e.NMay 12, 2013 at 8:08 AM
Nick, in case you haven't noticed it, Eduardo commented on the backgrounds of the review process. Georg, Manfred and I commented as well: Klimazwiebel
In fact, Manfred wasn't official reviewer at any stage. In any case, thank you for this additional comment on the subject.
ReplyDelete
Replies
CarrickMay 13, 2013 at 3:31 AM
James also had a bit of discussion on this article on his blog.

http://julesandjames.blogspot.com/2013/03/peer-review-problems-at-egu-journals.html

Don't you think it's a bit of stretch though to suggest the truth "died"? Essentially everybody including the WUWT crowd is aware the paper is bad.

Anyway... to the more technical aspects of your post:

The result you go here is a special case where the frequency of the signal is exactly centered in a bin, and on the use of a rectangular window. For observational data, it's essentially never going to be centered on a window. You'll guaranteed to get just one peak, but in general you'll have nonzero values for the other bins.

I redid your little experiment, but I moved the period to 250 years, had an observational period of 1000 years, and a sampling interval of 1-year. (It makes it a bit easier to interpret if you move the signal out of the first non-zero frequency bin.)

Here's my version of the plot.

When you zero-pad the data, the true window-response function is revealed, which for a rectangular window is just a sinc function of course.

Since the location of the secondary maxima of the window response function depends on the window function chosen, one test for windowing artifacts is to use more than one window type...

I recommend using rectangular, Welch, triangular, Hann and Blackman windows.

Here's a figure. (I'm just showing rectangular, Welch and Hann windows.)
ReplyDelete
Replies
CarrickMay 13, 2013 at 3:47 AM
Do you have a link to the data that you actually used?
ReplyDelete
Replies
CarrickMay 13, 2013 at 5:04 AM
Thanks Nick.

Regarding this statement:

To anyone who understands Fourier analysis, it was trivial. They took a DFT (Discrete Fourier Transform) of the series, truncated the spectrum to the first six terms, inverted, and showed that the result was similar to a smoothed version of the original series. They identified the peaks with claimed natural periodicities.

It appears from from their figure they also subtracted the mean of the series... correct?

I can't understand their rational for not detrending the data though.
ReplyDelete
Replies
CarrickMay 13, 2013 at 12:00 PM
Nick thanks again for the link to the data. When I compute the spectra for the Hohenpeissenberg data (231 years), I get a very similar figure to Ludecke's Figure 3, in terms of the apparent spectral peaks in the data.

I went back and did the test I suggested, namely compared rectangular to Welch and Hann windowing. It's very clear (to me anyway) that most of these other peaks are artifacts. I still get peaks for periods near 250 years and 35 years.

My suspicion is the 250 year peak is just a windowing artifact associated with the non-zero trend in the data (it disappears if you e.g. quadratically detrend the data).

The existence of a 35-year period seems plausible though. It's robust across different window function choices and

I think that the labeled peaks in Ludecke's figure 3 are just windowing artifacts.

Anyway here's the figure.

Definition of Welch window here.

There's a more formalized approach for testing the robustness of spectral peaks called the "multitaper method". There even seems to be an R-implementation. If you care to pursue this, you might get a publishable result from that.
ReplyDelete
Replies
CarrickMay 13, 2013 at 12:01 PM
Got distracted.... to clarify this, other than the 35-year peak, I think the other, longer period, peaks in Figure 3 are not real.
ReplyDelete
Replies
Paul SMay 16, 2013 at 8:27 AM
No idea what you peeps are going on about but thought it might be interesting to note that 35 years is the average recurrence time for large volcanic eruptions in the Gao 2008 stratospheric aerosol series. There's fairly substantial variance (+/-28 years, 1 sd) so it could be coincidence.
ReplyDelete
Replies
EliRabettMay 17, 2013 at 1:55 PM
IEHO the problem was the reviewers were a) polite and b) not really appropriate. Now some (hi Steve), not Eli to be sure, would say that the fix was in given Zorita's connections to the Alpine climate groups. He had to know better.
ReplyDelete
Replies
EliRabettMay 17, 2013 at 2:00 PM
Hi, where did you find the numerical data for the stations in BEST? esp the ones that are corrected.

http://berkeleyearth.lbl.gov/stations/155038
ReplyDelete
Replies

Add comment

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Friday, May 10, 2013

Climate of the Past fails Fourier test

Climate of the Past fails Fourier test

Data

The analysis

The claims

The journal

My Fourier analysis of temperature

Peak shift - a single frequency

Pure trend data

Reconstruction

Prediction

Conclusion

34 comments:

Search This Blog

Maintained Pages

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Friday, May 10, 2013

Climate of the Past fails Fourier test

Climate of the Past fails Fourier test

Data

The analysis

The claims

The journal

My Fourier analysis of temperature

Peak shift - a single frequency

Pure trend data

Reconstruction

Prediction

Conclusion

34 comments:

Search This Blog

Maintained Pages

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me