moyhu: Trends, breakpoints and derivatives

Sunday, January 25, 2015

Trends, breakpoints and derivatives - part 2

In part 1, I discussed how trends worked as a derivative estimate for noisy data. They give the minimum variance estimator for prescribed number of data points, but leave quite a lot of high frequency noise, which can cause confusion. I also gave some of the Savitsky-style theory for calculating derivative operators, and introduced the Welch taper, which I'll use for better smoothing. I've chosen Welch (a parabola) because it is simple, about as good as any, and arises naturally when integrating (summing) the trend coefficient by parts.

I gave theory for the operators previously. The basic plan here is to apply them, particularly second derivative (acceleration) to see if it helps clarify break points, and the general pattern of temperatures. The better smoothing might seem contrary to detecting breakpoints, since it smooths them. But that actually helps to avoid spurious cases. I'll show here just the analysis of GISS Land/Ocean.

I'll start with the spectrum of acceleration below. As I said in Part 1, you can actually get much the same results by differencing the smooth (twice for accel), or smoothing the difference. But the combined operator shows best what is happening in the frequency domain.

Spectrum of acceleration

Here is a plot of the spectra for acceleration, as with trend in part 1.

Some points:

Each of the operators is now quadratic for low frequencies, as differentiation requires. As the frequency (1/width) = 10 /Cen is approached, the response again starts to taper. This is the effect of smoothing at higher frequencies.
Each operator then has pronounced band-pass character, slightly more so than with trend. This will show in their behaviour.
You can still see the increasing order of roll-off, though each is slower than the corresponding trend spectrum.

Gradient plots

The active plot below shows gradients with 10,20 and 30 year filters, on 13 different datasets. Each plot shows the three different tapers ("Regress" (red) is just OLS). You can use the buttons at the top to change data set or filter length.

Length yrs	Dataset

The plot you see first here is GISS Land/Ocean monthly, 30 year filters. The filter is centered, so you see an estimate of the derivative at the year marked on the axis. There is no padding, so the plot stops at 2000. Some notes:

The trend is mostly positive (warming).
As the smoothing increases, there is more pronounced amplification around the filter period (30 yrs). Inevitably, most of that is noise. But it happens even with the OLS trend.
There is no radical change as smoothing increases, but the blue curve strips away high frequency detail, which probably had little meaning.
What remains are the familiar features - warming 1910 to near 1940, then a hiatus, then warming from about 1975 on, with a max trend (not a pause) at about 2000. Some sign of deceleration there, although it could be just the amplification of the 30 yr band.

Acceleration plots

Now we are estimating second derivative, which should be mostly the derivative of the above. This will be clearer with the W² blue curve. The main thing to look for are spikes (+ or -) to indicate break points, where the derivative changes.

Length yrs	Dataset

The spikes aren't very pronounced. There is conflict between the want to remove HF noise, and preserving the spike. So the smoothest line shows smoothish spikes, but that is abtually the meaningful part. It isn't really better without smoothing. So here we see 1910 and 1940 as the most prominent features, with a reasonable peak around 1972 (it's really hard now to pin down a year, as it should be). At this resolution, no sign of a peak at 2000.
Going to shorter periods doesn't really reveal more. There is just more noise at about the periodicity of the filter length.

More about the datasets

HadCRUT - HADCRUT 4 Land/Ocean
GISSlo - GISS Land/Ocean
NOAAlo - NOAA Land/Ocean
UAH5.6 - UAH Lower Troposphere
RSS.MSU - RSS Lower Troposphere
TempLSgrid - Land/Ocean
BESTlo - Land/Ocean
C.Wkrig - Cowtan and Way kriging Land/Ocean
TempLSmesh - Land/Ocean
BESTla - Land Only
GISS.Ts - Met stations
CRUTEM - Land Only
NOAAla - Land Only
HADSST3 - Sea Surface
NOAAsst - Sea Surface

28 comments:

JCHJanuary 26, 2015 at 3:18 AM
Really interesting. Can you do this to the PDO and AMO indexes?
ReplyDelete
Replies
Greg GoodmanFebruary 21, 2015 at 7:29 PM
This pair of articles is very informative.

It is odd however that you have not pointed out one of the most significant features of all these filters: the negative lobes.

The first negative lobe appears to have a magnitude of about 50% of main peak. That is a huge problem. 50% leakage would be bad enough but actually inverting the signal is potentially disastrous if there is significant signal in that part of the spectrum.

This actually looks considerably worse than a straight running mean:
https://climategrog.files.wordpress.com/2013/05/gauss_rm_fft.png

( Note that is a magnitude plot and, as here, every other lobe is negative for RM )
ReplyDelete
Replies
Greg GoodmanFebruary 22, 2015 at 5:32 PM
Here is the response of the gaussian-derivative. Since the freq resp of a gaussian is itself a gaussian the result of diff and gauss is the linear ramp of diff x gaussian and gaussian wins at high freq.

http://climategrog.files.wordpress.com/2015/02/dgauss-5y_freq_resp.png?w=800

The example I plotted is for sigma=5y which is a suitable replacement for the M&F 15y sliding-trend .

It would be good to see this (with suitable frequency scaling) on the same plot as your graph in this post to allow direct comparison.

Roll-off is slow and even but most importantly does not have negative lobes. It looks like your blue line filter may have some merit of a much faster roll-off but the neg. lobe, though smaller, is still a bit ugly.

presumably you could take the process further and get a still improved version.

I would like to try a Lanczos-derivative, in the same way as GD. That should have minimum overshoot/ripple and be faster roll-off than the blue line.

ReplyDelete
Replies
CarrickFebruary 23, 2015 at 5:49 PM
One thing I should add is that Nick's transfer function is not dimensionless. This is the ratio of the trend amplitude to temperature amplitude.

If you want to look at the dimensionless version, you need to divide by 1/f (the value at f=0 is undefined as is appropriate).

Also, you have to look at the noise spectrum when looking at the relative magnitude of the signal that passes through the first lobe as compared to the pass band of the filter.

Since this is also 1/f to some power, in practice, the amount of energy in the first band is much less than the 50% that Greg is quoting.
ReplyDelete
Replies
Greg GoodmanFebruary 23, 2015 at 7:57 PM
You seem to assume that temperature is random: integrated white noise. If it was we would not all be arguing about "forcings".

There were two major "forcing" events: El Chichon and Mt Pinatubo that inserted a very significant circa 10y signal in the latter half of the 20th. c. Inverting part of that is definitely not desirable.

There are some errors in Nick's formula as give since it only works n=10. I would not start doing dimensional analysis until he's fixed. it.

The transfer should not be undefined at zero, the difference operation will take out any constant term. It is correctly zero and should tend to zero in a well defined manner.
ReplyDelete
Replies
@whutFebruary 24, 2015 at 11:27 PM
I likely will never use filters with that large a window. All I really need is a window of about 1 year to suppress the intra-annual noise. It is becoming more and more clear that a time-series such as the global GISS record is composed by ENSO, volcanic, CO2, etc factors and that one uses these signatures as a model-based compensation or filtering. And for the long-term variations, the temperature compensation is empirically established by the LOD anomaly.

So the approach is to apply the same small-window filter to both the SOI and the GISS time series and compose that way.

http://contextearth.com/2015/01/30/csalt-re-analysis/

Moreover, this approach points out how delusional the current hysteria is over manipulation of temperature data -- see Curry, WUWT, and the usual suspects. There is no possible way that manipulation of temperature and of a metric such as SOI can be aligned so perfectly as a result of a conspiratorial strategy.

ReplyDelete
Replies
Greg GoodmanFebruary 25, 2015 at 3:09 AM
How about you learn some manners PUKE-ITE. ?

Yes, I misread fine.

Now as usual you ignore the very valid points I make about your ill-conceived model and how you need to test it and why your regressions are wrong, and simply declare it "the winner in all this".

Very powerful argument that.

You're right, I'm really, really afraid of the power of your model and what it is going to do to the world. LOL.

ReplyDelete
Replies
@whutFebruary 25, 2015 at 3:39 AM
I don't know what your problem is Goodmann, other than you have problems spelling your own name.

So you want me to go through your "very valid points", eh?

First, I don't do AR1, I do model an Ornstein-Uhlenbeck random walk process. And that isn't going to cut it.

Next, what exactly is the problem with the well-known factors that contribute to natural variability? You call them exploratory variables. Yes, I have seen how you are thrashing and wailing about trying to make sense of the volcanic aerosols. My recommendation to you is to take a deep breath and just do it correctly. The result pops right out, just as the SOI factor pops out. Adding in the log(CO2), TSI, and LOD anomaly brings up the correlation coefficient to at least 0.98.

That is essentially the starting point for further analysis.

If you really want to ignore 98% of the jigsaw puzzle because you are beholden to some political agenda, that is your problem. It really is painful to watch you abnegators try to make things more difficult than they actually are, for the sole purpose of creating FUD.

ReplyDelete
Replies

Add comment

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Sunday, January 25, 2015

Trends, breakpoints and derivatives - part 2

Trends, breakpoints and derivatives - part 2

Spectrum of acceleration

Gradient plots

Acceleration plots

More about the datasets

28 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Sunday, January 25, 2015

Trends, breakpoints and derivatives - part 2

Trends, breakpoints and derivatives - part 2

Spectrum of acceleration

Gradient plots

Acceleration plots

More about the datasets

28 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me