moyhu: A weekend paradox

Sunday, April 27, 2014

A weekend paradox

There's a post by Willis Eschenbach at WUWT, titled Extreme Times. It notes that with an autocorrelated signal, for any fixed observation period, the max for the period is more likely to be at the ends than in the middle. That's not easily intuitive.

They go on to argue that statements like "the millenium ended with its warmest decade" do not reinforce global warming. That's what millenia do.

The argument he's countering says that there's only about a 1 in 100 chance of that happening by chance. No, says Willis, it's more like 1 in 50. True, but that's still only 1 in 50. It doesn't change much.

Anyway, I thought of a more homely version of the paradox. Counting weeks as starting on Sundays, on what day are weekly temperature maxima most likely to occur?

My argument went thus. It's like with TOBS. Warm days often come in spells. A warm spell midweek will probably yield a max for one week. But a warm spell at the weekend may well make a weekly max on both Sat and Sun. So over a year, say, Sundays will show up more in the statistics. In fact, up to twice as often as mid-week.

Anyway, the argument at WUWT went on, so I checked. I have a file of daily max for Melbourne from this post. It's from May 1855 to Nov 2013. I counted. Results below, with an error corrected:

Update I had made a mistake in transposing matrices, which had the result of somewhat exaggerating the effect. It is still there though. I have posted the Melbourne max data as a 7 col array here. It starts Sun 7 May 1855.

Day of Max	Number
Sunday	1273
Monday	1138
Tuesday	1100
Wednesday	1071
Thursday	1101
Friday	1057
Saturday	1534

This doesn't mean, alas, than Nature gives us specially warm weekends in Melbourne. You'd get the same result for minima. Or, if you start the week on Wednesday:

Day of Max	Number
Wednesday	1306
Thursday	1102
Friday	1067
Saturday	1126
Sunday	1038
Monday	1150
Tuesday	1485

TOBS

This logic lies behind the TOBS adjustment for change of resetting times for min/max thermometers. There you divide into 24 hour periods. The difference is that Nature does make a difference between hours of the day. So if you make a split at 5pm, while it does increase both the occurrences of maxima and minima there, at that time maxima are far more likely to occur and be more counted. That's a warm bias. If you shift to 9am, minima will be favored. In the USHCN, there was a trend to move from 5pm reading to 9am reading of min/max thermometers (which sets start of "day"). That needs to be corrected. And yes, since the bias moved from warm to cold, correcting it increases trends.

18 comments:

AnonymousApril 28, 2014 at 2:57 PM
I'm guessing next he will go and prove the non-existence of God, since there's autocorrelation between proven miracles and the amount of religious people present at the time of observation.
ReplyDelete
Replies
EFS_JuniorApril 29, 2014 at 10:07 AM
Nick,

The double counting at the ends can be removed by doing what is called a moving max (or min). However, I've only done this for an odd integer window size, so that it is centered on the middle of the window. The trick is to match the moving max/min time series with the original time series and select all exact matches (for limited precision data, I add random noise at the end of each data point, that removes repeating numbers that might occur due to limited precision).
I did find the algorithm (on my own, but perhaps just a reinvention that someone else has already done before) completely removes adjacent maxima (or minima). I did this about 3 years ago when working on historic Mississippi River stage data during the flood of 2011. I know this algorithm works for selecting, on average, a max/min, per annum, when N = 365 (one data point/stage per day).

And I've just applied it to the Newport, RI (NOAA 8452660) predicted hourly tide data circa 1930-2012 (again a small project for the USACE). NOAA in their predicted tide series uses a maximum frequency of one year, the data are definitely stationary (zero mean, zero slope, you know a bunch of tidal harmonics). I took the window size as N = 25, selected all matching pairs, and obtained an averaged period of 25.72 hours, not exactly 25 but then again there's the O1 harmonic sitting out there at 25.819 hr, so perhaps no real surprise there. Anyways, then I tried M = 24 hours (or rather 24 bins) since that seems rather obvious, but no go, distinct semi-diurnal distro. So then I tried M = 26 (or rather 26 bins) since that was very close to the average period of 25.72 hours, viola, a uniform distro (Excel 2013 x64, not that it matters);

Mean = 1088
Median = 1092
Mode = 1095
Min = 1057
Max = 1113
Stdevp = 13.08551754
Skew = -0.580302101
Kurt = 0.051937353

I also have some thoughts on what at first glance looks like an elliptical distro that WE and you have generated, but perhaps later.

What would be really good is if WE would post his two million point time series (the RI tide time series I mentioned above has ~730,000 data points), I'm banned over there, so I don't want to do the asking.

As to your data set, N = 7, but I would need to get the data into two columns from the array format it is in now (rather rusty at both Excel (and Fortran) at the moment, so I don't remember how to convert tabular time series into a linear array).

Or I could send you the two spreadsheets, that I mentioned above. As usual YMMV.
ReplyDelete
Replies
@whutApril 30, 2014 at 4:18 AM
Nick,
First, do you find that Willis gets in way over his head on these matters? Or do you think he is intentionally deceptive?

On this matter, here is my take. I looked at the red noise time series he is using and it looks more like an unbounded random walk than the bounded, reversion-to-the-mean random walk that a red noise process should have as a characteristic.

One of the properties of a classical random walk is that it should act as a martingale (or gambler's ruin) process, which means that it will eventually walk to plus or minus infinity. This means that all states are equally populated and the AC has a spike at 0 only. The implication of this is that one would normally see the walker near an end-point as it makes its journey from its starting point to +/- infinity. In other words, it will be near an endpoint the longer it runs. That is the gambler's ruin outcome.

With that as a boundary condition, a red noise walker can be configured by the Ornstein-Uhlenbeck coefficients to assume a character of anything from a tightly bounded random walker that bounces between two states (like a random telegraph signal), to something that looks like a classic unbounded random walker. The issue is that if Willis chose weakly bound O-U coefficients, it will start to look like an unbounded walker, especially if he does not let it run long enough. That is the catch. He has a finite run on a weakly bound red noise walker, which means that it will not have visited all the states. If he did a histogram on a tightly-bound red noise walker, the profile would have been uniform. And that is what a temperature profile looks more like. Unless the AWG is in effect, which means that there is a secular trend and of course it will be near an end-point.

I really do laugh at Willis for what he does in promoting FUD in climate science. He takes on this personna of an everyman working stiff who claims to have this great scientific intuition and then passes on his "discoveries" and the gullible fools at WUWT lap it up.

----

BTW, you also may be on to something with the day-of-the-week analysis. There is an interesting stat study on heavy weather vs workweek days and it looks pretty conclusive -- yet I wonder if they have considered your "wrap-around" effect?
http://www.agu.org/pubs/crossref/2008/2007JD008623.shtml

ReplyDelete
Replies
Nick StokesMay 5, 2014 at 9:01 PM
Well, he does convey the impression that his methods are special, and they aren't. But they are sound enough.
ReplyDelete
Replies
@whutMay 5, 2014 at 10:39 PM
Nick,
The Fourier analysis ain't going to work on something as complex as decoding ENSO.
These are not stationary waveforms, nor are they composed of simple sines and cosines.

We are going to have to make an all-out effort to educate people on how we can unroll the physics behind ENSO and climate variability in general:
http://contextearth.com/2014/05/02/the-soim-substantiating-the-chandler-wobble-and-tidal-connection-to-enso/

Willis isn't the go-to- guy on this, you are, Nick. The WUWT's crowd is terrified of your knowledge and breadth, and that's why you get beaten down .. and then with no shame, the WUWTers turn around and use artifacts from your server. Hilarious watching them get spun around like that.

Well played.

ReplyDelete
Replies
CarrickMay 7, 2014 at 3:03 AM
You don't need stationary waveforms before you can use Fourier analysis. There is in general a one-to-one correspondence between linear time-domain manipulation and frequency domain ones, a statement that does not need stationarity before it is valid.

So what you said is just nonsense.

You can use Fourier analysis (in the form of spectral periodograms) study ENSOs without any problems. In fact, it is quite conventional and useful to do so, in climate studies.

Here is a usage by the IPCC AR4 for example.
ReplyDelete
Replies
@whutMay 7, 2014 at 7:46 AM
Sure Carrick, go ahead and use Fourier transforms on every time-series problem. No skin off my nose to watch you struggle. :)

Probably as entertaining as watching wonderin willis make a fool of himself.

ReplyDelete
Replies
CarrickMay 7, 2014 at 3:40 PM
Actually, I have software that performs real-time linear (and nonlinear) filtering of signals. I happen to use the DFT to implement this because of its relative efficiency. Even things like log-frequency sweeps can be efficiently and accurately computed in the frequency domain using a DFT. I've tested and compared it against time-domain code and it works. [tm] It happens that the DFT code has (nearly) fixed computational costs, for a broad variety of filter designs. And since we can benchmark the DFT code, we know up front how much overhead it's going to use (for real-time data filtering this is important to know).

By that's neither here nor there. You claimed that "The Fourier analysis ain't going to work on something as complex as decoding ENSO." Not only is this false, Fourier analysis is commonly used by people in climate science to study ENSO.

What Willard is doing is correct, even if he doesn't know the right name for what he's doing. The formula for the DFT is after all conventionally derived using an OLS formulation.

So I'm not sure what you are actually finding entertaining here, but hubris is becoming of nobody.
ReplyDelete
Replies
CarrickMay 9, 2014 at 2:56 AM
This is a repeat of a comment I made that apparently went into the bit bucket. It is paraphrased because I didn't save the other one before publishing.

I am not arguing that you should only use Fourier analysis, just correcting your statement that it can not be used.

Fourier analysis is not an ideal method for the study of transient phenomenon such as the known wintertime phase entrainment of the ENSO. Time-domain based methods for better for that IMO.

Willard ≠ Willis. But typos aren't a sign of losing it, so don't be afraid.

Anyway, I maintain that I haven't lost it because I never had it.

ReplyDelete
Replies
Nick StokesMay 9, 2014 at 2:44 PM
"Willard? Who's Willard?"
More than you might expect.
ReplyDelete
Replies
@whutMay 10, 2014 at 4:02 PM
I know Willard -- the Climate Ball guy.

"Fourier analysis is not an ideal method for the study of transient phenomenon such as the known wintertime phase entrainment of the ENSO. Time-domain based methods for better for that IMO."

The SOI of ENSO is an almost ideal dynamic sloshing mechanism nicely modeled as a periodic perturbation applied to the wave equation (the Mathieu equation). The wintertime phase entrainment is barely evident in contrast to the 6 to 6.5 year periodic forcing which create the peaks and valleys.

ReplyDelete
Replies
@whutMay 10, 2014 at 10:17 PM
Nick says that with Willis that "The indignation is a problem."

I ventured over to WUWT to stake a claim on what Willis wrote recently:

http://wattsupwiththat.com/2014/05/08/cycling-in-central-england/#comment-1632760
"
"BTW… Peeking at the code, it looks like Willis is fitting a sine wave using linear regression. Kewl ! "

I was quite proud when I dreamed that one up. Before that I was optimizing a sine wave, a very slow process. Instead, I just created a sine wave and a cosine wave, and used linear regression to give the optimum results using the two waves as the independent variable and the data as the dependent variable. Then I could take the peak-to-peak amplitude of the resulting fitted sine wave.
"

So I responded with " Not too original, I am afraid. " and a description of how my CSALT model works.

And of course, Willis responded with:

"
Oh, piss off, you nasty little man. Your jealousy is overwhelming your good sense. I came up with the idea myself, and I was proud of it. So sue me. Was I the first man to come up with the idea? Of course not … but I did come up with it independently myself. You are great at trying to tear down something someone else has built, but you never seem to build anything yourself … funny how that works.

w.
"

Willis tends to do that. He latches on to an idea and claims it is his own while claiming that he is self-taught. The idea of applying the quenouille significance measure is something that you have been doing Nick, and I am certain Willis picked it up from your discussions. Funny to watch that behavior in the WUWT thread.

ReplyDelete
Replies

Add comment

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Sunday, April 27, 2014

A weekend paradox

A weekend paradox

TOBS

18 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Sunday, April 27, 2014

A weekend paradox

A weekend paradox

TOBS

18 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me