moyhu: Better adjusted global temperatures for ENSO, Solar and volcanoes

Wednesday, June 19, 2013

Better adjusted global temperatures for ENSO, Solar and volcanoes

This is a follow-up to this earlier post, which please see for details. I had got into some difficulty there with using the R function nlm() to estimate both the regression parameters and the delay coefficients for each of the exogenous variables Vol, Sol and ENSO. The solar variable, which interacts most weakly, was apt to be assigned zero or negative delay, which created constant or exponentially rising secular processes, which were used by the fit.

I could avoid this by constraining that parameter. But I think it is better to do as others have done and use a common delay for all three. There is reasonable physical justification for that, and it reduces overfitting.

The result is a much more stable trend pattern across the time intervals and data sets. The trends since 1997 are now mostly between 0.65 °C/century and 1.325. This might still be seen as a slowdown, but surely a minor one. Oddly the only exception is the case studied by SteveF, Hadcrut 4 with linear from 1950. The trend I got there was 0.117°C/cen, I think quite similar to his, as was the decay coefficient at 0.026 (cf his 0.031).

I'll show below the revised table and images.

A zip file of R code and data is here.

Results

Here is the table. The results look better in several ways:

The coefficients are reasonably comparable across cases
Adding a quadratic term now always reduces the sum of squares
The post-1997 trends are fairly uniform

I have not normalised the units, so the actual magnnitudes of the regression coefficients are not easy to interpret. I'd still discount the 1979 quadratic, although no problems are obvious.

	Start	Trend	1	Vol	Sol	ENSO	t	t^2	Delay	SS
HADCRUT4	1950	0.117	2e-04	-8.39135	-3e-05	0.42995	0.25141	NA	0.02584	10.079
HADCRUT4	1950	1.024	-0.07304	-2.27873	0.00048	0.14844	0.30742	0.21653	0.12729	8.254
HADCRUT4	1979	1.086	-0.01689	-2.72106	0.00069	0.13331	0.50485	NA	0.11706	3.909
HADCRUT4	1979	1.078	-0.00963	-2.82278	0.00069	0.13478	0.50155	-0.07812	0.11325	3.893
GISS	1950	0.787	-0.00116	-6.00112	5e-05	0.24419	0.34476	NA	0.04173	10.441
GISS	1950	1.246	-0.04836	-2.70426	0.00042	0.1288	0.37122	0.14273	0.11017	9.626
GISS	1979	1.325	-0.01623	-3.28644	0.00065	0.1334	0.49335	NA	0.09649	5.086
GISS	1979	1.315	-0.01142	-3.38601	0.00064	0.13513	0.48994	-0.05241	0.09342	5.079
NOAA	1950	0.65	-0.00121	-4.42051	0.00017	0.19036	0.33218	NA	0.06223	8.103
NOAA	1950	0.877	-0.04486	-2.39086	0.00042	0.13126	0.34807	0.13221	0.118	7.351
NOAA	1979	0.929	-0.01586	-2.81475	0.00059	0.12721	0.47137	NA	0.10492	3.581
NOAA	1979	0.913	-0.00629	-2.99252	0.00058	0.13034	0.46551	-0.10409	0.09849	3.554

Images

The images may be scanned in the viewer below. There are 24, and you can flip through them using the top buttons. But you can also subselect using the selection boxes. For example, if you choose GISS, you will then cycle through just the 8 GISS plots. If you ask as well for plot type components, you will cycle through the four component plots. etc.

Data

Start Year

Regressor

Plot type

SteveF has had difficulty getting comments through the system - I'm trying to find out why. Anyway, he sent by email these comments, and I'll just follow with a few points I made in reply - hope we can get comments working for him soon:

SteveF says:

I think I may have identified why the influence of the solar cycle increases in the 1979 to present analyses compared to the 1950 to present analyses: there is a coincidental congruence between the 1983 and 1991 volcanoes and the peak (or near peak) of the solar cycle. That is, the downward part of the solar cycle is aliased with the volcanic influence. In the longer series, this influence is diluted in the regression, though for certain is still influencing the results. The 1964 to 1970 eruptions, while weaker, do not coincide with the peak of the solar cycle. I have tried several things to straighten this out, so far without a lot of success. I will next try a regression from 1950 to 1975 to see if the solar cycle influence falls to almost nothing (or even negative!) as I suspect it will. Substituting a regression specified quadratic or cubic secular function.

I think a defensible argument is, absent a good physical rational, that the best approach is to simply sum the two radiative terms (both with watts/M^2 units) and see what happens to the diagnosed "optimal lag". Any regression that reports a physically implausible lag constant for best fit seems to me very suspect. There have been multiple published reports with estimated lags for volcanoes; a tau value near 30-36 months for the decay of the response seems to be a common, though the exact value depends weakly on assumed climate sensitivity value (I have independently verified this is correct). Perhaps the best thing is to just try a few lags in the credible range and see what the best fit regression constants turn out to be. I also note that very much higher solar than volcanic response (on a watt/M^2 basis) is exactly the opposite of what I would expect on physical grounds; the slower solar cycle response ought to be lower, on a degrees/watt/M^2 basis than the volcanic, because the 11 year solar cycle 'sees' slower responding (deeper) parts of the oceans more than the shorter volcanic forcing.

If one accepts a large discrepancy between the measured changes in solar intensity over the cycle and the size of the temperature response, then some kind of 'amplification' of the solar cycle must be responsible (eg. more low clouds at the minimum), and evidence for that seems lacking. Of course, even if one assumes the true solar cycle forcing (in watts/M^2) is far higher than the measured changes in solar intensity, that doesn't mean there should not be considerable lag in the temperature response.

Finally, I tried a linear secular trend starting in 1964 (to include the earlier volcanoes); the fit is improved compared to starting in 1950 and the discrepancies between model and Hadley global temperatures is much reduced. Perhaps you could try that as well.
Nick says: I can well imagine that solar cycles might get tangled with volcanoes. My experience was the the solar cycle, being weak, could be pushed around a lot by the optimisation process - both the linear and the lag fitting. It had a strong tendency to drift into zero or negative lag, which nlm() took advantage of to create spurious fitting functions. For the same reasons, I don't take too much notice of the amplitude that emerges from regression. It could be just taking up a bit of the volcano signal.

My main concern at the moment is with the use of Nina34. I think it's a sensitive index, but also includes warming trend, and when you adjust with it, it takes out that part. I'm inclined now to switch to SOI, which may not be as good, but can't drift. A practical alternative is to detrend Nina34, but harder to justify, and the detrend would depend on the interval. I'd be happy to try a 1964 start. Another option is to use GHG forcing as one of the regressor functions.

58 comments:

Greg GoodmanJune 19, 2013 at 11:26 PM
Nick, looks more stable. Obviously having same delay makes physical sense if all these are simplistically viewed as equivalent radiative forcings.

I don't know if you're still watching Steve's thread on Lucia. I have been discussing the implications of the analytical solution to the linear model IDE with Paul_K and picked a small but very important error.

The result confirms something that I have been pushing for a few years now, that this sort of fitting ( at least for short term series) should be fitted to dT/dt and NOT the time series T(t).

That is quite important obviously but I think it is rigorous in relation to the linear and the solution to the differential equation.

Check it out and see if you think it makes sense.

Basically, it is dT/dt that is a power variable in terms of physical units. It is incorrect to try to fit T (which is energy not power) to a power forcing. The units just don't work.

If you look at the thread I explain why the short term response should be fit to dT/dt in terms of the equations too.

You're trying to fit apples to oranges.
ReplyDelete
Replies
Greg GoodmanJune 19, 2013 at 11:28 PM
BTW would you like to post some R code of how the do the fit. I would not mind fiddling but I can't find the two days it will need to decrypt the doc and arcane hyroglyphics needed to define a model to fit in R.

ReplyDelete
Replies
AnonymousJune 20, 2013 at 1:41 PM
Nick , I tried to load npl.r and got two errors.

load ("nlp.r")
Error: bad restore file magic number (file may be corrupted) -- no data loaded
In addition: Warning message:
file 'nlp.r' has magic number '#Code'
Use of save versions prior to 2 is deprecated

I'm running: R version 2.15.1 (2012-06-22)

Also one of the files you zipped was not the data you imagined:
== hadcrut4.txt ==

An error occurred while processing your request
Reference...

Greg
ReplyDelete
Replies
Nick StokesJune 21, 2013 at 4:47 AM
Greg,
It's finished the HADCRUT sequence, and then for some reason nlm() has failed to converge with GISS. It's right as far as it goes - the numbers printed are the SS. I've checked that it is the right data file. So I'm puzzled. Our R versions are the same.
ReplyDelete
Replies
Nick StokesJune 21, 2013 at 6:03 AM
Greg,
I'm glad it worked - I'm still not clear why it failed, but anyway.

The line
y=yy[n,J]
takes the temp from the array. You can difference at this stage (add a line following):
y=diff(c(NA,y))
The initial NA is to keep the length of vector right; if it gives trouble, substitute 0.

ReplyDelete
Replies
Greg GoodmanJune 22, 2013 at 7:57 PM
Where I'm hoping to go with all this is to combine the two, since the correct linear solution (as I'm discussing at Lucia's) includes both. A way to do the multivariate fit to both _may_ actually fit post 2000 without all the fudging.

To do it properly requires another method since the ration between in-phase response that you are fitting already and the dT/dt response I'm trying to do is not fixed. The ration is function of frequency.

However, if we "filter" the combined response and simply things we can say LF dominates the in-phase and HF dominate dT/dt

ie we need to fit c1*T(t)+c2*(dT/dt)

instead of the more correct 1/f weighting we approximate an average for each frequency band.

Now I think that should be possible with simple changes to what you have already done.

This should remove the need to "spread" the forcing terms and probably all the detrending as well. If there is a long term multi-century upward trend (and I think there is) that should be added as an explicit variable and not used as a pretext for removing variability from the time-series, much of which may be more properly attributable to the forcing variables we are trying to fit in the first place.

If I was capable of parsing R code I would have done this already. Unfortunately R is alien to me and I don't have the time to invest in that particularly steep learning curve.

I'm attacking this problem from a different angle using tools I already understand. Though I think adapting this method would be both simple to do and enlightening.

ReplyDelete
Replies
Greg GoodmanJune 23, 2013 at 9:10 AM
Nick, I've tried to extract the essentials of this dT/dt argument. You're pretty good on the maths, I'd value your opinion. Especially about the value of the cross-over point between the two regimes.

http://climategrog.wordpress.com/?attachment_id=399

If there are time constants like 20 years in the temp response and my estimations are correct everything under 60 year periods will be dominated by dT/dt.
ReplyDelete
Replies
Nick StokesJune 23, 2013 at 10:01 PM
"That implies a implicit assumption that the response to the forcing is in phase with the forcing. It is not."
It is not. The forcing is lagged.
ReplyDelete
Replies
Greg GoodmanJune 24, 2013 at 5:33 PM
Having identified tau as being,not just some arbitrary "lag" fitting parameter to "smooth" the input, but as the time constant constant of climate system, leads to an important implication of your results that you have missed.

Your regression fitted values of tau range between 0.25 and 0.5 whereas tau of climate is reckoned to be between 3 and 4.5 years.

That would seem to have two interpretations.

1. Current estimations of climate and model tau are off by an order of magnitude in which case you need to discuss the implications for CS which is inherently tied to tau.

2. The fit produced by this method is spurious. If tau is off by an order of magnitude what credibility can we give to the fit results in general and specifically the idea that it accounts for the recent 'plateau'.

In view of the fact that there are none of these inputs which could account for the 60y cycle and your de-trending does not attempt to remove it either I know where I'd put my money.

Perhaps you should comment on that now know what tau is.
ReplyDelete
Replies
Nick StokesJune 25, 2013 at 3:04 PM
Greg,
I think you've mistaken me for another commenter there.
ReplyDelete
Replies
Greg GoodmanJune 26, 2013 at 11:04 PM
"... to estimate both the regression parameters and the delay coefficients for each of the exogenous variables"

Nick, I can't find the 'delay' in the R code nor the convolution. Are you doing this by hand outside the R regression or what ?

If it's in the R code you posted could you point me to it?

thx
ReplyDelete
Replies
Greg GoodmanJune 27, 2013 at 2:15 AM
OK, I've run this with some test data. A synthetic time series that looks "something" like global temps.

Using that as the input "forcing" , I've calculated the response by both addition of Fourier cmpts and the Laplace convolution.

They are identical.

http://climategrog.wordpress.com/?attachment_id=402

I used adjacent months for the diff so there's a 0.5m offset, I left this slight offset so we can see both. Using [-1,0.+1] for the diff would make it centred.

Here you can see that the in-phase component is dominated by the 60y cycle, the orthogonal by the 9y cycle, in line with what I said originally.

If I have time later I may try to prove this formally by showing the kernels are equivalent.

This way of viewing it gives more insight and may (finally) help you see the importance of the derivative term. It is interesting to see it as the weighted sum of forcing and its derivative , both passed through a low-pass filter.

The exp term is just a spin up but gives a result for the early period that is not possible with the Laplace convolution.

The input must be stationary for this work accurately any linear trends or periods longer than the data need to be removed. The Laplace method is fully generalised for all inputs.

I'm not sure I can see a reason to prefer this method for fitting but having shown they are equivalent it gives a lot more insight into the relationship between input and output.

That should provide some insights into both temperature and CO2.

ReplyDelete
Replies
Greg GoodmanJune 27, 2013 at 6:49 PM
Not so "muddled" after all, then.

Could you reply to the query I posted above: I can't find the 'delay' in the R code nor the convolution. Are you doing this by hand outside the R regression or what ?

If it's in the R code you posted could you point me to it?

thanks.
ReplyDelete
Replies
Greg GoodmanJune 27, 2013 at 8:52 PM
I just lashed up the 9/22/60y model, using equal weight for each, to have some test data. It turns out to be quite close to reproducing AMO when fed through a tau=3.5 linear feedback:

http://climategrog.wordpress.com/?attachment_id=403

It matches the 1974 dip better than most of what you did here and shows the supposed strong volcanic correlations may be spurious.

greg.
ReplyDelete
Replies

Add comment

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Wednesday, June 19, 2013

Better adjusted global temperatures for ENSO, Solar and volcanoes

Better adjusted global temperatures for ENSO, Solar and volcanoes

Results

Images

58 comments:

Search This Blog

Maintained Pages

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Wednesday, June 19, 2013

Better adjusted global temperatures for ENSO, Solar and volcanoes

Better adjusted global temperatures for ENSO, Solar and volcanoes

Results

Images

58 comments:

Search This Blog

Maintained Pages

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me