<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Blogs on random temperature reconstruction</title>
	<atom:link href="http://landshape.org/enm/blogs-on-random-temperature-reconstruction/feed/" rel="self" type="application/rss+xml" />
	<link>http://landshape.org/enm/blogs-on-random-temperature-reconstruction/</link>
	<description>The Power of Numeracy</description>
	<lastBuildDate>Tue, 07 Feb 2012 17:57:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: davids</title>
		<link>http://landshape.org/enm/blogs-on-random-temperature-reconstruction/#comment-5685</link>
		<dc:creator>davids</dc:creator>
		<pubDate>Wed, 15 Mar 2006 20:20:19 +0000</pubDate>
		<guid isPermaLink="false">http://landshape.org/enm/?p=23#comment-5685</guid>
		<description>Hi Marty, Glad you like the discover button. It runs a search on the post title to bring up relevant pages, so saves having to type anything in to the box.  I agree with your sentiments on climate discussion to a degree.  So often I meet people who think that AGW skepticism is just a right wing think tank plot that I think that a lot more discussion is needed.  In general, I assume people don&#039;t care about what I say, but if I can crank out some results and a figure or two we can all learn something.  I just posted these comments out of interest.  What would be good is a system that found them automatically and listed them under the post.  Most blogs don&#039;t seem to send out trackback pings.

Absolutely agree that there are many more interesting issues such as detection limits of the methodology than ME vs R2.  Most interesting are the basic assumptions, of which stationary fractional gaussian noise models are fascinating. Under these assumptions I have run verification protocols of MBH98, which indicated apparent skill for random recons by R2 but not by RE, and the protocol of just drawing random points for verification, where both RE and R2 indicated skill (wrongly).  One could think up other ones, and try to identify protocols that did work in this case.  I don&#039;t know if I want to do that but it would be helpful for someone.</description>
		<content:encoded><![CDATA[<p>Hi Marty, Glad you like the discover button. It runs a search on the post title to bring up relevant pages, so saves having to type anything in to the box.  I agree with your sentiments on climate discussion to a degree.  So often I meet people who think that AGW skepticism is just a right wing think tank plot that I think that a lot more discussion is needed.  In general, I assume people don&#8217;t care about what I say, but if I can crank out some results and a figure or two we can all learn something.  I just posted these comments out of interest.  What would be good is a system that found them automatically and listed them under the post.  Most blogs don&#8217;t seem to send out trackback pings.</p>
<p>Absolutely agree that there are many more interesting issues such as detection limits of the methodology than ME vs R2.  Most interesting are the basic assumptions, of which stationary fractional gaussian noise models are fascinating. Under these assumptions I have run verification protocols of MBH98, which indicated apparent skill for random recons by R2 but not by RE, and the protocol of just drawing random points for verification, where both RE and R2 indicated skill (wrongly).  One could think up other ones, and try to identify protocols that did work in this case.  I don&#8217;t know if I want to do that but it would be helpful for someone.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: davids</title>
		<link>http://landshape.org/enm/blogs-on-random-temperature-reconstruction/#comment-6081</link>
		<dc:creator>davids</dc:creator>
		<pubDate>Wed, 15 Mar 2006 20:20:00 +0000</pubDate>
		<guid isPermaLink="false">http://landshape.org/enm/?p=23#comment-6081</guid>
		<description>Hi Marty, Glad you like the discover button. It runs a search on the post title to bring up relevant pages, so saves having to type anything in to the box.  I agree with your sentiments on climate discussion to a degree.  So often I meet people who think that AGW skepticism is just a right wing think tank plot that I think that a lot more discussion is needed.  In general, I assume people don&#039;t care about what I say, but if I can crank out some results and a figure or two we can all learn something.  I just posted these comments out of interest.  What would be good is a system that found them automatically and listed them under the post.  Most blogs don&#039;t seem to send out trackback pings.

Absolutely agree that there are many more interesting issues such as detection limits of the methodology than ME vs R2.  Most interesting are the basic assumptions, of which stationary fractional gaussian noise models are fascinating. Under these assumptions I have run verification protocols of MBH98, which indicated apparent skill for random recons by R2 but not by RE, and the protocol of just drawing random points for verification, where both RE and R2 indicated skill (wrongly).  One could think up other ones, and try to identify protocols that did work in this case.  I don&#039;t know if I want to do that but it would be helpful for someone.</description>
		<content:encoded><![CDATA[<p>Hi Marty, Glad you like the discover button. It runs a search on the post title to bring up relevant pages, so saves having to type anything in to the box.  I agree with your sentiments on climate discussion to a degree.  So often I meet people who think that AGW skepticism is just a right wing think tank plot that I think that a lot more discussion is needed.  In general, I assume people don&#8217;t care about what I say, but if I can crank out some results and a figure or two we can all learn something.  I just posted these comments out of interest.  What would be good is a system that found them automatically and listed them under the post.  Most blogs don&#8217;t seem to send out trackback pings.</p>
<p>Absolutely agree that there are many more interesting issues such as detection limits of the methodology than ME vs R2.  Most interesting are the basic assumptions, of which stationary fractional gaussian noise models are fascinating. Under these assumptions I have run verification protocols of MBH98, which indicated apparent skill for random recons by R2 but not by RE, and the protocol of just drawing random points for verification, where both RE and R2 indicated skill (wrongly).  One could think up other ones, and try to identify protocols that did work in this case.  I don&#8217;t know if I want to do that but it would be helpful for someone.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Marty Ringo</title>
		<link>http://landshape.org/enm/blogs-on-random-temperature-reconstruction/#comment-5684</link>
		<dc:creator>Marty Ringo</dc:creator>
		<pubDate>Wed, 15 Mar 2006 20:17:44 +0000</pubDate>
		<guid isPermaLink="false">http://landshape.org/enm/?p=23#comment-5684</guid>
		<description>David,

Thank you for the &quot;Discover more ...&quot; button.  By-and-large I am not very interested in blogs on climate because most of the postings are of the form &quot;You [the previous person posting] haven&#039;t taken X, Y or Z [fill in according to your own beliefs] and therefore you are at least wrong and more likely stupid.&quot;  However, having finished work for the day, I trolled through a couple pages of links largely confirming my prejudices.  Sorry, but just because someone agrees with you doesn&#039;t mean the agreement is sound.  Anyway on the fifth page I found the a link to Roger Pielke&#039;s (Sr.) site (Climate Science) and clicked over to &quot;Reflections of a Climate Skeptic by Henk Tennekes&quot;
http://climatesci.atmos.colostate.edu/2006/01/06/guest-weblog-reflections-of-a-climate-skeptic-henk-tennekes/

Prof. Tennekes&#039;s essay is a nice statement of his skepticism which seems to be based on Popperian criteria and Tennekes&#039;s own specialty, turbulence.  But I comment here not on Tennekes rather on Pielkeâ€™s response (#18) to the query â€œPlease define skillful.â€?  Pielke gives the American Meteorological Societyâ€™s definition of skill:
â€œSkill: A statistical evaluation of the accuracy of forecasts or the effectiveness of detection techniques.â€?

That is bland enough for just about anyone.  Pielke follows the bland definition with specific, but after reading those specifics I find I have no idea as to whether he would prefer an R2 (correlation squared), RE (reduction in error), CE (coefficient of efficiency) or maybe one of Prof. Theilâ€™s statistics.  Is this just non-technical commentary?  Probably, but there is an implicit message here for all of us who investigate the empirical properties of various estimation and prediction schemes:  the accuracy of a prediction [both forecasts and backcasts as in reconstructions] lies in the nature of the forecast.  An R2 of 0.2 and an RE of -0.1 might be pretty good if we are trying to predict a 4 standard deviations outside our sample.  The recent National Academy of Sciences conference on climate reconstruction asked the presenters if reconstructions for a 1000 years past were accurate within 0.5 degrees C.  Let me ask: is this not a more interesting test than an R2 or RE?

Of course, this test does not come with its algorithm, which of course means a debate on the application, but why not add this test to the mix.  There is the complication in modeling of the 1000 years.  I believe that is addressed by dual verification testing on the verification period and the â€œunobservedâ€? period.  That is simulate a 1000 years which should be broken into three parts:  1 to N1 -- the unobserved, N1+1 to N2 -- the verification, and N2+1  to N [the 1000] -- the estimation.  Then ask what skill (be measured in R2, RE, mean absolute deviations, or whatever) is needed in the verification period to assure a, say, 95% confidence of being within 0.5 degrees for a, again say, 30 year average 1000 years past?

And yes, I have started, but Iâ€™m lazy and probably wonâ€™t finish, plus I donâ€™t code as fast as you â€œkidsâ€? do.  So while your reconstruction bones simulation are good, I am suggesting a -- there are presumably many more -- way to add some muscle the model.</description>
		<content:encoded><![CDATA[<p>David,</p>
<p>Thank you for the &#8220;Discover more &#8230;&#8221; button.  By-and-large I am not very interested in blogs on climate because most of the postings are of the form &#8220;You [the previous person posting] haven&#8217;t taken X, Y or Z [fill in according to your own beliefs] and therefore you are at least wrong and more likely stupid.&#8221;  However, having finished work for the day, I trolled through a couple pages of links largely confirming my prejudices.  Sorry, but just because someone agrees with you doesn&#8217;t mean the agreement is sound.  Anyway on the fifth page I found the a link to Roger Pielke&#8217;s (Sr.) site (Climate Science) and clicked over to &#8220;Reflections of a Climate Skeptic by Henk Tennekes&#8221;<br />
<a href="http://climatesci.atmos.colostate.edu/2006/01/06/guest-weblog-reflections-of-a-climate-skeptic-henk-tennekes/" rel="nofollow">http://climatesci.atmos.colostate.edu/2006/01/06/guest-weblog-reflections-of-a-climate-skeptic-henk-tennekes/</a></p>
<p>Prof. Tennekes&#8217;s essay is a nice statement of his skepticism which seems to be based on Popperian criteria and Tennekes&#8217;s own specialty, turbulence.  But I comment here not on Tennekes rather on Pielkeâ€™s response (#18) to the query â€œPlease define skillful.â€?  Pielke gives the American Meteorological Societyâ€™s definition of skill:<br />
â€œSkill: A statistical evaluation of the accuracy of forecasts or the effectiveness of detection techniques.â€?</p>
<p>That is bland enough for just about anyone.  Pielke follows the bland definition with specific, but after reading those specifics I find I have no idea as to whether he would prefer an R2 (correlation squared), RE (reduction in error), CE (coefficient of efficiency) or maybe one of Prof. Theilâ€™s statistics.  Is this just non-technical commentary?  Probably, but there is an implicit message here for all of us who investigate the empirical properties of various estimation and prediction schemes:  the accuracy of a prediction [both forecasts and backcasts as in reconstructions] lies in the nature of the forecast.  An R2 of 0.2 and an RE of -0.1 might be pretty good if we are trying to predict a 4 standard deviations outside our sample.  The recent National Academy of Sciences conference on climate reconstruction asked the presenters if reconstructions for a 1000 years past were accurate within 0.5 degrees C.  Let me ask: is this not a more interesting test than an R2 or RE?</p>
<p>Of course, this test does not come with its algorithm, which of course means a debate on the application, but why not add this test to the mix.  There is the complication in modeling of the 1000 years.  I believe that is addressed by dual verification testing on the verification period and the â€œunobservedâ€? period.  That is simulate a 1000 years which should be broken into three parts:  1 to N1 &#8212; the unobserved, N1+1 to N2 &#8212; the verification, and N2+1  to N [the 1000] &#8212; the estimation.  Then ask what skill (be measured in R2, RE, mean absolute deviations, or whatever) is needed in the verification period to assure a, say, 95% confidence of being within 0.5 degrees for a, again say, 30 year average 1000 years past?</p>
<p>And yes, I have started, but Iâ€™m lazy and probably wonâ€™t finish, plus I donâ€™t code as fast as you â€œkidsâ€? do.  So while your reconstruction bones simulation are good, I am suggesting a &#8212; there are presumably many more &#8212; way to add some muscle the model.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Marty Ringo</title>
		<link>http://landshape.org/enm/blogs-on-random-temperature-reconstruction/#comment-6080</link>
		<dc:creator>Marty Ringo</dc:creator>
		<pubDate>Wed, 15 Mar 2006 20:17:00 +0000</pubDate>
		<guid isPermaLink="false">http://landshape.org/enm/?p=23#comment-6080</guid>
		<description>David,

Thank you for the &quot;Discover more ...&quot; button.  By-and-large I am not very interested in blogs on climate because most of the postings are of the form &quot;You [the previous person posting] haven&#039;t taken X, Y or Z [fill in according to your own beliefs] and therefore you are at least wrong and more likely stupid.&quot;  However, having finished work for the day, I trolled through a couple pages of links largely confirming my prejudices.  Sorry, but just because someone agrees with you doesn&#039;t mean the agreement is sound.  Anyway on the fifth page I found the a link to Roger Pielke&#039;s (Sr.) site (Climate Science) and clicked over to &quot;Reflections of a Climate Skeptic by Henk Tennekes&quot; 
http://climatesci.atmos.colostate.edu/2006/01/06/guest-weblog-reflections-of-a-climate-skeptic-henk-tennekes/

Prof. Tennekes&#039;s essay is a nice statement of his skepticism which seems to be based on Popperian criteria and Tennekes&#039;s own specialty, turbulence.  But I comment here not on Tennekes rather on Pielke’s response (#18) to the query “Please define skillful.”  Pielke gives the American Meteorological Society’s definition of skill: 
“Skill: A statistical evaluation of the accuracy of forecasts or the effectiveness of detection techniques.”

That is bland enough for just about anyone.  Pielke follows the bland definition with specific, but after reading those specifics I find I have no idea as to whether he would prefer an R2 (correlation squared), RE (reduction in error), CE (coefficient of efficiency) or maybe one of Prof. Theil’s statistics.  Is this just non-technical commentary?  Probably, but there is an implicit message here for all of us who investigate the empirical properties of various estimation and prediction schemes:  the accuracy of a prediction [both forecasts and backcasts as in reconstructions] lies in the nature of the forecast.  An R2 of 0.2 and an RE of -0.1 might be pretty good if we are trying to predict a 4 standard deviations outside our sample.  The recent National Academy of Sciences conference on climate reconstruction asked the presenters if reconstructions for a 1000 years past were accurate within 0.5 degrees C.  Let me ask: is this not a more interesting test than an R2 or RE?  

Of course, this test does not come with its algorithm, which of course means a debate on the application, but why not add this test to the mix.  There is the complication in modeling of the 1000 years.  I believe that is addressed by dual verification testing on the verification period and the “unobserved” period.  That is simulate a 1000 years which should be broken into three parts:  1 to N1 -- the unobserved, N1+1 to N2 -- the verification, and N2+1  to N [the 1000] -- the estimation.  Then ask what skill (be measured in R2, RE, mean absolute deviations, or whatever) is needed in the verification period to assure a, say, 95% confidence of being within 0.5 degrees for a, again say, 30 year average 1000 years past?  

And yes, I have started, but I’m lazy and probably won’t finish, plus I don’t code as fast as you “kids” do.  So while your reconstruction bones simulation are good, I am suggesting a -- there are presumably many more -- way to add some muscle the model.</description>
		<content:encoded><![CDATA[<p>David,</p>
<p>Thank you for the &#8220;Discover more &#8230;&#8221; button.  By-and-large I am not very interested in blogs on climate because most of the postings are of the form &#8220;You [the previous person posting] haven&#8217;t taken X, Y or Z [fill in according to your own beliefs] and therefore you are at least wrong and more likely stupid.&#8221;  However, having finished work for the day, I trolled through a couple pages of links largely confirming my prejudices.  Sorry, but just because someone agrees with you doesn&#8217;t mean the agreement is sound.  Anyway on the fifth page I found the a link to Roger Pielke&#8217;s (Sr.) site (Climate Science) and clicked over to &#8220;Reflections of a Climate Skeptic by Henk Tennekes&#8221;<br />
<a href="http://climatesci.atmos.colostate.edu/2006/01/06/guest-weblog-reflections-of-a-climate-skeptic-henk-tennekes/" rel="nofollow">http://climatesci.atmos.colostate.edu/2006/01/06/guest-weblog-reflections-of-a-climate-skeptic-henk-tennekes/</a></p>
<p>Prof. Tennekes&#8217;s essay is a nice statement of his skepticism which seems to be based on Popperian criteria and Tennekes&#8217;s own specialty, turbulence.  But I comment here not on Tennekes rather on Pielke’s response (#18) to the query “Please define skillful.”  Pielke gives the American Meteorological Society’s definition of skill:<br />
“Skill: A statistical evaluation of the accuracy of forecasts or the effectiveness of detection techniques.”</p>
<p>That is bland enough for just about anyone.  Pielke follows the bland definition with specific, but after reading those specifics I find I have no idea as to whether he would prefer an R2 (correlation squared), RE (reduction in error), CE (coefficient of efficiency) or maybe one of Prof. Theil’s statistics.  Is this just non-technical commentary?  Probably, but there is an implicit message here for all of us who investigate the empirical properties of various estimation and prediction schemes:  the accuracy of a prediction [both forecasts and backcasts as in reconstructions] lies in the nature of the forecast.  An R2 of 0.2 and an RE of -0.1 might be pretty good if we are trying to predict a 4 standard deviations outside our sample.  The recent National Academy of Sciences conference on climate reconstruction asked the presenters if reconstructions for a 1000 years past were accurate within 0.5 degrees C.  Let me ask: is this not a more interesting test than an R2 or RE?  </p>
<p>Of course, this test does not come with its algorithm, which of course means a debate on the application, but why not add this test to the mix.  There is the complication in modeling of the 1000 years.  I believe that is addressed by dual verification testing on the verification period and the “unobserved” period.  That is simulate a 1000 years which should be broken into three parts:  1 to N1 &#8212; the unobserved, N1+1 to N2 &#8212; the verification, and N2+1  to N [the 1000] &#8212; the estimation.  Then ask what skill (be measured in R2, RE, mean absolute deviations, or whatever) is needed in the verification period to assure a, say, 95% confidence of being within 0.5 degrees for a, again say, 30 year average 1000 years past?  </p>
<p>And yes, I have started, but I’m lazy and probably won’t finish, plus I don’t code as fast as you “kids” do.  So while your reconstruction bones simulation are good, I am suggesting a &#8212; there are presumably many more &#8212; way to add some muscle the model.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

