[ExI] Statistical significance

Max More max at maxmore.com
Fri Apr 2 04:12:06 UTC 2010


spike: Have you read Taleb's Black Swan? He has interesting things to 
say about statistics and mathematical models.

Coincidentally, the following piece just appeared in my in-box from 
forecasting expert Scott Armstrong: "Does statistical significance 
help you make better forecasts?"

Research findings should help to improve your decision-making and to 
simplify your life: Abolish tests of statistical significance from 
your decision making. Armstrong and Green have not found any evidence 
that such tests improve decision making. Indeed, they seem to create 
confusion and harm decision making. As a result, they have recently 
changed one of their guidelines for forecasters to read:

13.29    Do not use measures of statistical significance to assess a 
forecasting method or model.

Description:  Even when correctly applied, significance tests are 
dangerous. Statistical significance tests calculate the probability, 
assuming the analyst's null hypothesis is true, that relationships 
apparent in a sample of data are the result of chance variations that 
arose in selecting the sample. The probability that is calculated is 
affected by the size of the sample and the choice of null hypothesis. 
With large samples, even small differences from what would be 
expected in the data if the null hypothesis were true will be 
"statistically significant." Choosing a different null hypothesis can 
change the conclusion. Statistical significance tests do not provide 
useful information on material significance or importance. Moreover, 
the tests are blind to common problems such as non-response error, 
and response error. The proper approach to analyzing and 
communicating findings from empirical studies is to (1) calculate and 
report effect sizes; (2) estimate the range within which the actual 
effect size is likely to lie by taking account of prior knowledge and 
all potential sources of error in measuring the effect; and (3) 
conduct replications, extensions, and meta-analyses.

Purpose:  To avoid the selection of invalid models or methods, and 
the rejection of valid ones.

Conditions:  There are no empirically demonstrated conditions on this 
principle. Statistical significance tests should not be used unless 
it can be shown that the measures provide a net benefit in the 
situation under consideration.

Strength of evidence: Strong logical support and non-experimental 
evidence. There are many examples showing how significance testing 
has harmed decision-making. Despite repeated appeals for evidence 
that statistical significance tests can improve decisions, none has 
been forthcoming. Tests of statistical significance run contrary to 
the proper purpose of statisticswhich is to help users make sense of 
data. Experimental studies are needed to identify the conditions, if 
any, under which tests of statistical significance can improve decision-making.

Source of evidence:
Armstrong, J. S. (2007). Significance tests harm progress in 
forecasting<http://marketing.wharton.upenn.edu/documents/research/StatSigIJF361.pdf>. 
International Journal of Forecasting, 23, 321-336, with commentary 
and a reply. Hauer, E. (2004). The harm done by tests of statistical 
significance<http://tinyurl.com/Hauer2004Harm>. Accident Analysis and 
Prevention, 36, 495-500. Hubbard, R. & Armstrong J. S. (2006). Why we 
don't really know what 'statistical significance' means: a major 
educational 
failure<http://marketing.wharton.upenn.edu/ideas/pdf/Armstrong/StatisticalSignificance.pdf>. 
Journal of Marketing Education, 28, 114-120 Hunter, J.E. & Schmidt, 
F. L. (1996). Cumulative research knowledge and social policy 
formulation: The critical role of 
meta-analysis<http://conium.org/~maccoun/PP279_Hunter.pdf>. 
Psychology, Public Policy, and Law, 2, 324-347. Ziliak, S. T. & 
McCloskey, D. N. (2008). The cult of statistical significance: How 
the standard error costs us jobs, justice, and 
lives<http://www.amazon.com/Cult-Statistical-Significance-Economics-Cognition/dp/0472050079>. 
Ann Arbor, MI: University of Michigan Press.

J. Scott Armstrong
Dept. of Marketing
The Wharton School
U. of Pennsylvania
Phila., PA 19104 
armstrong at wharton.upenn.edu<mailto:armstrong at wharton.upenn.edu>






More information about the extropy-chat mailing list