[ExI] Statistical significance
Max More
max at maxmore.com
Fri Apr 2 04:12:06 UTC 2010
spike: Have you read Taleb's Black Swan? He has interesting things to
say about statistics and mathematical models.
Coincidentally, the following piece just appeared in my in-box from
forecasting expert Scott Armstrong: "Does statistical significance
help you make better forecasts?"
Research findings should help to improve your decision-making and to
simplify your life: Abolish tests of statistical significance from
your decision making. Armstrong and Green have not found any evidence
that such tests improve decision making. Indeed, they seem to create
confusion and harm decision making. As a result, they have recently
changed one of their guidelines for forecasters to read:
13.29 Do not use measures of statistical significance to assess a
forecasting method or model.
Description: Even when correctly applied, significance tests are
dangerous. Statistical significance tests calculate the probability,
assuming the analyst's null hypothesis is true, that relationships
apparent in a sample of data are the result of chance variations that
arose in selecting the sample. The probability that is calculated is
affected by the size of the sample and the choice of null hypothesis.
With large samples, even small differences from what would be
expected in the data if the null hypothesis were true will be
"statistically significant." Choosing a different null hypothesis can
change the conclusion. Statistical significance tests do not provide
useful information on material significance or importance. Moreover,
the tests are blind to common problems such as non-response error,
and response error. The proper approach to analyzing and
communicating findings from empirical studies is to (1) calculate and
report effect sizes; (2) estimate the range within which the actual
effect size is likely to lie by taking account of prior knowledge and
all potential sources of error in measuring the effect; and (3)
conduct replications, extensions, and meta-analyses.
Purpose: To avoid the selection of invalid models or methods, and
the rejection of valid ones.
Conditions: There are no empirically demonstrated conditions on this
principle. Statistical significance tests should not be used unless
it can be shown that the measures provide a net benefit in the
situation under consideration.
Strength of evidence: Strong logical support and non-experimental
evidence. There are many examples showing how significance testing
has harmed decision-making. Despite repeated appeals for evidence
that statistical significance tests can improve decisions, none has
been forthcoming. Tests of statistical significance run contrary to
the proper purpose of statisticswhich is to help users make sense of
data. Experimental studies are needed to identify the conditions, if
any, under which tests of statistical significance can improve decision-making.
Source of evidence:
Armstrong, J. S. (2007). Significance tests harm progress in
forecasting<http://marketing.wharton.upenn.edu/documents/research/StatSigIJF361.pdf>.
International Journal of Forecasting, 23, 321-336, with commentary
and a reply. Hauer, E. (2004). The harm done by tests of statistical
significance<http://tinyurl.com/Hauer2004Harm>. Accident Analysis and
Prevention, 36, 495-500. Hubbard, R. & Armstrong J. S. (2006). Why we
don't really know what 'statistical significance' means: a major
educational
failure<http://marketing.wharton.upenn.edu/ideas/pdf/Armstrong/StatisticalSignificance.pdf>.
Journal of Marketing Education, 28, 114-120 Hunter, J.E. & Schmidt,
F. L. (1996). Cumulative research knowledge and social policy
formulation: The critical role of
meta-analysis<http://conium.org/~maccoun/PP279_Hunter.pdf>.
Psychology, Public Policy, and Law, 2, 324-347. Ziliak, S. T. &
McCloskey, D. N. (2008). The cult of statistical significance: How
the standard error costs us jobs, justice, and
lives<http://www.amazon.com/Cult-Statistical-Significance-Economics-Cognition/dp/0472050079>.
Ann Arbor, MI: University of Michigan Press.
J. Scott Armstrong
Dept. of Marketing
The Wharton School
U. of Pennsylvania
Phila., PA 19104
armstrong at wharton.upenn.edu<mailto:armstrong at wharton.upenn.edu>
More information about the extropy-chat
mailing list