Statistical significance and its critics: practicing damaging science, or damaging scientific practice?

Mayo, Deborah G.; Hand, David

Statistical significance and its critics: practicing damaging science, or damaging scientific practice?

dc.contributor.author	Mayo, Deborah G.	en
dc.contributor.author	Hand, David	en
dc.date.accessioned	2022-06-13T13:18:47Z	en
dc.date.available	2022-06-13T13:18:47Z	en
dc.date.issued	2022-05-12	en
dc.description.abstract	While the common procedure of statistical significance testing and its accompanying concept of p-values have long been surrounded by controversy, renewed concern has been triggered by the replication crisis in science. Many blame statistical significance tests themselves, and some regard them as sufficiently damaging to scientific practice as to warrant being abandoned. We take a contrary position, arguing that the central criticisms arise from misunderstanding and misusing the statistical tools, and that in fact the purported remedies themselves risk damaging science. We argue that banning the use of p-value thresholds in interpreting data does not diminish but rather exacerbates data-dredging and biasing selection effects. If an account cannot specify outcomes that will not be allowed to count as evidence for a claim-if all thresholds are abandoned-then there is no test of that claim. The contributions of this paper are: To explain the rival statistical philosophies underlying the ongoing controversy; To elucidate and reinterpret statistical significance tests, and explain how this reinterpretation ameliorates common misuses and misinterpretations; To argue why recent recommendations to replace, abandon, or retire statistical significance undermine a central function of statistics in science: to test whether observed patterns in the data are genuine or due to background variability.	en
dc.description.version	Published version	en
dc.format.mimetype	application/pdf	en
dc.identifier.doi	https://doi.org/10.1007/s11229-022-03692-0	en
dc.identifier.eissn	1573-0964	en
dc.identifier.issn	0039-7857	en
dc.identifier.issue	3	en
dc.identifier.other	220	en
dc.identifier.pmid	35578622	en
dc.identifier.uri	http://hdl.handle.net/10919/110755	en
dc.identifier.volume	200	en
dc.language.iso	en	en
dc.publisher	Springer	en
dc.rights	Creative Commons Attribution 4.0 International	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	en
dc.subject	Data-dredging	en
dc.subject	Error probabilities	en
dc.subject	Fisher	en
dc.subject	Neyman and Pearson	en
dc.subject	P-values	en
dc.subject	Statistical significance tests	en
dc.title	Statistical significance and its critics: practicing damaging science, or damaging scientific practice?	en
dc.title.serial	Synthese	en
dc.type	Article - Refereed	en
dc.type.dcmitype	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Mayo-Hand2022_Article_StatisticalSignificanceAndItsC.pdf
Size:: 482.91 KB
Format:: Adobe Portable Document Format
Description:: Published version

Download

Collections

Scholarly Works, Philosophy