VTechWorks staff will be away for the Thanksgiving holiday beginning at noon on Wednesday, November 27, through Friday, November 29. We will resume normal operations on Monday, December 2. Thank you for your patience.
 

Statistical significance and its critics: practicing damaging science, or damaging scientific practice?

dc.contributor.authorMayo, Deborah G.en
dc.contributor.authorHand, Daviden
dc.date.accessioned2022-06-13T13:18:47Zen
dc.date.available2022-06-13T13:18:47Zen
dc.date.issued2022-05-12en
dc.description.abstractWhile the common procedure of statistical significance testing and its accompanying concept of p-values have long been surrounded by controversy, renewed concern has been triggered by the replication crisis in science. Many blame statistical significance tests themselves, and some regard them as sufficiently damaging to scientific practice as to warrant being abandoned. We take a contrary position, arguing that the central criticisms arise from misunderstanding and misusing the statistical tools, and that in fact the purported remedies themselves risk damaging science. We argue that banning the use of p-value thresholds in interpreting data does not diminish but rather exacerbates data-dredging and biasing selection effects. If an account cannot specify outcomes that will not be allowed to count as evidence for a claim-if all thresholds are abandoned-then there is no test of that claim. The contributions of this paper are: To explain the rival statistical philosophies underlying the ongoing controversy; To elucidate and reinterpret statistical significance tests, and explain how this reinterpretation ameliorates common misuses and misinterpretations; To argue why recent recommendations to replace, abandon, or retire statistical significance undermine a central function of statistics in science: to test whether observed patterns in the data are genuine or due to background variability.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1007/s11229-022-03692-0en
dc.identifier.eissn1573-0964en
dc.identifier.issn0039-7857en
dc.identifier.issue3en
dc.identifier.other220en
dc.identifier.pmid35578622en
dc.identifier.urihttp://hdl.handle.net/10919/110755en
dc.identifier.volume200en
dc.language.isoenen
dc.publisherSpringeren
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectData-dredgingen
dc.subjectError probabilitiesen
dc.subjectFisheren
dc.subjectNeyman and Pearsonen
dc.subjectP-valuesen
dc.subjectStatistical significance testsen
dc.titleStatistical significance and its critics: practicing damaging science, or damaging scientific practice?en
dc.title.serialSyntheseen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mayo-Hand2022_Article_StatisticalSignificanceAndItsC.pdf
Size:
482.91 KB
Format:
Adobe Portable Document Format
Description:
Published version