Revisiting the Large n (Sample Size) Problem: How to Avert Spurious Significance Results

Spanos, Aris

Revisiting the Large n (Sample Size) Problem: How to Avert Spurious Significance Results

dc.contributor.author	Spanos, Aris	en
dc.date.accessioned	2024-02-01T14:34:11Z	en
dc.date.available	2024-02-01T14:34:11Z	en
dc.date.issued	2023-12-05	en
dc.date.updated	2023-12-22T13:45:08Z	en
dc.description.abstract	Although large data sets are generally viewed as advantageous for their ability to provide more precise and reliable evidence, it is often overlooked that these benefits are contingent upon certain conditions being met. The primary condition is the approximate validity (statistical adequacy) of the probabilistic assumptions comprising the statistical model <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="script">M</mi><mi mathvariant="bold-italic">θ</mi></msub><mrow><mo>(</mo><mi mathvariant="bold">x</mi><mo>)</mo></mrow></mrow></semantics></math></inline-formula> applied to the data. In the case of a statistically adequate <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="script">M</mi><mi mathvariant="bold-italic">θ</mi></msub><mrow><mo>(</mo><mi mathvariant="bold">x</mi><mo>)</mo></mrow></mrow></semantics></math></inline-formula> and a given significance level <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi>α</mi></semantics></math></inline-formula>, as <i>n</i> increases, the power of a test increases, and the <i>p</i>-value decreases due to the inherent trade-off between type I and type II error probabilities in frequentist testing. This trade-off raises concerns about the reliability of declaring ‘statistical significance’ based on conventional significance levels when <i>n</i> is exceptionally large. To address this issue, the author proposes that a principled approach, in the form of post-data severity (SEV) evaluation, be employed. The SEV evaluation represents a post-data error probability that converts unduly data-specific ‘accept/reject <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>H</mi><mn>0</mn></msub></semantics></math></inline-formula> results’ into evidence either supporting or contradicting inferential claims regarding the parameters of interest. This approach offers a more nuanced and robust perspective in navigating the challenges posed by the large <i>n</i> problem.	en
dc.description.version	Published version	en
dc.format.mimetype	application/pdf	en
dc.identifier.citation	Spanos, A. Revisiting the Large n (Sample Size) Problem: How to Avert Spurious Significance Results. Stats 2023, 6, 1323-1338.	en
dc.identifier.doi	https://doi.org/10.3390/stats6040081	en
dc.identifier.uri	https://hdl.handle.net/10919/117810	en
dc.language.iso	en	en
dc.publisher	MDPI	en
dc.rights	Creative Commons Attribution 4.0 International	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	en
dc.subject	large n problem	en
dc.subject	Neyman–Pearson testing	en
dc.subject	p-value	en
dc.subject	post-data severity evaluation	en
dc.subject	spurious statistical significance	en
dc.title	Revisiting the Large n (Sample Size) Problem: How to Avert Spurious Significance Results	en
dc.title.serial	Statistics	en
dc.type	Article - Refereed	en
dc.type.dcmitype	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: stats-06-00081-v2.pdf
Size:: 402.44 KB
Format:: Adobe Portable Document Format
Description:: Published version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.5 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Journal Articles, Multidisciplinary Digital Publishing Institute (MDPI)
Scholarly Works, Economics