Revisiting the Large n (Sample Size) Problem: How to Avert Spurious Significance Results

Spanos, Aris

Revisiting the Large n (Sample Size) Problem: How to Avert Spurious Significance Results

Files

Published version (402.44 KB)

Downloads: 72

Date

2023-12-05

Authors

Spanos, Aris

Publisher

MDPI

Abstract

Although large data sets are generally viewed as advantageous for their ability to provide more precise and reliable evidence, it is often overlooked that these benefits are contingent upon certain conditions being met. The primary condition is the approximate validity (statistical adequacy) of the probabilistic assumptions comprising the statistical model $M θ (x)$ applied to the data. In the case of a statistically adequate $M θ (x)$ and a given significance level $α$ , as n increases, the power of a test increases, and the p-value decreases due to the inherent trade-off between type I and type II error probabilities in frequentist testing. This trade-off raises concerns about the reliability of declaring ‘statistical significance’ based on conventional significance levels when n is exceptionally large. To address this issue, the author proposes that a principled approach, in the form of post-data severity (SEV) evaluation, be employed. The SEV evaluation represents a post-data error probability that converts unduly data-specific ‘accept/reject $H 0$ results’ into evidence either supporting or contradicting inferential claims regarding the parameters of interest. This approach offers a more nuanced and robust perspective in navigating the challenges posed by the large n problem.

Keywords

large n problem, Neyman–Pearson testing, p-value, post-data severity evaluation, spurious statistical significance

Citation

Spanos, A. Revisiting the Large n (Sample Size) Problem: How to Avert Spurious Significance Results. Stats 2023, 6, 1323-1338.

Persistent link

https://hdl.handle.net/10919/117810

Collections

Journal Articles, Multidisciplinary Digital Publishing Institute (MDPI)
Scholarly Works, Economics

Full item page

Revisiting the Large n (Sample Size) Problem: How to Avert Spurious Significance Results

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections