Impact of Ignoring Nested Data Structures on Ability Estimation

Shropshire, Kevin O'Neil

Impact of Ignoring Nested Data Structures on Ability Estimation

Files

Shropshire_KO_D_2014.pdf (1.19 MB)

Downloads: 540

Supporting documents (884.17 KB)

Downloads: 118

Date

2014-06-03

Authors

Shropshire, Kevin O'Neil

Publisher

Virginia Tech

Abstract

The literature is clear that intentional or unintentional clustering of data elements typically results in the inflation of the estimated standard error of fixed parameter estimates. This study is unique in that it examines the impact of multilevel data structures on subject ability which are random effect predictions known as empirical Bayes estimates in the one-parameter IRT / Rasch model. The literature on the impact of complex survey design on latent trait models is mixed and there is no "best practice" established regarding how to handle this situation. A simulation study was conducted to address two questions related to ability estimation. First, what impacts does design based clustering have with respect to desirable statistical properties when estimating subject ability with the one-parameter IRT / Rasch model? Second, since empirical Bayes estimators have shrinkage properties, what impacts does clustering of first-stage sampling units have on measurement validity-does the first-stage sampling unit impact the ability estimate, and if so, is this desirable and equitable?

Two models were fit to a factorial experimental design where the data were simulated over various conditions. The first model Rasch model formulated as a HGLM ignores the sample design (incorrect model) while the second incorporates a first-stage sampling unit (correct model). Study findings generally showed that the two models were comparable with respect to desirable statistical properties under a majority of the replicated conditions-more measurement error in ability estimation is found when the intra-class correlation is high and the item pool is small. In practice this is the exception rather than the norm. However, it was found that the empirical Bayes estimates were dependent upon the first-stage sampling unit raising the issue of equity and fairness in educational decision making. A real-world complex survey design with binary outcome data was also fit with both models. Analysis of the data supported the simulation design results which lead to the conclusion that modeling binary Rasch data may resort to a policy tradeoff between desirable statistical properties and measurement validity.

Keywords

Complex survey designs, clustering, PSU, nested data, multilevel data, hierarchical data, two-level HGLM, three-level HGLM, Rasch, ability estimation

Persistent link

http://hdl.handle.net/10919/64197

Collections

Doctoral Dissertations

Full item page

Impact of Ignoring Nested Data Structures on Ability Estimation

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections