Automatic scoring of speeded interpersonal assessment center exercises via machine learning: Initial psychometric evidence and practical guidelines

Hickman, Louis; Herde, Christoph N.; Lievens, Filip; Tay, Louis

Automatic scoring of speeded interpersonal assessment center exercises via machine learning: Initial psychometric evidence and practical guidelines

dc.contributor.author	Hickman, Louis	en
dc.contributor.author	Herde, Christoph N.	en
dc.contributor.author	Lievens, Filip	en
dc.contributor.author	Tay, Louis	en
dc.date.accessioned	2023-02-24T14:20:35Z	en
dc.date.available	2023-02-24T14:20:35Z	en
dc.date.issued	2023-02	en
dc.date.updated	2023-02-24T01:58:47Z	en
dc.description.abstract	Assessment center (AC) exercises such as role-plays have established themselves as valuable approaches for obtaining insights into interpersonal behavior, but they are often considered the “Rolls Royce” of personnel assessment due to their high costs. The observation and rating process comprises a substantial part of these costs. In an exploratory case study, we capitalize on recent advances in natural language processing (NLP) by developing NLP-based machine learning (ML) models to investigate the possibility of automatically scoring AC exercises. First, we compared the convergent-related validity and contamination with word count of ML scores based on models that used different NLP methods to operationalize verbal behavior. Second, for the model that maximized convergence while minimizing contamination with word count (i.e., a model that used both n-grams and Universal Sentence Encoder embeddings as predictors), we investigated the criterion-related validity of its scores. Third, we examined how the interrater reliability of the AC role-play scores affects ML model convergence. To do so, we applied seven NLP methods to 96 assessees' transcriptions and trained 10 sets of ML models across 18 speeded AC role-plays to automatically score assessee performance. Results suggest that ML scores recovered most of the original variance in the overall assessment ratings, and replacing one or more human assessors with ML scores maintained criterion-related validity. Additionally, ML models seemed to exhibit higher convergence when assessors consistently detected and utilized observable behaviors to make ratings (i.e., when interrater reliability was higher). Finally, we provide a step-by-step guide for practitioners seeking to implement ML scoring in ACs.	en
dc.description.version	Published version	en
dc.format.mimetype	application/pdf	en
dc.identifier.doi	https://doi.org/10.1111/ijsa.12418	en
dc.identifier.eissn	1468-2389	en
dc.identifier.issn	0965-075X	en
dc.identifier.orcid	Hickman, Louis [0000-0002-2752-7705]	en
dc.identifier.uri	http://hdl.handle.net/10919/113932	en
dc.language.iso	en	en
dc.publisher	Wiley	en
dc.rights	Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International	en
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	en
dc.title	Automatic scoring of speeded interpersonal assessment center exercises via machine learning: Initial psychometric evidence and practical guidelines	en
dc.title.serial	International Journal of Selection and Assessment	en
dc.type	Article - Refereed	en
dc.type.dcmitype	Text	en
dc.type.other	Journal Article	en
pubs.organisational-group	/Virginia Tech	en
pubs.organisational-group	/Virginia Tech/Science	en
pubs.organisational-group	/Virginia Tech/Science/Psychology	en
pubs.organisational-group	/Virginia Tech/All T&R Faculty	en
pubs.organisational-group	/Virginia Tech/Science/COS T&R Faculty	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Hickman Herde Lievens Tay-2023-Automatic scoring of speeded interpersonal assessment center exercises via machine learning.pdf
Size:: 1.38 MB
Format:: Adobe Portable Document Format
Description:: Published version

Download

Collections

All Faculty Deposits
Scholarly Works, Psychology