Enhancing big data in the social sciences with crowdsourcing: Data augmentation practices, techniques, and opportunities

dc.contributor.authorPorter, Nathaniel D.en
dc.contributor.authorVerdery, Ashton M.en
dc.contributor.authorGaddis, S. Michaelen
dc.contributor.departmentUniversity Librariesen
dc.date.accessioned2020-08-05T13:49:19Zen
dc.date.available2020-08-05T13:49:19Zen
dc.date.issued2020-06-10en
dc.description.abstractProponents of big data claim it will fuel a social research revolution, but skeptics challenge its reliability and decontextualization. The largest subset of big data is not designed for social research. Data augmentation-systematic assessment of measurement against known quantities and expansion of extant data with new information-is an important tool to maximize such data's validity and research value. Using trained research assistants or specialized algorithms are common approaches to augmentation but may not scale to big data or appease skeptics. We consider a third alternative: data augmentation with online crowdsourcing. Three empirical cases illustrate strengths and limitations of crowdsourcing, using Amazon Mechanical Turk to verify automated coding, link online databases, and gather data on online resources. Using these, we develop best practice guidelines and a reporting template to enhance reproducibility. Carefully designed, correctly applied, and rigorously documented crowdsourcing help address concerns about big data's usefulness for social research.en
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1371/journal.pone.0233154en
dc.identifier.issn1932-6203en
dc.identifier.issue6en
dc.identifier.othere0233154en
dc.identifier.pmid32520948en
dc.identifier.urihttp://hdl.handle.net/10919/99484en
dc.identifier.volume15en
dc.language.isoenen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.titleEnhancing big data in the social sciences with crowdsourcing: Data augmentation practices, techniques, and opportunitiesen
dc.title.serialPLoS Oneen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
dc.type.dcmitypeStillImageen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
journal.pone.0233154.pdf
Size:
1.48 MB
Format:
Adobe Portable Document Format
Description: