The Open Science of Deep Learning: Three Case Studies

dc.contributor.authorMiller, Chrestonen
dc.contributor.authorLahne, Jacoben
dc.contributor.authorHamilton, Leahen
dc.date.accessioned2023-05-17T19:36:25Zen
dc.date.available2023-05-17T19:36:25Zen
dc.date.issued2022-03en
dc.date.updated2023-05-17T15:01:03Zen
dc.description.abstractThe open science movement, which prioritizes the open availability of research data and methods for public scrutiny and replication, includes practices like providing code implementing described algorithms in openly available publications. An area of research in which open-science principles may have particularly high impact is in deep learning, where researchers have developed a plethora of algorithms to solve complex and challenging problems, but where others may have difficulty in replicating results and applying these algorithms to other problems. In response, some researchers have begun to open up deep-learning research by making their code and resources available (e.g., datasets and/or pre-trained models) to the current and future research community. This presentation describes three case studies in deep learning where openly available resources differed and investigates the impact on the project and the outcome. This provides a venue for discussion on successes, lessons learned, and recommendations for future researchers facing similar situations, especially as deep learning increasingly becomes an important tool across disciplines. In the first case study, we present a workflow for text summarization, based on thousands of news articles. The outcome, generalizable to many situations, is a tool that can concisely report key facts and events from the articles. In the second case study, we describe the development of an Optical Character Recognition tool for archival research of physical typed notecards, in this case documenting an important, curated collection of thousands of items of clothing. In the last case study, we describe the workflow for applying common Natural Language Processing tools to a novel task: identifying descriptive language for whiskies from thousands of free-form text reviews. These case studies resulted in working solutions addressing their respective, challenging problems because of researchers embracing the concept of open science.en
dc.format.mimetypeimage/pngen
dc.identifier.orcidMiller, Chreston [0000-0003-4276-0537]en
dc.identifier.orcidLahne, Jacob [0000-0002-2344-1816]en
dc.identifier.urihttp://hdl.handle.net/10919/115093en
dc.language.isoenen
dc.relation.urihttps://osf.io/76a3b/en
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titleThe Open Science of Deep Learning: Three Case Studiesen
dc.typePosteren
dc.type.dcmitypeTexten
pubs.organisational-group/Virginia Techen
pubs.organisational-group/Virginia Tech/Libraryen
pubs.organisational-group/Virginia Tech/Library/Research, Learning, and Informaticsen
pubs.organisational-group/Virginia Tech/Library/Research, Learning, and Informatics/Data Servicesen
pubs.organisational-group/Virginia Tech/Library/Research, Learning, and Informatics/Data Services/Informatics Lab & Data Services Projectsen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
RDAP_Summit_2022_CMiller_poster_final.png
Size:
2.22 MB
Format:
Portable Network Graphics
Description:
Poster