The Open Science of Deep Learning: Three Case Studies

Miller, Chreston; Lahne, Jacob; Hamilton, Leah

The Open Science of Deep Learning: Three Case Studies

dc.contributor.author	Miller, Chreston	en
dc.contributor.author	Lahne, Jacob	en
dc.contributor.author	Hamilton, Leah	en
dc.date.accessioned	2023-05-17T19:36:25Z	en
dc.date.available	2023-05-17T19:36:25Z	en
dc.date.issued	2022-03	en
dc.date.updated	2023-05-17T15:01:03Z	en
dc.description.abstract	The open science movement, which prioritizes the open availability of research data and methods for public scrutiny and replication, includes practices like providing code implementing described algorithms in openly available publications. An area of research in which open-science principles may have particularly high impact is in deep learning, where researchers have developed a plethora of algorithms to solve complex and challenging problems, but where others may have difficulty in replicating results and applying these algorithms to other problems. In response, some researchers have begun to open up deep-learning research by making their code and resources available (e.g., datasets and/or pre-trained models) to the current and future research community. This presentation describes three case studies in deep learning where openly available resources differed and investigates the impact on the project and the outcome. This provides a venue for discussion on successes, lessons learned, and recommendations for future researchers facing similar situations, especially as deep learning increasingly becomes an important tool across disciplines. In the first case study, we present a workflow for text summarization, based on thousands of news articles. The outcome, generalizable to many situations, is a tool that can concisely report key facts and events from the articles. In the second case study, we describe the development of an Optical Character Recognition tool for archival research of physical typed notecards, in this case documenting an important, curated collection of thousands of items of clothing. In the last case study, we describe the workflow for applying common Natural Language Processing tools to a novel task: identifying descriptive language for whiskies from thousands of free-form text reviews. These case studies resulted in working solutions addressing their respective, challenging problems because of researchers embracing the concept of open science.	en
dc.format.mimetype	image/png	en
dc.identifier.orcid	Miller, Chreston [0000-0003-4276-0537]	en
dc.identifier.orcid	Lahne, Jacob [0000-0002-2344-1816]	en
dc.identifier.uri	http://hdl.handle.net/10919/115093	en
dc.language.iso	en	en
dc.relation.uri	https://osf.io/76a3b/	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.title	The Open Science of Deep Learning: Three Case Studies	en
dc.type	Poster	en
dc.type.dcmitype	Text	en
pubs.organisational-group	/Virginia Tech	en
pubs.organisational-group	/Virginia Tech/Library	en
pubs.organisational-group	/Virginia Tech/Library/Research, Learning, and Informatics	en
pubs.organisational-group	/Virginia Tech/Library/Research, Learning, and Informatics/Data Services	en
pubs.organisational-group	/Virginia Tech/Library/Research, Learning, and Informatics/Data Services/Informatics Lab & Data Services Projects	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: RDAP_Summit_2022_CMiller_poster_final.png
Size:: 2.22 MB
Format:: Portable Network Graphics
Description:: Poster

Download

Collections

All Faculty Deposits
Scholarly Works, Food Science and Technology
Scholarly Works, University Libraries