Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning

dc.contributor.authorMiller, Chrestonen
dc.contributor.authorHamilton, Leahen
dc.contributor.authorLahne, Jacoben
dc.contributor.departmentUniversity Librariesen
dc.contributor.departmentFood Science and Technologyen
dc.date.accessioned2021-07-23T17:27:58Zen
dc.date.available2021-07-23T17:27:58Zen
dc.date.issued2021-07-14en
dc.date.updated2021-07-23T13:27:09Zen
dc.description.abstractThis paper is concerned with extracting relevant terms from a text corpus on whisk(e)y. “Relevant” terms are usually contextually defined in their domain of use. Arguably, every domain has a specialized vocabulary used for describing things. For example, the field of Sensory Science, a sub-field of Food Science, investigates human responses to food products and differentiates “descriptive” terms for flavors from “ordinary”, non-descriptive language. Within the field, descriptors are generated through Descriptive Analysis, a method wherein a human panel of experts tastes multiple food products and defines descriptors. This process is both time-consuming and expensive. However, one could leverage existing data to identify and build a flavor language automatically. For example, there are thousands of professional and semi-professional reviews of whisk(e)y published on the internet, providing abundant descriptors interspersed with non-descriptive language. The aim, then, is to be able to automatically identify descriptive terms in unstructured reviews for later use in product flavor characterization. We created two systems to perform this task. The first is an interactive visual tool that can be used to tag examples of descriptive terms from thousands of whisky reviews. This creates a training dataset that we use to perform transfer learning using GloVe word embeddings and a Long Short-Term Memory deep learning model architecture. The result is a model that can accurately identify descriptors within a corpus of whisky review texts with a train/test accuracy of 99% and precision, recall, and F1-scores of 0.99. We tested for overfitting by comparing the training and validation loss for divergence. Our results show that the language structure for descriptive terms can be programmatically learned.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationMiller, C.; Hamilton, L.; Lahne, J. Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning. Foods 2021, 10, 1633.en
dc.identifier.doihttps://doi.org/10.3390/foods10071633en
dc.identifier.orcidMiller, Chreston [0000-0003-4276-0537]en
dc.identifier.orcidLahne, Jacob [0000-0002-2344-1816]en
dc.identifier.urihttp://hdl.handle.net/10919/104375en
dc.language.isoenen
dc.publisherMDPIen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectnatural language processingen
dc.subjectdeep learningen
dc.subjectsensory scienceen
dc.subjectflavor lexiconen
dc.subjectlong short-term memoryen
dc.titleSensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learningen
dc.title.serialFoodsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
foods-10-01633.pdf
Size:
3.96 MB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: