Hierarchy Aligned Commonality Through Prototypical Networks: Discovering Evolutionary Traits over Tree-of-Life

dc.contributor.authorManogaran, Harish Babuen
dc.contributor.committeechairAbbott, Amos L.en
dc.contributor.committeechairKarpatne, Anujen
dc.contributor.committeememberJones, Creed Farrisen
dc.contributor.departmentElectrical and Computer Engineeringen
dc.date.accessioned2024-10-12T08:00:12Zen
dc.date.available2024-10-12T08:00:12Zen
dc.date.issued2024-10-11en
dc.description.abstractA grand challenge in biology is to discover evolutionary traits, which are features of organisms common to a group of species with a shared ancestor in the Tree of Life (also referred to as phylogenetic tree). With the recent availability of large-scale image repositories in biology and advances in the field of explainable machine learning (ML) such as ProtoPNet and other prototype-based methods, there is a tremendous opportunity to discover evolutionary traits directly from images in the form of a hierarchy of prototypes learned at internal nodes of the phylogenetic tree. However, current prototype-based methods are mostly designed to operate over a flat structure of classes and face several challenges in discovering hierarchical prototypes on a tree, including the problem of learning over-specific features at internal nodes in the tree. To overcome these challenges, we introduce the framework of Hierarchy aligned Commonality through Prototypical Networks (HComP-Net), which learns common features shared by all descendant species of an internal node and avoids the learning of over-specific prototypes. We empirically show that HComP-Net learns prototypes that are of high accuracy, semantically consistent, and generalizable to unseen species in comparison to baselines. While we focus on the biological problem of discovering evolutionary traits, our work can be applied to any domain involving a hierarchy of classes.en
dc.description.abstractgeneralA phylogenetic tree (also called as tree of life) shows how different species or groups of living things are related to each other through evolution. Scientists use phylogenetic trees to trace the evolutionary history of species, helping them understand how life on Earth is connected and how different species have changed over time. Each branch of the tree represents a group of species that share a common ancestor, and the point where branches split shows when they began to evolve into different species. Although the species have evolved separately they continue to share some traits due to their common ancestry in the phylogeny. Such traits are referred to as synapomorphies. In our work, we focus on identifying such traits from images in the form of prototypes (representative image patches) by incorporating the knowledge of phylogenetic tree. We learn prototypes at each internal node of the phylogenetic tree, such that the prototypes learned at each node represents the common traits that are shared between all the species that are under the node. By learning such prototypes we can identify and localize the regions (or image patches) of the image that contains such common traits.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:41589en
dc.identifier.urihttps://hdl.handle.net/10919/121329en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectDeep learningen
dc.subjectInterpretable Machine Learningen
dc.subjectExplainable AIen
dc.subjectPrototype-based Neural Networksen
dc.subjectPhylogenyen
dc.subjectEvolutionary Biologyen
dc.titleHierarchy Aligned Commonality Through Prototypical Networks: Discovering Evolutionary Traits over Tree-of-Lifeen
dc.typeThesisen
thesis.degree.disciplineComputer Engineeringen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Manogaran_H_T_2024.pdf
Size:
44.13 MB
Format:
Adobe Portable Document Format

Collections