Finding Succinct Representations For Clusters

dc.contributor.authorGupta, Aparnaen
dc.contributor.committeechairMarathe, Madhav Vishnuen
dc.contributor.committeememberVullikanti, Anil Kumar S.en
dc.contributor.committeememberSwarup, Samarthen
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2019-07-10T08:01:33Zen
dc.date.available2019-07-10T08:01:33Zen
dc.date.issued2019-07-09en
dc.description.abstractImproving the explainability of results from machine learning methods has become an important research goal. In this thesis, we have studied the problem of making clusters more interpretable using a recent approach by Davidson et al., and Sambaturu et al., based on succinct representations of clusters. Given a set of objects S, a partition of S (into clusters), and a universe T of descriptors such that each element in S is associated with a subset of descriptors, the goal is to find a representative set of descriptors for each cluster such that those sets are pairwise-disjoint and the total size of all the representatives is at most a given budget. Since this problem is NP-hard in general, Sambaturu et al. have developed a suite of approximation algorithms for the problem. We also show applications to explain clusters of genomic sequences that represent different threat levelsen
dc.description.abstractgeneralImproving the explainability of results from machine learning methods has become an important research goal. Clustering is a commonly used Machine Learning technique which is performed on a variety of datasets. In this thesis, we have studied the problem of making clusters more interpretable; and have tried to answer whether it is possible to explain clusters using a set of attributes which were not used while generating these clusters.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:21065en
dc.identifier.urihttp://hdl.handle.net/10919/91388en
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectclusteringen
dc.subjectinteger programmingen
dc.titleFinding Succinct Representations For Clustersen
dc.typeThesisen
thesis.degree.disciplineComputer Science and Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Gupta_A_T_2019.pdf
Size:
384.54 KB
Format:
Adobe Portable Document Format

Collections