Algorithms for Storytelling

dc.contributor.authorKumar, Deepten
dc.contributor.authorRamakrishnan, Narenen
dc.contributor.authorHelm, Richard F.en
dc.contributor.authorPotts, Malcolmen
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2013-06-19T14:36:35Zen
dc.date.available2013-06-19T14:36:35Zen
dc.date.issued2006en
dc.description.abstractWe formulate a new data mining problem called "storytelling" as a generalization of redescription mining. In traditional redescription mining, we are given a set of objects and a collection of subsets defined over these objects. The goal is to view the set system as a vocabulary and identify two expressions in this vocabulary that induce the same set of objects. Storytelling, on the other hand, aims to explicitly relate object sets that are disjoint (and hence, maximally dissimilar) by finding a chain of (approximate) redescriptions between the sets. This problem finds applications in bioinformatics, for instance, where the biologist is trying to relate a set of genes expressed in one experiment to another set, implicated in a different pathway. We outline an efficient storytelling implementation that embeds the CARTwheels redescription mining algorithm in an A* search procedure, using the former to supply next move operators on search branches to the latter. This approach is practical and effective for mining large datasets and, at the same time, exploits the structure of partitions imposed by the given vocabulary. Three application case studies are presented: a study of word overlaps in large English dictionaries, exploring connections between genesets in a bioinformatics dataset, and relating publications in the PubMed index of abstracts.en
dc.format.mimetypeapplication/pdfen
dc.identifierhttp://eprints.cs.vt.edu/archive/00000747/en
dc.identifier.sourceurlhttp://eprints.cs.vt.edu/archive/00000747/01/story.pdfen
dc.identifier.trnumberTR-06-09en
dc.identifier.urihttp://hdl.handle.net/10919/20220en
dc.language.isoenen
dc.publisherDepartment of Computer Science, Virginia Polytechnic Institute & State Universityen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectArtificial intelligenceen
dc.subjectBioinformaticsen
dc.subjectAlgorithmsen
dc.subjectData structuresen
dc.titleAlgorithms for Storytellingen
dc.typeTechnical reporten
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
story.pdf
Size:
521.25 KB
Format:
Adobe Portable Document Format