Capturing Truthiness: Mining Truth Tables in Binary Datasets

dc.contributor.authorOwens, Clifford Conleyen
dc.contributor.authorMurali, T. M.en
dc.contributor.authorRamakrishnan, Narenen
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2013-06-19T14:36:18Zen
dc.date.available2013-06-19T14:36:18Zen
dc.date.issued2007-02-01en
dc.description.abstractWe introduce a new data mining problem: mining truth tables in binary datasets. Given a matrix of objects and the properties they satisfy, a truth table identifies a subset of properties that exhibit maximal variability (and hence, complete independence) in occurrence patterns over the underlying objects. This problem is relevant in many domains, e.g., bioinformatics where we seek to identify and model independent components of combinatorial regulatory pathways, and in social/economic demographics where we desire to determine independent behavioral attributes of populations. Besides intrinsic interest in such patterns, we show how the problem of mining truth tables is dual to the problem of mining redescriptions, in that a set of properties involved in a truth table cannot participate in any possible redescription. This allows us to adapt our algorithm to the problem of mining redescriptions as well, by first identifying regions where redescriptions cannot happen, and then pursuing a divide and conquer strategy around these regions. Furthermore, our work suggests dual mining strategies where both classes of algorithms can be brought to bear upon either data mining task. We outline a family of levelwise approaches adapted to mining truth tables, algorithmic optimizations, and applications to bioinformatics and political datasets.en
dc.format.mimetypeapplication/pdfen
dc.identifierhttp://eprints.cs.vt.edu/archive/00000948/en
dc.identifier.sourceurlhttp://eprints.cs.vt.edu/archive/00000948/01/paper.pdfen
dc.identifier.trnumberTR-07-10en
dc.identifier.urihttp://hdl.handle.net/10919/19669en
dc.language.isoenen
dc.publisherDepartment of Computer Science, Virginia Polytechnic Institute & State Universityen
dc.relation.ispartofComputer Science Technical Reportsen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectAlgorithmsen
dc.subjectData structuresen
dc.titleCapturing Truthiness: Mining Truth Tables in Binary Datasetsen
dc.typeTechnical reporten
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
paper.pdf
Size:
294.03 KB
Format:
Adobe Portable Document Format