Clustering constrained by dependencies

Tadepalli, Satish; Ramakrishnan, Naren; Watson, Layne T.

Clustering constrained by dependencies

Files

ACMSurv09.pdf (2.08 MB)

Downloads: 207

TR Number

TR-09-12

Date

2009

Authors

Tadepalli, Satish

Ramakrishnan, Naren

Watson, Layne T.

Publisher

Department of Computer Science, Virginia Polytechnic Institute & State University

Abstract

Clustering is the unsupervised method of grouping data samples to form a partition of a given dataset. Such grouping is typically done based on homogeneity assumptions of clusters over an attribute space and hence the precise definition of the similarity metric affects the clusters inferred. In recent years, new formulations of clustering have emerged that posit indirect constraints on clustering, typically in terms of preserving dependencies between data samples and auxiliary variables. These formulations ﬁnd applications in bioinformatics, web mining, social network analysis, and many other domains. The purpose of this survey is to provide a gentle introduction to these formulations, their mathematical assumptions, and the contexts under which they are applicable.

Keywords

Artificial intelligence

Persistent link

http://hdl.handle.net/10919/20151

Collections

Computer Science Technical Reports

Full item page