Stratified Feature Sampling for Semi-Supervised Ensemble Clustering

dc.contributor.authorTian, Jialinen
dc.contributor.authorRen, Yazhouen
dc.contributor.authorCheng, Xiangen
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2019-11-13T15:15:26Zen
dc.date.available2019-11-13T15:15:26Zen
dc.date.issued2019en
dc.description.abstractEnsemble Clustering (EC), which seeks to generate a consensus clustering by integrating multiple base clusterings, has attracted increasing attentions. However, traditional EC methods typically have three main limitations: (1) High dimensional data present a huge challenge to ensemble clustering methods. (2) Most EC algorithms can not use prior information, e.g., pairwise constraints, to enhance the clustering performance. (3) Even in existing semi-supervised ensemble clustering methods, prior information is not sufficiently used, e.g., only used in generating base clusterings. To alleviate these problems, we propose Stratified Feature Sampling for Semi-Supervised Ensemble Clustering ((SFSEC)-E-3). Firstly, we develop a novel stratified feature sampling method, which can cope with high dimensional data, guarantee the diversity of base clusterings, and reduce the risk that some features are not selected at the same time. Secondly, semi-supervised clustering, i.e., constraint propagation, is applied to obtain base clusterings. Finally, we propose to utilize prior information in both the base clustering generating process and the consensus process, which guarantees that prior information is sufficiently used. We conduct a series of experiments on ten real-world data sets to demonstrate the effectiveness of the proposed model.en
dc.description.notesThis work was supported in part by the National Natural Science Foundation of China under Grant 61806043, Grant 61832001, and Grant 61872062, and in part by the Project funded by China Postdoctoral Science Foundation under Grant 2016M602674.en
dc.description.sponsorshipNational Natural Science Foundation of ChinaNational Natural Science Foundation of China [61806043, 61832001, 61872062]; China Postdoctoral Science FoundationChina Postdoctoral Science Foundation [2016M602674]en
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1109/ACCESS.2019.2939581en
dc.identifier.eissn2169-3536en
dc.identifier.urihttp://hdl.handle.net/10919/95532en
dc.identifier.volume7en
dc.language.isoenen
dc.publisherIEEEen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectConstraint propagationen
dc.subjectensemble clusteringen
dc.subjecthigh dimensional dataen
dc.subjectsemi-supervised learningen
dc.subjectstratified feature samplingen
dc.titleStratified Feature Sampling for Semi-Supervised Ensemble Clusteringen
dc.title.serialIEEE Accessen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
dc.type.dcmitypeStillImageen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
08825848.pdf
Size:
6.21 MB
Format:
Adobe Portable Document Format
Description: