meth-SemiCancer: a cancer subtype classification framework via semi-supervised learning utilizing DNA methylation profiles

dc.contributor.authorChoi, Joung M.en
dc.contributor.authorPark, Chaelinen
dc.contributor.authorChae, Heejoonen
dc.date.accessioned2023-05-01T14:35:03Zen
dc.date.available2023-05-01T14:35:03Zen
dc.date.issued2023-04-26en
dc.date.updated2023-04-30T03:12:29Zen
dc.description.abstractBackground Identification of the cancer subtype plays a crucial role to provide an accurate diagnosis and proper treatment to improve the clinical outcomes of patients. Recent studies have shown that DNA methylation is one of the key factors for tumorigenesis and tumor growth, where the DNA methylation signatures have the potential to be utilized as cancer subtype-specific markers. However, due to the high dimensionality and the low number of DNA methylome cancer samples with the subtype information, still, to date, a cancer subtype classification method utilizing DNA methylome datasets has not been proposed. Results In this paper, we present meth-SemiCancer, a semi-supervised cancer subtype classification framework based on DNA methylation profiles. The proposed model was first pre-trained based on the methylation datasets with the cancer subtype labels. After that, meth-SemiCancer generated the pseudo-subtypes for the cancer datasets without subtype information based on the model’s prediction. Finally, fine-tuning was performed utilizing both the labeled and unlabeled datasets. Conclusions From the performance comparison with the standard machine learning-based classifiers, meth-SemiCancer achieved the highest average F1-score and Matthews correlation coefficient, outperforming other methods. Fine-tuning the model with the unlabeled patient samples by providing the proper pseudo-subtypes, encouraged meth-SemiCancer to generalize better than the supervised neural network-based subtype classification method. meth-SemiCancer is publicly available at https://github.com/cbi-bioinfo/meth-SemiCancer.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationBMC Bioinformatics. 2023 Apr 26;24(1):168en
dc.identifier.doihttps://doi.org/10.1186/s12859-023-05272-6en
dc.identifier.urihttp://hdl.handle.net/10919/114858en
dc.language.isoenen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.holderThe Author(s)en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.titlemeth-SemiCancer: a cancer subtype classification framework via semi-supervised learning utilizing DNA methylation profilesen
dc.title.serialBMC Bioinformaticsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
12859_2023_Article_5272.pdf
Size:
1.23 MB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: