CancerSubtyper: A Web-Based Deep Learning Platform for Cancer Subtyping Through DNA Methylation Data

dc.contributor.authorCheung, Yat Feien
dc.contributor.committeechairZhang, Liqingen
dc.contributor.committeememberLu, Chang Tienen
dc.contributor.committeememberMeng, Naen
dc.contributor.departmentComputer Science and#38; Applicationsen
dc.date.accessioned2025-09-09T08:01:43Zen
dc.date.available2025-09-09T08:01:43Zen
dc.date.issued2025-09-08en
dc.description.abstractCancer subtyping plays a critical role in understanding tumor heterogeneity, predicting patient outcomes, and guiding personalized therapies. While DNA methylation data offers an informative molecular source for subtyping, leveraging its signals across cohorts remains challenging due to high dimensionality, batch effects, and lack of standardized tools. In this thesis, we present CancerSubtyper, a web-based deep learning platform that enables both supervised classification and semi-supervised discovery of cancer subtypes using methylation data. The platform incorporates two models: BCtypeFinder, designed for subtype prediction with domain adaptation, and CancerSubminer, a flexible model for subtype discovery or refinement with potential applicability across different cancer types. Users can upload labeled and unlabeled datasets, select models, and visualize results through various interactive plots including UMAP projections, boxplots, and Kaplan–Meier survival curves. We evaluate the platform using The Cancer Genome Atlas (TCGA) breast cancer cohort, demonstrating distinct cancer subtypes, effective batch correction, and clinical relevance via survival analysis. CancerSubtyper provides a reproducible, user-oriented platform that streamlines cancer subtype analysis using DNA methylation data. It bridges a methodological gap by enabling both classification and discovery tasks, supports batch correction across datasets, and facilitates result interpretation through interactive visualizations. Designed for accessibility, the platform requires no programming expertise, making it a practical tool for researchers and clinicians to explore cancer subtypes and contribute to personalized care.en
dc.description.abstractgeneralThis thesis presents CancerSubtyper, a web-based platform that uses deep learning to support cancer subtyping with DNA methylation data. The platform integrates BCtypeFinder for supervised subtype prediction and CancerSubminer for semi-supervised subtype discovery or refinement, all within an automated workflow that requires no programming expertise. Users can upload labeled or unlabeled datasets, after which the system performs preprocessing, batch effect correction, and subtype analysis, and then provides results through interactive visualizations including UMAP projections, heatmaps, boxplots, and Kaplan–Meier survival curves. The platform was evaluated on The Cancer Genome Atlas breast cancer cohort, where it successfully recovered distinct subtypes, corrected for batch effects, and revealed clinically relevant differences in patient survival. Overall, CancerSubtyper delivers accurate, interpretable, and reproducible subtyping analyses, lowering the barrier for researchers to conduct large-scale methylation-based cancer studies.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:44631en
dc.identifier.urihttps://hdl.handle.net/10919/137647en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectBioinformaticsen
dc.subjectDeep Learningen
dc.subjectWeb Developmenten
dc.subjectCancer Subtypingen
dc.subjectDNA Methylationen
dc.subjectEpigeneticsen
dc.titleCancerSubtyper: A Web-Based Deep Learning Platform for Cancer Subtyping Through DNA Methylation Dataen
dc.typeThesisen
thesis.degree.disciplineComputer Science & Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Cheung_Y_T_2025.pdf
Size:
2.65 MB
Format:
Adobe Portable Document Format

Collections