Browsing by Author "Cui, Yi"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
- Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSOZuo, Yiming; Cui, Yi; Yu, Guoqiang; Li, Ruijiang; Ressom, Habtom W. (2017-02-10)Background Conventional differential gene expression analysis by methods such as student’s t-test, SAM, and Empirical Bayes often searches for statistically significant genes without considering the interactions among them. Network-based approaches provide a natural way to study these interactions and to investigate the rewiring interactions in disease versus control groups. In this paper, we apply weighted graphical LASSO (wgLASSO) algorithm to integrate a data-driven network model with prior biological knowledge (i.e., protein-protein interactions) for biological network inference. We propose a novel differentially weighted graphical LASSO (dwgLASSO) algorithm that builds group-specific networks and perform network-based differential gene expression analysis to select biomarker candidates by considering their topological differences between the groups. Results Through simulation, we showed that wgLASSO can achieve better performance in building biologically relevant networks than purely data-driven models (e.g., neighbor selection, graphical LASSO), even when only a moderate level of information is available as prior biological knowledge. We evaluated the performance of dwgLASSO for survival time prediction using two microarray breast cancer datasets previously reported by Bild et al. and van de Vijver et al. Compared with the top 10 significant genes selected by conventional differential gene expression analysis method, the top 10 significant genes selected by dwgLASSO in the dataset from Bild et al. led to a significantly improved survival time prediction in the independent dataset from van de Vijver et al. Among the 10 genes selected by dwgLASSO, UBE2S, SALL2, XBP1 and KIAA0922 have been confirmed by literature survey to be highly relevant in breast cancer biomarker discovery study. Additionally, we tested dwgLASSO on TCGA RNA-seq data acquired from patients with hepatocellular carcinoma (HCC) on tumors samples and their corresponding non-tumorous liver tissues. Improved sensitivity, specificity and area under curve (AUC) were observed when comparing dwgLASSO with conventional differential gene expression analysis method. Conclusions The proposed network-based differential gene expression analysis algorithm dwgLASSO can achieve better performance than conventional differential gene expression analysis methods by integrating information at both gene expression and network topology levels. The incorporation of prior biological knowledge can lead to the identification of biologically meaningful genes in cancer biomarker studies.
- Multiple Myeloma DREAM Challenge reveals epigenetic regulator PHF19 as marker of aggressive diseaseMason, Mike J.; Schinke, Carolina; Eng, Christine L. P.; Towfic, Fadi; Gruber, Fred; Dervan, Andrew; White, Brian S.; Pratapa, Aditya; Guan, Yuanfang; Chen, Hongjie; Cui, Yi; Li, Bailiang; Yu, Thomas; Neto, Elias Chaibub; Mavrommatis, Konstantinos; Ortiz, Maria; Lyzogubov, Valeriy; Bisht, Kamlesh; Dai, Hongyue Y.; Schmitz, Frank; Flynt, Erin; Rozelle, Dan; Danziger, Samuel A.; Ratushny, Alexander; Dalton, William S.; Goldschmidt, Hartmut; Avet-Loiseau, Herve; Samur, Mehmet; Hayete, Boris; Sonneveld, Pieter; Shain, Kenneth H.; Munshi, Nikhil; Auclair, Daniel; Hose, Dirk; Morgan, Gareth; Trotter, Matthew; Bassett, Douglas; Goke, Jonathan; Walker, Brian A.; Thakurta, Anjan; Guinney, Justin (2020-02-14)While the past decade has seen meaningful improvements in clinical outcomes for multiple myeloma patients, a subset of patients does not benefit from current therapeutics for unclear reasons. Many gene expression-based models of risk have been developed, but each model uses a different combination of genes and often involves assaying many genes making them difficult to implement. We organized the Multiple Myeloma DREAM Challenge, a crowdsourced effort to develop models of rapid progression in newly diagnosed myeloma patients and to benchmark these against previously published models. This effort lead to more robust predictors and found that incorporating specific demographic and clinical features improved gene expression-based models of high risk. Furthermore, post-challenge analysis identified a novel expression-based risk marker, PHF19, which has recently been found to have an important biological role in multiple myeloma. Lastly, we show that a simple four feature predictor composed of age, ISS, and expression of PHF19 and MMSET performs similarly to more complex models with many more gene expression features included.