ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements
dc.contributor.author | Chen, Xi | en |
dc.contributor.author | Neuwald, Andrew F. | en |
dc.contributor.author | Hilakivi-Clarke, Leena | en |
dc.contributor.author | Clarke, Robert | en |
dc.contributor.author | Xuan, Jianhua | en |
dc.date.accessioned | 2022-02-22T17:29:59Z | en |
dc.date.available | 2022-02-22T17:29:59Z | en |
dc.date.issued | 2021-07-01 | en |
dc.date.updated | 2022-02-22T17:29:54Z | en |
dc.description.abstract | Transcription factors (TFs) often function as a module including both master factors and mediators binding at cis-regulatory regions to modulate nearby gene transcription. ChIPseq profiling of multiple TFs makes it feasible to infer functional TF modules. However, when inferring TF modules based on co-localization of ChIP-seq peaks, often many weak binding events are missed, especially for mediators, resulting in incomplete identification of modules. To address this problem, we develop a ChIP-seq data-driven Gibbs Sampler to infer Modules (ChIP-GSM) using a Bayesian framework that integrates ChIP-seq profiles of multiple TFs. ChIP-GSM samples read counts of module TFs iteratively to estimate the binding potential of a module to each region and, across all regions, estimates the module abundance. Using inferred module-region probabilistic bindings as feature units, ChIP-GSM then employs logistic regression to predict active regulatory elements. Validation of ChIPGSM predicted regulatory regions on multiple independent datasets sharing the same context confirms the advantage of using TF modules for predicting regulatory activity. In a case study of K562 cells, we demonstrate that the ChIP-GSM inferred modules form as groups, activate gene expression at different time points, and mediate diverse functional cellular processes. Hence, ChIP-GSM infers biologically meaningful TF modules and improves the prediction accuracy of regulatory region activities. | en |
dc.description.version | Published version | en |
dc.format.extent | 22 page(s) | en |
dc.format.mimetype | application/pdf | en |
dc.identifier | ARTN e1009203 (Article number) | en |
dc.identifier.doi | https://doi.org/10.1371/journal.pcbi.1009203 | en |
dc.identifier.eissn | 1553-7358 | en |
dc.identifier.issn | 1553-734X | en |
dc.identifier.issue | 7 | en |
dc.identifier.other | PCOMPBIOL-D-20-01578 (PII) | en |
dc.identifier.pmid | 34292930 | en |
dc.identifier.uri | http://hdl.handle.net/10919/108817 | en |
dc.identifier.volume | 17 | en |
dc.language.iso | en | en |
dc.publisher | PLoS | en |
dc.relation.uri | http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000677717200001&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=930d57c9ac61a043676db62af60056c1 | en |
dc.rights | Creative Commons Attribution 4.0 International | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
dc.subject | Biochemical Research Methods | en |
dc.subject | Mathematical & Computational Biology | en |
dc.subject | Biochemistry & Molecular Biology | en |
dc.subject | CHROMATIN-STATE DISCOVERY | en |
dc.subject | SEQ | en |
dc.subject | EXPRESSION | en |
dc.subject | ENHANCERS | en |
dc.subject | LINEAGE | en |
dc.subject | ROLES | en |
dc.subject | 01 Mathematical Sciences | en |
dc.subject | 06 Biological Sciences | en |
dc.subject | 08 Information and Computing Sciences | en |
dc.subject | Bioinformatics | en |
dc.subject.mesh | K562 Cells | en |
dc.subject.mesh | Chromatin | en |
dc.subject.mesh | Humans | en |
dc.subject.mesh | Transcription Factors | en |
dc.subject.mesh | Models, Statistical | en |
dc.subject.mesh | Bayes Theorem | en |
dc.subject.mesh | Computational Biology | en |
dc.subject.mesh | Gene Expression Regulation | en |
dc.subject.mesh | Epigenesis, Genetic | en |
dc.subject.mesh | Binding Sites | en |
dc.subject.mesh | Regulatory Sequences, Nucleic Acid | en |
dc.subject.mesh | Gene Regulatory Networks | en |
dc.subject.mesh | Enhancer Elements, Genetic | en |
dc.subject.mesh | Promoter Regions, Genetic | en |
dc.subject.mesh | MCF-7 Cells | en |
dc.subject.mesh | Chromatin Immunoprecipitation Sequencing | en |
dc.title | ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements | en |
dc.title.serial | PLOS Computational Biology | en |
dc.type | Article - Refereed | en |
dc.type.dcmitype | Text | en |
dc.type.other | Article | en |
dc.type.other | Journal | en |
dcterms.dateAccepted | 2021-06-20 | en |
pubs.organisational-group | /Virginia Tech | en |
pubs.organisational-group | /Virginia Tech/Engineering | en |
pubs.organisational-group | /Virginia Tech/Engineering/Electrical and Computer Engineering | en |
pubs.organisational-group | /Virginia Tech/Faculty of Health Sciences | en |
pubs.organisational-group | /Virginia Tech/All T&R Faculty | en |
pubs.organisational-group | /Virginia Tech/Engineering/COE T&R Faculty | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- ChIP-GSM Inferring active transcription factor modules to predict functional regulatory elements.pdf
- Size:
- 2.66 MB
- Format:
- Adobe Portable Document Format
- Description:
- Published version