A Machine Learning Approach to Predict Gene Regulatory Networks in Seed Development in Arabidopsis

dc.contributor.authorGrene, Ruthen
dc.contributor.authorHeath, Lenwood S.en
dc.contributor.authorLi, Songen
dc.contributor.authorCollakova, Evaen
dc.contributor.authorElmarakeby, Haitham A.en
dc.contributor.authorNi, Yingen
dc.contributor.authorAghamirzaie, Delasaen
dc.contributor.departmentComputer Scienceen
dc.contributor.departmentSchool of Plant and Environmental Sciencesen
dc.date.accessioned2016-12-26T19:06:29Zen
dc.date.available2016-12-26T19:06:29Zen
dc.date.issued2016-12-23en
dc.description.abstractGene regulatory networks (GRNs) provide a representation of relationships between regulators and their target genes. Several methods for GRN inference, both unsupervised and supervised, have been developed to date Because regulatory relationships consistently reprogram in diverse tissues or under different conditions, GRNs inferred without specific biological contexts are of limited applicability. In this report, a machine learning approach is presented to predict GRNs specific to developing Arabidopsis thaliana embryos. We developed the Beacon GRN inference tool to predict GRNs occurring during seed development in Arabidopsis based on a support vector machine (SVM) model. We developed both global and local inference models and compared their performance, demonstrating that local models are generally superior for our application. Using both the expression levels of the genes expressed in developing embryos and prior known regulatory relationships, GRNs were predicted for specific embryonic developmental stages. The targets that are strongly positively correlated with their regulators are mostly expressed at the beginning of seed development. Potential direct targets were identified based on a match between the promoter regions of these inferred targets and the cis elements recognized by specific regulators. Our analysis also provides evidence for previously unknown inhibitory effects of three positive regulators of gene expression. The Beacon GRN inference tool provides a valuable model system for context-specific GRN inference and is freely available at https://github.com/BeaconProjectAtVirginiaTech/beacon_network_inference.git.en
dc.description.notesfalse (Extension publication?)en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.3389/fpls.2016.01936en
dc.identifier.issn1664-462Xen
dc.identifier.issueDecember 23, 2016en
dc.identifier.urihttp://hdl.handle.net/10919/73833en
dc.language.isoenen
dc.publisherFrontiersen
dc.relation.urihttp://home.frontiersin.org/en
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectGene Regulatory Networksen
dc.subjectSeed developmenten
dc.subjectRNA-Seq dataen
dc.subjectArabidopsisen
dc.titleA Machine Learning Approach to Predict Gene Regulatory Networks in Seed Development in Arabidopsisen
dc.title.serialFrontiers in Plant Science Bioinformatics and Computational Biologyen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
dcterms.dateAccepted2016-12-22en
pubs.organisational-group/Virginia Techen
pubs.organisational-group/Virginia Tech/Agriculture & Life Sciencesen
pubs.organisational-group/Virginia Tech/Agriculture & Life Sciences/CALS T&R Facultyen
pubs.organisational-group/Virginia Tech/Agriculture & Life Sciences/Plant Pathology, Physiology, & Weed Scienceen
pubs.organisational-group/Virginia Tech/All T&R Facultyen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Frontiers_120216.pdf
Size:
1.45 MB
Format:
Adobe Portable Document Format
Description:
Accepted Version
License bundle
Now showing 1 - 1 of 1
Name:
VTUL_Distribution_License_2016_05_09.pdf
Size:
18.09 KB
Format:
Adobe Portable Document Format
Description: