RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

dc.contributor.authorBrettin, Thomasen
dc.contributor.authorDavis, James J.en
dc.contributor.authorDisz, Terryen
dc.contributor.authorEdwards, Robert A.en
dc.contributor.authorGerdes, Svetlanaen
dc.contributor.authorOlsen, Gary J.en
dc.contributor.authorOlson, Roberten
dc.contributor.authorOverbeek, Rossen
dc.contributor.authorParrello, Bruceen
dc.contributor.authorPusch, Gordon D.en
dc.contributor.authorShukla, Mauliken
dc.contributor.authorThomason, James A., IIIen
dc.contributor.authorStevens, Rick L.en
dc.contributor.authorVonstein, Veronikaen
dc.contributor.authorWattam, Alice R.en
dc.contributor.authorXia, Fangfangen
dc.date.accessioned2019-01-28T17:56:17Zen
dc.date.available2019-01-28T17:56:17Zen
dc.date.issued2015-02-10en
dc.description.abstractThe RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.en
dc.description.notesWe thank Emily Dietrich for her helpful comments. This work was supported by the United States National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Service [Contract No. HHSN272201400027C]; the United States Department of Energy [DE-AC02-06CH11357], as part of the DOE Systems Biology Knowledgebase; R.A.E. was supported by United States National Science Foundation Grants grants II-EN: Computational Enhancement of Analytical Metagenomics Systems CNS-1305112, and Experimental and Computational Determination of Microbial Genotypes and Phenotypes MCB-1330800; and G.J.O. was supported by the National Aeronautics and Space Administration through the NASA Astrobiology Institute under Cooperative Agreement No. NNA13AA91A issued through the Science Mission Directorate. United States Department of Energy: National Institute of Allergy and Infectious Diseases.en
dc.description.sponsorshipUnited States National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Service [HHSN272201400027C]; United States Department of Energy, DOE Systems Biology Knowledgebase [DE-AC02-06CH11357]; United States National Science Foundation [CNS-1305112]; Experimental and Computational Determination of Microbial Genotypes and Phenotypes [MCB-1330800]; National Aeronautics and Space Administration through the NASA Astrobiology Institute [NNA13AA91A]en
dc.format.extent6en
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1038/srep08365en
dc.identifier.issn2045-2322en
dc.identifier.other8365en
dc.identifier.pmid25666585en
dc.identifier.urihttp://hdl.handle.net/10919/87057en
dc.identifier.volume5en
dc.language.isoen_USen
dc.publisherSpringer Natureen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectmicrobial genomesen
dc.subjectrna genesen
dc.subjectresistanceen
dc.subjectdatabaseen
dc.subjectsystemen
dc.subjectidentificationen
dc.subjectgenerationen
dc.subjectsequencesen
dc.subjectresourceen
dc.subjectarchaeaen
dc.titleRASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomesen
dc.title.serialScientific Reportsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
srep08365.pdf
Size:
700.06 KB
Format:
Adobe Portable Document Format
Description: