Identification of discriminatory antibiotic resistance genes among environmental resistomes using extremely randomized tree algorithm

dc.contributor.authorGupta, Surajen
dc.contributor.authorArango-Argoty, Gustavoen
dc.contributor.authorZhang, Liqingen
dc.contributor.authorPruden, Amyen
dc.contributor.authorVikesland, Peter J.en
dc.date.accessioned2019-09-03T12:38:49Zen
dc.date.available2019-09-03T12:38:49Zen
dc.date.issued2019-08-29en
dc.date.updated2019-09-01T05:45:00Zen
dc.description.abstractBackground The interconnectivities of built and natural environments can serve as conduits for the proliferation and dissemination of antibiotic resistance genes (ARGs). Several studies have compared the broad spectrum of ARGs (i.e., “resistomes”) in various environmental compartments, but there is a need to identify unique ARG occurrence patterns (i.e., “discriminatory ARGs”), characteristic of each environment. Such an approach will help to identify factors influencing ARG proliferation, facilitate development of relative comparisons of the ARGs distinguishing various environments, and help pave the way towards ranking environments based on their likelihood of contributing to the spread of clinically relevant antibiotic resistance. Here we formulate and demonstrate an approach using an extremely randomized tree (ERT) algorithm combined with a Bayesian optimization technique to capture ARG variability in environmental samples and identify the discriminatory ARGs. The potential of ERT for identifying discriminatory ARGs was first evaluated using in silico metagenomic datasets (simulated metagenomic Illumina sequencing data) with known variability. The application of ERT was then demonstrated through analyses using publicly available and in-house metagenomic datasets associated with (1) different aquatic habitats (e.g., river, wastewater influent, hospital effluent, and dairy farm effluent) to compare resistomes between distinct environments and (2) different river samples (i.e., Amazon, Kalamas, and Cam Rivers) to compare resistome characteristics of similar environments. Results The approach was found to readily identify discriminatory ARGs in the in silico datasets. Also, it was not found to be biased towards ARGs with high relative abundance, which is a common limitation of feature projection methods, and instead only captured those ARGs that elicited significant profiles. Analyses of publicly available metagenomic datasets further demonstrated that the ERT approach can effectively differentiate real-world environmental samples and identify discriminatory ARGs based on pre-defined categorizing schemes. Conclusions Here a new methodology was formulated to characterize and compare variances in ARG profiles between metagenomic data sets derived from similar/dissimilar environments. Specifically, identification of discriminatory ARGs among samples representing various environments can be identified based on factors of interest. The methodology could prove to be a particularly useful tool for ARG surveillance and the assessment of the effectiveness of strategies for mitigating the spread of antibiotic resistance. The python package is hosted in the Git repository: https://github.com/gaarangoa/ExtrARGen
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationMicrobiome. 2019 Aug 29;7(1):123en
dc.identifier.doihttps://doi.org/10.1186/s40168-019-0735-1en
dc.identifier.urihttp://hdl.handle.net/10919/93333en
dc.language.isoenen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.holderThe Author(s)en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.titleIdentification of discriminatory antibiotic resistance genes among environmental resistomes using extremely randomized tree algorithmen
dc.title.serialMicrobiomeen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
40168_2019_Article_735.pdf
Size:
1.42 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: