Investigating bisulfite short-read mapping failure with hairpin bisulfite sequencing data

dc.contributorVirginia Techen
dc.contributor.authorPorter, Jacoben
dc.contributor.authorSun, Ming-anen
dc.contributor.authorXie, Hehuang Daviden
dc.contributor.authorZhang, Liqingen
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2017-03-10T16:32:44Zen
dc.date.available2017-03-10T16:32:44Zen
dc.date.issued2015-11-10en
dc.description.abstractBackground: DNA methylation is an important epigenetic mark relevant to normal development and disease genesis. A common approach to characterizing genome-wide DNA methylation is using Next Generation Sequencing technology to sequence bisulfite treated DNA. The short sequence reads are mapped to the reference genome to determine the methylation statuses of Cs. However, despite intense effort, a much smaller proportion of the reads derived from bisulfite treated DNA (usually about 40-80%) can be mapped than regular short reads mapping (> 90%), and it is unclear what factors lead to this low mapping efficiency. Results: To address this issue, we used the hairpin bisulfite sequencing technology to determine sequences of both DNA double strands simultaneously. This enabled the recovery of the original non-bisulfite-converted sequences. We used Bismark for bisulfite read mapping and Bowtie2 for recovered read mapping. We found that recovering the reads improved unique mapping efficiency by 9-10% compared to the bisulfite reads. Such improvement in mapping efficiency is related to sequence entropy. Conclusions: The hairpin recovery technique improves mapping efficiency, and sequence entropy relates to mapping efficiency.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1186/1471-2164-16-S11-S2en
dc.identifier.issue11en
dc.identifier.urihttp://hdl.handle.net/10919/76109en
dc.identifier.volume16en
dc.language.isoenen
dc.publisherBMCen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.titleInvestigating bisulfite short-read mapping failure with hairpin bisulfite sequencing dataen
dc.title.serialBMC Genomicsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
XieInvestigating2015.pdf
Size:
1.58 MB
Format:
Adobe Portable Document Format
Description: