Systematic evaluation of parameters in RNA bisulfite sequencing data generation and analysis

Files

TR Number

Date

2022-03-31

Journal Title

Journal ISSN

Volume Title

Publisher

Oxford University Press

Abstract

The presence of 5-methylcytosine (m5C) in RNA molecules has been known for decades and its importance in regulating RNA metabolism has gradually become appreciated. Despite recent advances made in the functional and mechanistic understanding of RNA m5C modifications, the detection and quantification of methylated RNA remains a challenge. In this study, we compared four library construction procedures for RNA bisulfite sequencing and implemented an analytical pipeline to assess the key parameters in the process of m5C calling. We found that RNA fragmentation after bisulfite conversion increased the yield significantly, and an additional high temperature treatment improved bisulfite conversion efficiency especially for sequence reads mapped to the mitochondrial transcriptome. Using Unique Molecular Identifiers (UMIs), we observed that PCR favors the amplification of unmethylated templates. The low sequencing quality of bisulfite-converted bases is a major contributor to the methylation artifacts. In addition, we found that mitochondrial transcripts are frequently resistant to bisulfite conversion and no p-m5C sites with high confidence could be identified on mitochondrial mRNAs. Taken together, this study reveals the various sources of artifacts in RNA bisulfite sequencing data and provides an improved experimental procedure together with analytical methodology.

Description

Keywords

Citation