vi-HMM: a novel HMM-based method for sequence variant identification in short-read data

dc.contributor.authorTang, Manen
dc.contributor.authorHasan, Mohammad Shabbiren
dc.contributor.authorZhu, Hongxiaoen
dc.contributor.authorZhang, Liqingen
dc.contributor.authorWu, Xiaoweien
dc.contributor.departmentComputer Scienceen
dc.contributor.departmentStatisticsen
dc.date.accessioned2019-02-25T13:47:37Zen
dc.date.available2019-02-25T13:47:37Zen
dc.date.issued2019-02-13en
dc.date.updated2019-02-24T04:20:58Zen
dc.description.abstractBackground Accurate and reliable identification of sequence variants, including single nucleotide polymorphisms (SNPs) and insertion-deletion polymorphisms (INDELs), plays a fundamental role in next-generation sequencing (NGS) applications. Existing methods for calling these variants often make simplified assumptions of positional independence and fail to leverage the dependence between genotypes at nearby loci that is caused by linkage disequilibrium (LD). Results and conclusion We propose vi-HMM, a hidden Markov model (HMM)-based method for calling SNPs and INDELs in mapped short-read data. This method allows transitions between hidden states (defined as “SNP,” “Ins,” “Del,” and “Match”) of adjacent genomic bases and determines an optimal hidden state path by using the Viterbi algorithm. The inferred hidden state path provides a direct solution to the identification of SNPs and INDELs. Simulation studies show that, under various sequencing depths, vi-HMM outperforms commonly used variant calling methods in terms of sensitivity and F1 score. When applied to the real data, vi-HMM demonstrates higher accuracy in calling SNPs and INDELs.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationHuman Genomics. 2019 Feb 13;13(1):9en
dc.identifier.doihttps://doi.org/10.1186/s40246-019-0194-6en
dc.identifier.urihttp://hdl.handle.net/10919/87765en
dc.language.isoenen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.holderThe Author(s)en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.titlevi-HMM: a novel HMM-based method for sequence variant identification in short-read dataen
dc.title.serialHuman Genomicsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
40246_2019_Article_194.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: