Show simple item record

dc.contributor.authorKinney, N.en
dc.contributor.authorTitus-Glover, K.en
dc.contributor.authorWren, J.D.en
dc.contributor.authorVarghese, Ronnieen
dc.contributor.authorMichalak, Pawelen
dc.contributor.authorLiao, H.en
dc.contributor.authorAnandakrishnan, Ramuen
dc.contributor.authorPulenthiran, A.en
dc.contributor.authorKang, L.en
dc.contributor.authorGarner, Harold R.en
dc.date.accessioned2019-04-05T14:31:04Zen
dc.date.available2019-04-05T14:31:04Zen
dc.date.issued2019-01-08en
dc.identifier.issn0305-1048en
dc.identifier.urihttp://hdl.handle.net/10919/88840en
dc.description.abstractThe human genome harbors an abundance of repetitive DNA; however, its function continues to be debated. Microsatellites-a class of short tandem repeat-are established as an important source of genetic variation. Array length variants are common among microsatellites and affect gene expression; but, efforts to understand the role and diversity of microsatellite variation has been hampered by several challenges. Without adequate depth, both long-read and short-read sequencing may not detect the variants present in a sample; additionally, large sample sizes are needed to reveal the degree of population-level polymorphism. To address these challenges we present the Comparative Analysis of Germline Microsatellites (CAGm): A database of germline microsatellites from 2529 individuals in the 1000 genomes project. A key novelty of CAGm is the ability to aggregate microsatellite variation by population, ethnicity (super population) and gender. The database provides advanced searching for microsatellites embedded in genes and functional elements. All data can be downloaded as Microsoft Excel spreadsheets. Two use-case scenarios are presented to demonstrate its utility: A mononucleotide (A) microsatellite at the BAT-26 locus and a dinucleotide (CA) microsatellite in the coding region of FGFRL1. CAGm is freely available at http://www.cagmdb.org/. © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.en
dc.format.mimetypeapplication/pdfen
dc.language.isoen_USen
dc.publisherOxford University Pressen
dc.rightsCreative Commons Attribution-NonCommercial 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/en
dc.titleCAGm: A repository of germline microsatellite variations in the 1000 genomes projecten
dc.typeArticle - Refereeden
dc.contributor.departmentComputer Scienceen
dc.contributor.departmentVirginia-Maryland College of Veterinary Medicineen
dc.title.serialNucleic Acids Researchen
dc.identifier.doihttps://doi.org/10.1093/nar/gky969en
dc.identifier.volume47en
dc.identifier.issueD1en
dc.type.dcmitypeTexten
dc.identifier.pmid30329086en


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Creative Commons Attribution-NonCommercial 4.0 International
License: Creative Commons Attribution-NonCommercial 4.0 International