Somatic microsatellite variability as a measure of DNA stability in cancer and DNA  repair disorders

TR Number



Journal Title

Journal ISSN

Volume Title


Virginia Tech


Microsatellites (MSTs) are short tandem repeats of motifs, 1 — 6 nucleotide in length, and are considered mutational 'hot-spots' and show a greater degree of somatic variability and population polymorphisms than surrounding DNA sequences. MSTs provide for a unique computational alignment problem for many commonly used algorithms, and therefore required software tools to be developed to specifically address these issues. For this work we developed a novel approach to extract MSTs from next-gen sequencing data that can robustly detect signatures of MST mutation bias and somatic variation occurring in next-gen data including a high frequency of in-phase indels. Somatic variability, novel genomic polymorphisms that arise within a cell population not found in the progenitors, plays a critical role in cellular reprogramming leading to the development and progression of cancer. MST mutation rates are between 10 and 1000 time higher than that of surrounding DNA. MSTs are found ubiquitously throughout the genome including in nearly all transcribed regions and 10-20% of coding genomic regions. Currently the only established DNA repair defect that that has been directly linked to MST instability is replication coupled mismatch repair (MMR). An initial analysis of the utility of the software was conducted with DNA repair impaired cell lines. The results demonstrated the utility in identifying the consequences of DNA repair impairments on genomic stability. There were major objectives of the finding including 1) complimenting genomics of matched DNA samples with in-sample quantification of variation and 2) demonstrating that DNA repair proficient cells and those with different defects in DNA repair can have different somatic MST variability (SMV) profiles that may be potential markers for these defects.

DNA repair disorders are debilitating conditions that result in physical and neurological abnormalities robbing the individual of a normal quality of life and life span. The various conditions that fall into this class are known as progeroid disorders and they provide a very important glimpse into the aging process on a genomic level. The conditions for four cohorts analyzed here were; Cockayne's syndrome, caused by the loss of the ERCC8 gene, also known as CSA; xeroderma pigmentosum, caused by the loss of the XPA or XPB genes; Werner syndrome, caused by the loss of the RecQL2 gene; and Rothmond-Thomson syndrome, caused by the loss of the RecQL4 gene. The goal of this project was to determine if impaired excision repair genes CSA or global XPA and B or excision repair supporting helicases BLM or RecQL4 leads to MST destabilization. Comparing cohorts from excision repair disorders with a co-sequenced normal cohort we found that CSA both RecQ helicases had an effect on the exome somatic variability of MSTs. On the other hand the effects of XPA/B were inconclusive.

MST instability (MSI), defined as acquired/lost primary alleles in tumors for a small set of microsatellite loci, has been implicated and is a clinically relevant marker for colorectal cancer. Conversely, no clinically actionable genetic markers have been found for liver cancer, a cancer with a very high mortality rate. Here we explore the use SMV defined as the presence of minor alleles at MST loci, as a complementary measure of MSI as a genetic marker for colorectal and liver cancer by analyzing Illumina sequenced genomes from The Cancer Genome Atlas. Our data shows that SMV may distinguish a subpopulation of African American patients with colorectal cancer, ~33% of the population in this study. Further, for liver cancer, a higher rate of SMV may be indicative of earlier age of onset. In conclusion, the work presented here suggests that MSI should be expanded to include SMV, not only instability.



Cancer, DNA repair, Genomics, Somatic Variability, Somatic, Cell lines, Patients, Mismatch Repair, RecQL2, RecQL4, Liver cancer, Colorectal cancer