During this masters study, we focused on purine rich DNA oligonucleotide containing 5’-d(CAGAGG)-3’ repeat(s). Human genome contains segments with tandem repeats of the mentioned sequence that forms poorly known secondary structures, which during DNA replication causes replication fork stalling, resulting in their collapse. Within human genome there are at least three tandem repeats of the sequence in more than half of all chromosomes. Some are even located inside important protein sequences like ADAM metallopeptidase, transcription factors and other. We carried out study of the secondary structures adopted by oligonucleotide RS4, 5’-d(TTGCAGAGaAGAGGCAGAGGCAA)-3’, which contains triple repeat of the 5’-d(CAGAGG)-3’ flanked by modified 5’- and 3’-ends specifically designed to enhance thermodynamic stability. In parallel, we investigate structure adopted by a shorter oligonucleotide SLL1, 5’-d(GTGCAGAGGCAC)-3’, which contains only one 5’-d(CAGAGG)-3’ repeat. With the use of NMR, we have discovered that structures adopted by RS4 exhibit a short stem and three AGAG loops, which in existing literature have not yet been characterized at atomic level. We proved, that oligonucleotide RS4 in the presence of potassium ions folds in a symmetrical structure with two GC and two AT base pairs in Watson-Crick geometry. Furthermore, proposed symmetry of the structure is in agreement with the collected NMR data, especially the lower number of proton signals with respect to expected from the number of residues in the RS4 DNA sequence. Alternatively, missing peaks could also be explained by conformational changes with specific dynamics, which leads to 1H NMR spectral signals being broadened to the baseline - undetectable on 1H NMR time scale. We found that salt concentration has a great effect on the smaller oligonucleotide SLL1, with the higher potassium ion concentrations favouring dimeric structural fold, while such monomer-dimer transition is not observed for RS4. With the use of 13C- and 15N-isotopic labelling of the nucleotide residues in the SLL1 and with use of 2D NMR methods we assigned most of the 1H NMR signals. We further analysed NOE correlations and calculated the structural model of the monomer structure, which depicts structural characteristics of 5’-d(CAGAGG)-3’ tandem repeats.
|