Sample records for dna repeat elements

  1. Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.

    PubMed Central

    Ananiev, E V; Phillips, R L; Rines, H W

    1998-01-01

    The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055

  2. Primary analysis of repeat elements of the Asian seabass (Lates calcarifer) transcriptome and genome

    PubMed Central

    Kuznetsova, Inna S.; Thevasagayam, Natascha M.; Sridatta, Prakki S. R.; Komissarov, Aleksey S.; Saju, Jolly M.; Ngoh, Si Y.; Jiang, Junhui; Shen, Xueyan; Orbán, László

    2014-01-01

    As part of our Asian seabass genome project, we are generating an inventory of repeat elements in the genome and transcriptome. The karyotype showed a diploid number of 2n = 24 chromosomes with a variable number of B-chromosomes. The transcriptome and genome of Asian seabass were searched for repetitive elements with experimental and bioinformatics tools. Six different types of repeats constituting 8–14% of the genome were characterized. Repetitive elements were clustered in the pericentromeric heterochromatin of all chromosomes, but some of them were preferentially accumulated in pretelomeric and pericentromeric regions of several chromosomes pairs and have chromosomes specific arrangement. From the dispersed class of fish-specific non-LTR retrotransposon elements Rex1 and MAUI-like repeats were analyzed. They were wide-spread both in the genome and transcriptome, accumulated on the pericentromeric and peritelomeric areas of all chromosomes. Every analyzed repeat was represented in the Asian seabass transcriptome, some showed differential expression between the gonads. The other group of repeats analyzed belongs to the rRNA multigene family. FISH signal for 5S rDNA was located on a single pair of chromosomes, whereas that for 18S rDNA was found on two pairs. A BAC-derived contig containing rDNA was sequenced and assembled into a scaffold containing incomplete fragments of 18S rDNA. Their assembly and chromosomal position revealed that this part of Asian seabass genome is extremely rich in repeats containing evolutionarily conserved and novel sequences. In summary, transcriptome assemblies and cDNA data are suitable for the identification of repetitive DNA from unknown genomes and for comparative investigation of conserved elements between teleosts and other vertebrates. PMID:25120555

  3. A novel species-specific tandem repeat DNA family from Sinapis arvensis: detection of telomere-like sequences.

    PubMed

    Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M

    1996-08-01

    DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.

  4. Sequence of retrovirus provirus resembles that of bacterial transposable elements

    NASA Astrophysics Data System (ADS)

    Shimotohno, Kunitada; Mizutani, Satoshi; Temin, Howard M.

    1980-06-01

    The nucleotide sequences of the terminal regions of an infectious integrated retrovirus cloned in the modified λ phage cloning vector Charon 4A have been elucidated. There is a 569-base pair direct repeat at both ends of the viral DNA. The cell-virus junctions at each end consist of a 5-base pair direct repeat of cell DNA next to a 3-base pair inverted repeat of viral DNA. This structure resembles that of a transposable element and is consistent with the protovirus hypothesis that retroviruses evolved from the cell genome.

  5. Rat L (long interspersed repeated DNA) elements contain guanine-rich homopurine sequences that induce unpairing of contiguous duplex DNA.

    PubMed Central

    Usdin, K; Furano, A V

    1988-01-01

    The L family (long interspersed repeated DNA) of mobile genetic elements is a persistent feature of the mammalian genome. In rats, this family contains approximately equal to 40,000 members and accounts for approximately equal to 10% of the haploid genome. We demonstrate here that the guanine-rich homopurine stretches located at the right end of L-DNA induce oligonucleotide uptake by contiguous duplex DNA. The uptake is dependent on negative supercoiling and the length of the homopurine stretch and occurs even when the L-DNA homopurine stretches are introduced into a different DNA environment. The bound oligomer primes DNA synthesis when DNA polymerase and deoxyribonucleoside triphosphates are added, resulting in a faithful copy of the template to which the oligonucleotide had bound. The implications of this property of the L-DNA guanine-rich homopurine stretches in the amplification, recombination, and dispersal of L elements is discussed. Images PMID:2837766

  6. Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.

    PubMed

    Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V

    1985-09-01

    The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this.

  7. Billions of basepairs of recently expanded, repetitive sequences are eliminated from the somatic genome during copepod development.

    PubMed

    Sun, Cheng; Wyngaard, Grace; Walton, D Brian; Wichman, Holly A; Mueller, Rachel Lockridge

    2014-03-11

    Chromatin diminution is the programmed deletion of DNA from presomatic cell or nuclear lineages during development, producing single organisms that contain two different nuclear genomes. Phylogenetically diverse taxa undergo chromatin diminution--some ciliates, nematodes, copepods, and vertebrates. In cyclopoid copepods, chromatin diminution occurs in taxa with massively expanded germline genomes; depending on species, germline genome sizes range from 15 - 75 Gb, 12-74 Gb of which are lost from pre-somatic cell lineages at germline--soma differentiation. This is more than an order of magnitude more sequence than is lost from other taxa. To date, the sequences excised from copepods have not been analyzed using large-scale genomic datasets, and the processes underlying germline genomic gigantism in this clade, as well as the functional significance of chromatin diminution, have remained unknown. Here, we used high-throughput genomic sequencing and qPCR to characterize the germline and somatic genomes of Mesocyclops edax, a freshwater cyclopoid copepod with a germline genome of ~15 Gb and a somatic genome of ~3 Gb. We show that most of the excised DNA consists of repetitive sequences that are either 1) verifiable transposable elements (TEs), or 2) non-simple repeats of likely TE origin. Repeat elements in both genomes are skewed towards younger (i.e. less divergent) elements. Excised DNA is a non-random sample of the germline repeat element landscape; younger elements, and high frequency DNA transposons and LINEs, are disproportionately eliminated from the somatic genome. Our results suggest that germline genome expansion in M. edax reflects explosive repeat element proliferation, and that billions of base pairs of such repeats are deleted from the somatic genome every generation. Thus, we hypothesize that chromatin diminution is a mechanism that controls repeat element load, and that this load can evolve to be divergent between tissue types within single organisms.

  8. Billions of basepairs of recently expanded, repetitive sequences are eliminated from the somatic genome during copepod development

    PubMed Central

    2014-01-01

    Background Chromatin diminution is the programmed deletion of DNA from presomatic cell or nuclear lineages during development, producing single organisms that contain two different nuclear genomes. Phylogenetically diverse taxa undergo chromatin diminution — some ciliates, nematodes, copepods, and vertebrates. In cyclopoid copepods, chromatin diminution occurs in taxa with massively expanded germline genomes; depending on species, germline genome sizes range from 15 – 75 Gb, 12–74 Gb of which are lost from pre-somatic cell lineages at germline – soma differentiation. This is more than an order of magnitude more sequence than is lost from other taxa. To date, the sequences excised from copepods have not been analyzed using large-scale genomic datasets, and the processes underlying germline genomic gigantism in this clade, as well as the functional significance of chromatin diminution, have remained unknown. Results Here, we used high-throughput genomic sequencing and qPCR to characterize the germline and somatic genomes of Mesocyclops edax, a freshwater cyclopoid copepod with a germline genome of ~15 Gb and a somatic genome of ~3 Gb. We show that most of the excised DNA consists of repetitive sequences that are either 1) verifiable transposable elements (TEs), or 2) non-simple repeats of likely TE origin. Repeat elements in both genomes are skewed towards younger (i.e. less divergent) elements. Excised DNA is a non-random sample of the germline repeat element landscape; younger elements, and high frequency DNA transposons and LINEs, are disproportionately eliminated from the somatic genome. Conclusions Our results suggest that germline genome expansion in M. edax reflects explosive repeat element proliferation, and that billions of base pairs of such repeats are deleted from the somatic genome every generation. Thus, we hypothesize that chromatin diminution is a mechanism that controls repeat element load, and that this load can evolve to be divergent between tissue types within single organisms. PMID:24618421

  9. The fission yeast CENP-B protein Abp1 prevents pervasive transcription of repetitive DNA elements.

    PubMed

    Daulny, Anne; Mejía-Ramírez, Eva; Reina, Oscar; Rosado-Lugo, Jesus; Aguilar-Arnal, Lorena; Auer, Herbert; Zaratiegui, Mikel; Azorin, Fernando

    2016-10-01

    It is well established that eukaryotic genomes are pervasively transcribed producing cryptic unstable transcripts (CUTs). However, the mechanisms regulating pervasive transcription are not well understood. Here, we report that the fission yeast CENP-B homolog Abp1 plays an important role in preventing pervasive transcription. We show that loss of abp1 results in the accumulation of CUTs, which are targeted for degradation by the exosome pathway. These CUTs originate from different types of genomic features, but the highest increase corresponds to Tf2 retrotransposons and rDNA repeats, where they map along the entire elements. In the absence of abp1, increased RNAPII-Ser5P occupancy is observed throughout the Tf2 coding region and, unexpectedly, RNAPII-Ser5P is enriched at rDNA repeats. Loss of abp1 also results in Tf2 derepression and increased nucleolus size. Altogether these results suggest that Abp1 prevents pervasive RNAPII transcription of repetitive DNA elements (i.e., Tf2 and rDNA repeats) from internal cryptic sites. Copyright © 2016 Elsevier B.V. All rights reserved.

  10. Satellite DNA Modulates Gene Expression in the Beetle Tribolium castaneum after Heat Stress

    PubMed Central

    Feliciello, Isidoro; Akrap, Ivana; Ugarković, Đurđica

    2015-01-01

    Non-coding repetitive DNAs have been proposed to perform a gene regulatory role, however for tandemly repeated satellite DNA no such role was defined until now. Here we provide the first evidence for a role of satellite DNA in the modulation of gene expression under specific environmental conditions. The major satellite DNA TCAST1 in the beetle Tribolium castaneum is preferentially located within pericentromeric heterochromatin but is also dispersed as single repeats or short arrays in the vicinity of protein-coding genes within euchromatin. Our results show enhanced suppression of activity of TCAST1-associated genes and slower recovery of their activity after long-term heat stress relative to the same genes without associated TCAST1 satellite DNA elements. The level of gene suppression is not influenced by the distance of TCAST1 elements from the associated genes up to 40 kb from the genes’ transcription start sites, but it does depend on the copy number of TCAST1 repeats within an element, being stronger for the higher number of copies. The enhanced gene suppression correlates with the enrichment of the repressive histone marks H3K9me2/3 at dispersed TCAST1 elements and their flanking regions as well as with increased expression of TCAST1 satellite DNA. The results reveal transient, RNAi based heterochromatin formation at dispersed TCAST1 repeats and their proximal regions as a mechanism responsible for enhanced silencing of TCAST1-associated genes. Differences in the pattern of distribution of TCAST1 elements contribute to gene expression diversity among T. castaneum strains after long-term heat stress and might have an impact on adaptation to different environmental conditions. PMID:26275223

  11. Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.

    PubMed Central

    Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V

    1985-01-01

    The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this. Images PMID:3016521

  12. Preferential Nucleosome Assembly at DNA Triplet Repeats from the Myotonic Dystrophy Gene

    NASA Astrophysics Data System (ADS)

    Wang, Yuh-Hwa; Amirhaeri, Sorour; Kang, Seongman; Wells, Robert D.; Griffith, Jack D.

    1994-07-01

    The expansion of CTG repeats in DNA occurs in or near genes involved in several human diseases, including myotonic dystrophy and Huntington's disease. Nucleosomes, the basic structural element of chromosomes, consist of 146 base pairs of DNA coiled about an octamer of histone proteins and mediate general transcriptional repression. Electron microscopy was used to examine in vitro the nucleosome assembly of DNA containing repeating CTG triplets. The efficiency of nucleosome formation increased with expanded triplet blocks, suggesting that such blocks may repress transcription through the creation of stable nucleosomes.

  13. Turnover of R1 (Type I) and R2 (Type Ii) Retrotransposable Elements in the Ribosomal DNA of Drosophila Melanogaster

    PubMed Central

    Jakubczak, J. L.; Zenni, M. K.; Woodruff, R. C.; Eickbush, T. H.

    1992-01-01

    R1 and R2 are distantly related non-long terminal repeat retrotransposable elements each of which inserts into a specific site in the 28S rRNA genes of most insects. We have analyzed aspects of R1 and R2 abundance and sequence variation in 27 geographical isolates of Drosophila melanogaster. The fraction of 28S rRNA genes containing these elements varied greatly between strains, 17-67% for R1 elements and 2-28% for R2 elements. The total percentage of the rDNA repeats inserted ranged from 32 to 77%. The fraction of the rDNA repeats that contained both of these elements suggested that R1 and R2 exhibit neither an inhibition of nor preference for insertion into a 28S gene already containing the other type of element. Based on the conservation of restriction sites in the elements of all strains, and sequence analysis of individual elements from three strains, nucleotide divergence is very low for R1 and R2 elements within or between strains (<0.6%). This sequence uniformity is the expected result of the forces of concerted evolution (unequal crossovers and gene conversion) which act on the rRNA genes themselves. Evidence for the role of retrotransposition in the turnover of R1 and R2 was obtained by using naturally occurring 5' length polymorphisms of the elements as markers for independent transposition events. The pattern of these different length 5' truncations of R1 and R2 was found to be diverse and unique to most strains analyzed. Because recombination can only, with time, amplify or eliminate those length variants already present, the diversity found in each strain suggests that retrotransposition has played a critical role in maintaining these elements in the rDNA repeats of D. melanogaster. PMID:1317313

  14. Genome-wide DNA methylation patterns in LSH mutant reveals de-repression of repeat elements and redundant epigenetic silencing pathways

    PubMed Central

    Yu, Weishi; McIntosh, Carl; Lister, Ryan; Zhu, Iris; Han, Yixing; Ren, Jianke; Landsman, David; Lee, Eunice; Briones, Victorino; Terashima, Minoru; Leighty, Robert; Ecker, Joseph R.

    2014-01-01

    Cytosine methylation is critical in mammalian development and plays a role in diverse biologic processes such as genomic imprinting, X chromosome inactivation, and silencing of repeat elements. Several factors regulate DNA methylation in early embryogenesis, but their precise role in the establishment of DNA methylation at a given site remains unclear. We have generated a comprehensive methylation map in fibroblasts derived from the murine DNA methylation mutant Hells−/− (helicase, lymphoid specific, also known as LSH). It has been previously shown that HELLS can influence de novo methylation of retroviral sequences and endogenous genes. Here, we describe that HELLS controls cytosine methylation in a nuclear compartment that is in part defined by lamin B1 attachment regions. Despite widespread loss of cytosine methylation at regulatory sequences, including promoter regions of protein-coding genes and noncoding RNA genes, overall relative transcript abundance levels in the absence of HELLS are similar to those in wild-type cells. A subset of promoter regions shows increases of the histone modification H3K27me3, suggesting redundancy of epigenetic silencing mechanisms. Furthermore, HELLS modulates CG methylation at all classes of repeat elements and is critical for repression of a subset of repeat elements. Overall, we provide a detailed analysis of gene expression changes in relation to DNA methylation alterations, which contributes to our understanding of the biological role of cytosine methylation. PMID:25170028

  15. In and out of the rRNA genes: characterization of Pokey elements in the sequenced Daphnia genome

    PubMed Central

    2013-01-01

    Background Only a few transposable elements are known to exhibit site-specific insertion patterns, including the well-studied R-element retrotransposons that insert into specific sites within the multigene rDNA. The only known rDNA-specific DNA transposon, Pokey (superfamily: piggyBac) is found in the freshwater microcrustacean, Daphnia pulex. Here, we present a genome-wide analysis of Pokey based on the recently completed whole genome sequencing project for D. pulex. Results Phylogenetic analysis of Pokey elements recovered from the genome sequence revealed the presence of four lineages corresponding to two divergent autonomous families and two related lineages of non-autonomous miniature inverted repeat transposable elements (MITEs). The MITEs are also found at the same 28S rRNA gene insertion site as the Pokey elements, and appear to have arisen as deletion derivatives of autonomous elements. Several copies of the full-length Pokey elements may be capable of producing an active transposase. Surprisingly, both families of Pokey possess a series of 200 bp repeats upstream of the transposase that is derived from the rDNA intergenic spacer (IGS). The IGS sequences within the Pokey elements appear to be evolving in concert with the rDNA units. Finally, analysis of the insertion sites of Pokey elements outside of rDNA showed a target preference for sites similar to the specific sequence that is targeted within rDNA. Conclusions Based on the target site preference of Pokey elements and the concerted evolution of a segment of the element with the rDNA unit, we propose an evolutionary path by which the ancestors of Pokey elements have invaded the rDNA niche. We discuss how specificity for the rDNA unit may have evolved and how this specificity has played a role in the long-term survival of these elements in the subgenus Daphnia. PMID:24059783

  16. Translocation and gross deletion breakpoints in human inherited disease and cancer II: Potential involvement of repetitive sequence elements in secondary structure formation between DNA ends.

    PubMed

    Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N

    2003-09-01

    Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.

  17. The DL1 repeats in the genome of Diphyllobothrium latum.

    PubMed

    Usmanova, Nadezhda M; Kazakov, Vasiliy I

    2010-07-01

    Diphyllobothrium latum is a widespread intestinal parasite, which has a great clinical relevance, but there are no sequences of its nuclear genome. In this paper, a repetitive element in the D. latum genome is firstly described. The adult D. latum was obtained in the result of expulsion from intestinum of a patient suffering from diphyllobothriasis. Genomic DNA was isolated from several proglottids of this individual. PstI restriction products of D. latum genomic DNA were sequenced. Polymerase chain reaction (PCR) amplification of these products using genomic DNA and selected primers was carried out. Thereby a cluster of a repetitive element, called DL1, was discovered. For precise identification of a beginning and an end of the repeat, a product of PCR amplification of D. latum genomic DNA with one specific primer was sequenced. In discussion, several evidences that DL1 repeat is a member of the SINE family of retroposons were adduced.

  18. Structural analysis of the rDNA intergenic spacer of Brassica nigra: evolutionary divergence of the spacers of the three diploid Brassica species.

    PubMed

    Bhatia, S; Singh Negi, M; Lakshmikumaran, M

    1996-11-01

    EcoRI restriction of the B. nigra rDNA recombinants, isolated from a lambda genomic library, showed that the 3.9-kb fragment corresponded to the Intergenic Spacer (IGS), which was sequenced and found to be 3,928 bp in size. Sequence and dot-matrix analyses showed that the organization of the B. nigra rDNA IGS was typical of most rDNA spacers, consisting of a central repetitive region and flanking unique sequences on either side. The repetitive region was composed of two repeat families-RF 'A' and RF 'B.' The B. nigra RF 'A' consisted of a tandem array of three full-length copies of a 106-bp sequence element. RF 'B' was composed of 66 tandemly repeated elements. Each 'B' element was only 21-bp in size and this is the smallest repeat unit identified in plant rDNA to date. The putative transcription initiation site (TIS) was identified as nucleotide position 3,110. Based on the sequence analysis it was suggested that the present organization of the repeat families was generated by successive cycles of deletions and amplifications and was being maintained by homogenization processes such as gene conversion and crossing-over.A detailed comparison of the rDNA IGS sequences of the three diploid Brassica species-namely, B. nigra, B. campestris, and B. oleracea-was carried out. First, comparisons revealed that B. campestris and B. oleracea were close to each other as the repeat families in both showed high sequence homology between each other. Second, the repeat elements in both the species were organized in an interspersed manner. Third, a 52-bp sequence, present just downstream of the repeats in B. campestris, was found to be identical to the B. oleracea repeats, thereby suggesting a common progenitor. On the other hand, in B. nigra no interspersion pattern of organization of repeats was observed. Further, the B. nigra RF 'A' was identified as distinct from the repeat families of B. campestris and B. oleracea. Based on this analysis, it was suggested that during speciation B. campestris and B. oleracea evolved in one lineage whereas B. nigra diverged into a separate lineage. The comparative analysis of the IGS helped in identifying not only conserved ancestral sequence motifs of possible functional significance such as promoters and enhancers, but also sequences which showed variation between the three diploid species and were therefore identified as species-specific sequences.

  19. Detection and characterization of miniature inverted-repeat transposable elements in “Candidatus Liberibacter asiaticus”

    USDA-ARS?s Scientific Manuscript database

    Miniature inverted-repeat transposable elements (MITEs) are non-autonomous transposons (devoid a transposase gene, tps) involving insertion/deletion of genomic DNA in bacterial genomes influencing gene functions. No transposon has yet been reported in “Candidatus Liberibacter asiaticus”, an alpha-pr...

  20. The profile of repeat-associated histone lysine methylation states in the mouse epigenome

    PubMed Central

    Martens, Joost H A; O'Sullivan, Roderick J; Braunschweig, Ulrich; Opravil, Susanne; Radolf, Martin; Steinlein, Peter; Jenuwein, Thomas

    2005-01-01

    Histone lysine methylation has been shown to index silenced chromatin regions at, for example, pericentric heterochromatin or of the inactive X chromosome. Here, we examined the distribution of repressive histone lysine methylation states over the entire family of DNA repeats in the mouse genome. Using chromatin immunoprecipitation in a cluster analysis representing repetitive elements, our data demonstrate the selective enrichment of distinct H3-K9, H3-K27 and H4-K20 methylation marks across tandem repeats (e.g. major and minor satellites), DNA transposons, retrotransposons, long interspersed nucleotide elements and short interspersed nucleotide elements. Tandem repeats, but not the other repetitive elements, give rise to double-stranded (ds) RNAs that are further elevated in embryonic stem (ES) cells lacking the H3-K9-specific Suv39h histone methyltransferases. Importantly, although H3-K9 tri- and H4-K20 trimethylation appear stable at the satellite repeats, many of the other repeat-associated repressive marks vary in chromatin of differentiated ES cells or of embryonic trophoblasts and fibroblasts. Our data define a profile of repressive histone lysine methylation states for the repetitive complement of four distinct mouse epigenomes and suggest tandem repeats and dsRNA as primary triggers for more stable chromatin imprints. PMID:15678104

  1. Tetris Is a Foldback Transposon that Provided the Building Blocks for an Emerging Satellite DNA of Drosophila virilis

    PubMed Central

    Dias, Guilherme B.; Svartman, Marta; Delprat, Alejandra; Ruiz, Alfredo; Kuhn, Gustavo C.S.

    2014-01-01

    Transposable elements (TEs) and satellite DNAs (satDNAs) are abundant components of most eukaryotic genomes studied so far and their impact on evolution has been the focus of several studies. A number of studies linked TEs with satDNAs, but the nature of their evolutionary relationships remains unclear. During in silico analyses of the Drosophila virilis assembled genome, we found a novel DNA transposon we named Tetris based on its modular structure and diversity of rearranged forms. We aimed to characterize Tetris and investigate its role in generating satDNAs. Data mining and sequence analysis showed that Tetris is apparently nonautonomous, with a structure similar to foldback elements, and present in D. virilis and D. americana. Herein, we show that Tetris shares the final portions of its terminal inverted repeats (TIRs) with DAIBAM, a previously described miniature inverted transposable element implicated in the generation of chromosome inversions. Both elements are likely to be mobilized by the same autonomous TE. Tetris TIRs contain approximately 220-bp internal tandem repeats that we have named TIR-220. We also found TIR-220 repeats making up longer (kb-size) satDNA-like arrays. Using bioinformatic, phylogenetic and cytogenomic tools, we demonstrated that Tetris has contributed to shaping the genomes of D. virilis and D. americana, providing internal tandem repeats that served as building blocks for the amplification of satDNA arrays. The β-heterochromatic genomic environment seemed to have favored such amplification. Our results imply for the first time a role for foldback elements in generating satDNAs. PMID:24858539

  2. DNA-directed mutations. Leading and lagging strand specificity

    NASA Technical Reports Server (NTRS)

    Sinden, R. R.; Hashem, V. I.; Rosche, W. A.

    1999-01-01

    The fidelity of replication has evolved to reproduce B-form DNA accurately, while allowing a low frequency of mutation. The fidelity of replication can be compromised, however, by defined order sequence DNA (dosDNA) that can adopt unusual or non B-DNA conformations. These alternative DNA conformations, including hairpins, cruciforms, triplex DNAs, and slipped-strand structures, may affect enzyme-template interactions that potentially lead to mutations. To analyze the effect of dosDNA elements on spontaneous mutagenesis, various mutational inserts containing inverted repeats or direct repeats were cloned in a plasmid containing a unidirectional origin of replication and a selectable marker for the mutation. This system allows for analysis of mutational events that are specific for the leading or lagging strands during DNA replication in Escherichia coli. Deletions between direct repeats, involving misalignment stabilized by DNA secondary structure, occurred preferentially on the lagging strand. Intermolecular strand switch events, correcting quasipalindromes to perfect inverted repeats, occurred preferentially during replication of the leading strand.

  3. Transposon-like properties of the major, long repetitive sequence family in the genome of Physarum polycephalum

    PubMed Central

    Pearston, Douglas H.; Gordon, Mairi; Hardman, Norman

    1985-01-01

    A family of long, highly-repetitive sequences, referred to previously as `HpaII-repeats', dominates the genome of the eukaryotic slime mould Physarum polycephalum. These sequences are found exclusively in scrambled clusters. They account for about one-half of the total complement of repetitive DNA in Physarum, and represent the major sequence component found in hypermethylated, 20-50 kb segments of Physarum genomic DNA that fail to be cleaved using the restriction endonuclease HpaII. The structure of this abundant repetitive element was investigated by analysing cloned segments derived from the hypermethylated genomic DNA compartment. We show that the `HpaII-repeat' forms part of a larger repetitive DNA structure, ∼8.6 kb in length, with several structural features in common with recognised eukaryotic transposable genetic elements. Scrambled clusters of the sequence probably arise as a result of transposition-like events, during which the element preferentially recombines in either orientation with target sites located in other copies of the same repeated sequence. The target sites for transposition/recombination are not related in sequence but in all cases studied they are potentially capable of promoting the formation of small `cruciforms' or `Z-DNA' structures which might be recognised during the recombination process. ImagesFig. 3.Fig. 4. PMID:16453652

  4. A model for genesis of transcription systems.

    PubMed

    Burton, Zachary F; Opron, Kristopher; Wei, Guowei; Geiger, James H

    2016-01-01

    Repeating sequences generated from RNA gene fusions/ligations dominate ancient life, indicating central importance of building structural complexity in evolving biological systems. A simple and coherent story of life on earth is told from tracking repeating motifs that generate α/β proteins, 2-double-Ψ-β-barrel (DPBB) type RNA polymerases (RNAPs), general transcription factors (GTFs), and promoters. A general rule that emerges is that biological complexity that arises through generation of repeats is often bounded by solubility and closure (i.e., to form a pseudo-dimer or a barrel). Because the first DNA genomes were replicated by DNA template-dependent RNA synthesis followed by RNA template-dependent DNA synthesis via reverse transcriptase, the first DNA replication origins were initially 2-DPBB type RNAP promoters. A simplifying model for evolution of promoters/replication origins via repetition of core promoter elements is proposed. The model can explain why Pribnow boxes in bacterial transcription (i.e., (-12)TATAATG(-6)) so closely resemble TATA boxes (i.e., (-31)TATAAAAG(-24)) in archaeal/eukaryotic transcription. The evolution of anchor DNA sequences in bacterial (i.e., (-35)TTGACA(-30)) and archaeal (BRE(up); BRE for TFB recognition element) promoters is potentially explained. The evolution of BRE(down) elements of archaeal promoters is potentially explained.

  5. Telomere and ribosomal DNA repeats are chromosomal targets of the bloom syndrome DNA helicase

    PubMed Central

    Schawalder, James; Paric, Enesa; Neff, Norma F

    2003-01-01

    Background Bloom syndrome is one of the most cancer-predisposing disorders and is characterized by genomic instability and a high frequency of sister chromatid exchange. The disorder is caused by loss of function of a 3' to 5' RecQ DNA helicase, BLM. The exact role of BLM in maintaining genomic integrity is not known but the helicase has been found to associate with several DNA repair complexes and some DNA replication foci. Results Chromatin immunoprecipitation of BLM complexes recovered telomere and ribosomal DNA repeats. The N-terminus of BLM, required for NB localization, is the same as the telomere association domain of BLM. The C-terminus is required for ribosomal DNA localization. BLM localizes primarily to the non-transcribed spacer region of the ribosomal DNA repeat where replication forks initiate. Bloom syndrome cells expressing the deletion alleles lacking the ribosomal DNA and telomere association domains have altered cell cycle populations with increased S or G2/M cells relative to normal. Conclusion These results identify telomere and ribosomal DNA repeated sequence elements as chromosomal targets for the BLM DNA helicase during the S/G2 phase of the cell cycle. BLM is localized in nuclear bodies when it associates with telomeric repeats in both telomerase positive and negative cells. The BLM DNA helicase participates in genomic stability at ribosomal DNA repeats and telomeres. PMID:14577841

  6. Tetris is a foldback transposon that provided the building blocks for an emerging satellite DNA of Drosophila virilis.

    PubMed

    Dias, Guilherme B; Svartman, Marta; Delprat, Alejandra; Ruiz, Alfredo; Kuhn, Gustavo C S

    2014-05-24

    Transposable elements (TEs) and satellite DNAs (satDNAs) are abundant components of most eukaryotic genomes studied so far and their impact on evolution has been the focus of several studies. A number of studies linked TEs with satDNAs, but the nature of their evolutionary relationships remains unclear. During in silico analyses of the Drosophila virilis assembled genome, we found a novel DNA transposon we named Tetris based on its modular structure and diversity of rearranged forms. We aimed to characterize Tetris and investigate its role in generating satDNAs. Data mining and sequence analysis showed that Tetris is apparently nonautonomous, with a structure similar to foldback elements, and present in D. virilis and D. americana. Herein, we show that Tetris shares the final portions of its terminal inverted repeats (TIRs) with DAIBAM, a previously described miniature inverted transposable element implicated in the generation of chromosome inversions. Both elements are likely to be mobilized by the same autonomous TE. Tetris TIRs contain approximately 220-bp internal tandem repeats that we have named TIR-220. We also found TIR-220 repeats making up longer (kb-size) satDNA-like arrays. Using bioinformatic, phylogenetic and cytogenomic tools, we demonstrated that Tetris has contributed to shaping the genomes of D. virilis and D. americana, providing internal tandem repeats that served as building blocks for the amplification of satDNA arrays. The β-heterochromatic genomic environment seemed to have favored such amplification. Our results imply for the first time a role for foldback elements in generating satDNAs. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Transcriptional activation of short interspersed elements by DNA-damaging agents.

    PubMed

    Rudin, C M; Thompson, C B

    2001-01-01

    Short interspersed elements (SINEs), typified by the human Alu repeat, are RNA polymerase III (pol III)-transcribed sequences that replicate within the genome through an RNA intermediate. Replication of SINEs has been extensive in mammalian evolution: an estimated 5% of the human genome consists of Alu repeats. The mechanisms regulating transcription, reverse transcription, and reinsertion of SINE elements in genomic DNA are poorly understood. Here we report that expression of murine SINE transcripts of both the B1 and B2 classes is strongly upregulated after prolonged exposure to cisplatin, etoposide, or gamma radiation. A similar induction of Alu transcripts in human cells occurs under these conditions. This induction is not due to a general upregulation of pol III activity in either species. Genotoxic treatment of murine cells containing an exogenous human Alu element induced Alu transcription. Concomitant with the increased expression of SINEs, an increase in cellular reverse transcriptase was observed after exposure to these same DNA-damaging agents. These findings suggest that genomic damage may be an important activator of SINEs, and that SINE mobility may contribute to secondary malignancy after exposure to DNA-damaging chemotherapy.

  8. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  9. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  10. Expanded CAG/CTG Repeat DNA Induces a Checkpoint Response That Impacts Cell Proliferation in Saccharomyces cerevisiae

    PubMed Central

    Sundararajan, Rangapriya; Freudenreich, Catherine H.

    2011-01-01

    Repetitive DNA elements are mutational hotspots in the genome, and their instability is linked to various neurological disorders and cancers. Although it is known that expanded trinucleotide repeats can interfere with DNA replication and repair, the cellular response to these events has not been characterized. Here, we demonstrate that an expanded CAG/CTG repeat elicits a DNA damage checkpoint response in budding yeast. Using microcolony and single cell pedigree analysis, we found that cells carrying an expanded CAG repeat frequently experience protracted cell division cycles, persistent arrests, and morphological abnormalities. These phenotypes were further exacerbated by mutations in DSB repair pathways, including homologous recombination and end joining, implicating a DNA damage response. Cell cycle analysis confirmed repeat-dependent S phase delays and G2/M arrests. Furthermore, we demonstrate that the above phenotypes are due to the activation of the DNA damage checkpoint, since expanded CAG repeats induced the phosphorylation of the Rad53 checkpoint kinase in a rad52Δ recombination deficient mutant. Interestingly, cells mutated for the MRX complex (Mre11-Rad50-Xrs2), a central component of DSB repair which is required to repair breaks at CAG repeats, failed to elicit repeat-specific arrests, morphological defects, or Rad53 phosphorylation. We therefore conclude that damage at expanded CAG/CTG repeats is likely sensed by the MRX complex, leading to a checkpoint response. Finally, we show that repeat expansions preferentially occur in cells experiencing growth delays. Activation of DNA damage checkpoints in repeat-containing cells could contribute to the tissue degeneration observed in trinucleotide repeat expansion diseases. PMID:21437275

  11. The site-specific ribosomal DNA insertion element R1Bm belongs to a class of non-long-terminal-repeat retrotransposons.

    PubMed Central

    Xiong, Y; Eickbush, T H

    1988-01-01

    Two types of insertion elements, R1 and R2 (previously called type I and type II), are known to interrupt the 28S ribosomal genes of several insect species. In the silkmoth, Bombyx mori, each element occupies approximately 10% of the estimated 240 ribosomal DNA units, while at most only a few copies are located outside the ribosomal DNA units. We present here the complete nucleotide sequence of an R1 insertion from B. mori (R1Bm). This 5.1-kilobase element contains two overlapping open reading frames (ORFs) which together occupy 88% of its length. ORF1 is 461 amino acids in length and exhibits characteristics of retroviral gag genes. ORF2 is 1,051 amino acids in length and contains homology to reverse transcriptase-like enzymes. The analysis of 3' and 5' ends of independent isolates from the ribosomal locus supports the suggestion that R1 is still functioning as a transposable element. The precise location of the element within the genome implies that its transposition must occur with remarkable insertion sequence specificity. Comparison of the deduced amino acid sequences from six retrotransposons, R1 and R2 of B. mori, I factor and F element of Drosophila melanogaster, L1 of Mus domesticus, and Ingi of Trypanosoma brucei, reveals a relatively high level of sequence homology in the reverse transcriptase region. Like R1, these elements lack long terminal repeats. We have therefore named this class of related elements the non-long-terminal-repeat (non-LTR) retrotransposons. Images PMID:2447482

  12. In vitro selection of DNA elements highly responsive to the human T-cell lymphotropic virus type I transcriptional activator, Tax.

    PubMed

    Paca-Uccaralertkun, S; Zhao, L J; Adya, N; Cross, J V; Cullen, B R; Boros, I M; Giam, C Z

    1994-01-01

    The human T-cell lymphotropic virus type I (HTLV-I) transactivator, Tax, the ubiquitous transcriptional factor cyclic AMP (cAMP) response element-binding protein (CREB protein), and the 21-bp repeats in the HTLV-I transcriptional enhancer form a ternary nucleoprotein complex (L. J. Zhao and C. Z. Giam, Proc. Natl. Acad. Sci. USA 89:7070-7074, 1992). Using an antibody directed against the COOH-terminal region of Tax along with purified Tax and CREB proteins, we selected DNA elements bound specifically by the Tax-CREB complex in vitro. Two distinct but related groups of sequences containing the cAMP response element (CRE) flanked by long runs of G and C residues in the 5' and 3' regions, respectively, were preferentially recognized by Tax-CREB. In contrast, CREB alone binds only to CRE motifs (GNTGACG[T/C]) without neighboring G- or C-rich sequences. The Tax-CREB-selected sequences bear a striking resemblance to the 5' or 3' two-thirds of the HTLV-I 21-bp repeats and are highly inducible by Tax. Gel electrophoretic mobility shift assays, DNA transfection, and DNase I footprinting analyses indicated that the G- and C-rich sequences flanking the CRE motif are crucial for Tax-CREB-DNA ternary complex assembly and Tax transactivation but are not in direct contact with the Tax-CREB complex. These data show that Tax recruits CREB to form a multiprotein complex that specifically recognizes the viral 21-bp repeats. The expanded DNA binding specificity of Tax-CREB and the obligatory role the ternary Tax-CREB-DNA complex plays in transactivation reveal a novel mechanism for regulating the transcriptional activity of leucine zipper proteins like CREB.

  13. Repeated administration of CGP 46381, a gamma-aminobutyric acidB antagonist, and ethosuximide suppresses seizure-associated cyclic adenosine 3'5' monophosphate response element- and activator protein-1 DNA-binding activities in lethargic (lh/lh) mice.

    PubMed

    Ishige, K; Endo, H; Saito, H; Ito, Y

    2001-01-19

    To characterize seizure-associated increases in cerebral cortical and thalamic cyclic AMP responsive element (CRE)- and activator protein 1 (AP-1) DNA-binding activities in lethargic (lh/lh) mice, a genetic model of absence seizures, we examined the effects of ethosuximide and CGP 46381 on these DNA-binding activities. Repeated administration (twice a day for 5 days) of ethosuximide (200 mg/kg) or CGP 46381 (60 mg/kg) attenuated both seizure behavior and the increased DNA-binding activities, and was more effective than a single administration of these drugs. These treatments did not affect either normal behavior or basal DNA-binding activities in non-epileptic control (+/+) mice. Gel supershift assays revealed that the increased CRE-binding activity was attributable to activation of the binding activity of CREB, and that the c-Fos-c-Jun complex was a component of the increased AP-1 DNA-binding activity.

  14. Germ line insertion of mtDNA at the breakpoint junction of a reciprocal constitutional translocation.

    PubMed

    Willett-Brozick, J E; Savul, S A; Richey, L E; Baysal, B E

    2001-08-01

    Constitutional chromosomal translocations are relatively common causes of human morbidity, yet the DNA double-strand break (DSB) repair mechanisms that generate them are incompletely understood. We cloned, sequenced and analyzed the breakpoint junctions of a familial constitutional reciprocal translocation t(9;11)(p24;q23). Within the 10-kb region flanking the breakpoints, chromosome 11 had 25% repeat elements, whereas chromosome 9 had 98% repeats, 95% of which were L1-type LINE elements. The breakpoints occurred within an L1-type repeat element at 9p24 and at the 3'-end of an Alu sequence at 11q23. At the breakpoint junction of derivative chromosome 9, we discovered an unusually large 41-bp insertion, which showed 100% identity to 12S mitochondrial DNA (mtDNA) between nucleotides 896 and 936 of the mtDNA sequence. Analysis of the human genome failed to show the preexistence of the inserted sequence at normal chromosomes 9 and 11 breakpoint junctions or elsewhere in the genome, strongly suggesting that the insertion was derived from human mtDNA and captured into the junction during the DSB repair process. To our knowledge, these findings represent the first observation of spontaneous germ line insertion of modern human mtDNA sequences and suggest that DSB repair may play a role in inter-organellar gene transfer in vivo. Our findings also provide evidence for a previously unrecognized insertional mechanism in human, by which non-mobile extra-chromosomal fragments can be inserted into the genome at DSB repair junctions.

  15. Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.

    PubMed

    Kohany, Oleksiy; Gentles, Andrew J; Hankus, Lukasz; Jurka, Jerzy

    2006-10-25

    Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at http://www.girinst.org/repbase/submission.html (RepbaseSubmitter) and http://www.girinst.org/censor/index.php (Censor).

  16. Repetitive sequence analysis and karyotyping reveals centromere-associated DNA sequences in radish (Raphanus sativus L.).

    PubMed

    He, Qunyan; Cai, Zexi; Hu, Tianhua; Liu, Huijun; Bao, Chonglai; Mao, Weihai; Jin, Weiwei

    2015-04-18

    Radish (Raphanus sativus L., 2n = 2x = 18) is a major root vegetable crop especially in eastern Asia. Radish root contains various nutritions which play an important role in strengthening immunity. Repetitive elements are primary components of the genomic sequence and the most important factors in genome size variations in higher eukaryotes. To date, studies about repetitive elements of radish are still limited. To better understand genome structure of radish, we undertook a study to evaluate the proportion of repetitive elements and their distribution in radish. We conducted genome-wide characterization of repetitive elements in radish with low coverage genome sequencing followed by similarity-based cluster analysis. Results showed that about 31% of the genome was composed of repetitive sequences. Satellite repeats were the most dominating elements of the genome. The distribution pattern of three satellite repeat sequences (CL1, CL25, and CL43) on radish chromosomes was characterized using fluorescence in situ hybridization (FISH). CL1 was predominantly located at the centromeric region of all chromosomes, CL25 located at the subtelomeric region, and CL43 was a telomeric satellite. FISH signals of two satellite repeats, CL1 and CL25, together with 5S rDNA and 45S rDNA, provide useful cytogenetic markers to identify each individual somatic metaphase chromosome. The centromere-specific histone H3 (CENH3) has been used as a marker to identify centromere DNA sequences. One putative CENH3 (RsCENH3) was characterized and cloned from radish. Its deduced amino acid sequence shares high similarities to those of the CENH3s in Brassica species. An antibody against B. rapa CENH3, specifically stained radish centromeres. Immunostaining and chromatin immunoprecipitation (ChIP) tests with anti-BrCENH3 antibody demonstrated that both the centromere-specific retrotransposon (CR-Radish) and satellite repeat (CL1) are directly associated with RsCENH3 in radish. Proportions of repetitive elements in radish were estimated and satellite repeats were the most dominating elements. Fine karyotyping analysis was established which allow us to easily identify each individual somatic metaphase chromosome. Immunofluorescence- and ChIP-based assays demonstrated the functional significance of satellite and centromere-specific retrotransposon at centromeres. Our study provides a valuable basis for future genomic studies in radish.

  17. Dynamics and biological relevance of DNA demethylation in Arabidopsis antibacterial defense.

    PubMed

    Yu, Agnès; Lepère, Gersende; Jay, Florence; Wang, Jingyu; Bapaume, Laure; Wang, Yu; Abraham, Anne-Laure; Penterman, Jon; Fischer, Robert L; Voinnet, Olivier; Navarro, Lionel

    2013-02-05

    DNA methylation is an epigenetic mark that silences transposable elements (TEs) and repeats. Whereas the establishment and maintenance of DNA methylation are relatively well understood, little is known about their dynamics and biological relevance in plant and animal innate immunity. Here, we show that some TEs are demethylated and transcriptionally reactivated during antibacterial defense in Arabidopsis. This effect is correlated with the down-regulation of key transcriptional gene silencing factors and is partly dependent on an active demethylation process. DNA demethylation restricts multiplication and vascular propagation of the bacterial pathogen Pseudomonas syringae in leaves and, accordingly, some immune-response genes, containing repeats in their promoter regions, are negatively regulated by DNA methylation. This study provides evidence that DNA demethylation is part of a plant-induced immune response, potentially acting to prime transcriptional activation of some defense genes linked to TEs/repeats.

  18. DNA topoisomerase 1α promotes transcriptional silencing of transposable elements through DNA methylation and histone lysine 9 dimethylation in Arabidopsis.

    PubMed

    Dinh, Thanh Theresa; Gao, Lei; Liu, Xigang; Li, Dongming; Li, Shengben; Zhao, Yuanyuan; O'Leary, Michael; Le, Brandon; Schmitz, Robert J; Manavella, Pablo A; Manavella, Pablo; Li, Shaofang; Weigel, Detlef; Pontes, Olga; Ecker, Joseph R; Chen, Xuemei

    2014-07-01

    RNA-directed DNA methylation (RdDM) and histone H3 lysine 9 dimethylation (H3K9me2) are related transcriptional silencing mechanisms that target transposable elements (TEs) and repeats to maintain genome stability in plants. RdDM is mediated by small and long noncoding RNAs produced by the plant-specific RNA polymerases Pol IV and Pol V, respectively. Through a chemical genetics screen with a luciferase-based DNA methylation reporter, LUCL, we found that camptothecin, a compound with anti-cancer properties that targets DNA topoisomerase 1α (TOP1α) was able to de-repress LUCL by reducing its DNA methylation and H3K9me2 levels. Further studies with Arabidopsis top1α mutants showed that TOP1α silences endogenous RdDM loci by facilitating the production of Pol V-dependent long non-coding RNAs, AGONAUTE4 recruitment and H3K9me2 deposition at TEs and repeats. This study assigned a new role in epigenetic silencing to an enzyme that affects DNA topology.

  19. Analysis of Two Cosmid Clones from Chromosome 4 of Drosophila melanogaster Reveals Two New Genes Amid an Unusual Arrangement of Repeated Sequences

    PubMed Central

    Locke, John; Podemski, Lynn; Roy, Ken; Pilgrim, David; Hodgetts, Ross

    1999-01-01

    Chromosome 4 from Drosophila melanogaster has several unusual features that distinguish it from the other chromosomes. These include a diffuse appearance in salivary gland polytene chromosomes, an absence of recombination, and the variegated expression of P-element transgenes. As part of a larger project to understand these properties, we are assembling a physical map of this chromosome. Here we report the sequence of two cosmids representing ∼5% of the polytenized region. Both cosmid clones contain numerous repeated DNA sequences, as identified by cross hybridization with labeled genomic DNA, BLAST searches, and dot matrix analysis, which are positioned between and within the transcribed sequences. The repetitive sequences include three copies of the mobile element Hoppel, one copy of the mobile element HB, and 18 DINE repeats. DINE is a novel, short repeated sequence dispersed throughout both cosmid sequences. One cosmid includes the previously described cubitus interruptus (ci) gene and two new genes: that a gene with a predicted amino acid sequence similar to ribosomal protein S3a which is consistent with the Minute(4)101 locus thought to be in the region, and a novel member of the protein family that includes plexin and met–hepatocyte growth factor receptor. The other cosmid contains only the two short 5′-most exons from the zinc-finger-homolog-2 (zfh-2) gene. This is the first extensive sequence analysis of noncoding DNA from chromosome 4. The distribution of the various repeats suggests its organization is similar to the β-heterochromatic regions near the base of the major chromosome arms. Such a pattern may account for the diffuse banding of the polytene chromosome 4 and the variegation of many P-element transgenes on the chromosome. PMID:10022978

  20. The Ecological Genomics of Fungi: Repeated Elements in Filamentous Fungi with a Focus on Wood-Decay Fungi

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Murat, Claude; Payen, Thibaut; Petitpierre, Denis

    2013-01-01

    In the last decade, the genome of several dozen filamentous fungi have been sequenced. Interestingly, vast diversity in genome size was observed (Fig. 2.1) with 14-fold differences between the 9 Mb of the human pathogenic dandruff fungus (Malassezia globosa; Xu, Saunders, et al., 2007) and the 125 Mb of the ectomycorrhizal black truffle of P rigord (Tuber melanosporum; Martin, Kohler, et al., 2010). Recently, Raffaele and Kamoun (2012) highlighted that the genomes of several lineages of filamentous plant pathogens have been shaped by repeat-driven expansion. Indeed, repeated elements are ubiquitous in all prokaryote and eukaryote genomes; however, their frequencies canmore » vary from just a minor percentage of the genome to more that 60 percent of the genome. Repeated elements can be classified in two major types: satellites DNA and transposable elements. In this chapter, the different types of repeated elements and how these elements can impact genome and gene repertoire will be described. Also, an intriguing link between the transposable elements richness and diversity and the ecological niche will be highlighted.« less

  1. DNA motifs determining the accuracy of repeat duplication during CRISPR adaptation in Haloarcula hispanica

    PubMed Central

    Wang, Rui; Li, Ming; Gong, Luyao; Hu, Songnian; Xiang, Hua

    2016-01-01

    Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) acquire new spacers to generate adaptive immunity in prokaryotes. During spacer integration, the leader-preceded repeat is always accurately duplicated, leading to speculations of a repeat-length ruler. Here in Haloarcula hispanica, we demonstrate that the accurate duplication of its 30-bp repeat requires two conserved mid-repeat motifs, AACCC and GTGGG. The AACCC motif was essential and needed to be ∼10 bp downstream from the leader-repeat junction site, where duplication consistently started. Interestingly, repeat duplication terminated sequence-independently and usually with a specific distance from the GTGGG motif, which seemingly served as an anchor site for a molecular ruler. Accordingly, altering the spacing between the two motifs led to an aberrant duplication size (29, 31, 32 or 33 bp). We propose the adaptation complex may recognize these mid-repeat elements to enable measuring the repeat DNA for spacer integration. PMID:27085805

  2. Alu repeats: A source for the genesis of primate microsatellites

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Arcot, S.S.; Batzer, M.A.; Wang, Zhenyuan

    1995-09-01

    As a result of their abundance, relatively uniform distribution, and high degree of polymorphism, microsatellites and minisatellites have become valuable tools in genetic mapping, forensic identity testing, and population studies. In recent years, a number of microsatellite repeats have been found to be associated with Alu interspersed repeated DNA elements. The association of an Alu element with a microsatellite repeat could result from the integration of an Alu element within a preexisting microsatellite repeat. Alternatively, Alu elements could have a direct role in the origin of microsatellite repeats. Errors introduced during reverse transcription of the primary transcript derived from anmore » Alu {open_quotes}master{close_quote} gene or the accumulation of random mutations in the middle A-rich regions and oligo(dA)-rich tails of Alu elements after insertion and subsequent expansion and contraction of these sequences could result in the genesis of a microsatellite repeat. We have tested these hypotheses by a direct evolutionary comparison of the sequences of some recent Alu elements that are found only in humans and are absent from nonhuman primates, as well as some older Alu elements that are present at orthologous positions in a number of nonhuman primates. The origin of {open_quotes}young{close_quotes} Alu insertions, absence of sequences that resemble microsatellite repeats at the orthologous loci in chimpanzees, and the gradual expansion of microsatellite repeats in some old Alu repeats at orthologous positions within the genomes of a number of nonhuman primates suggest that Alu elements are a source for the genesis of primate microsatellite repeats. 48 refs., 5 figs., 3 tabs.« less

  3. Variation in the genomic locations and sequence conservation of STAR elements among staphylococcal species provides insight into DNA repeat evolution

    PubMed Central

    2012-01-01

    Background Staphylococcus aureus Repeat (STAR) elements are a type of interspersed intergenic direct repeat. In this study the conservation and variation in these elements was explored by bioinformatic analyses of published staphylococcal genome sequences and through sequencing of specific STAR element loci from a large set of S. aureus isolates. Results Using bioinformatic analyses, we found that the STAR elements were located in different genomic loci within each staphylococcal species. There was no correlation between the number of STAR elements in each genome and the evolutionary relatedness of staphylococcal species, however higher levels of repeats were observed in both S. aureus and S. lugdunensis compared to other staphylococcal species. Unexpectedly, sequencing of the internal spacer sequences of individual repeat elements from multiple isolates showed conservation at the sequence level within deep evolutionary lineages of S. aureus. Whilst individual STAR element loci were demonstrated to expand and contract, the sequences associated with each locus were stable and distinct from one another. Conclusions The high degree of lineage and locus-specific conservation of these intergenic repeat regions suggests that STAR elements are maintained due to selective or molecular forces with some of these elements having an important role in cell physiology. The high prevalence in two of the more virulent staphylococcal species is indicative of a potential role for STAR elements in pathogenesis. PMID:23020678

  4. Solution properties of the archaeal CRISPR DNA repeat-binding homeodomain protein Cbp2

    PubMed Central

    Kenchappa, Chandra S.; Heidarsson, Pétur O.; Kragelund, Birthe B.; Garrett, Roger A.; Poulsen, Flemming M.

    2013-01-01

    Clustered regularly interspaced short palindromic repeats (CRISPR) form the basis of diverse adaptive immune systems directed primarily against invading genetic elements of archaea and bacteria. Cbp1 of the crenarchaeal thermoacidophilic order Sulfolobales, carrying three imperfect repeats, binds specifically to CRISPR DNA repeats and has been implicated in facilitating production of long transcripts from CRISPR loci. Here, a second related class of CRISPR DNA repeat-binding protein, denoted Cbp2, is characterized that contains two imperfect repeats and is found amongst members of the crenarchaeal thermoneutrophilic order Desulfurococcales. DNA repeat-binding properties of the Hyperthermus butylicus protein Cbp2Hb were characterized and its three-dimensional structure was determined by NMR spectroscopy. The two repeats generate helix-turn-helix structures separated by a basic linker that is implicated in facilitating high affinity DNA binding of Cbp2 by tethering the two domains. Structural studies on mutant proteins provide support for Cys7 and Cys28 enhancing high thermal stability of Cbp2Hb through disulphide bridge formation. Consistent with their proposed CRISPR transcriptional regulatory role, Cbp2Hb and, by inference, other Cbp1 and Cbp2 proteins are closely related in structure to homeodomain proteins with linked helix-turn-helix (HTH) domains, in particular the paired domain Pax and Myb family proteins that are involved in eukaryal transcriptional regulation. PMID:23325851

  5. The SIDER2 elements, interspersed repeated sequences that populate the Leishmania genomes, constitute subfamilies showing chromosomal proximity relationship.

    PubMed

    Requena, Jose M; Folgueira, Cristina; López, Manuel C; Thomas, M Carmen

    2008-06-02

    Protozoan parasites of the genus Leishmania are causative agents of a diverse spectrum of human diseases collectively known as leishmaniasis. These eukaryotic pathogens that diverged early from the main eukaryotic lineage possess a number of unusual genomic, molecular and biochemical features. The completion of the genome projects for three Leishmania species has generated invaluable information enabling a direct analysis of genome structure and organization. By using DNA macroarrays, made with Leishmania infantum genomic clones and hybridized with total DNA from the parasite, we identified a clone containing a repeated sequence. An analysis of the recently completed genome sequence of L. infantum, using this repeated sequence as bait, led to the identification of a new class of repeated elements that are interspersed along the different L. infantum chromosomes. These elements turned out to be homologues of SIDER2 sequences, which were recently identified in the Leishmania major genome; thus, we adopted this nomenclature for the Leishmania elements described herein. Since SIDER2 elements are very heterogeneous in sequence, their precise identification is rather laborious. We have characterized 54 LiSIDER2 elements in chromosome 32 and 27 ones in chromosome 20. The mean size for these elements is 550 bp and their sequence is G+C rich (mean value of 66.5%). On the basis of sequence similarity, these elements can be grouped in subfamilies that show a remarkable relationship of proximity, i.e. SIDER2s of a given subfamily locate close in a chromosomal region without intercalating elements. For comparative purposes, we have identified the SIDER2 elements existing in L. major and Leishmania braziliensis chromosomes 32. While SIDER2 elements are highly conserved both in number and location between L. infantum and L. major, no such conservation exists when comparing with SIDER2s in L. braziliensis chromosome 32. SIDER2 elements constitute a relevant piece in the Leishmania genome organization. Sequence characteristics, genomic distribution and evolutionarily conservation of SIDER2s are suggestive of relevant functions for these elements in Leishmania. Apart from a proved involvement in post-transcriptional mechanisms of gene regulation, SIDER2 elements could be involved in DNA amplification processes and, perhaps, in chromosome segregation as centromeric sequences.

  6. DNA is structured as a linear "jigsaw puzzle" in the genomes of Arabidopsis, rice, and budding yeast.

    PubMed

    Liu, Yun-Hua; Zhang, Meiping; Wu, Chengcang; Huang, James J; Zhang, Hong-Bin

    2014-01-01

    Knowledge of how a genome is structured and organized from its constituent elements is crucial to understanding its biology and evolution. Here, we report the genome structuring and organization pattern as revealed by systems analysis of the sequences of three model species, Arabidopsis, rice and yeast, at the whole-genome and chromosome levels. We found that all fundamental function elements (FFE) constituting the genomes, including genes (GEN), DNA transposable elements (DTE), retrotransposable elements (RTE), simple sequence repeats (SSR), and (or) low complexity repeats (LCR), are structured in a nonrandom and correlative manner, thus leading to a hypothesis that the DNA of the species is structured as a linear "jigsaw puzzle". Furthermore, we showed that different FFE differ in their importance in the formation and evolution of the DNA jigsaw puzzle structure between species. DTE and RTE play more important roles than GEN, LCR, and SSR in Arabidopsis, whereas GEN and RTE play more important roles than LCR, SSR, and DTE in rice. The genes having multiple recognized functions play more important roles than those having single functions. These results provide useful knowledge necessary for better understanding genome biology and evolution of the species and for effective molecular breeding of rice.

  7. Selfish DNA in protein-coding genes of Rickettsia.

    PubMed

    Ogata, H; Audic, S; Barbe, V; Artiguenave, F; Fournier, P E; Raoult, D; Claverie, J M

    2000-10-13

    Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.

  8. Repetitive DNA in the pea (Pisum sativum L.) genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula

    PubMed Central

    Macas, Jiří; Neumann, Pavel; Navrátilová, Alice

    2007-01-01

    Background Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum). Results Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula. Conclusion We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data provide a starting point for further investigations of legume plant genomes based on their global comparative analysis and for the development of more sophisticated approaches for data mining. PMID:18031571

  9. Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size.

    PubMed

    Kelly, Laura J; Renny-Byfield, Simon; Pellicer, Jaume; Macas, Jiří; Novák, Petr; Neumann, Pavel; Lysak, Martin A; Day, Peter D; Berger, Madeleine; Fay, Michael F; Nichols, Richard A; Leitch, Andrew R; Leitch, Ilia J

    2015-10-01

    Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  10. Analysis of the platypus genome suggests a transposon origin for mammalian imprinting.

    PubMed

    Pask, Andrew J; Papenfuss, Anthony T; Ager, Eleanor I; McColl, Kaighin A; Speed, Terence P; Renfree, Marilyn B

    2009-01-01

    Genomic imprinting is an epigenetic phenomenon that results in monoallelic gene expression. Many hypotheses have been advanced to explain why genomic imprinting evolved in mammals, but few have examined how it arose. The host defence hypothesis suggests that imprinting evolved from existing mechanisms within the cell that act to silence foreign DNA elements that insert into the genome. However, the changes to the mammalian genome that accompanied the evolution of imprinting have been hard to define due to the absence of large scale genomic resources between all extant classes. The recent release of the platypus genome has provided the first opportunity to perform comparisons between prototherian (monotreme; which appear to lack imprinting) and therian (marsupial and eutherian; which have imprinting) mammals. We compared the distribution of repeat elements known to attract epigenetic silencing across the entire genome from monotremes and therian mammals, particularly focusing on the orthologous imprinted regions. There is a significant accumulation of certain repeat elements within imprinted regions of therian mammals compared to the platypus. Our analyses show that the platypus has significantly fewer repeats of certain classes in the regions of the genome that have become imprinted in therian mammals. The accumulation of repeats, especially long terminal repeats and DNA elements, in therian imprinted genes and gene clusters is coincident with, and may have been a potential driving force in, the development of mammalian genomic imprinting. These data provide strong support for the host defence hypothesis.

  11. Analysis of the platypus genome suggests a transposon origin for mammalian imprinting

    PubMed Central

    Pask, Andrew J; Papenfuss, Anthony T; Ager, Eleanor I; McColl, Kaighin A; Speed, Terence P; Renfree, Marilyn B

    2009-01-01

    Background Genomic imprinting is an epigenetic phenomenon that results in monoallelic gene expression. Many hypotheses have been advanced to explain why genomic imprinting evolved in mammals, but few have examined how it arose. The host defence hypothesis suggests that imprinting evolved from existing mechanisms within the cell that act to silence foreign DNA elements that insert into the genome. However, the changes to the mammalian genome that accompanied the evolution of imprinting have been hard to define due to the absence of large scale genomic resources between all extant classes. The recent release of the platypus genome has provided the first opportunity to perform comparisons between prototherian (monotreme; which appear to lack imprinting) and therian (marsupial and eutherian; which have imprinting) mammals. Results We compared the distribution of repeat elements known to attract epigenetic silencing across the entire genome from monotremes and therian mammals, particularly focusing on the orthologous imprinted regions. There is a significant accumulation of certain repeat elements within imprinted regions of therian mammals compared to the platypus. Conclusions Our analyses show that the platypus has significantly fewer repeats of certain classes in the regions of the genome that have become imprinted in therian mammals. The accumulation of repeats, especially long terminal repeats and DNA elements, in therian imprinted genes and gene clusters is coincident with, and may have been a potential driving force in, the development of mammalian genomic imprinting. These data provide strong support for the host defence hypothesis. PMID:19121219

  12. A SHORT SEQUENCE IMMEDIATELY UPSTREAM OF THE INTERNAL REPEAT ELEMENTS IS CRITICAL FOR KSHV LANA MEDIATED DNA REPLICATION AND IMPACTS EPISOME PERSISTENCE

    PubMed Central

    León Vázquez, Erika De; Juillard, Franceline; Rosner, Bernard; Kaye, Kenneth M.

    2013-01-01

    Kaposi’s sarcoma-associated herpesvirus LANA (1162 residues) mediates episomal persistence of viral genomes during latency. LANA mediates viral DNA replication and segregates episomes to daughter nuclei. A 59 residue deletion immediately upstream of the internal repeat elements rendered LANA highly deficient for DNA replication and modestly deficient for the ability to segregate episomes, while smaller deletions did not. The 59 amino acid deletion reduced LANA episome persistence by ~14-fold, while sequentially smaller deletions resulted in ~3-fold, or no deficiency. Three distinct LANA regions reorganized heterochromatin, one of which contains the deleted sequence, but the deletion did not abolish LANA’s ability to alter chromatin. Therefore, this work identifies a short internal LANA sequence that is critical for DNA replication, has modest effects on episome segregation, and substantially impacts episome persistence; this region may exert its effects through an interacting host cell protein(s). PMID:24314665

  13. High-throughput analysis of the satellitome illuminates satellite DNA evolution

    NASA Astrophysics Data System (ADS)

    Ruiz-Ruano, Francisco J.; López-León, María Dolores; Cabrero, Josefa; Camacho, Juan Pedro M.

    2016-07-01

    Satellite DNA (satDNA) is a major component yet the great unknown of eukaryote genomes and clearly underrepresented in genome sequencing projects. Here we show the high-throughput analysis of satellite DNA content in the migratory locust by means of the bioinformatic analysis of Illumina reads with the RepeatExplorer and RepeatMasker programs. This unveiled 62 satDNA families and we propose the term “satellitome” for the whole collection of different satDNA families in a genome. The finding that satDNAs were present in many contigs of the migratory locust draft genome indicates that they show many genomic locations invisible by fluorescent in situ hybridization (FISH). The cytological pattern of five satellites showing common descent (belonging to the SF3 superfamily) suggests that non-clustered satDNAs can become into clustered through local amplification at any of the many genomic loci resulting from previous dissemination of short satDNA arrays. The fact that all kinds of satDNA (micro- mini- and satellites) can show the non-clustered and clustered states suggests that all these elements are mostly similar, except for repeat length. Finally, the presence of VNTRs in bacteria, showing similar properties to non-clustered satDNAs in eukaryotes, suggests that this kind of tandem repeats show common properties in all living beings.

  14. Evidence of birth-and-death evolution of 5S rRNA gene in Channa species (Teleostei, Perciformes).

    PubMed

    Barman, Anindya Sundar; Singh, Mamta; Singh, Rajeev Kumar; Lal, Kuldeep Kumar

    2016-12-01

    In higher eukaryotes, minor rDNA family codes for 5S rRNA that is arranged in tandem arrays and comprises of a highly conserved 120 bp long coding sequence with a variable non-transcribed spacer (NTS). Initially the 5S rDNA repeats are considered to be evolved by the process of concerted evolution. But some recent reports, including teleost fishes suggested that evolution of 5S rDNA repeat does not fit into the concerted evolution model and evolution of 5S rDNA family may be explained by a birth-and-death evolution model. In order to study the mode of evolution of 5S rDNA repeats in Perciformes fish species, nucleotide sequence and molecular organization of five species of genus Channa were analyzed in the present study. Molecular analyses revealed several variants of 5S rDNA repeats (four types of NTS) and networks created by a neighbor net algorithm for each type of sequences (I, II, III and IV) did not show a clear clustering in species specific manner. The stable secondary structure is predicted and upstream and downstream conserved regulatory elements were characterized. Sequence analyses also shown the presence of two putative pseudogenes in Channa marulius. Present study supported that 5S rDNA repeats in genus Channa were evolved under the process of birth-and-death.

  15. Identification and chromosome mapping of repetitive elements in the Astyanax scabripinnis (Teleostei: Characidae) species complex.

    PubMed

    Barbosa, Patrícia; de Oliveira, Luiz Antonio; Pucci, Marcela Baer; Santos, Mateus Henrique; Moreira-Filho, Orlando; Vicari, Marcelo Ricardo; Nogaroto, Viviane; de Almeida, Mara Cristina; Artoni, Roberto Ferreira

    2015-02-01

    Most part of the eukaryotic genome is composed of repeated sequences or multiple copies of DNA, which were considered as "junk DNA", and may be associated to the heterochromatin. In this study, three populations of Astyanax aff. scabripinnis from Brazilian rivers of Guaratinguetá and Pindamonhangaba (São Paulo) and a population from Maringá (Paraná) were analyzed concerning the localization of the nucleolar organizer regions (Ag-NORs), the As51 satellite DNA, the 18S ribosomal DNA (rDNA), and the 5S rDNA. Repeated sequences were also isolated and identified by the Cot - 1 method, which indicated similarity (90%) with the LINE UnaL2 retrotransposon. The fluorescence in situ hybridization (FISH) showed the retrotransposon dispersed and more concentrated markers in centromeric and telomeric chromosomal regions. These sequences were co-localized and interspaced with 18S and 5S rDNA and As51, confirmed by fiber-FISH essay. The B chromosome found in these populations pointed to a conspicuous hybridization with LINE probe, which is also co-located in As51 sequences. The NORs were active at unique sites of a homologous pair in the three populations. There were no evidences that transposable elements and repetitive DNA had influence in the transcriptional regulation of ribosomal genes in our analyses.

  16. Heterochromatin and molecular characterization of DsmarMITE transposable element in the beetle Dichotomius schiffleri (Coleoptera: Scarabaeidae).

    PubMed

    Xavier, Crislaine; Cabral-de-Mello, Diogo Cavalcanti; de Moura, Rita Cássia

    2014-12-01

    Cytogenetic studies of the Neotropical beetle genus Dichotomius (Scarabaeinae, Coleoptera) have shown dynamism for centromeric constitutive heterochromatin sequences. In the present work we studied the chromosomes and isolated repetitive sequences of Dichotomius schiffleri aiming to contribute to the understanding of coleopteran genome/chromosomal organization. Dichotomius schiffleri presented a conserved karyotype and heterochromatin distribution in comparison to other species of the genus with 2n = 18, biarmed chromosomes, and pericentromeric C-positive blocks. Similarly to heterochromatin distributional patterns, the highly and moderately repetitive DNA fraction (C 0 t-1 DNA) was detected in pericentromeric areas, contrasting with the euchromatic mapping of an isolated TE (named DsmarMITE). After structural analyses, the DsmarMITE was classified as a non-autonomous element of the type miniature inverted-repeat transposable element (MITE) with terminal inverted repeats similar to Mariner elements of insects from different orders. The euchromatic distribution for DsmarMITE indicates that it does not play a part in the dynamics of constitutive heterochromatin sequences.

  17. Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.

    PubMed Central

    Wincker, P; Jubier-Maurin, V; Roizès, G

    1987-01-01

    Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566

  18. Altered Methylation in Tandem Repeat Element and Elemental Component Levels in Inhalable Air Particles

    PubMed Central

    Hou, Lifang; Zhang, Xiao; Zheng, Yinan; Wang, Sheng; Dou, Chang; Guo, Liqiong; Byun, Hyang-Min; Motta, Valeria; McCracken, John; Díaz, Anaité; Kang, Choong-Min; Koutrakis, Petros; Bertazzi, Pier Alberto; Li, Jingyun; Schwartz, Joel; Baccarelli, Andrea A.

    2014-01-01

    Exposure to particulate matter (PM) has been associated with lung cancer risk in epidemiology investigations. Elemental components of PM have been suggested to have critical roles in PM toxicity, but the molecular mechanisms underlying their association with cancer risks remain poorly understood. DNA methylation has emerged as a promising biomarker for environmental-related diseases, including lung cancer. In this study, we evaluated the effects of PM elemental components on methylation of three tandem repeats in a highly-exposed population in Beijing, China. The Beijing Truck Driver Air Pollution Study was conducted shortly before the 2008 Beijing Olympic Games (June 15-July 27, 2008) and included 60 truck drivers and 60 office workers. On two days separated by 1-2 weeks, we measured blood DNA methylation of SATα, NBL2, D4Z4, and personal exposure to eight elemental components in PM2.5, including aluminum (Al), silicon (Si), sulfur (S), potassium (K), calcium (Ca) titanium (Ti), iron (Fe), and zinc (Zn). We estimated the associations of individual elemental component with each tandem repeat methylation in generalized estimating equations (GEE) models adjusted for PM2.5 mass and other covariates. Out of the eight examined elements, NBL2 methylation was positively associated with concentrations of Si (0.121, 95%CI: 0.030; 0.212, FDR=0.047) and Ca (0.065, 95%CI: 0.014; 0.115, FDR=0.047) in truck drivers. In office workers, SATα methylation was positively associated with concentrations of S (0.115, 95%CI: 0.034; 0.196, FDR=0.042). PM-associated differences in blood tandem-repeat methylation may help detect biological effects of the exposure and identify individuals who may eventually experience higher lung cancer risk. PMID:24273195

  19. Repetitive DNA loci and their modulation by the non-canonical nucleic acid structures R-loops and G-quadruplexes

    PubMed Central

    Hall, Amanda C.; Ostrowski, Lauren A.; Mekhail, Karim

    2017-01-01

    ABSTRACT Cells have evolved intricate mechanisms to maintain genome stability despite allowing mutational changes to drive evolutionary adaptation. Repetitive DNA sequences, which represent the bulk of most genomes, are a major threat to genome stability often driving chromosome rearrangements and disease. The major source of repetitive DNA sequences and thus the most vulnerable constituents of the genome are the rDNA (rDNA) repeats, telomeres, and transposable elements. Maintaining the stability of these loci is critical to overall cellular fitness and lifespan. Therefore, cells have evolved mechanisms to regulate rDNA copy number, telomere length and transposon activity, as well as DNA repair at these loci. In addition, non-canonical structure-forming DNA motifs can also modulate the function of these repetitive DNA loci by impacting their transcription, replication, and stability. Here, we discuss key mechanisms that maintain rDNA repeats, telomeres, and transposons in yeast and human before highlighting emerging roles for non-canonical DNA structures at these repetitive loci. PMID:28406751

  20. Molecular structure and chromosome distribution of three repetitive DNA families in Anemone hortensis L. (Ranunculaceae).

    PubMed

    Mlinarec, Jelena; Chester, Mike; Siljak-Yakovlev, Sonja; Papes, Drazena; Leitch, Andrew R; Besendorfer, Visnja

    2009-01-01

    The structure, abundance and location of repetitive DNA sequences on chromosomes can characterize the nature of higher plant genomes. Here we report on three new repeat DNA families isolated from Anemone hortensis L.; (i) AhTR1, a family of satellite DNA (stDNA) composed of a 554-561 bp long EcoRV monomer; (ii) AhTR2, a stDNA family composed of a 743 bp long HindIII monomer and; (iii) AhDR, a repeat family composed of a 945 bp long HindIII fragment that exhibits some sequence similarity to Ty3/gypsy-like retroelements. Fluorescence in-situ hybridization (FISH) to metaphase chromosomes of A. hortensis (2n = 16) revealed that both AhTR1 and AhTR2 sequences co-localized with DAPI-positive AT-rich heterochromatic regions. AhTR1 sequences occur at intercalary DAPI bands while AhTR2 sequences occur at 8-10 terminally located heterochromatic blocks. In contrast AhDR sequences are dispersed over all chromosomes as expected of a Ty3/gypsy-like element. AhTR2 and AhTR1 repeat families include polyA- and polyT-tracks, AT/TA-motifs and a pentanucleotide sequence (CAAAA) that may have consequences for chromatin packing and sequence homogeneity. AhTR2 repeats also contain TTTAGGG motifs and degenerate variants. We suggest that they arose by interspersion of telomeric repeats with subtelomeric repeats, before hybrid unit(s) amplified through the heterochromatic domain. The three repetitive DNA families together occupy approximately 10% of the A. hortensis genome. Comparative analyses of eight Anemone species revealed that the divergence of the A. hortensis genome was accompanied by considerable modification and/or amplification of repeats.

  1. Comparative genomics and repetitive sequence divergence in the species of diploid Nicotiana section Alatae.

    PubMed

    Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R

    2006-12-01

    Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.

  2. Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

    PubMed

    Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

    2015-01-01

    The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.

  3. SINE sequences detect DNA fingerprints in salmonid fishes.

    PubMed

    Spruell, P; Thorgaard, G H

    1996-04-01

    DNA probes homologous to two previously described salmonid short interspersed nuclear elements (SINEs) detected DNA fingerprint patterns in 14 species of salmonid fishes. The probes showed more homology to some species than to others and little homology to three nonsalmonid fishes. The DNA fingerprint patterns derived from the SINE probes are individual-specific and inherited in a Mendelian manner. Probes derived from different regions of the same SINE detect only partially overlapping banding patterns, reflecting a more complex SINE structure than has been previously reported. Like the human Alu sequence, the SINEs found in salmonids could provide useful genetic markers and primer sites for PCR-based techniques. These elements may be more desirable for some applications than traditional DNA fingerprinting probes that detect tandemly repeated arrays.

  4. Grasshopper, a long terminal repeat (LTR) retroelement in the phytopathogenic fungus Magnaporthe grisea.

    PubMed

    Dobinson, K F; Harris, R E; Hamer, J E

    1993-01-01

    The fungal phytopathogen Magnaporthe grisea parasitizes a wide variety of gramineous hosts. In the course of investigating the genetic relationship between pathogen genotype and host specificity we identified a retroelement that is present in some strains of M. grisea that infect finger millet and goosegrass (members of the plant genus Eleusine). The element, designated grasshopper (grh), is present in multiple copies and dispersed throughout the genome. DNA sequence analysis showed that grasshopper contains 198 base pair direct, long terminal repeats (LTRs) with features characteristic of retroviral and retrotransposon LTRs. Within the element we identified an open reading frame with sequences homologous to the reverse transcriptase, RNaseH, and integrase domains of retroelement pol genes. Comparison of the open reading frame with sequences from other retroelements showed that grh is related to the gypsy family of retrotransposons. Comparisons of the distribution of the grasshopper element with other dispersed repeated DNA sequences in M. grisea indicated that grasshopper was present in a broadly dispersed subgroup of Eleusine pathogens, suggesting that the element was acquired subsequent to the evolution of this host-specific form. We present arguments that the amplification of different retroelements within populations of M. grisea is a consequence of the clonal organization of the fungal populations.

  5. The structure of the regulatory region of the rat L1 (L1Rn, long interspersed repeated) DNA family of transposable elements.

    PubMed Central

    Furano, A V; Robb, S M; Robb, F T

    1988-01-01

    Here we report the DNA structure of the left 1.5 kb of two newly isolated full length members of the rat L1 DNA family (L1Rn, long interspersed repeated DNA). In contrast to earlier isolated rat L1 members, both of these contain promoter-like regions that are most likely full length. In addition, the promoter-like region of both members has undergone a partial tandem duplication. A second internal region of the left end of one of the reported members is also tandemly duplicated. The propensity of the left end of rat L1 elements to undergo this form of genetic rearrangement, as well as other structural features revealed by the present work, is discussed in light of the fact that during evolution the otherwise conserved mammalian L1 DNA families have each acquired completely different promoter-like regions. In an accompanying paper [Nur, I., Pascale, E., and Furano, A. V. (1988) Nucleic Acids Res. 16, submitted], we report that one of the rat promoter-like regions can function as a promoter in rat cells when fused to the Escherichia coli chloramphenicol acyltransferase gene. PMID:2845369

  6. [Structural organization of 5S ribosomal DNA of Rosa rugosa].

    PubMed

    Tynkevych, Iu O; Volkov, R A

    2014-01-01

    In order to clarify molecular organization of the genomic region encoding 5S rRNA in diploid species Rosa rugosa several 5S rDNA repeated units were cloned and sequenced. Analysis of the obtained sequences revealed that only one length variant of 5S rDNA repeated units, which contains intact promoter elements in the intergenic spacer region (IGS) and appears to be transcriptionally active is present in the genome. Additionally, a limited number of 5S rDNA pseudogenes lacking a portion of coding sequence and the complete IGS was detected. A high level of sequence similarity (from 93.7 to 97.5%) between the IGS of major 5S rDNA variants of East Asian R. rugosa and North American R. nitida was found indicating comparatively recent divergence of these species.

  7. Interplay between DNA methylation, histone modification and chromatin remodeling in stem cells and during development.

    PubMed

    Ikegami, Kohta; Ohgane, Jun; Tanaka, Satoshi; Yagi, Shintaro; Shiota, Kunio

    2009-01-01

    Genes constitute only a small proportion of the mammalian genome, the majority of which is composed of non-genic repetitive elements including interspersed repeats and satellites. A unique feature of the mammalian genome is that there are numerous tissue-dependent, differentially methylated regions (T-DMRs) in the non-repetitive sequences, which include genes and their regulatory elements. The epigenetic status of T-DMRs varies from that of repetitive elements and constitutes the DNA methylation profile genome-wide. Since the DNA methylation profile is specific to each cell and tissue type, much like a fingerprint, it can be used as a means of identification. The formation of DNA methylation profiles is the basis for cell differentiation and development in mammals. The epigenetic status of each T-DMR is regulated by the interplay between DNA methyltransferases, histone modification enzymes, histone subtypes, non-histone nuclear proteins and non-coding RNAs. In this review, we will discuss how these epigenetic factors cooperate to establish cell- and tissue-specific DNA methylation profiles.

  8. Foldback intercoil DNA and the mechanism of DNA transposition.

    PubMed

    Kim, Byung-Dong

    2014-09-01

    Foldback intercoil (FBI) DNA is formed by the folding back at one point of a non-helical parallel track of double-stranded DNA at as sharp as 180° and the intertwining of two double helixes within each other's major groove to form an intercoil with a diameter of 2.2 nm. FBI DNA has been suggested to mediate intra-molecular homologous recombination of a deletion and inversion. Inter-molecular homologous recombination, known as site-specific insertion, on the other hand, is mediated by the direct perpendicular approach of the FBI DNA tip, as the attP site, onto the target DNA, as the attB site. Transposition of DNA transposons involves the pairing of terminal inverted repeats and 5-7-bp tandem target duplication. FBI DNA configuration effectively explains simple as well as replicative transposition, along with the involvement of an enhancer element. The majority of diverse retrotransposable elements that employ a target site duplication mechanism is also suggested to follow the FBI DNA-mediated perpendicular insertion of the paired intercoil ends by non-homologous end-joining, together with gap filling. A genome-wide perspective of transposable elements in light of FBI DNA is discussed.

  9. Uniformity of nucleosome preservation pattern in Mammalian sperm and its connection to repetitive DNA elements.

    PubMed

    Samans, Birgit; Yang, Yang; Krebs, Stefan; Sarode, Gaurav Vilas; Blum, Helmut; Reichenbach, Myriam; Wolf, Eckhard; Steger, Klaus; Dansranjavin, Temuujin; Schagdarsurengin, Undraga

    2014-07-14

    Nucleosome-to-protamine exchange during mammalian spermiogenesis is essential for compaction and protection of paternal DNA. It is interesting that, depending on the species, 1% to 15% of nucleosomes are retained, but the generalizability and biological function of this retention are unknown. Here, we show concordantly in human and bovine that nucleosomes remained in sperm chromatin predominantly within distal intergenic regions and introns and associated with centromere repeats and retrotransposons (LINE1 and SINEs). In contrast, nucleosome depletion concerned particularly exons, 5'-UTR, 3'-UTR, TSS, and TTS and was associated with simple and low-complexity repeats. Overlap of human and bovine genes exhibiting nucleosome preservation in the promoter and gene body revealed a significant enrichment of signal transduction and RNA- and protein-processing factors. Our study demonstrates the genome-wide uniformity of the nucleosome preservation pattern in mammalian sperm and its connection to repetitive DNA elements and suggests a function in preimplantation processes for paternally derived nucleosomes. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. Effects of Particulate Matter on Genomic DNA Methylation Content and iNOS Promoter Methylation

    PubMed Central

    Tarantini, Letizia; Bonzini, Matteo; Apostoli, Pietro; Pegoraro, Valeria; Bollati, Valentina; Marinelli, Barbara; Cantone, Laura; Rizzo, Giovanna; Hou, Lifang; Schwartz, Joel; Bertazzi, Pier Alberto; Baccarelli, Andrea

    2009-01-01

    Background Altered patterns of gene expression mediate the effects of particulate matter (PM) on human health, but mechanisms through which PM modifies gene expression are largely undetermined. Objectives We aimed at identifying short- and long-term effects of PM exposure on DNA methylation, a major genomic mechanism of gene expression control, in workers in an electric furnace steel plant with well-characterized exposure to PM with aerodynamic diameters < 10 μm (PM10). Methods We measured global genomic DNA methylation content estimated in Alu and long interspersed nuclear element-1 (LINE-1) repeated elements, and promoter DNA methylation of iNOS (inducible nitric oxide synthase), a gene suppressed by DNA methylation and induced by PM exposure in blood leukocytes. Quantitative DNA methylation analysis was performed through bisulfite PCR pyrosequencing on blood DNA obtained from 63 workers on the first day of a work week (baseline, after 2 days off work) and after 3 days of work (postexposure). Individual PM10 exposure was between 73.4 and 1,220 μg/m3. Results Global methylation content estimated in Alu and LINE-1 repeated elements did not show changes in postexposure measures compared with baseline. PM10 exposure levels were negatively associated with methylation in both Alu [β = −0.19 %5-methylcytosine (%5mC); p = 0.04] and LINE-1 [β = −0.34 %5mC; p = 0.04], likely reflecting long-term PM10 effects. iNOS promoter DNA methylation was significantly lower in postexposure blood samples compared with baseline (difference = −0.61 %5mC; p = 0.02). Conclusions We observed changes in global and gene specific methylation that should be further characterized in future investigations on the effects of PM. PMID:19270791

  11. Visual ModuleOrganizer: a graphical interface for the detection and comparative analysis of repeat DNA modules

    PubMed Central

    2014-01-01

    Background DNA repeats, such as transposable elements, minisatellites and palindromic sequences, are abundant in sequences and have been shown to have significant and functional roles in the evolution of the host genomes. In a previous study, we introduced the concept of a repeat DNA module, a flexible motif present in at least two occurences in the sequences. This concept was embedded into ModuleOrganizer, a tool allowing the detection of repeat modules in a set of sequences. However, its implementation remains difficult for larger sequences. Results Here we present Visual ModuleOrganizer, a Java graphical interface that enables a new and optimized version of the ModuleOrganizer tool. To implement this version, it was recoded in C++ with compressed suffix tree data structures. This leads to less memory usage (at least 120-fold decrease in average) and decreases by at least four the computation time during the module detection process in large sequences. Visual ModuleOrganizer interface allows users to easily choose ModuleOrganizer parameters and to graphically display the results. Moreover, Visual ModuleOrganizer dynamically handles graphical results through four main parameters: gene annotations, overlapping modules with known annotations, location of the module in a minimal number of sequences, and the minimal length of the modules. As a case study, the analysis of FoldBack4 sequences clearly demonstrated that our tools can be extended to comparative and evolutionary analyses of any repeat sequence elements in a set of genomic sequences. With the increasing number of sequences available in public databases, it is now possible to perform comparative analyses of repeated DNA modules in a graphic and friendly manner within a reasonable time period. Availability Visual ModuleOrganizer interface and the new version of the ModuleOrganizer tool are freely available at: http://lcb.cnrs-mrs.fr/spip.php?rubrique313. PMID:24678954

  12. Evolutional dynamics of 45S and 5S ribosomal DNA in ancient allohexaploid Atropa belladonna.

    PubMed

    Volkov, Roman A; Panchuk, Irina I; Borisjuk, Nikolai V; Hosiawa-Baranska, Marta; Maluszynska, Jolanta; Hemleben, Vera

    2017-01-23

    Polyploid hybrids represent a rich natural resource to study molecular evolution of plant genes and genomes. Here, we applied a combination of karyological and molecular methods to investigate chromosomal structure, molecular organization and evolution of ribosomal DNA (rDNA) in nightshade, Atropa belladonna (fam. Solanaceae), one of the oldest known allohexaploids among flowering plants. Because of their abundance and specific molecular organization (evolutionarily conserved coding regions linked to variable intergenic spacers, IGS), 45S and 5S rDNA are widely used in plant taxonomic and evolutionary studies. Molecular cloning and nucleotide sequencing of A. belladonna 45S rDNA repeats revealed a general structure characteristic of other Solanaceae species, and a very high sequence similarity of two length variants, with the only difference in number of short IGS subrepeats. These results combined with the detection of three pairs of 45S rDNA loci on separate chromosomes, presumably inherited from both tetraploid and diploid ancestor species, example intensive sequence homogenization that led to substitution/elimination of rDNA repeats of one parent. Chromosome silver-staining revealed that only four out of six 45S rDNA sites are frequently transcriptionally active, demonstrating nucleolar dominance. For 5S rDNA, three size variants of repeats were detected, with the major class represented by repeats containing all functional IGS elements required for transcription, the intermediate size repeats containing partially deleted IGS sequences, and the short 5S repeats containing severe defects both in the IGS and coding sequences. While shorter variants demonstrate increased rate of based substitution, probably in their transition into pseudogenes, the functional 5S rDNA variants are nearly identical at the sequence level, pointing to their origin from a single parental species. Localization of the 5S rDNA genes on two chromosome pairs further supports uniparental inheritance from the tetraploid progenitor. The obtained molecular, cytogenetic and phylogenetic data demonstrate complex evolutionary dynamics of rDNA loci in allohexaploid species of Atropa belladonna. The high level of sequence unification revealed in 45S and 5S rDNA loci of this ancient hybrid species have been seemingly achieved by different molecular mechanisms.

  13. Novel Structure of Ty3 Reverse Transcriptase | Center for Cancer Research

    Cancer.gov

    Retrotransposons are mobile genetic elements that self amplify via a single-stranded RNA intermediate, which is converted to double-stranded DNA by an encoded reverse transcriptase (RT) with both DNA polymerase (pol) and ribonuclease H (RNase) activities. Categorized by whether they contain flanking long terminal repeat (LTR) sequences, retrotransposons play a critical role in

  14. Eggs, embryos and the evolution of imprinting: insights from the platypus genome.

    PubMed

    Renfree, Marilyn B; Papenfuss, Anthony T; Shaw, Geoff; Pask, Andrew J

    2009-01-01

    Genomic imprinting is widespread in eutherian and marsupial mammals. Although there have been many hypotheses to explain why genomic imprinting evolved in mammals, few have examined how it arose. The host defence hypothesis suggests that imprinting evolved from existing mechanisms within the cell that act to silence foreign DNA elements that insert into the genome. However, the changes to the mammalian genome that accompanied the evolution of imprinting have been hard to define due to the absence of large-scale genomic resources from all extant classes. The recent release of the platypus genome sequence has provided the first opportunity to make comparisons between prototherian (monotreme, which show no signs of imprinting) and therian (marsupial and eutherian, which have imprinting) mammals. We compared the distribution of repeat elements known to attract epigenetic silencing across the genome from monotremes and therian mammals, particularly focusing on the orthologous imprinted regions. Our analyses show that the platypus has significantly fewer repeats of certain classes in the regions of the genome that have become imprinted in therian mammals. The accumulation of repeats, especially long-terminal repeats and DNA elements, in therian imprinted genes and gene clusters therefore appears to be coincident with, and may have been a potential driving force in, the development of mammalian genomic imprinting. Comparative platypus genome analyses of orthologous imprinted regions have provided strong support for the host defence hypothesis to explain the origin of imprinting.

  15. Chromosome ends: different sequences may provide conserved functions.

    PubMed

    Louis, Edward J; Vershinin, Alexander V

    2005-07-01

    The structures of specific chromosome regions, centromeres and telomeres, present a number of puzzles. As functions performed by these regions are ubiquitous and essential, their DNA, proteins and chromatin structure are expected to be conserved. Recent studies of centromeric DNA from human, Drosophila and plant species have demonstrated that a hidden universal centromere-specific sequence is highly unlikely. The DNA of telomeres is more conserved consisting of a tandemly repeated 6-8 bp Arabidopsis-like sequence in a majority of organisms as diverse as protozoan, fungi, mammals and plants. However, there are alternatives to short DNA repeats at the ends of chromosomes and for telomere elongation by telomerase. Here we focus on the similarities and diversity that exist among the structural elements, DNA sequences and proteins, that make up terminal domains (telomeres and subtelomeres), and how organisms use these in different ways to fulfil the functions of end-replication and end-protection. Copyright (c) 2005 Wiley Periodicals, Inc.

  16. Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences

    PubMed Central

    Sheinman, Michael; Ramisch, Anna; Massip, Florian; Arndt, Peter F.

    2016-01-01

    Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes. PMID:27488939

  17. Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes.

    PubMed

    Cer, Regina Z; Bruce, Kevin H; Mudunuri, Uma S; Yi, Ming; Volfovsky, Natalia; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M

    2011-01-01

    Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov.

  18. Improved PCR-Based Detection of Soil Transmitted Helminth Infections Using a Next-Generation Sequencing Approach to Assay Design.

    PubMed

    Pilotte, Nils; Papaiakovou, Marina; Grant, Jessica R; Bierwert, Lou Ann; Llewellyn, Stacey; McCarthy, James S; Williams, Steven A

    2016-03-01

    The soil transmitted helminths are a group of parasitic worms responsible for extensive morbidity in many of the world's most economically depressed locations. With growing emphasis on disease mapping and eradication, the availability of accurate and cost-effective diagnostic measures is of paramount importance to global control and elimination efforts. While real-time PCR-based molecular detection assays have shown great promise, to date, these assays have utilized sub-optimal targets. By performing next-generation sequencing-based repeat analyses, we have identified high copy-number, non-coding DNA sequences from a series of soil transmitted pathogens. We have used these repetitive DNA elements as targets in the development of novel, multi-parallel, PCR-based diagnostic assays. Utilizing next-generation sequencing and the Galaxy-based RepeatExplorer web server, we performed repeat DNA analysis on five species of soil transmitted helminths (Necator americanus, Ancylostoma duodenale, Trichuris trichiura, Ascaris lumbricoides, and Strongyloides stercoralis). Employing high copy-number, non-coding repeat DNA sequences as targets, novel real-time PCR assays were designed, and assays were tested against established molecular detection methods. Each assay provided consistent detection of genomic DNA at quantities of 2 fg or less, demonstrated species-specificity, and showed an improved limit of detection over the existing, proven PCR-based assay. The utilization of next-generation sequencing-based repeat DNA analysis methodologies for the identification of molecular diagnostic targets has the ability to improve assay species-specificity and limits of detection. By exploiting such high copy-number repeat sequences, the assays described here will facilitate soil transmitted helminth diagnostic efforts. We recommend similar analyses when designing PCR-based diagnostic tests for the detection of other eukaryotic pathogens.

  19. The site-specific ribosomal insertion element type II of Bombyx mori (R2Bm) contains the coding sequence for a reverse transcriptase-like enzyme.

    PubMed Central

    Burke, W D; Calalang, C C; Eickbush, T H

    1987-01-01

    Two classes of DNA elements interrupt a fraction of the rRNA repeats of Bombyx mori. We have analyzed by genomic blotting and sequence analysis one class of these elements which we have named R2. These elements occupy approximately 9% of the rDNA units of B. mori and appear to be homologous to the type II rDNA insertions detected in Drosophila melanogaster. Approximately 25 copies of R2 exist within the B. mori genome, of which at least 20 are located at a precise location within otherwise typical rDNA units. Nucleotide sequence analysis has revealed that the 4.2-kilobase-pair R2 element has a single large open reading frame, occupying over 82% of the total length of the element. The central region of this 1,151-amino-acid open reading frame shows homology to the reverse transcriptase enzymes found in retroviruses and certain transposable elements. Amino acid homology of this region is highest to the mobile line 1 elements of mammals, followed by the mitochondrial type II introns of fungi, and the pol gene of retroviruses. Less homology exists with transposable elements of D. melanogaster and Saccharomyces cerevisiae. Two additional regions of sequence homology between L1 and R2 elements were also found outside the reverse transcriptase region. We suggest that the R2 elements are retrotransposons that are site specific in their insertion into the genome. Such mobility would enable these elements to occupy a small fraction of the rDNA units of B. mori despite their continual elimination from the rDNA locus by sequence turnover. Images PMID:2439905

  20. Deep Investigation of Arabidopsis thaliana Junk DNA Reveals a Continuum between Repetitive Elements and Genomic Dark Matter

    PubMed Central

    Maumus, Florian; Quesneville, Hadi

    2014-01-01

    Eukaryotic genomes contain highly variable amounts of DNA with no apparent function. This so-called junk DNA is composed of two components: repeated and repeat-derived sequences (together referred to as the repeatome), and non-annotated sequences also known as genomic dark matter. Because of their high duplication rates as compared to other genomic features, transposable elements are predominant contributors to the repeatome and the products of their decay is thought to be a major source of genomic dark matter. Determining the origin and composition of junk DNA is thus important to help understanding genome evolution as well as host biology. In this study, we have used a combination of tools enabling to show that the repeatome from the small and reducing A. thaliana genome is significantly larger than previously thought. Furthermore, we present the concepts and results from a series of innovative approaches suggesting that a significant amount of the A. thaliana dark matter is of repetitive origin. As a tentative standard for the community, we propose a deep compendium annotation of the A. thaliana repeatome that may help addressing farther genome evolution as well as transcriptional and epigenetic regulation in this model plant. PMID:24709859

  1. Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.

    PubMed

    Šatović, Eva; Plohl, Miroslav

    2017-10-01

    Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.

  2. DNA transposons have colonized the genome of the giant virus Pandoravirus salinus.

    PubMed

    Sun, Cheng; Feschotte, Cédric; Wu, Zhiqiang; Mueller, Rachel Lockridge

    2015-06-12

    Transposable elements are mobile DNA sequences that are widely distributed in prokaryotic and eukaryotic genomes, where they represent a major force in genome evolution. However, transposable elements have rarely been documented in viruses, and their contribution to viral genome evolution remains largely unexplored. Pandoraviruses are recently described DNA viruses with genome sizes that exceed those of some prokaryotes, rivaling parasitic eukaryotes. These large genomes appear to include substantial noncoding intergenic spaces, which provide potential locations for transposable element insertions. However, no mobile genetic elements have yet been reported in pandoravirus genomes. Here, we report a family of miniature inverted-repeat transposable elements (MITEs) in the Pandoravirus salinus genome, representing the first description of a virus populated with a canonical transposable element family that proliferated by transposition within the viral genome. The MITE family, which we name Submariner, includes 30 copies with all the hallmarks of MITEs: short length, terminal inverted repeats, TA target site duplication, and no coding capacity. Submariner elements show signs of transposition and are undetectable in the genome of Pandoravirus dulcis, the closest known relative Pandoravirus salinus. We identified a DNA transposon related to Submariner in the genome of Acanthamoeba castellanii, a species thought to host pandoraviruses, which contains remnants of coding sequence for a Tc1/mariner transposase. These observations suggest that the Submariner MITEs of P. salinus belong to the widespread Tc1/mariner superfamily and may have been mobilized by an amoebozoan host. Ten of the 30 MITEs in the P. salinus genome are located within coding regions of predicted genes, while others are close to genes, suggesting that these transposons may have contributed to viral genetic novelty. Our discovery highlights the remarkable ability of DNA transposons to colonize and shape genomes from all domains of life, as well as giant viruses. Our findings continue to blur the division between viral and cellular genomes, adhering to the emerging view that the content, dynamics, and evolution of the genomes of giant viruses do not substantially differ from those of cellular organisms.

  3. Role for a region of helically unstable DNA within the Epstein-Barr virus latent cycle origin of DNA replication oriP in origin function

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Polonskaya, Zhanna; Benham, Craig J.; Hearing, Janet

    The minimal replicator of the Epstein-Barr virus (EBV) latent cycle origin of DNA replication oriP is composed of two binding sites for the Epstein-Barr virus nuclear antigen-1 (EBNA-1) and flanking inverted repeats that bind the telomere repeat binding factor TRF2. Although not required for minimal replicator activity, additional binding sites for EBNA-1 and TRF2 and one or more auxiliary elements located to the right of the EBNA-1/TRF2 sites are required for the efficient replication of oriP plasmids. Another region of oriP that is predicted to be destabilized by DNA supercoiling is shown here to be an important functional component ofmore » oriP. The ability of DNA fragments of unrelated sequence and possessing supercoiled-induced DNA duplex destabilized (SIDD) structures, but not fragments characterized by helically stable DNA, to substitute for this component of oriP demonstrates a role for the SIDD region in the initiation of oriP-plasmid DNA replication.« less

  4. Eukaryotic gene regulation by targeted chromatin re-modeling at dispersed, middle-repetitive sequence elements.

    PubMed

    Hodgetts, Ross

    2004-12-01

    RNA interference might have evolved to minimize the deleterious impact of transposable elements and viruses on eukaryotic genomes, because mutations in genes within the RNAi pathway cause mobilization of transposons in nematodes and flies. Although the first examples of RNAi involved post-transcriptional gene silencing, recently the pathway has been shown to act at the transcriptional level. It does so by establishing a chromatin configuration on the target DNA that has many of the hallmarks of heterochromatin, thus preventing its transcription. Members of dispersed, repeated sequence families appear to have been utilized by the RNAi machinery to regulate nearby genes in yeast. The unusual genomic distribution of three repeated element families in the chicken, fruit-fly and nematode genomes prompts speculation that some of these repeats have been co-opted to control gene expression, either locally or over extended chromosomal domains.

  5. A Helitron-like Transposon Superfamily from Lepidoptera Disrupts (GAAA)n Microsatellites and is Responsible for Flanking Sequence Similarity within a Microsatellite Family

    USDA-ARS?s Scientific Manuscript database

    Transposable elements (TEs) are mobile DNA regions that alter host genome structure and gene expression. A novel 588 bp non-autonomous high copy number TE in the Ostrinia nubilalis genome has features in common with miniature inverted-repeat transposable elements (MITEs): high A+T content (62.3%),...

  6. Promoter selection in human mitochondria involves binding of a transcription factor to orientation-independent upstream regulatory elements.

    PubMed

    Fisher, R P; Topper, J N; Clayton, D A

    1987-07-17

    Selective transcription of human mitochondrial DNA requires a transcription factor (mtTF) in addition to an essentially nonselective RNA polymerase. Partially purified mtTF is able to sequester promoter-containing DNA in preinitiation complexes in the absence of mitochondrial RNA polymerase, suggesting a DNA-binding mechanism for factor activity. Functional domains, required for positive transcriptional regulation by mtTF, are identified within both major promoters of human mtDNA through transcription of mutant promoter templates in a reconstituted in vitro system. These domains are essentially coextensive with DNA sequences protected from nuclease digestion by mtTF-binding. Comparison of the sequences of the two mtTF-responsive elements reveals significant homology only when one sequence is inverted; the binding sites are in opposite orientations with respect to the predominant direction of transcription. Thus mtTF may function bidirectionally, requiring additional protein-DNA interactions to dictate transcriptional polarity. The mtTF-responsive elements are arrayed as direct repeats, separated by approximately 80 bp within the displacement-loop region of human mitochondrial DNA; this arrangement may reflect duplication of an ancestral bidirectional promoter, giving rise to separate, unidirectional promoters for each strand.

  7. The repetitive landscape of the chicken genome.

    PubMed

    Wicker, Thomas; Robertson, Jon S; Schulze, Stefan R; Feltus, F Alex; Magrini, Vincent; Morrison, Jason A; Mardis, Elaine R; Wilson, Richard K; Peterson, Daniel G; Paterson, Andrew H; Ivarie, Robert

    2005-01-01

    Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7 x coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.

  8. The repetitive landscape of the chicken genome

    PubMed Central

    Wicker, Thomas; Robertson, Jon S.; Schulze, Stefan R.; Feltus, F. Alex; Magrini, Vincent; Morrison, Jason A.; Mardis, Elaine R.; Wilson, Richard K.; Peterson, Daniel G.; Paterson, Andrew H.; Ivarie, Robert

    2005-01-01

    Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7× coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available. PMID:15256510

  9. The repeat organizer, a specialized insulator element within the intergenic spacer of the Xenopus rRNA genes.

    PubMed Central

    Robinett, C C; O'Connor, A; Dunaway, M

    1997-01-01

    We have identified a novel activity for the region of the intergenic spacer of the Xenopus laevis rRNA genes that contains the 35- and 100-bp repeats. We devised a new assay for this region by constructing DNA plasmids containing a tandem repeat of rRNA reporter genes that were separated by the 35- and 100-bp repeat region and a rRNA gene enhancer. When the 35- and 100-bp repeat region is present in its normal position and orientation at the 3' end of the rRNA reporter genes, the enhancer activates the adjacent downstream promoter but not the upstream rRNA promoter on the same plasmid. Because this element can restrict the range of an enhancer's activity in the context of tandem genes, we have named it the repeat organizer (RO). The ability to restrict enhancer action is a feature of insulator elements, but unlike previously described insulator elements the RO does not block enhancer action in a simple enhancer-blocking assay. Instead, the activity of the RO requires that it be in its normal position and orientation with respect to the other sequence elements of the rRNA genes. The enhancer-binding transcription factor xUBF also binds to the repetitive sequences of the RO in vitro, but these sequences do not activate transcription in vivo. We propose that the RO is a specialized insulator element that organizes the tandem array of rRNA genes into single-gene expression units by promoting activation of a promoter by its proximal enhancers. PMID:9111359

  10. Localization of Action of the Is50-Encoded Transposase Protein

    PubMed Central

    Phadnis, Suhas H.; Sasakawa, Chihiro; Berg, Douglas E.

    1986-01-01

    The movement of the bacterial insertion sequence IS50 and of composite elements containing direct terminal repeats of IS50 involves the two ends of IS50, designated O (outside) and I (inside), which are weakly matched in DNA sequence, and an IS50 encoded protein, transposase, which recognizes the O and I ends and acts preferentially in cis. Previous data had suggested that, initially, transposase interacts preferentially with the O end sequence and then, in a second step, with either an O or an I end. To better understand the cis action of transposase and how IS50 ends are selected, we generated a series of composite transposons which contain direct repeats of IS50 elements. In each transposon, one IS50 element encoded transposase (tnp +), and the other contained a null (tnp-) allele. In each of the five sets of composite transposons studied, the transposon for which the tnp+ IS50 element contained its O end was more active than a complementary transposon for which the tnp - IS50 element contained its O end. This pattern of O end use suggests models in which the cis action of transposase and its choice of ends is determined by protein tracking along DNA molecules. PMID:3007274

  11. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats.

    PubMed

    Bland, Charles; Ramsey, Teresa L; Sabree, Fareedah; Lowe, Micheal; Brown, Kyndall; Kyrpides, Nikos C; Hugenholtz, Philip

    2007-06-18

    Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel type of direct repeat found in a wide range of bacteria and archaea. CRISPRs are beginning to attract attention because of their proposed mechanism; that is, defending their hosts against invading extrachromosomal elements such as viruses. Existing repeat detection tools do a poor job of identifying CRISPRs due to the presence of unique spacer sequences separating the repeats. In this study, a new tool, CRT, is introduced that rapidly and accurately identifies CRISPRs in large DNA strings, such as genomes and metagenomes. CRT was compared to CRISPR detection tools, Patscan and Pilercr. In terms of correctness, CRT was shown to be very reliable, demonstrating significant improvements over Patscan for measures precision, recall and quality. When compared to Pilercr, CRT showed improved performance for recall and quality. In terms of speed, CRT proved to be a huge improvement over Patscan. Both CRT and Pilercr were comparable in speed, however CRT was faster for genomes containing large numbers of repeats. In this paper a new tool was introduced for the automatic detection of CRISPR elements. This tool, CRT, showed some important improvements over current techniques for CRISPR identification. CRT's approach to detecting repetitive sequences is straightforward. It uses a simple sequential scan of a DNA sequence and detects repeats directly without any major conversion or preprocessing of the input. This leads to a program that is easy to describe and understand; yet it is very accurate, fast and memory efficient, being O(n) in space and O(nm/l) in time.

  12. CRISPR Recognition Tool (CRT): a tool for automatic detection ofclustered regularly interspaced palindromic repeats

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bland, Charles; Ramsey, Teresa L.; Sabree, Fareedah

    Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel type of direct repeat found in a wide range of bacteria and archaea. CRISPRs are beginning to attract attention because of their proposed mechanism; that is, defending their hosts against invading extrachromosomal elements such as viruses. Existing repeat detection tools do a poor job of identifying CRISPRs due to the presence of unique spacer sequences separating the repeats. In this study, a new tool, CRT, is introduced that rapidly and accurately identifies CRISPRs in large DNA strings, such as genomes and metagenomes. CRT was compared to CRISPR detection tools, Patscan andmore » Pilercr. In terms of correctness, CRT was shown to be very reliable, demonstrating significant improvements over Patscan for measures precision, recall and quality. When compared to Pilercr, CRT showed improved performance for recall and quality. In terms of speed, CRT also demonstrated superior performance, especially for genomes containing large numbers of repeats. In this paper a new tool was introduced for the automatic detection of CRISPR elements. This tool, CRT, was shown to be a significant improvement over the current techniques for CRISPR identification. CRT's approach to detecting repetitive sequences is straightforward. It uses a simple sequential scan of a DNA sequence and detects repeats directly without any major conversion or preprocessing of the input. This leads to a program that is easy to describe and understand; yet it is very accurate, fast and memory efficient, being O(n) in space and O(nm/l) in time.« less

  13. New insights into replication origin characteristics in metazoans

    PubMed Central

    Puy, Aurore; Rialle, Stéphanie; Kaplan, Noam; Segal, Eran

    2012-01-01

    We recently reported the identification and characterization of DNA replication origins (Oris) in metazoan cell lines. Here, we describe additional bioinformatic analyses showing that the previously identified GC-rich sequence elements form origin G-rich repeated elements (OGREs) that are present in 67% to 90% of the DNA replication origins from Drosophila to human cells, respectively. Our analyses also show that initiation of DNA synthesis takes place precisely at 160 bp (Drosophila) and 280 bp (mouse) from the OGRE. We also found that in most CpG islands, an OGRE is positioned in opposite orientation on each of the two DNA strands and detected two sites of initiation of DNA synthesis upstream or downstream of each OGRE. Conversely, Oris not associated with CpG islands have a single initiation site. OGRE density along chromosomes correlated with previously published replication timing data. Ori sequences centered on the OGRE are also predicted to have high intrinsic nucleosome occupancy. Finally, OGREs predict G-quadruplex structures at Oris that might be structural elements controlling the choice or activation of replication origins. PMID:22373526

  14. Whole DNA methylome profiling in mice exposed to secondhand smoke.

    PubMed

    Tommasi, Stella; Zheng, Albert; Yoon, Jae-In; Li, Arthur Xuejun; Wu, Xiwei; Besaratinia, Ahmad

    2012-11-01

    Aberration of DNA methylation is a prime epigenetic mechanism of carcinogenesis. Aberrant DNA methylation occurs frequently in lung cancer, with exposure to secondhand smoke (SHS) being an established risk factor. The causal role of SHS in the genesis of lung cancer, however, remains elusive. To investigate whether SHS can cause aberrant DNA methylation in vivo, we have constructed the whole DNA methylome in mice exposed to SHS for a duration of 4 mo, both after the termination of exposure and at ensuing intervals post-exposure (up to 10 mo). Our genome-wide and gene-specific profiling of DNA methylation in the lung of SHS-exposed mice revealed that all groups of SHS-exposed mice and controls share a similar pattern of DNA methylation. Furthermore, the methylation status of major repetitive DNA elements, including long-interspersed nuclear elements (LINE L1), intracisternal A particle long-terminal repeat retrotransposons (IAP-LTR), and short-interspersed nuclear elements (SINE B1), in the lung of all groups of SHS-exposed mice and controls remains comparable. The absence of locus-specific gain of DNA methylation and global loss of DNA methylation in the lung of SHS-exposed mice within a timeframe that precedes neoplastic-lesion formation underscore the challenges of lung cancer biomarker development. Identifying the initiating events that cause aberrant DNA methylation in lung carcinogenesis may help improve future strategies for prevention, early detection and treatment of this highly lethal disease.

  15. Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts.

    PubMed

    Trofimova, Irina; Krasikova, Alla

    2016-12-01

    Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.

  16. Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts

    PubMed Central

    Krasikova, Alla

    2016-01-01

    ABSTRACT Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription. PMID:27763817

  17. GREAM: A Web Server to Short-List Potentially Important Genomic Repeat Elements Based on Over-/Under-Representation in Specific Chromosomal Locations, Such as the Gene Neighborhoods, within or across 17 Mammalian Species

    PubMed Central

    Chandrashekar, Darshan Shimoga; Dey, Poulami; Acharya, Kshitish K.

    2015-01-01

    Background Genome-wide repeat sequences, such as LINEs, SINEs and LTRs share a considerable part of the mammalian nuclear genomes. These repeat elements seem to be important for multiple functions including the regulation of transcription initiation, alternative splicing and DNA methylation. But it is not possible to study all repeats and, hence, it would help to short-list before exploring their potential functional significance via experimental studies and/or detailed in silico analyses. Result We developed the ‘Genomic Repeat Element Analyzer for Mammals’ (GREAM) for analysis, screening and selection of potentially important mammalian genomic repeats. This web-server offers many novel utilities. For example, this is the only tool that can reveal a categorized list of specific types of transposons, retro-transposons and other genome-wide repetitive elements that are statistically over-/under-represented in regions around a set of genes, such as those expressed differentially in a disease condition. The output displays the position and frequency of identified elements within the specified regions. In addition, GREAM offers two other types of analyses of genomic repeat sequences: a) enrichment within chromosomal region(s) of interest, and b) comparative distribution across the neighborhood of orthologous genes. GREAM successfully short-listed a repeat element (MER20) known to contain functional motifs. In other case studies, we could use GREAM to short-list repetitive elements in the azoospermia factor a (AZFa) region of the human Y chromosome and those around the genes associated with rat liver injury. GREAM could also identify five over-represented repeats around some of the human and mouse transcription factor coding genes that had conserved expression patterns across the two species. Conclusion GREAM has been developed to provide an impetus to research on the role of repetitive sequences in mammalian genomes by offering easy selection of more interesting repeats in various contexts/regions. GREAM is freely available at http://resource.ibab.ac.in/GREAM/. PMID:26208093

  18. Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903.

    PubMed Central

    Grindley, N D; Joyce, C M

    1980-01-01

    The kanamycin resistance transposon Tn903 consists of a unique region of about 1000 base pairs bounded by a pair of 1050-base-pair inverted repeat sequences. Each repeat contains two Pvu II endonuclease cleavage sites separated by 520 base pairs. We have constructed derivatives of Tn903 in which this 520-base-pair fragment is deleted from one or both repeats. Those derivatives that lack both 520-base-pair fragments cannot transpose, whereas those that lack just one remain transposition proficient. One such transposable derivative, Tn903 delta I, has been selected for further study. We have determined the sequence of the intact inverted repeat. The 18 base pairs at each end are identical and inverted relative to one another, a structure characteristic of insertion sequences. Additional experiments indicate that a single inverted repeat from Tn903 can, in fact, transpose; we propose that this element be called IS903. To correlate the DNA sequence with genetic activities, we have created mutations by inserting a 10-base-pair DNA fragment at several sites within the intact repeat of Tn903 delta 1, and we have examined the effect of such insertions on transposability. The results suggest that IS903 encodes a 307-amino-acid polypeptide (a "transposase") that is absolutely required for transposition of IS903 or Tn903. Images PMID:6261245

  19. Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.

    PubMed

    Lakshmikumaran, M; Negi, M S

    1994-03-01

    Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.

  20. Sequences spanning the leader-repeat junction mediate CRISPR adaptation to phage in Streptococcus thermophilus

    PubMed Central

    Wei, Yunzhou; Chesne, Megan T.; Terns, Rebecca M.; Terns, Michael P.

    2015-01-01

    CRISPR-Cas systems are RNA-based immune systems that protect prokaryotes from invaders such as phages and plasmids. In adaptation, the initial phase of the immune response, short foreign DNA fragments are captured and integrated into host CRISPR loci to provide heritable defense against encountered foreign nucleic acids. Each CRISPR contains a ∼100–500 bp leader element that typically includes a transcription promoter, followed by an array of captured ∼35 bp sequences (spacers) sandwiched between copies of an identical ∼35 bp direct repeat sequence. New spacers are added immediately downstream of the leader. Here, we have analyzed adaptation to phage infection in Streptococcus thermophilus at the CRISPR1 locus to identify cis-acting elements essential for the process. We show that the leader and a single repeat of the CRISPR locus are sufficient for adaptation in this system. Moreover, we identified a leader sequence element capable of stimulating adaptation at a dormant repeat. We found that sequences within 10 bp of the site of integration, in both the leader and repeat of the CRISPR, are required for the process. Our results indicate that information at the CRISPR leader-repeat junction is critical for adaptation in this Type II-A system and likely other CRISPR-Cas systems. PMID:25589547

  1. Transposable element distribution, abundance and role in genome size variation in the genus Oryza.

    PubMed

    Zuccolo, Andrea; Sebastian, Aswathy; Talag, Jayson; Yu, Yeisoo; Kim, HyeRan; Collura, Kristi; Kudrna, Dave; Wing, Rod A

    2007-08-29

    The genus Oryza is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop - rice (Oryza sativa [AA]). Genome size variation in the Oryza is more than 3-fold and ranges from 357 Mbp in Oryza glaberrima [AA] to 1283 Mbp in the polyploid Oryza ridleyi [HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative Oryza species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements) in shaping these genomes and in their contributing to genome size variation. We identified the elements primarily responsible for the most strikingly genome size variation in Oryza. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the Oryza and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species Oryza coarctata [HHKK] whose placement in the Oryza genus is controversial. Long Terminal Repeat retrotransposons are the major component of the Oryza genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the Oryza genus. Two families of Ty3-gypsy elements (RIRE2 and Atlantys) account for a significant portion of the genome size variations present in the Oryza genus.

  2. Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae.

    PubMed

    Oggioni, M R; Claverys, J P

    1999-10-01

    A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.

  3. Radiation-Induced Epigenetic Alterations after Low and High LET Irradiations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aypar, Umut; Morgan, William F.; Baulch, Janet E.

    Epigenetics, including DNA methylation and microRNA (miRNA) expression, could be the missing link in understanding the delayed, non-targeted effects of radiation including radiationinduced genomic instability (RIGI). This study tests the hypothesis that irradiation induces epigenetic aberrations, which could eventually lead to RIGI, and that the epigenetic aberrations induced by low linear energy transfer (LET) irradiation are different than those induced by high LET irradiations. GM10115 cells were irradiated with low LET x-rays and high LET iron (Fe) ions and evaluated for DNA damage, cell survival and chromosomal instability. The cells were also evaluated for specific locus methylation of nuclear factor-kappamore » B (NFκB), tumor suppressor in lung cancer 1 (TSLC1) and cadherin 1 (CDH1) gene promoter regions, long interspersed nuclear element 1 (LINE-1) and Alu repeat element methylation, CpG and non-CpG global methylation and miRNA expression levels. Irradiated cells showed increased micronucleus induction and cell killing immediately following exposure, but were chromosomally stable at delayed times post-irradiation. At this same delayed time, alterations in repeat element and global DNA methylation and miRNA expression were observed. Analyses of DNA methylation predominantly showed hypomethylation, however hypermethylation was also observed. MiRNA shown to be altered in expression level after x-ray irradiation are involved in chromatin remodeling and DNA methylation. Different and higher incidence of epigenetic changes were observed after exposure to low LET x-rays than high LET Fe ions even though Fe ions elicited more chromosomal damage and cell killing. This study also shows that the irradiated cells acquire epigenetic changes even though they are chromosomally stable suggesting that epigenetic aberrations may arise in the cell without initiating RIGI.« less

  4. Genomic patterns associated with paternal/maternal distribution of transposable elements

    NASA Astrophysics Data System (ADS)

    Jurka, Jerzy

    2003-03-01

    Transposable elements (TEs) are specialized DNA or RNA fragments capable of surviving in intragenomic niches. They are commonly, perhaps unjustifiably referred to as "selfish" or "parasitic" elements. TEs can be divided in two major classes: retroelements and DNA transposons. The former include non-LTR retrotransposons and retrovirus-like elements, using reverse transriptase for their reproduction prior to integration into host DNA. The latter depend mostly on host DNA replication, with possible exception of rolling-circle transposons recently discovered by our team. I will review basic information on TEs, with emphasis on human Alu and L1 retroelements discussed in the context of genomic organization. TEs are non-randomly distributed in chromosomal DNA. In particular, human Alu elements tend to prefer GC-rich regions, whereas L1 accumulate in AT-rich regions. Current explanations of this phenomenon focus on the so called "target effects" and post-insertional selection. However, the proposed models appear to be unsatisfactory and alternative explanations invoking "channeling" to different chromosomal regions will be a major focus of my presentation. Transposable elements (TEs) can be expressed and integrated into host DNA in the male or female germlines, or both. Different models of expression and integration imply different proportions of TEs on sex chromosomes and autosomes. The density of recently retroposed human Alu elements is around three times higher on chromosome Y than on chromosome X, and over two times higher than the average density for all human autosomes. This implies Alu activity in paternal germlines. Analogous inter-chromosomal proportions for other repeat families should determine their compatibility with one of the three basic models describing the inheritance of TEs. Published evidence indicates that maternally and paternally imprinted genes roughly correspond to GC-rich and AT-rich DNA. This may explain the observed chromosomal distribution of Alu and L1 elements. Finally, paternal models of inheritance predict rapid accumulation of active TEs on chromosome Y. I will discuss potential implications of this phenomenon for evolution of chromosome Y and transposable elements.

  5. Recombination, rearrangement, reshuffling, and divergence in a centromeric region of rice.

    PubMed

    Ma, Jianxin; Bennetzen, Jeffrey L

    2006-01-10

    Centromeres have many unusual biological properties, including kinetochore attachment and severe repression of local meiotic recombination. These properties are partly an outcome, partly a cause, of unusual DNA structure in the centromeric region. Although several plant and animal genomes have been sequenced, most centromere sequences have not been completed or analyzed in depth. To shed light on the unique organization, variability, and evolution of centromeric DNA, detailed analysis of a 1.97-Mb sequence that includes centromere 8 (CEN8) of japonica rice was undertaken. Thirty-three long-terminal repeat (LTR)-retrotransposon families (including 11 previously unknown) were identified in the CEN8 region, totaling 245 elements and fragments that account for 67% of the region. The ratio of solo LTRs to intact elements in the CEN8 region is approximately 0.9:1, compared with approximately 2.2:1 in noncentromeric regions of rice. However, the ratio of solo LTRs to intact elements in the core of the CEN8 region ( approximately 2.5:1) is higher than in any other region investigated in rice, suggesting a hotspot for unequal recombination. Comparison of the CEN8 region of japonica and its orthologous segments from indica rice indicated that approximately 15% of the intact retrotransposons and solo LTRs were inserted into CEN8 after the divergence of japonica and indica from a common ancestor, compared with approximately 50% for previously studied euchromatic regions. Frequent DNA rearrangements were observed in the CEN8 region, including a 212-kb subregion that was found to be composed of three rearranged tandem repeats. Phylogenetic analysis also revealed recent segmental duplication and extensive rearrangement and reshuffling of the CentO satellite repeats.

  6. The 2.1-kb inverted repeat DNA sequences flank the mat2,3 silent region in two species of Schizosaccharomyces and are involved in epigenetic silencing in Schizosaccharomyces pombe.

    PubMed Central

    Singh, Gurjeet; Klar, Amar J S

    2002-01-01

    The mat2,3 region of the fission yeast Schizosaccharomyces pombe exhibits a phenomenon of transcriptional silencing. This region is flanked by two identical DNA sequence elements, 2.1 kb in length, present in inverted orientation: IRL on the left and IRR on the right of the silent region. The repeats do not encode any ORF. The inverted repeat DNA region is also present in a newly identified related species, which we named S. kambucha. Interestingly, the left and right repeats share perfect identity within a species, but show approximately 2% bases interspecies variation. Deletion of IRL results in variegated expression of markers inserted in the silent region, while deletion of the IRR causes their derepression. When deletions of these repeats were genetically combined with mutations in different trans-acting genes previously shown to cause a partial defect in silencing, only mutations in clr1 and clr3 showed additive defects in silencing with the deletion of IRL. The rate of mat1 switching is also affected by deletion of repeats. The IRL or IRR deletion did not cause significant derepression of the mat2 or mat3 loci. These results implicate repeats for maintaining full repression of the mat2,3 region, for efficient mat1 switching, and further support the notion that multiple pathways cooperate to silence the mat2,3 domain. PMID:12399374

  7. Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.

    PubMed

    Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav

    2010-09-16

    Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection.

  8. Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing

    PubMed Central

    2010-01-01

    Background Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. Results In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. Conclusion A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection. PMID:20846365

  9. Novel Structure of Ty3 Reverse Transcriptase | Center for Cancer Research

    Cancer.gov

    Retrotransposons are mobile genetic elements that self amplify via a single-stranded RNA intermediate, which is converted to double-stranded DNA by an encoded reverse transcriptase (RT) with both DNA polymerase (pol) and ribonuclease H (RNase) activities. Categorized by whether they contain flanking long terminal repeat (LTR) sequences, retrotransposons play a critical role in the architecture of eukaryotic genomes and are the evolutionary origin of retroviruses, including human immunodeficiency virus (HIV).

  10. cDNA sequence and expression of a cold-responsive gene in Citrus unshiu.

    PubMed

    Hara, M; Wakasugi, Y; Ikoma, Y; Yano, M; Ogawa, K; Kuboi, T

    1999-02-01

    A cDNA clone encoding a protein (CuCOR19), the sequence of which is similar to Poncirus COR19, of the dehydrin family was isolated from the epicarp of Citrus unshiu. The molecular mass of the predicted protein was 18,980 daltons. CuCOR19 was highly hydrophilic and contained three repeating elements including Lys-rich motifs. The gene expression in leaves increased by cold stress.

  11. Loop-mediated isothermal amplification (LAMP): early detection of Toxoplasma gondii infection in mice.

    PubMed

    Kong, Qing-Ming; Lu, Shao-Hong; Tong, Qun-Bo; Lou, Di; Chen, Rui; Zheng, Bin; Kumagai, Takashi; Wen, Li-Yong; Ohta, Nobuo; Zhou, Xiao-Nong

    2012-01-03

    Toxoplasmosis is a widespread zoonotic parasitic disease that occurs in both animals and humans. Traditional molecular assays are often difficult to perform, especially for the early diagnosis of Toxoplasma gondii infections. Here, we established a novel loop-mediated isothermal amplification targeting the 529 bp repeat element (529 bp-LAMP) to detect T. gondii DNA in blood samples of experimental mice infected with tachyzoites of the RH strain. The assay was performed with Bst DNA polymerase at 65°C for 1 h. The detection limit of the 529 bp-LAMP assay was as low as 0.6 fg of T. gondii DNA. The sensitivity of this assay was 100 and 1000 fold higher than that of the LAMP targeting B1 gene (B1-LAMP) and nested PCR targeting 529 bp repeat element (529 bp-nested PCR), respectively. The specificity of the 529 bp-LAMP assay was determined using the DNA samples of Trypanosoma evansi, Plasmodium falciparum, Paragonimus westermani, Schistosoma japonicum, Fasciola hepatica and Angiostrongylus cantonensis. No cross-reactivity with the DNA of any parasites was found. The assay was able to detect T. gondii DNA in all mouse blood samples at one day post infection (dpi). We report the following findings: (i) The detection limit of the 529 bp-LAMP assay is 0.6 fg of T. gondii DNA; (ii) The assay does not involve any cross-reactivity with the DNA of other parasites; (iii) This is the first report on the application of the LAMP assay for early diagnosis of toxoplasmosis in blood samples from experimentally infected mice. Due to its simplicity, sensitivity and cost-effectiveness for common use, we suggest that this assay should be used as an early diagnostic tool for health control of toxoplasmosis.

  12. Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes.

    PubMed

    Al-Attar, Sinan; Westra, Edze R; van der Oost, John; Brouns, Stan J J

    2011-04-01

    Many prokaryotes contain the recently discovered defense system against mobile genetic elements. This defense system contains a unique type of repetitive DNA stretches, termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs). CRISPRs consist of identical repeated DNA sequences (repeats), interspaced by highly variable sequences referred to as spacers. The spacers originate from either phages or plasmids and comprise the prokaryotes' 'immunological memory'. CRISPR-associated (cas) genes encode conserved proteins that together with CRISPRs make-up the CRISPR/Cas system, responsible for defending the prokaryotic cell against invaders. CRISPR-mediated resistance has been proposed to involve three stages: (i) CRISPR-Adaptation, the invader DNA is encountered by the CRISPR/Cas machinery and an invader-derived short DNA fragment is incorporated in the CRISPR array. (ii) CRISPR-Expression, the CRISPR array is transcribed and the transcript is processed by Cas proteins. (iii) CRISPR-Interference, the invaders' nucleic acid is recognized by complementarity to the crRNA and neutralized. An application of the CRISPR/Cas system is the immunization of industry-relevant prokaryotes (or eukaryotes) against mobile-genetic invasion. In addition, the high variability of the CRISPR spacer content can be exploited for phylogenetic and evolutionary studies. Despite impressive progress during the last couple of years, the elucidation of several fundamental details will be a major challenge in future research.

  13. Whole DNA methylome profiling in mice exposed to secondhand smoke

    PubMed Central

    Tommasi, Stella; Zheng, Albert; Yoon, Jae-In; Li, Arthur Xuejun; Wu, Xiwei; Besaratinia, Ahmad

    2012-01-01

    Aberration of DNA methylation is a prime epigenetic mechanism of carcinogenesis. Aberrant DNA methylation occurs frequently in lung cancer, with exposure to secondhand smoke (SHS) being an established risk factor. The causal role of SHS in the genesis of lung cancer, however, remains elusive. To investigate whether SHS can cause aberrant DNA methylation in vivo, we have constructed the whole DNA methylome in mice exposed to SHS for a duration of 4 mo, both after the termination of exposure and at ensuing intervals post-exposure (up to 10 mo). Our genome-wide and gene-specific profiling of DNA methylation in the lung of SHS-exposed mice revealed that all groups of SHS-exposed mice and controls share a similar pattern of DNA methylation. Furthermore, the methylation status of major repetitive DNA elements, including long-interspersed nuclear elements (LINE L1), intracisternal A particle long-terminal repeat retrotransposons (IAP-LTR), and short-interspersed nuclear elements (SINE B1), in the lung of all groups of SHS-exposed mice and controls remains comparable. The absence of locus-specific gain of DNA methylation and global loss of DNA methylation in the lung of SHS-exposed mice within a timeframe that precedes neoplastic-lesion formation underscore the challenges of lung cancer biomarker development. Identifying the initiating events that cause aberrant DNA methylation in lung carcinogenesis may help improve future strategies for prevention, early detection and treatment of this highly lethal disease. PMID:23051858

  14. First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Traditional Medicinal Plant

    PubMed Central

    Liu, Di; Zeng, Shao-Hua; Chen, Jian-Jun; Zhang, Yan-Jun; Xiao, Gong; Zhu, Lin-Yao; Wang, Ying

    2013-01-01

    Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant. PMID:23807511

  15. Active Site Sharing and Subterminal Hairpin Recognition in a New Class of DNA Transposases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ronning, Donald R.; Guynet, Catherine; Ton-Hoang, Bao

    2010-07-20

    Many bacteria harbor simple transposable elements termed insertion sequences (IS). In Helicobacter pylori, the chimeric IS605 family elements are particularly interesting due to their proximity to genes encoding gastric epithelial invasion factors. Protein sequences of IS605 transposases do not bear the hallmarks of other well-characterized transposases. We have solved the crystal structure of full-length transposase (TnpA) of a representative member, ISHp608. Structurally, TnpA does not resemble any characterized transposase; rather, it is related to rolling circle replication (RCR) proteins. Consistent with RCR, Mg{sup 2+} and a conserved tyrosine, Tyr127, are essential for DNA nicking and the formation of a covalentmore » intermediate between TnpA and DNA. TnpA is dimeric, contains two shared active sites, and binds two DNA stem loops representing the conserved inverted repeats near each end of ISHp608. The cocrystal structure with stem-loop DNA illustrates how this family of transposases specifically recognizes and pairs ends, necessary steps during transposition.« less

  16. Mobility and generation of mosaic non-autonomous transposons by Tn3-derived inverted-repeat miniature elements (TIMEs).

    PubMed

    Szuplewska, Magdalena; Ludwiczak, Marta; Lyzwa, Katarzyna; Czarnecki, Jakub; Bartosik, Dariusz

    2014-01-01

    Functional transposable elements (TEs) of several Pseudomonas spp. strains isolated from black shale ore of Lubin mine and from post-flotation tailings of Zelazny Most in Poland, were identified using a positive selection trap plasmid strategy. This approach led to the capture and characterization of (i) 13 insertion sequences from 5 IS families (IS3, IS5, ISL3, IS30 and IS1380), (ii) isoforms of two Tn3-family transposons--Tn5563a and Tn4662a (the latter contains a toxin-antitoxin system), as well as (iii) non-autonomous TEs of diverse structure, ranging in size from 262 to 3892 bp. The non-autonomous elements transposed into AT-rich DNA regions and generated 5- or 6-bp sequence duplications at the target site of transposition. Although these TEs lack a transposase gene, they contain homologous 38-bp-long terminal inverted repeat sequences (IRs), highly conserved in Tn5563a and many other Tn3-family transposons. The simplest elements of this type, designated TIMEs (Tn3 family-derived Inverted-repeat Miniature Elements) (262 bp), were identified within two natural plasmids (pZM1P1 and pLM8P2) of Pseudomonas spp. It was demonstrated that TIMEs are able to mobilize segments of plasmid DNA for transposition, which results in the generation of more complex non-autonomous elements, resembling IS-driven composite transposons in structure. Such transposon-like elements may contain different functional genetic modules in their core regions, including plasmid replication systems. Another non-autonomous element "captured" with a trap plasmid was a TIME derivative containing a predicted resolvase gene and a res site typical for many Tn3-family transposons. The identification of a portable site-specific recombination system is another intriguing example confirming the important role of non-autonomous TEs of the TIME family in shuffling genetic information in bacterial genomes. Transposition of such mosaic elements may have a significant impact on diversity and evolution, not only of transposons and plasmids, but also of other types of mobile genetic elements.

  17. Use of a Drosophila Genome-Wide Conserved Sequence Database to Identify Functionally Related cis-Regulatory Enhancers

    PubMed Central

    Brody, Thomas; Yavatkar, Amarendra S; Kuzin, Alexander; Kundu, Mukta; Tyson, Leonard J; Ross, Jermaine; Lin, Tzu-Yang; Lee, Chi-Hon; Awasaki, Takeshi; Lee, Tzumin; Odenwald, Ward F

    2012-01-01

    Background: Phylogenetic footprinting has revealed that cis-regulatory enhancers consist of conserved DNA sequence clusters (CSCs). Currently, there is no systematic approach for enhancer discovery and analysis that takes full-advantage of the sequence information within enhancer CSCs. Results: We have generated a Drosophila genome-wide database of conserved DNA consisting of >100,000 CSCs derived from EvoPrints spanning over 90% of the genome. cis-Decoder database search and alignment algorithms enable the discovery of functionally related enhancers. The program first identifies conserved repeat elements within an input enhancer and then searches the database for CSCs that score highly against the input CSC. Scoring is based on shared repeats as well as uniquely shared matches, and includes measures of the balance of shared elements, a diagnostic that has proven to be useful in predicting cis-regulatory function. To demonstrate the utility of these tools, a temporally-restricted CNS neuroblast enhancer was used to identify other functionally related enhancers and analyze their structural organization. Conclusions: cis-Decoder reveals that co-regulating enhancers consist of combinations of overlapping shared sequence elements, providing insights into the mode of integration of multiple regulating transcription factors. The database and accompanying algorithms should prove useful in the discovery and analysis of enhancers involved in any developmental process. Developmental Dynamics 241:169–189, 2012. © 2011 Wiley Periodicals, Inc. Key findings A genome-wide catalog of Drosophila conserved DNA sequence clusters. cis-Decoder discovers functionally related enhancers. Functionally related enhancers share balanced sequence element copy numbers. Many enhancers function during multiple phases of development. PMID:22174086

  18. A novel hAT element in Bombyx mori and Rhodnius prolixus: its relationship with miniature inverted repeat transposable elements (MITEs) and horizontal transfer.

    PubMed

    Zhang, H-H; Shen, Y-H; Xu, H-E; Liang, H-Y; Han, M-J; Zhang, Z

    2013-10-01

    Comparative analysis of transposable elements (TEs) from different species can make it possible to reconstruct their history over evolutionary time. In this study, we identified a novel hAT element in Bombyx mori and Rhodnius prolixus with characteristic GGGCGGCA repeats in its subterminal region. Meanwhile, phylogenetic analysis demonstrated that the elements in these two species might represent a separate cluster of the hAT superfamily. Strikingly, a previously identified miniature inverted repeat transposable element (MITE) shared high identity with this autonomous element across the entire length, supporting the hypothesis that MITEs are derived from the internal deletion of DNA transposons. Interestingly, identity of the consensus sequences of this novel hAT element between B. mori and R. prolixus, which diverged about 370 million years ago, was as high as 96.5% over their full length (about 3.6 kb) at the nucleotide level. The patchy distribution amongst species, coupled with overall lack of intense purifying selection acting on this element, suggest that this novel hAT element might have experienced horizontal transfer between the ancestors of B. mori and R. prolixus. Our results highlight that this novel hAT element could be used as a potential tool for germline transformation of R. prolixus to control the transmission of Trypanosoma cruzi, which causes Chagas disease. © 2013 Royal Entomological Society.

  19. Genomic Heat Shock Element Sequences Drive Cooperative Human Heat Shock Factor 1 DNA Binding and Selectivity*

    PubMed Central

    Jaeger, Alex M.; Makley, Leah N.; Gestwicki, Jason E.; Thiele, Dennis J.

    2014-01-01

    The heat shock transcription factor 1 (HSF1) activates expression of a variety of genes involved in cell survival, including protein chaperones, the protein degradation machinery, anti-apoptotic proteins, and transcription factors. Although HSF1 activation has been linked to amelioration of neurodegenerative disease, cancer cells exhibit a dependence on HSF1 for survival. Indeed, HSF1 drives a program of gene expression in cancer cells that is distinct from that activated in response to proteotoxic stress, and HSF1 DNA binding activity is elevated in cycling cells as compared with arrested cells. Active HSF1 homotrimerizes and binds to a DNA sequence consisting of inverted repeats of the pentameric sequence nGAAn, known as heat shock elements (HSEs). Recent comprehensive ChIP-seq experiments demonstrated that the architecture of HSEs is very diverse in the human genome, with deviations from the consensus sequence in the spacing, orientation, and extent of HSE repeats that could influence HSF1 DNA binding efficacy and the kinetics and magnitude of target gene expression. To understand the mechanisms that dictate binding specificity, HSF1 was purified as either a monomer or trimer and used to evaluate DNA-binding site preferences in vitro using fluorescence polarization and thermal denaturation profiling. These results were compared with quantitative chromatin immunoprecipitation assays in vivo. We demonstrate a role for specific orientations of extended HSE sequences in driving preferential HSF1 DNA binding to target loci in vivo. These studies provide a biochemical basis for understanding differential HSF1 target gene recognition and transcription in neurodegenerative disease and in cancer. PMID:25204655

  20. A theory that may explain the Hayflick limit--a means to delete one copy of a repeating sequence during each cell cycle in certain human cells such as fibroblasts.

    PubMed

    Naveilhan, P; Baudet, C; Jabbour, W; Wion, D

    1994-09-01

    A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.

  1. Diversity, Evolution, and Functionality of Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) Regions in the Fire Blight Pathogen Erwinia amylovora▿†

    PubMed Central

    Rezzonico, Fabio; Smits, Theo H. M.; Duffy, Brion

    2011-01-01

    The clustered regularly interspaced short palindromic repeat (CRISPR)/Cas system confers acquired heritable immunity against mobile nucleic acid elements in prokaryotes, limiting phage infection and horizontal gene transfer of plasmids. In CRISPR arrays, characteristic repeats are interspersed with similarly sized nonrepetitive spacers derived from transmissible genetic elements and acquired when the cell is challenged with foreign DNA. New spacers are added sequentially and the number and type of CRISPR units can differ among strains, providing a record of phage/plasmid exposure within a species and giving a valuable typing tool. The aim of this work was to investigate CRISPR diversity in the highly homogeneous species Erwinia amylovora, the causal agent of fire blight. A total of 18 CRISPR genotypes were defined within a collection of 37 cosmopolitan strains. Strains from Spiraeoideae plants clustered in three major groups: groups II and III were composed exclusively of bacteria originating from the United States, whereas group I generally contained strains of more recent dissemination obtained in Europe, New Zealand, and the Middle East. Strains from Rosoideae and Indian hawthorn (Rhaphiolepis indica) clustered separately and displayed a higher intrinsic diversity than that of isolates from Spiraeoideae plants. Reciprocal exclusion was generally observed between plasmid content and cognate spacer sequences, supporting the role of the CRISPR/Cas system in protecting against foreign DNA elements. However, in several group III strains, retention of plasmid pEU30 is inconsistent with a functional CRISPR/Cas system. PMID:21460108

  2. Diversity, evolution, and functionality of clustered regularly interspaced short palindromic repeat (CRISPR) regions in the fire blight pathogen Erwinia amylovora.

    PubMed

    Rezzonico, Fabio; Smits, Theo H M; Duffy, Brion

    2011-06-01

    The clustered regularly interspaced short palindromic repeat (CRISPR)/Cas system confers acquired heritable immunity against mobile nucleic acid elements in prokaryotes, limiting phage infection and horizontal gene transfer of plasmids. In CRISPR arrays, characteristic repeats are interspersed with similarly sized nonrepetitive spacers derived from transmissible genetic elements and acquired when the cell is challenged with foreign DNA. New spacers are added sequentially and the number and type of CRISPR units can differ among strains, providing a record of phage/plasmid exposure within a species and giving a valuable typing tool. The aim of this work was to investigate CRISPR diversity in the highly homogeneous species Erwinia amylovora, the causal agent of fire blight. A total of 18 CRISPR genotypes were defined within a collection of 37 cosmopolitan strains. Strains from Spiraeoideae plants clustered in three major groups: groups II and III were composed exclusively of bacteria originating from the United States, whereas group I generally contained strains of more recent dissemination obtained in Europe, New Zealand, and the Middle East. Strains from Rosoideae and Indian hawthorn (Rhaphiolepis indica) clustered separately and displayed a higher intrinsic diversity than that of isolates from Spiraeoideae plants. Reciprocal exclusion was generally observed between plasmid content and cognate spacer sequences, supporting the role of the CRISPR/Cas system in protecting against foreign DNA elements. However, in several group III strains, retention of plasmid pEU30 is inconsistent with a functional CRISPR/Cas system.

  3. Identification of multiple binding sites for the THAP domain of the Galileo transposase in the long terminal inverted-repeats.

    PubMed

    Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald

    2013-08-01

    Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.

  4. [Active miniature inverted-repeat transposable elements transposon in plants: a review].

    PubMed

    Hu, Bingjie; Zhou, Mingbing

    2018-02-25

    Miniature inverted-repeat transposable elements transposon is a special transposon that could transpose by "cut-paste" mechanism, which is one of characteristics of DNA transposons. Otherwise, the copy number of MITEs is very high, which is one of characteristics of RNA transposons. Many MITE families have been reported, but little about active MITEs. We summarize recent advances in studying active MITEs. Most the MITEs belong to the Tourist-like family, such as mPing, mGing, PhTourist1, Tmi1 and PhTst-3. Additionally, DTstu1 and MITE-39 belong to Stowaway-like family, and AhMITEs1 belongs to Mutator-like family. Moreover, we summarize the structure (terminal inverse repeats and target site duplications), copy number, evolution pattern and transposition characteristics of these active MITEs, to provide the foundation for the identification of other active MITEs and subsequent research on MITE transposition and amplification mechanism.

  5. The human immunodeficiency virus type 1 long terminal repeat specifies two different transcription complexes, only one of which is regulated by Tat.

    PubMed Central

    Lu, X; Welsh, T M; Peterlin, B M

    1993-01-01

    The human immunodeficiency virus type 1 long terminal repeat sets up two different transcription complexes, which have been called processive and nonprocessive complexes. By mutating and substituting cis-acting sequences, we mapped elements of the human immunodeficiency virus long terminal repeat that are responsible for creating each transcription complex. Whereas processive complexes are efficiently assembled by upstream promoter elements in the absence of the TATA box, nonprocessive complexes absolutely require the TATA box. Moreover, the TATA box alone can set up these nonprocessive complexes, and nonprocessive but not processive complexes are trans activated by Tat. Finally, a strong DNA-binding site between the TATA box and trans-activation-responsive region interferes with either the assembly or movement of these nonprocessive complexes and diminishes the effects of Tat. Thus, Tat affects a critical step in the formation of elongation-competent transcription complexes. Images PMID:8445708

  6. High copy number of highly similar mariner-like transposons in planarian (Platyhelminthe): evidence for a trans-phyla horizontal transfer.

    PubMed

    Garcia-Fernàndez, J; Bayascas-Ramírez, J R; Marfany, G; Muñoz-Mármol, A M; Casali, A; Baguñà, J; Saló, E

    1995-05-01

    Several DNA sequences similar to the mariner element were isolated and characterized in the platyhelminthe Dugesia (Girardia) tigrina. They were 1,288 bp long, flanked by two 32 bp-inverted repeats, and contained a single 339 amino acid open-reading frame (ORF) encoding the transposase. The number of copies of this element is approximately 8,000 per haploid genome, constituting a member of the middle-repetitive DNA of Dugesia tigrina. Sequence analysis of several elements showed a high percentage of conservation between the different copies. Most of them presented an intact ORF and the standard signals of actively expressed genes, which suggests that some of them are or have recently been functional transposons. The high degree of similarity shared with other mariner elements from some arthropods, together with the fact that this element is undetectable in other planarian species, strongly suggests a case of horizontal transfer between these two distant phyla.

  7. Palindromic repetitive DNA elements with coding potential in Methanocaldococcus jannaschii.

    PubMed

    Suyama, Mikita; Lathe, Warren C; Bork, Peer

    2005-10-10

    We have identified 141 novel palindromic repetitive elements in the genome of euryarchaeon Methanocaldococcus jannaschii. The total length of these elements is 14.3kb, which corresponds to 0.9% of the total genomic sequence and 6.3% of all extragenic regions. The elements can be divided into three groups (MJRE1-3) based on the sequence similarity. The low sequence identity within each of the groups suggests rather old origin of these elements in M. jannaschii. Three MJRE2 elements were located within the protein coding regions without disrupting the coding potential of the host genes, indicating that insertion of repeats might be a widespread mechanism to enhance sequence diversity in coding regions.

  8. Tyrosine Recombinase Retrotransposons and Transposons.

    PubMed

    Poulter, Russell T M; Butler, Margi I

    2015-04-01

    Retrotransposons carrying tyrosine recombinases (YR) are widespread in eukaryotes. The first described tyrosine recombinase mobile element, DIRS1, is a retroelement from the slime mold Dictyostelium discoideum. The YR elements are bordered by terminal repeats related to their replication via free circular dsDNA intermediates. Site-specific recombination is believed to integrate the circle without creating duplications of the target sites. Recently a large number of YR retrotransposons have been described, including elements from fungi (mucorales and basidiomycetes), plants (green algae) and a wide range of animals including nematodes, insects, sea urchins, fish, amphibia and reptiles. YR retrotransposons can be divided into three major groups: the DIRS elements, PAT-like and the Ngaro elements. The three groups form distinct clades on phylogenetic trees based on alignments of reverse transcriptase/ribonuclease H (RT/RH) and YR sequences, and also having some structural distinctions. A group of eukaryote DNA transposons, cryptons, also carry tyrosine recombinases. These DNA transposons do not encode a reverse transcriptase. They have been detected in several pathogenic fungi and oomycetes. Sequence comparisons suggest that the crypton YRs are related to those of the YR retrotransposons. We suggest that the YR retrotransposons arose from the combination of a crypton-like YR DNA transposon and the RT/RH encoding sequence of a retrotransposon. This acquisition must have occurred at a very early point in the evolution of eukaryotes.

  9. Evolutionary genomics of miniature inverted-repeat transposable elements (MITEs) in Brassica.

    PubMed

    Nouroz, Faisal; Noreen, Shumaila; Heslop-Harrison, J S

    2015-12-01

    Miniature inverted-repeat transposable elements (MITEs) are truncated derivatives of autonomous DNA transposons, and are dispersed abundantly in most eukaryotic genomes. We aimed to characterize various MITEs families in Brassica in terms of their presence, sequence characteristics and evolutionary activity. Dot plot analyses involving comparison of homoeologous bacterial artificial chromosome (BAC) sequences allowed identification of 15 novel families of mobile MITEs. Of which, 5 were Stowaway-like with TA Target Site Duplications (TSDs), 4 Tourist-like with TAA/TTA TSDs, 5 Mutator-like with 9-10 bp TSDs and 1 novel MITE (BoXMITE1) flanked by 3 bp TSDs. Our data suggested that there are about 30,000 MITE-related sequences in Brassica rapa and B. oleracea genomes. In situ hybridization showed one abundant family was dispersed in the A-genome, while another was located near 45S rDNA sites. PCR analysis using primers flanking sequences of MITE elements detected MITE insertion polymorphisms between and within the three Brassica (AA, BB, CC) genomes, with many insertions being specific to single genomes and others showing evidence of more recent evolutionary insertions. Our BAC sequence comparison strategy enables identification of evolutionarily active MITEs with no prior knowledge of MITE sequences. The details of MITE families reported in Brassica enable their identification, characterization and annotation. Insertion polymorphisms of MITEs and their transposition activity indicated important mechanism of genome evolution and diversification. MITE families derived from known Mariner, Harbinger and Mutator DNA transposons were discovered, as well as some novel structures. The identification of Brassica MITEs will have broad applications in Brassica genomics, breeding, hybridization and phylogeny through their use as DNA markers.

  10. Conserved DNA motifs in the type II-A CRISPR leader region.

    PubMed

    Van Orden, Mason J; Klein, Peter; Babu, Kesavan; Najar, Fares Z; Rajan, Rakhi

    2017-01-01

    The Clustered Regularly Interspaced Short Palindromic Repeats associated (CRISPR-Cas) systems consist of RNA-protein complexes that provide bacteria and archaea with sequence-specific immunity against bacteriophages, plasmids, and other mobile genetic elements. Bacteria and archaea become immune to phage or plasmid infections by inserting short pieces of the intruder DNA (spacer) site-specifically into the leader-repeat junction in a process called adaptation. Previous studies have shown that parts of the leader region, especially the 3' end of the leader, are indispensable for adaptation. However, a comprehensive analysis of leader ends remains absent. Here, we have analyzed the leader, repeat, and Cas proteins from 167 type II-A CRISPR loci. Our results indicate two distinct conserved DNA motifs at the 3' leader end: ATTTGAG (noted previously in the CRISPR1 locus of Streptococcus thermophilus DGCC7710) and a newly defined CTRCGAG, associated with the CRISPR3 locus of S. thermophilus DGCC7710. A third group with a very short CG DNA conservation at the 3' leader end is observed mostly in lactobacilli. Analysis of the repeats and Cas proteins revealed clustering of these CRISPR components that mirrors the leader motif clustering, in agreement with the coevolution of CRISPR-Cas components. Based on our analysis of the type II-A CRISPR loci, we implicate leader end sequences that could confer site-specificity for the adaptation-machinery in the different subsets of type II-A CRISPR loci.

  11. Target Site Recognition by a Diversity-Generating Retroelement

    PubMed Central

    Guo, Huatao; Tse, Longping V.; Nieh, Angela W.; Czornyj, Elizabeth; Williams, Steven; Oukil, Sabrina; Liu, Vincent B.; Miller, Jeff F.

    2011-01-01

    Diversity-generating retroelements (DGRs) are in vivo sequence diversification machines that are widely distributed in bacterial, phage, and plasmid genomes. They function to introduce vast amounts of targeted diversity into protein-encoding DNA sequences via mutagenic homing. Adenine residues are converted to random nucleotides in a retrotransposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using the Bordetella bacteriophage BPP-1 element as a prototype, we have characterized requirements for DGR target site function. Although sequences upstream of VR are dispensable, a 24 bp sequence immediately downstream of VR, which contains short inverted repeats, is required for efficient retrohoming. The inverted repeats form a hairpin or cruciform structure and mutational analysis demonstrated that, while the structure of the stem is important, its sequence can vary. In contrast, the loop has a sequence-dependent function. Structure-specific nuclease digestion confirmed the existence of a DNA hairpin/cruciform, and marker coconversion assays demonstrated that it influences the efficiency, but not the site of cDNA integration. Comparisons with other phage DGRs suggested that similar structures are a conserved feature of target sequences. Using a kanamycin resistance determinant as a reporter, we found that transplantation of the IMH and hairpin/cruciform-forming region was sufficient to target the DGR diversification machinery to a heterologous gene. In addition to furthering our understanding of DGR retrohoming, our results suggest that DGRs may provide unique tools for directed protein evolution via in vivo DNA diversification. PMID:22194701

  12. Conserved DNA motifs in the type II-A CRISPR leader region

    PubMed Central

    Babu, Kesavan; Najar, Fares Z.

    2017-01-01

    The Clustered Regularly Interspaced Short Palindromic Repeats associated (CRISPR-Cas) systems consist of RNA-protein complexes that provide bacteria and archaea with sequence-specific immunity against bacteriophages, plasmids, and other mobile genetic elements. Bacteria and archaea become immune to phage or plasmid infections by inserting short pieces of the intruder DNA (spacer) site-specifically into the leader-repeat junction in a process called adaptation. Previous studies have shown that parts of the leader region, especially the 3′ end of the leader, are indispensable for adaptation. However, a comprehensive analysis of leader ends remains absent. Here, we have analyzed the leader, repeat, and Cas proteins from 167 type II-A CRISPR loci. Our results indicate two distinct conserved DNA motifs at the 3′ leader end: ATTTGAG (noted previously in the CRISPR1 locus of Streptococcus thermophilus DGCC7710) and a newly defined CTRCGAG, associated with the CRISPR3 locus of S. thermophilus DGCC7710. A third group with a very short CG DNA conservation at the 3′ leader end is observed mostly in lactobacilli. Analysis of the repeats and Cas proteins revealed clustering of these CRISPR components that mirrors the leader motif clustering, in agreement with the coevolution of CRISPR-Cas components. Based on our analysis of the type II-A CRISPR loci, we implicate leader end sequences that could confer site-specificity for the adaptation-machinery in the different subsets of type II-A CRISPR loci. PMID:28392985

  13. Pstl repeat: a family of short interspersed nucleotide element (SINE)-like sequences in the genomes of cattle, goat, and buffalo.

    PubMed

    Sheikh, Faruk G; Mukhopadhyay, Sudit S; Gupta, Prabhakar

    2002-02-01

    The PstI family of elements are short, highly repetitive DNA sequences interspersed throughout the genome of the Bovidae. We have cloned and sequenced some members of the PstI family from cattle, goat, and buffalo. These elements are approximately 500 bp, have a copy number of 2 x 10(5) - 4 x 10(5), and comprise about 4% of the haploid genome. Studies of nucleotide sequence homology indicate that the buffalo and goat PstI repeats (type II) are similar types of short interspersed nucleotide element (SINE) sequences, but the cattle PstI repeat (type I) is considerably more divergent. Additionally, the goat PstI sequence showed significant sequence homology with bovine serine tRNA, and is therefore likely derived from serine tRNA. Interestingly, Southern hybridization suggests that both types of SINEs (I and II) are present in all the species of Bovidae. Dendrogram analysis indicates that cattle PstI SINE is similar to bovine Alu-like SINEs. Goat and buffalo SINEs formed a separate cluster, suggesting that these two types of SINEs evolved separately in the genome of the Bovidae.

  14. Diversity and structure of PIF/Harbinger-like elements in the genome of Medicago truncatula

    PubMed Central

    Grzebelus, Dariusz; Lasota, Slawomir; Gambin, Tomasz; Kucherov, Gregory; Gambin, Anna

    2007-01-01

    Background Transposable elements constitute a significant fraction of plant genomes. The PIF/Harbinger superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, required for transposition, is characteristic for the autonomous PIF/Harbinger-like elements. Based on the above features, PIF/Harbinger-like elements were identified in several plant genomes and divided into several evolutionary lineages. Availability of a significant portion of Medicago truncatula genomic sequence allowed for mining PIF/Harbinger-like elements, starting from a single previously described element MtMaster. Results Twenty two putative autonomous, i.e. carrying an ORF coding for TPase and complete terminal inverted repeats, and 67 non-autonomous PIF/Harbinger-like elements were found in the genome of M. truncatula. They were divided into five families, MtPH-A5, MtPH-A6, MtPH-D,MtPH-E, and MtPH-M, corresponding to three previously identified and two new lineages. The largest families, MtPH-A6 and MtPH-M were further divided into four and three subfamilies, respectively. Non-autonomous elements were usually direct deletion derivatives of the putative autonomous element, however other types of rearrangements, including inversions and nested insertions were also observed. An interesting structural characteristic – the presence of 60 bp tandem repeats – was observed in a group of elements of subfamily MtPH-A6-4. Some families could be related to miniature inverted repeat elements (MITEs). The presence of empty loci (RESites), paralogous to those flanking the identified transposable elements, both autonomous and non-autonomous, as well as the presence of transposon insertion related size polymorphisms, confirmed that some of the mined elements were capable for transposition. Conclusion The population of PIF/Harbinger-like elements in the genome of M. truncatula is diverse. A detailed intra-family comparison of the elements' structure proved that they proliferated in the genome generally following the model of abortive gap repair. However, the presence of tandem repeats facilitated more pronounced rearrangements of the element internal regions. The insertion polymorphism of the MtPH elements and related MITE families in different populations of M. truncatula, if further confirmed experimentally, could be used as a source of molecular markers complementary to other marker systems. PMID:17996080

  15. Damage-induced ectopic recombination in the yeast Saccharomyces cerevisiae.

    PubMed

    Kupiec, M; Steinlauf, R

    1997-06-09

    Mitotic recombination in the yeast Saccharomyces cerevisiae is induced when cells are irradiated with UV or X-rays, reflecting the efficient repair of damage by recombinational repair mechanisms. We have used multiply marked haploid strains that allow the simultaneous detection of several types of ectopic recombination events. We show that inter-chromosomal ectopic conversion of lys2 heteroalleles and, to a lesser extent, direct repeat recombination (DRR) between non-tandem repeats, are increased by DNA-damaging agents; in contrast, ectopic recombination of the naturally occurring Ty element is not induced. We have tested several hypotheses that could explain the preferential lack of induction of Ty recombination by DNA-damaging agents. We have found that the lack of induction cannot be explained by a cell cycle control or by an effect of the mating-type genes. We also found no role for the flanking long terminal repeats (LTRs) of the Ty in preventing the induction. Ectopic conversion, DRR, and forward mutation of artificial repeats show different kinetics of induction at various positions of the cell cycle, reflecting different mechanisms of recombination. We discuss the mechanistic and evolutionary aspects of these results.

  16. Maternal phthalate exposure during pregnancy is associated with DNA methylation of LINE-1 and Alu repetitive elements in Mexican-American children

    PubMed Central

    Huen, Karen; Calafat, Antonia M.; Bradman, Asa; Yousefi, Paul; Eskenazi, Brenda; Holland, Nina

    2016-01-01

    Phthalates are frequently used in personal care products and plasticizers and phthalate exposure is ubiquitous in the US population. Exposure to phthalates during critical periods in utero has been associated with a variety of adverse health outcomes but the biological mechanisms linking these exposures with disease are not well characterized. In this study, we examined the relationship of in utero phthalate exposure with repetitive element DNA methylation, an epigenetic marker of genome instability, in children from the longitudinal birth cohort CHAMACOS. Methylation of Alu and long interspersed nucleotide elements (LINE-1) was determined using pyrosequencing of bisulfite-treated DNA isolated from whole blood samples collected from newborns and 9 year old children (n=355). Concentrations of eleven phthalate metabolites were measured in urine collected from pregnant mothers at 13 and 26 weeks gestation. We found a consistent inverse association between prenatal concentrations of monoethyl phthalate, the most frequently detected urinary metabolite, with cord blood methylation of Alu repeats (β(95%CI):−0.14(−0.28,0.00) and −0.16(−0.31,−0.02)) for early and late pregnancy, respectively, and a similar but weaker association with LINE-1 methylation. Additionally, increases in urinary concentrations of di-(2-ethylhexyl) phthalate metabolites during late pregnancy were associated with lower levels of methylation of Alu repeats in 9 year old blood (significant p-values ranged from 0.003 to 0.03). Our findings suggest that prenatal exposure to some phthalates may influence differences in repetitive element methylation, highlighting epigenetics as a plausible biological mechanism through which phthalates may affect health. PMID:27019040

  17. ProGeRF: Proteome and Genome Repeat Finder Utilizing a Fast Parallel Hash Function

    PubMed Central

    Moraes, Walas Jhony Lopes; Rodrigues, Thiago de Souza; Bartholomeu, Daniella Castanheira

    2015-01-01

    Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the genome or proteome, displaying a user-friendly web interface, and performing-exhaustive searches. ProGeRF is a web site for extracting repetitive regions from genome and proteome sequences. It was designed to be efficient, fast, and accurate and primarily user-friendly web tool allowing many ways to view and analyse the results. ProGeRF (Proteome and Genome Repeat Finder) is freely available as a stand-alone program, from which the users can download the source code, and as a web tool. It was developed using the hash table approach to extract perfect and imperfect repetitive regions in a (multi)FASTA file, while allowing a linear time complexity. PMID:25811026

  18. Genome-Wide Stochastic Adaptive DNA Amplification at Direct and Inverted DNA Repeats in the Parasite Leishmania

    PubMed Central

    Plourde, Marie; Gingras, Hélène; Roy, Gaétan; Lapointe, Andréanne; Leprohon, Philippe; Papadopoulou, Barbara; Corbeil, Jacques; Ouellette, Marc

    2014-01-01

    Gene amplification of specific loci has been described in all kingdoms of life. In the protozoan parasite Leishmania, the product of amplification is usually part of extrachromosomal circular or linear amplicons that are formed at the level of direct or inverted repeated sequences. A bioinformatics screen revealed that repeated sequences are widely distributed in the Leishmania genome and the repeats are chromosome-specific, conserved among species, and generally present in low copy number. Using sensitive PCR assays, we provide evidence that the Leishmania genome is continuously being rearranged at the level of these repeated sequences, which serve as a functional platform for constitutive and stochastic amplification (and deletion) of genomic segments in the population. This process is adaptive as the copy number of advantageous extrachromosomal circular or linear elements increases upon selective pressure and is reversible when selection is removed. We also provide mechanistic insights on the formation of circular and linear amplicons through RAD51 recombinase-dependent and -independent mechanisms, respectively. The whole genome of Leishmania is thus stochastically rearranged at the level of repeated sequences, and the selection of parasite subpopulations with changes in the copy number of specific loci is used as a strategy to respond to a changing environment. PMID:24844805

  19. Genome size diversity in angiosperms and its influence on gene space.

    PubMed

    Dodsworth, Steven; Leitch, Andrew R; Leitch, Ilia J

    2015-12-01

    Genome size varies c. 2400-fold in angiosperms (flowering plants), although the range of genome size is skewed towards small genomes, with a mean genome size of 1C=5.7Gb. One of the most crucial factors governing genome size in angiosperms is the relative amount and activity of repetitive elements. Recently, there have been new insights into how these repeats, previously discarded as 'junk' DNA, can have a significant impact on gene space (i.e. the part of the genome comprising all the genes and gene-related DNA). Here we review these new findings and explore in what ways genome size itself plays a role in influencing how repeats impact genome dynamics and gene space, including gene expression. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  20. Complete, Programmable Decoding of Oxidized 5-Methylcytosine Nucleobases in DNA by Chemoselective Blockage of Universal Transcription-Activator-Like Effector Repeats.

    PubMed

    Gieß, Mario; Witte, Anna; Jasper, Julia; Koch, Oliver; Summerer, Daniel

    2018-05-09

    5-Methylcytosine (5mC) and its oxidized derivatives are regulatory elements of mammalian genomes involved in development and disease. These nucleobases do not selectively modulate Watson-Crick pairing, preventing their programmable targeting and analysis by traditional hybridization probes. Transcription-activator-like effectors (TALEs) can be engineered for use as programmable probes with epigenetic nucleobase selectivity. However, only partial selectivities for oxidized 5mC have been achieved so far, preventing unambiguous target binding. We overcome this limitation by destroying and re-inducing nucleobase selectivity in TALEs via protein engineering and chemoselective nucleobase blocking. We engineer cavities in TALE repeats and identify a cavity that accommodates all eight human DNA nucleobases. We then introduce substituents with varying size, flexibility, and branching degree at each oxidized 5mC. Depending on the nucleobase, substituents with distinct properties effectively block TALE-binding and induce full nucleobase selectivity in the universal repeat. Successful transfer to affinity enrichment in a human genome background indicates that this approach enables the fully selective detection of each oxidized 5mC in complex DNA by programmable probes.

  1. Genome-Wide Negative Feedback Drives Transgenerational DNA Methylation Dynamics in Arabidopsis

    PubMed Central

    Kassam, Mohamed; Duvernois-Berthet, Evelyne; Cortijo, Sandra; Takashima, Kazuya; Saze, Hidetoshi; Toyoda, Atsushi; Fujiyama, Asao; Colot, Vincent; Kakutani, Tetsuji

    2015-01-01

    Epigenetic variations of phenotypes, especially those associated with DNA methylation, are often inherited over multiple generations in plants. The active and inactive chromatin states are heritable and can be maintained or even be amplified by positive feedback in a transgenerational manner. However, mechanisms controlling the transgenerational DNA methylation dynamics are largely unknown. As an approach to understand the transgenerational dynamics, we examined long-term effect of impaired DNA methylation in Arabidopsis mutants of the chromatin remodeler gene DDM1 (Decrease in DNA Methylation 1) through whole genome DNA methylation sequencing. The ddm1 mutation induces a drastic decrease in DNA methylation of transposable elements (TEs) and repeats in the initial generation, while also inducing ectopic DNA methylation at hundreds of loci. Unexpectedly, this ectopic methylation can only be seen after repeated self-pollination. The ectopic cytosine methylation is found primarily in the non-CG context and starts from 3’ regions within transcription units and spreads upstream. Remarkably, when chromosomes with reduced DNA methylation were introduced from a ddm1 mutant into a DDM1 wild-type background, the ddm1-derived chromosomes also induced analogous de novo accumulation of DNA methylation in trans. These results lead us to propose a model to explain the transgenerational DNA methylation redistribution by genome-wide negative feedback. The global negative feedback, together with local positive feedback, would ensure robust and balanced differentiation of chromatin states within the genome. PMID:25902052

  2. CRISPR-Cas systems target a diverse collection of invasive mobile genetic elements in human microbiomes

    PubMed Central

    2013-01-01

    Background Bacteria and archaea develop immunity against invading genomes by incorporating pieces of the invaders' sequences, called spacers, into a clustered regularly interspaced short palindromic repeats (CRISPR) locus between repeats, forming arrays of repeat-spacer units. When spacers are expressed, they direct CRISPR-associated (Cas) proteins to silence complementary invading DNA. In order to characterize the invaders of human microbiomes, we use spacers from CRISPR arrays that we had previously assembled from shotgun metagenomic datasets, and identify contigs that contain these spacers' targets. Results We discover 95,000 contigs that are putative invasive mobile genetic elements, some targeted by hundreds of CRISPR spacers. We find that oral sites in healthy human populations have a much greater variety of mobile genetic elements than stool samples. Mobile genetic elements carry genes encoding diverse functions: only 7% of the mobile genetic elements are similar to known phages or plasmids, although a much greater proportion contain phage- or plasmid-related genes. A small number of contigs share similarity with known integrative and conjugative elements, providing the first examples of CRISPR defenses against this class of element. We provide detailed analyses of a few large mobile genetic elements of various types, and a relative abundance analysis of mobile genetic elements and putative hosts, exploring the dynamic activities of mobile genetic elements in human microbiomes. A joint analysis of mobile genetic elements and CRISPRs shows that protospacer-adjacent motifs drive their interaction network; however, some CRISPR-Cas systems target mobile genetic elements lacking motifs. Conclusions We identify a large collection of invasive mobile genetic elements in human microbiomes, an important resource for further study of the interaction between the CRISPR-Cas immune system and invaders. PMID:23628424

  3. Organisation of the plant genome in chromosomes.

    PubMed

    Heslop-Harrison, J S Pat; Schwarzacher, Trude

    2011-04-01

    The plant genome is organized into chromosomes that provide the structure for the genetic linkage groups and allow faithful replication, transcription and transmission of the hereditary information. Genome sizes in plants are remarkably diverse, with a 2350-fold range from 63 to 149,000 Mb, divided into n=2 to n= approximately 600 chromosomes. Despite this huge range, structural features of chromosomes like centromeres, telomeres and chromatin packaging are well-conserved. The smallest genomes consist of mostly coding and regulatory DNA sequences present in low copy, along with highly repeated rDNA (rRNA genes and intergenic spacers), centromeric and telomeric repetitive DNA and some transposable elements. The larger genomes have similar numbers of genes, with abundant tandemly repeated sequence motifs, and transposable elements alone represent more than half the DNA present. Chromosomes evolve by fission, fusion, duplication and insertion events, allowing evolution of chromosome size and chromosome number. A combination of sequence analysis, genetic mapping and molecular cytogenetic methods with comparative analysis, all only becoming widely available in the 21st century, is elucidating the exact nature of the chromosome evolution events at all timescales, from the base of the plant kingdom, to intraspecific or hybridization events associated with recent plant breeding. As well as being of fundamental interest, understanding and exploiting evolutionary mechanisms in plant genomes is likely to be a key to crop development for food production. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.

  4. Characterization of human glucocorticoid receptor complexes formed with DNA fragments containing or lacking glucocorticoid response elements

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tully, D.B.; Cidlowski, J.A.

    1989-03-07

    Sucrose density gradient shift assays were used to study the interactions of human glucocorticoid receptors (GR) with small DNA fragments either containing or lacking glucocorticoid response element (GRE) DNA consensus sequences. When crude cytoplasmic extracts containing ({sup 3}H)triamcinolone acetonide (({sup 3}H)TA) labeled GR were incubated with unlabeled DNA under conditions of DNA excess, a GRE-containing DNA fragment obtained from the 5' long terminal repeat of mouse mammary tumor virus (MMTV LTR) formed a stable 12-16S complex with activated, but not nonactivated, ({sup 3}H)TA receptor. By contrast, if the cytosols were treated with calf thymus DNA-cellulose to deplete non-GR-DNA-binding proteins priormore » to heat activation, a smaller 7-10S complex was formed with the MMTV LTR DNA fragment. Activated ({sup 3}H)TA receptor from DNA-cellulose pretreated cytosols also interacted with two similarly sized fragments from pBR322 DNA. Stability of the complexes formed between GR and these three DNA fragments was strongly affected by even moderate alterations in either the salt concentration or the pH of the gradient buffer. Under all conditions tested, the complex formed with the MMTV LTR DNA fragment was more stable than the complexes formed with either of the pBR322 DNA fragments. Together these observations indicate that the formation of stable complexes between activated GR and isolated DNA fragments requires the presence of GRE consensus sequences in the DNA.« less

  5. Characterization of three active transposable elements recently inserted in three independent DFR-A alleles and one high-copy DNA transposon isolated from the Pink allele of the ANS gene in onion (Allium cepa L.).

    PubMed

    Kim, Sunggil; Park, Jee Young; Yang, Tae-Jin

    2015-06-01

    Intact retrotransposon and DNA transposons inserted in a single gene were characterized in onions (Allium cepa) and their transcription and copy numbers were estimated in this study. While analyzing diverse onion germplasm, large insertions in the DFR-A gene encoding dihydroflavonol 4-reductase (DFR) involved in the anthocyanin biosynthesis pathway were found in two accessions. A 5,070-bp long terminal repeat (LTR) retrotransposon inserted in the active DFR-A (R4) allele was identified from one of the large insertions and designated AcCOPIA1. An intact ORF encoded typical domains of copia-like LTR retrotransposons. However, AcCOPIA1 contained atypical 'TG' and 'TA' dinucleotides at the ends of the LTRs. A 4,615-bp DNA transposon was identified in the other large insertion. This DNA transposon, designated AcCACTA1, contained an ORF coding for a transposase showing homology with the CACTA superfamily transposable elements (TEs). Another 5,073-bp DNA transposon was identified from the DFR-A (TRN) allele. This DNA transposon, designated AchAT1, belonged to the hAT superfamily with short 4-bp terminal inverted repeats (TIRs). Finally, a 6,258-bp non-autonomous DNA transposon, designated AcPINK, was identified in the ANS-p allele encoding anthocyanidin synthase, the next downstream enzyme to DFR in the anthocyanin biosynthesis pathway. AcPINK also possessed very short 3-bp TIRs. Active transcription of AcCOPIA1, AcCACTA1, and AchAT1 was observed through RNA-Seq analysis and RT-PCR. The copy numbers of AcPINK estimated by mapping the genomic DNA reads produced by NextSeq 500 were predominantly high compared with the other TEs. A series of evidence indicated that these TEs might have transposed in these onion genes very recently, providing a stepping stone for elucidation of enormously large-sized onion genome structure.

  6. Plasmodium falciparum Nucleosomes Exhibit Reduced Stability and Lost Sequence Dependent Nucleosome Positioning

    PubMed Central

    Silberhorn, Elisabeth; Schwartz, Uwe; Symelka, Anne; de Koning-Ward, Tania; Längst, Gernot

    2016-01-01

    The packaging and organization of genomic DNA into chromatin represents an additional regulatory layer of gene expression, with specific nucleosome positions that restrict the accessibility of regulatory DNA elements. The mechanisms that position nucleosomes in vivo are thought to depend on the biophysical properties of the histones, sequence patterns, like phased di-nucleotide repeats and the architecture of the histone octamer that folds DNA in 1.65 tight turns. Comparative studies of human and P. falciparum histones reveal that the latter have a strongly reduced ability to recognize internal sequence dependent nucleosome positioning signals. In contrast, the nucleosomes are positioned by AT-repeat sequences flanking nucleosomes in vivo and in vitro. Further, the strong sequence variations in the plasmodium histones, compared to other mammalian histones, do not present adaptations to its AT-rich genome. Human and parasite histones bind with higher affinity to GC-rich DNA and with lower affinity to AT-rich DNA. However, the plasmodium nucleosomes are overall less stable, with increased temperature induced mobility, decreased salt stability of the histones H2A and H2B and considerable reduced binding affinity to GC-rich DNA, as compared with the human nucleosomes. In addition, we show that plasmodium histone octamers form the shortest known nucleosome repeat length (155bp) in vitro and in vivo. Our data suggest that the biochemical properties of the parasite histones are distinct from the typical characteristics of other eukaryotic histones and these properties reflect the increased accessibility of the P. falciparum genome. PMID:28033404

  7. Characterization of three DNA transposons in the Dutch elm disease fungi and evidence of repeat-induced point (RIP) mutations.

    PubMed

    Bouvet, Guillaume F; Jacobi, Volker; Bernier, Louis

    2007-05-01

    Transposable elements (TEs) are fundamental components of eukaryotic genomes and can contribute in various ways to genome plasticity and evolution. We describe here the first three DNA transposons in the Dutch elm disease (DED) pathogens Ophiostoma ulmi and O. novo-ulmi, named OPHIO1, OPHIO2 and OPHIO3. We demonstrate that OPHIO transposons, which show high homology to Fot1/pogo TEs within the Tc1/mariner superfamily, have different distribution patterns and specificity in the DED fungi and that interspecific hybrids could act as genetic bridges for transmission of TEs between closely related fungal species. OPHIO3 was found to have undergone repeat-induced point mutations (RIP). We have also developed a complementary method to Margolin's ratios based on the computation of cumulative transition scores (CTS) in order to visualize rapidly RIP signatures on individual DNA strands of OPHIO transposons and TEs found in other ascomycete fungi.

  8. Structural and biochemical analysis of nuclease domain of clustered regularly interspaced short palindromic repeat (CRISPR)-associated protein 3 (Cas3).

    PubMed

    Mulepati, Sabin; Bailey, Scott

    2011-09-09

    RNA transcribed from clustered regularly interspaced short palindromic repeats (CRISPRs) protects many prokaryotes from invasion by foreign DNA such as viruses, conjugative plasmids, and transposable elements. Cas3 (CRISPR-associated protein 3) is essential for this CRISPR protection and is thought to mediate cleavage of the foreign DNA through its N-terminal histidine-aspartate (HD) domain. We report here the 1.8 Å crystal structure of the HD domain of Cas3 from Thermus thermophilus HB8. Structural and biochemical studies predict that this enzyme binds two metal ions at its active site. We also demonstrate that the single-stranded DNA endonuclease activity of this T. thermophilus domain is activated not by magnesium but by transition metal ions such as manganese and nickel. Structure-guided mutagenesis confirms the importance of the metal-binding residues for the nuclease activity and identifies other active site residues. Overall, these results provide a framework for understanding the role of Cas3 in the CRISPR system.

  9. Rex and a Suppressor of Rex Are Repeated Neomorphic Loci in the Drosophila Melanogaster Ribosomal DNA

    PubMed Central

    Rasooly, R. S.; Robbins, L. G.

    1991-01-01

    The Rex locus of Drosophila melanogaster induces a high frequency of mitotic exchange between two separated ribosomal DNA arrays on a single chromosome. The exchanges take place in the progeny of Rex mothers and occur very early, before the third mitotic division. A number of common laboratory stocks have also been found to carry dominant suppressors of Rex (Su(Rex)). Rex was mapped to the X centric heterochromatin, proximal to su(f), by genetic and molecular analysis of two spontaneous recombinants. Using deficiencies and duplications of the heterochromatin, both Rex and one Su(Rex) were shown to behave as neomorphs. Rex-induced exchange in a target chromosome bearing both Rex and Su(Rex) was then used to map these functions to the bb locus itself. Molecular analysis of the recombinants, using length variants of the ribosomal DNA intergenic spacer as genetic markers, mapped Su(Rex) and Rex within the bb locus and demonstrated that both are repeated elements. PMID:1936953

  10. Rotifer rDNA-specific R9 retrotransposable elements generate an exceptionally long target site duplication upon insertion.

    PubMed

    Gladyshev, Eugene A; Arkhipova, Irina R

    2009-12-15

    Ribosomal DNA genes in many eukaryotes contain insertions of non-LTR retrotransposable elements belonging to the R2 clade. These elements persist in the host genomes by inserting site-specifically into multicopy target sites, thereby avoiding random disruption of single-copy host genes. Here we describe R9 retrotransposons from the R2 clade in the 28S RNA genes of bdelloid rotifers, small freshwater invertebrate animals best known for their long-term asexuality and for their ability to survive repeated cycles of desiccation and rehydration. While the structural organization of R9 elements is highly similar to that of other members of the R2 clade, they are characterized by two distinct features: site-specific insertion into a previously unreported target sequence within the 28S gene, and an unusually long target site duplication of 126 bp. We discuss the implications of these findings in the context of bdelloid genome organization and the mechanisms of target-primed reverse transcription.

  11. The Dfam database of repetitive DNA families.

    PubMed

    Hubley, Robert; Finn, Robert D; Clements, Jody; Eddy, Sean R; Jones, Thomas A; Bao, Weidong; Smit, Arian F A; Wheeler, Travis J

    2016-01-04

    Repetitive DNA, especially that due to transposable elements (TEs), makes up a large fraction of many genomes. Dfam is an open access database of families of repetitive DNA elements, in which each family is represented by a multiple sequence alignment and a profile hidden Markov model (HMM). The initial release of Dfam, featured in the 2013 NAR Database Issue, contained 1143 families of repetitive elements found in humans, and was used to produce more than 100 Mb of additional annotation of TE-derived regions in the human genome, with improved speed. Here, we describe recent advances, most notably expansion to 4150 total families including a comprehensive set of known repeat families from four new organisms (mouse, zebrafish, fly and nematode). We describe improvements to coverage, and to our methods for identifying and reducing false annotation. We also describe updates to the website interface. The Dfam website has moved to http://dfam.org. Seed alignments, profile HMMs, hit lists and other underlying data are available for download. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

    PubMed

    Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J; Leitch, Ilia J

    2015-01-01

    The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.

  13. Epigenetic regulation of transcription and possible functions of mammalian short interspersed elements, SINEs.

    PubMed

    Ichiyanagi, Kenji

    2013-01-01

    Short interspersed elements (SINEs) are a class of retrotransposons, which amplify their copy numbers in their host genomes by retrotransposition. More than a million copies of SINEs are present in a mammalian genome, constituting over 10% of the total genomic sequence. In contrast to the other two classes of retrotransposons, long interspersed elements (LINEs) and long terminal repeat (LTR) elements, SINEs are transcribed by RNA polymerase III. However, like LINEs and LTR elements, the SINE transcription is likely regulated by epigenetic mechanisms such as DNA methylation, at least for human Alu and mouse B1. Whereas SINEs and other transposable elements have long been thought as selfish or junk DNA, recent studies have revealed that they play functional roles at their genomic locations, for example, as distal enhancers, chromatin boundaries and binding sites of many transcription factors. These activities imply that SINE retrotransposition has shaped the regulatory network and chromatin landscape of their hosts. Whereas it is thought that the epigenetic mechanisms were originated as a host defense system against proliferation of parasitic elements, this review discusses a possibility that the same mechanisms are also used to regulate the SINE-derived functions.

  14. Global mapping of DNA conformational flexibility on Saccharomyces cerevisiae.

    PubMed

    Menconi, Giulia; Bedini, Andrea; Barale, Roberto; Sbrana, Isabella

    2015-04-01

    In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3'UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3'-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites.

  15. Global Mapping of DNA Conformational Flexibility on Saccharomyces cerevisiae

    PubMed Central

    Menconi, Giulia; Bedini, Andrea; Barale, Roberto; Sbrana, Isabella

    2015-01-01

    In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3’UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3’-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites. PMID:25860149

  16. Endonuclease-independent LINE-1 retrotransposition at mammalian telomeres.

    PubMed

    Morrish, Tammy A; Garcia-Perez, José Luis; Stamato, Thomas D; Taccioli, Guillermo E; Sekiguchi, JoAnn; Moran, John V

    2007-03-08

    Long interspersed element-1 (LINE-1 or L1) elements are abundant, non-long-terminal-repeat (non-LTR) retrotransposons that comprise approximately 17% of human DNA. The average human genome contains approximately 80-100 retrotransposition-competent L1s (ref. 2), and they mobilize by a process that uses both the L1 endonuclease and reverse transcriptase, termed target-site primed reverse transcription. We have previously reported an efficient, endonuclease-independent L1 retrotransposition pathway (EN(i)) in certain Chinese hamster ovary (CHO) cell lines that are defective in the non-homologous end-joining (NHEJ) pathway of DNA double-strand-break repair. Here we have characterized EN(i) retrotransposition events generated in V3 CHO cells, which are deficient in DNA-dependent protein kinase catalytic subunit (DNA-PKcs) activity and have both dysfunctional telomeres and an NHEJ defect. Notably, approximately 30% of EN(i) retrotransposition events insert in an orientation-specific manner adjacent to a perfect telomere repeat (5'-TTAGGG-3'). Similar insertions were not detected among EN(i) retrotransposition events generated in controls or in XR-1 CHO cells deficient for XRCC4, an NHEJ factor that is required for DNA ligation but has no known function in telomere maintenance. Furthermore, transient expression of a dominant-negative allele of human TRF2 (also called TERF2) in XRCC4-deficient XR-1 cells, which disrupts telomere capping, enables telomere-associated EN(i) retrotransposition events. These data indicate that L1s containing a disabled endonuclease can use dysfunctional telomeres as an integration substrate. The findings highlight similarities between the mechanism of EN(i) retrotransposition and the action of telomerase, because both processes can use a 3' OH for priming reverse transcription at either internal DNA lesions or chromosome ends. Thus, we propose that EN(i) retrotransposition is an ancestral mechanism of RNA-mediated DNA repair associated with non-LTR retrotransposons that may have been used before the acquisition of an endonuclease domain.

  17. Dfam: a database of repetitive DNA based on profile hidden Markov models.

    PubMed

    Wheeler, Travis J; Clements, Jody; Eddy, Sean R; Hubley, Robert; Jones, Thomas A; Jurka, Jerzy; Smit, Arian F A; Finn, Robert D

    2013-01-01

    We present a database of repetitive DNA elements, called Dfam (http://dfam.janelia.org). Many genomes contain a large fraction of repetitive DNA, much of which is made up of remnants of transposable elements (TEs). Accurate annotation of TEs enables research into their biology and can shed light on the evolutionary processes that shape genomes. Identification and masking of TEs can also greatly simplify many downstream genome annotation and sequence analysis tasks. The commonly used TE annotation tools RepeatMasker and Censor depend on sequence homology search tools such as cross_match and BLAST variants, as well as Repbase, a collection of known TE families each represented by a single consensus sequence. Dfam contains entries corresponding to all Repbase TE entries for which instances have been found in the human genome. Each Dfam entry is represented by a profile hidden Markov model, built from alignments generated using RepeatMasker and Repbase. When used in conjunction with the hidden Markov model search tool nhmmer, Dfam produces a 2.9% increase in coverage over consensus sequence search methods on a large human benchmark, while maintaining low false discovery rates, and coverage of the full human genome is 54.5%. The website provides a collection of tools and data views to support improved TE curation and annotation efforts. Dfam is also available for download in flat file format or in the form of MySQL table dumps.

  18. Genome-wide characterization of centromeric satellites from multiple mammalian genomes.

    PubMed

    Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario

    2011-01-01

    Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.

  19. RNAi drives nonreciprocal translocations at eroding chromosome ends to establish telomere-free linear chromosomes.

    PubMed

    Begnis, Martina; Apte, Manasi S; Masuda, Hirohisa; Jain, Devanshi; Wheeler, David Lee; Cooper, Julia Promisel

    2018-04-01

    The identification of telomerase-negative HAATI (heterochromatin amplification-mediated and telomerase-independent) cells, in which telomeres are superseded by nontelomeric heterochromatin tracts, challenged the idea that canonical telomeres are essential for chromosome linearity and raised crucial questions as to how such tracts translocate to eroding chromosome ends and confer end protection. Here we show that HAATI arises when telomere loss triggers a newly recognized illegitimate translocation pathway that requires RNAi factors. While RNAi is necessary for the translocation events that mobilize ribosomal DNA (rDNA) tracts to all chromosome ends (forming "HAATI rDNA " chromosomes), it is dispensable for HAATI rDNA maintenance. Surprisingly, Dicer (Dcr1) plays a separate, RNAi-independent role in preventing formation of the rare HAATI subtype in which a different repetitive element (the subtelomeric element) replaces telomeres. Using genetics and fusions between shelterin components and rDNA-binding proteins, we mapped the mechanism by which rDNA loci engage crucial end protection factors-despite the absence of telomere repeats-and secure end protection. Sequence analysis of HAATI rDNA genomes allowed us to propose RNA and DNA polymerase template-switching models for the mechanism of RNAi-triggered rDNA translocations. Collectively, our results reveal unforeseen roles for noncoding RNAs (ncRNAs) in assembling a telomere-free chromosome end protection device. © 2018 Begnis et al.; Published by Cold Spring Harbor Laboratory Press.

  20. As solid as a rock-comparison of CE- and MPS-based analyses of the petrosal bone as a source of DNA for forensic identification of challenging cranial bones.

    PubMed

    Kulstein, Galina; Hadrys, Thorsten; Wiegand, Peter

    2018-01-01

    Short tandem repeat (STR) typing from skeletal remains can be a difficult task. Dependent on the environmental conditions of the provenance of the bones, DNA can be degraded and STR typing inhibited. Generally, dense and compact bones are known to preserve DNA better. Several studies already proved that femora and teeth have high DNA typing success rates. Unfortunately, these elements are not present in all cases involving skeletal remains. Processing partial or singular skeletal elements, it is favorable to select bone areas where DNA preservation is comparably higher. Especially, cranial bones are often accidentally discovered during criminal investigations. The cranial bone is composed of multiple parts. In this examination, we evaluated the potential of the petrous bone for human identification of skeletal remains in forensic case work. Material from different sections of eight unknown cranial bones and-where available-additionally other skeletal elements, collected at the DNA department of the Institute of Legal Medicine in Ulm, Germany, from 2010 to 2017, were processed with an optimized DNA extraction and STR typing strategy. The results highlight that STR typing from the petrous bones leads to reportable profiles in all individuals, even in cases where the analysis of the parietal bone failed. Moreover, the comparison of capillary electrophorese (CE) typing to massively parallel sequencing (MPS) analysis shows that MPS has the potential to analyze degraded human remains and is even capable to provide additional information about phenotype and ancestry of unknown individuals.

  1. Comparative analysis of complete orthologous centromeres from two subspecies of rice reveals rapid variation of centromere organization and structure.

    PubMed

    Wu, Jianzhong; Fujisawa, Masaki; Tian, Zhixi; Yamagata, Harumi; Kamiya, Kozue; Shibata, Michie; Hosokawa, Satomi; Ito, Yukiyo; Hamada, Masao; Katagiri, Satoshi; Kurita, Kanako; Yamamoto, Mayu; Kikuta, Ari; Machita, Kayo; Karasawa, Wataru; Kanamori, Hiroyuki; Namiki, Nobukazu; Mizuno, Hiroshi; Ma, Jianxin; Sasaki, Takuji; Matsumoto, Takashi

    2009-12-01

    Centromeres are sites for assembly of the chromosomal structures that mediate faithful segregation at mitosis and meiosis. This function is conserved across species, but the DNA components that are involved in kinetochore formation differ greatly, even between closely related species. To shed light on the nature, evolutionary timing and evolutionary dynamics of rice centromeres, we decoded a 2.25-Mb DNA sequence covering the centromeric region of chromosome 8 of an indica rice variety, 'Kasalath' (Kas-Cen8). Analysis of repetitive sequences in Kas-Cen8 led to the identification of 222 long terminal repeat (LTR)-retrotransposon elements and 584 CentO satellite monomers, which account for 59.2% of the region. A comparison of the Kas-Cen8 sequence with that of japonica rice 'Nipponbare' (Nip-Cen8) revealed that about 66.8% of the Kas-Cen8 sequence was collinear with that of Nip-Cen8. Although the 27 putative genes are conserved between the two subspecies, only 55.4% of the total LTR-retrotransposon elements in 'Kasalath' had orthologs in 'Nipponbare', thus reflecting recent proliferation of a considerable number of LTR-retrotransposons since the divergence of two rice subspecies of indica and japonica within Oryza sativa. Comparative analysis of the subfamilies, time of insertion, and organization patterns of inserted LTR-retrotransposons between the two Cen8 regions revealed variations between 'Kasalath' and 'Nipponbare' in the preferential accumulation of CRR elements, and the expansion of CentO satellite repeats within the core domain of Cen8. Together, the results provide insights into the recent proliferation of LTR-retrotransposons, and the rapid expansion of CentO satellite repeats, underlying the dynamic variation and plasticity of plant centromeres.

  2. Heart rate variability and DNA methylation levels are altered after short-term metal fume exposure among occupational welders: a repeated-measures panel study.

    PubMed

    Fan, Tianteng; Fang, Shona C; Cavallari, Jennifer M; Barnett, Ian J; Wang, Zhaoxi; Su, Li; Byun, Hyang-Min; Lin, Xihong; Baccarelli, Andrea A; Christiani, David C

    2014-12-16

    In occupational settings, boilermakers are exposed to high levels of metallic fine particulate matter (PM2.5) generated during the welding process. The effect of welding PM2.5 on heart rate variability (HRV) has been described, but the relationship between PM2.5, DNA methylation, and HRV is not known. In this repeated-measures panel study, we recorded resting HRV and measured DNA methylation levels in transposable elements Alu and long interspersed nuclear element-1 (LINE-1) in peripheral blood leukocytes under ambient conditions (pre-shift) and right after a welding task (post-shift) among 66 welders. We also monitored personal PM2.5 level in the ambient environment and during the welding procedure. The concentration of welding PM2.5 was significantly higher than background levels in the union hall (0.43 mg/m3 vs. 0.11 mg/m3, p < 0.0001). The natural log of transformed power in the high frequency range (ln HF) had a significantly negative association with PM2.5 exposure (β = -0.76, p = 0.035). pNN10 and pNN20 also had a negative association with PM2.5 exposure (β = -0.16%, p = 0.006 and β = -0.13%, p = 0.030, respectively). PM2.5 was positively associated with LINE-1 methylation [β = 0.79%, 5-methylcytosince (%mC), p = 0.013]; adjusted for covariates. LINE-1 methylation did not show an independent association with HRV. Acute decline of HRV was observed following exposure to welding PM2.5 and evidence for an epigenetic response of transposable elements to short-term exposure to high-level metal-rich particulates was reported.

  3. The VBP and a1/EBP leucine zipper factors bind overlapping subsets of avian retroviral long terminal repeat CCAAT/enhancer elements.

    PubMed

    Smith, C D; Baglia, L A; Curristin, S M; Ruddell, A

    1994-10-01

    Two long terminal repeat (LTR) enhancer-binding proteins which may regulate high rates of avian leukosis virus (ALV) LTR-enhanced c-myc transcription during bursal lymphomagenesis have been identified (A. Ruddell, M. Linial, and M. Groudine, Mol. Cell. Biol. 9:5660-5668, 1989). The genes encoding the a1/EBP and a3/EBP binding factors were cloned by expression screening of a lambda gt11 cDNA library from chicken bursal lymphoma cells. The a1/EBP cDNA encodes a novel leucine zipper transcription factor (W. Bowers and A. Ruddell, J. Virol. 66:6578-6586, 1992). The partial a3/EBP cDNA clone encodes amino acids 84 to 313 of vitellogenin gene-binding protein (VBP), a leucine zipper factor that binds the avian vitellogenin II gene promoter (S. Iyer, D. Davis, and J. Burch, Mol. Cell. Biol. 11:4863-4875, 1991). Multiple VBP mRNAs are expressed in B cells in a pattern identical to that previously observed for VBP in other cell types. The LTR-binding activities of VBP, a1/EBP, and B-cell nuclear extract protein were compared and mapped by gel shift, DNase I footprinting, and methylation interference assays. The purified VBP and a1/EBP bacterial fusion proteins bind overlapping but distinct subsets of CCAAT/enhancer elements in the closely related ALV and Rous sarcoma virus (RSV) LTR enhancers. Protein binding to these CCAAT/enhancer elements accounts for most of the labile LTR enhancer-binding activity observed in B-cell nuclear extracts. VBP and a1/EBP could mediate the high rates of ALV and RSV LTR-enhanced transcription in bursal lymphoma cells and many other cell types.

  4. A mammary cell-specific enhancer in mouse mammary tumor virus DNA is composed of multiple regulatory elements including binding sites for CTF/NFI and a novel transcription factor, mammary cell-activating factor.

    PubMed Central

    Mink, S; Härtig, E; Jennewein, P; Doppler, W; Cato, A C

    1992-01-01

    Mouse mammary tumor virus (MMTV) is a milk-transmitted retrovirus involved in the neoplastic transformation of mouse mammary gland cells. The expression of this virus is regulated by mammary cell type-specific factors, steroid hormones, and polypeptide growth factors. Sequences for mammary cell-specific expression are located in an enhancer element in the extreme 5' end of the long terminal repeat region of this virus. This enhancer, when cloned in front of the herpes simplex thymidine kinase promoter, endows the promoter with mammary cell-specific response. Using functional and DNA-protein-binding studies with constructs mutated in the MMTV long terminal repeat enhancer, we have identified two main regulatory elements necessary for the mammary cell-specific response. These elements consist of binding sites for a transcription factor in the family of CTF/NFI proteins and the transcription factor mammary cell-activating factor (MAF) that recognizes the sequence G Pu Pu G C/G A A G G/T. Combinations of CTF/NFI- and MAF-binding sites or multiple copies of either one of these binding sites but not solitary binding sites mediate mammary cell-specific expression. The functional activities of these two regulatory elements are enhanced by another factor that binds to the core sequence ACAAAG. Interdigitated binding sites for CTF/NFI, MAF, and/or the ACAAAG factor are also found in the 5' upstream regions of genes encoding whey milk proteins from different species. These findings suggest that mammary cell-specific regulation is achieved by a concerted action of factors binding to multiple regulatory sites. Images PMID:1328867

  5. Mavericks, a novel class of giant transposable elements widespread in eukaryotes and related to DNA viruses.

    PubMed

    Pritham, Ellen J; Putliwala, Tasneem; Feschotte, Cédric

    2007-04-01

    We previously identified a group of atypical mobile elements designated Mavericks from the nematodes Caenorhabditis elegans and C. briggsae and the zebrafish Danio rerio. Here we present the results of comprehensive database searches of the genome sequences available, which reveal that Mavericks are widespread in invertebrates and non-mammalian vertebrates but show a patchy distribution in non-animal species, being present in the fungi Glomus intraradices and Phakopsora pachyrhizi and in several single-celled eukaryotes such as the ciliate Tetrahymena thermophila, the stramenopile Phytophthora infestans and the trichomonad Trichomonas vaginalis, but not detectable in plants. This distribution, together with comparative and phylogenetic analyses of Maverick-encoded proteins, is suggestive of an ancient origin of these elements in eukaryotes followed by lineage-specific losses and/or recurrent episodes of horizontal transmission. In addition, we report that Maverick elements have amplified recently to high copy numbers in T. vaginalis where they now occupy as much as 30% of the genome. Sequence analysis confirms that most Mavericks encode a retroviral-like integrase, but lack other open reading frames typically found in retroelements. Nevertheless, the length and conservation of the target site duplication created upon Maverick insertion (5- or 6-bp) is consistent with a role of the integrase-like protein in the integration of a double-stranded DNA transposition intermediate. Mavericks also display long terminal-inverted repeats but do not contain ORFs similar to proteins encoded by DNA transposons. Instead, Mavericks encode a conserved set of 5 to 9 genes (in addition to the integrase) that are predicted to encode proteins with homology to replication and packaging proteins of some bacteriophages and diverse eukaryotic double-stranded DNA viruses, including a DNA polymerase B homolog and putative capsid proteins. Based on these and other structural similarities, we speculate that Mavericks represent an evolutionary missing link between seemingly disparate invasive DNA elements that include bacteriophages, adenoviruses and eukaryotic linear plasmids.

  6. Breaks in the 45S rDNA Lead to Recombination-Mediated Loss of Repeats.

    PubMed

    Warmerdam, Daniël O; van den Berg, Jeroen; Medema, René H

    2016-03-22

    rDNA repeats constitute the most heavily transcribed region in the human genome. Tumors frequently display elevated levels of recombination in rDNA, indicating that the repeats are a liability to the genomic integrity of a cell. However, little is known about how cells deal with DNA double-stranded breaks in rDNA. Using selective endonucleases, we show that human cells are highly sensitive to breaks in 45S but not the 5S rDNA repeats. We find that homologous recombination inhibits repair of breaks in 45S rDNA, and this results in repeat loss. We identify the structural maintenance of chromosomes protein 5 (SMC5) as contributing to recombination-mediated repair of rDNA breaks. Together, our data demonstrate that SMC5-mediated recombination can lead to error-prone repair of 45S rDNA repeats, resulting in their loss and thereby reducing cellular viability. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  7. Role of the human cytomegalovirus major immediate-early promoter's 19-base-pair-repeat cyclic AMP-response element in acutely infected cells.

    PubMed

    Keller, M J; Wheeler, D G; Cooper, E; Meier, J L

    2003-06-01

    Prior studies have suggested a role of the five copies of the 19-bp-repeat cyclic AMP (cAMP)-response element (CRE) in major immediate-early (MIE) promoter activation, the rate-limiting step in human cytomegalovirus (HCMV) replication. We used two different HCMV genome modification strategies to test this hypothesis in acutely infected cells. We report the following: (i) the CREs do not govern basal levels of MIE promoter activity at a high or low multiplicity of infection (MOI) in human foreskin fibroblast (HFF)- or NTera2-derived neuronal cells; (ii) serum and virion components markedly increase MIE promoter-dependent transcription at a low multiplicity of infection (MOI), but this increase is not mediated by the CREs; (iii) forskolin stimulation of the cAMP signaling pathway induces a two- to threefold increase in MIE RNA levels in a CRE-specific manner at a low MOI in both HFF- and NTera2-derived neuronal cells; and (iv) the CREs do not regulate basal levels of HCMV DNA replication at a high or low MOI in HFF. Their presence does impart a forskolin-induced increase in viral DNA replication at a low MOI but only when basal levels of MIE promoter activity are experimentally diminished. In conclusion, the 19-bp-repeat CREs add to the robust MIE promoter activity that occurs in the acutely infected stimulated cells, although the CREs' greater role may be in other settings.

  8. Maize centromere structure and evolution: sequence analysis of centromeres 2 and 5 reveals dynamic Loci shaped primarily by retrotransposons.

    PubMed

    Wolfgruber, Thomas K; Sharma, Anupma; Schneider, Kevin L; Albert, Patrice S; Koo, Dal-Hoe; Shi, Jinghua; Gao, Zhi; Han, Fangpu; Lee, Hyeran; Xu, Ronghui; Allison, Jamie; Birchler, James A; Jiang, Jiming; Dawe, R Kelly; Presting, Gernot G

    2009-11-01

    We describe a comprehensive and general approach for mapping centromeres and present a detailed characterization of two maize centromeres. Centromeres are difficult to map and analyze because they consist primarily of repetitive DNA sequences, which in maize are the tandem satellite repeat CentC and interspersed centromeric retrotransposons of maize (CRM). Centromeres are defined epigenetically by the centromeric histone H3 variant, CENH3. Using novel markers derived from centromere repeats, we have mapped all ten centromeres onto the physical and genetic maps of maize. We were able to completely traverse centromeres 2 and 5, confirm physical maps by fluorescence in situ hybridization (FISH), and delineate their functional regions by chromatin immunoprecipitation (ChIP) with anti-CENH3 antibody followed by pyrosequencing. These two centromeres differ substantially in size, apparent CENH3 density, and arrangement of centromeric repeats; and they are larger than the rice centromeres characterized to date. Furthermore, centromere 5 consists of two distinct CENH3 domains that are separated by several megabases. Succession of centromere repeat classes is evidenced by the fact that elements belonging to the recently active recombinant subgroups of CRM1 colonize the present day centromeres, while elements of the ancestral subgroups are also found in the flanking regions. Using abundant CRM and non-CRM retrotransposons that inserted in and near these two centromeres to create a historical record of centromere location, we show that maize centromeres are fluid genomic regions whose borders are heavily influenced by the interplay of retrotransposons and epigenetic marks. Furthermore, we propose that CRMs may be involved in removal of centromeric DNA (specifically CentC), invasion of centromeres by non-CRM retrotransposons, and local repositioning of the CENH3.

  9. Maize Centromere Structure and Evolution: Sequence Analysis of Centromeres 2 and 5 Reveals Dynamic Loci Shaped Primarily by Retrotransposons

    PubMed Central

    Albert, Patrice S.; Koo, Dal-Hoe; Shi, Jinghua; Gao, Zhi; Han, Fangpu; Lee, Hyeran; Xu, Ronghui; Allison, Jamie; Birchler, James A.; Jiang, Jiming; Dawe, R. Kelly; Presting, Gernot G.

    2009-01-01

    We describe a comprehensive and general approach for mapping centromeres and present a detailed characterization of two maize centromeres. Centromeres are difficult to map and analyze because they consist primarily of repetitive DNA sequences, which in maize are the tandem satellite repeat CentC and interspersed centromeric retrotransposons of maize (CRM). Centromeres are defined epigenetically by the centromeric histone H3 variant, CENH3. Using novel markers derived from centromere repeats, we have mapped all ten centromeres onto the physical and genetic maps of maize. We were able to completely traverse centromeres 2 and 5, confirm physical maps by fluorescence in situ hybridization (FISH), and delineate their functional regions by chromatin immunoprecipitation (ChIP) with anti-CENH3 antibody followed by pyrosequencing. These two centromeres differ substantially in size, apparent CENH3 density, and arrangement of centromeric repeats; and they are larger than the rice centromeres characterized to date. Furthermore, centromere 5 consists of two distinct CENH3 domains that are separated by several megabases. Succession of centromere repeat classes is evidenced by the fact that elements belonging to the recently active recombinant subgroups of CRM1 colonize the present day centromeres, while elements of the ancestral subgroups are also found in the flanking regions. Using abundant CRM and non-CRM retrotransposons that inserted in and near these two centromeres to create a historical record of centromere location, we show that maize centromeres are fluid genomic regions whose borders are heavily influenced by the interplay of retrotransposons and epigenetic marks. Furthermore, we propose that CRMs may be involved in removal of centromeric DNA (specifically CentC), invasion of centromeres by non-CRM retrotransposons, and local repositioning of the CENH3. PMID:19956743

  10. Identification and functional characterization of BTas transactivator as a DNA-binding protein.

    PubMed

    Tan, Juan; Hao, Peng; Jia, Rui; Yang, Wei; Liu, Ruichang; Wang, Jinzhong; Xi, Zhen; Geng, Yunqi; Qiao, Wentao

    2010-09-30

    The genome of bovine foamy virus (BFV) encodes a transcriptional transactivator, namely BTas, that remarkably enhances gene expression by binding to the viral long-terminal repeat promoter (LTR) and internal promoter (IP). In this report, we characterized the functional domains of BFV BTas. BTas contains two major functional domains: the N-terminal DNA-binding domain (residues 1-133) and the C-terminal activation domain (residues 198-249). The complete BTas responsive regions were mapped to the positions -380/-140 of LTR and 9205/9276 of IP. Four BTas responsive elements were identified at the positions -368/-346, -327/-307, -306/-285 and -186/-165 of the BFV LTR, and one element was identified at the position 9243/9264 of the BFV IP. Unlike other foamy viruses, the five BTas responsive elements in BFV shared obvious sequence homology. These data suggest that among the complex retroviruses, BFV appears to have a unique transactivation mechanism. Crown Copyright 2010. Published by Elsevier Inc. All rights reserved.

  11. Spy: a new group of eukaryotic DNA transposons without target site duplications.

    PubMed

    Han, Min-Jin; Xu, Hong-En; Zhang, Hua-Hao; Feschotte, Cédric; Zhang, Ze

    2014-06-24

    Class 2 or DNA transposons populate the genomes of most eukaryotes and like other mobile genetic elements have a profound impact on genome evolution. Most DNA transposons belong to the cut-and-paste types, which are relatively simple elements characterized by terminal-inverted repeats (TIRs) flanking a single gene encoding a transposase. All eukaryotic cut-and-paste transposons so far described are also characterized by target site duplications (TSDs) of host DNA generated upon chromosomal insertion. Here, we report a new group of evolutionarily related DNA transposons called Spy, which also include TIRs and DDE motif-containing transposase but surprisingly do not create TSDs upon insertion. Instead, Spy transposons appear to transpose precisely between 5'-AAA and TTT-3' host nucleotides, without duplication or modification of the AAATTT target sites. Spy transposons were identified in the genomes of diverse invertebrate species based on transposase homology searches and structure-based approaches. Phylogenetic analyses indicate that Spy transposases are distantly related to IS5, ISL2EU, and PIF/Harbinger transposases. However, Spy transposons are distinct from these and other DNA transposon superfamilies by their lack of TSD and their target site preference. Our findings expand the known diversity of DNA transposons and reveal a new group of eukaryotic DDE transposases with unusual catalytic properties. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  12. Characterization of two new plasmid DNAs found in mitochondria of wild-type Neurospora intermedia strains.

    PubMed Central

    Stohl, L L; Collins, R A; Cole, M D; Lambowitz, A M

    1982-01-01

    Mitochondria from two Neurospora intermedia strains (P4O5-Labelle and Fiji N6-6) were found to contain plasmid DNAs in addition to the standard mitochondrial DNA species. The plasmid DNAs consist of monomeric circles (4.1-4.3 kbp and 5.2-5.3 kbp for Labelle and Fiji, respectively) and oligomers in which monomers are organized as head-to-tail repeats. DNA-DNA hybridization experiments showed that the plasmids have no substantial sequence homology to mtDNA, to each other, or to a previously characterized mitochondrial plasmid from N. crassa strain Mauriceville-lc (Collins et al. Cell 24, 443-452, 1981). The intramitochondrial location of the plasmids was established by cell fractionation and nuclease protection experiments. In sexual crosses, the plasmids showed strict maternal inheritance, the same as Neurospora mitochondrial DNA. The plasmids may represent a novel class of mitochondrial genetic elements. Images PMID:6280144

  13. R-loops: targets for nuclease cleavage and repeat instability.

    PubMed

    Freudenreich, Catherine H

    2018-01-11

    R-loops form when transcribed RNA remains bound to its DNA template to form a stable RNA:DNA hybrid. Stable R-loops form when the RNA is purine-rich, and are further stabilized by DNA secondary structures on the non-template strand. Interestingly, many expandable and disease-causing repeat sequences form stable R-loops, and R-loops can contribute to repeat instability. Repeat expansions are responsible for multiple neurodegenerative diseases, including Huntington's disease, myotonic dystrophy, and several types of ataxias. Recently, it was found that R-loops at an expanded CAG/CTG repeat tract cause DNA breaks as well as repeat instability (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Two factors were identified as causing R-loop-dependent breaks at CAG/CTG tracts: deamination of cytosines and the MutLγ (Mlh1-Mlh3) endonuclease, defining two new mechanisms for how R-loops can generate DNA breaks (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Following R-loop-dependent nicking, base excision repair resulted in repeat instability. These results have implications for human repeat expansion diseases and provide a paradigm for how RNA:DNA hybrids can cause genome instability at structure-forming DNA sequences. This perspective summarizes mechanisms of R-loop-induced fragility at G-rich repeats and new links between DNA breaks and repeat instability.

  14. The 28S–18S rDNA intergenic spacer from Crithidia fasciculata: repeated sequences, length heterogeneity, putative processing sites and potential interactions between U3 small nucleolar RNA and the ribosomal RNA precursor

    PubMed Central

    Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.

    2000-01-01

    In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863

  15. Cocaine dynamically regulates heterochromatin and repetitive element unsilencing in nucleus accumbens.

    PubMed

    Maze, Ian; Feng, Jian; Wilkinson, Matthew B; Sun, HaoSheng; Shen, Li; Nestler, Eric J

    2011-02-15

    Repeated cocaine exposure induces persistent alterations in genome-wide transcriptional regulatory networks, chromatin remodeling activity and, ultimately, gene expression profiles in the brain's reward circuitry. Virtually all previous investigations have centered on drug-mediated effects occurring throughout active euchromatic regions of the genome, with very little known concerning the impact of cocaine exposure on the regulation and maintenance of heterochromatin in adult brain. Here, we report that cocaine dramatically and dynamically alters heterochromatic histone H3 lysine 9 trimethylation (H3K9me3) in the nucleus accumbens (NAc), a key brain reward region. Furthermore, we demonstrate that repeated cocaine exposure causes persistent decreases in heterochromatization in this brain region, suggesting a potential role for heterochromatic regulation in the long-term actions of cocaine. To identify precise genomic loci affected by these alterations, chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-Seq) was performed on NAc. ChIP-Seq analyses confirmed the existence of the H3K9me3 mark mainly within intergenic regions of the genome and identified specific patterns of cocaine-induced H3K9me3 regulation at repetitive genomic sequences. Cocaine-mediated decreases in H3K9me3 enrichment at specific genomic repeats [e.g., long interspersed nuclear element (LINE)-1 repeats] were further confirmed by the increased expression of LINE-1 retrotransposon-associated repetitive elements in NAc. Such increases likely reflect global patterns of genomic destabilization in this brain region after repeated cocaine administration and open the door for future investigations into the epigenetic and genetic basis of drug addiction.

  16. The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats

    PubMed Central

    Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine

    2007-01-01

    Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element) are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the dictionary creator. CRISPRdb is accessible at PMID:17521438

  17. Short Tandem Repeat DNA Internet Database

    National Institute of Standards and Technology Data Gateway

    SRD 130 Short Tandem Repeat DNA Internet Database (Web, free access)   Short Tandem Repeat DNA Internet Database is intended to benefit research and application of short tandem repeat DNA markers for human identity testing. Facts and sequence information on each STR system, population data, commonly used multiplex STR systems, PCR primers and conditions, and a review of various technologies for analysis of STR alleles have been included.

  18. Emergence of Sequence Type 779 Methicillin-Resistant Staphylococcus aureus Harboring a Novel Pseudo Staphylococcal Cassette Chromosome mec (SCCmec)-SCC-SCCCRISPR Composite Element in Irish Hospitals

    PubMed Central

    Kinnevey, Peter M.; Shore, Anna C.; Brennan, Grainne I.; Sullivan, Derek J.; Ehricht, Ralf; Monecke, Stefan; Slickers, Peter

    2013-01-01

    Methicillin-resistant Staphylococcus aureus (MRSA) has been a major cause of nosocomial infection in Irish hospitals for 4 decades, and replacement of predominant MRSA clones has occurred several times. An MRSA isolate recovered in 2006 as part of a larger study of sporadic MRSA exhibited a rare spa (t878) and multilocus sequence (ST779) type and was nontypeable by PCR- and DNA microarray-based staphylococcal cassette chromosome mec (SCCmec) element typing. Whole-genome sequencing revealed the presence of a novel 51-kb composite island (CI) element with three distinct domains, each flanked by direct repeat and inverted repeat sequences, including (i) a pseudo SCCmec element (16.3 kb) carrying mecA with a novel mec class region, a fusidic acid resistance gene (fusC), and two copper resistance genes (copB and copC) but lacking ccr genes; (ii) an SCC element (17.5 kb) carrying a novel ccrAB4 allele; and (iii) an SCC element (17.4 kb) carrying a novel ccrC allele and a clustered regularly interspaced short palindromic repeat (CRISPR) region. The novel CI was subsequently identified by PCR in an additional 13 t878/ST779 MRSA isolates, six from bloodstream infections, recovered between 2006 and 2011 in 11 hospitals. Analysis of open reading frames (ORFs) carried by the CI showed amino acid sequence similarity of 44 to 100% to ORFs from S. aureus and coagulase-negative staphylococci (CoNS). These findings provide further evidence of genetic transfer between S. aureus and CoNS and show how this contributes to the emergence of novel SCCmec elements and MRSA strains. Ongoing surveillance of this MRSA strain is warranted and will require updating of currently used SCCmec typing methods. PMID:23147725

  19. Genetic analysis and ethnic affinities from two Scytho-Siberian skeletons.

    PubMed

    Ricaut, François-Xavier; Keyser-Tracqui, Christine; Cammaert, Laurence; Crubézy, Eric; Ludes, Bertrand

    2004-04-01

    We extracted DNA from two skeletons belonging to the Sytho-Siberian population, which were excavated from the Sebÿstei site (dating back 2,500 years) in the Altai Republic (Central Asia). Ancient DNA was analyzed by autosomal short tandem repeats (STRs) and by the sequencing of the hypervariable region 1 (HV1) of the mitochondrial DNA (mtDNA) control region. The results showed that these two skeletons were not close relatives. Moreover, their haplogroups were characteristic of Asian populations. Comparison with the haplogroup of 3,523 Asian and American individuals linked one skeleton with a putative ancestral paleo-Asiatic population and the other with Chinese populations. It appears that the genetic study of ancient populations of Central Asia brings important elements to the understanding of human population movements in Asia. Copyright 2003 Wiley-Liss, Inc.

  20. Extensive length variation in the ribosomal DNA intergenic spacer of yellow perch (Perca flavescens).

    PubMed

    Kakou, Bidénam; Angers, Bernard; Glémet, Hélène

    2016-03-01

    The intergenic spacer (IGS) is located between ribosomal RNA (rRNA) gene copies. Within the IGS, regulatory elements for rRNA gene transcription are found, as well as a varying number of other repetitive elements that are at the root of IGS length heterogeneity. This heterogeneity has been shown to have a functional significance through its effect on growth rate. Here, we present the structural organization of yellow perch (Perca flavescens) IGS based on its entire sequence, as well as the IGS length variation within a natural population. Yellow perch IGS structure has four discrete regions containing tandem repeat elements. For three of these regions, no specific length class was detected as allele size was seemingly normally distributed. However, for one repeat region, PCR amplification uncovered the presence of two distinctive IGS variants representing a length difference of 1116 bp. This repeat region was also devoid of any CpG sites despite a high GC content. Balanced selection may be holding the alleles in the population and would account for the high diversity of length variants observed for adjacent regions. Our study is an important precursor for further work aiming to assess the role of IGS length variation in influencing growth rate in fish.

  1. Evolution in the block: common elements of 5S rDNA organization and evolutionary patterns in distant fish genera.

    PubMed

    Campo, Daniel; García-Vázquez, Eva

    2012-01-01

    The 5S rDNA is organized in the genome as tandemly repeated copies of a structural unit composed of a coding sequence plus a nontranscribed spacer (NTS). The coding region is highly conserved in the evolution, whereas the NTS vary in both length and sequence. It has been proposed that 5S rRNA genes are members of a gene family that have arisen through concerted evolution. In this study, we describe the molecular organization and evolution of the 5S rDNA in the genera Lepidorhombus and Scophthalmus (Scophthalmidae) and compared it with already known 5S rDNA of the very different genera Merluccius (Merluccidae) and Salmo (Salmoninae), to identify common structural elements or patterns for understanding 5S rDNA evolution in fish. High intra- and interspecific diversity within the 5S rDNA family in all the genera can be explained by a combination of duplications, deletions, and transposition events. Sequence blocks with high similarity in all the 5S rDNA members across species were identified for the four studied genera, with evidences of intense gene conversion within noncoding regions. We propose a model to explain the evolution of the 5S rDNA, in which the evolutionary units are blocks of nucleotides rather than the entire sequences or single nucleotides. This model implies a "two-speed" evolution: slow within blocks (homogenized by recombination) and fast within the gene family (diversified by duplications and deletions).

  2. Stabilization of perfect and imperfect tandem repeats by single-strand DNA exonucleases

    PubMed Central

    Feschenko, Vladimir V.; Rajman, Luis A.; Lovett, Susan T.

    2003-01-01

    Rearrangements between tandemly repeated DNA sequences are a common source of genetic instability. Such rearrangements underlie several human genetic diseases. In many organisms, the mismatch-repair (MMR) system functions to stabilize repeats when the repeat unit is short or when sequence imperfections are present between the repeats. We show here that the action of single-stranded DNA (ssDNA) exonucleases plays an additional, important role in stabilizing tandem repeats, independent of their role in MMR. For perfect repeats of ≈100 bp in Escherichia coli that are not susceptible to MMR, exonuclease (Exo)-I, ExoX, and RecJ exonuclease redundantly inhibit deletion. Our data suggest that >90% of potential deletion events are avoided by the combined action of these three exonucleases. Imperfect tandem repeats, less prone to rearrangements, are stabilized by both the MMR-pathway and ssDNA-specific exonucleases. For 100-bp repeats containing four mispairs, ExoI alone aborts most deletion events, even in the presence of a functional MMR system. By genetic analysis, we show that the inhibitory effect of ssDNA exonucleases on deletion formation is independent of the MutS and UvrD proteins. Exonuclease degradation of DNA displaced during the deletion process may abort slipped misalignment. Exonuclease action is therefore a significant force in genetic stabilization of many forms of repetitive DNA. PMID:12538867

  3. Identification of multiple binding sites for the THAP domain of the Galileo transposase in the long terminal inverted-repeats☆

    PubMed Central

    Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald

    2013-01-01

    Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. PMID:23648487

  4. The Crystal Structure of TAL Effector PthXo1 Bound to Its DNA Target

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mak, Amanda Nga-Sze; Bradley, Philip; Cernadas, Raul A.

    2012-02-10

    DNA recognition by TAL effectors is mediated by tandem repeats, each 33 to 35 residues in length, that specify nucleotides via unique repeat-variable diresidues (RVDs). The crystal structure of PthXo1 bound to its DNA target was determined by high-throughput computational structure prediction and validated by heavy-atom derivatization. Each repeat forms a left-handed, two-helix bundle that presents an RVD-containing loop to the DNA. The repeats self-associate to form a right-handed superhelix wrapped around the DNA major groove. The first RVD residue forms a stabilizing contact with the protein backbone, while the second makes a base-specific contact to the DNA sense strand.more » Two degenerate amino-terminal repeats also interact with the DNA. Containing several RVDs and noncanonical associations, the structure illustrates the basis of TAL effector-DNA recognition.« less

  5. Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

    PubMed

    Cao, Yinhe; Tung, Wen-Wen; Gao, J B

    2004-01-01

    With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.

  6. The yeast DNA ligase gene CDC9 is controlled by six orientation specific upstream activating sequences that respond to cellular proliferation but which alone cannot mediate cell cycle regulation.

    PubMed Central

    White, J H; Johnson, A L; Lowndes, N F; Johnston, L H

    1991-01-01

    By fusing the CDC9 structural gene to the PGK upstream sequences and the CDC9 upstream to lacZ, we showed that the cell cycle expression of CDC9 is largely due to transcriptional regulation. To investigate the role of six ATGATT upstream repeats in CDC9 regulation, synthetic copies of the sequence were attached to a heterologous gene. The repeats stimulated transcription strongly and additively, but, unlike conventional yeast UAS elements, only when present in one orientation. Transcription driven by the repeats declines in cells held at START of the cell cycle or in stationary phase, as occurs with CDC9. However, the repeats by themselves cannot impart cell cycle regulation to a heterologous gene. CDC9 may therefore be controlled by an activating system operating through the repeats that is sensitive to cellular proliferation and a separate mechanism that governs the periodic expression in the cell cycle. Images PMID:1901644

  7. Stable CoT-1 repeat RNA is abundant and associated with euchromatic interphase chromosomes

    PubMed Central

    Hall, Lisa L.; Carone, Dawn M.; Gomez, Alvin; Kolpa, Heather J.; Byron, Meg; Mehta, Nitish; Fackelmayer, Frank O.; Lawrence, Jeanne B.

    2014-01-01

    SUMMARY Recent studies recognize a vast diversity of non-coding RNAs with largely unknown functions, but few have examined interspersed repeat sequences, which constitute almost half our genome. RNA hybridization in situ using CoT-1 (highly repeated) DNA probes detects surprisingly abundant euchromatin-associated RNA comprised predominantly of repeat sequences (“CoT-1 RNA”), including LINE-1. CoT-1-hybridizing RNA strictly localizes to the interphase chromosome territory in cis, and remains stably associated with the chromosome territory following prolonged transcriptional inhibition. The CoT-1 RNA territory resists mechanical disruption and fractionates with the non-chromatin scaffold, but can be experimentally released. Loss of repeat-rich, stable nuclear RNAs from euchromatin corresponds to aberrant chromatin distribution and condensation. CoT-1 RNA has several properties similar to XIST chromosomal RNA, but is excluded from chromatin condensed by XIST. These findings impact two “black boxes” of genome science: the poorly understood diversity of non-coding RNA and the unexplained abundance of repetitive elements. PMID:24581492

  8. CHARACTERIZATION AND NUCLEOTIDE SEQUENCE DETERMINATION OF A REPEAT ELEMENT ISOLATED FROM A 2,4,5,-T DEGRADING STRAIN OF PSEUDOMONAS CEPACIA

    EPA Science Inventory

    Pseudomonas cepacia strain AC1100, capable of growth on 2,4,5-trichlorophenoxyacetic acid (2,4,5-T), was mutated to the 2,4,5-T− strain PT88 by a ColE1 :: Tn5 chromosomal insertion. Using cloned DNA from the region flanking the insertion, a 1477-bp sequence (designated RS1100) wa...

  9. DDM1 represses noncoding RNA expression and RNA-directed DNA methylation in heterochromatin.

    PubMed

    Tan, Feng; Lu, Yue; Jiang, Wei; Zhao, Yu; Wu, Tian; Zhang, Ruoyu; Zhou, Dao-Xiu

    2018-05-24

    Cytosine methylation of DNA, which occurs at CG, CHG, and CHH (H=A, C, or T) sequences in plants, is a hallmark for epigenetic repression of repetitive sequences. The chromatin remodeling factor DECREASE IN DNA METHYLATION1 (DDM1) is essential for DNA methylation, especially at CG and CHG sequences. However, its potential role in RNA-directed DNA methylation (RdDM) and in chromatin function is not completely understood in rice (Oryza sativa). In this work, we used high-throughput approaches to study the function of rice DDM1 (OsDDM1) in RdDM and the expression of non-coding RNA (ncRNA). We show that loss of function of OsDDM1 results in ectopic CHH methylation of transposable elements and repeats. The ectopic CHH methylation was dependent on rice DOMAINS REARRANGED METHYLTRANSFERASE2 (OsDRM2), a DNA methyltransferase involved in RdDM. Mutations in OsDDM1 lead to decreases of histone H3K9me2 and increases in the levels of heterochromatic small RNA (sRNA) and long noncoding RNA (lncRNA). In particular, OsDDM1 was found to be essential to repress transcription of the two repetitive sequences, Centromeric Retrotransposons of Rice1 (CRR1) and the dominant centromeric CentO repeats. These results suggest that OsDDM1 antagonizes RdDM at heterochromatin and represses tissue-specific expression of ncRNA from repetitive sequences in the rice genome. {copyright, serif} 2018 American Society of Plant Biologists. All rights reserved.

  10. High-Resolution Whole-Genome Sequencing Reveals That Specific Chromatin Domains from Most Human Chromosomes Associate with Nucleoli

    PubMed Central

    van Koningsbruggen, Silvana; Gierliński, Marek; Schofield, Pietá; Martin, David; Barton, Geoffey J.; Ariyurek, Yavuz; den Dunnen, Johan T.

    2010-01-01

    The nuclear space is mostly occupied by chromosome territories and nuclear bodies. Although this organization of chromosomes affects gene function, relatively little is known about the role of nuclear bodies in the organization of chromosomal regions. The nucleolus is the best-studied subnuclear structure and forms around the rRNA repeat gene clusters on the acrocentric chromosomes. In addition to rDNA, other chromatin sequences also surround the nucleolar surface and may even loop into the nucleolus. These additional nucleolar-associated domains (NADs) have not been well characterized. We present here a whole-genome, high-resolution analysis of chromatin endogenously associated with nucleoli. We have used a combination of three complementary approaches, namely fluorescence comparative genome hybridization, high-throughput deep DNA sequencing and photoactivation combined with time-lapse fluorescence microscopy. The data show that specific sequences from most human chromosomes, in addition to the rDNA repeat units, associate with nucleoli in a reproducible and heritable manner. NADs have in common a high density of AT-rich sequence elements, low gene density and a statistically significant enrichment in transcriptionally repressed genes. Unexpectedly, both the direct DNA sequencing and fluorescence photoactivation data show that certain chromatin loci can specifically associate with either the nucleolus, or the nuclear envelope. PMID:20826608

  11. High-resolution whole-genome sequencing reveals that specific chromatin domains from most human chromosomes associate with nucleoli.

    PubMed

    van Koningsbruggen, Silvana; Gierlinski, Marek; Schofield, Pietá; Martin, David; Barton, Geoffey J; Ariyurek, Yavuz; den Dunnen, Johan T; Lamond, Angus I

    2010-11-01

    The nuclear space is mostly occupied by chromosome territories and nuclear bodies. Although this organization of chromosomes affects gene function, relatively little is known about the role of nuclear bodies in the organization of chromosomal regions. The nucleolus is the best-studied subnuclear structure and forms around the rRNA repeat gene clusters on the acrocentric chromosomes. In addition to rDNA, other chromatin sequences also surround the nucleolar surface and may even loop into the nucleolus. These additional nucleolar-associated domains (NADs) have not been well characterized. We present here a whole-genome, high-resolution analysis of chromatin endogenously associated with nucleoli. We have used a combination of three complementary approaches, namely fluorescence comparative genome hybridization, high-throughput deep DNA sequencing and photoactivation combined with time-lapse fluorescence microscopy. The data show that specific sequences from most human chromosomes, in addition to the rDNA repeat units, associate with nucleoli in a reproducible and heritable manner. NADs have in common a high density of AT-rich sequence elements, low gene density and a statistically significant enrichment in transcriptionally repressed genes. Unexpectedly, both the direct DNA sequencing and fluorescence photoactivation data show that certain chromatin loci can specifically associate with either the nucleolus, or the nuclear envelope.

  12. The coactivator CBP stimulates human T-cell lymphotrophic virus type I Tax transactivation in vitro.

    PubMed

    Kashanchi, F; Duvall, J F; Kwok, R P; Lundblad, J R; Goodman, R H; Brady, J N

    1998-12-18

    Tax interacts with the cellular cyclic AMP-responsive element binding protein (CREB) and facilitates the binding of the coactivator CREB binding protein (CBP), forming a multimeric complex on the cyclic AMP-responsive element (CRE)-like sites in the human T-cell lymphotrophic virus type I (HTLV-I) promoter. The trimeric complex is believed to recruit additional regulatory proteins to the HTLV-I long terminal repeat, but there has been no direct evidence that CBP is required for Tax-mediated transactivation. We present evidence that Tax and CBP activate transcription from the HTLV-I 21 base pair repeats on naked DNA templates. Transcriptional activation of the HTLV-I sequences required both Tax and CBP and could be mediated by either the N-terminal activation domain of CBP or the full-length protein. Fluorescence polarization binding assays indicated that CBP does not markedly enhance the affinity of Tax for the trimeric complex. Transcription analyses suggest that CBP activates Tax-dependent transcription by promoting transcriptional initiation and reinitiation. The ability of CBP to activate the HTLV-I promoter does not involve the stabilization of Tax binding, but rather depends upon gene activation properties of the co-activator that function in the context of a naked DNA template.

  13. Analysis of Duck Hepatitis B Virus Reverse Transcription Indicates a Common Mechanism for the Two Template Switches during Plus-Strand DNA Synthesis

    PubMed Central

    Havert, Michael B.; Ji, Lin; Loeb, Daniel D.

    2002-01-01

    The synthesis of the hepadnavirus relaxed circular DNA genome requires two template switches, primer translocation and circularization, during plus-strand DNA synthesis. Repeated sequences serve as donor and acceptor templates for these template switches, with direct repeat 1 (DR1) and DR2 for primer translocation and 5′r and 3′r for circularization. These donor and acceptor sequences are at, or near, the ends of the minus-strand DNA. Analysis of plus-strand DNA synthesis of duck hepatitis B virus (DHBV) has indicated that there are at least three other cis-acting sequences that make contributions during the synthesis of relaxed circular DNA. These sequences, 5E, M, and 3E, are located near the 5′ end, the middle, and the 3′ end of minus-strand DNA, respectively. The mechanism by which these sequences contribute to the synthesis of plus-strand DNA was unclear. Our aim was to better understand the mechanism by which 5E and M act. We localized the DHBV 5E element to a short sequence of approximately 30 nucleotides that is 100 nucleotides 3′ of DR2 on minus-strand DNA. We found that the new 5E mutants were partially defective for primer translocation/utilization at DR2. They were also invariably defective for circularization. In addition, examination of several new DHBV M variants indicated that they too were defective for primer translocation/utilization and circularization. Thus, this analysis indicated that 5E and M play roles in both primer translocation/utilization and circularization. In conjunction with earlier findings that 3E functions in both template switches, our findings indicate that the processes of primer translocation and circularization share a common underlying mechanism. PMID:11861843

  14. CRISPR-Cas systems: prokaryotes upgrade to adaptive immunity

    PubMed Central

    Barrangou, Rodolphe; Marraffini, Luciano A.

    2014-01-01

    Summary Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR), and associated proteins (Cas) comprise the CRISPR-Cas system, which confers adaptive immunity against exogenic elements in many bacteria and most archaea. CRISPR-mediated immunization occurs through the uptake of DNA from invasive genetic elements such as plasmids and viruses, followed by its integration into CRISPR loci. These loci are subsequently transcribed and processed into small interfering RNAs that guide nucleases for specific cleavage of complementary sequences. Conceptually, CRISPR-Cas shares functional features with the mammalian adaptive immune system, while also exhibiting characteristics of Lamarckian evolution. Because immune markers spliced from exogenous agents are integrated iteratively in CRISPR loci, they constitute a genetic record of vaccination events and reflect environmental conditions and changes over time. Cas endonucleases, which can be reprogrammed by small guide RNAs have shown unprecedented potential and flexibility for genome editing, and can be repurposed for numerous DNA targeting applications including transcriptional control. PMID:24766887

  15. Characterization of the Fb-Nof Transposable Element of Drosophila Melanogaster

    PubMed Central

    Harden, N.; Ashburner, M.

    1990-01-01

    FB-NOF is a composite transposable element of Drosophila melanogaster. It is composed of foldback sequences, of variable length, which flank a 4-kb NOF sequence with 308-bp inverted repeat termini. The NOF sequence could potentially code for a 120-kD polypeptide. The FB-NOF element is responsible for unstable mutations of the white gene (w(c) and w(DZL)) and is associated with the large TEs of G. Ising. Although most strains of D. melanogaster have 20-30 sites of FB insertion, FB-NOF elements are usually rare, many strains lack this composite element or have only one copy of it. A few strains, including w(DZL) and Basc have many (8-21) copies of FB-NOF, and these show a tendency to insert at ``hot-spots.'' These strains also have an increased number of FB elements. The DNA sequence of the NOF region associated with TE146(Z) has been determined. PMID:2174013

  16. The Influence of Primary and Secondary DNA Structure in Deletion and Duplication between Direct Repeats in Escherichia Coli

    PubMed Central

    Trinh, T. Q.; Sinden, R. R.

    1993-01-01

    We describe a system to measure the frequency of both deletions and duplications between direct repeats. Short 17- and 18-bp palindromic and nonpalindromic DNA sequences were cloned into the EcoRI site within the chloramphenicol acetyltransferase gene of plasmids pBR325 and pJT7. This creates an insert between direct repeated EcoRI sites and results in a chloramphenicol-sensitive phenotype. Selection for chloramphenicol resistance was utilized to select chloramphenicol resistant revertants that included those with precise deletion of the insert from plasmid pBR325 and duplication of the insert in plasmid pJT7. The frequency of deletion or duplication varied more than 500-fold depending on the sequence of the short sequence inserted into the EcoRI site. For the nonpalindromic inserts, multiple internal direct repeats and the length of the direct repeats appear to influence the frequency of deletion. Certain palindromic DNA sequences with the potential to form DNA hairpin structures that might stabilize the misalignment of direct repeats had a high frequency of deletion. Other DNA sequences with the potential to form structures that might destabilize misalignment of direct repeats had a very low frequency of deletion. Duplication mutations occurred at the highest frequency when the DNA between the direct repeats contained no direct or inverted repeats. The presence of inverted repeats dramatically reduced the frequency of duplications. The results support the slippage-misalignment model, suggesting that misalignment occurring during DNA replication leads to deletion and duplication mutations. The results also support the idea that the formation of DNA secondary structures during DNA replication can facilitate and direct specific mutagenic events. PMID:8325478

  17. Disruption of Higher Order DNA Structures in Friedreich’s Ataxia (GAA)n Repeats by PNA or LNA Targeting

    PubMed Central

    Bergquist, Helen; Rocha, Cristina S. J.; Álvarez-Asencio, Rubén; Nguyen, Chi-Hung; Rutland, Mark. W.; Smith, C. I. Edvard; Good, Liam; Nielsen, Peter E.; Zain, Rula

    2016-01-01

    Expansion of (GAA)n repeats in the first intron of the Frataxin gene is associated with reduced mRNA and protein levels and the development of Friedreich’s ataxia. (GAA)n expansions form non-canonical structures, including intramolecular triplex (H-DNA), and R-loops and are associated with epigenetic modifications. With the aim of interfering with higher order H-DNA (like) DNA structures within pathological (GAA)n expansions, we examined sequence-specific interaction of peptide nucleic acid (PNA) with (GAA)n repeats of different lengths (short: n=9, medium: n=75 or long: n=115) by chemical probing of triple helical and single stranded regions. We found that a triplex structure (H-DNA) forms at GAA repeats of different lengths; however, single stranded regions were not detected within the medium size pathological repeat, suggesting the presence of a more complex structure. Furthermore, (GAA)4-PNA binding of the repeat abolished all detectable triplex DNA structures, whereas (CTT)5-PNA did not. We present evidence that (GAA)4-PNA can invade the DNA at the repeat region by binding the DNA CTT strand, thereby preventing non-canonical-DNA formation, and that triplex invasion complexes by (CTT)5-PNA form at the GAA repeats. Locked nucleic acid (LNA) oligonucleotides also inhibited triplex formation at GAA repeat expansions, and atomic force microscopy analysis showed significant relaxation of plasmid morphology in the presence of GAA-LNA. Thus, by inhibiting disease related higher order DNA structures in the Frataxin gene, such PNA and LNA oligomers may have potential for discovery of drugs aiming at recovering Frataxin expression. PMID:27846236

  18. Molecular Dynamics Simulations of DNA-Free and DNA-Bound TAL Effectors

    PubMed Central

    Wan, Hua; Hu, Jian-ping; Li, Kang-shun; Tian, Xu-hong; Chang, Shan

    2013-01-01

    TAL (transcriptional activator-like) effectors (TALEs) are DNA-binding proteins, containing a modular central domain that recognizes specific DNA sequences. Recently, the crystallographic studies of TALEs revealed the structure of DNA-recognition domain. In this article, molecular dynamics (MD) simulations are employed to study two crystal structures of an 11.5-repeat TALE, in the presence and absence of DNA, respectively. The simulated results indicate that the specific binding of RVDs (repeat-variable diresidues) with DNA leads to the markedly reduced fluctuations of tandem repeats, especially at the two ends. In the DNA-bound TALE system, the base-specific interaction is formed mainly by the residue at position 13 within a TAL repeat. Tandem repeats with weak RVDs are unfavorable for the TALE-DNA binding. These observations are consistent with experimental studies. By using principal component analysis (PCA), the dominant motions are open-close movements between the two ends of the superhelical structure in both DNA-free and DNA-bound TALE systems. The open-close movements are found to be critical for the recognition and binding of TALE-DNA based on the analysis of free energy landscape (FEL). The conformational analysis of DNA indicates that the 5′ end of DNA target sequence has more remarkable structural deformability than the other sites. Meanwhile, the conformational change of DNA is likely associated with the specific interaction of TALE-DNA. We further suggest that the arrangement of N-terminal repeats with strong RVDs may help in the design of efficient TALEs. This study provides some new insights into the understanding of the TALE-DNA recognition mechanism. PMID:24130757

  19. Long interspersed nuclear element-1 retroelements are expressed in patients with systemic autoimmune disease and induce type I interferon

    PubMed Central

    Mavragani, Clio P.; Sagalovskiy, Irina; Guo, Qiu; Nezos, Adrianos; Kapsogeorgou, Efstathia K.; Lu, Pin; Zhou, Jun Liang; Kirou, Kyriakos A.; Seshan, Surya V.; Moutsopoulos, Haralampos M.; Crow, Mary K.

    2016-01-01

    Objective Increased type I interferon (IFN-I) and a broad signature of IFN-I-induced gene transcripts are observed in patients with SLE and other systemic autoimmune diseases. To identify disease-relevant triggers of the IFN-I pathway we investigated whether endogenous virus-like genomic repeat elements, normally silent, might be expressed in patients with systemic autoimmune disease, activate an innate immune response and induce IFN-I. Methods Expression of IFN-I and long interspersed nuclear element-1 (LINE-1; L1) was studied in kidney tissue from lupus patients and minor salivary gland (MSG) tissue from patients with primary Sjogren’s syndrome (SS) by PCR, western blot and immunohistochemistry. Induction of IFN-I by L1 was investigated by transfection of plasmacytoid dendritic cells (pDCs) or monocytes with an L1-encoding plasmid or L1 RNA. Involvement of innate immune pathways and altered L1 methylation were assessed. Results L1 mRNA transcripts were increased in lupus nephritis kidneys and in MSG from SS patients and correlated with IFN-I expression and L1 DNA demethylation. L1 open reading frame 1/p40 protein and IFNβ were expressed in MSG ductal epithelial cells and in lupus kidneys, and IFNα was detected in infiltrating pDCs. Transfection of pDCs or monocytes with L1-encoding DNA or RNA induced IFN-I. Inhibition of TLR7/8 reduced L1 induction of IFNα in pDCs and an inhibitor of IKKε/TBK1 abrogated induction of IFN-I by L1 RNA in monocytes. Conclusion L1 genomic repeat elements represent endogenous nucleic acid triggers of the IFN-I pathway in SLE and SS and may contribute to initiation or amplification of autoimmune disease. PMID:27338297

  20. Mobilization of a plant transposon by expression of the transposon-encoded anti-silencing factor.

    PubMed

    Fu, Yu; Kawabe, Akira; Etcheverry, Mathilde; Ito, Tasuku; Toyoda, Atsushi; Fujiyama, Asao; Colot, Vincent; Tarutani, Yoshiaki; Kakutani, Tetsuji

    2013-08-28

    Transposable elements (TEs) have a major impact on genome evolution, but they are potentially deleterious, and most of them are silenced by epigenetic mechanisms, such as DNA methylation. Here, we report the characterization of a TE encoding an activity to counteract epigenetic silencing by the host. In Arabidopsis thaliana, we identified a mobile copy of the Mutator-like element (MULE) with degenerated terminal inverted repeats (TIRs). This TE, named Hiun (Hi), is silent in wild-type plants, but it transposes when DNA methylation is abolished. When a Hi transgene was introduced into the wild-type background, it induced excision of the endogenous Hi copy, suggesting that Hi is the autonomously mobile copy. In addition, the transgene induced loss of DNA methylation and transcriptional activation of the endogenous Hi. Most importantly, the trans-activation of Hi depends on a Hi-encoded protein different from the conserved transposase. Proteins related to this anti-silencing factor, which we named VANC, are widespread in the non-TIR MULEs and may have contributed to the recent success of these TEs in natural Arabidopsis populations.

  1. Intramolecular transposition by a synthetic IS50 (Tn5) derivative

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tomcsanyi, T.; Phadnis, S.H.; Berg, D.E.

    1990-11-01

    We report the formation of deletions and inversions by intramolecular transposition of Tn5-derived mobile elements. The synthetic transposons used contained the IS50 O and I end segments and the transposase gene, a contraselectable gene encoding sucrose sensitivity (sacB), antibiotic resistance genes, and a plasmid replication origin. Both deletions and inversions were associated with loss of a 300-bp segment that is designated the vector because it is outside of the transposon. Deletions were severalfold more frequent than inversions, perhaps reflecting constraints on DNA twisting or abortive transposition. Restriction and DNA sequence analyses showed that both types of rearrangements extended from onemore » transposon end to many different sites in target DNA. In the case of inversions, transposition generated 9-bp direct repeats of target sequences.« less

  2. [Molecular variability in the commom shrew Sorex araneus L. from European Russia and Siberia inferred from the length polymorphism of DNA regions flanked by short interspersed elements (Inter-SINE PCR) and the relationships between the Moscow and Seliger chromosome races].

    PubMed

    Bannikova, A A; Bulatova, N Sh; Kramerov, D A

    2006-06-01

    Genetic exchange among chromosomal races of the common shrew Sorex araneus and the problem of reproductive barriers have been extensively studied by means of such molecular markers as mtDNA, microsatellites, and allozymes. In the present study, the interpopulation and interracial polymorphism in the common shrew was derived, using fingerprints generated by amplified DNA regions flanked by short interspersed repeats (SINEs)-interSINE PCR (IS-PCR). We used primers, complementary to consensus sequences of two short retroposons: mammalian element MIR and the SOR element from the genome of Sorex araneus. Genetic differentiation among eleven populations of the common shrew from eight chromosome races was estimated. The NP and MJ analyses, as well as multidimensional scaling showed that all samples examined grouped into two main clusters, corresponding to European Russia and Siberia. The bootstrap support of the European Russia cluster in the NJ and MP analyses was respectively 76 and 61%. The bootstrap index for the Siberian cluster was 100% in both analyses; the Tomsk race, included into this cluster, was separated with the bootstrap support of NJ/MP 92/95%.

  3. A novel cold-inducible zinc finger protein from soybean, SCOF-1, enhances cold tolerance in transgenic plants.

    PubMed

    Kim, J C; Lee, S H; Cheong, Y H; Yoo, C M; Lee, S I; Chun, H J; Yun, D J; Hong, J C; Lee, S Y; Lim, C O; Cho, M J

    2001-02-01

    Cold stress on plants induces changes in the transcription of cold response genes. A cDNA clone encoding C2H2-type zinc finger protein, SCOF-1, was isolated from soybean. The transcription of SCOF-1 is specifically induced by low temperature and abscisic acid (ABA) but not by dehydration or high salinity. Constitutive overexpression of SCOF-1 induced cold-regulated (COR) gene expression and enhanced cold tolerance of non-acclimated transgenic Arabidopsis and tobacco plants. SCOF-1 localized to the nucleus but did not bind directly to either C-repeat/dehydration (CRT/DRE) or ABA responsive element (ABRE), cis-acting DNA regulatory elements present in COR gene promoters. However, SCOF-1 greatly enhanced the DNA binding activity of SGBF-1, a soybean G-box binding bZIP transcription factor, to ABRE in vitro. SCOF-1 also interacted with SGBF-1 in a yeast two-hybrid system. The SGBF-1 transactivated the beta-glucuronidase reporter gene driven by the ABRE element in Arabidopsis leaf protoplasts. Furthermore, the SCOF-1 enhanced ABRE-dependent gene expression mediated by SGBF-1. These results suggest that SCOF-1 may function as a positive regulator of COR gene expression mediated by ABRE via protein-protein interaction, which in turn enhances cold tolerance of plants.

  4. Methylator phenotype of malignant germ cell tumours in children identifies strong candidates for chemotherapy resistance

    PubMed Central

    Jeyapalan, J N; Noor, D A Mohamed; Lee, S-H; Tan, C L; Appleby, V A; Kilday, J P; Palmer, R D; Schwalbe, E C; Clifford, S C; Walker, D A; Murray, M J; Coleman, N; Nicholson, J C; Scotting, P J

    2011-01-01

    Background: Yolk sac tumours (YSTs) and germinomas are the two major pure histological subtypes of germ cell tumours. To date, the role of DNA methylation in the aetiology of this class of tumour has only been analysed in adult testicular forms and with respect to only a few genes. Methods: A bank of paediatric tumours was analysed for global methylation of LINE-1 repeat elements and global methylation of regulatory elements using GoldenGate methylation arrays. Results: Both germinomas and YSTs exhibited significant global hypomethylation of LINE-1 elements. However, in germinomas, methylation of gene regulatory regions differed little from control samples, whereas YSTs exhibited increased methylation at a large proportion of the loci tested, showing a ‘methylator' phenotype, including silencing of genes associated with Caspase-8-dependent apoptosis. Furthermore, we found that the methylator phenotype of YSTs was coincident with higher levels of expression of the DNA methyltransferase, DNA (cytosine-5)-methyltransferase 3B, suggesting a mechanism underlying the phenotype. Conclusion: Epigenetic silencing of a large number of potential tumour suppressor genes in YSTs might explain why they exhibit a more aggressive natural history than germinomas and silencing of genes associated with Caspase-8-dependent cell death might explain the relative resistance of YSTs to conventional therapy. PMID:21712824

  5. Methylator phenotype of malignant germ cell tumours in children identifies strong candidates for chemotherapy resistance.

    PubMed

    Jeyapalan, J N; Noor, D A Mohamed; Lee, S-H; Tan, C L; Appleby, V A; Kilday, J P; Palmer, R D; Schwalbe, E C; Clifford, S C; Walker, D A; Murray, M J; Coleman, N; Nicholson, J C; Scotting, P J

    2011-08-09

    Yolk sac tumours (YSTs) and germinomas are the two major pure histological subtypes of germ cell tumours. To date, the role of DNA methylation in the aetiology of this class of tumour has only been analysed in adult testicular forms and with respect to only a few genes. A bank of paediatric tumours was analysed for global methylation of LINE-1 repeat elements and global methylation of regulatory elements using GoldenGate methylation arrays. Both germinomas and YSTs exhibited significant global hypomethylation of LINE-1 elements. However, in germinomas, methylation of gene regulatory regions differed little from control samples, whereas YSTs exhibited increased methylation at a large proportion of the loci tested, showing a 'methylator' phenotype, including silencing of genes associated with Caspase-8-dependent apoptosis. Furthermore, we found that the methylator phenotype of YSTs was coincident with higher levels of expression of the DNA methyltransferase, DNA (cytosine-5)-methyltransferase 3B, suggesting a mechanism underlying the phenotype. Epigenetic silencing of a large number of potential tumour suppressor genes in YSTs might explain why they exhibit a more aggressive natural history than germinomas and silencing of genes associated with Caspase-8-dependent cell death might explain the relative resistance of YSTs to conventional therapy.

  6. Restless 5S: the re-arrangement(s) and evolution of the nuclear ribosomal DNA in land plants.

    PubMed

    Wicke, Susann; Costa, Andrea; Muñoz, Jesùs; Quandt, Dietmar

    2011-11-01

    Among eukaryotes two types of nuclear ribosomal DNA (nrDNA) organization have been observed. Either all components, i.e. the small ribosomal subunit, 5.8S, large ribosomal subunit, and 5S occur tandemly arranged or the 5S rDNA forms a separate cluster of its own. Generalizations based on data derived from just a few model organisms have led to a superimposition of structural and evolutionary traits to the entire plant kingdom asserting that plants generally possess separate arrays. This study reveals that plant nrDNA organization into separate arrays is not a distinctive feature, but rather assignable almost solely to seed plants. We show that early diverging land plants and presumably streptophyte algae share a co-localization of all rRNA genes within one repeat unit. This raises the possibility that the state of rDNA gene co-localization had occurred in their common ancestor. Separate rDNA arrays were identified for all basal seed plants and water ferns, implying at least two independent 5S rDNA transposition events during land plant evolution. Screening for 5S derived Cassandra transposable elements which might have played a role during the transposition events, indicated that this retrotransposon is absent in early diverging vascular plants including early fern lineages. Thus, Cassandra can be rejected as a primary mechanism for 5S rDNA transposition in water ferns. However, the evolution of Cassandra and other eukaryotic 5S derived elements might have been a side effect of the 5S rDNA cluster formation. Structural analysis of the intergenic spacers of the ribosomal clusters revealed that transposition events partially affect spacer regions and suggests a slightly different transcription regulation of 5S rDNA in early land plants. 5S rDNA upstream regulatory elements are highly divergent or absent from the LSU-5S spacers of most early divergent land plant lineages. Several putative scenarios and mechanisms involved in the concerted relocation of hundreds of 5S rRNA gene copies are discussed. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. Nucleotide sequences of Dictyostelium discoideum developmentally regulated cDNAs rich in (AAC) imply proteins that contain clusters of asparagine, glutamine, or threonine.

    PubMed

    Shaw, D R; Richter, H; Giorda, R; Ohmachi, T; Ennis, H L

    1989-09-01

    A Dictyostelium discoideum repetitive element composed of long repeats of the codon (AAC) is found in developmentally regulated transcripts. The concentration of (AAC) sequences is low in mRNA from dormant spores and growing cells and increases markedly during spore germination and multicellular development. The sequence hybridizes to many different sized Dictyostelium DNA restriction fragments indicating that it is scattered throughout the genome. Four cDNA clones isolated contain (AAC) sequences in the deduced coding region. Interestingly, the (AAC)-rich sequences are present in all three reading frames in the deduced proteins, i.e., AAC (asparagine), ACA (threonine) and CAA (glutamine). Three of the clones contain only one of these in-frame so that the individual proteins carry either asparagine, threonine, or glutamine clusters, not mixtures. However, one clone is both glutamine- and asparagine-rich. The (AAC) portion of the transcripts are reiterated 300 times in the haploid genome while the other portions of the cDNAs represent single copy genes, whose sequences show no similarity other than the (AAC) repeats. The repeated sequence is similar to the opa or M sequence found in Drosophila melanogaster notch and homeo box genes and in fly developmentally regulated transcripts. The transcripts are present on polysomes suggesting that they are translated. Although the function of these repeats is unknown, long amino acid repeats are a characteristic feature of extracellular proteins of lower eukaryotes.

  8. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    PubMed Central

    2010-01-01

    Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements), a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches. PMID:20626840

  9. Development and validation of a real-time PCR assay for the detection of Toxoplasma gondii DNA in animal and meat samples.

    PubMed

    Marino, Anna Maria Fausta; Percipalle, Maurizio; Giunta, Renato Paolo; Salvaggio, Antonio; Caracappa, Giulia; Alfonzetti, Tiziana; Aparo, Alessandra; Reale, Stefano

    2017-03-01

    We report a rapid and reliable method for the detection of Toxoplasma gondii in meat and animal tissues based on real-time polymerase chain reaction (PCR). Samples were collected from cattle, small ruminants, horses, and pigs raised or imported into Sicily, Italy. All DNA preparations were assayed by real-time PCR tests targeted to a 98-bp long fragment in the AF 529-bp repeat element and to the B1 gene using specific primers. Diagnostic sensitivity (100%), diagnostic specificity (100%), limit of detection (0.01 pg), efficiency (92-109%), and precision (mean coefficient of variation = 0.60%), repeatability (100%), reproducibility (100%), and robustness were evaluated using 240 DNA extracted samples (120 positives and 120 negative as per the OIE nested PCR method) from different matrices. Positive results were confirmed by the repetition of both real-time and nested PCR assays. Our study demonstrates the viability of a reliable, rapid, and specific real-time PCR on a large scale to monitor contamination with Toxoplasma cysts in meat and animal specimens. This validated method can be used for postmortem detection in domestic and wild animals and for food safety purposes.

  10. Zaba: a novel miniature transposable element present in genomes of legume plants.

    PubMed

    Macas, J; Neumann, P; Pozárková, D

    2003-08-01

    A novel family of miniature transposable elements, named Zaba, was identified in pea (Pisum sativum) and subsequently also in other legume species using computer analysis of their DNA sequences. Zaba elements are 141-190 bp long, generate 10-bp target site duplications, and their terminal inverted repeats make up most of the sequence. Zaba elements thus resemble class 3 foldback transposons. The elements are only moderately repetitive in pea (tens to hundreds copies per haploid genome), but they are present in up to thousands of copies in the genomes of several Medicago and Vicia species. More detailed analysis of the elements from pea, including isolation of new sequences from a genomic library, revealed that a fraction of these elements are truncated, and that their last transposition probably did not occur recently. A search for Zaba sequences in EST databases showed that at least some elements are transcribed, most probably due to their association with genic regions.

  11. Problem-solving test: Southwestern blotting.

    PubMed

    Szeberényi, József

    2014-01-01

    Terms to be familiar with before you start to solve the test: Southern blotting, Western blotting, restriction endonucleases, agarose gel electrophoresis, nitrocellulose filter, molecular hybridization, polyacrylamide gel electrophoresis, proto-oncogene, c-abl, Src-homology domains, tyrosine protein kinase, nuclear localization signal, cDNA, deletion mutants, expression plasmid, transfection, RNA polymerase II, promoter, Shine-Dalgarno sequence, polyadenylation element, affinity chromatography, Northern blotting, immunoprecipitation, sodium dodecylsulfate, autoradiography, tandem repeats. Copyright © 2014 The International Union of Biochemistry and Molecular Biology.

  12. Regulation of DNA methylation patterns by CK2-mediated phosphorylation of Dnmt3a.

    PubMed

    Deplus, Rachel; Blanchon, Loïc; Rajavelu, Arumugam; Boukaba, Abdelhalim; Defrance, Matthieu; Luciani, Judith; Rothé, Françoise; Dedeurwaerder, Sarah; Denis, Hélène; Brinkman, Arie B; Simmer, Femke; Müller, Fabian; Bertin, Benjamin; Berdasco, Maria; Putmans, Pascale; Calonne, Emilie; Litchfield, David W; de Launoit, Yvan; Jurkowski, Tomasz P; Stunnenberg, Hendrik G; Bock, Christoph; Sotiriou, Christos; Fraga, Mario F; Esteller, Manel; Jeltsch, Albert; Fuks, François

    2014-08-07

    DNA methylation is a central epigenetic modification that is established by de novo DNA methyltransferases. The mechanisms underlying the generation of genomic methylation patterns are still poorly understood. Using mass spectrometry and a phosphospecific Dnmt3a antibody, we demonstrate that CK2 phosphorylates endogenous Dnmt3a at two key residues located near its PWWP domain, thereby downregulating the ability of Dnmt3a to methylate DNA. Genome-wide DNA methylation analysis shows that CK2 primarily modulates CpG methylation of several repeats, most notably of Alu SINEs. This modulation can be directly attributed to CK2-mediated phosphorylation of Dnmt3a. We also find that CK2-mediated phosphorylation is required for localization of Dnmt3a to heterochromatin. By revealing phosphorylation as a mode of regulation of de novo DNA methyltransferase function and by uncovering a mechanism for the regulation of methylation at repetitive elements, our results shed light on the origin of DNA methylation patterns. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  13. The phytochemical 3,3'-diindolylmethane decreases expression of AR-controlled DNA damage repair genes through repressive chromatin modifications and is associated with DNA damage in prostate cancer cells.

    PubMed

    Palomera-Sanchez, Zoraya; Watson, Gregory W; Wong, Carmen P; Beaver, Laura M; Williams, David E; Dashwood, Roderick H; Ho, Emily

    2017-09-01

    Androgen receptor (AR) is a transcription factor involved in normal prostate physiology and prostate cancer (PCa) development. 3,3'-Diindolylmethane (DIM) is a promising phytochemical agent against PCa that affects AR activity and epigenetic regulators in PCa cells. However, whether DIM suppresses PCa via epigenetic regulation of AR target genes is unknown. We assessed epigenetic regulation of AR target genes in LNCaP PCa cells and showed that DIM treatment led to epigenetic suppression of AR target genes involved in DNA repair (PARP1, MRE11, DNA-PK). Decreased expression of these genes was accompanied by an increase in repressive chromatin marks, loss of AR occupancy and EZH2 recruitment to their regulatory regions. Decreased DNA repair gene expression was associated with an increase in DNA damage (γH2Ax) and up-regulation of genomic repeat elements LINE1 and α-satellite. Our results suggest that DIM suppresses AR-dependent gene transcription through epigenetic modulation, leading to DNA damage and genome instability in PCa cells. Published by Elsevier Inc.

  14. A team of heterochromatin factors collaborates with small RNA pathways to combat repetitive elements and germline stress

    PubMed Central

    McMurchy, Alicia N; Stempor, Przemyslaw; Gaarenstroom, Tessa; Wysolmerski, Brian; Dong, Yan; Aussianikava, Darya; Appert, Alex; Huang, Ni; Kolasinska-Zwierz, Paulina; Sapetschnig, Alexandra; Miska, Eric A; Ahringer, Julie

    2017-01-01

    Repetitive sequences derived from transposons make up a large fraction of eukaryotic genomes and must be silenced to protect genome integrity. Repetitive elements are often found in heterochromatin; however, the roles and interactions of heterochromatin proteins in repeat regulation are poorly understood. Here we show that a diverse set of C. elegans heterochromatin proteins act together with the piRNA and nuclear RNAi pathways to silence repetitive elements and prevent genotoxic stress in the germ line. Mutants in genes encoding HPL-2/HP1, LIN-13, LIN-61, LET-418/Mi-2, and H3K9me2 histone methyltransferase MET-2/SETDB1 also show functionally redundant sterility, increased germline apoptosis, DNA repair defects, and interactions with small RNA pathways. Remarkably, fertility of heterochromatin mutants could be partially restored by inhibiting cep-1/p53, endogenous meiotic double strand breaks, or the expression of MIRAGE1 DNA transposons. Functional redundancy among factors and pathways underlies the importance of safeguarding the genome through multiple means. DOI: http://dx.doi.org/10.7554/eLife.21666.001 PMID:28294943

  15. Twisting Right to Left: A…A Mismatch in a CAG Trinucleotide Repeat Overexpansion Provokes Left-Handed Z-DNA Conformation

    PubMed Central

    2015-01-01

    Conformational polymorphism of DNA is a major causative factor behind several incurable trinucleotide repeat expansion disorders that arise from overexpansion of trinucleotide repeats located in coding/non-coding regions of specific genes. Hairpin DNA structures that are formed due to overexpansion of CAG repeat lead to Huntington’s disorder and spinocerebellar ataxias. Nonetheless, DNA hairpin stem structure that generally embraces B-form with canonical base pairs is poorly understood in the context of periodic noncanonical A…A mismatch as found in CAG repeat overexpansion. Molecular dynamics simulations on DNA hairpin stems containing A…A mismatches in a CAG repeat overexpansion show that A…A dictates local Z-form irrespective of starting glycosyl conformation, in sharp contrast to canonical DNA duplex. Transition from B-to-Z is due to the mechanistic effect that originates from its pronounced nonisostericity with flanking canonical base pairs facilitated by base extrusion, backbone and/or base flipping. Based on these structural insights we envisage that such an unusual DNA structure of the CAG hairpin stem may have a role in disease pathogenesis. As this is the first study that delineates the influence of a single A…A mismatch in reversing DNA helicity, it would further have an impact on understanding DNA mismatch repair. PMID:25876062

  16. DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats

    PubMed Central

    de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

    2015-01-01

    Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363

  17. Spectroscopic insights into quadruplexes of five-repeat telomere DNA sequences upon G-block damage.

    PubMed

    Dvořáková, Zuzana; Vorlíčková, Michaela; Renčiuk, Daniel

    2017-11-01

    The DNA lesions, resulting from oxidative damage, were shown to destabilize human telomere four-repeat quadruplex and to alter its structure. Long telomere DNA, as a repetitive sequence, offers, however, other mechanisms of dealing with the lesion: extrusion of the damaged repeat into loop or shifting the quadruplex position by one repeat. Using circular dichroism and UV absorption spectroscopy and polyacrylamide electrophoresis, we studied consequences of lesions at different positions of the model five-repeat human telomere DNA sequences on the structure and stability of their quadruplexes in sodium and in potassium. The repeats affected by lesion are preferentially positioned as terminal overhangs of the core quadruplex structurally similar to the four-repeat one. Forced affecting of the inner repeats leads to presence of variety of more parallel folds in potassium. In sodium the designed models form mixture of two dominant antiparallel quadruplexes whose population varies with the position of the affected repeat. The shapes of quadruplex CD spectra, namely the height of dominant peaks, significantly correlate with melting temperatures. Lesion in one guanine tract of a more than four repeats long human telomere DNA sequence may cause re-positioning of its quadruplex arrangement associated with a shift of the structure to less common quadruplex conformations. The type of the quadruplex depends on the loop position and external conditions. The telomere DNA quadruplexes are quite resistant to the effect of point mutations due to the telomere DNA repetitive nature, although their structure and, consequently, function might be altered. Copyright © 2017. Published by Elsevier B.V.

  18. DNA Replication Dynamics of the GGGGCC Repeat of the C9orf72 Gene.

    PubMed

    Thys, Ryan Griffin; Wang, Yuh-Hwa

    2015-11-27

    DNA has the ability to form a variety of secondary structures in addition to the normal B-form DNA, including hairpins and quadruplexes. These structures are implicated in a number of neurological diseases and cancer. Expansion of a GGGGCC repeat located at C9orf72 is associated with familial amyotrophic lateral sclerosis and frontotemporal dementia. This repeat expands from two to 24 copies in normal individuals to several hundreds or thousands of repeats in individuals with the disease. Biochemical studies have demonstrated that as little as four repeats have the ability to form a stable DNA secondary structure known as a G-quadruplex. Quadruplex structures have the ability to disrupt normal DNA processes such as DNA replication and transcription. Here we examine the role of GGGGCC repeat length and orientation on DNA replication using an SV40 replication system in human cells. Replication through GGGGCC repeats leads to a decrease in overall replication efficiency and an increase in instability in a length-dependent manner. Both repeat expansions and contractions are observed, and replication orientation is found to influence the propensity for expansions or contractions. The presence of replication stress, such as low-dose aphidicolin, diminishes replication efficiency but has no effect on instability. Two-dimensional gel electrophoresis analysis demonstrates a replication stall with as few as 20 GGGGCC repeats. These results suggest that replication of the GGGGCC repeat at C9orf72 is perturbed by the presence of expanded repeats, which has the potential to result in further expansion, leading to disease. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  19. Fragile X-related element 2 methylation analysis may provide a suitable option for inclusion of fragile X syndrome and/or sex chromosome aneuploidy into newborn screening: a technical validation study.

    PubMed

    Inaba, Yoshimi; Herlihy, Amy S; Schwartz, Charles E; Skinner, Cindy; Bui, Quang M; Cobb, Joanna; Shi, Elva Z; Francis, David; Arvaj, Alison; Amor, David J; Pope, Kate; Wotton, Tiffany; Cohen, Jonathan; Hewitt, Jacqueline K; Hagerman, Randi J; Metcalfe, Sylvia A; Hopper, John L; Loesch, Danuta Z; Slater, Howard R; Godler, David E

    2013-04-01

    We show that a novel fragile X-related epigenetic element 2 FMR1 methylation test can be used along with a test for sex-determining region Y (SRY) to provide the option of combined fragile X syndrome and sex chromosome aneuploidy newborn screening. Fragile X-related epigenetic element 2, SRY, and FMR1 CGG repeat analyses were performed on blood and saliva DNA, and in adult and newborn blood spots. The cohort consisted of 159 controls (CGG <40), 187 premutation (CGG 56-170), and 242 full-mutation (CGG ~200-2,000) males and females, 106 sex chromosome aneuploidy individuals, and 151 cytogenetically normal controls. At the 0.435 threshold, fragile X-related epigenetic element 2 analysis in males was robust on both blood DNA and newborn blood spots, with specificity and sensitivity of ~100% for full-mutation genotype. In females, the specificity was 99%, whereas half of full-mutation females were above the 0.435 threshold in both blood DNA and newborn blood spots. Furthermore, at this threshold, the test could not differentiate individuals with Klinefelter syndrome from female controls without using the SRY marker. When combined with SRY analysis, the test was consistent with most results for sex chromosome aneuploidies from karyotyping. Setting specific thresholds for fragile X-related epigenetic element 2 analysis and including the SRY marker provides the option to either include or exclude detection of sex chromosome aneuploidies as part of fragile X syndrome newborn screening.

  20. Comparative sequence analysis of the X-inactivation center region in mouse, human, and bovine.

    PubMed

    Chureau, Corinne; Prissette, Marine; Bourdet, Agnès; Barbe, Valérie; Cattolico, Laurence; Jones, Louis; Eggen, André; Avner, Philip; Duret, Laurent

    2002-06-01

    We have sequenced to high levels of accuracy 714-kb and 233-kb regions of the mouse and bovine X-inactivation centers (Xic), respectively, centered on the Xist gene. This has provided the basis for a fully annotated comparative analysis of the mouse Xic with the 2.3-Mb orthologous region in human and has allowed a three-way species comparison of the core central region, including the Xist gene. These comparisons have revealed conserved genes, both coding and noncoding, conserved CpG islands and, more surprisingly, conserved pseudogenes. The distribution of repeated elements, especially LINE repeats, in the mouse Xic region when compared to the rest of the genome does not support the hypothesis of a role for these repeat elements in the spreading of X inactivation. Interestingly, an asymmetric distribution of LINE elements on the two DNA strands was observed in the three species, not only within introns but also in intergenic regions. This feature is suggestive of important transcriptional activity within these intergenic regions. In silico prediction followed by experimental analysis has allowed four new genes, Cnbp2, Ftx, Jpx, and Ppnx, to be identified and novel, widespread, complex, and apparently noncoding transcriptional activity to be characterized in a region 5' of Xist that was recently shown to attract histone modification early after the onset of X inactivation.

  1. Methods for sequencing GC-rich and CCT repeat DNA templates

    DOEpatents

    Robinson, Donna L.

    2007-02-20

    The present invention is directed to a PCR-based method of cycle sequencing DNA and other polynucleotide sequences having high CG content and regions of high GC content, and includes for example DNA strands with a high Cytosine and/or Guanosine content and repeated motifs such as CCT repeats.

  2. Crystal structure of clustered regularly interspaced short palindromic repeats (CRISPR)-associated Csn2 protein revealed Ca2+-dependent double-stranded DNA binding activity.

    PubMed

    Nam, Ki Hyun; Kurinov, Igor; Ke, Ailong

    2011-09-02

    Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated protein genes (cas genes) are widespread in bacteria and archaea. They form a line of RNA-based immunity to eradicate invading bacteriophages and malicious plasmids. A key molecular event during this process is the acquisition of new spacers into the CRISPR loci to guide the selective degradation of the matching foreign genetic elements. Csn2 is a Nmeni subtype-specific cas gene required for new spacer acquisition. Here we characterize the Enterococcus faecalis Csn2 protein as a double-stranded (ds-) DNA-binding protein and report its 2.7 Å tetrameric ring structure. The inner circle of the Csn2 tetrameric ring is ∼26 Å wide and populated with conserved lysine residues poised for nonspecific interactions with ds-DNA. Each Csn2 protomer contains an α/β domain and an α-helical domain; significant hinge motion was observed between these two domains. Ca(2+) was located at strategic positions in the oligomerization interface. We further showed that removal of Ca(2+) ions altered the oligomerization state of Csn2, which in turn severely decreased its affinity for ds-DNA. In summary, our results provided the first insight into the function of the Csn2 protein in CRISPR adaptation by revealing that it is a ds-DNA-binding protein functioning at the quaternary structure level and regulated by Ca(2+) ions.

  3. Crystal Structure of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated Csn2 Protein Revealed Ca[superscript 2+]-dependent Double-stranded DNA Binding Activity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nam, Ki Hyun; Kurinov, Igor; Ke, Ailong

    Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated protein genes (cas genes) are widespread in bacteria and archaea. They form a line of RNA-based immunity to eradicate invading bacteriophages and malicious plasmids. A key molecular event during this process is the acquisition of new spacers into the CRISPR loci to guide the selective degradation of the matching foreign genetic elements. Csn2 is a Nmeni subtype-specific cas gene required for new spacer acquisition. Here we characterize the Enterococcus faecalis Csn2 protein as a double-stranded (ds-) DNA-binding protein and report its 2.7 {angstrom} tetrameric ring structure. The inner circle ofmore » the Csn2 tetrameric ring is {approx}26 {angstrom} wide and populated with conserved lysine residues poised for nonspecific interactions with ds-DNA. Each Csn2 protomer contains an {alpha}/{beta} domain and an {alpha}-helical domain; significant hinge motion was observed between these two domains. Ca{sup 2+} was located at strategic positions in the oligomerization interface. We further showed that removal of Ca{sup 2+} ions altered the oligomerization state of Csn2, which in turn severely decreased its affinity for ds-DNA. In summary, our results provided the first insight into the function of the Csn2 protein in CRISPR adaptation by revealing that it is a ds-DNA-binding protein functioning at the quaternary structure level and regulated by Ca{sup 2+} ions.« less

  4. DNA Data Visualization (DDV): Software for Generating Web-Based Interfaces Supporting Navigation and Analysis of DNA Sequence Data of Entire Genomes.

    PubMed

    Neugebauer, Tomasz; Bordeleau, Eric; Burrus, Vincent; Brzezinski, Ryszard

    2015-01-01

    Data visualization methods are necessary during the exploration and analysis activities of an increasingly data-intensive scientific process. There are few existing visualization methods for raw nucleotide sequences of a whole genome or chromosome. Software for data visualization should allow the researchers to create accessible data visualization interfaces that can be exported and shared with others on the web. Herein, novel software developed for generating DNA data visualization interfaces is described. The software converts DNA data sets into images that are further processed as multi-scale images to be accessed through a web-based interface that supports zooming, panning and sequence fragment selection. Nucleotide composition frequencies and GC skew of a selected sequence segment can be obtained through the interface. The software was used to generate DNA data visualization of human and bacterial chromosomes. Examples of visually detectable features such as short and long direct repeats, long terminal repeats, mobile genetic elements, heterochromatic segments in microbial and human chromosomes, are presented. The software and its source code are available for download and further development. The visualization interfaces generated with the software allow for the immediate identification and observation of several types of sequence patterns in genomes of various sizes and origins. The visualization interfaces generated with the software are readily accessible through a web browser. This software is a useful research and teaching tool for genetics and structural genomics.

  5. The Foldback-like element Galileo belongs to the P superfamily of DNA transposons and is widespread within the Drosophila genus.

    PubMed

    Marzo, Mar; Puig, Marta; Ruiz, Alfredo

    2008-02-26

    Galileo is the only transposable element (TE) known to have generated natural chromosomal inversions in the genus Drosophila. It was discovered in Drosophila buzzatii and classified as a Foldback-like element because of its long, internally repetitive, terminal inverted repeats (TIRs) and lack of coding capacity. Here, we characterized a seemingly complete copy of Galileo from the D. buzzatii genome. It is 5,406 bp long, possesses 1,229-bp TIRs, and encodes a 912-aa transposase similar to those of the Drosophila melanogaster 1360 (Hoppel) and P elements. We also searched the recently available genome sequences of 12 Drosophila species for elements similar to Dbuz\\Galileo by using bioinformatic tools. Galileo was found in six species (ananassae, willistoni, peudoobscura, persimilis, virilis, and mojavensis) from the two main lineages within the Drosophila genus. Our observations place Galileo within the P superfamily of cut-and-paste transposons and extend considerably its phylogenetic distribution. The interspecific distribution of Galileo indicates an ancient presence in the genus, but the phylogenetic tree built with the transposase amino acid sequences contrasts significantly with that of the species, indicating lineage sorting and/or horizontal transfer events. Our results also suggest that Foldback-like elements such as Galileo may evolve from DNA-based transposon ancestors by loss of the transposase gene and disproportionate elongation of TIRs.

  6. The Foldback-like element Galileo belongs to the P superfamily of DNA transposons and is widespread within the Drosophila genus

    PubMed Central

    Marzo, Mar; Puig, Marta; Ruiz, Alfredo

    2008-01-01

    Galileo is the only transposable element (TE) known to have generated natural chromosomal inversions in the genus Drosophila. It was discovered in Drosophila buzzatii and classified as a Foldback-like element because of its long, internally repetitive, terminal inverted repeats (TIRs) and lack of coding capacity. Here, we characterized a seemingly complete copy of Galileo from the D. buzzatii genome. It is 5,406 bp long, possesses 1,229-bp TIRs, and encodes a 912-aa transposase similar to those of the Drosophila melanogaster 1360 (Hoppel) and P elements. We also searched the recently available genome sequences of 12 Drosophila species for elements similar to Dbuz\\Galileo by using bioinformatic tools. Galileo was found in six species (ananassae, willistoni, peudoobscura, persimilis, virilis, and mojavensis) from the two main lineages within the Drosophila genus. Our observations place Galileo within the P superfamily of cut-and-paste transposons and extend considerably its phylogenetic distribution. The interspecific distribution of Galileo indicates an ancient presence in the genus, but the phylogenetic tree built with the transposase amino acid sequences contrasts significantly with that of the species, indicating lineage sorting and/or horizontal transfer events. Our results also suggest that Foldback-like elements such as Galileo may evolve from DNA-based transposon ancestors by loss of the transposase gene and disproportionate elongation of TIRs. PMID:18287066

  7. First Staphylococcal Cassette Chromosome mec Containing a mecB-Carrying Gene Complex Independent of Transposon Tn6045 in a Macrococcus caseolyticus Isolate from a Canine Infection

    PubMed Central

    Gómez-Sanz, Elena; Schwendener, Sybille; Thomann, Andreas; Gobeli Brawand, Stefanie

    2015-01-01

    A methicillin-resistant mecB-positive Macrococcus caseolyticus (strain KM45013) was isolated from the nares of a dog with rhinitis. It contained a novel 39-kb transposon-defective complete mecB-carrying staphylococcal cassette chromosome mec element (SCCmecKM45013). SCCmecKM45013 contained 49 coding sequences (CDSs), was integrated at the 3′ end of the chromosomal orfX gene, and was delimited at both ends by imperfect direct repeats functioning as integration site sequences (ISSs). SCCmecKM45013 presented two discontinuous regions of homology (SCCmec coverage of 35%) to the chromosomal and transposon Tn6045-associated SCCmec-like element of M. caseolyticus JCSC7096: (i) the mec gene complex (98.8% identity) and (ii) the ccr-carrying segment (91.8% identity). The mec gene complex, located at the right junction of the cassette, also carried the β-lactamase gene blaZm (mecRm-mecIm-mecB-blaZm). SCCmecKM45013 contained two cassette chromosome recombinase genes, ccrAm2 and ccrBm2, which shared 94.3% and 96.6% DNA identity with those of the SCCmec-like element of JCSC7096 but shared less than 52% DNA identity with the staphylococcal ccrAB and ccrC genes. Three distinct extrachromosomal circularized elements (the entire SCCmecKM45013, ΨSCCmecKM45013 lacking the ccr genes, and SCCKM45013 lacking mecB) flanked by one ISS copy, as well as the chromosomal regions remaining after excision, were detected. An unconventional circularized structure carrying the mecB gene complex was associated with two extensive direct repeat regions, which enclosed two open reading frames (ORFs) (ORF46 and ORF51) flanking the chromosomal mecB-carrying gene complex. This study revealed M. caseolyticus as a potential disease-associated bacterium in dogs and also unveiled an SCCmec element carrying mecB not associated with Tn6045 in the genus Macrococcus. PMID:25987634

  8. Repetitive element transcripts are elevated in the brain of C9orf72 ALS/FTLD patients.

    PubMed

    Prudencio, Mercedes; Gonzales, Patrick K; Cook, Casey N; Gendron, Tania F; Daughrity, Lillian M; Song, Yuping; Ebbert, Mark T W; van Blitterswijk, Marka; Zhang, Yong-Jie; Jansen-West, Karen; Baker, Matthew C; DeTure, Michael; Rademakers, Rosa; Boylan, Kevin B; Dickson, Dennis W; Petrucelli, Leonard; Link, Christopher D

    2017-09-01

    Significant transcriptome alterations are detected in the brain of patients with amyotrophic lateral sclerosis (ALS), including carriers of the C9orf72 repeat expansion and C9orf72-negative sporadic cases. Recently, the expression of repetitive element transcripts has been associated with toxicity and, while increased repetitive element expression has been observed in several neurodegenerative diseases, little is known about their contribution to ALS. To assess whether aberrant expression of repetitive element sequences are observed in ALS, we analysed RNA sequencing data from C9orf72-positive and sporadic ALS cases, as well as healthy controls. Transcripts from multiple classes and subclasses of repetitive elements (LINEs, endogenous retroviruses, DNA transposons, simple repeats, etc.) were significantly increased in the frontal cortex of C9orf72 ALS patients. A large collection of patient samples, representing both C9orf72 positive and negative ALS, ALS/FTLD, and FTLD cases, was used to validate the levels of several repetitive element transcripts. These analyses confirmed that repetitive element expression was significantly increased in C9orf72-positive compared to C9orf72-negative or control cases. While previous studies suggest an important link between TDP-43 and repetitive element biology, our data indicate that TDP-43 pathology alone is insufficient to account for the observed changes in repetitive elements in ALS/FTLD. Instead, we found that repetitive element expression positively correlated with RNA polymerase II activity in postmortem brain, and pharmacologic modulation of RNA polymerase II activity altered repetitive element expression in vitro. We conclude that increased RNA polymerase II activity in ALS/FTLD may lead to increased repetitive element transcript expression, a novel pathological feature of ALS/FTLD. © The Author 2017. Published by Oxford University Press.

  9. Comparative whole genome DNA methylation profiling of cattle sperm and somatic tissues reveals striking hypomethylated patterns in sperm

    PubMed Central

    Zhou, Yang; Connor, Erin E; Bickhart, Derek M; Li, Congjun; Baldwin, Ransom L; Schroeder, Steven G; Rosen, Benjamin D; Yang, Liguo; Van Tassell, Curtis P

    2018-01-01

    Abstract Background Although sperm DNA methylation has been studied in humans and other species, its status in cattle is largely unknown. Results Using whole-genome bisulfite sequencing (WGBS), we profiled the DNA methylome of cattle sperm through comparison with three somatic tissues (mammary gland, brain, and blood). Large differences between cattle sperm and somatic cells were observed in the methylation patterns of global CpGs, pericentromeric satellites, partially methylated domains (PMDs), hypomethylated regions (HMRs), and common repeats. As expected, we observed low methylation in the promoter regions and high methylation in the bodies of active genes. We detected selective hypomethylation of megabase domains of centromeric satellite clusters, which may be related to chromosome segregation during meiosis and their rapid transcriptional activation upon fertilization. We found more PMDs in sperm cells than in somatic cells and identified meiosis-related genes such asKIF2B and REPIN1, which are hypomethylated in sperm but hypermethylated in somatic cells. In addition to the common HMRs around gene promoters, which showed substantial differences between sperm and somatic cells, the sperm-specific HMRs also targeted to distinct spermatogenesis-related genes, including BOLL, MAEL, ASZ1, SYCP3, CTCFL, MND1, SPATA22, PLD6, DDX4, RBBP8, FKBP6, and SYCE1. Although common repeats were heavily methylated in both sperm and somatic cells, some young Bov-A2 repeats, which belong to the SINE family, were hypomethylated in sperm and could affect the promoter structures by introducing new regulatory elements. Conclusions Our study provides a comprehensive resource for bovine sperm epigenomic research and enables new discoveries about DNA methylation and its role in male fertility. PMID:29635292

  10. Comparative whole genome DNA methylation profiling of cattle sperm and somatic tissues reveals striking hypomethylated patterns in sperm.

    PubMed

    Zhou, Yang; Connor, Erin E; Bickhart, Derek M; Li, Congjun; Baldwin, Ransom L; Schroeder, Steven G; Rosen, Benjamin D; Yang, Liguo; Van Tassell, Curtis P; Liu, George E

    2018-05-01

    Although sperm DNA methylation has been studied in humans and other species, its status in cattle is largely unknown. Using whole-genome bisulfite sequencing (WGBS), we profiled the DNA methylome of cattle sperm through comparison with three somatic tissues (mammary gland, brain, and blood). Large differences between cattle sperm and somatic cells were observed in the methylation patterns of global CpGs, pericentromeric satellites, partially methylated domains (PMDs), hypomethylated regions (HMRs), and common repeats. As expected, we observed low methylation in the promoter regions and high methylation in the bodies of active genes. We detected selective hypomethylation of megabase domains of centromeric satellite clusters, which may be related to chromosome segregation during meiosis and their rapid transcriptional activation upon fertilization. We found more PMDs in sperm cells than in somatic cells and identified meiosis-related genes such asKIF2B and REPIN1, which are hypomethylated in sperm but hypermethylated in somatic cells. In addition to the common HMRs around gene promoters, which showed substantial differences between sperm and somatic cells, the sperm-specific HMRs also targeted to distinct spermatogenesis-related genes, including BOLL, MAEL, ASZ1, SYCP3, CTCFL, MND1, SPATA22, PLD6, DDX4, RBBP8, FKBP6, and SYCE1. Although common repeats were heavily methylated in both sperm and somatic cells, some young Bov-A2 repeats, which belong to the SINE family, were hypomethylated in sperm and could affect the promoter structures by introducing new regulatory elements. Our study provides a comprehensive resource for bovine sperm epigenomic research and enables new discoveries about DNA methylation and its role in male fertility.

  11. Transposable elements in fish chromosomes: a study in the marine cobia species.

    PubMed

    Costa, G W W F; Cioffi, M B; Bertollo, L A C; Molina, W F

    2013-01-01

    Rachycentron canadum, a unique representative of the Rachycentridae family, has been the subject of considerable biotechnological interest due to its potential use in marine fish farming. This species has undergone extensive research concerning the location of genes and multigene families on its chromosomes. Although most of the genome of some organisms is composed of repeated DNA sequences, aspects of the origin and dispersion of these elements are still largely unknown. The physical mapping of repetitive sequences on the chromosomes of R. canadum proved to be relevant for evolutionary and applied purposes. Therefore, here, we present the mapping by fluorescence in situ hybridization of the transposable element (TE) Tol2, the non-LTR retrotransposons Rex1 and Rex3, together with the 18S and 5S rRNA genes in the chromosome of this species. The Tol2 TE, belonging to the family of hAT transposons, is homogeneously distributed in the euchromatic regions of the chromosomes but with huge colocalization with the 18S rDNA sites. The hybridization signals for Rex1 and Rex3 revealed a semi-arbitrary distribution pattern, presenting differentiated dispersion in euchromatic and heterochromatic regions. Rex1 elements are associated preferentially in heterochromatic regions, while Rex3 shows a scarce distribution in the euchromatic regions of the chromosomes. The colocalization of TEs with 18S and 5S rDNA revealed complex chromosomal regions of repetitive sequences. In addition, the nonpreferential distribution of Rex1 and Rex3 in all heterochromatic regions, as well as the preferential distribution of the Tol2 transposon associated with 18S rDNA sequences, reveals a distinct pattern of organization of TEs in the genome of this species. A heterogeneous chromosomal colonization of TEs may confer different evolutionary rates to the heterochromatic regions of this species.

  12. Structural rearrangements in the mitochondrial genome of Drosophila melanogaster induced by elevated levels of the replicative DNA helicase

    PubMed Central

    Ciesielski, Grzegorz L; Nadalutti, Cristina A; Oliveira, Marcos T; Griffith, Jack D; Kaguni, Laurie S

    2018-01-01

    Abstract Pathological conditions impairing functions of mitochondria often lead to compensatory upregulation of the mitochondrial DNA (mtDNA) replisome machinery, and the replicative DNA helicase appears to be a key factor in regulating mtDNA copy number. Moreover, mtDNA helicase mutations have been associated with structural rearrangements of the mitochondrial genome. To evaluate the effects of elevated levels of the mtDNA helicase on the integrity and replication of the mitochondrial genome, we overexpressed the helicase in Drosophila melanogaster Schneider cells and analyzed the mtDNA by two-dimensional neutral agarose gel electrophoresis and electron microscopy. We found that elevation of mtDNA helicase levels increases the quantity of replication intermediates and alleviates pausing at the replication slow zones. Though we did not observe a concomitant alteration in mtDNA copy number, we observed deletions specific to the segment of repeated elements in the immediate vicinity of the origin of replication, and an accumulation of species characteristic of replication fork stalling. We also found elevated levels of RNA that are retained in the replication intermediates. Together, our results suggest that upregulation of mtDNA helicase promotes the process of mtDNA replication but also results in genome destabilization. PMID:29432582

  13. Molecular architecture of classical cytological landmarks: Centromeres and telomeres

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Meyne, J.

    1994-11-01

    Both the human telomere repeat and the pericentromeric repeat sequence (GGAAT)n were isolated based on evolutionary conservation. Their isolation was based on the premise that chromosomal features as structurally and functionally important as telomeres and centromeres should be highly conserved. Both sequences were isolated by high stringency screening of a human repetitive DNA library with rodent repetitive DNA. The pHuR library (plasmid Human Repeat) used for this project was enriched for repetitive DNA by using a modification of the standard DNA library preparation method. Usually DNA for a library is cut with restriction enzymes, packaged, infected, and the library ismore » screened. A problem with this approach is that many tandem repeats don`t have any (or many) common restriction sites. Therefore, many of the repeat sequences will not be represented in the library because they are not restricted to a viable length for the vector used. To prepare the pHuR library, human DNA was mechanically sheared to a small size. These relatively short DNA fragments were denatured and then renatured to C{sub o}t 50. Theoretically only repetitive DNA sequences should renature under C{sub o}t 50 conditions. The single-stranded regions were digested using S1 nuclease, leaving the double-stranded, renatured repeat sequences.« less

  14. Analysis and Dynamics of the Chromosomal Complements of Wild Sparkling-Wine Yeast Strains

    PubMed Central

    Nadal, Dolors; Carro, David; Fernández-Larrea, Juan; Piña, Benjamin

    1999-01-01

    We isolated Saccharomyces cerevisiae yeast strains that are able to carry out the second fermentation of sparkling wine from spontaneously fermenting musts in El Penedès (Spain) by specifically designed selection protocols. All of them (26 strains) showed one of two very similar mitochondrial DNA (mtDNA) restriction patterns, whereas their karyotypes differed. These strains showed high rates of karyotype instability, which were dependent on both the medium and the strain, during vegetative growth. In all cases, the mtDNA restriction pattern was conserved in strains kept under the same conditions. Analysis of different repetitive sequences in their genomes suggested that ribosomal DNA repeats play an important role in the changes in size observed in chromosome XII, whereas SUC genes or Ty elements did not show amplification or transposition processes that could be related to rearrangements of the chromosomes showing these sequences. Karyotype changes also occurred in monosporidic diploid derivatives. We propose that these changes originated mainly from ectopic recombination between repeated sequences interspersed in the genome. None of the rearranged karyotypes provided a selective advantage strong enough to allow the strains to displace the parental strains. The nature and frequency of these changes suggest that they may play an important role in the establishment and maintenance of the genetic diversity observed in S. cerevisiae wild populations. PMID:10103269

  15. Target sites for the transposition of rat long interspersed repeated DNA elements (LINEs) are not random.

    PubMed Central

    Furano, A V; Somerville, C C; Tsichlis, P N; D'Ambrosio, E

    1986-01-01

    The long interspersed repeated DNA family of rats (LINE or L1Rn family) contains about 40,000 6.7-kilobase (kb) long members (1). LINE members may be currently mobile since their presence or absence causes allelic variation at three single copy loci (2, 3): insulin 1, Moloney leukemia virus integration 2 (Mlvi-2) (4), and immunoglobulin heavy chain (Igh). To characterize target sites for LINE insertion, we compared the DNA sequences of the unoccupied Mlvi-2 target site, its LINE-containing allele, and several other LINE-containing sites. Although not homologous overall, the target sites share three characteristics: First, depending on the site, they are from 68% to 86% (A+T) compared to 58% (A+T) for total rat DNA (5). Depending on the site, a 7- to 15-bp target site sequence becomes duplicated and flanks the inserted LINE member. The second is a version (0 or 1 mismatch) of the hexanucleotide, TACTCA, which is also present in the LINE member, in a highly conserved region located just before the A-rich right end of the LINE member. The third is a stretch of alternating purine/pyrimidine (PQ). The A-rich right ends of different LINE members vary in length and composition, and the sequence of a particularly long one suggests that it contains the A-rich target site from a previous transposition. PMID:3012480

  16. Genomic Repeat Abundances Contain Phylogenetic Signal

    PubMed Central

    Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464

  17. Instability of expanded CAG/CAA repeats in spinocerebellar ataxia type 17.

    PubMed

    Gao, Rui; Matsuura, Tohru; Coolbaugh, Mary; Zühlke, Christine; Nakamura, Koichiro; Rasmussen, Astrid; Siciliano, Michael J; Ashizawa, Tetsuo; Lin, Xi

    2008-02-01

    Trinucleotide repeat expansions are dynamic mutations causing many neurological disorders, and their instability is influenced by multiple factors. Repeat configuration seems particularly important, and pure repeats are thought to be more unstable than interrupted repeats. But direct evidence is still lacking. Here, we presented strong support for this hypothesis from our studies on spinocerebellar ataxia type 17 (SCA17). SCA17 is a typical polyglutamine disease caused by CAG repeat expansion in TBP (TATA binding protein), and is unique in that the pure expanded polyglutamine tract is coded by either a simple configuration with long stretches of pure CAGs or a complex configuration containing CAA interruptions. By small pool PCR (SP-PCR) analysis of blood DNA from SCA17 patients of distinct racial backgrounds, we quantitatively assessed the instability of these two types of expanded alleles coding similar length of polyglutamine expansion. Mutation frequency in patients harboring pure CAG repeats is 2-3 folds of those with CAA interruptions. Interestingly, the pure CAG repeats showed both expansion and deletion while the interrupted repeats exhibited mostly deletion at a significantly lower frequency. These data strongly suggest that repeat configuration is a critical determinant for instability, and CAA interruptions might serve as a limiting element for further expansion of CAG repeats in SCA17 locus, suggesting a molecular basis for lack of anticipation in SCA17 families with interrupted CAG expansion.

  18. Determining the Specificity of Cascade Binding, Interference, and Primed Adaptation In Vivo in the Escherichia coli Type I-E CRISPR-Cas System.

    PubMed

    Cooper, Lauren A; Stringer, Anne M; Wade, Joseph T

    2018-04-17

    In clustered regularly interspaced short palindromic repeat (CRISPR)-Cas (CRISPR-associated) immunity systems, short CRISPR RNAs (crRNAs) are bound by Cas proteins, and these complexes target invading nucleic acid molecules for degradation in a process known as interference. In type I CRISPR-Cas systems, the Cas protein complex that binds DNA is known as Cascade. Association of Cascade with target DNA can also lead to acquisition of new immunity elements in a process known as primed adaptation. Here, we assess the specificity determinants for Cascade-DNA interaction, interference, and primed adaptation in vivo , for the type I-E system of Escherichia coli Remarkably, as few as 5 bp of crRNA-DNA are sufficient for association of Cascade with a DNA target. Consequently, a single crRNA promotes Cascade association with numerous off-target sites, and the endogenous E. coli crRNAs direct Cascade binding to >100 chromosomal sites. In contrast to the low specificity of Cascade-DNA interactions, >18 bp are required for both interference and primed adaptation. Hence, Cascade binding to suboptimal, off-target sites is inert. Our data support a model in which the initial Cascade association with DNA targets requires only limited sequence complementarity at the crRNA 5' end whereas recruitment and/or activation of the Cas3 nuclease, a prerequisite for interference and primed adaptation, requires extensive base pairing. IMPORTANCE Many bacterial and archaeal species encode CRISPR-Cas immunity systems that protect against invasion by foreign DNA. In the Escherichia coli CRISPR-Cas system, a protein complex, Cascade, binds 61-nucleotide (nt) CRISPR RNAs (crRNAs). The Cascade complex is directed to invading DNA molecules through base pairing between the crRNA and target DNA. This leads to recruitment of the Cas3 nuclease, which destroys the invading DNA molecule and promotes acquisition of new immunity elements. We made the first in vivo measurements of Cascade binding to DNA targets. Thus, we show that Cascade binding to DNA is highly promiscuous; endogenous E. coli crRNAs can direct Cascade binding to >100 chromosomal locations. In contrast, we show that targeted degradation and acquisition of new immunity elements require highly specific association of Cascade with DNA, limiting CRISPR-Cas function to the appropriate targets. Copyright © 2018 Cooper et al.

  19. DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

    PubMed

    de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

    2015-11-16

    Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. 1.688 g/cm(3) satellite-related repeats: a missing link to dosage compensation and speciation.

    PubMed

    Gallach, Miguel

    2015-09-01

    Despite the important progress that has been made on dosage compensation (DC), a critical link in our understanding of the X chromosome recognition mechanisms is still missing. Recent studies in Drosophila indicate that the missing link could be a family of DNA repeats populating the euchromatin of the X chromosome. In this opinion article, I discuss how these findings add a new fresh twist on the DC problem. In the following sections, I first summarize our understanding of DC in Drosophila and integrate these recent discoveries into our knowledge of the X chromosome recognition problem. Next, I introduce a model according to which, 1.688 g/cm(3) satellite-related (SR) repeats would be the primary recognition elements for the dosage compensation complex. Contrary to the current belief, I suggest that the DC system in Drosophila is not conserved and static, but it is continuously co-evolving with the target SR repeats. The potential role of the SR repeats in hybrid incompatibilities and speciation is also discussed. © 2015 John Wiley & Sons Ltd.

  1. Microbial Degradation of Forensic Samples of Biological Origin: Potential Threat to Human DNA Typing.

    PubMed

    Dash, Hirak Ranjan; Das, Surajit

    2018-02-01

    Forensic biology is a sub-discipline of biological science with an amalgam of other branches of science used in the criminal justice system. Any nucleated cell/tissue harbouring DNA, either live or dead, can be used as forensic exhibits, a source of investigation through DNA typing. These biological materials of human origin are rich source of proteins, carbohydrates, lipids, trace elements as well as water and, thus, provide a virtuous milieu for the growth of microbes. The obstinate microbial growth augments the degradation process and is amplified with the passage of time and improper storage of the biological materials. Degradation of these biological materials carriages a huge challenge in the downstream processes of forensic DNA typing technique, such as short tandem repeats (STR) DNA typing. Microbial degradation yields improper or no PCR amplification, heterozygous peak imbalance, DNA contamination from non-human sources, degradation of DNA by microbial by-products, etc. Consequently, the most precise STR DNA typing technique is nullified and definite opinion can be hardly given with degraded forensic exhibits. Thus, suitable precautionary measures should be taken for proper storage and processing of the biological exhibits to minimize their decaying process by micro-organisms.

  2. Organization and evolution of highly repeated satellite DNA sequences in plant chromosomes.

    PubMed

    Sharma, S; Raina, S N

    2005-01-01

    A major component of the plant nuclear genome is constituted by different classes of repetitive DNA sequences. The structural, functional and evolutionary aspects of the satellite repetitive DNA families, and their organization in the chromosomes is reviewed. The tandem satellite DNA sequences exhibit characteristic chromosomal locations, usually at subtelomeric and centromeric regions. The repetitive DNA family(ies) may be widely distributed in a taxonomic family or a genus, or may be specific for a species, genome or even a chromosome. They may acquire large-scale variations in their sequence and copy number over an evolutionary time-scale. These features have formed the basis of extensive utilization of repetitive sequences for taxonomic and phylogenetic studies. Hybrid polyploids have especially proven to be excellent models for studying the evolution of repetitive DNA sequences. Recent studies explicitly show that some repetitive DNA families localized at the telomeres and centromeres have acquired important structural and functional significance. The repetitive elements are under different evolutionary constraints as compared to the genes. Satellite DNA families are thought to arise de novo as a consequence of molecular mechanisms such as unequal crossing over, rolling circle amplification, replication slippage and mutation that constitute "molecular drive". Copyright 2005 S. Karger AG, Basel.

  3. Whole-genome expression analysis of mammalian-wide interspersed repeat elements in human cell lines.

    PubMed

    Carnevali, Davide; Conti, Anastasia; Pellegrini, Matteo; Dieci, Giorgio

    2017-02-01

    With more than 500,000 copies, mammalian-wide interspersed repeats (MIRs), a sub-group of SINEs, represent ∼2.5% of the human genome and one of the most numerous family of potential targets for the RNA polymerase (Pol) III transcription machinery. Since MIR elements ceased to amplify ∼130 myr ago, previous studies primarily focused on their genomic impact, while the issue of their expression has not been extensively addressed. We applied a dedicated bioinformatic pipeline to ENCODE RNA-Seq datasets of seven human cell lines and, for the first time, we were able to define the Pol III-driven MIR transcriptome at single-locus resolution. While the majority of Pol III-transcribed MIR elements are cell-specific, we discovered a small set of ubiquitously transcribed MIRs mapping within Pol II-transcribed genes in antisense orientation that could influence the expression of the overlapping gene. We also identified novel Pol III-transcribed ncRNAs, deriving from transcription of annotated MIR fragments flanked by unique MIR-unrelated sequences, and confirmed the role of Pol III-specific internal promoter elements in MIR transcription. Besides demonstrating widespread transcription at these retrotranspositionally inactive elements in human cells, the ability to profile MIR expression at single-locus resolution will facilitate their study in different cell types and states including pathological alterations. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  4. Dicer-like 3 produces transposable element-associated 24-nt siRNAs that control agricultural traits in rice

    PubMed Central

    Wei, Liya; Gu, Lianfeng; Song, Xianwei; Cui, Xiekui; Lu, Zhike; Zhou, Ming; Wang, Lulu; Hu, Fengyi; Zhai, Jixian; Meyers, Blake C.; Cao, Xiaofeng

    2014-01-01

    Transposable elements (TEs) and repetitive sequences make up over 35% of the rice (Oryza sativa) genome. The host regulates the activity of different TEs by different epigenetic mechanisms, including DNA methylation, histone H3K9 methylation, and histone H3K4 demethylation. TEs can also affect the expression of host genes. For example, miniature inverted repeat TEs (MITEs), dispersed high copy-number DNA TEs, can influence the expression of nearby genes. In plants, 24-nt small interfering RNAs (siRNAs) are mainly derived from repeats and TEs. However, the extent to which TEs, particularly MITEs associated with 24-nt siRNAs, affect gene expression remains elusive. Here, we show that the rice Dicer-like 3 homolog OsDCL3a is primarily responsible for 24-nt siRNA processing. Impairing OsDCL3a expression by RNA interference caused phenotypes affecting important agricultural traits; these phenotypes include dwarfism, larger flag leaf angle, and fewer secondary branches. We used small RNA deep sequencing to identify 535,054 24-nt siRNA clusters. Of these clusters, ∼82% were OsDCL3a-dependent and showed significant enrichment of MITEs. Reduction of OsDCL3a function reduced the 24-nt siRNAs predominantly from MITEs and elevated expression of nearby genes. OsDCL3a directly targets genes involved in gibberellin and brassinosteroid homeostasis; OsDCL3a deficiency may affect these genes, thus causing the phenotypes of dwarfism and enlarged flag leaf angle. Our work identifies OsDCL3a-dependent 24-nt siRNAs derived from MITEs as broadly functioning regulators for fine-tuning gene expression, which may reflect a conserved epigenetic mechanism in higher plants with genomes rich in dispersed repeats or TEs. PMID:24554078

  5. UV-induced DNA damage is an intermediate step in UV-induced expression of human immunodeficiency virus type 1, collagenase, c-fos, and metallothionein.

    PubMed Central

    Stein, B; Rahmsdorf, H J; Steffen, A; Litfin, M; Herrlich, P

    1989-01-01

    UV irradiation of human and murine cells enhances the transcription of several genes. Here we report on the primary target of relevant UV absorption, on pathways leading to gene activation, and on the elements receiving the UV-induced signal in the human immunodeficiency virus type 1 (HIV-1) long terminal repeat, in the gene coding for collagenase, and in the cellular oncogene fos. In order to induce the expression of genes. UV radiation needs to be absorbed by DNA and to cause DNA damage of the kind that cannot be repaired by cells from patients with xeroderma pigmentosum group A. UV-induced activation of the three genes is mediated by the major enhancer elements (located between nucleotide positions -105 and -79 of HIV-1, between positions -72 and -65 of the collagenase gene, and between positions -320 and -299 of fos). These elements share no apparent sequence motif and bind different trans-acting proteins; a member of the NF kappa B family binds to the HIV-1 enhancer, the heterodimer of Jun and Fos (AP-1) binds to the collagenase enhancer, and the serum response factors p67 and p62 bind to fos. DNA-binding activities of the factors recognizing the HIV-1 and collagenase enhancers are augmented in extracts from UV-treated cells. The increase in activity is due to posttranslational modification. While AP-1 resides in the nucleus and must be modulated there, NF kappa B is activated in the cytoplasm, indicating the existence of a cytoplasmic signal transduction pathway triggered by UV-induced DNA damage. In addition to activation, new synthesis of AP-1 is induced by UV radiation. Images PMID:2557547

  6. Contrasting Patterns of rDNA Homogenization within the Zygosaccharomyces rouxii Species Complex

    PubMed Central

    Chand Dakal, Tikam; Giudici, Paolo; Solieri, Lisa

    2016-01-01

    Arrays of repetitive ribosomal DNA (rDNA) sequences are generally expected to evolve as a coherent family, where repeats within such a family are more similar to each other than to orthologs in related species. The continuous homogenization of repeats within individual genomes is a recombination process termed concerted evolution. Here, we investigated the extent and the direction of concerted evolution in 43 yeast strains of the Zygosaccharomyces rouxii species complex (Z. rouxii, Z. sapae, Z. mellis), by analyzing two portions of the 35S rDNA cistron, namely the D1/D2 domains at the 5’ end of the 26S rRNA gene and the segment including the internal transcribed spacers (ITS) 1 and 2 (ITS regions). We demonstrate that intra-genomic rDNA sequence variation is unusually frequent in this clade and that rDNA arrays in single genomes consist of an intermixing of Z. rouxii, Z. sapae and Z. mellis-like sequences, putatively evolved by reticulate evolutionary events that involved repeated hybridization between lineages. The levels and distribution of sequence polymorphisms vary across rDNA repeats in different individuals, reflecting four patterns of rDNA evolution: I) rDNA repeats that are homogeneous within a genome but are chimeras derived from two parental lineages via recombination: Z. rouxii in the ITS region and Z. sapae in the D1/D2 region; II) intra-genomic rDNA repeats that retain polymorphisms only in ITS regions; III) rDNA repeats that vary only in their D1/D2 domains; IV) heterogeneous rDNA arrays that have both polymorphic ITS and D1/D2 regions. We argue that an ongoing process of homogenization following allodiplodization or incomplete lineage sorting gave rise to divergent evolutionary trajectories in different strains, depending upon temporal, structural and functional constraints. We discuss the consequences of these findings for Zygosaccharomyces species delineation and, more in general, for yeast barcoding. PMID:27501051

  7. Revisiting the TALE repeat.

    PubMed

    Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

    2014-04-01

    Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.

  8. High variability of mitochondrial gene order among fungi.

    PubMed

    Aguileta, Gabriela; de Vienne, Damien M; Ross, Oliver N; Hood, Michael E; Giraud, Tatiana; Petit, Elsa; Gabaldón, Toni

    2014-02-01

    From their origin as an early alpha proteobacterial endosymbiont to their current state as cellular organelles, large-scale genomic reorganization has taken place in the mitochondria of all main eukaryotic lineages. So far, most studies have focused on plant and animal mitochondrial (mt) genomes (mtDNA), but fungi provide new opportunities to study highly differentiated mtDNAs. Here, we analyzed 38 complete fungal mt genomes to investigate the evolution of mtDNA gene order among fungi. In particular, we looked for evidence of nonhomologous intrachromosomal recombination and investigated the dynamics of gene rearrangements. We investigated the effect that introns, intronic open reading frames (ORFs), and repeats may have on gene order. Additionally, we asked whether the distribution of transfer RNAs (tRNAs) evolves independently to that of mt protein-coding genes. We found that fungal mt genomes display remarkable variation between and within the major fungal phyla in terms of gene order, genome size, composition of intergenic regions, and presence of repeats, introns, and associated ORFs. Our results support previous evidence for the presence of mt recombination in all fungal phyla, a process conspicuously lacking in most Metazoa. Overall, the patterns of rearrangements may be explained by the combined influences of recombination (i.e., most likely nonhomologous and intrachromosomal), accumulated repeats, especially at intergenic regions, and to a lesser extent, mobile element dynamics.

  9. Genome-wide analysis of LTR-retrotransposon diversity and its impact on the evolution of the genus Helianthus (L.).

    PubMed

    Mascagni, Flavia; Giordani, Tommaso; Ceccarelli, Marilena; Cavallini, Andrea; Natali, Lucia

    2017-08-18

    Genome divergence by mobile elements activity and recombination is a continuous process that plays a key role in the evolution of species. Nevertheless, knowledge on retrotransposon-related variability among species belonging to the same genus is still limited. Considering the importance of the genus Helianthus, a model system for studying the ecological genetics of speciation and adaptation, we performed a comparative analysis of the repetitive genome fraction across ten species and one subspecies of sunflower, focusing on long terminal repeat retrotransposons at superfamily, lineage and sublineage levels. After determining the relative genome size of each species, genomic DNA was isolated and subjected to Illumina sequencing. Then, different assembling and clustering approaches allowed exploring the repetitive component of all genomes. On average, repetitive DNA in Helianthus species represented more than 75% of the genome, being composed mostly by long terminal repeat retrotransposons. Also, the prevalence of Gypsy over Copia superfamily was observed and, among lineages, Chromovirus was by far the most represented. Although nearly all the same sublineages are present in all species, we found considerable variability in the abundance of diverse retrotransposon lineages and sublineages, especially between annual and perennial species. This large variability should indicate that different events of amplification or loss related to these elements occurred following species separation and should have been involved in species differentiation. Our data allowed us inferring on the extent of interspecific repetitive DNA variation related to LTR-RE abundance, investigating the relationship between changes of LTR-RE abundance and the evolution of the genus, and determining the degree of coevolution of different LTR-RE lineages or sublineages between and within species. Moreover, the data suggested that LTR-RE abundance in a species was affected by the annual or perennial habit of that species.

  10. Recent Amplification of the Kangaroo Endogenous Retrovirus, KERV, Limited to the Centromere▿

    PubMed Central

    Ferreri, Gianni C.; Brown, Judith D.; Obergfell, Craig; Jue, Nathaniel; Finn, Caitlin E.; O'Neill, Michael J.; O'Neill, Rachel J.

    2011-01-01

    Mammalian retrotransposons, transposable elements that are processed through an RNA intermediate, are categorized as short interspersed elements (SINEs), long interspersed elements (LINEs), and long terminal repeat (LTR) retroelements, which include endogenous retroviruses. The ability of transposable elements to autonomously amplify led to their initial characterization as selfish or junk DNA; however, it is now known that they may acquire specific cellular functions in a genome and are implicated in host defense mechanisms as well as in genome evolution. Interactions between classes of transposable elements may exert a markedly different and potentially more significant effect on a genome than interactions between members of a single class of transposable elements. We examined the genomic structure and evolution of the kangaroo endogenous retrovirus (KERV) in the marsupial genus Macropus. The complete proviral structure of the kangaroo endogenous retrovirus, phylogenetic relationship among relative retroviruses, and expression of this virus in both Macropus rufogriseus and M. eugenii are presented for the first time. In addition, we show the relative copy number and distribution of the kangaroo endogenous retrovirus in the Macropus genus. Our data indicate that amplification of the kangaroo endogenous retrovirus occurred in a lineage-specific fashion, is restricted to the centromeres, and is not correlated with LINE depletion. Finally, analysis of KERV long terminal repeat sequences using massively parallel sequencing indicates that the recent amplification in M. rufogriseus is likely due to duplications and concerted evolution rather than a high number of independent insertion events. PMID:21389136

  11. CRISPR-Cas systems: Prokaryotes upgrade to adaptive immunity.

    PubMed

    Barrangou, Rodolphe; Marraffini, Luciano A

    2014-04-24

    Clustered regularly interspaced short palindromic repeats (CRISPR), and associated proteins (Cas) comprise the CRISPR-Cas system, which confers adaptive immunity against exogenic elements in many bacteria and most archaea. CRISPR-mediated immunization occurs through the uptake of DNA from invasive genetic elements such as plasmids and viruses, followed by its integration into CRISPR loci. These loci are subsequently transcribed and processed into small interfering RNAs that guide nucleases for specific cleavage of complementary sequences. Conceptually, CRISPR-Cas shares functional features with the mammalian adaptive immune system, while also exhibiting characteristics of Lamarckian evolution. Because immune markers spliced from exogenous agents are integrated iteratively in CRISPR loci, they constitute a genetic record of vaccination events and reflect environmental conditions and changes over time. Cas endonucleases, which can be reprogrammed by small guide RNAs have shown unprecedented potential and flexibility for genome editing and can be repurposed for numerous DNA targeting applications including transcriptional control. Copyright © 2014 Elsevier Inc. All rights reserved.

  12. Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase.

    PubMed

    Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V

    2006-10-15

    The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.

  13. Transposable element evolution in Heliconius suggests genome diversity within Lepidoptera

    PubMed Central

    2013-01-01

    Background Transposable elements (TEs) have the potential to impact genome structure, function and evolution in profound ways. In order to understand the contribution of transposable elements (TEs) to Heliconius melpomene, we queried the H. melpomene draft sequence to identify repetitive sequences. Results We determined that TEs comprise ~25% of the genome. The predominant class of TEs (~12% of the genome) was the non-long terminal repeat (non-LTR) retrotransposons, including a novel SINE family. However, this was only slightly higher than content derived from DNA transposons, which are diverse, with several families having mobilized in the recent past. Compared to the only other well-studied lepidopteran genome, Bombyx mori, H. melpomene exhibits a higher DNA transposon content and a distinct repertoire of retrotransposons. We also found that H. melpomene exhibits a high rate of TE turnover with few older elements accumulating in the genome. Conclusions Our analysis represents the first complete, de novo characterization of TE content in a butterfly genome and suggests that, while TEs are able to invade and multiply, TEs have an overall deleterious effect and/or that maintaining a small genome is advantageous. Our results also hint that analysis of additional lepidopteran genomes will reveal substantial TE diversity within the group. PMID:24088337

  14. Linkage map of the fragments of herpesvirus papio DNA.

    PubMed Central

    Lee, Y S; Tanaka, A; Lau, R Y; Nonoyama, M; Rabin, H

    1981-01-01

    Herpesvirus papio (HVP), an Epstein-Barr-like virus, causes lymphoblastoid disease in baboons. The physical map of HVP DNA was constructed for the fragments produced by cleavage of HVP DNA with restriction endonucleases EcoRI, HindIII, SalI, and PvuI, which produced 12, 12, 10, and 4 fragments, respectively. The total molecular size of HVP DNA was calculated as close to 110 megadaltons. The following methods were used for construction of the map; (i) fragments near the ends of HVP DNA were identified by treating viral DNA with lambda exonuclease before restriction enzyme digestion; (ii) fragments containing nucleotide sequences in common with fragments from the second enzyme digest of HVP DNA were examined by Southern blot hybridization; and (iii) the location of some fragments was determined by isolating individual fragments from agarose gels and redigesting the isolated fragments with a second restriction enzyme. Terminal heterogeneity and internal repeats were found to be unique features of HVP DNA molecule. One to five repeats of 0.8 megadaltons were found at both terminal ends. Although the repeats of both ends shared a certain degree of homology, it was not determined whether they were identical repeats. The internal repeat sequence of HVP DNA was found in the EcoRI-C region, which extended from 8.4 to 23 megadaltons from the left end of the molecule. The average number of the repeats was calculated to be seven, and the molecular size was determined to be 1.8 megadaltons. Similar unique features have been reported in EBV DNA (D. Given and E. Kieff, J. Virol. 28:524-542, 1978). Images PMID:6261015

  15. Local chromatin structure of heterochromatin regulates repeated DNA stability, nucleolus structure, and genome integrity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peng, Jamy C.

    Heterochromatin constitutes a significant portion of the genome in higher eukaryotes; approximately 30% in Drosophila and human. Heterochromatin contains a high repeat DNA content and a low density of protein-encoding genes. In contrast, euchromatin is composed mostly of unique sequences and contains the majority of single-copy genes. Genetic and cytological studies demonstrated that heterochromatin exhibits regulatory roles in chromosome organization, centromere function and telomere protection. As an epigenetically regulated structure, heterochromatin formation is not defined by any DNA sequence consensus. Heterochromatin is characterized by its association with nucleosomes containing methylated-lysine 9 of histone H3 (H3K9me), heterochromatin protein 1 (HP1) thatmore » binds H3K9me, and Su(var)3-9, which methylates H3K9 and binds HP1. Heterochromatin formation and functions are influenced by HP1, Su(var)3-9, and the RNA interference (RNAi) pathway. My thesis project investigates how heterochromatin formation and function impact nuclear architecture, repeated DNA organization, and genome stability in Drosophila melanogaster. H3K9me-based chromatin reduces extrachromosomal DNA formation; most likely by restricting the access of repair machineries to repeated DNAs. Reducing extrachromosomal ribosomal DNA stabilizes rDNA repeats and the nucleolus structure. H3K9me-based chromatin also inhibits DNA damage in heterochromatin. Cells with compromised heterochromatin structure, due to Su(var)3-9 or dcr-2 (a component of the RNAi pathway) mutations, display severe DNA damage in heterochromatin compared to wild type. In these mutant cells, accumulated DNA damage leads to chromosomal defects such as translocations, defective DNA repair response, and activation of the G2-M DNA repair and mitotic checkpoints that ensure cellular and animal viability. My thesis research suggests that DNA replication, repair, and recombination mechanisms in heterochromatin differ from those in euchromatin. Remarkably, human euchromatin and fly heterochromatin share similar features; such as repeated DNA content, intron lengths and open reading frame sizes. Human cells likely stabilize their DNA content via mechanisms and factors similar to those in Drosophila heterochromatin. Furthermore, my thesis work raises implications for H3K9me and chromatin functions in complex-DNA genome stability, repeated DNA homogenization by molecular drive, and in genome reorganization through evolution.« less

  16. Mitochondrial genome rearrangements in glomus species triggered by homologous recombination between distinct mtDNA haplotypes.

    PubMed

    Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed

    2013-01-01

    Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms.AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence,were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigated podiversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity.We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants.

  17. Mitochondrial Genome Rearrangements in Glomus Species Triggered by Homologous Recombination between Distinct mtDNA Haplotypes

    PubMed Central

    Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed

    2013-01-01

    Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms. AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence, were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigate dpo diversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity. We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants. PMID:23925788

  18. A super-family of transcriptional activators regulates bacteriophage packaging and lysis in Gram-positive bacteria

    PubMed Central

    Quiles-Puchalt, Nuria; Tormo-Más, María Ángeles; Campoy, Susana; Toledo-Arana, Alejandro; Monedero, Vicente; Lasa, Íñigo; Novick, Richard P.; Christie, Gail E.; Penadés, José R.

    2013-01-01

    The propagation of bacteriophages and other mobile genetic elements requires exploitation of the phage mechanisms involved in virion assembly and DNA packaging. Here, we identified and characterized four different families of phage-encoded proteins that function as activators required for transcription of the late operons (morphogenetic and lysis genes) in a large group of phages infecting Gram-positive bacteria. These regulators constitute a super-family of proteins, here named late transcriptional regulators (Ltr), which share common structural, biochemical and functional characteristics and are unique to this group of phages. They are all small basic proteins, encoded by genes present at the end of the early gene cluster in their respective phage genomes and expressed under cI repressor control. To control expression of the late operon, the Ltr proteins bind to a DNA repeat region situated upstream of the terS gene, activating its transcription. This involves the C-terminal part of the Ltr proteins, which control specificity for the DNA repeat region. Finally, we show that the Ltr proteins are the only phage-encoded proteins required for the activation of the packaging and lysis modules. In summary, we provide evidence that phage packaging and lysis is a conserved mechanism in Siphoviridae infecting a wide variety of Gram-positive bacteria. PMID:23771138

  19. Aberrant methylation and associated transcriptional mobilization of Alu elements contributes to genomic instability in hypoxia.

    PubMed

    Pal, Arnab; Srivastava, Tapasya; Sharma, Manish K; Mehndiratta, Mohit; Das, Prerna; Sinha, Subrata; Chattopadhyay, Parthaprasad

    2010-11-01

    Hypoxia is an integral part of tumorigenesis and contributes extensively to the neoplastic phenotype including drug resistance and genomic instability. It has also been reported that hypoxia results in global demethylation. Because a majority of the cytosine-phosphate-guanine (CpG) islands are found within the repeat elements of DNA, and are usually methylated under normoxic conditions, we suggested that retrotransposable Alu or short interspersed nuclear elements (SINEs) which show altered methylation and associated changes of gene expression during hypoxia, could be associated with genomic instability. U87MG glioblastoma cells were cultured in 0.1% O₂ for 6 weeks and compared with cells cultured in 21% O₂ for the same duration. Real-time PCR analysis showed a significant increase in SINE and reverse transcriptase coding long interspersed nuclear element (LINE) transcripts during hypoxia. Sequencing of bisulphite treated DNA as well as the Combined Bisulfite Restriction Analysis (COBRA) assay showed that the SINE loci studied underwent significant hypomethylation though there was patchy hypermethylation at a few sites. The inter-alu PCR profile of DNA from cells cultured under 6-week hypoxia, its 4-week revert back to normoxia and 6-week normoxia showed several changes in the band pattern indicating increased alu mediated genomic alteration. Our results show that aberrant methylation leading to increased transcription of SINE and reverse transcriptase associated LINE elements could lead to increased genomic instability in hypoxia. This might be a cause of genetic heterogeneity in tumours especially in variegated hypoxic environment and lead to a development of foci of more aggressive tumour cells. © 2009 The Authors Journal compilation © 2010 Foundation for Cellular and Molecular Medicine/Blackwell Publishing Ltd.

  20. DNA methylation by DNMT1 and DNMT3b methyltransferases is driven by the MUC1-C oncoprotein in human carcinoma cells.

    PubMed

    Rajabi, H; Tagde, A; Alam, M; Bouillez, A; Pitroda, S; Suzuki, Y; Kufe, D

    2016-12-15

    Aberrant expression of the DNA methyltransferases (DNMTs) and disruption of DNA methylation patterns are associated with carcinogenesis and cancer cell survival. The oncogenic MUC1-C protein is aberrantly overexpressed in diverse carcinomas; however, there is no known link between MUC1-C and DNA methylation. Our results demonstrate that MUC1-C induces the expression of DNMT1 and DNMT3b, but not DNMT3a, in breast and other carcinoma cell types. We show that MUC1-C occupies the DNMT1 and DNMT3b promoters in complexes with NF-κB p65 and drives DNMT1 and DNMT3b transcription. In this way, MUC1-C controls global DNA methylation as determined by analysis of LINE-1 repeat elements. The results further demonstrate that targeting MUC1-C downregulates DNA methylation of the CDH1 tumor suppressor gene in association with induction of E-cadherin expression. These findings provide compelling evidence that MUC1-C is of functional importance to induction of DNMT1 and DNMT3b and, in turn, changes in DNA methylation patterns in cancer cells.

  1. A periodic pattern of SNPs in the human genome

    PubMed Central

    Madsen, Bo Eskerod; Villesen, Palle; Wiuf, Carsten

    2007-01-01

    By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing or alignment errors, for example, transposable elements (SINE, LINE, and LTR), tandem repeats, and large duplicated regions. However, we found that the pattern is almost entirely confined to what we define as “periodic DNA.” Periodic DNA is a genomic region with a high degree of periodicity in nucleotide usage. It turned out that periodic DNA is mainly small regions (average length 16.9 bp), widely distributed in the genome. Furthermore, periodic DNA has a 1.8 times higher SNP density than the rest of the genome and SNPs inside periodic DNA have a significantly higher genotyping error rate than SNPs outside periodic DNA. Our results suggest that not all SNPs in the human genome are created by independent single nucleotide mutations, and that care should be taken in analysis of SNPs from periodic DNA. The latter may have important consequences for SNP and association studies. PMID:17673700

  2. Co-evolution of plant LTR-retrotransposons and their host genomes.

    PubMed

    Zhao, Meixia; Ma, Jianxin

    2013-07-01

    Transposable elements (TEs), particularly, long terminal repeat retrotransposons (LTR-RTs), are the most abundant DNA components in all plant species that have been investigated, and are largely responsible for plant genome size variation. Although plant genomes have experienced periodic proliferation and/or recent burst of LTR-retrotransposons, the majority of LTR-RTs are inactivated by DNA methylation and small RNA-mediated silencing mechanisms, and/or were deleted/truncated by unequal homologous recombination and illegitimate recombination, as suppression mechanisms that counteract genome expansion caused by LTR-RT amplification. LTR-RT DNA is generally enriched in pericentromeric regions of the host genomes, which appears to be the outcomes of preferential insertions of LTR-RTs in these regions and low effectiveness of selection that purges LTR-RT DNA from these regions relative to chromosomal arms. Potential functions of various TEs in their host genomes remain blurry; nevertheless, LTR-RTs have been recognized to play important roles in maintaining chromatin structures and centromere functions and regulation of gene expressions in their host genomes.

  3. Molecular Organization of the 25S–18S rDNA IGS of Fagus sylvatica and Quercus suber: A Comparative Analysis

    PubMed Central

    Inácio, Vera; Rocheta, Margarida; Morais-Cecílio, Leonor

    2014-01-01

    The 35S ribosomal DNA (rDNA) units, repeated in tandem at one or more chromosomal loci, are separated by an intergenic spacer (IGS) containing functional elements involved in the regulation of transcription of downstream rRNA genes. In the present work, we have compared the IGS molecular organizations in two divergent species of Fagaceae, Fagus sylvatica and Quercus suber, aiming to comprehend the evolution of the IGS sequences within the family. Self- and cross-hybridization FISH was done on representative species of the Fagaceae. The IGS length variability and the methylation level of 18 and 25S rRNA genes were assessed in representatives of three genera of this family: Fagus, Quercus and Castanea. The intergenic spacers in Beech and Cork Oak showed similar overall organizations comprising putative functional elements needed for rRNA gene activity and containing a non-transcribed spacer (NTS), a promoter region, and a 5′-external transcribed spacer. In the NTS: the sub-repeats structure in Beech is more organized than in Cork Oak, sharing some short motifs which results in the lowest sequence similarity of the entire IGS; the AT-rich region differed in both spacers by a GC-rich block inserted in Cork Oak. The 5′-ETS is the region with the higher similarity, having nonetheless different lengths. FISH with the NTS-5′-ETS revealed fainter signals in cross-hybridization in agreement with the divergence between genera. The diversity of IGS lengths revealed variants from ∼2 kb in Fagus, and Quercus up to 5.3 kb in Castanea, and a lack of correlation between the number of variants and the number of rDNA loci in several species. Methylation of 25S Bam HI site was confirmed in all species and detected for the first time in the 18S of Q. suber and Q. faginea. These results provide important clues for the evolutionary trends of the rDNA 25S-18S IGS in the Fagaceae family. PMID:24893289

  4. The Targeted Sequencing of Alpha Satellite DNA in Cercopithecus pogonias Provides New Insight into the Diversity and Dynamics of Centromeric Repeats in Old World monkeys.

    PubMed

    Cacheux, Lauriane; Ponger, Loïc; Gerbault-Seureau, Michèle; Loll, François; Gey, Delphine; Richard, Florence Anne; Escudé, Christophe

    2018-06-01

    Alpha satellite is the major repeated DNA element of primate centromeres. Specific evolutionary mechanisms have led to a great diversity of sequence families with peculiar genomic organization and distribution, which have till now been studied mostly in great apes. Using high throughput sequencing of alpha satellite monomers obtained by enzymatic digestion followed by computational and cytogenetic analysis, we compare here the diversity and genomic distribution of alpha satellite DNA in two related Old World monkey species, Cercopithecus pogonias and Cercopithecus solatus, which are known to have diverged about seven million years ago. Two main families of monomers, called C1 and C2, are found in both species. A detailed analysis of our datasets revealed the existence of numerous subfamilies within the centromeric C1 family. Although the most abundant subfamily is conserved between both species, our FISH experiments clearly show that some subfamilies are specific for each species and that their distribution is restricted to a subset of chromosomes, thereby pointing to the existence of recurrent amplification/homogenization events. The pericentromeric C2 family is very abundant on the short arm of all acrocentric chromosomes in both species, pointing to specific mechanisms that lead to this distribution. Results obtained using two different restriction enzymes are fully consistent with a predominant monomeric organization of alpha satellite DNA which coexists with higher order organization patterns in the Cercopithecus pogonias genome. Our study suggests a high dynamics of alpha satellite DNA in Cercopithecini, with recurrent apparition of new sequence variants and interchromosomal sequence transfer.

  5. A variant Tc4 transposable element in the nematode C. elegans could encode a novel protein.

    PubMed Central

    Li, W; Shaw, J E

    1993-01-01

    A variant C. elegans Tc4 transposable element, Tc4-rh1030, has been sequenced and is 3483 bp long. The Tc4 element that had been analyzed previously is 1605 bp long, consists of two 774-bp nearly perfect inverted terminal repeats connected by a 57-bp loop, and lacks significant open reading frames. In Tc4-rh1030, by comparison, a 2343-bp novel sequence is present in place of a 477-bp segment in one of the inverted repeats. The novel sequence of Tc4-rh1030 is present about five times per haploid genome and is invariably associated with Tc4 elements; we have used the designation Tc4v to denote this variant subfamily of Tc4 elements. Sequence analysis of three cDNA clones suggests that a Tc4v element contains at least five exons that could encode a novel basic protein of 537 amino acid residues. On northern blots, a 1.6-kb Tc4v-specific transcript was detected in the mutator strain TR679 but not in the wild-type strain N2; Tc4 elements are known to transpose in TR679 but appear to be quiescent in N2. We have analyzed transcripts produced by an unc-33 gene that has the Tc4-rh1030 insertional mutation in its transcribed region; all or almost all of the Tc4v sequence is frequently spliced out of the mutant unc-33 transcripts, sometimes by means of non-consensus splice acceptor sites. Images PMID:8382791

  6. Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters.

    PubMed

    Dallery, Jean-Félix; Lapalu, Nicolas; Zampounis, Antonios; Pigné, Sandrine; Luyten, Isabelle; Amselem, Joëlle; Wittenberg, Alexander H J; Zhou, Shiguo; de Queiroz, Marisa V; Robin, Guillaume P; Auger, Annie; Hainaut, Matthieu; Henrissat, Bernard; Kim, Ki-Tae; Lee, Yong-Hwan; Lespinet, Olivier; Schwartz, David C; Thon, Michael R; O'Connell, Richard J

    2017-08-29

    The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.

  7. Stress-induced rearrangement of Fusarium retrotransposon sequences.

    PubMed

    Anaya, N; Roncero, M I

    1996-11-27

    Rearrangement of fusarium oxysporum retrotransposon skippy was induced by growth in the presence of potassium chlorate. Three fungal strains, one sensitive to chlorate (Co60) and two resistant to chlorate and deficient for nitrate reductase (Co65 and Co94), were studied by Southern analysis of their genomic DNA. Polymorphism was detected in their hybridization banding pattern, relative to the wild type grown in the absence of chlorate, using various enzymes with or without restriction sites within the retrotransposon. Results were consistent with the assumption that three different events had occurred in strain Co60: genomic amplification of skippy yielding tandem arrays of the element, generation of new skippy sequences, and deletion of skippy sequences. Amplification of Co60 genomic DNA using the polymerase chain reaction and divergent primers derived from the retrotransposon generated a new band, corresponding to one long terminal repeat plus flanking sequences, that was not present in the wild-type strain. Molecular analysis of nitrate reductase-deficient mutants showed that generation and deletion of skippy sequences, but not genomic amplification in tandem repeats, had occurred in their genomes.

  8. Molecular characterization and physical localization of highly repetitive DNA sequences from Brazilian Alstroemeria species.

    PubMed

    Kuipers, A G J; Kamstra, S A; de Jeu, M J; Visser, R G F

    2002-01-01

    Highly repetitive DNA sequences were isolated from genomic DNA libraries of Alstroemeria psittacina and A. inodora. Among the repetitive sequences that were isolated, tandem repeats as well as dispersed repeats could be discerned. The tandem repeats belonged to a family of interlinked Sau3A subfragments with sizes varying from 68-127 bp, and constituted a larger HinfI repeat of approximately 400 bp. Southern hybridization showed a similar molecular organization of the tandem repeats in each of the Brazilian Alstroemeria species tested. None of the repeats hybridized with DNA from Chilean Alstroemeria species, which indicates that they are specific for the Brazilian species. In-situ localization studies revealed the tandem repeats to be localized in clusters on the chromosomes of A. inodora and A. psittacina: distal hybridization sites were found on chromosome arms 2PS, 6PL, 7PS, 7PL and 8PL, interstitial sites on chromosome arms 2PL, 3PL, 4PL and 5PL. The applicability of the tandem repeats for cytogenetic analysis of interspecific hybrids and their role in heterochromatin organization are discussed.

  9. Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

    PubMed Central

    Benslimane, A A; Dron, M; Hartmann, C; Rode, A

    1986-01-01

    Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553

  10. Concerted copy number variation balances ribosomal DNA dosage in human and mouse genomes

    PubMed Central

    Gibbons, John G.; Branco, Alan T.; Godinho, Susana A.; Yu, Shoukai; Lemos, Bernardo

    2015-01-01

    Tandemly repeated ribosomal DNA (rDNA) arrays are among the most evolutionary dynamic loci of eukaryotic genomes. The loci code for essential cellular components, yet exhibit extensive copy number (CN) variation within and between species. CN might be partly determined by the requirement of dosage balance between the 5S and 45S rDNA arrays. The arrays are nonhomologous, physically unlinked in mammals, and encode functionally interdependent RNA components of the ribosome. Here we show that the 5S and 45S rDNA arrays exhibit concerted CN variation (cCNV). Despite 5S and 45S rDNA elements residing on different chromosomes and lacking sequence similarity, cCNV between these loci is strong, evolutionarily conserved in humans and mice, and manifested across individual genotypes in natural populations and pedigrees. Finally, we observe that bisphenol A induces rapid and parallel modulation of 5S and 45S rDNA CN. Our observations reveal a novel mode of genome variation, indicate that natural selection contributed to the evolution and conservation of cCNV, and support the hypothesis that 5S CN is partly determined by the requirement of dosage balance with the 45S rDNA array. We suggest that human disease variation might be traced to disrupted rDNA dosage balance in the genome. PMID:25583482

  11. Fission yeast retrotransposon Tf1 integration is targeted to 5' ends of open reading frames.

    PubMed

    Behrens, R; Hayles, J; Nurse, P

    2000-12-01

    Target site selection of transposable elements is usually not random but involves some specificity for a DNA sequence or a DNA binding host factor. We have investigated the target site selection of the long terminal repeat-containing retrotransposon Tf1 from the fission yeast Schizosaccharomyces pombe. By monitoring induced transposition events we found that Tf1 integration sites were distributed throughout the genome. Mapping these insertions revealed that Tf1 did not integrate into open reading frames, but occurred preferentially in longer intergenic regions with integration biased towards a region 100-420 bp upstream of the translation start site. Northern blot analysis showed that transcription of genes adjacent to Tf1 insertions was not significantly changed.

  12. Fission yeast retrotransposon Tf1 integration is targeted to 5′ ends of open reading frames

    PubMed Central

    Behrens, Ralf; Hayles, Jacky; Nurse, Paul

    2000-01-01

    Target site selection of transposable elements is usually not random but involves some specificity for a DNA sequence or a DNA binding host factor. We have investigated the target site selection of the long terminal repeat-containing retrotransposon Tf1 from the fission yeast Schizosaccharomyces pombe. By monitoring induced transposition events we found that Tf1 integration sites were distributed throughout the genome. Mapping these insertions revealed that Tf1 did not integrate into open reading frames, but occurred preferentially in longer intergenic regions with integration biased towards a region 100–420 bp upstream of the translation start site. Northern blot analysis showed that transcription of genes adjacent to Tf1 insertions was not significantly changed. PMID:11095681

  13. Effects of age, sex, and persistent organic pollutants on DNA methylation in children

    PubMed Central

    Huen, Karen; Yousefi, Paul; Bradman, Asa; Yan, Liying; Harley, Kim G.; Kogut, Katherine; Eskenazi, Brenda; Holland, Nina

    2015-01-01

    Epigenetic changes such as DNA methylation may be a molecular mechanism through which environmental exposures affect health. Methylation of Alu and long interspersed nucleotide elements (LINE-1) is a well-established measure of DNA methylation often used in epidemiologic studies. Yet, few studies have examined the effects of host factors on LINE-1 and Alu methylation in children. We characterized the relationship of age, sex, and prenatal exposure to persistent organic pollutants (POPs), dichlorodiphenyl trichloroethane (DDT), dichlorodiphenyldichloroethylene (DDE), and polybrominated diphenyl ethers (PBDEs), with DNA methylation in a birth cohort of Mexican-American children participating in the CHAMACOS study. We measured Alu and LINE-1 methylation by pyrosequencing bisulfite-treated DNA isolated from whole blood samples collected from newborns and 9-year old children (n=358). POPs were measured in maternal serum during late pregnancy. Levels of DNA methylation were lower in 9-year olds compared to newborns and were higher in boys compared to girls. Higher prenatal DDT/E exposure was associated with lower Alu methylation at birth, particularly after adjusting for cell type composition (p=0.02 for o,p′ -DDT). Associations of POPs with LINE-1 methylation were only identified after examining the co-exposure of DDT/E with PBDEs simultaneously. Our data suggest that repeat element methylation can be an informative marker of epigenetic differences by age and sex and that prenatal exposure to POPs may be linked to hypomethylation in fetal blood. Accounting for co-exposure to different types of chemicals and adjusting for blood cell types may increase sensitivity of epigenetic analyses for epidemiological studies. PMID:24375655

  14. Adeno-associated virus inverted terminal repeats stimulate gene editing.

    PubMed

    Hirsch, M L

    2015-02-01

    Advancements in genome editing have relied on technologies to specifically damage DNA which, in turn, stimulates DNA repair including homologous recombination (HR). As off-target concerns complicate the therapeutic translation of site-specific DNA endonucleases, an alternative strategy to stimulate gene editing based on fragile DNA was investigated. To do this, an episomal gene-editing reporter was generated by a disruptive insertion of the adeno-associated virus (AAV) inverted terminal repeat (ITR) into the egfp gene. Compared with a non-structured DNA control sequence, the ITR induced DNA damage as evidenced by increased gamma-H2AX and Mre11 foci formation. As local DNA damage stimulates HR, ITR-mediated gene editing was investigated using DNA oligonucleotides as repair substrates. The AAV ITR stimulated gene editing >1000-fold in a replication-independent manner and was not biased by the polarity of the repair oligonucleotide. Analysis of additional human DNA sequences demonstrated stimulation of gene editing to varying degrees. In particular, inverted yet not direct, Alu repeats induced gene editing, suggesting a role for DNA structure in the repair event. Collectively, the results demonstrate that inverted DNA repeats stimulate gene editing via double-strand break repair in an episomal context and allude to efficient gene editing of the human chromosome using fragile DNA sequences.

  15. Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies.

    PubMed

    Zeng, Lu; Kortschak, R Daniel; Raison, Joy M; Bertozzi, Terry; Adelson, David L

    2018-01-01

    Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package.

  16. Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies

    PubMed Central

    Zeng, Lu; Kortschak, R. Daniel; Raison, Joy M.

    2018-01-01

    Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package. PMID:29538441

  17. Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome.

    PubMed

    González, Leonardo Galindo; Deyholos, Michael K

    2012-11-21

    Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated.

  18. Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome

    PubMed Central

    2012-01-01

    Background Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Results Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. Conclusions The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated. PMID:23171245

  19. Duplication in DNA Sequences

    NASA Astrophysics Data System (ADS)

    Ito, Masami; Kari, Lila; Kincaid, Zachary; Seki, Shinnosuke

    The duplication and repeat-deletion operations are the basis of a formal language theoretic model of errors that can occur during DNA replication. During DNA replication, subsequences of a strand of DNA may be copied several times (resulting in duplications) or skipped (resulting in repeat-deletions). As formal language operations, iterated duplication and repeat-deletion of words and languages have been well studied in the literature. However, little is known about single-step duplications and repeat-deletions. In this paper, we investigate several properties of these operations, including closure properties of language families in the Chomsky hierarchy and equations involving these operations. We also make progress toward a characterization of regular languages that are generated by duplicating a regular language.

  20. TALE-Like Effectors Are an Ancestral Feature of the Ralstonia solanacearum Species Complex and Converge in DNA Targeting Specificity.

    PubMed

    Schandry, Niklas; de Lange, Orlando; Prior, Philippe; Lahaye, Thomas

    2016-01-01

    Ralstonia solanacearum, a species complex of bacterial plant pathogens divided into four monophyletic phylotypes, causes plant diseases in tropical climates around the world. Some strains exhibit a broad host range on solanaceous hosts, while others are highly host-specific as for example some banana-pathogenic strains. Previous studies showed that transcription activator-like (TAL) effectors from Ralstonia, termed RipTALs, are capable of activating reporter genes in planta, if these are preceded by a matching effector binding element (EBE). RipTALs target DNA via their central repeat domain (CRD), where one repeat pairs with one DNA-base of the given EBE. The repeat variable diresidue dictates base repeat specificity in a predictable fashion, known as the TALE code. In this work, we analyze RipTALs across all phylotypes of the Ralstonia solanacearum species complex. We find that RipTALs are prevalent in phylotypes I and IV but absent from most phylotype III and II strains (10/12, 8/14, 1/24, and 1/5 strains contained a RipTAL, respectively). RipTALs originating from strains of the same phylotype show high levels of sequence similarity (>98%) in the N-terminal and C-terminal regions, while RipTALs isolated from different phylotypes show 47-91% sequence similarity in those regions, giving rise to four RipTAL classes. We show that, despite sequence divergence, the base preference for guanine, mediated by the N-terminal region, is conserved across RipTALs of all classes. Using the number and order of repeats found in the CRD, we functionally sub-classify RipTALs, introduce a new simple nomenclature, and predict matching EBEs for all seven distinct RipTALs identified. We experimentally study RipTAL EBEs and uncover that some RipTALs are able to target the EBEs of other RipTALs, referred to as cross-reactivity. In particular, RipTALs from strains with a broad host range on solanaceous hosts cross-react on each other's EBEs. Investigation of sequence divergence between RipTAL repeats allows for a reconstruction of repeat array biogenesis, for example through slipped strand mispairing or gene conversion. Using these studies we show how RipTALs of broad host range strains evolved convergently toward a shared target sequence. Finally, we discuss the differences between TALE-likes of plant pathogens in the context of disease ecology.

  1. TAL effector-DNA specificity.

    PubMed

    Scholze, Heidi; Boch, Jens

    2010-01-01

    TAL effectors are important virulence factors of bacterial plant pathogenic Xanthomonas, which infect a wide variety of plants including valuable crops like pepper, rice, and citrus. TAL proteins are translocated via the bacterial type III secretion system into host cells and induce transcription of plant genes by binding to target gene promoters. Members of the TAL effector family differ mainly in their central domain of tandemly arranged repeats of typically 34 amino acids each with hypervariable di-amino acids at positions 12 and 13. We recently showed that target DNA-recognition specificity of TAL effectors is encoded in a modular and clearly predictable mode. The repeats of TAL effectors feature a surprising one repeat-to-one-bp correlation with different repeat types exhibiting a different DNA base pair specificity. Accordingly, we predicted DNA specificities of TAL effectors and generated artificial TAL proteins with novel DNA recognition specificities. We describe here novel artificial TALs and discuss implications for the DNA recognition specificity. The unique TAL-DNA binding domain allows design of proteins with potentially any given DNA recognition specificity enabling many uses for biotechnology.

  2. A family of long intergenic non-coding RNA genes in human chromosomal region 22q11.2 carry a DNA translocation breakpoint/AT-rich sequence

    PubMed Central

    2018-01-01

    FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722

  3. In silico modeling of epigenetic-induced changes in photoreceptor cis-regulatory elements.

    PubMed

    Hossain, Reafa A; Dunham, Nicholas R; Enke, Raymond A; Berndsen, Christopher E

    2018-01-01

    DNA methylation is a well-characterized epigenetic repressor of mRNA transcription in many plant and vertebrate systems. However, the mechanism of this repression is not fully understood. The process of transcription is controlled by proteins that regulate recruitment and activity of RNA polymerase by binding to specific cis-regulatory sequences. Cone-rod homeobox (CRX) is a well-characterized mammalian transcription factor that controls photoreceptor cell-specific gene expression. Although much is known about the functions and DNA binding specificity of CRX, little is known about how DNA methylation modulates CRX binding affinity to genomic cis-regulatory elements. We used bisulfite pyrosequencing of human ocular tissues to measure DNA methylation levels of the regulatory regions of RHO , PDE6B, PAX6 , and LINE1 retrotransposon repeats. To describe the molecular mechanism of repression, we used molecular modeling to illustrate the effect of DNA methylation on human RHO regulatory sequences. In this study, we demonstrate an inverse correlation between DNA methylation in regulatory regions adjacent to the human RHO and PDE6B genes and their subsequent transcription in human ocular tissues. Docking of CRX to the DNA models shows that CRX interacts with the grooves of these sequences, suggesting changes in groove structure could regulate binding. Molecular dynamics simulations of the RHO promoter and enhancer regions show changes in the flexibility and groove width upon epigenetic modification. Models also demonstrate changes in the local dynamics of CRX binding sites within RHO regulatory sequences which may account for the repression of CRX-dependent transcription. Collectively, these data demonstrate epigenetic regulation of CRX binding sites in human retinal tissue and provide insight into the mechanism of this mode of epigenetic regulation to be tested in future experiments.

  4. The CRISPR conundrum: evolve and maybe die, or survive and risk stagnation

    PubMed Central

    García-Martínez, Jesús; Maldonado, Rafael D.; Guzmán, Noemí M.; Mojica, Francisco J. M.

    2018-01-01

    CRISPR-Cas represents a prokaryotic defense mechanism against invading genetic elements. Although there is a diversity of CRISPR-Cas systems, they all share similar, essential traits. In general, a CRISPR-Cas system consists of one or more groups of DNA repeats named CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats), regularly separated by unique sequences referred to as spacers, and a set of functionally associated cas (CRISPR associated) genes typically located next to one of the repeat arrays. The origin of spacers is in many cases unknown but, when ascertained, they usually match foreign genetic molecules. The proteins encoded by some of the cas genes are in charge of the incorporation of new spacers upon entry of a genetic element. Other Cas proteins participate in generating CRISPR-spacer RNAs and perform the task of destroying nucleic acid molecules carrying sequences similar to the spacer. In this way, CRISPR-Cas provides protection against genetic intruders that could substantially affect the cell viability, thus acting as an adaptive immune system. However, this defensive action also hampers the acquisition of potentially beneficial, horizontally transferred genes, undermining evolution. Here we cover how the model bacterium Escherichia coli deals with CRISPR-Cas to tackle this major dilemma, evolution versus survival. PMID:29850463

  5. Bursts of retrotransposition reproduced in Arabidopsis.

    PubMed

    Tsukahara, Sayuri; Kobayashi, Akie; Kawabe, Akira; Mathieu, Olivier; Miura, Asuka; Kakutani, Tetsuji

    2009-09-17

    Retrotransposons, which proliferate by reverse transcription of RNA intermediates, comprise a major portion of plant genomes. Plants often change the genome size and organization during evolution by rapid proliferation and deletion of long terminal repeat (LTR) retrotransposons. Precise transposon sequences throughout the Arabidopsis thaliana genome and the trans-acting mutations affecting epigenetic states make it an ideal model organism with which to study transposon dynamics. Here we report the mobilization of various families of endogenous A. thaliana LTR retrotransposons identified through genetic and genomic approaches with high-resolution genomic tiling arrays and mutants in the chromatin-remodelling gene DDM1 (DECREASE IN DNA METHYLATION 1). Using multiple lines of self-pollinated ddm1 mutant, we detected an increase in copy number, and verified this for various retrotransposons in a gypsy family (ATGP3) and copia families (ATCOPIA13, ATCOPIA21, ATCOPIA93), and also for a DNA transposon of a Mutator family, VANDAL21. A burst of retrotransposition occurred stochastically and independently for each element, suggesting an additional autocatalytic process. Furthermore, comparison of the identified LTR retrotransposons in related Arabidopsis species revealed that a lineage-specific burst of retrotransposition of these elements did indeed occur in natural Arabidopsis populations. The recent burst of retrotransposition in natural population is targeted to centromeric repeats, which is presumably less harmful than insertion into genes. The ddm1-induced retrotransposon proliferations and genome rearrangements mimic the transposon-mediated genome dynamics during evolution and provide experimental systems with which to investigate the controlling molecular factors directly.

  6. Medium-sized tandem repeats represent an abundant component of the Drosophila virilis genome.

    PubMed

    Abdurashitov, Murat A; Gonchar, Danila A; Chernukhin, Valery A; Tomilov, Victor N; Tomilova, Julia E; Schostak, Natalia G; Zatsepina, Olga G; Zelentsova, Elena S; Evgen'ev, Michael B; Degtyarev, Sergey K H

    2013-11-09

    Previously, we developed a simple method for carrying out a restriction enzyme analysis of eukaryotic DNA in silico, based on the known DNA sequences of the genomes. This method allows the user to calculate lengths of all DNA fragments that are formed after a whole genome is digested at the theoretical recognition sites of a given restriction enzyme. A comparison of the observed peaks in distribution diagrams with the results from DNA cleavage using several restriction enzymes performed in vitro have shown good correspondence between the theoretical and experimental data in several cases. Here, we applied this approach to the annotated genome of Drosophila virilis which is extremely rich in various repeats. Here we explored the combined approach to perform the restriction analysis of D. virilis DNA. This approach enabled to reveal three abundant medium-sized tandem repeats within the D. virilis genome. While the 225 bp repeats were revealed previously in intergenic non-transcribed spacers between ribosomal genes of D. virilis, two other families comprised of 154 bp and 172 bp repeats were not described. Tandem Repeats Finder search demonstrated that 154 bp and 172 bp units are organized in multiple clusters in the genome of D. virilis. Characteristically, only 154 bp repeats derived from Helitron transposon are transcribed. Using in silico digestion in combination with conventional restriction analysis and sequencing of repeated DNA fragments enabled us to isolate and characterize three highly abundant families of medium-sized repeats present in the D. virilis genome. These repeats comprise a significant portion of the genome and may have important roles in genome function and structural integrity. Therefore, we demonstrated an approach which makes possible to investigate in detail the gross arrangement and expression of medium-sized repeats basing on sequencing data even in the case of incompletely assembled and/or annotated genomes.

  7. Structure of chromatin and the linking number of DNA.

    PubMed Central

    Worcel, A; Strogatz, S; Riley, D

    1981-01-01

    Recent observations suggest that the basic supranucleosomal structure of chromatin is a zigzag helical ribbon with a repeat unit made of two nucleosomes connected by a relaxed spacer DNA. A remarkable feature of one particular ribbon is that it solves the apparent paradox between the number of DNA turns per nucleosome and the total linking number of a nucleosome-containing closed circular DNA molecule. We show here that the repeat unit of the proposed structure, which contains two nucleosomes with -1 3/4 DNA turns per nucleosome and one spacer crossover per repeat, contributes -2 to the linking number of closed circular DNA. Space-filling models show that the cylindrical 250-A chromatin fiber can be generated by twisting the ribbon. Images PMID:6940168

  8. Disease-associated repeat instability and mismatch repair.

    PubMed

    Schmidt, Monika H M; Pearson, Christopher E

    2016-02-01

    Expanded tandem repeat sequences in DNA are associated with at least 40 human genetic neurological, neurodegenerative, and neuromuscular diseases. Repeat expansion can occur during parent-to-offspring transmission, and arise at variable rates in specific tissues throughout the life of an affected individual. Since the ongoing somatic repeat expansions can affect disease age-of-onset, severity, and progression, targeting somatic expansion holds potential as a therapeutic target. Thus, understanding the factors that regulate this mutation is crucial. DNA repair, in particular mismatch repair (MMR), is the major driving force of disease-associated repeat expansions. In contrast to its anti-mutagenic roles, mammalian MMR curiously drives the expansion mutations of disease-associated (CAG)·(CTG) repeats. Recent advances have broadened our knowledge of both the MMR proteins involved in disease repeat expansions, including: MSH2, MSH3, MSH6, MLH1, PMS2, and MLH3, as well as the types of repeats affected by MMR, now including: (CAG)·(CTG), (CGG)·(CCG), and (GAA)·(TTC) repeats. Mutagenic slipped-DNA structures have been detected in patient tissues, and the size of the slip-out and their junction conformation can determine the involvement of MMR. Furthermore, the formation of other unusual DNA and R-loop structures is proposed to play a key role in MMR-mediated instability. A complex correlation is emerging between tissues showing varying amounts of repeat instability and MMR expression levels. Notably, naturally occurring polymorphic variants of DNA repair genes can have dramatic effects upon the levels of repeat instability, which may explain the variation in disease age-of-onset, progression and severity. An increasing grasp of these factors holds prognostic and therapeutic potential. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Cytogenetic Analysis of Populus trichocarpa - Ribosomal DNA, Telomere Repeat Sequence, and Marker-selected BACs

    Treesearch

    M.N. lslam-Faridi; C.D. Nelson; S.P. DiFazio; L.E. Gunter; G.A. Tuskan

    2009-01-01

    The 185-285 rDNA and 55 rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 185-285 rDNA sites and one 55 rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis-type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones...

  10. Identification of Genetic Elements Associated with EPSPS Gene Amplification

    PubMed Central

    Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.

    2013-01-01

    Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434

  11. Global analysis of DNA methylation in young (J1) and senescent (J2) Gossypium hirsutum L. cotyledons by MeDIP-Seq

    PubMed Central

    Dou, Lingling; Jia, Xiaoyun; Wei, Hengling; Fan, Shuli; Wang, Hantao; Guo, Yaning; Duan, Shan; Pang, Chaoyou; Yu, Shuxun

    2017-01-01

    DNA methylation is an important epigenetic modification regulating gene expression, genomic imprinting, transposon silencing and chromatin structure in plants and plays an important role in leaf senescence. However, the DNA methylation pattern during Gossypium hirsutum L. cotyledon senescence is poorly understood. In this study, global DNA methylation patterns were compared between two cotyledon development stages, young (J1) and senescence (J2), using methylated DNA immunoprecipitation (MeDIP-Seq). Methylated cytosine occurred mostly in repeat elements, especially LTR/Gypsy in both J1 and J2. When comparing J1 against J2, there were 1222 down-methylated genes and 623 up-methylated genes. Methylated genes were significantly enriched in carbohydrate metabolism, biosynthesis of other secondary metabolites and amino acid metabolism pathways. The global DNA methylation level decreased from J1 to J2, especially in gene promoters, transcriptional termination regions and regions around CpG islands. We further investigated the expression patterns of 9 DNA methyltransferase-associated genes and 2 DNA demethyltransferase-associated genes from young to senescent cotyledons, which were down-regulated during cotyledon development. In this paper, we first reported that senescent cotton cotyledons exhibited lower DNA methylation levels, primarily due to decreased DNA methyltransferase activity and which also play important role in regulating secondary metabolite process. PMID:28715427

  12. DNA replication stress restricts ribosomal DNA copy number.

    PubMed

    Salim, Devika; Bradford, William D; Freeland, Amy; Cady, Gillian; Wang, Jianmin; Pruitt, Steven C; Gerton, Jennifer L

    2017-09-01

    Ribosomal RNAs (rRNAs) in budding yeast are encoded by ~100-200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA) locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how "normal" copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2)-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a "normal" rDNA copy number.

  13. DNA sequence homology induces cytosine-to-thymine mutation by a heterochromatin-related pathway in Neurospora

    PubMed Central

    Gladyshev, Eugene; Kleckner, Nancy

    2017-01-01

    Eukaryotic genomes contain substantial amounts of repetitive DNA organized in the form of constitutive heterochromatin and associated with repressive epigenetic modifications, such as H3K9me3 and C5-cytosine methylation (5mC). In the fungus Neurospora crassa, H3K9me3 and 5mC are catalyzed, respectively, by a conserved SUV39 histone methyltransferase DIM-5 and a DNMT1-like cytosine methyltransferase DIM-2. Here we show that DIM-2 can also mediate Repeat-Induced Point mutation (RIP) of repetitive DNA in N. crassa. We further show that DIM-2-dependent RIP requires DIM-5, HP1, and other known heterochromatin factors, implying the role of a repeat-induced heterochromatin-related process. Our previous findings suggest that the mechanism of repeat recognition for RIP involves direct interactions between homologous double-stranded (ds) DNA segments. We thus now propose that, in somatic cells, homologous dsDNA/dsDNA interactions between a small number of repeat copies can nucleate a transient heterochromatic state, which, on longer repeat arrays, may lead to the formation of constitutive heterochromatin. PMID:28459455

  14. Insights on genome size evolution from a miniature inverted repeat transposon driving a satellite DNA.

    PubMed

    Scalvenzi, Thibault; Pollet, Nicolas

    2014-12-01

    The genome size in eukaryotes does not correlate well with the number of genes they contain. We can observe this so-called C-value paradox in amphibian species. By analyzing an amphibian genome we asked how repetitive DNA can impact genome size and architecture. We describe here our discovery of a Tc1/mariner miniature inverted-repeat transposon family present in Xenopus frogs. These transposons named miDNA4 are unique since they contain a satellite DNA motif. We found that miDNA4 measured 331 bp, contained 25 bp long inverted terminal repeat sequences and a sequence motif of 119 bp present as a unique copy or as an array of 2-47 copies. We characterized the structure, dynamics, impact and evolution of the miDNA4 family and its satellite DNA in Xenopus frog genomes. This led us to propose a model for the evolution of these two repeated sequences and how they can synergize to increase genome size. Copyright © 2014 Elsevier Inc. All rights reserved.

  15. Protection of CpG islands from DNA methylation is DNA-encoded and evolutionarily conserved

    PubMed Central

    Long, Hannah K.; King, Hamish W.; Patient, Roger K.; Odom, Duncan T.; Klose, Robert J.

    2016-01-01

    DNA methylation is a repressive epigenetic modification that covers vertebrate genomes. Regions known as CpG islands (CGIs), which are refractory to DNA methylation, are often associated with gene promoters and play central roles in gene regulation. Yet how CGIs in their normal genomic context evade the DNA methylation machinery and whether these mechanisms are evolutionarily conserved remains enigmatic. To address these fundamental questions we exploited a transchromosomic animal model and genomic approaches to understand how the hypomethylated state is formed in vivo and to discover whether mechanisms governing CGI formation are evolutionarily conserved. Strikingly, insertion of a human chromosome into mouse revealed that promoter-associated CGIs are refractory to DNA methylation regardless of host species, demonstrating that DNA sequence plays a central role in specifying the hypomethylated state through evolutionarily conserved mechanisms. In contrast, elements distal to gene promoters exhibited more variable methylation between host species, uncovering a widespread dependence on nucleotide frequency and occupancy of DNA-binding transcription factors in shaping the DNA methylation landscape away from gene promoters. This was exemplified by young CpG rich lineage-restricted repeat sequences that evaded DNA methylation in the absence of co-evolved mechanisms targeting methylation to these sequences, and species specific DNA binding events that protected against DNA methylation in CpG poor regions. Finally, transplantation of mouse chromosomal fragments into the evolutionarily distant zebrafish uncovered the existence of a mechanistically conserved and DNA-encoded logic which shapes CGI formation across vertebrate species. PMID:27084945

  16. Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes

    PubMed Central

    Huang, Yongjie; Mrázek, Jan

    2014-01-01

    Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877

  17. Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights

    PubMed Central

    Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.

    2016-01-01

    Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794

  18. Long Terminal Repeat Retrotransposon Content in Eight Diploid Sunflower Species Inferred from Next-Generation Sequence Data

    PubMed Central

    Tetreault, Hannah M.; Ungerer, Mark C.

    2016-01-01

    The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia. Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius. We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny. PMID:27233667

  19. A Dynamic Tandem Repeat in Monocotyledons Inferred from a Comparative Analysis of Chloroplast Genomes in Melanthiaceae.

    PubMed

    Do, Hoang Dang Khoa; Kim, Joo-Hwan

    2017-01-01

    Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic results from SSM in chloroplast genomes which can be useful for further evolutionary studies in angiosperms. Additionally, genomics events in cpDNA are potential resources for mining molecular markers in Liliales.

  20. Transcriptional activity of the homopurine-homopyrimidine repeat of the c-Ki-ras promoter is independent of its H-forming potential.

    PubMed Central

    Raghu, G; Tevosian, S; Anant, S; Subramanian, K N; George, D L; Mirkin, S M

    1994-01-01

    The mouse c-Ki-ras protooncogene promoter contains an unusual DNA element consisting of a 27 bp-long homopurine-homopyrimidine mirror repeat (H-motif) adjacent to a d(C-G)5 repeat. We have previously shown that in vitro these repeats may adopt H and Z conformations, respectively, causing nuclease and chemical hypersensitivity. Here we have studied the functional role of these DNA stretches using fine deletion analysis of the promoter and a transient transcription assay in vivo. We found that while the H-motif is responsible for approximately half of the promoter activity in both mouse and human cell lines, the Z-forming sequence exhibits little, if any, such activity. Mutational changes introduced within the homopurine-homopyrimidine stretch showed that its sequence integrity, rather than its H-forming potential, is responsible for its effect on transcription. Electrophoretic mobility shift assays revealed that the putative H-motif tightly binds several nuclear proteins, one of which is likely to be transcription factor Sp1, as determined by competition experiments. Southwestern hybridization studies detected two major proteins specifically binding to the H-motif: a 97 kD protein which presumably corresponds to Sp1 and another protein of 60 kD in human and 64 kD in mouse cells. We conclude that the homopurine-homopyrimidine stretch is required for full transcriptional activity of the c-Ki-ras promoter and at least two distinct factors, Sp1 and an unidentified protein, potentially contribute to the positive effect on transcription. Images PMID:8078760

  1. Survival and Evolution of CRISPR–Cas System in Prokaryotes and Its Applications

    PubMed Central

    Shabbir, Muhammad Abu Bakr; Hao, Haihong; Shabbir, Muhammad Zubair; Hussain, Hafiz Iftikhar; Iqbal, Zahid; Ahmed, Saeed; Sattar, Adeel; Iqbal, Mujahid; Li, Jun; Yuan, Zonghui

    2016-01-01

    Prokaryotes have developed numerous innate immune mechanisms in order to fend off bacteriophage or plasmid attack. One of these immune systems is clustered regularly interspaced short palindromic repeats (CRISPR). CRISPR-associated proteins play a key role in survival of prokaryotes against invaders, as these systems cleave DNA of foreign genetic elements. Beyond providing immunity, these systems have significant impact in altering the bacterial physiology in term of its virulence and pathogenicity, as well as evolution. Also, due to their diverse nature of functionality, cas9 endoribonuclease can be easily reprogrammed with the help of guide RNAs, showing unprecedented potential and significance for gene editing in treating genetic diseases. Here, we also discuss the use of NgAgo–gDNA system in genome editing of human cells. PMID:27725818

  2. G-quadruplex-interacting compounds alter latent DNA replication and episomal persistence of KSHV

    PubMed Central

    Madireddy, Advaitha; Purushothaman, Pravinkumar; Loosbroock, Christopher P.; Robertson, Erle S.; Schildkraut, Carl L.; Verma, Subhash C.

    2016-01-01

    Kaposi's sarcoma associated herpesvirus (KSHV) establishes life-long latent infection by persisting as an extra-chromosomal episome in the infected cells and by maintaining its genome in dividing cells. KSHV achieves this by tethering its epigenome to the host chromosome by latency associated nuclear antigen (LANA), which binds in the terminal repeat (TR) region of the viral genome. Sequence analysis of the TR, a GC-rich DNA element, identified several potential Quadruplex G-Rich Sequences (QGRS). Since quadruplexes have the tendency to obstruct DNA replication, we used G-quadruplex stabilizing compounds to examine their effect on latent DNA replication and the persistence of viral episomes. Our results showed that these G-quadruplex stabilizing compounds led to the activation of dormant origins of DNA replication, with preferential bi-directional pausing of replications forks moving out of the TR region, implicating the role of the G-rich TR in the perturbation of episomal DNA replication. Over time, treatment with PhenDC3 showed a loss of viral episomes in the infected cells. Overall, these data show that G-quadruplex stabilizing compounds retard the progression of replication forks leading to a reduction in DNA replication and episomal maintenance. These results suggest a potential role for G-quadruplex stabilizers in the treatment of KSHV-associated diseases. PMID:26837574

  3. The Molecular Structure of Te146 and Its Derivatives in Drosophila Melanogaster

    PubMed Central

    Lovering, R.; Harden, N.; Ashburner, M.

    1991-01-01

    TE146 is a giant transposon of Drosophila melanogaster. It carries two copies of the white and roughest genes, normally found on the X chromosome. The structure of this transposon has been studied at the molecular level. TE146 may transpose to new chromosome positions, excise and be lost from the genome or undergo internal rearrangements. The termini of TE146 are foldback DNA elements (FB); the transposon also carries two internal FB elements. Loss or internal rearrangement of TE146 involves recombination between different FB elements. These events have been mapped molecularly, by taking advantage of the fact that the FB sequences are composed largely of a regular 155-bp repeat sequence that is cut by the restriction enzyme TaqI, and are shown to be nonrandom. We suggest that these FB-FB exchange events occur by mitotic sister-chromatid exchange in the premeiotic germ line. PMID:1649070

  4. Complete genome sequence of a novel extrachromosomal virus-like element identified in planarian Girardia tigrina

    PubMed Central

    Rebrikov, Denis V; Bulina, Maria E; Bogdanova, Ekaterina A; Vagner, Loura L; Lukyanov, Sergey A

    2002-01-01

    Background Freshwater planarians are widely used as models for investigation of pattern formation and studies on genetic variation in populations. Despite extensive information on the biology and genetics of planaria, the occurrence and distribution of viruses in these animals remains an unexplored area of research. Results Using a combination of Suppression Subtractive Hybridization (SSH) and Mirror Orientation Selection (MOS), we compared the genomes of two strains of freshwater planarian, Girardia tigrina. The novel extrachromosomal DNA-containing virus-like element denoted PEVE (Planarian Extrachromosomal Virus-like Element) was identified in one planarian strain. The PEVE genome (about 7.5 kb) consists of two unique regions (Ul and Us) flanked by inverted repeats. Sequence analyses reveal that PEVE comprises two helicase-like sequences in the genome, of which the first is a homolog of a circoviral replication initiator protein (Rep), and the second is similar to the papillomavirus E1 helicase domain. PEVE genome exists in at least two variant forms with different arrangements of single-stranded and double-stranded DNA stretches that correspond to the Us and Ul regions. Using PCR analysis and whole-mount in situ hybridization, we characterized PEVE distribution and expression in the planarian body. Conclusions PEVE is the first viral element identified in free-living flatworms. This element differs from all known viruses and viral elements, and comprises two potential helicases that are homologous to proteins from distant viral phyla. PEVE is unevenly distributed in the worm body, and is detected in specific parenchyma cells. PMID:12065025

  5. The mitochondrial genome of the gymnosperm Cycas taitungensis contains a novel family of short interspersed elements, Bpu sequences, and abundant RNA editing sites.

    PubMed

    Chaw, Shu-Miaw; Shih, Arthur Chun-Chieh; Wang, Daryi; Wu, Yu-Wei; Liu, Shu-Mei; Chou, The-Yuan

    2008-03-01

    The mtDNA of Cycas taitungensis is a circular molecule of 414,903 bp, making it 2- to 6-fold larger than the known mtDNAs of charophytes and bryophytes, but similar to the average of 7 elucidated angiosperm mtDNAs. It is characterized by abundant RNA editing sites (1,084), more than twice the number found in the angiosperm mtDNAs. The A + T content of Cycas mtDNA is 53.1%, the lowest among known land plants. About 5% of the Cycas mtDNA is composed of a novel family of mobile elements, which we designated as "Bpu sequences." They share a consensus sequence of 36 bp with 2 terminal direct repeats (AAGG) and a recognition site for the Bpu 10I restriction endonuclease (CCTGAAGC). Comparison of the Cycas mtDNA with other plant mtDNAs revealed many new insights into the biology and evolution of land plant mtDNAs. For example, the noncoding sequences in mtDNAs have drastically expanded as land plants have evolved, with abrupt increases appearing in the bryophytes, and then in the seed plants. As a result, the genomic organizations of seed plant mtDNAs are much less compact than in other plants. Also, the Cycas mtDNA appears to have been exempted from the frequent gene loss observed in angiosperm mtDNAs. Similar to the angiosperms, the 3 Cycas genes nad1, nad2, and nad5 are disrupted by 5 group II intron squences, which have brought the genes into trans-splicing arrangements. The evolutionary origin and invasion/duplication mechanism of the Bpu sequences in Cycas mtDNA are hypothesized and discussed.

  6. Two different factors act separately or together to specify functionally distinct activities at a single transcriptional enhancer.

    PubMed Central

    DeFranco, D; Yamamoto, K R

    1986-01-01

    The expression of genes fused downstream of the Moloney murine sarcoma virus (MoMSV) long terminal repeat is stimulated by glucocorticoids. We mapped the glucocorticoid response element that conferred this hormonal regulation and found that it is a hormone-dependent transcriptional enhancer, designated Sg; it resides within DNA fragments that also carry a previously described enhancer element (B. Levinson, G. Khoury, G. Vande Woude, and P. Gruss, Nature [London] 295:568-572, 1982), here termed Sa, whose activity is independent of the hormone. Nuclease footprinting revealed that purified glucocorticoid receptor bound at multiple discrete sites within and at the borders of the tandemly repeated sequence motif that defines Sa. The Sa and Sg activities stimulated the apparent efficiency of cognate or heterologous promoter utilization, individually providing modest enhancement and in concert yielding higher levels of activity. A deletion mutant lacking most of the tandem repeat but retaining a single receptor footprint sequence lost Sa activity but still conferred Sg activity. The two enhancer components could also be distinguished physiologically: both were operative within cultured rat fibroblasts, but only Sg activity was detectable in rat exocrine pancreas cells. Therefore, the sequence determinants of Sa and Sg activity may be interdigitated, and when both components are active, the receptor and a putative Sa factor can apparently bind and act simultaneously. We concluded that MoMSV enhancer activity is effected by at least two distinct binding factors, suggesting that combinatorial regulation of promoter function can be mediated even from a single genetic element. Images PMID:3023887

  7. Laser mass spectrometry for DNA fingerprinting for forensic applications

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, C.H.; Tang, K.; Taranenko, N.I.

    The application of DNA fingerprinting has become very broad in forensic analysis, patient identification, diagnostic medicine, and wildlife poaching, since every individual`s DNA structure is identical within all tissues of their body. DNA fingerprinting was initiated by the use of restriction fragment length polymorphisms (RFLP). In 1987, Nakamura et al. found that a variable number of tandem repeats (VNTR) often occurred in the alleles. The probability of different individuals having the same number of tandem repeats in several different alleles is very low. Thus, the identification of VNTR from genomic DNA became a very reliable method for identification of individuals.more » DNA fingerprinting is a reliable tool for forensic analysis. In DNA fingerprinting, knowledge of the sequence of tandem repeats and restriction endonuclease sites can provide the basis for identification. The major steps for conventional DNA fingerprinting include (1) specimen processing (2) amplification of selected DNA segments by PCR, and (3) gel electrophoresis to do the final DNA analysis. In this work we propose to use laser desorption mass spectrometry for fast DNA fingerprinting. The process and advantages are discussed.« less

  8. Unusual Structure of the attB Site of the Site-Specific Recombination System of Lactobacillus delbrueckii Bacteriophage mv4

    PubMed Central

    Auvray, Frédéric; Coddeville, Michèle; Ordonez, Romy Catoira; Ritzenthaler, Paul

    1999-01-01

    The temperate phage mv4 integrates its genome into the chromosome of Lactobacillus delbrueckii subsp. bulgaricus by site-specific recombination within the 3′ end of a tRNASer gene. Recombination is catalyzed by the phage-encoded integrase and occurs between the phage attP site and the bacterial attB site. In this study, we show that the mv4 integrase functions in vivo in Escherichia coli and we characterize the bacterial attB site with a site-specific recombination test involving compatible plasmids carrying the recombination sites. The importance of particular nucleotides within the attB sequence was determined by site-directed mutagenesis. The structure of the attB site was found to be simple but rather unusual. A 16-bp DNA fragment was sufficient for function. Unlike most genetic elements that integrate their DNA into tRNA genes, none of the dyad symmetry elements of the tRNASer gene were present within the minimal attB site. No inverted repeats were detected within this site either, in contrast to the lambda site-specific recombination model. PMID:10572145

  9. Schizosaccharomyces pombe Retrotransposon Tf2 Mobilizes Primarily through Homologous cDNA Recombination

    PubMed Central

    Hoff, Eleanor F.; Levin, Henry L.; Boeke, Jef D.

    1998-01-01

    The Tf2 retrotransposon, found in the fission yeast Schizosaccharomyces pombe, is nearly identical to its sister element, Tf1, in its reverse transcriptase-RNase H and integrase domains but is very divergent in the gag domain, the protease, the 5′ untranslated region, and the U3 domain of the long terminal repeats. It has now been demonstrated that a neo-marked copy of Tf2 overexpressed from a heterologous promoter can mobilize into the S. pombe genome and produce true transposition events. However, the Tf2-neo mobilization frequency is 10- to 20-fold lower than that of Tf1-neo, and 70% of the Tf2-neo events are homologous recombination events generated independently of a functional Tf2 integrase. Thus, the Tf2 element is primarily dependent on homologous recombination with preexisting copies of Tf2 for its propagation. Finally, production of Tf2-neo proteins and cDNA was also analyzed; surprisingly, Tf2 was found to produce its reverse transcriptase as a single species in which it is fused to protease, unlike all other retroviruses and retrotransposons. PMID:9774697

  10. Structural and biophysical properties of h-FANCI ARM repeat protein.

    PubMed

    Siddiqui, Mohd Quadir; Choudhary, Rajan Kumar; Thapa, Pankaj; Kulkarni, Neha; Rajpurohit, Yogendra S; Misra, Hari S; Gadewal, Nikhil; Kumar, Satish; Hasan, Syed K; Varma, Ashok K

    2017-11-01

    Fanconi anemia complementation groups - I (FANCI) protein facilitates DNA ICL (Inter-Cross-link) repair and plays a crucial role in genomic integrity. FANCI is a 1328 amino acids protein which contains armadillo (ARM) repeats and EDGE motif at the C-terminus. ARM repeats are functionally diverse and evolutionarily conserved domain that plays a pivotal role in protein-protein and protein-DNA interactions. Considering the importance of ARM repeats, we have explored comprehensive in silico and in vitro approach to examine folding pattern. Size exclusion chromatography, dynamic light scattering (DLS) and glutaraldehyde crosslinking studies suggest that FANCI ARM repeat exist as monomer as well as in oligomeric forms. Circular dichroism (CD) and fluorescence spectroscopy results demonstrate that protein has predominantly α- helices and well-folded tertiary structure. DNA binding was analysed using electrophoretic mobility shift assay by autoradiography. Temperature-dependent CD, Fluorescence spectroscopy and DLS studies concluded that protein unfolds and start forming oligomer from 30°C. The existence of stable portion within FANCI ARM repeat was examined using limited proteolysis and mass spectrometry. The normal mode analysis, molecular dynamics and principal component analysis demonstrated that helix-turn-helix (HTH) motif present in ARM repeat is highly dynamic and has anti-correlated motion. Furthermore, FANCI ARM repeat has HTH structural motif which binds to double-stranded DNA.

  11. Environmental stress induces trinucleotide repeat mutagenesis in human cells

    PubMed Central

    Chatterjee, Nimrat; Lin, Yunfu; Santillan, Beatriz A.; Yotnda, Patricia; Wilson, John H.

    2015-01-01

    The dynamic mutability of microsatellite repeats is implicated in the modification of gene function and disease phenotype. Studies of the enhanced instability of long trinucleotide repeats (TNRs)—the cause of multiple human diseases—have revealed a remarkable complexity of mutagenic mechanisms. Here, we show that cold, heat, hypoxic, and oxidative stresses induce mutagenesis of a long CAG repeat tract in human cells. We show that stress-response factors mediate the stress-induced mutagenesis (SIM) of CAG repeats. We show further that SIM of CAG repeats does not involve mismatch repair, nucleotide excision repair, or transcription, processes that are known to promote TNR mutagenesis in other pathways of instability. Instead, we find that these stresses stimulate DNA rereplication, increasing the proportion of cells with >4 C-value (C) DNA content. Knockdown of the replication origin-licensing factor CDT1 eliminates both stress-induced rereplication and CAG repeat mutagenesis. In addition, direct induction of rereplication in the absence of stress also increases the proportion of cells with >4C DNA content and promotes repeat mutagenesis. Thus, environmental stress triggers a unique pathway for TNR mutagenesis that likely is mediated by DNA rereplication. This pathway may impact normal cells as they encounter stresses in their environment or during development or abnormal cells as they evolve metastatic potential. PMID:25775519

  12. Environmental stress induces trinucleotide repeat mutagenesis in human cells.

    PubMed

    Chatterjee, Nimrat; Lin, Yunfu; Santillan, Beatriz A; Yotnda, Patricia; Wilson, John H

    2015-03-24

    The dynamic mutability of microsatellite repeats is implicated in the modification of gene function and disease phenotype. Studies of the enhanced instability of long trinucleotide repeats (TNRs)-the cause of multiple human diseases-have revealed a remarkable complexity of mutagenic mechanisms. Here, we show that cold, heat, hypoxic, and oxidative stresses induce mutagenesis of a long CAG repeat tract in human cells. We show that stress-response factors mediate the stress-induced mutagenesis (SIM) of CAG repeats. We show further that SIM of CAG repeats does not involve mismatch repair, nucleotide excision repair, or transcription, processes that are known to promote TNR mutagenesis in other pathways of instability. Instead, we find that these stresses stimulate DNA rereplication, increasing the proportion of cells with >4 C-value (C) DNA content. Knockdown of the replication origin-licensing factor CDT1 eliminates both stress-induced rereplication and CAG repeat mutagenesis. In addition, direct induction of rereplication in the absence of stress also increases the proportion of cells with >4C DNA content and promotes repeat mutagenesis. Thus, environmental stress triggers a unique pathway for TNR mutagenesis that likely is mediated by DNA rereplication. This pathway may impact normal cells as they encounter stresses in their environment or during development or abnormal cells as they evolve metastatic potential.

  13. DNA methylation differences in exposed workers and nearby residents of the Ma Ta Phut industrial estate, Rayong, Thailand

    PubMed Central

    Peluso, Marco; Bollati, Valentina; Munnia, Armelle; Srivatanakul, Petcharin; Jedpiyawongse, Adisorn; Sangrajrang, Suleeporn; Piro, Sara; Ceppi, Marcello; Bertazzi, Pier Alberto; Boffetta, Paolo; Baccarelli, Andrea A

    2012-01-01

    Background Adverse biological effects from airborne pollutants are a primary environmental concern in highly industrialized areas. Recent studies linked air pollution exposures with altered blood Deoxyribo-nucleic acid (DNA) methylation, but effects from industrial sources and underlying biological mechanisms are still largely unexplored. Methods The Ma Ta Phut industrial estate (MIE) in Rayong, Thailand hosts one of the largest steel, oil refinery and petrochemical complexes in south-eastern Asia. We measured a panel of blood DNA methylation markers previously associated with air pollution exposures, including repeated elements [long interspersed nuclear element-1 (LINE-1) and Alu] and genes [p53, hypermethylated-in-cancer-1 (HIC1), p16 and interleukin-6 (IL-6)], in 67 MIE workers, 65 Ma Ta Phut residents and 45 rural controls. To evaluate the role of DNA damage and oxidation, we correlated DNA methylation measures with bulky DNA and 3-(2-deoxy-β-D-erythro-pentafuranosyl)pyrimido[1,2-α]purin-10(3H)-one deoxyguanosine (M1dG) adducts. Results In covariate-adjusted models, MIE workers, compared with rural residents, showed lower LINE-1 (74.8% vs 78.0%; P < 0.001), p53 (8.0% vs 15.7%; P < 0.001) and IL-6 methylation (39.2% vs 45.0%; P = 0.027) and higher HIC1 methylation (22.2% vs 15.3%, P < 0.001). For all four markers, Ma Ta Phut residents exhibited methylation levels intermediate between MIE workers and rural controls (LINE-1, 75.7%, P < 0.001; p53, 9.0%, P < 0.001; IL-6, 39.8%, P = 0.041; HIC1, 17.8%, P = 0.05; all P-values vs rural controls). Bulky DNA adducts showed negative correlation with p53 methylation (P = 0.01). M1dG showed negative correlations with LINE-1 (P = 0.003) and IL-6 methylation (P = 0.05). Conclusions Our findings indicate that industrial exposures may induce alterations of DNA methylation patterns detectable in blood leucocyte DNA. Correlation of DNA adducts with DNA hypomethylation suggests potential mediation by DNA damage. PMID:23064502

  14. Characterization of non-CG genomic hypomethylation associated with gamma-ray-induced suppression of CMT3 transcription in Arabidopsis thaliana.

    PubMed

    Kim, Ji Eun; Lee, Min Hee; Cho, Eun Ju; Kim, Ji Hong; Chung, Byung Yeoup; Kim, Jin-Hong

    2013-12-01

    Ionizing radiation causes various epigenetic changes, as well as a variety of DNA lesions such as strand breaks, cross-links, oxidative damages, etc., in genomes. However, radiation-induced epigenetic changes have rarely been substantiated in plant genomes. The current study investigates whether DNA methylation of Arabidopsis thaliana genome is altered by gamma rays. We found that genomic DNA methylation decreased in wild-type plants with increasing doses of gamma rays (5, 50 and 200 Gy). Irradiation with 200 Gy significantly increased the expression of transcriptionally inactive centromeric 180-bp (CEN) and transcriptionally silent information (TSI) repeats. This increase suggested that there was a substantial release of transcriptional gene silencing by gamma rays, probably by induction of DNA hypomethylation. High expression of the DNA demethylase ROS1 and low expression of the DNA methyltransferase CMT3 supported this hypothesis. Moreover, Southern blot analysis following digestion of genomic DNA with methylation-sensitive enzymes revealed that the DNA hypomethylation occured preferentially at CHG or CHH sites rather than CG sites, depending on the radiation dose. Unlike CEN and TSI repeats, the number of Ta3, AtSN1 and FWA repeats decreased in transcription but increased in non-CG methylation. In addition, the cmt3-11 mutant showed neither DNA hypomethylation nor transcriptional activation of silenced repeats upon gamma irradiation. Furthermore, profiles of genome-wide transcriptomes in response to gamma rays differed between the wild-type and cmt3-11 mutant. These results suggest that gamma irradiation induced DNA hypomethylation preferentially at non-CG sites of transcriptionally inactive repeats in a locus-specific manner, which depends on CMT3 activity.

  15. FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes

    PubMed Central

    Chaves, Raquel; Ferreira, Daniela; Mendes-da-Silva, Ana; Meles, Susana; Adega, Filomena

    2017-01-01

    Abstract In recent years, a growing body of evidence has recognized the tandem repeat sequences, and specifically satellite DNA, as a functional class of sequences in the genomic “dark matter.” Using an original, complementary, and thus an eclectic experimental design, we show that the cat archetypal satellite DNA sequence, FA-SAT, is “frozen” conservatively in several Bilateria genomes. We found different genomic FA-SAT architectures, and the interspersion pattern was conserved. In Carnivora genomes, the FA-SAT-related sequences are also amplified, with the predominance of a specific FA-SAT variant, at the heterochromatic regions. We inspected the cat genome project to locate FA-SAT array flanking regions and revealed an intensive intermingling with transposable elements. Our results also show that FA-SAT-related sequences are transcribed and that the most abundant FA-SAT variant is not always the most transcribed. We thus conclude that the DNA sequences of FA-SAT and their transcripts are “frozen” in these genomes. Future work is needed to disclose any putative function that these sequences may play in these genomes. PMID:29608678

  16. The role of Cas8 in type I CRISPR interference.

    PubMed

    Cass, Simon D B; Haas, Karina A; Stoll, Britta; Alkhnbashi, Omer S; Sharma, Kundan; Urlaub, Henning; Backofen, Rolf; Marchfelder, Anita; Bolt, Edward L

    2015-05-05

    CRISPR (clustered regularly interspaced short palindromic repeat) systems provide bacteria and archaea with adaptive immunity to repel invasive genetic elements. Type I systems use 'cascade' [CRISPR-associated (Cas) complex for antiviral defence] ribonucleoprotein complexes to target invader DNA, by base pairing CRISPR RNA (crRNA) to protospacers. Cascade identifies PAMs (protospacer adjacent motifs) on invader DNA, triggering R-loop formation and subsequent DNA degradation by Cas3. Cas8 is a candidate PAM recognition factor in some cascades. We analysed Cas8 homologues from type IB CRISPR systems in archaea Haloferax volcanii (Hvo) and Methanothermobacter thermautotrophicus (Mth). Cas8 was essential for CRISPR interference in Hvo and purified Mth Cas8 protein responded to PAM sequence when binding to nucleic acids. Cas8 interacted physically with Cas5-Cas7-crRNA complex, stimulating binding to PAM containing substrates. Mutation of conserved Cas8 amino acid residues abolished interference in vivo and altered catalytic activity of Cas8 protein in vitro. This is experimental evidence that Cas8 is important for targeting Cascade to invader DNA. © 2015 Authors.

  17. Transcription of tandemly repetitive DNA: functional roles.

    PubMed

    Biscotti, Maria Assunta; Canapa, Adriana; Forconi, Mariko; Olmo, Ettore; Barucca, Marco

    2015-09-01

    A considerable fraction of the eukaryotic genome is made up of satellite DNA constituted of tandemly repeated sequences. These elements are mainly located at centromeres, pericentromeres, and telomeres and are major components of constitutive heterochromatin. Although originally satellite DNA was thought silent and inert, an increasing number of studies are providing evidence on its transcriptional activity supporting, on the contrary, an unexpected dynamicity. This review summarizes the multiple structural roles of satellite noncoding RNAs at chromosome level. Indeed, satellite noncoding RNAs play a role in the establishment of a heterochromatic state at centromere and telomere. These highly condensed structures are indispensable to preserve chromosome integrity and genome stability, preventing recombination events, and ensuring the correct chromosome pairing and segregation. Moreover, these RNA molecules seem to be involved also in maintaining centromere identity and in elongation, capping, and replication of telomere. Finally, the abnormal variation of centromeric and pericentromeric DNA transcription across major eukaryotic lineages in stress condition and disease has evidenced the critical role that these transcripts may play and the potentially dire consequences for the organism.

  18. Short interspersed element (SINE) depletion and long interspersed element (LINE) abundance are not features universally required for imprinting.

    PubMed

    Cowley, Michael; de Burca, Anna; McCole, Ruth B; Chahal, Mandeep; Saadat, Ghazal; Oakey, Rebecca J; Schulz, Reiner

    2011-04-20

    Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.

  19. Accurate quantification of chromosomal lesions via short tandem repeat analysis using minimal amounts of DNA

    PubMed Central

    Jann, Johann-Christoph; Nowak, Daniel; Nolte, Florian; Fey, Stephanie; Nowak, Verena; Obländer, Julia; Pressler, Jovita; Palme, Iris; Xanthopoulos, Christina; Fabarius, Alice; Platzbecker, Uwe; Giagounidis, Aristoteles; Götze, Katharina; Letsch, Anne; Haase, Detlef; Schlenk, Richard; Bug, Gesine; Lübbert, Michael; Ganser, Arnold; Germing, Ulrich; Haferlach, Claudia; Hofmann, Wolf-Karsten; Mossner, Maximilian

    2017-01-01

    Background Cytogenetic aberrations such as deletion of chromosome 5q (del(5q)) represent key elements in routine clinical diagnostics of haematological malignancies. Currently established methods such as metaphase cytogenetics, FISH or array-based approaches have limitations due to their dependency on viable cells, high costs or semi-quantitative nature. Importantly, they cannot be used on low abundance DNA. We therefore aimed to establish a robust and quantitative technique that overcomes these shortcomings. Methods For precise determination of del(5q) cell fractions, we developed an inexpensive multiplex-PCR assay requiring only nanograms of DNA that simultaneously measures allelic imbalances of 12 independent short tandem repeat markers. Results Application of this method to n=1142 samples from n=260 individuals revealed strong intermarker concordance (R²=0.77–0.97) and reproducibility (mean SD: 1.7%). Notably, the assay showed accurate quantification via standard curve assessment (R²>0.99) and high concordance with paired FISH measurements (R²=0.92) even with subnanogram amounts of DNA. Moreover, cytogenetic response was reliably confirmed in del(5q) patients with myelodysplastic syndromes treated with lenalidomide. While the assay demonstrated good diagnostic accuracy in receiver operating characteristic analysis (area under the curve: 0.97), we further observed robust correlation between bone marrow and peripheral blood samples (R²=0.79), suggesting its potential suitability for less-invasive clonal monitoring. Conclusions In conclusion, we present an adaptable tool for quantification of chromosomal aberrations, particularly in problematic samples, which should be easily applicable to further tumour entities. PMID:28600436

  20. The Repeat Expansion Diseases: the dark side of DNA repair?

    PubMed Central

    Zhao, Xiao-Nan; Usdin, Karen

    2015-01-01

    DNA repair normally protects the genome against mutations that threaten genome integrity and thus cell viability. However, growing evidence suggests that in the case of the Repeat Expansion Diseases, disorders that result from an increase in the size of a disease-specific microsatellite, the disease-causing mutation is actually the result of aberrant DNA repair. A variety of proteins from different DNA repair pathways have thus far been implicated in this process. This review will summarize recent findings from patients and from mouse models of these diseases that shed light on how these pathways may interact to cause repeat expansion. PMID:26002199

  1. Identification and nucleotide sequence analysis of the repetitive DNA element in the genome of fish lymphocystis disease virus.

    PubMed

    Schnitzler, P; Delius, H; Scholz, J; Touray, M; Orth, E; Darai, G

    1987-12-01

    The genome of the fish lymphocystis disease virus (FLDV) was screened for the existence of repetitive DNA sequences using a defined and complete gene library of the viral genome (98 kbp) by DNA-DNA hybridization, heteroduplex analysis, and restriction fine mapping. A repetitive DNA sequence was detected at the coordinates 0.034 to 0.057 and 0.718 to 0.736 map units (m.u.) of the FLDV genome. The first region (0.034 to 0.057 m.u.) corresponds to the 5' terminus of the EcoRI FLDV DNA fragment B (0.034 to 0.165 m.u.) and the second region (0.718 to 0.736 m.u.) is identical to the EcoRI DNA fragment M of the viral genome. The DNA nucleotide sequence of the EcoRI FLDV DNA fragment M was determined. This analysis revealed the presence of many short direct and inverted repetitions, e.g., a 18-mer direct repetition (TTTAAAATTTAATTAA) that started at nucleotide positions 812 and 942 and a 14-mer inverted repeat (TTAAATTTAAATTT) at nucleotide positions 820 and 959. Only short open reading frames were detected within this region. The DNA repetitions are discussed as sequences that play a possible regulatory role for virus replication. Furthermore, hybridization experiments revealed that the repetitive DNA sequences are conserved in the genome of different strains of fish lymphocystis disease virus isolated from two species of Pleuronectidae (flounder and dab).

  2. Next-generation sequencing detects repetitive elements expansion in giant genomes of annual killifish genus Austrolebias (Cyprinodontiformes, Rivulidae).

    PubMed

    García, G; Ríos, N; Gutiérrez, V

    2015-06-01

    Among Neotropical fish fauna, the South American killifish genus Austrolebias (Cyprinodontiformes: Rivulidae) constitutes an excellent model to study the genomic evolutionary processes underlying speciation events. Recently, unusually large genome size has been described in 16 species of this genus, with an average DNA content of about 5.95 ± 0.45 pg per diploid cell (mean C-value of about 2.98 pg). In the present paper we explore the possible origin of this unparallel genomic increase by means of comparative analysis of the repetitive components using NGS (454-Roche) technology in the lowest and highest Rivulidae genomes. Here, we provide the first annotated Rivulidae-repeated sequences composition and their relative repetitive fraction in both genomes. Remarkably, the genomic proportion of the moderately repetitive DNA in Austrolebias charrua genome represents approximately twice (45%) of the repetitive components of the highly related rivulinae taxon Cynopoecilus melanotaenia (25%). Present work provides evidence about the impact of the repeat families that could be distinctly proliferated among sublineages within Rivulidae fish group, explaining the great genome size differences encompassing the differentiation and speciation events in this family.

  3. Visualization and quantitative analysis of extrachromosomal telomere-repeat DNA in individual human cells by Halo-FISH

    PubMed Central

    Komosa, Martin; Root, Heather; Meyn, M. Stephen

    2015-01-01

    Current methods for characterizing extrachromosomal nuclear DNA in mammalian cells do not permit single-cell analysis, are often semi-quantitative and frequently biased toward the detection of circular species. To overcome these limitations, we developed Halo-FISH to visualize and quantitatively analyze extrachromosomal DNA in single cells. We demonstrate Halo-FISH by using it to analyze extrachromosomal telomere-repeat (ECTR) in human cells that use the Alternative Lengthening of Telomeres (ALT) pathway(s) to maintain telomere lengths. We find that GM847 and VA13 ALT cells average ∼80 detectable G/C-strand ECTR DNA molecules/nucleus, while U2OS ALT cells average ∼18 molecules/nucleus. In comparison, human primary and telomerase-positive cells contain <5 ECTR DNA molecules/nucleus. ECTR DNA in ALT cells exhibit striking cell-to-cell variations in number (<20 to >300), range widely in length (<1 to >200 kb) and are composed of primarily G- or C-strand telomere-repeat DNA. Halo-FISH enables, for the first time, the simultaneous analysis of ECTR DNA and chromosomal telomeres in a single cell. We find that ECTR DNA comprises ∼15% of telomere-repeat DNA in GM847 and VA13 cells, but <4% in U2OS cells. In addition to its use in ALT cell analysis, Halo-FISH can facilitate the study of a wide variety of extrachromosomal DNA in mammalian cells. PMID:25662602

  4. The organization of repeating units in mitochondrial DNA from yeast petite mutants.

    PubMed

    Bos, J L; Heyting, C; Van der Horst, G; Borst, P

    1980-04-01

    We have reinvestigated the linkage orientation of repeating units in mtDNAs of yeast ρ(-) petite mutants containing an inverted duplication. All five petite mtDNAs studied contain a continuous segment of wild-type mtDNA, part of which is duplicated and present in inverted form in the repeat. We show by restriction enzyme analysis that the non-duplicated segments between the inverted duplications are present in random orientation in all five petite mtDNAs. There is no segregation of sub-types with unique orientation. We attribute this to the high rate of intramolecular recombination between the inverted duplications. The results provide additional evidence for the high rate of recombination of yeast mtDNA even in haploid ρ(-) petite cells.We conclude that only two types of stable sequence organization exist in petite mtDNA: petites without an inverted duplication have repeats linked in straight head-to-tail arrangement (abcabc); petites with an inverted duplication have repeats in which the non-duplicated segments are present in random orientation.

  5. Non-B DB v2.0: a database of predicted non-B DNA-forming motifs and its associated tools.

    PubMed

    Cer, Regina Z; Donohue, Duncan E; Mudunuri, Uma S; Temiz, Nuri A; Loss, Michael A; Starner, Nathan J; Halusa, Goran N; Volfovsky, Natalia; Yi, Ming; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M

    2013-01-01

    The non-B DB, available at http://nonb.abcc.ncifcrf.gov, catalogs predicted non-B DNA-forming sequence motifs, including Z-DNA, G-quadruplex, A-phased repeats, inverted repeats, mirror repeats, direct repeats and their corresponding subsets: cruciforms, triplexes and slipped structures, in several genomes. Version 2.0 of the database revises and re-implements the motif discovery algorithms to better align with accepted definitions and thresholds for motifs, expands the non-B DNA-forming motifs coverage by including short tandem repeats and adds key visualization tools to compare motif locations relative to other genomic annotations. Non-B DB v2.0 extends the ability for comparative genomics by including re-annotation of the five organisms reported in non-B DB v1.0, human, chimpanzee, dog, macaque and mouse, and adds seven additional organisms: orangutan, rat, cow, pig, horse, platypus and Arabidopsis thaliana. Additionally, the non-B DB v2.0 provides an overall improved graphical user interface and faster query performance.

  6. Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster

    PubMed Central

    Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.

    1993-01-01

    Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654

  7. [Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].

    PubMed

    Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou

    2002-01-01

    To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.

  8. Selective recognition of N4-methylcytosine in DNA by engineered transcription-activator-like effectors.

    PubMed

    Rathi, Preeti; Maurer, Sara; Summerer, Daniel

    2018-06-05

    The epigenetic DNA nucleobases 5-methylcytosine (5mC) and N 4-methylcytosine (4mC) coexist in bacterial genomes and have important functions in host defence and transcription regulation. To better understand the individual biological roles of both methylated nucleobases, analytical strategies for distinguishing unmodified cytosine (C) from 4mC and 5mC are required. Transcription-activator-like effectors (TALEs) are programmable DNA-binding repeat proteins, which can be re-engineered for the direct detection of epigenetic nucleobases in user-defined DNA sequences. We here report the natural, cytosine-binding TALE repeat to not strongly differentiate between 5mC and 4mC. To engineer repeats with selectivity in the context of C, 5mC and 4mC, we developed a homogeneous fluorescence assay and screened a library of size-reduced TALE repeats for binding to all three nucleobases. This provided insights into the requirements of size-reduced TALE repeats for 4mC binding and revealed a single mutant repeat as a selective binder of 4mC. Employment of a TALE with this repeat in affinity enrichment enabled the isolation of a user-defined DNA sequence containing a single 4mC but not C or 5mC from the background of a bacterial genome. Comparative enrichments with TALEs bearing this or the natural C-binding repeat provides an approach for the complete, programmable decoding of all cytosine nucleobases found in bacterial genomes.This article is part of a discussion meeting issue 'Frontiers in epigenetic chemical biology'. © 2018 The Author(s).

  9. Developmental rearrangement of cyanobacterial nif genes: nucleotide sequence, open reading frames, and cytochrome P-450 homology of the Anabaena sp. strain PCC 7120 nifD element.

    PubMed Central

    Lammers, P J; McLaughlin, S; Papin, S; Trujillo-Provencio, C; Ryncarz, A J

    1990-01-01

    An 11-kbp DNA element of unknown function interrupts the nifD gene in vegetative cells of Anabaena sp. strain PCC 7120. In developing heterocysts the nifD element excises from the chromosome via site-specific recombination between short repeat sequences that flank the element. The nucleotide sequence of the nifH-proximal half of the element was determined to elucidate the genetic potential of the element. Four open reading frames with the same relative orientation as the nifD element-encoded xisA gene were identified in the sequenced region. Each of the open reading frames was preceded by a reasonable ribosome-binding site and had biased codon utilization preferences consistent with low levels of expression. Open reading frame 3 was highly homologous with three cytochrome P-450 omega-hydroxylase proteins and showed regional homology to functionally significant domains common to the cytochrome P-450 superfamily. The sequence encoding open reading frame 2 was the most highly conserved portion of the sequenced region based on heterologous hybridization experiments with three genera of heterocystous cyanobacteria. Images PMID:2123860

  10. Instability of plasmid DNA sequences: macro and micro evolution of the antibiotic resistance plasmid R6-5.

    PubMed

    Timmis, K N; Cabello, F; Andrés, I; Nordheim, A; Burkhardt, H J; Cohen, S N

    1978-11-16

    Detailed examination of the structure of cloned DNA fragments of the R6-5 antibiotic resistance plasmid has revealed a substantial degree of polynucleotide sequence heterogeneity and indicates that sequence rearrangements in plasmids and possible other replicons occur more frequently than has hitherto been appreciated. The sequences changes in cloned R6-5 fragments were shown in some instances to have occurred prior to cloning, i.e. existing in the original population of R6-5 molecules that was obtained from a single bacterial clone and by several different criteria judged to be homogeneous, and in others to have occurred either during the cloning procedure or during subsequent propagation of hybrid molecules. The molecular changes that are described involved insertion/deletion of the previously characterized IS2 insertion element, formation of a new inverted repeat structure probably by duplication of a preexisting R6-5 DNA sequence, sequence inversion, and loss and gain of restriction endonuclease cleavage sites.

  11. Generation of a conditional analog-sensitive kinase in human cells using CRISPR/Cas9-mediated genome engineering.

    PubMed

    Moyer, Tyler C; Holland, Andrew J

    2015-01-01

    The ability to rapidly and specifically modify the genome of mammalian cells has been a long-term goal of biomedical researchers. Recently, the clustered, regularly interspaced, short palindromic repeats (CRISPR)/Cas9 system from bacteria has been exploited for genome engineering in human cells. The CRISPR system directs the RNA-guided Cas9 nuclease to a specific genomic locus to induce a DNA double-strand break that may be subsequently repaired by homology-directed repair using an exogenous DNA repair template. Here we describe a protocol using CRISPR/Cas9 to achieve bi-allelic insertion of a point mutation in human cells. Using this method, homozygous clonal cell lines can be constructed in 5-6 weeks. This method can also be adapted to insert larger DNA elements, such as fluorescent proteins and degrons, at defined genomic locations. CRISPR/Cas9 genome engineering offers exciting applications in both basic science and translational research. Copyright © 2015 Elsevier Inc. All rights reserved.

  12. CRISPR/Cas9 for genome editing: progress, implications and challenges.

    PubMed

    Zhang, Feng; Wen, Yan; Guo, Xiong

    2014-09-15

    Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) protein 9 system provides a robust and multiplexable genome editing tool, enabling researchers to precisely manipulate specific genomic elements, and facilitating the elucidation of target gene function in biology and diseases. CRISPR/Cas9 comprises of a nonspecific Cas9 nuclease and a set of programmable sequence-specific CRISPR RNA (crRNA), which can guide Cas9 to cleave DNA and generate double-strand breaks at target sites. Subsequent cellular DNA repair process leads to desired insertions, deletions or substitutions at target sites. The specificity of CRISPR/Cas9-mediated DNA cleavage requires target sequences matching crRNA and a protospacer adjacent motif locating at downstream of target sequences. Here, we review the molecular mechanism, applications and challenges of CRISPR/Cas9-mediated genome editing and clinical therapeutic potential of CRISPR/Cas9 in future. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  13. DNA replication stress restricts ribosomal DNA copy number

    PubMed Central

    Salim, Devika; Bradford, William D.; Freeland, Amy; Cady, Gillian; Wang, Jianmin

    2017-01-01

    Ribosomal RNAs (rRNAs) in budding yeast are encoded by ~100–200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA) locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how “normal” copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2)-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a “normal” rDNA copy number. PMID:28915237

  14. Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome

    PubMed Central

    2009-01-01

    Background Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. Results We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. Conclusion We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes. PMID:19656416

  15. Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.

    PubMed

    Hamberger, Björn; Hall, Dawn; Yuen, Mack; Oddy, Claire; Hamberger, Britta; Keeling, Christopher I; Ritland, Carol; Ritland, Kermit; Bohlmann, Jörg

    2009-08-06

    Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes.

  16. Structure of genes and an insertion element in the methane producing archaebacterium Methanobrevibacter smithii.

    PubMed

    Hamilton, P T; Reeve, J N

    1985-01-01

    DNA fragments cloned from the methanogenic archaebacterium Methanobrevibacter smithii which complement mutations in the purE and proC genes of E. coli have been sequenced. Sequence analyses, transposon mutagenesis and expression in E. coli minicells indicate that purE and proC complementations result from the synthesis of M. smithii polypeptides with molecular weights of 36,697 and 27,836 respectively. The encoding genes appear to be located in operons. The M. smithii genome contains 69% A/T basepairs (bp) which is reflected in unusual codon usages and intergenic regions containing approximately 85% A/T bp. An insertion element, designated ISM1, was found within the cloned M. smithii DNA located adjacent to the proC complementing region. ISM1 is 1381 bp in length, has 29 bp terminal inverted repeat sequences and contains one major ORF encoded in 87% of the ISM1 sequence. ISM1 is mobile, present in approximately 10 copies per genome and integration duplicates 8 bp at the site of insertion. The duplicated sequences show homology with sequences within the 29 bp terminal repeat sequence of ISM1. Comparison of our data with sequences from halophilic archaebacteria suggests that 5'GAANTTTCA and 5'TTTTAATATAAA may be consensus promoter sequences for archaebacteria. These sequences closely resemble the consensus sequences which precede Drosophila heat-shock genes (Pelham 1982; Davidson et al. 1983). Methanogens appear to employ the eubacterial system of mRNA: 16SrRNA hybridization to ensure initiation of translation; the consensus ribosome binding sequence is 5'AGGTGA.

  17. Identification of a recently active Prunus-specific non-autonomous Mutator element with considerable genome shaping force.

    PubMed

    Halász, Júlia; Kodad, Ossama; Hegedűs, Attila

    2014-07-01

    Miniature inverted-repeat transposable elements (MITEs) are known to contribute to the evolution of plants, but only limited information is available for MITEs in the Prunus genome. We identified a MITE that has been named Falling Stones, FaSt. All structural features (349-bp size, 82-bp terminal inverted repeats and 9-bp target site duplications) are consistent with this MITE being a putative member of the Mutator transposase superfamily. FaSt showed a preferential accumulation in the short AT-rich segments of the euchromatin region of the peach genome. DNA sequencing and pollination experiments have been performed to confirm that the nested insertion of FaSt into the S-haplotype-specific F-box gene of apricot resulted in the breakdown of self-incompatibility (SI). A bioinformatics-based survey of the known Rosaceae and other genomes and a newly designed polymerase chain reaction (PCR) assay verified the Prunoideae-specific occurrence of FaSt elements. Phylogenetic analysis suggested a recent activity of FaSt in the Prunus genome. The occurrence of a nested insertion in the apricot genome further supports the recent activity of FaSt in response to abiotic stress conditions. This study reports on a presumably active non-autonomous Mutator element in Prunus that exhibits a major indirect genome shaping force through inducing loss-of-function mutation in the SI locus. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  18. CACTA-superfamily transposable element is inserted in MYB transcription factor gene of soybean line producing variegated seeds.

    PubMed

    Yan, Fan; Di, Shaokang; Takahashi, Ryoji

    2015-08-01

    The R gene of soybean, presumably encoding a MYB transcription factor, controls seed coat color. The gene consists of multiple alleles, R (black), r-m (black spots and (or) concentric streaks on brown seed), and r (brown seed). This study was conducted to determine the structure of the MYB transcription factor gene in a near-isogenic line (NIL) having r-m allele. PCR amplification of a fragment of the candidate gene Glyma.09G235100 generated a fragment of about 1 kb in the soybean cultivar Clark, whereas a fragment of about 14 kb in addition to fragments of 1 and 1.4 kb were produced in L72-2040, a Clark 63 NIL with the r-m allele. Clark 63 is a NIL of Clark with the rxp and Rps1 alleles. A DNA fragment of 13 060 bp was inserted in the intron of Glyma.09G235100 in L72-2040. The fragment had the CACTA motif at both ends, imperfect terminal inverted repeats (TIR), inverse repetition of short sequence motifs close to the 5' and 3' ends, and a duplication of three nucleotides at the site of integration, indicating that it belongs to a CACTA-superfamily transposable element. We designated the element as Tgm11. Overall nucleotide sequence, motifs of TIR, and subterminal repeats were similar to those of Tgm1 and Tgs1, suggesting that these elements comprise a family.

  19. Characterisation of IS153, an IS3-family insertion sequence isolated from Lactobacillus sanfranciscensis and its use for strain differentiation.

    PubMed

    Ehrmann, M A; Vogel, R E

    2001-11-01

    An insertion sequence has been identified in the genome of Lactobacillus sanfranciscensis DSM 20451T as segment of 1351 nucleotides containing 37-bp imperfect terminal inverted repeats. The sequence of this element encodes two out of phase, overlapping open reading frames, orfA and orfB, from which three putative proteins are produced. OrfAB is a transframe protein produced by -1 translational frame shifting between orf A and orf B that is presumed to be the transposase. The large orfAB of this element encodes a 342 amino acid protein that displays similarities with transposases encoded by bacterial insertion sequences belonging to the IS3 family. In L. sanfranciscensis type strain DSM 20451T multiple truncated IS elements were identified. Inverse PCR was used to analyze target sites of four of these elements, but except of their highly AT rich character not any sequence specificity was identified so far. Moreover, no flanking direct repeats were identified. Multiple copies of IS153 were detected by hybridization in other strains of L. sanfranciscensis. Resulting hybridization patterns were shown to differentiate between organisms at strain level rather than a probe targeted against the 16S rDNA. With a PCR based approach IS153 or highly similar sequences were detected in L. acidophilus, L. casei, L. malefermentans, L. plantarum, L. hilgardii, L. collinoides L. farciminis L. sakei and L. salivarius, L. reuteri as well as in Enterococcus faecium, Pediococcus acidilactici and P. pentosaceus.

  20. Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain

    PubMed Central

    de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

    2014-01-01

    The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163

  1. The genome biology of phytoplasma: modulators of plants and insects.

    PubMed

    Sugio, Akiko; Hogenhout, Saskia A

    2012-06-01

    Phytoplasmas are bacterial pathogens of plants that are transmitted by insects. These bacteria uniquely multiply intracellularly in both plants (Plantae) and insects (Animalia). Similarly to bacterial endosymbionts, phytoplasmas have reduced genomes with limited metabolic capabilities. Nonetheless, the chromosomes of many phytoplasmas are rich in repeated DNA consisting of mobile elements. Phytoplasmas produce an arsenal of effectors most of which are encoded on these mobile elements and on plasmids. These effectors target conserved plant transcription factors resulting in witches' broom and leafy flower symptoms and suppression of plant defense to insect vectors that transmit the phytoplasmas. Future studies of these fascinating microbes will generate a wealth of new knowledge about forces that shape genomes and microbial interactions with multicellular hosts. Copyright © 2012 Elsevier Ltd. All rights reserved.

  2. Inhibition of hepatitis B virus replication with linear DNA sequences expressing antiviral micro-RNA shuttles

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie

    2009-11-20

    RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR)more » shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.« less

  3. Heterochromatic siRNAs and DDM1 Independently Silence Aberrant 5S rDNA Transcripts in Arabidopsis

    PubMed Central

    Blevins, Todd; Pontes, Olga; Pikaard, Craig S.; Meins, Frederick

    2009-01-01

    5S ribosomal RNA gene repeats are arranged in heterochromatic arrays (5S rDNA) situated near the centromeres of Arabidopsis chromosomes. The chromatin remodeling factor DDM1 is known to maintain 5S rDNA methylation patterns while silencing transcription through 5S rDNA intergenic spacers (IGS). We mapped small-interfering RNAs (siRNA) to a composite 5S rDNA repeat, revealing a high density of siRNAs matching silenced IGS transcripts. IGS transcript repression requires proteins of the heterochromatic siRNA pathway, including RNA polymerase IV (Pol IV), RNA-DEPENDENT RNA POLYMERASE 2 (RDR2) and DICER-LIKE 3 (DCL3). Using molecular and cytogenetic approaches, we show that the DDM1 and siRNA-dependent silencing effects are genetically independent. DDM1 suppresses production of the siRNAs, however, thereby limiting RNA-directed DNA methylation at 5S rDNA repeats. We conclude that DDM1 and siRNA-dependent silencing are overlapping processes that both repress aberrant 5S rDNA transcription and contribute to the heterochromatic state of 5S rDNA arrays. PMID:19529764

  4. Protection of CpG islands from DNA methylation is DNA-encoded and evolutionarily conserved.

    PubMed

    Long, Hannah K; King, Hamish W; Patient, Roger K; Odom, Duncan T; Klose, Robert J

    2016-08-19

    DNA methylation is a repressive epigenetic modification that covers vertebrate genomes. Regions known as CpG islands (CGIs), which are refractory to DNA methylation, are often associated with gene promoters and play central roles in gene regulation. Yet how CGIs in their normal genomic context evade the DNA methylation machinery and whether these mechanisms are evolutionarily conserved remains enigmatic. To address these fundamental questions we exploited a transchromosomic animal model and genomic approaches to understand how the hypomethylated state is formed in vivo and to discover whether mechanisms governing CGI formation are evolutionarily conserved. Strikingly, insertion of a human chromosome into mouse revealed that promoter-associated CGIs are refractory to DNA methylation regardless of host species, demonstrating that DNA sequence plays a central role in specifying the hypomethylated state through evolutionarily conserved mechanisms. In contrast, elements distal to gene promoters exhibited more variable methylation between host species, uncovering a widespread dependence on nucleotide frequency and occupancy of DNA-binding transcription factors in shaping the DNA methylation landscape away from gene promoters. This was exemplified by young CpG rich lineage-restricted repeat sequences that evaded DNA methylation in the absence of co-evolved mechanisms targeting methylation to these sequences, and species specific DNA binding events that protected against DNA methylation in CpG poor regions. Finally, transplantation of mouse chromosomal fragments into the evolutionarily distant zebrafish uncovered the existence of a mechanistically conserved and DNA-encoded logic which shapes CGI formation across vertebrate species. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Tandemly repeated sequences in mtDNA control region of whitefish, Coregonus lavaretus.

    PubMed

    Brzuzan, P

    2000-06-01

    Length variation of the mitochondrial DNA control region was observed with PCR amplification of a sample of 138 whitefish (Coregonus lavaretus). Nucleotide sequences of representative PCR products showed that the variation was due to the presence of an approximately 100-bp motif tandemly repeated two, three, or five times in the region between the conserved sequence block-3 (CSB-3) and the gene for phenylalanine tRNA. This is the first report on the tandem array composed of long repeat units in mitochondrial DNA of salmonids.

  6. Is the Fungus Magnaporthe Losing DNA Methylation?

    PubMed Central

    Ikeda, Ken-ichi; Van Vu, Ba; Kadotani, Naoki; Tanaka, Masaki; Murata, Toshiki; Shiina, Kohta; Chuma, Izumi; Tosa, Yukio; Nakayashiki, Hitoshi

    2013-01-01

    The long terminal repeat retrotransposon, Magnaporthe gypsy-like element (MAGGY), has been shown to be targeted for cytosine methylation in a subset of Magnaporthe oryzae field isolates. Analysis of the F1 progeny from a genetic cross between methylation-proficient (Br48) and methylation-deficient (GFSI1-7-2) isolates revealed that methylation of the MAGGY element was governed by a single dominant gene. Positional cloning followed by gene disruption and complementation experiments revealed that the responsible gene was the DNA methyltransferase, MoDMT1, an ortholog of Neurospora crassa Dim-2. A survey of MAGGY methylation in 60 Magnaporthe field isolates revealed that 42 isolates from rice, common millet, wheat, finger millet, and buffelgrass were methylation proficient while 18 isolates from foxtail millet, green bristlegrass, Japanese panicgrass, torpedo grass, Guinea grass, and crabgrass were methylation deficient. Phenotypic analyses showed that MoDMT1 plays no major role in development and pathogenicity of the fungus. Quantitative polymerase chain reaction analysis showed that the average copy number of genomic MAGGY elements was not significantly different between methylation-deficient and -proficient field isolates even though the levels of MAGGY transcript were generally higher in the former group. MoDMT1 gene sequences in the methylation-deficient isolates suggested that at least three independent mutations were responsible for the loss of MoDMT1 function. Overall, our data suggest that MoDMT1 is not essential for the natural life cycle of the fungus and raise the possibility that the genus Magnaporthe may be losing the mechanism of DNA methylation on the evolutionary time scale. PMID:23979580

  7. Processing of double-R-loops in (CAG)·(CTG) and C9orf72 (GGGGCC)·(GGCCCC) repeats causes instability

    PubMed Central

    Reddy, Kaalak; Schmidt, Monika H.M.; Geist, Jaimie M.; Thakkar, Neha P.; Panigrahi, Gagan B.; Wang, Yuh-Hwa; Pearson, Christopher E.

    2014-01-01

    R-loops, transcriptionally-induced RNA:DNA hybrids, occurring at repeat tracts (CTG)n, (CAG)n, (CGG)n, (CCG)n and (GAA)n, are associated with diseases including myotonic dystrophy, Huntington's disease, fragile X and Friedreich's ataxia. Many of these repeats are bidirectionally transcribed, allowing for single- and double-R-loop configurations, where either or both DNA strands may be RNA-bound. R-loops can trigger repeat instability at (CTG)·(CAG) repeats, but the mechanism of this is unclear. We demonstrate R-loop-mediated instability through processing of R-loops by HeLa and human neuron-like cell extracts. Double-R-loops induced greater instability than single-R-loops. Pre-treatment with RNase H only partially suppressed instability, supporting a model in which R-loops directly generate instability by aberrant processing, or via slipped-DNA formation upon RNA removal and its subsequent aberrant processing. Slipped-DNAs were observed to form following removal of the RNA from R-loops. Since transcriptionally-induced R-loops can occur in the absence of DNA replication, R-loop processing may be a source of repeat instability in the brain. Double-R-loop formation and processing to instability was extended to the expanded C9orf72 (GGGGCC)·(GGCCCC) repeats, known to cause amyotrophic lateral sclerosis and frontotemporal dementia, providing the first suggestion through which these repeats may become unstable. These findings provide a mechanistic basis for R-loop-mediated instability at disease-associated repeats. PMID:25147206

  8. Fragile DNA Motifs Trigger Mutagenesis at Distant Chromosomal Loci in Saccharomyces cerevisiae

    PubMed Central

    Saini, Natalie; Zhang, Yu; Nishida, Yuri; Sheng, Ziwei; Choudhury, Shilpa; Mieczkowski, Piotr; Lobachev, Kirill S.

    2013-01-01

    DNA sequences capable of adopting non-canonical secondary structures have been associated with gross-chromosomal rearrangements in humans and model organisms. Previously, we have shown that long inverted repeats that form hairpin and cruciform structures and triplex-forming GAA/TTC repeats induce the formation of double-strand breaks which trigger genome instability in yeast. In this study, we demonstrate that breakage at both inverted repeats and GAA/TTC repeats is augmented by defects in DNA replication. Increased fragility is associated with increased mutation levels in the reporter genes located as far as 8 kb from both sides of the repeats. The increase in mutations was dependent on the presence of inverted or GAA/TTC repeats and activity of the translesion polymerase Polζ. Mutagenesis induced by inverted repeats also required Sae2 which opens hairpin-capped breaks and initiates end resection. The amount of breakage at the repeats is an important determinant of mutations as a perfect palindromic sequence with inherently increased fragility was also found to elevate mutation rates even in replication-proficient strains. We hypothesize that the underlying mechanism for mutagenesis induced by fragile motifs involves the formation of long single-stranded regions in the broken chromosome, invasion of the undamaged sister chromatid for repair, and faulty DNA synthesis employing Polζ. These data demonstrate that repeat-mediated breaks pose a dual threat to eukaryotic genome integrity by inducing chromosomal aberrations as well as mutations in flanking genes. PMID:23785298

  9. Alu repeated DNAs are differentially methylated in primate germ cells.

    PubMed Central

    Rubin, C M; VandeVoort, C A; Teplitz, R L; Schmid, C W

    1994-01-01

    A significant fraction of Alu repeats in human sperm DNA, previously found to be unmethylated, is nearly completely methylated in DNA from many somatic tissues. A similar fraction of unmethylated Alus is observed here in sperm DNA from rhesus monkey. However, Alus are almost completely methylated at the restriction sites tested in monkey follicular oocyte DNA. The Alu methylation patterns in mature male and female monkey germ cells are consistent with Alu methylation in human germ cell tumors. Alu sequences are hypomethylated in seminoma DNAs and more methylated in a human ovarian dysgerminoma. These results contrast with methylation patterns reported for germ cell single-copy, CpG island, satellite, and L1 sequences. The function of Alu repeats is not known, but differential methylation of Alu repeats in the male and female germ lines suggests that they may serve as markers for genomic imprinting or in maintaining differences in male and female meiosis. Images PMID:7800508

  10. Length and sequence heterogeneity in 5S rDNA of Populus deltoides.

    PubMed

    Negi, Madan S; Rajagopal, Jyothi; Chauhan, Neeti; Cronn, Richard; Lakshmikumaran, Malathi

    2002-12-01

    The 5S rRNA genes and their associated non-transcribed spacer (NTS) regions are present as repeat units arranged in tandem arrays in plant genomes. Length heterogeneity in 5S rDNA repeats was previously identified in Populus deltoides and was also observed in the present study. Primers were designed to amplify the 5S rDNA NTS variants from the P. deltoides genome. The PCR-amplified products from the two accessions of P. deltoides (G3 and G48) suggested the presence of length heterogeneity of 5S rDNA units within and among accessions, and the size of the spacers ranged from 385 to 434 bp. Sequence analysis of the non-transcribed spacer (NTS) revealed two distinct classes of 5S rDNA within both accessions: class 1, which contained GAA trinucleotide microsatellite repeats, and class 2, which lacked the repeats. The class 1 spacer shows length variation owing to the microsatellite, with two clones exhibiting 10 GAA repeat units and one clone exhibiting 16 such repeat units. However, distance analysis shows that class 1 spacer sequences are highly similar inter se, yielding nucleotide diversity (pi) estimates that are less than 0.15% of those obtained for class 2 spacers (pi = 0.0183 vs. 0.1433, respectively). The presence of microsatellite in the NTS region leading to variation in spacer length is reported and discussed for the first time in P. deltoides.

  11. Electronic Transport in Single-Stranded DNA Molecule Related to Huntington's Disease

    NASA Astrophysics Data System (ADS)

    Sarmento, R. G.; Silva, R. N. O.; Madeira, M. P.; Frazão, N. F.; Sousa, J. O.; Macedo-Filho, A.

    2018-04-01

    We report a numerical analysis of the electronic transport in single chain DNA molecule consisting of 182 nucleotides. The DNA chains studied were extracted from a segment of the human chromosome 4p16.3, which were modified by expansion of CAG (cytosine-adenine-guanine) triplet repeats to mimics Huntington's disease. The mutated DNA chains were connected between two platinum electrodes to analyze the relationship between charge propagation in the molecule and Huntington's disease. The computations were performed within a tight-binding model, together with a transfer matrix technique, to investigate the current-voltage (I-V) of 23 types of DNA sequence and compare them with the distributions of the related CAG repeat numbers with the disease. All DNA sequences studied have a characteristic behavior of a semiconductor. In addition, the results showed a direct correlation between the current-voltage curves and the distributions of the CAG repeat numbers, suggesting possible applications in the development of DNA-based biosensors for molecular diagnostics.

  12. History of CRISPR-Cas from Encounter with a Mysterious Repeated Sequence to Genome Editing Technology.

    PubMed

    Ishino, Yoshizumi; Krupovic, Mart; Forterre, Patrick

    2018-04-01

    Clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems are well-known acquired immunity systems that are widespread in archaea and bacteria. The RNA-guided nucleases from CRISPR-Cas systems are currently regarded as the most reliable tools for genome editing and engineering. The first hint of their existence came in 1987, when an unusual repetitive DNA sequence, which subsequently was defined as a CRISPR, was discovered in the Escherichia coli genome during an analysis of genes involved in phosphate metabolism. Similar sequence patterns were then reported in a range of other bacteria as well as in halophilic archaea, suggesting an important role for such evolutionarily conserved clusters of repeated sequences. A critical step toward functional characterization of the CRISPR-Cas systems was the recognition of a link between CRISPRs and the associated Cas proteins, which were initially hypothesized to be involved in DNA repair in hyperthermophilic archaea. Comparative genomics, structural biology, and advanced biochemistry could then work hand in hand, not only culminating in the explosion of genome editing tools based on CRISPR-Cas9 and other class II CRISPR-Cas systems but also providing insights into the origin and evolution of this system from mobile genetic elements denoted casposons. To celebrate the 30th anniversary of the discovery of CRISPR, this minireview briefly discusses the fascinating history of CRISPR-Cas systems, from the original observation of an enigmatic sequence in E. coli to genome editing in humans. Copyright © 2018 American Society for Microbiology.

  13. Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

    PubMed

    Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

    2012-12-01

    In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.

  14. Genome-wide analyses of LINE–LINE-mediated nonallelic homologous recombination

    PubMed Central

    Startek, Michał; Szafranski, Przemyslaw; Gambin, Tomasz; Campbell, Ian M.; Hixson, Patricia; Shaw, Chad A.; Stankiewicz, Paweł; Gambin, Anna

    2015-01-01

    Nonallelic homologous recombination (NAHR), occurring between low-copy repeats (LCRs) >10 kb in size and sharing >97% DNA sequence identity, is responsible for the majority of recurrent genomic rearrangements in the human genome. Recent studies have shown that transposable elements (TEs) can also mediate recurrent deletions and translocations, indicating the features of substrates that mediate NAHR may be significantly less stringent than previously believed. Using >4 kb length and >95% sequence identity criteria, we analyzed of the genome-wide distribution of long interspersed element (LINE) retrotransposon and their potential to mediate NAHR. We identified 17 005 directly oriented LINE pairs located <10 Mbp from each other as potential NAHR substrates, placing 82.8% of the human genome at risk of LINE–LINE-mediated instability. Cross-referencing these regions with CNVs in the Baylor College of Medicine clinical chromosomal microarray database of 36 285 patients, we identified 516 CNVs potentially mediated by LINEs. Using long-range PCR of five different genomic regions in a total of 44 patients, we confirmed that the CNV breakpoints in each patient map within the LINE elements. To additionally assess the scale of LINE–LINE/NAHR phenomenon in the human genome, we tested DNA samples from six healthy individuals on a custom aCGH microarray targeting LINE elements predicted to mediate CNVs and identified 25 LINE–LINE rearrangements. Our data indicate that LINE–LINE-mediated NAHR is widespread and under-recognized, and is an important mechanism of structural rearrangement contributing to human genomic variability. PMID:25613453

  15. Cell cycle-dependent transcription factors control the expression of yeast telomerase RNA.

    PubMed

    Dionne, Isabelle; Larose, Stéphanie; Dandjinou, Alain T; Abou Elela, Sherif; Wellinger, Raymund J

    2013-07-01

    Telomerase is a specialized ribonucleoprotein that adds repeated DNA sequences to the ends of eukaryotic chromosomes to preserve genome integrity. Some secondary structure features of the telomerase RNA are very well conserved, and it serves as a central scaffold for the binding of associated proteins. The Saccharomyces cerevisiae telomerase RNA, TLC1, is found in very low copy number in the cell and is the limiting component of the known telomerase holoenzyme constituents. The reasons for this low abundance are unclear, but given that the RNA is very stable, transcriptional control mechanisms must be extremely important. Here we define the sequences forming the TLC1 promoter and identify the elements required for its low expression level, including enhancer and repressor elements. Within an enhancer element, we found consensus sites for Mbp1/Swi4 association, and chromatin immunoprecipitation (ChIP) assays confirmed the binding of Mbp1 and Swi4 to these sites of the TLC1 promoter. Furthermore, the enhancer element conferred cell cycle-dependent regulation to a reporter gene, and mutations in the Mbp1/Swi4 binding sites affected the levels of telomerase RNA and telomere length. Finally, ChIP experiments using a TLC1 RNA-binding protein as target showed cell cycle-dependent transcription of the TLC1 gene. These results indicate that the budding yeast TLC1 RNA is transcribed in a cell cycle-dependent fashion late in G1 and may be part of the S phase-regulated group of genes involved in DNA replication.

  16. Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.

    PubMed

    Schnitzler, P; Darai, G

    1989-09-01

    The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.

  17. DNA Fingerprint Analysis of Three Short Tandem Repeat (STR) Loci for Biochemistry and Forensic Science Laboratory Courses

    ERIC Educational Resources Information Center

    McNamara-Schroeder, Kathleen; Olonan, Cheryl; Chu, Simon; Montoya, Maria C.; Alviri, Mahta; Ginty, Shannon; Love, John J.

    2006-01-01

    We have devised and implemented a DNA fingerprinting module for an upper division undergraduate laboratory based on the amplification and analysis of three of the 13 short tandem repeat loci that are required by the Federal Bureau of Investigation Combined DNA Index System (FBI CODIS) data base. Students first collect human epithelial (cheek)…

  18. Genomic sequences of murine gamma B- and gamma C-crystallin-encoding genes: promoter analysis and complete evolutionary pattern of mouse, rat and human gamma-crystallins.

    PubMed

    Graw, J; Liebstein, A; Pietrowski, D; Schmitt-John, T; Werner, T

    1993-12-22

    The murine genes, gamma B-cry and gamma C-cry, encoding the gamma B- and gamma C-crystallins, were isolated from a genomic DNA library. The complete nucleotide (nt) sequences of both genes were determined from 661 and 711 bp, respectively, upstream from the first exon to the corresponding polyadenylation sites, comprising more than 2650 and 2890 bp, respectively. The new sequences were compared to the partial cDNA sequences available for the murine gamma B-cry and gamma C-cry, as well as to the corresponding genomic sequences from rat and man, at both the nt and predicted amino acid (aa) sequence levels. In the gamma B-cry promoter region, a canonical CCAAT-box, a TATA-box, putative NF-I and C/EBP sites were detected. An R-repeat is inserted 366 bp upstream from the transcription start point. In contrast, the gamma C-cry promoter does not contain a CCAAT-box, but some other putative binding sites for transcription factors (AP-2, UBP-1, LBP-1) were located by computer analysis. The promoter regions of all six gamma-cry from mouse, rat and human, except human psi gamma F-cry, were analyzed for common sequence elements. A complex sequence element of about 70-80 bp was found in the proximal promoter, which contains a gamma-cry-specific and almost invariant sequence (crygpel) of 14 nt, and ends with the also invariant TATA-box. Within the complex sequence element, a minimum of three further features specific for the gamma A-, gamma B- and gamma D/E/F-cry genes can be defined, at least two of which were recently shown to be functional. In addition to these four sequence elements, a subtype-specific structure of inverted repeats with different-sized spacers can be deduced from the multiple sequence alignment. A phylogenetic analysis based on the promoter region, as well as the complete exon 3 of all gamma-cry from mouse, rat and man, suggests separation of only five gamma-cry subtypes (gamma A-, gamma B-, gamma C-, gamma D- and gamma E/F-cry) prior to species separation.

  19. G-quadruplex-interacting compounds alter latent DNA replication and episomal persistence of KSHV.

    PubMed

    Madireddy, Advaitha; Purushothaman, Pravinkumar; Loosbroock, Christopher P; Robertson, Erle S; Schildkraut, Carl L; Verma, Subhash C

    2016-05-05

    Kaposi's sarcoma associated herpesvirus (KSHV) establishes life-long latent infection by persisting as an extra-chromosomal episome in the infected cells and by maintaining its genome in dividing cells. KSHV achieves this by tethering its epigenome to the host chromosome by latency associated nuclear antigen (LANA), which binds in the terminal repeat (TR) region of the viral genome. Sequence analysis of the TR, a GC-rich DNA element, identified several potential Quadruplex G-Rich Sequences (QGRS). Since quadruplexes have the tendency to obstruct DNA replication, we used G-quadruplex stabilizing compounds to examine their effect on latent DNA replication and the persistence of viral episomes. Our results showed that these G-quadruplex stabilizing compounds led to the activation of dormant origins of DNA replication, with preferential bi-directional pausing of replications forks moving out of the TR region, implicating the role of the G-rich TR in the perturbation of episomal DNA replication. Over time, treatment with PhenDC3 showed a loss of viral episomes in the infected cells. Overall, these data show that G-quadruplex stabilizing compounds retard the progression of replication forks leading to a reduction in DNA replication and episomal maintenance. These results suggest a potential role for G-quadruplex stabilizers in the treatment of KSHV-associated diseases. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Distinct roles of DNMT1-dependent and DNMT1-independent methylation patterns in the genome of mouse embryonic stem cells.

    PubMed

    Li, Zhiguang; Dai, Hongzheng; Martos, Suzanne N; Xu, Beisi; Gao, Yang; Li, Teng; Zhu, Guangjing; Schones, Dustin E; Wang, Zhibin

    2015-06-02

    DNA methylation patterns are initiated by de novo DNA methyltransferases DNMT3a/3b adding methyl groups to CG dinucleotides in the hypomethylated genome of early embryos. These patterns are faithfully maintained by DNMT1 during DNA replication to ensure epigenetic inheritance across generations. However, this two-step model is based on limited data. We generated base-resolution DNA methylomes for a series of DNMT knockout embryonic stem cells, with deep coverage at highly repetitive elements. We show that DNMT1 and DNMT3a/3b activities work complementarily and simultaneously to establish symmetric CG methylation and CHH (H = A, T or C) methylation. DNMT3a/3b can add methyl groups to daughter strands after each cycle of DNA replication. We also observe an unexpected division of labor between DNMT1 and DNMT3a/3b in suppressing retrotransposon long terminal repeats and long interspersed elements, respectively. Our data suggest that mammalian cells use a specific CG density threshold to predetermine methylation levels in wild-type cells and the magnitude of methylation reduction in DNMT knockout cells. Only genes with low CG density can be induced or, surprisingly, suppressed in the hypomethylated genome. Lastly, we do not find any association between gene body methylation and transcriptional activity. We show the concerted actions of DNMT enzymes in the establishment and maintenance of methylation patterns. The finding of distinct roles of DNMT1-dependent and -independent methylation patterns in genome stability and regulation of transcription provides new insights for understanding germ cell development, neuronal diversity, and transgenerational epigenetic inheritance and will help to develop next-generation DNMT inhibitors.

  1. Prenatal Arsenic Exposure and DNA Methylation in Maternal and Umbilical Cord Blood Leukocytes

    PubMed Central

    Baccarelli, Andrea; Hoffman, Elaine; Tarantini, Letizia; Quamruzzaman, Quazi; Rahman, Mahmuder; Mahiuddin, Golam; Mostofa, Golam; Hsueh, Yu-Mei; Wright, Robert O.; Christiani, David C.

    2012-01-01

    Background: Arsenic is an epigenetic toxicant and could influence fetal developmental programming. Objectives: We evaluated the association between arsenic exposure and DNA methylation in maternal and umbilical cord leukocytes. Methods: Drinking-water and urine samples were collected when women were at ≤ 28 weeks gestation; the samples were analyzed for arsenic using inductively coupled plasma mass spectrometry. DNA methylation at CpG sites in p16 (n = 7) and p53 (n = 4), and in LINE-1 and Alu repetitive elements (3 CpG sites in each), was quantified using pyrosequencing in 113 pairs of maternal and umbilical blood samples. We used general linear models to evaluate the relationship between DNA methylation and tertiles of arsenic exposure. Results: Mean (± SD) drinking-water arsenic concentration was 14.8 ± 36.2 μg/L (range: < 1–230 μg/L). Methylation in LINE-1 increased by 1.36% [95% confidence interval (CI): 0.52, 2.21%] and 1.08% (95% CI: 0.07, 2.10%) in umbilical cord and maternal leukocytes, respectively, in association with the highest versus lowest tertile of total urinary arsenic per gram creatinine. Arsenic exposure was also associated with higher methylation of some of the tested CpG sites in the promoter region of p16 in umbilical cord and maternal leukocytes. No associations were observed for Alu or p53 methylation. Conclusions: Exposure to higher levels of arsenic was positively associated with DNA methylation in LINE-1 repeated elements, and to a lesser degree at CpG sites within the promoter region of the tumor suppressor gene p16. Associations were observed in both maternal and fetal leukocytes. Future research is needed to confirm these results and determine if these small increases in methylation are associated with any health effects. PMID:22466225

  2. Recombination Creates Novel L1 (Line-1) Elements in Rattus Norvegicus

    PubMed Central

    Hayward, B. E.; Zavanelli, M.; Furano, A. V.

    1997-01-01

    Mammalian L1 (long interspersed repeated DNA, LINE-1) retrotransposons consist of a 5' untranslated region (UTR) with regulatory properties, two protein encoding regions (ORF I, ORF II, which encodes a reverse transcriptase) and a 3' UTR. L1 elements have been evolving in mammals for >100 million years and this process continues to generate novel L1 subfamilies in modern species. Here we characterized the youngest known subfamily in Rattus norvegicus, L1(mlvi2), and unexpectedly found that this element has a dual ancestry. While its 3' UTR shares the same lineage as its nearest chronologically antecedent subfamilies, L1(3) and L1(4), its ORF I sequence does not. The L1(mlvi2) ORF I was derived from an ancestral ORF I sequence that was the evolutionary precursor of the L1(3) and L1(4) ORF I. We suggest that an ancestral ORF I sequence was recruited into the modern L1(mlvi2) subfamily by recombination that possibly could have resulted from template strand switching by the reverse transcriptase during L1 replication. This mechanism could also account for some of the structural features of rodent L1 5' UTR and ORF I sequences including one of the more dramatic features of L1 evolution in mammals, namely the repeated acquisition of novel 5' UTRs. PMID:9178013

  3. Comparative Sequence Analysis of the X-Inactivation Center Region in Mouse, Human, and Bovine

    PubMed Central

    Chureau, Corinne; Prissette, Marine; Bourdet, Agnès; Barbe, Valérie; Cattolico, Laurence; Jones, Louis; Eggen, André; Avner, Philip; Duret, Laurent

    2002-01-01

    We have sequenced to high levels of accuracy 714-kb and 233-kb regions of the mouse and bovine X-inactivation centers (Xic), respectively, centered on the Xist gene. This has provided the basis for a fully annotated comparative analysis of the mouse Xic with the 2.3-Mb orthologous region in human and has allowed a three-way species comparison of the core central region, including the Xist gene. These comparisons have revealed conserved genes, both coding and noncoding, conserved CpG islands and, more surprisingly, conserved pseudogenes. The distribution of repeated elements, especially LINE repeats, in the mouse Xic region when compared to the rest of the genome does not support the hypothesis of a role for these repeat elements in the spreading of X inactivation. Interestingly, an asymmetric distribution of LINE elements on the two DNA strands was observed in the three species, not only within introns but also in intergenic regions. This feature is suggestive of important transcriptional activity within these intergenic regions. In silico prediction followed by experimental analysis has allowed four new genes, Cnbp2, Ftx, Jpx, and Ppnx, to be identified and novel, widespread, complex, and apparently noncoding transcriptional activity to be characterized in a region 5′ of Xist that was recently shown to attract histone modification early after the onset of X inactivation. [The sequence data described in this paper have been submitted to the EMBL data library under accession nos. AJ421478, AJ421479, AJ421480, and AJ421481. Online supplemental data are available at http://pbil.univ-lyon1.fr/datasets/Xic2002/data.html and www.genome.org.] PMID:12045143

  4. Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.

    PubMed

    Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene

    2017-02-01

    Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. Isolation and analysis of a multifunctional triterpene synthase KcMS promoter region from mangrove plant kandelia candel

    NASA Astrophysics Data System (ADS)

    Basyuni, M.; Wati, R.; Sulistiyono, N.; Sumardi; Oku, H.; Baba, S.; Sagami, H.

    2018-03-01

    Molecular cloning of Kandelia candel KcMS gene has previously been cloned and encoded a multifunctional triterpene synthase. In this study, the KcMS gene promoter was cloned through Genome walking, sequenced, and analyzed. A 1,358 bp genomic DNA fragment of KcMS promoter was obtained. PLACE and PlantCARE analysis of the KcMS promoter revealed that there was some regulatory elements in response to environmental signals and involved in the regulation of gene expression. Results showed that four kinds of elements are regulated by hormone binding, namely 2 MeJA-responsiveness elements (CGTCA-motif and TGACG-motif), the ABRE (TACGTG) involved in abscisic acid responsiveness, gibberellin-related GARE-motif (AAACAGA), and the TGA-element (AACGAC) as an auxin-responsive element. Several elements in the KcMS have been shown in other plants to be responsive to abiotic stress. These motifs were MBS (CAACTG), TC-rich repeats, and eight light responsive elements. The KcMS promoter was also involved in the activation of defense genes in plants such as HSE (AAAAAATTC) and four circadian control elements (CAANNNNATC). The presence of multipotential regulatory motifs suggested that KcMS may be involved in regulation of plant tolerance to several types of stresses.

  6. Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

    PubMed

    de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

    2014-06-01

    The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Parasitism and the retrotransposon life cycle in plants: a hitchhiker's guide to the genome.

    PubMed

    Sabot, F; Schulman, A H

    2006-12-01

    LTR (long terminal repeat) retrotransposons are the main components of higher plant genomic DNA. They have shaped their host genomes through insertional mutagenesis and by effects on genome size, gene expression and recombination. These Class I transposable elements are closely related to retroviruses such as the HIV by their structure and presumptive life cycle. However, the retrotransposon life cycle has been closely investigated in few systems. For retroviruses and retrotransposons, individual defective copies can parasitize the activity of functional ones. However, some LTR retrotransposon groups as a whole, such as large retrotransposon derivatives and terminal repeats in miniature, are non-autonomous even though their genomic insertion patterns remain polymorphic between organismal accessions. Here, we examine what is known of the retrotransposon life cycle in plants, and in that context discuss the role of parasitism and complementation between and within retrotransposon groups.

  8. Understanding the recognition mechanisms of Zα domain of human editing enzyme ADAR1 (hZα(ADAR1)) and various Z-DNAs from molecular dynamics simulation.

    PubMed

    Wang, Qianqian; Li, Lanlan; Wang, Xiaoting; Liu, Huanxiang; Yao, Xiaojun

    2014-11-01

    The Z-DNA-binding domain of human double-stranded RNA adenosine deaminase I (hZαADAR1) can specifically recognize the left-handed Z-DNA which preferentially occurs at alternating purine-pyrimidine repeats, especially the CG-repeats. The interactions of hZαADAR1 and Z-DNAs in different sequence contexts can affect many important biological functions including gene regulation and chromatin remodeling. Therefore it is of great necessity to fully understand their recognition mechanisms. However, most existing studies are aimed at the standard CG-repeat Z-DNA rather than the non-CG-repeats, and whether the molecular basis of hZαADAR1 binding to various Z-DNAs are identical or not is still unclear on the atomic level. Here, based on the recently determined crystal structures of three representative non-CG-repeat Z-DNAs (d(CACGTG)2, d(CGTACG)2 and d(CGGCCG)2) in complex with hZαADAR1, 40 ns molecular dynamics simulation together with binding free energy calculation were performed for each system. For comparison, the standard CG-repeat Z-DNA (d(CGCGCG)2) complexed with hZαADAR1 was also simulated. The consistent results demonstrate that nonpolar interaction is the driving force during the protein-DNA binding process, and that polar interaction mainly from helix α3 also provides important contributions. Five common hot-spot residues were identified, namely Lys169, Lys170, Asn173, Arg174 and Tyr177. Hydrogen bond analysis coupled with surface charge distribution further reveal the interfacial information between hZαADAR1 and Z-DNA in detail. All of the analysis illustrate that four complexes share the common key features and the similar binding modes irrespective of Z-DNA sequences, suggesting that Z-DNA recognition by hZαADAR1 is conformation-specific rather than sequence-specific. Additionally, by analyzing the conformational changes of hZαADAR1, we found that the binding of Z-DNA could effectively stabilize hZαADAR1 protein. Our study can provide some valuable information for better understanding the binding mechanism between hZαADAR1 or even other Z-DNA-binding protein and Z-DNA.

  9. C9orf72 nucleotide repeat structures initiate molecular cascades of disease.

    PubMed

    Haeusler, Aaron R; Donnelly, Christopher J; Periz, Goran; Simko, Eric A J; Shaw, Patrick G; Kim, Min-Sik; Maragakis, Nicholas J; Troncoso, Juan C; Pandey, Akhilesh; Sattler, Rita; Rothstein, Jeffrey D; Wang, Jiou

    2014-03-13

    A hexanucleotide repeat expansion (HRE), (GGGGCC)n, in C9orf72 is the most common genetic cause of the neurodegenerative diseases amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Here we identify a molecular mechanism by which structural polymorphism of the HRE leads to ALS/FTD pathology and defects. The HRE forms DNA and RNA G-quadruplexes with distinct structures and promotes RNA•DNA hybrids (R-loops). The structural polymorphism causes a repeat-length-dependent accumulation of transcripts aborted in the HRE region. These transcribed repeats bind to ribonucleoproteins in a conformation-dependent manner. Specifically, nucleolin, an essential nucleolar protein, preferentially binds the HRE G-quadruplex, and patient cells show evidence of nucleolar stress. Our results demonstrate that distinct C9orf72 HRE structural polymorphism at both DNA and RNA levels initiates molecular cascades leading to ALS/FTD pathologies, and provide the basis for a mechanistic model for repeat-associated neurodegenerative diseases.

  10. LPA gene: interaction between the apolipoprotein(a) size ('kringle IV' repeat) polymorphism and a pentanucleotide repeat polymorphism influences Lp(a) lipoprotein level.

    PubMed

    Røsby, O; Berg, K

    2000-01-01

    In order to search for factors influencing the Lp(a) lipoprotein level, we have examined the apolipoprotein(a) (apo(a)) size polymorphism as well as a pentanucleotide (TTTTA) repeat polymorphism in the 5' control region of the LPA gene. Lp(a) lipoprotein levels were compared between individuals with different genotypes as defined by pulsed field gel electrophoresis of DNA plugs, and PCR of DNA samples followed by polyacrylamide gel electrophoresis. DNA plugs and DNA were prepared from blood samples collected from blood donors. Twenty-seven different K IV repeat alleles were observed in the 71 women and 92 men from which apo(a) size polymorphism results were obtained. Alleles encoding 26-32 Kringle IV repeats were the most frequent. Alleles encoding seven to 11 TTTTA repeats were detected in the 84 women and 122 men included in the pentanucleotide polymorphism study, and homozygosity for eight TTTTA repeats was the most common genotype. The eight TTTTA repeat allele occurred with almost any apo(a) allele. An inverse relationship between number of K IV repeats and Lp(a) concentration was confirmed. The contributions of the apo(a) size polymorphism and the pentanucleotide repeat polymorphism to the interindividual variance of Lp(a) lipoprotein concentrations were 9.7 and 3.5%, respectively (type IV sum of squares). Nineteen per cent of the variance in Lp(a) lipoprotein level appeared to be the result of the multiplication product (interaction) between the apo(a) size polymorphism and the pentanucleotide repeat polymorphism. The contribution of the apo(a) size polymorphism alone to the variation in Lp(a) lipoprotein level was lower than previously reported. However, the multiplicative interaction effect between the K IV repeat polymorphism and the pentanucleotide repeat polymorphism may be an important factor explaining the variation in Lp(a) lipoprotein levels among the populations.

  11. Accurate quantification of chromosomal lesions via short tandem repeat analysis using minimal amounts of DNA.

    PubMed

    Jann, Johann-Christoph; Nowak, Daniel; Nolte, Florian; Fey, Stephanie; Nowak, Verena; Obländer, Julia; Pressler, Jovita; Palme, Iris; Xanthopoulos, Christina; Fabarius, Alice; Platzbecker, Uwe; Giagounidis, Aristoteles; Götze, Katharina; Letsch, Anne; Haase, Detlef; Schlenk, Richard; Bug, Gesine; Lübbert, Michael; Ganser, Arnold; Germing, Ulrich; Haferlach, Claudia; Hofmann, Wolf-Karsten; Mossner, Maximilian

    2017-09-01

    Cytogenetic aberrations such as deletion of chromosome 5q (del(5q)) represent key elements in routine clinical diagnostics of haematological malignancies. Currently established methods such as metaphase cytogenetics, FISH or array-based approaches have limitations due to their dependency on viable cells, high costs or semi-quantitative nature. Importantly, they cannot be used on low abundance DNA. We therefore aimed to establish a robust and quantitative technique that overcomes these shortcomings. For precise determination of del(5q) cell fractions, we developed an inexpensive multiplex-PCR assay requiring only nanograms of DNA that simultaneously measures allelic imbalances of 12 independent short tandem repeat markers. Application of this method to n=1142 samples from n=260 individuals revealed strong intermarker concordance (R²=0.77-0.97) and reproducibility (mean SD: 1.7%). Notably, the assay showed accurate quantification via standard curve assessment (R²>0.99) and high concordance with paired FISH measurements (R²=0.92) even with subnanogram amounts of DNA. Moreover, cytogenetic response was reliably confirmed in del(5q) patients with myelodysplastic syndromes treated with lenalidomide. While the assay demonstrated good diagnostic accuracy in receiver operating characteristic analysis (area under the curve: 0.97), we further observed robust correlation between bone marrow and peripheral blood samples (R²=0.79), suggesting its potential suitability for less-invasive clonal monitoring. In conclusion, we present an adaptable tool for quantification of chromosomal aberrations, particularly in problematic samples, which should be easily applicable to further tumour entities. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  12. DNA mismatch repair complex MutSβ promotes GAA·TTC repeat expansion in human cells.

    PubMed

    Halabi, Anasheh; Ditch, Scott; Wang, Jeffrey; Grabczyk, Ed

    2012-08-24

    While DNA repair has been implicated in CAG·CTG repeat expansion, its role in the GAA·TTC expansion of Friedreich ataxia (FRDA) is less clear. We have developed a human cellular model that recapitulates the DNA repeat expansion found in FRDA patient tissues. In this model, GAA·TTC repeats expand incrementally and continuously. We have previously shown that the expansion rate is linked to transcription within the repeats. Our working hypothesis is that structures formed within the GAA·TTC repeat during transcription attract DNA repair enzymes that then facilitate the expansion process. MutSβ, a heterodimer of MSH2 and MSH3, is known to have a role in CAG·CTG repeat expansion. We now show that shRNA knockdown of either MSH2 or MSH3 slowed GAA·TTC expansion in our system. We further characterized the role of MutSβ in GAA·TTC expansion using a functional assay in primary FRDA patient-derived fibroblasts. These fibroblasts have no known propensity for instability in their native state. Ectopic expression of MSH2 and MSH3 induced GAA·TTC repeat expansion in the native FXN gene. MSH2 is central to mismatch repair and its absence or reduction causes a predisposition to cancer. Thus, despite its essential role in GAA·TTC expansion, MSH2 is not an attractive therapeutic target. The absence or reduction of MSH3 is not strongly associated with cancer predisposition. Accordingly, MSH3 has been suggested as a therapeutic target for CAG·CTG repeat expansion disorders. Our results suggest that MSH3 may also serve as a therapeutic target to slow the expansion of GAA·TTC repeats in the future.

  13. DNA Mismatch Repair Complex MutSβ Promotes GAA·TTC Repeat Expansion in Human Cells*

    PubMed Central

    Halabi, Anasheh; Ditch, Scott; Wang, Jeffrey; Grabczyk, Ed

    2012-01-01

    While DNA repair has been implicated in CAG·CTG repeat expansion, its role in the GAA·TTC expansion of Friedreich ataxia (FRDA) is less clear. We have developed a human cellular model that recapitulates the DNA repeat expansion found in FRDA patient tissues. In this model, GAA·TTC repeats expand incrementally and continuously. We have previously shown that the expansion rate is linked to transcription within the repeats. Our working hypothesis is that structures formed within the GAA·TTC repeat during transcription attract DNA repair enzymes that then facilitate the expansion process. MutSβ, a heterodimer of MSH2 and MSH3, is known to have a role in CAG·CTG repeat expansion. We now show that shRNA knockdown of either MSH2 or MSH3 slowed GAA·TTC expansion in our system. We further characterized the role of MutSβ in GAA·TTC expansion using a functional assay in primary FRDA patient-derived fibroblasts. These fibroblasts have no known propensity for instability in their native state. Ectopic expression of MSH2 and MSH3 induced GAA·TTC repeat expansion in the native FXN gene. MSH2 is central to mismatch repair and its absence or reduction causes a predisposition to cancer. Thus, despite its essential role in GAA·TTC expansion, MSH2 is not an attractive therapeutic target. The absence or reduction of MSH3 is not strongly associated with cancer predisposition. Accordingly, MSH3 has been suggested as a therapeutic target for CAG·CTG repeat expansion disorders. Our results suggest that MSH3 may also serve as a therapeutic target to slow the expansion of GAA·TTC repeats in the future. PMID:22787155

  14. Concerted evolution of the tandem array encoding primate U2 snRNA occurs in situ, without changing the cytological context of the RNU2 locus.

    PubMed Central

    Pavelitz, T; Rusché, L; Matera, A G; Scharf, J M; Weiner, A M

    1995-01-01

    In primates, the tandemly repeated genes encoding U2 small nuclear RNA evolve concertedly, i.e. the sequence of the U2 repeat unit is essentially homogeneous within each species but differs somewhat between species. Using chromosome painting and the NGFR gene as an outside marker, we show that the U2 tandem array (RNU2) has remained at the same chromosomal locus (equivalent to human 17q21) through multiple speciation events over > 35 million years leading to the Old World monkey and hominoid lineages. The data suggest that the U2 tandem repeat, once established in the primate lineage, contained sequence elements favoring perpetuation and concerted evolution of the array in situ, despite a pericentric inversion in chimpanzee, a reciprocal translocation in gorilla and a paracentric inversion in orang utan. Comparison of the 11 kb U2 repeat unit found in baboon and other Old World monkeys with the 6 kb U2 repeat unit in humans and other hominids revealed that an ancestral U2 repeat unit was expanded by insertion of a 5 kb retrovirus bearing 1 kb long terminal repeats (LTRs). Subsequent excision of the provirus by homologous recombination between the LTRs generated a 6 kb U2 repeat unit containing a solo LTR. Remarkably, both junctions between the human U2 tandem array and flanking chromosomal DNA at 17q21 fall within the solo LTR sequence, suggesting a role for the LTR in the origin or maintenance of the primate U2 array. Images PMID:7828589

  15. Long repeating (TTAGGG)n single stranded DNA self-condenses into compact beaded filaments stabilized by G-quadruplex formation.

    PubMed

    Kar, Anirban; Jones, Nathan; Arat, N Özlem; Fishel, Richard; Griffith, Jack

    2018-04-19

    Conformations adopted by long stretches of single stranded DNA (ssDNA) are of central interest in understanding the architecture of replication forks, R loops, and other structures generated during DNA metabolism in vivo. This is particularly so if the ssDNA consists of short nucleotide repeats. Such studies have been hampered by the lack of defined substrates greater than ~150 nt, and the absence of high-resolution biophysical approaches. Here we describe the generation of very long ssDNA consisting of the mammalian telomeric repeat (5'-TTAGGG-3')n as well as the interrogation of its structure by electron microscopy (EM) and single molecule magnetic tweezers (smMT). This repeat is of particular interest as it contains a run of 3 contiguous guanine residues capable of forming G quartets as ssDNA. Fluorescent-dye exclusion assays confirmed that this G-strand ssDNA forms ubiquitous G-quadruplex folds. EM revealed thick bead-like filaments that condensed the DNA ~12 fold. The bead-like structures were 5 nm and 8 nm in diameter and linked by thin filaments. The G-strand ssDNA displayed initial stability to smMT force extension that ultimately released in steps that were multiples ~28 nm at forces between 6-12 pN; well below the >20 pN required to unravel G-quadruplexes. Most smMT steps were consistent with the disruption of the beads seen by EM. Binding by RAD51 distinctively altered the force extension properties of the G-strand ssDNA, suggesting a stochastic G-quadruplex-dependent condensation model that is discussed. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.

  16. Multicenter Evaluation of Epidemiological Typing of Methicillin-Resistant Staphylococcus aureus Strains by Repetitive-Element PCR Analysis

    PubMed Central

    Deplano, Ariane; Schuermans, Annette; Van Eldere, Johan; Witte, Wolfgang; Meugnier, Hèléne; Etienne, Jerome; Grundmann, Hajo; Jonas, Daniel; Noordhoek, Gerda T.; Dijkstra, Jolanda; van Belkum, Alex; van Leeuwen, Willem; Tassios, Panayotis T.; Legakis, Nicholas J.; van der Zee, Anneke; Bergmans, Anneke; Blanc, Dominique S.; Tenover, Fred C.; Cookson, Barry C.; O'Neil, Gael; Struelens, Marc J.

    2000-01-01

    Rapid and efficient epidemiologic typing systems would be useful to monitor transmission of methicillin-resistant Staphylococcus aureus (MRSA) at both local and interregional levels. To evaluate the intralaboratory performance and interlaboratory reproducibility of three recently developed repeat-element PCR (rep-PCR) methods for the typing of MRSA, 50 MRSA strains characterized by pulsed-field gel electrophoresis (PFGE) (SmaI) analysis and epidemiological data were blindly typed by inter-IS256, 16S-23S ribosomal DNA (rDNA), and MP3 PCR in 12 laboratories in eight countries using standard reagents and protocols. Performance of typing was defined by reproducibility (R), discriminatory power (D), and agreement with PFGE analysis. Interlaboratory reproducibility of pattern and type classification was assessed visually and using gel analysis software. Each typing method showed a different performance level in each center. In the center performing best with each method, inter-IS256 PCR typing achieved R = 100% and D = 100%; 16S-23S rDNA PCR, R = 100% and D = 82%; and MP3 PCR, R = 80% and D = 83%. Concordance between rep-PCR type and PFGE type ranged by center: 70 to 90% for inter-IS256 PCR, 44 to 57% for 16S-23S rDNA PCR, and 53 to 54% for MP3 PCR analysis. In conclusion, the performance of inter-IS256 PCR typing was similar to that of PFGE analysis in some but not all centers, whereas other rep-PCR protocols showed lower discrimination and intralaboratory reproducibility. None of these assays, however, was sufficiently reproducible for interlaboratory exchange of data. PMID:11015358

  17. Molecular characterization of the canine mitochondrial DNA control region for forensic applications.

    PubMed

    Eichmann, Cordula; Parson, Walther

    2007-09-01

    The canine mitochondrial DNA (mtDNA) control region of 133 dogs living in the area around Innsbruck, Austria was sequenced. A total of 40 polymorphic sites were observed in the first hypervariable segment and 15 in the second, which resulted in the differentiation of 40 distinct haplotypes. We observed five nucleotide positions that were highly polymorphic within different haplogroups, and they represent good candidates for mtDNA screening. We found five point heteroplasmic positions; all located in HVS-I and a polythymine region in HVS-II, the latter often being associated with length heteroplasmy. In contrast to human mtDNA, the canine control region contains a hypervariable 10 nucleotide repeat region, which is located between the two hypervariable regions. In our population sample, we observed eight different repeat types, which we characterized by direct sequencing and fragment length analysis. The discrimination power of the canine mtDNA control region was 0.93, not taking the polymorphic repeat region into consideration.

  18. STRBase: a short tandem repeat DNA database for the human identity testing community

    PubMed Central

    Ruitberg, Christian M.; Reeder, Dennis J.; Butler, John M.

    2001-01-01

    The National Institute of Standards and Technology (NIST) has compiled and maintained a Short Tandem Repeat DNA Internet Database (http://www.cstl.nist.gov/biotech/strbase/) since 1997 commonly referred to as STRBase. This database is an information resource for the forensic DNA typing community with details on commonly used short tandem repeat (STR) DNA markers. STRBase consolidates and organizes the abundant literature on this subject to facilitate on-going efforts in DNA typing. Observed alleles and annotated sequence for each STR locus are described along with a review of STR analysis technologies. Additionally, commercially available STR multiplex kits are described, published polymerase chain reaction (PCR) primer sequences are reported, and validation studies conducted by a number of forensic laboratories are listed. To supplement the technical information, addresses for scientists and hyperlinks to organizations working in this area are available, along with the comprehensive reference list of over 1300 publications on STRs used for DNA typing purposes. PMID:11125125

  19. Comparative Chloroplast Genomes of Pinaceae: Insights into the Mechanism of Diversified Genomic Organizations

    PubMed Central

    Wu, Chung-Shien; Lin, Ching-Ping; Hsu, Chi-Yao; Wang, Rui-Jiang; Chaw, Shu-Miaw

    2011-01-01

    Abstract Pinaceae, the largest family of conifers, has diversified organizations of chloroplast genomes (cpDNAs) with the two typical inverted repeats (IRs) highly reduced. To unravel the mechanism of this genomic diversification, we examined the cpDNA organizations from 53 species of the ten Pinaceous genera, including those of Larix decidua (122,474 bp), Picea morrisonicola (124,168 bp), and Pseudotsuga wilsoniana (122,513 bp), which were firstly elucidated. The results uncovered four distinct cpDNA forms (A−C and P) that are due to rearrangements of two ∼20 and ∼21 kb specific fragments. The C form was documented for the first time and the A form might be the most ancestral one. In addition, only the individuals of Ps. macrocarpa and Ps. wilsoniana were detected to have isomeric cpDNA forms. Three types (types 1−3) of Pinaceae-specific repeats situated nearby the rearranged fragments were found to be syntenic. We hypothesize that type 1 (949 ± 343 bp) and type 3 (608 ± 73 bp) repeats are substrates for homologous recombination (HR), whereas type 2 repeats are likely inactive for HR because of their relatively short sizes (151 ± 30 bp). Conversions among the four distinct forms may be achieved by HR and mediated by type 1 or 3 repeats, thus resulting in increased diversity of cpDNA organizations. We propose that in the Pinaceae cpDNAs, the reduced IRs have lost HR activity, then decreasing the diversity of cpDNA organizations, but the specific repeats that the evolution endowed Pinaceae complement the reduced IRs and increase the diversity of cpDNA organizations. PMID:21402866

  20. Schizosaccharomyces pombe MutSα and MutLα Maintain Stability of Tetra-Nucleotide Repeats and Msh3 of Hepta-Nucleotide Repeats

    PubMed Central

    Villahermosa, Desirée; Christensen, Olaf; Knapp, Karen; Fleck, Oliver

    2017-01-01

    Defective mismatch repair (MMR) in humans is associated with colon cancer and instability of microsatellites, that is, DNA sequences with one or several nucleotides repeated. Key factors of eukaryotic MMR are the heterodimers MutSα (Msh2-Msh6), which recognizes base-base mismatches and unpaired nucleotides in DNA, and MutLα (Mlh1-Pms1), which facilitates downstream steps. In addition, MutSβ (Msh2-Msh3) recognizes DNA loops of various sizes, although our previous data and the data presented here suggest that Msh3 of Schizosaccharomyces pombe does not play a role in MMR. To test microsatellite stability in S. pombe and hence DNA loop repair, we have inserted tetra-, penta-, and hepta-nucleotide repeats in the ade6 gene and determined their Ade+ reversion rates and spectra in wild type and various mutants. Our data indicate that loops with four unpaired nucleotides in the nascent and the template strand are the upper limit of MutSα- and MutLα-mediated MMR in S. pombe. Stability of hepta-nucleotide repeats requires Msh3 and Exo1 in MMR-independent processes as well as the DNA repair proteins Rad50, Rad51, and Rad2FEN1. Most strikingly, mutation rates in the double mutants msh3 exo1 and msh3 rad51 were decreased when compared to respective single mutants, indicating that Msh3 prevents error prone processes carried out by Exo1 and Rad51. We conclude that Msh3 has no obvious function in MMR in S. pombe, but contributes to DNA repeat stability in MMR-independent processes. PMID:28341698

  1. Schizosaccharomyces pombe MutSα and MutLα Maintain Stability of Tetra-Nucleotide Repeats and Msh3 of Hepta-Nucleotide Repeats.

    PubMed

    Villahermosa, Desirée; Christensen, Olaf; Knapp, Karen; Fleck, Oliver

    2017-05-05

    Defective mismatch repair (MMR) in humans is associated with colon cancer and instability of microsatellites, that is, DNA sequences with one or several nucleotides repeated. Key factors of eukaryotic MMR are the heterodimers MutSα (Msh2-Msh6), which recognizes base-base mismatches and unpaired nucleotides in DNA, and MutLα (Mlh1-Pms1), which facilitates downstream steps. In addition, MutSβ (Msh2-Msh3) recognizes DNA loops of various sizes, although our previous data and the data presented here suggest that Msh3 of Schizosaccharomyces pombe does not play a role in MMR. To test microsatellite stability in S. pombe and hence DNA loop repair, we have inserted tetra-, penta-, and hepta-nucleotide repeats in the ade6 gene and determined their Ade + reversion rates and spectra in wild type and various mutants. Our data indicate that loops with four unpaired nucleotides in the nascent and the template strand are the upper limit of MutSα- and MutLα-mediated MMR in S. pombe Stability of hepta-nucleotide repeats requires Msh3 and Exo1 in MMR-independent processes as well as the DNA repair proteins Rad50, Rad51, and Rad2 FEN1 Most strikingly, mutation rates in the double mutants msh3 exo1 and msh3 rad51 were decreased when compared to respective single mutants, indicating that Msh3 prevents error prone processes carried out by Exo1 and Rad51. We conclude that Msh3 has no obvious function in MMR in S. pombe , but contributes to DNA repeat stability in MMR-independent processes. Copyright © 2017 Villahermosa et al.

  2. Giant Reverse Transcriptase-Encoding Transposable Elements at Telomeres.

    PubMed

    Arkhipova, Irina R; Yushenova, Irina A; Rodriguez, Fernando

    2017-09-01

    Transposable elements are omnipresent in eukaryotic genomes and have a profound impact on chromosome structure, function and evolution. Their structural and functional diversity is thought to be reasonably well-understood, especially in retroelements, which transpose via an RNA intermediate copied into cDNA by the element-encoded reverse transcriptase, and are characterized by a compact structure. Here, we report a novel type of expandable eukaryotic retroelements, which we call Terminons. These elements can attach to G-rich telomeric repeat overhangs at the chromosome ends, in a process apparently facilitated by complementary C-rich repeats at the 3'-end of the RNA template immediately adjacent to a hammerhead ribozyme motif. Terminon units, which can exceed 40 kb in length, display an unusually complex and diverse structure, and can form very long chains, with host genes often captured between units. As the principal polymerizing component, Terminons contain Athena reverse transcriptases previously described in bdelloid rotifers and belonging to the enigmatic group of Penelope-like elements, but can additionally accumulate multiple cooriented ORFs, including DEDDy 3'-exonucleases, GDSL esterases/lipases, GIY-YIG-like endonucleases, rolling-circle replication initiator (Rep) proteins, and putatively structural ORFs with coiled-coil motifs and transmembrane domains. The extraordinary length and complexity of Terminons and the high degree of interfamily variability in their ORF content challenge the current views on the structural organization of eukaryotic retroelements, and highlight their possible connections with the viral world and the implications for the elevated frequency of gene transfer. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. The Ty1 LTR-retrotransposon of budding yeast, Saccharomyces cerevisiae

    PubMed Central

    Curcio, M. Joan; Lutz, Sheila; Lesage, Pascale

    2015-01-01

    Summary Long-terminal repeat (LTR)-retrotransposons generate a copy of their DNA (cDNA) by reverse transcription of their RNA genome in cytoplasmic nucleocapsids. They are widespread in the eukaryotic kingdom and are the evolutionary progenitors of retroviruses [1]. The Ty1 element of the budding yeast Saccharomyces cerevisiae was the first LTR-retrotransposon demonstrated to mobilize through an RNA intermediate, and not surprisingly, is the best studied. The depth of our knowledge of Ty1 biology stems not only from the predominance of active Ty1 elements in the S. cerevisiae genome but also the ease and breadth of genomic, biochemical and cell biology approaches available to study cellular processes in yeast. This review describes the basic structure of Ty1 and its gene products, the replication cycle, the rapidly expanding compendium of host co-factors known to influence retrotransposition and the nature of Ty1's elaborate symbiosis with its host. Our goal is to illuminate the value of Ty1 as a paradigm to explore the biology of LTR-retrotransposons in multicellular organisms, where the low frequency of retrotransposition events presents a formidable barrier to investigations of retrotransposon biology. PMID:25893143

  4. Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population.

    PubMed

    Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao

    2018-05-01

    Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.

  5. Characterizing the strand-specific distribution of non-CpG methylation in human pluripotent cells.

    PubMed

    Guo, Weilong; Chung, Wen-Yu; Qian, Minping; Pellegrini, Matteo; Zhang, Michael Q

    2014-03-01

    DNA methylation is an important defense and regulatory mechanism. In mammals, most DNA methylation occurs at CpG sites, and asymmetric non-CpG methylation has only been detected at appreciable levels in a few cell types. We are the first to systematically study the strand-specific distribution of non-CpG methylation. With the divide-and-compare strategy, we show that CHG and CHH methylation are not intrinsically different in human embryonic stem cells (ESCs) and induced pluripotent stem cells (iPSCs). We also find that non-CpG methylation is skewed between the two strands in introns, especially at intron boundaries and in highly expressed genes. Controlling for the proximal sequences of non-CpG sites, we show that the skew of non-CpG methylation in introns is mainly guided by sequence skew. By studying subgroups of transposable elements, we also found that non-CpG methylation is distributed in a strand-specific manner in both short interspersed nuclear elements (SINE) and long interspersed nuclear elements (LINE), but not in long terminal repeats (LTR). Finally, we show that on the antisense strand of Alus, a non-CpG site just downstream of the A-box is highly methylated. Together, the divide-and-compare strategy leads us to identify regions with strand-specific distributions of non-CpG methylation in humans.

  6. Molecular organization and phylogenetic analysis of 5S rDNA in crustaceans of the genus Pollicipes reveal birth-and-death evolution and strong purifying selection.

    PubMed

    Perina, Alejandra; Seoane, David; González-Tizón, Ana M; Rodríguez-Fariña, Fernanda; Martínez-Lage, Andrés

    2011-10-17

    The 5S ribosomal DNA (5S rDNA) is organized in tandem arrays with repeat units that consist of a transcribing region (5S) and a variable nontranscribed spacer (NTS), in higher eukaryotes. Until recently the 5S rDNA was thought to be subject to concerted evolution, however, in several taxa, sequence divergence levels between the 5S and the NTS were found higher than expected under this model. So, many studies have shown that birth-and-death processes and selection can drive the evolution of 5S rDNA. In analyses of 5S rDNA evolution is found several 5S rDNA types in the genome, with low levels of nucleotide variation in the 5S and a spacer region highly divergent. Molecular organization and nucleotide sequence of the 5S ribosomal DNA multigene family (5S rDNA) were investigated in three Pollicipes species in an evolutionary context. The nucleotide sequence variation revealed that several 5S rDNA variants occur in Pollicipes genomes. They are clustered in up to seven different types based on differences in their nontranscribed spacers (NTS). Five different units of 5S rDNA were characterized in P. pollicipes and two different units in P. elegans and P. polymerus. Analysis of these sequences showed that identical types were shared among species and that two pseudogenes were present. We predicted the secondary structure and characterized the upstream and downstream conserved elements. Phylogenetic analysis showed an among-species clustering pattern of 5S rDNA types. These results suggest that the evolution of Pollicipes 5S rDNA is driven by birth-and-death processes with strong purifying selection.

  7. The linked units of 5S rDNA and U1 snDNA of razor shells (Mollusca: Bivalvia: Pharidae).

    PubMed

    Vierna, J; Jensen, K T; Martínez-Lage, A; González-Tizón, A M

    2011-08-01

    The linkage between 5S ribosomal DNA and other multigene families has been detected in many eukaryote lineages, but whether it provides any selective advantage remains unclear. In this work, we report the occurrence of linked units of 5S ribosomal DNA (5S rDNA) and U1 small nuclear DNA (U1 snDNA) in 10 razor shell species (Mollusca: Bivalvia: Pharidae) from four different genera. We obtained several clones containing partial or complete repeats of both multigene families in which both types of genes displayed the same orientation. We provide a comprehensive collection of razor shell 5S rDNA clones, both with linked and nonlinked organisation, and the first bivalve U1 snDNA sequences. We predicted the secondary structures and characterised the upstream and downstream conserved elements, including a region at -25 nucleotides from both 5S rDNA and U1 snDNA transcription start sites. The analysis of 5S rDNA showed that some nontranscribed spacers (NTSs) are more closely related to NTSs from other species (and genera) than to NTSs from the species they were retrieved from, suggesting birth-and-death evolution and ancestral polymorphism. Nucleotide conservation within the functional regions suggests the involvement of purifying selection, unequal crossing-overs and gene conversions. Taking into account this and other studies, we discuss the possible mechanisms by which both multigene families could have become linked in the Pharidae lineage. The reason why 5S rDNA is often found linked to other multigene families seems to be the result of stochastic processes within genomes in which its high copy number is determinant.

  8. The linked units of 5S rDNA and U1 snDNA of razor shells (Mollusca: Bivalvia: Pharidae)

    PubMed Central

    Vierna, J; Jensen, K T; Martínez-Lage, A; González-Tizón, A M

    2011-01-01

    The linkage between 5S ribosomal DNA and other multigene families has been detected in many eukaryote lineages, but whether it provides any selective advantage remains unclear. In this work, we report the occurrence of linked units of 5S ribosomal DNA (5S rDNA) and U1 small nuclear DNA (U1 snDNA) in 10 razor shell species (Mollusca: Bivalvia: Pharidae) from four different genera. We obtained several clones containing partial or complete repeats of both multigene families in which both types of genes displayed the same orientation. We provide a comprehensive collection of razor shell 5S rDNA clones, both with linked and nonlinked organisation, and the first bivalve U1 snDNA sequences. We predicted the secondary structures and characterised the upstream and downstream conserved elements, including a region at −25 nucleotides from both 5S rDNA and U1 snDNA transcription start sites. The analysis of 5S rDNA showed that some nontranscribed spacers (NTSs) are more closely related to NTSs from other species (and genera) than to NTSs from the species they were retrieved from, suggesting birth-and-death evolution and ancestral polymorphism. Nucleotide conservation within the functional regions suggests the involvement of purifying selection, unequal crossing-overs and gene conversions. Taking into account this and other studies, we discuss the possible mechanisms by which both multigene families could have become linked in the Pharidae lineage. The reason why 5S rDNA is often found linked to other multigene families seems to be the result of stochastic processes within genomes in which its high copy number is determinant. PMID:21364693

  9. Molecular organization and phylogenetic analysis of 5S rDNA in crustaceans of the genus Pollicipes reveal birth-and-death evolution and strong purifying selection

    PubMed Central

    2011-01-01

    Background The 5S ribosomal DNA (5S rDNA) is organized in tandem arrays with repeat units that consist of a transcribing region (5S) and a variable nontranscribed spacer (NTS), in higher eukaryotes. Until recently the 5S rDNA was thought to be subject to concerted evolution, however, in several taxa, sequence divergence levels between the 5S and the NTS were found higher than expected under this model. So, many studies have shown that birth-and-death processes and selection can drive the evolution of 5S rDNA. In analyses of 5S rDNA evolution is found several 5S rDNA types in the genome, with low levels of nucleotide variation in the 5S and a spacer region highly divergent. Molecular organization and nucleotide sequence of the 5S ribosomal DNA multigene family (5S rDNA) were investigated in three Pollicipes species in an evolutionary context. Results The nucleotide sequence variation revealed that several 5S rDNA variants occur in Pollicipes genomes. They are clustered in up to seven different types based on differences in their nontranscribed spacers (NTS). Five different units of 5S rDNA were characterized in P. pollicipes and two different units in P. elegans and P. polymerus. Analysis of these sequences showed that identical types were shared among species and that two pseudogenes were present. We predicted the secondary structure and characterized the upstream and downstream conserved elements. Phylogenetic analysis showed an among-species clustering pattern of 5S rDNA types. Conclusions These results suggest that the evolution of Pollicipes 5S rDNA is driven by birth-and-death processes with strong purifying selection. PMID:22004418

  10. Noncoding origins of anthropoid traits and a new null model of transposon functionalization

    PubMed Central

    del Rosario, Ricardo C.H.; Rayan, Nirmala Arul

    2014-01-01

    Little is known about novel genetic elements that drove the emergence of anthropoid primates. We exploited the sequencing of the marmoset genome to identify 23,849 anthropoid-specific constrained (ASC) regions and confirmed their robust functional signatures. Of the ASC base pairs, 99.7% were noncoding, suggesting that novel anthropoid functional elements were overwhelmingly cis-regulatory. ASCs were highly enriched in loci associated with fetal brain development, motor coordination, neurotransmission, and vision, thus providing a large set of candidate elements for exploring the molecular basis of hallmark primate traits. We validated ASC192 as a primate-specific enhancer in proliferative zones of the developing brain. Unexpectedly, transposable elements (TEs) contributed to >56% of ASCs, and almost all TE families showed functional potential similar to that of nonrepetitive DNA. Three L1PA repeat-derived ASCs displayed coherent eye-enhancer function, thus demonstrating that the “gene-battery” model of TE functionalization applies to enhancers in vivo. Our study provides fundamental insights into genome evolution and the origins of anthropoid phenotypes and supports an elegantly simple new null model of TE exaptation. PMID:25043600

  11. Qualitative and Quantitative Assays of Transposition and Homologous Recombination of the Retrotransposon Tf1 in Schizosaccharomyces pombe.

    PubMed

    Sangesland, Maya; Atwood-Moore, Angela; Rai, Sudhir K; Levin, Henry L

    2016-01-01

    Transposition and homologous recombination assays are valuable genetic tools to measure the production and integration of cDNA from the long terminal repeat (LTR) retrotransposon Tf1 in the fission yeast (Schizosaccharomyces pombe). Here we describe two genetic assays, one that measures the transposition activity of Tf1 by monitoring the mobility of a drug resistance marked Tf1 element expressed from a multi-copy plasmid and another assay that measures homologous recombination between Tf1 cDNA and the expression plasmid. While the transposition assay measures insertion of full-length Tf1 cDNA mediated by the transposon integrase, the homologous recombination assay measures levels of cDNA present in the nucleus and is independent of integrase activity. Combined, these assays can be used to systematically screen large collections of strains to identify mutations that specifically inhibit the integration step in the retroelement life cycle. Such mutations can be identified because they reduce transposition activity but nevertheless have wild-type frequencies of homologous recombination. Qualitative assays of yeast patches on agar plates detect large defects in integration and recombination, while the quantitative approach provides a precise method of determining integration and recombination frequencies.

  12. Activation of RNA polymerase III transcription of human Alu repetitive elements by adenovirus type 5: Requirement for the E1b 58-Kilodalton protein and the products of E4 open reading frames 3 and 6

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Panning, B.; Smiley, J.R.

    1993-06-01

    Alu elements are the single most abundant class of dispersed repeated sequences in the human genome, comprising 5-10% of the mass of human DNA. This report demonstrates that Ad5 infection strongly stimulates Pol III transcription of human Alu elements in HeLa and 293 cells. In contrast to the cases of Ad5-induced Pol III transcriptional activation, this process requires the E1b 58-kDa protein and the products of E4 open reading frames (ORFs) 3 and 6 in addition to the E1a 289-residue product. These findings suggest novel regulatory properties of the Ad5 E1b and E4 proteins and raise the possibility that analogousmore » cellular trans-acting factors serve to modulate Alu expression in vivo.« less

  13. Fine organization of genomic regions tagged to the 5S rDNA locus of the bread wheat 5B chromosome.

    PubMed

    Sergeeva, Ekaterina M; Shcherban, Andrey B; Adonina, Irina G; Nesterov, Michail A; Beletsky, Alexey V; Rakitin, Andrey L; Mardanov, Andrey V; Ravin, Nikolai V; Salina, Elena A

    2017-11-14

    The multigene family encoding the 5S rRNA, one of the most important structurally-functional part of the large ribosomal subunit, is an obligate component of all eukaryotic genomes. 5S rDNA has long been a favored target for cytological and phylogenetic studies due to the inherent peculiarities of its structural organization, such as the tandem arrays of repetitive units and their high interspecific divergence. The complex polyploid nature of the genome of bread wheat, Triticum aestivum, and the technically difficult task of sequencing clusters of tandem repeats mean that the detailed organization of extended genomic regions containing 5S rRNA genes remains unclear. This is despite the recent progress made in wheat genomic sequencing. Using pyrosequencing of BAC clones, in this work we studied the organization of two distinct 5S rDNA-tagged regions of the 5BS chromosome of bread wheat. Three BAC-clones containing 5S rDNA were identified in the 5BS chromosome-specific BAC-library of Triticum aestivum. Using the results of pyrosequencing and assembling, we obtained six 5S rDNA- containing contigs with a total length of 140,417 bp, and two sets (pools) of individual 5S rDNA sequences belonging to separate, but closely located genomic regions on the 5BS chromosome. Both regions are characterized by the presence of approximately 70-80 copies of 5S rDNA, however, they are completely different in their structural organization. The first region contained highly diverged short-type 5S rDNA units that were disrupted by multiple insertions of transposable elements. The second region contained the more conserved long-type 5S rDNA, organized as a single tandem array. FISH using probes specific to both 5S rDNA unit types showed differences in the distribution and intensity of signals on the chromosomes of polyploid wheat species and their diploid progenitors. A detailed structural organization of two closely located 5S rDNA-tagged genomic regions on the 5BS chromosome of bread wheat has been established. These two regions differ in the organization of both 5S rDNA and the neighboring sequences comprised of transposable elements, implying different modes of evolution for these regions.

  14. Are mutagenic non D-loop direct repeat motifs in mitochondrial DNA under a negative selection pressure?

    PubMed Central

    Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto

    2015-01-01

    Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815

  15. Characterization of a species-specific repetitive DNA from a highly endangered wild animal, Rhinoceros unicornis, and assessment of genetic polymorphism by microsatellite associated sequence amplification (MASA).

    PubMed

    Ali, S; Azfer, M A; Bashamboo, A; Mathur, P K; Malik, P K; Mathur, V B; Raha, A K; Ansari, S

    1999-03-04

    We have cloned and sequenced a 906bp EcoRI repeat DNA fraction from Rhinoceros unicornis genome. The contig pSS(R)2 is AT rich with 340 A (37.53%), 187 C (20.64%), 173 G (19.09%) and 206 T (22.74%). The sequence contains MALT box, NF-E1, Poly-A signal, lariat consensus sequences, TATA box, translational initiation sequences and several stop codons. Translation of the contig showed seven different types of protein motifs, among which, EGF-like domain cysteine pattern signatures and Bowman-Birk serine protease inhibitor family signatures were prominent. The presence of eukaryotic transcriptional elements, protein signatures and analysis of subset sequences in the 5' region from 1 to 165nt indicating coding potential (test code value=0.97) suggest possible regulatory and/or functional role(s) of these sequences in the rhino genome. Translation of the complementary strand from 906 to 706nt and 190 to 2nt showed proteins of more than 7kDa rich in non-polar residues. This suggests that pSS(R)2 is either a part of, or adjacent to, a functional gene. The contig contains mostly non-consecutive simple repeat units from 2 to 17nt with varying frequencies, of which four base motifs were found to be predominant. Zoo-blot hybridization revealed that pSS(R)2 sequences are unique to R. unicornis genome because they do not cross-hybridize, even with the genomic DNA of South African black rhino Diceros bicornis. Southern blot analysis of R. unicornis genomic DNA with pSS(R)2 and other synthetic oligo probes revealed a high level of genetic homogeneity, which was also substantiated by microsatellite associated sequence amplification (MASA). Owing to its uniqueness, the pSS(R)2 probe has a potential application in the area of conservation biology for unequivocal identification of horn or other body tissues of R. unicornis. The evolutionary aspect of this repeat fraction in the context of comparative genome analysis is discussed.

  16. Structure and Genetic Content of the Megaplasmids of Neurotoxigenic Clostridium butyricum Type E Strains from Italy

    PubMed Central

    Iacobino, Angelo; Scalfaro, Concetta; Franciosa, Giovanna

    2013-01-01

    We determined the genetic maps of the megaplasmids of six neutoroxigenic Clostridium butyricum type E strains from Italy using molecular and bioinformatics techniques. The megaplasmids are circular, not linear as we had previously proposed. The differently-sized megaplasmids share a genetic region that includes structural, metabolic and regulatory genes. In addition, we found that a 168 kb genetic region is present only in the larger megaplasmids of two tested strains, whereas it is absent from the smaller megaplasmids of the four remaining strains. The genetic region unique to the larger megaplasmids contains, among other features, a locus for clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR associated (cas) genes, i.e. a bacterial adaptive immune system providing sequence-specific protection from invading genetic elements. Some CRISPR spacer sequences of the neurotoxigenic C. butyricum type E strains showed homology to prophage, phage and plasmid sequences from closely related clostridia species or from distant species, all sharing the intestinal habitat, suggesting that the CRISPR locus might be involved in the microorganism adaptation to the human or animal intestinal environment. Besides, we report here that each of four distinct CRISPR spacers partially matched DNA sequences of different prophages and phages, at identical nucleotide locations. This suggests that, at least in neurotoxigenic C. butyricum type E, the CRISPR locus is potentially able to recognize the same conserved DNA sequence of different invading genetic elements, besides targeting sequences unique to previously encountered invading DNA, as currently predicted for a CRISPR locus. Thus, the results of this study introduce the possibility that CRISPR loci can provide resistance to a wider range of invading DNA elements than previously appreciated. Whether it is more advantageous for the peculiar neurotoxigenic C. butyricum type E strains to maintain or to lose the CRISPR-cas system remains an open question. PMID:23967192

  17. Densely ionizing radiation affects DNA methylation of selective LINE-1 elements

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Prior, Sara; Miousse, Isabelle R.

    Long Interspersed Nucleotide Element 1 (LINE-1) retrotransposons are heavily methylated and are the most abundant transposable elements in mammalian genomes. Here, we investigated the differential DNA methylation within the LINE-1 under normal conditions and in response to environmentally relevant doses of sparsely and densely ionizing radiation. We demonstrate that DNA methylation of LINE-1 elements in the lungs of C57BL6 mice is dependent on their evolutionary age, where the elder age of the element is associated with the lower extent of DNA methylation. Exposure to 5-aza-2′-deoxycytidine and methionine-deficient diet affected DNA methylation of selective LINE-1 elements in an age- and promotermore » type-dependent manner. Exposure to densely IR, but not sparsely IR, resulted in DNA hypermethylation of older LINE-1 elements, while the DNA methylation of evolutionary younger elements remained mostly unchanged. We also demonstrate that exposure to densely IR increased mRNA and protein levels of LINE-1 via the loss of the histone H3K9 dimethylation and an increase in the H3K4 trimethylation at the LINE-1 5′-untranslated region, independently of DNA methylation. Our findings suggest that DNA methylation is important for regulation of LINE-1 expression under normal conditions, but histone modifications may dictate the transcriptional activity of LINE-1 in response to exposure to densely IR. - Highlights: • DNA methylation of LINE-1 elements is dependent on their evolutionary age. • Densely ionizing radiation affects DNA methylation of selective LINE-1 elements. • Radiation-induced reactivation of LINE-1 is DNA methylation-independent. • Histone modifications dictate the transcriptional activity of LINE-1.« less

  18. Transcription arrest by a G quadruplex forming-trinucleotide repeat sequence from the human c-myb gene.

    PubMed

    Broxson, Christopher; Beckett, Joshua; Tornaletti, Silvia

    2011-05-17

    Non canonical DNA structures correspond to genomic regions particularly susceptible to genetic instability. The transcription process facilitates formation of these structures and plays a major role in generating the instability associated with these genomic sites. However, little is known about how non canonical structures are processed when encountered by an elongating RNA polymerase. Here we have studied the behavior of T7 RNA polymerase (T7RNAP) when encountering a G quadruplex forming-(GGA)(4) repeat located in the human c-myb proto-oncogene. To make direct correlations between formation of the structure and effects on transcription, we have taken advantage of the ability of the T7 polymerase to transcribe single-stranded substrates and of G4 DNA to form in single-stranded G-rich sequences in the presence of potassium ions. Under physiological KCl concentrations, we found that T7 RNAP transcription was arrested at two sites that mapped to the c-myb (GGA)(4) repeat sequence. The extent of arrest did not change with time, indicating that the c-myb repeat represented an absolute block and not a transient pause to T7 RNAP. Consistent with G4 DNA formation, arrest was not observed in the absence of KCl or in the presence of LiCl. Furthermore, mutations in the c-myb (GGA)(4) repeat, expected to prevent transition to G4, also eliminated the transcription block. We show T7 RNAP arrest at the c-myb repeat in double-stranded DNA under conditions mimicking the cellular concentration of biomolecules and potassium ions, suggesting that the G4 structure formed in the c-myb repeat may represent a transcription roadblock in vivo. Our results support a mechanism of transcription-coupled DNA repair initiated by arrest of transcription at G4 structures.

  19. Recognition Imaging of Acetylated Chromatin Using a DNA Aptamer

    PubMed Central

    Lin, Liyun; Fu, Qiang; Williams, Berea A.R.; Azzaz, Abdelhamid M.; Shogren-Knaak, Michael A.; Chaput, John C.; Lindsay, Stuart

    2009-01-01

    Histone acetylation plays an important role in the regulation of gene expression. A DNA aptamer generated by in vitro selection to be highly specific for histone H4 protein acetylated at lysine 16 was used as a recognition element for atomic force microscopy-based recognition imaging of synthetic nucleosomal arrays with precisely controlled acetylation. The aptamer proved to be reasonably specific at recognizing acetylated histones, with recognition efficiencies of 60% on-target and 12% off-target. Though this selectivity is much poorer than the >2000:1 equilibrium specificity of the aptamer, it is a large improvement on the performance of a ChIP-quality antibody, which is not selective at all in this application, and it should permit high-fidelity recognition with repeated imaging. The ability to image the precise location of posttranslational modifications may permit nanometer-scale investigation of their effect on chromatin structure. PMID:19751687

  20. Direct CRISPR spacer acquisition from RNA by a natural reverse-transcriptase-Cas1 fusion protein

    PubMed Central

    Sidote, David J.; Markham, Laura M.; Sanchez-Amat, Antonio; Bhaya, Devaki; Lambowitz, Alan M.; Fire, Andrew Z.

    2016-01-01

    CRISPR (Clustered Regularly Interspaced Short Palindromic Repeat) systems mediate adaptive immunity in diverse prokaryotes. CRISPR-associated Cas1 and Cas2 proteins have been shown to enable adaptation to new threats in Type I and II CRISPR systems by the acquisition of short segments of DNA (“spacers”) from invasive elements. In several Type III CRISPR systems, Cas1 is naturally fused to a reverse transcriptase (RT). In the marine bacterium Marinomonas mediterranea (MMB-1), we show that an RT-Cas1 fusion enables the acquisition of RNA spacers in vivo in an RT-dependent manner. In vitro, the MMB-1 RT-Cas1 and Cas2 proteins catalyze ligation of RNA segments into the CRISPR array, followed by reverse transcription. These observations outline a host-mediated mechanism for reverse information flow from RNA to DNA. PMID:26917774

  1. Inhibition of adenovirus 5 replication in COS-1 cells by antisense RNAs against the viral E1a region.

    PubMed

    Miroshnichenko, O I; Ponomareva, T I; Tikchonenko, T I

    1989-12-07

    To study the effect of antisense E1a RNA (asRNA) on adenovirus development, two types of adenovirus 5 E1a antisense constructs have been engineered. One was complementary to the viral DNA region [nucleotide (nt) positions 500-720] regulated by the metallothionein-I promoter, and the other was complementary to the DNA regions (nt positions 630-1570) under control of the long terminal repeat Moloney mouse leukosis virus promoter. Both asRNA constructs were cloned into a plasmid containing the simian virus 40 origin of replication, the gene controlling geneticin (G418) resistance (G418R), and other regulatory elements. The COS-1 cells, which contained up to 100 copies of the engineered plasmids, synthesized antiviral asRNAs, which provided 71 to over 95% inhibition of adenoviral replication, in comparison to the control cells not synthesizing asRNAs.

  2. Molecular Cytogenetic Analysis of Deschampsia antarctica Desv. (Poaceae), Maritime Antarctic.

    PubMed

    Amosova, Alexandra V; Bolsheva, Nadezhda L; Samatadze, Tatiana E; Twardovska, Maryana O; Zoshchuk, Svyatoslav A; Andreev, Igor O; Badaeva, Ekaterina D; Kunakh, Viktor A; Muravenko, Olga V

    2015-01-01

    Deschampsia antarctica Desv. (Poaceae) (2n = 26) is one of the two vascular plants adapted to the harshest environment of the Antarctic. Although the species is a valuable model for study of environmental stress tolerance in plants, its karyotype is still poorly investigated. We firstly conducted a comprehensive molecular cytogenetic analysis of D. antarctica collected on four islands of the Maritime Antarctic. D. antarctica karyotypes were studied by Giemsa C- and DAPI/C-banding, Ag-NOR staining, multicolour fluorescence in situ hybridization with repeated DNA probes (pTa71, pTa794, telomere repeats, pSc119.2, pAs1) and the GAA simple sequence repeat probe. We also performed sequential rapid in situ hybridization with genomic DNA of D. caespitosa. Two chromosome pairs bearing transcriptionally active 45S rDNA loci and five pairs with 5S rDNA sites were detected. A weak intercalary site of telomere repeats was revealed on the largest chromosome in addition to telomere hybridization signals at terminal positions. This fact confirms indirectly the hypothesis that chromosome fusion might have been the cause of the unusual for cereals chromosome number in this species. Based on patterns of distribution of the examined molecular cytogenetic markers, all chromosomes in karyotypes were identified, and chromosome idiograms of D. antarctica were constructed. B chromosomes were found in most karyotypes of plants from Darboux Island. A mixoploid plant with mainly triploid cells bearing a Robertsonian rearrangement was detected among typical diploid specimens from Great Jalour Island. The karyotype variability found in D. antarctica is probably an expression of genome instability induced by environmental stress factors. The differences in C-banding patterns and in chromosome distribution of rDNA loci as well as homologous highly repeated DNA sequences detected between genomes of D. antarctica and its related species D. caespitosa indicate that genome reorganization involving coding and noncoding repeated DNA sequences had occurred during the divergence of these species.

  3. Molecular Cytogenetic Analysis of Deschampsia antarctica Desv. (Poaceae), Maritime Antarctic

    PubMed Central

    Amosova, Alexandra V.; Bolsheva, Nadezhda L.; Samatadze, Tatiana E.; Twardovska, Maryana O.; Zoshchuk, Svyatoslav A.; Andreev, Igor O.; Badaeva, Ekaterina D.; Kunakh, Viktor A.; Muravenko, Olga V.

    2015-01-01

    Deschampsia antarctica Desv. (Poaceae) (2n = 26) is one of the two vascular plants adapted to the harshest environment of the Antarctic. Although the species is a valuable model for study of environmental stress tolerance in plants, its karyotype is still poorly investigated. We firstly conducted a comprehensive molecular cytogenetic analysis of D. antarctica collected on four islands of the Maritime Antarctic. D. antarctica karyotypes were studied by Giemsa C- and DAPI/C-banding, Ag-NOR staining, multicolour fluorescence in situ hybridization with repeated DNA probes (pTa71, pTa794, telomere repeats, pSc119.2, pAs1) and the GAA simple sequence repeat probe. We also performed sequential rapid in situ hybridization with genomic DNA of D. caespitosa. Two chromosome pairs bearing transcriptionally active 45S rDNA loci and five pairs with 5S rDNA sites were detected. A weak intercalary site of telomere repeats was revealed on the largest chromosome in addition to telomere hybridization signals at terminal positions. This fact confirms indirectly the hypothesis that chromosome fusion might have been the cause of the unusual for cereals chromosome number in this species. Based on patterns of distribution of the examined molecular cytogenetic markers, all chromosomes in karyotypes were identified, and chromosome idiograms of D. antarctica were constructed. B chromosomes were found in most karyotypes of plants from Darboux Island. A mixoploid plant with mainly triploid cells bearing a Robertsonian rearrangement was detected among typical diploid specimens from Great Jalour Island. The karyotype variability found in D. antarctica is probably an expression of genome instability induced by environmental stress factors. The differences in C-banding patterns and in chromosome distribution of rDNA loci as well as homologous highly repeated DNA sequences detected between genomes of D. antarctica and its related species D. caespitosa indicate that genome reorganization involving coding and noncoding repeated DNA sequences had occurred during the divergence of these species. PMID:26394331

  4. Ankyrin Repeat Domain Protein 2 and Inhibitor of DNA Binding 3 Cooperatively Inhibit Myoblast Differentiation by Physical Interaction*

    PubMed Central

    Mohamed, Junaith S.; Lopez, Michael A.; Cox, Gregory A.; Boriek, Aladin M.

    2013-01-01

    Ankyrin repeat domain protein 2 (ANKRD2) translocates from the nucleus to the cytoplasm upon myogenic induction. Overexpression of ANKRD2 inhibits C2C12 myoblast differentiation. However, the mechanism by which ANKRD2 inhibits myoblast differentiation is unknown. We demonstrate that the primary myoblasts of mdm (muscular dystrophy with myositis) mice (pMBmdm) overexpress ANKRD2 and ID3 (inhibitor of DNA binding 3) proteins and are unable to differentiate into myotubes upon myogenic induction. Although suppression of either ANKRD2 or ID3 induces myoblast differentiation in mdm mice, overexpression of ANKRD2 and inhibition of ID3 or vice versa is insufficient to inhibit myoblast differentiation in WT mice. We identified that ANKRD2 and ID3 cooperatively inhibit myoblast differentiation by physical interaction. Interestingly, although MyoD activates the Ankrd2 promoter in the skeletal muscles of wild-type mice, SREBP-1 (sterol regulatory element binding protein-1) activates the same promoter in the skeletal muscles of mdm mice, suggesting the differential regulation of Ankrd2. Overall, we uncovered a novel pathway in which SREBP-1/ANKRD2/ID3 activation inhibits myoblast differentiation, and we propose that this pathway acts as a critical determinant of the skeletal muscle developmental program. PMID:23824195

  5. Two DNA-binding factors recognize specific sequences at silencers, upstream activating sequences, autonomously replicating sequences, and telomeres in Saccharomyces cerevisiae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Buchman, A.R.; Kimmerly, W.J.; Rine, J.

    1988-01-01

    Two DNA-binding factors from Saccharomyces cerevisiae have been characterized, GRFI (general regulatory factor I) and ABFI (ARS-binding factor I), that recognize specific sequences within diverse genetic elements. GRFI bound to sequences at the negative regulatory elements (silencers) of the silent mating type loci HML E and HMR E and to the upstream activating sequence (UAS) required for transcription of the MAT ..cap alpha.. genes. A putative conserved UAS located at genes involved in translation (RPG box) was also recognized by GRFI. In addition, GRFI bound with high affinity to sequences within the (C/sub 1-3/A)-repeat region at yeast telomeres. Binding sitesmore » for GRFI with the highest affinity appeared to be of the form 5'-(A/G)(A/C)ACCCAN NCA(T/C)(T/C)-3', where N is any nucleotide. ABFI-binding sites were located next to autonomously replicating sequences (ARSs) at controlling elements of the silent mating type loci HMR E, HMR I, and HML I and were associated with ARS1, ARS2, and the 2..mu..m plasmid ARS. Two tandem ABFI binding sites were found between the HIS3 and DED1 genes, several kilobase pairs from any ARS, indicating that ABFI-binding sites are not restricted to ARSs. The sequences recognized by AFBI showed partial dyad-symmetry and appeared to be variations of the consensus 5'-TATCATTNNNNACGA-3'. GRFI and ABFI were both abundant DNA-binding factors and did not appear to be encoded by the SIR genes, whose product are required for repression of the silent mating type loci. Together, these results indicate that both GRFI and ABFI play multiple roles within the cell.« less

  6. Comparative molecular cytogenetic analyses of a major tandemly repeated DNA family and retrotransposon sequences in cultivated jute Corchorus species (Malvaceae).

    PubMed

    Begum, Rabeya; Zakrzewski, Falk; Menzel, Gerhard; Weber, Beatrice; Alam, Sheikh Shamimul; Schmidt, Thomas

    2013-07-01

    The cultivated jute species Corchorus olitorius and Corchorus capsularis are important fibre crops. The analysis of repetitive DNA sequences, comprising a major part of plant genomes, has not been carried out in jute but is useful to investigate the long-range organization of chromosomes. The aim of this study was the identification of repetitive DNA sequences to facilitate comparative molecular and cytogenetic studies of two jute cultivars and to develop a fluorescent in situ hybridization (FISH) karyotype for chromosome identification. A plasmid library was generated from C. olitorius and C. capsularis with genomic restriction fragments of 100-500 bp, which was complemented by targeted cloning of satellite DNA by PCR. The diversity of the repetitive DNA families was analysed comparatively. The genomic abundance and chromosomal localization of different repeat classes were investigated by Southern analysis and FISH, respectively. The cytosine methylation of satellite arrays was studied by immunolabelling. Major satellite repeats and retrotransposons have been identified from C. olitorius and C. capsularis. The satellite family CoSat I forms two undermethylated species-specific subfamilies, while the long terminal repeat (LTR) retrotransposons CoRetro I and CoRetro II show similarity to the Metaviridea of plant retroelements. FISH karyotypes were developed by multicolour FISH using these repetitive DNA sequences in combination with 5S and 18S-5·8S-25S rRNA genes which enable the unequivocal chromosome discrimination in both jute species. The analysis of the structure and diversity of the repeated DNA is crucial for genome sequence annotation. The reference karyotypes will be useful for breeding of jute and provide the basis for karyotyping homeologous chromosomes of wild jute species to reveal the genetic and evolutionary relationship between cultivated and wild Corchorus species.

  7. Characterization of species-specific repeated DNA sequences from B. nigra.

    PubMed

    Gupta, V; Lakshmisita, G; Shaila, M S; Jagannathan, V; Lakshmikumaran, M S

    1992-07-01

    The construction and characterization of two genome-specific recombinant DNA clones from B. nigra are described. Southern analysis showed that the two clones belong to a dispersed repeat family. They differ from each other in their length, distribution and sequence, though the average GC content is nearly the same (45%). These B genome-specific repeats have been used to analyse the phylogenetic relationships between cultivated and wild species of the family Brassicaceae.

  8. Repetitive DNA and Plant Domestication: Variation in Copy Number and Proximity to Genes of LTR-Retrotransposons among Wild and Cultivated Sunflower (Helianthus annuus) Genotypes

    PubMed Central

    Mascagni, Flavia; Barghini, Elena; Giordani, Tommaso; Rieseberg, Loren H.; Cavallini, Andrea; Natali, Lucia

    2015-01-01

    The sunflower (Helianthus annuus) genome contains a very large proportion of transposable elements, especially long terminal repeat retrotransposons. However, knowledge on the retrotransposon-related variability within this species is still limited. We used next-generation sequencing (NGS) technologies to perform a quantitative and qualitative survey of intraspecific variation of the retrotransposon fraction of the genome across 15 genotypes—7 wild accessions and 8 cultivars—of H. annuus. By mapping the Illumina reads of the 15 genotypes onto a library of sunflower long terminal repeat retrotransposons, we observed considerable variability in redundancy among genotypes, at both superfamily and family levels. In another analysis, we mapped Illumina paired reads to two sets of sequences, that is, long terminal repeat retrotransposons and protein-encoding sequences, and evaluated the extent of retrotransposon proximity to genes in the sunflower genome by counting the number of paired reads in which one read mapped to a retrotransposon and the other to a gene. Large variability among genotypes was also ascertained for retrotransposon proximity to genes. Both long terminal repeat retrotransposon redundancy and proximity to genes varied among retrotransposon families and also between cultivated and wild genotypes. Such differences are discussed in relation to the possible role of long terminal repeat retrotransposons in the domestication of sunflower. PMID:26608057

  9. Retrotransposon accumulation and satellite amplification mediated by segmental duplication facilitate centromere expansion in rice.

    PubMed

    Ma, Jianxin; Jackson, Scott A

    2006-02-01

    The abundance of repetitive DNA varies greatly across centromeres within an individual or between different organisms. To shed light on the molecular mechanisms of centromere repeat proliferation, we performed structural analysis of LTR-retrotransposons, mostly centromere retrotransposons of rice (CRRs), and phylogenetic analysis of CentO satellite repeats harbored in the core region of the rice chromosome 4 centromere (CEN4). The data obtained demonstrate that the CRRs in the centromeric region we investigated have been enriched more significantly by recent rounds of segmental duplication than by original integration of active elements, suggesting that segmental duplication is an important process for CRR accumulation in the centromeric region. Our results also indicate that segmental duplication of large arrays of satellite repeats is primarily responsible for the amplification of satellite repeats, contributing to rapid reshuffling of CentO satellites. Intercentromere satellite homogenization was revealed by genome-wide comparison of CentO satellite monomers. However, a 10-bp duplication present in nearly half of the CEN4 monomers was found to be completely absent in rice centromere 8 (CEN8), suggesting that CEN4 and CEN8 may represent two different stages in the evolution of rice centromeres. These observations, obtained from the only complex eukaryotic centromeres to have been completely sequenced thus far, depict the evolutionary dynamics of rice centromeres with respect to the nature, timing, and process of centromeric repeat amplification.

  10. Two different size classes of 5S rDNA units coexisting in the same tandem array in the razor clam Ensis macha: is this region suitable for phylogeographic studies?

    PubMed

    Fernández-Tajes, Juan; Méndez, Josefina

    2009-12-01

    For a study of 5S ribosomal genes (rDNA) in the razor clam Ensis macha, the 5S rDNA region was amplified and sequenced. Two variants, so-called type I or short repeat (approximately 430 bp) and type II or long repeat (approximately 735 bp), appeared to be the main components of the 5S rDNA of this species. Their spacers differed markedly, both in length and nucleotide composition. The organization of the two variants was investigated by amplifying the genomic DNA with primers based on the sequence of the type I and type II spacers. PCR amplification products with primers EMLbF and EMSbR showed that the long and short repeats are associated within the same tandem array, suggesting an intermixed arrangement of both spacers. Nevertheless, amplifications carried out with inverse primers EMSinvF/R and EMLinvF/R revealed that some short and long repeats are contiguous in the same tandem array. This is the first report of the coexistence of two variable spacers in the same tandem array in bivalve mollusks.

  11. Characterization of (CA)n microsatellite repeats from large-insert clones.

    PubMed

    Litt, M; Browne, D

    2001-05-01

    The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit determination of sequences flanking the microsatellites. When cosmids or large-insert phage clones are used as primary sources of (CA)n repeat markers, they have traditionally been subcloned into plasmid vectors such as pUC18 or M13 mp 18/19 cloning vectors to obtain fragments of suitable size for DNA sequencing. This unit presents an alternative approach whereby a set of degenerate sequencing primers that anneal directly to (CA)n microsatellites can be used to determine sequences that are inaccessible with vector-derived primers. Because the primers anneal to the repeat and not to the vector, they can be used with subclones containing inserts of several kilobases and should, in theory, always give sequence in the regions directly flanking the repeat. Degeneracy at the 3 end of each of these primers prevents elongation of primers that have annealed out-of-register. The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit.

  12. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

    PubMed

    Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L

    2013-01-30

    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.

  13. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

    PubMed Central

    2013-01-01

    Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705

  14. Glider and Vision: two new families of miniature inverted-repeat transposable elements in Xenopus laevis genome.

    PubMed

    Lepetit, D; Pasquet, S; Olive, M; Thézé, N; Thiébaud, P

    2000-01-01

    We have characterised from Xenopus laevis two new short interspersed repetitive elements, we have named Glider and Vision, that belong to the family of miniature inverted-repeat transposable elements (MITEs). Glider was first characterised in an intronic region of the alpha-tropomyosin (alpha-TM) gene and database search has revealed the presence of this element in 10 other Xenopus laevis genes. Glider elements are about 150 bp long and for some of them, their terminal inverted repeats are flanked by potential target-site duplications. Evidence for the mobility of Glider element has been provided by the presence/absence of one element at corresponding location in duplicated alpha-TM genes. Vision element has been identified in the promoter region of the cyclin dependant kinase 2 gene (cdk2) where it is boxed in a Glider element. Vision is 284bp long and is framed by 14-bp terminal inverted repeats that are flanked by 7-bp direct repeats. We have estimated that there are about 20,000 and 300 copies of Glider and Vision respectively scattered throughout the Xenopus laevis genome. Every MITEs elements but two described in our study are found either in 5' or in 3' regulatory regions of genes suggesting a potential role in gene regulation.

  15. GENETIC DIVERSITY OF TYPHA LATIFOLIA (TYPHACEAE) AND THE IMPACT OF POLLUTANTS EXAMINED WITH TANDEM-REPETITIVE DNA PROBES

    EPA Science Inventory

    Genetic diversity at variable-number-tandem-repeat (VNTR) loci was examined in the common cattail, Typha latifolia (Typhaceae), using three synthetic DNA probes composed of tandemly repeated "core" sequences (GACA, GATA, and GCAC). The principal objectives of this investigation w...

  16. The left end of rat L1 (L1Rn, long interspersed repeated) DNA which is a CpG island can function as a promoter.

    PubMed Central

    Nur, I; Pascale, E; Furano, A V

    1988-01-01

    Here we report that the 600 bp promoter-like region at the left end of a newly isolated and characterized rat L1 DNA element can activate the prokaryotic chloramphenicol acyltransferase gene in a rat cell line. Activation only occurs when the promoter region is oriented to the transferase gene as it is to the L1 protein encoding sequences and is 75% inhibited by methylation of just 5 of the 22 CpGs present in the promoter. The G + C rich promoter contains enough CpGs to qualify it as a CpG island, but in contrast to other CpG islands, genomic L1 promoters are fully methylated in both somatic cell and sperm DNA as judged by restriction enzyme analysis. Partial demethylation of the genomic promoters by treatment with 5-azacytidine failed to produce discrete L1 transcripts. The relationship of methylation to the evolutionary history and fate of the rat L1 promoter is discussed. Images PMID:2459662

  17. Cas4 Facilitates PAM-Compatible Spacer Selection during CRISPR Adaptation.

    PubMed

    Kieper, Sebastian N; Almendros, Cristóbal; Behler, Juliane; McKenzie, Rebecca E; Nobrega, Franklin L; Haagsma, Anna C; Vink, Jochem N A; Hess, Wolfgang R; Brouns, Stan J J

    2018-03-27

    CRISPR-Cas systems adapt their immunological memory against their invaders by integrating short DNA fragments into clustered regularly interspaced short palindromic repeat (CRISPR) loci. While Cas1 and Cas2 make up the core machinery of the CRISPR integration process, various class I and II CRISPR-Cas systems encode Cas4 proteins for which the role is unknown. Here, we introduced the CRISPR adaptation genes cas1, cas2, and cas4 from the type I-D CRISPR-Cas system of Synechocystis sp. 6803 into Escherichia coli and observed that cas4 is strictly required for the selection of targets with protospacer adjacent motifs (PAMs) conferring I-D CRISPR interference in the native host Synechocystis. We propose a model in which Cas4 assists the CRISPR adaptation complex Cas1-2 by providing DNA substrates tailored for the correct PAM. Introducing functional spacers that target DNA sequences with the correct PAM is key to successful CRISPR interference, providing a better chance of surviving infection by mobile genetic elements. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  18. Activation of both acfA and acfD transcription by Vibrio cholerae ToxT requires binding to two centrally located DNA sites in an inverted repeat conformation.

    PubMed

    Withey, Jeffrey H; DiRita, Victor J

    2005-05-01

    The Gram-negative bacterium Vibrio cholerae is the infectious agent responsible for the disease Asiatic cholera. The genes required for V. cholerae virulence, such as those encoding the cholera toxin (CT) and toxin-coregulated pilus (TCP), are controlled by a cascade of transcriptional activators. Ultimately, the direct transcriptional activator of the majority of V. cholerae virulence genes is the AraC/XylS family member ToxT protein, the expression of which is activated by the ToxR and TcpP proteins. Previous studies have identified the DNA sites to which ToxT binds upstream of the ctx operon, encoding CT, and the tcpA operon, encoding, among other products, the major subunit of the TCP. These known ToxT binding sites are seemingly dissimilar in sequence other than being A/T rich. Further results suggested that ctx and tcpA each has a pair of ToxT binding sites arranged in a direct repeat orientation upstream of the core promoter elements. In this work, using both transcriptional lacZ fusions and in vitro copper-phenanthroline footprinting experiments, we have identified the ToxT binding sites between the divergently transcribed acfA and acfD genes, which encode components of the accessory colonization factor required for efficient intestinal colonization by V. cholerae. Our results indicate that ToxT binds to a pair of DNA sites between acfA and acfD in an inverted repeat orientation. Moreover, a mutational analysis of the ToxT binding sites indicates that both binding sites are required by ToxT for transcriptional activation of both acfA and acfD. Using copper-phenanthroline footprinting to assess the occupancy of ToxT on DNA having mutations in one of these binding sites, we found that protection by ToxT of the unaltered binding site was not affected, whereas protection by ToxT of the mutant binding site was significantly reduced in the region of the mutations. The results of further footprinting experiments using DNA templates having +5 bp and +10 bp insertions between the two ToxT binding sites indicate that both binding sites are occupied by ToxT regardless of their positions relative to each other. Based on these results, we propose that ToxT binds independently to two DNA sites between acfA and acfD to activate transcription of both genes.

  19. A New Protein Architecture for Processing Alkylation Damaged DNA: The Crystal Structure of DNA Glycosylase AlkD

    PubMed Central

    Rubinson, Emily H.; Metz, Audrey H.; O'Quin, Jami; Eichman, Brandt F.

    2013-01-01

    Summary DNA glycosylases safeguard the genome by locating and excising chemically modified bases from DNA. AlkD is a recently discovered bacterial DNA glycosylase that removes positively charged methylpurines from DNA, and was predicted to adopt a protein fold distinct from other DNA repair proteins. The crystal structure of Bacillus cereus AlkD presented here shows that the protein is composed exclusively of helical HEAT-like repeats, which form a solenoid perfectly shaped to accommodate a DNA duplex on the concave surface. Structural analysis of the variant HEAT repeats in AlkD provides a rationale for how this protein scaffolding motif has been modified to bind DNA. We report 7mG excision and DNA binding activities of AlkD mutants, along with a comparison of alkylpurine DNA glycosylase structures. Together, these data provide important insight into the requirements for alkylation repair within DNA and suggest that AlkD utilizes a novel strategy to manipulate DNA in its search for alkylpurine bases. PMID:18585735

  20. Rapid detection of Wuchereria bancrofti and Brugia malayi in mosquito vectors (Diptera: Culicidae) using a real-time fluorescence resonance energy transfer multiplex PCR and melting curve analysis.

    PubMed

    Intapan, Pewpan M; Thanchomnang, Tongjit; Lulitanond, Viraphong; Maleewong, Wanchai

    2009-01-01

    We developed a single-step real-time fluorescence resonance energy transfer (FRET) multiplex polymerase chain reaction (PCR) merged with melting curve analysis for the detection of Wuchereria bancrofti and Brugia malayi DNA in blood-fed mosquitoes. Real-time FRET multiplex PCR is based on fluorescence melting curve analysis of a hybrid of amplicons generated from two families of repeated DNA elements: the 188 bp SspI repeated sequence, specific to W. bancrofti, and the 153-bp HhaI repeated sequence, specific to the genus Brugia and two pairs of specific fluorophore-labeled probes. Both W. bancrofti and B. malayi can be differentially detected in infected vectors by this process through their different fluorescence channel and melting temperatures. The assay could distinguish both human filarial DNAs in infected vectors from the DNAs of Dirofilaria immitis- and Plasmodium falciparum-infected human red blood cells and noninfected mosquitoes and human leukocytes. The technique showed 100% sensitivity and specificity and offers a rapid and reliable procedure for differentially identifying lymphatic filariasis. The introduced real-time FRET multiplex PCR can reduce labor time and reagent costs and is not prone to carry over contamination. The test can be used to screen mosquito vectors in endemic areas and therefore should be a useful diagnostic tool for the evaluation of infection rate of the mosquito populations and for xenomonitoring in the community after eradication programs such as the Global Program to Eliminate Lymphatic Filariasis.

  1. DNA preservation in skeletal elements from the World Trade Center disaster: recommendations for mass fatality management.

    PubMed

    Mundorff, Amy Z; Bartelink, Eric J; Mar-Cash, Elaine

    2009-07-01

    The World Trade Center (WTC) victim identification effort highlights taphonomic influences on the degradation of DNA from victims of mass fatality incidents. This study uses a subset of the WTC-Human Remains Database to evaluate differential preservation of DNA by skeletal element. Recovery location, sex, and victim type (civilian, firefighter, or plane passenger) do not appear to influence DNA preservation. Results indicate that more intact elements, as well as elements encased in soft tissue, produced slightly higher identification rates than more fragmented remains. DNA identification rates by element type conform to previous findings, with higher rates generally found in denser, weight-bearing bones. However, smaller bones including patellae, metatarsals, and foot phalanges yielded rates comparable to both femora and tibiae. These elements can be easily sampled with a disposable scalpel, and thus reduce potential DNA contamination. These findings have implications for DNA sampling guidelines in future mass fatality incidents.

  2. The armadillo repeat region targets ARVCF to cadherin-based cellular junctions.

    PubMed

    Kaufmann, U; Zuppinger, C; Waibler, Z; Rudiger, M; Urbich, C; Martin, B; Jockusch, B M; Eppenberger, H; Starzinski-Powitz, A

    2000-11-01

    The cytoplasmic domain of the transmembrane protein M-cadherin is involved in anchoring cytoskeletal elements to the plasma membrane at cell-cell contact sites. Several members of the armadillo repeat protein family mediate this linkage. We show here that ARVCF, a member of the p120 (ctn) subfamily, is a ligand for the cytoplasmic domain of M-cadherin, and characterize the regions involved in this interaction in detail. Complex formation in an in vivo environment was demonstrated in (1) yeast two-hybrid screens, using a cDNA library from differentiating skeletal muscle and part of the cytoplasmic M-cadherin tail as a bait, and (2) mammalian cells, using a novel experimental system, the MOM recruitment assay. Immunoprecipitation and in vitro binding assays confirmed this interaction. Ectopically expressed EGFP-ARVCF-C11, an N-terminal truncated fragment, targets to junctional structures in epithelial MCF7 cells and cardiomyocytes, where it colocalizes with the respective cadherins, beta-catenin and p120 (ctn). Hence, the N terminus of ARVCF is not required for junctional localization. In contrast, deletion of the four N-terminal armadillo repeats abolishes this ability in cardiomyocytes. Detailed mutational analysis revealed the armadillo repeat region of ARVCF as sufficient and necessary for interaction with the 55 membrane-proximal amino acids of the M-cadherin tail.

  3. Repeated pulses of serotonin required for long-term facilitation activate mitogen-activated protein kinase in sensory neurons of Aplysia

    PubMed Central

    Michael, Dan; Martin, Kelsey C.; Seger, Rony; Ning, Ming-Ming; Baston, Rene; Kandel, Eric R.

    1998-01-01

    Long-term facilitation of the connections between the sensory and motor neurons of the gill-withdrawal reflex in Aplysia requires five repeated pulses of serotonin (5-HT). The repeated pulses of 5-HT initiate a cascade of gene activation that leads ultimately to the growth of new synaptic connections. Several genes in this process have been identified, including the transcriptional regulators apCREB-1, apCREB-2, apC/EBP, and the cell adhesion molecule apCAM, which is thought to be involved in the formation of new synaptic connections. Here we report that the transcriptional regulators apCREB-2 and apC/EBP, as well as a peptide derived from the cytoplasmic domain of apCAM, are phosphorylated in vitro by Aplysia mitogen-activated protein kinase (apMAPK). We have cloned the cDNA encoding apMAPK and show that apMAPK activity is increased in sensory neurons treated with repeated pulses of 5-HT and by the cAMP pathway. These results suggest that apMAPK may participate with cAMP-dependent protein kinase during long-term facilitation in sensory cells by modifying some of the key elements involved in the consolidation of short- to long-lasting changes in synaptic strength. PMID:9465108

  4. Slipped-strand mispairing at noncontiguous repeats in Poecilia reticulata: a model for minisatellite birth.

    PubMed Central

    Taylor, J S; Breden, F

    2000-01-01

    The standard slipped-strand mispairing (SSM) model for the formation of variable number tandem repeats (VNTRs) proposes that a few tandem repeats, produced by chance mutations, provide the "raw material" for VNTR expansion. However, this model is unlikely to explain the formation of VNTRs with long motifs (e.g., minisatellites), because the likelihood of a tandem repeat forming by chance decreases rapidly as the length of the repeat motif increases. Phylogenetic reconstruction of the birth of a mitochondrial (mt) DNA minisatellite in guppies suggests that VNTRs with long motifs can form as a consequence of SSM at noncontiguous repeats. VNTRs formed in this manner have motifs longer than the noncontiguous repeat originally formed by chance and are flanked by one unit of the original, noncontiguous repeat. SSM at noncontiguous repeats can therefore explain the birth of VNTRs with long motifs and the "imperfect" or "short direct" repeats frequently observed adjacent to both mtDNA and nuclear VNTRs. PMID:10880490

  5. C9orf72 Nucleotide Repeat Structures Initiate Molecular Cascades of Disease

    PubMed Central

    Haeusler, Aaron R.; Donnelly, Christopher J.; Periz, Goran; Simko, Eric A.J.; Shaw, Patrick G.; Kim, Min-Sik; Maragakis, Nicholas J.; Troncoso, Juan C.; Pandey, Akhilesh; Sattler, Rita; Rothstein, Jeffrey D.; Wang, Jiou

    2014-01-01

    Summary A hexanucleotide repeat expansion (HRE), (GGGGCC)n, in C9orf72 is the most common genetic cause of the neurodegenerative diseases amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Here we identify a molecular mechanism by which structural polymorphism of the HRE leads to ALS/FTD pathology and defects. The HRE forms DNA and RNA G-quadruplexes with distinct structures and promotes RNA•DNA hybrids (R-loops). The structural polymorphism causes a repeat length-dependent accumulation of transcripts aborted in the HRE region. These transcribed repeats bind to ribonucleoproteins in a conformationdependent manner. Specifically, nucleolin (NCL), an essential nucleolar protein, preferentially binds the HRE G-quadruplex, and patient cells show evidence of nucleolar stress. Our results demonstrate that distinct C9orf72 HRE structural polymorphism at both DNA and RNA levels initiates molecular cascades leading to ALS/FTD pathologies, and provide the basis for a mechanistic model for repeat-associated neurodegenerative diseases. PMID:24598541

  6. Non-radioactive detection of trinucleotide repeat size variability.

    PubMed

    Tomé, Stéphanie; Nicole, Annie; Gomes-Pereira, Mario; Gourdon, Genevieve

    2014-03-06

    Many human diseases are associated with the abnormal expansion of unstable trinucleotide repeat sequences. The mechanisms of trinucleotide repeat size mutation have not been fully dissected, and their understanding must be grounded on the detailed analysis of repeat size distributions in human tissues and animal models. Small-pool PCR (SP-PCR) is a robust, highly sensitive and efficient PCR-based approach to assess the levels of repeat size variation, providing both quantitative and qualitative data. The method relies on the amplification of a very low number of DNA molecules, through sucessive dilution of a stock genomic DNA solution. Radioactive Southern blot hybridization is sensitive enough to detect SP-PCR products derived from single template molecules, separated by agarose gel electrophoresis and transferred onto DNA membranes. We describe a variation of the detection method that uses digoxigenin-labelled locked nucleic acid probes. This protocol keeps the sensitivity of the original method, while eliminating the health risks associated with the manipulation of radiolabelled probes, and the burden associated with their regulation, manipulation and waste disposal.

  7. iPBS: a universal method for DNA fingerprinting and retrotransposon isolation.

    PubMed

    Kalendar, Ruslan; Antonius, Kristiina; Smýkal, Petr; Schulman, Alan H

    2010-11-01

    Molecular markers are essential in plant and animal breeding and biodiversity applications, in human forensics, and for map-based cloning of genes. The long terminal repeat (LTR) retrotransposons are well suited as molecular markers. As dispersed and ubiquitous transposable elements, their "copy and paste" life cycle of replicative transposition leads to new genome insertions without excision of the original element. Both the overall structure of retrotransposons and the domains responsible for the various phases of their replication are highly conserved in all eukaryotes. Nevertheless, up to a year has been required to develop a retrotransposon marker system in a new species, involving cloning and sequencing steps as well as the development of custom primers. Here, we describe a novel PCR-based method useful both as a marker system in its own right and for the rapid isolation of retrotransposon termini and full-length elements, making it ideal for "orphan crops" and other species with underdeveloped marker systems. The method, iPBS amplification, is based on the virtually universal presence of a tRNA complement as a reverse transcriptase primer binding site (PBS) in LTR retrotransposons. The method differs from earlier retrotransposon isolation methods because it is applicable not only to endogenous retroviruses and retroviruses, but also to both Gypsy and Copia LTR retrotransposons, as well as to non-autonomous LARD and TRIM elements, throughout the plant kingdom and to animals. Furthermore, the inter-PBS amplification technique as such has proved to be a powerful DNA fingerprinting technology without the need for prior sequence knowledge.

  8. The mitochondrial genome of the pathogenic yeast Candida subhashii: GC-rich linear DNA with a protein covalently attached to the 5′ termini

    PubMed Central

    Fricova, Dominika; Valach, Matus; Farkas, Zoltan; Pfeiffer, Ilona; Kucsera, Judit; Tomaska, Lubomir; Nosek, Jozef

    2010-01-01

    As a part of our initiative aimed at a large-scale comparative analysis of fungal mitochondrial genomes, we determined the complete DNA sequence of the mitochondrial genome of the yeast Candida subhashii and found that it exhibits a number of peculiar features. First, the mitochondrial genome is represented by linear dsDNA molecules of uniform length (29 795 bp), with an unusually high content of guanine and cytosine residues (52.7 %). Second, the coding sequences lack introns; thus, the genome has a relatively compact organization. Third, the termini of the linear molecules consist of long inverted repeats and seem to contain a protein covalently bound to terminal nucleotides at the 5′ ends. This architecture resembles the telomeres in a number of linear viral and plasmid DNA genomes classified as invertrons, in which the terminal proteins serve as specific primers for the initiation of DNA synthesis. Finally, although the mitochondrial genome of C. subhashii contains essentially the same set of genes as other closely related pathogenic Candida species, we identified additional ORFs encoding two homologues of the family B protein-priming DNA polymerases and an unknown protein. The terminal structures and the genes for DNA polymerases are reminiscent of linear mitochondrial plasmids, indicating that this genome architecture might have emerged from fortuitous recombination between an ancestral, presumably circular, mitochondrial genome and an invertron-like element. PMID:20395267

  9. Determining the Specificity of Cascade Binding, Interference, and Primed Adaptation In Vivo in the Escherichia coli Type I-E CRISPR-Cas System

    PubMed Central

    Cooper, Lauren A.; Stringer, Anne M.

    2018-01-01

    ABSTRACT In clustered regularly interspaced short palindromic repeat (CRISPR)-Cas (CRISPR-associated) immunity systems, short CRISPR RNAs (crRNAs) are bound by Cas proteins, and these complexes target invading nucleic acid molecules for degradation in a process known as interference. In type I CRISPR-Cas systems, the Cas protein complex that binds DNA is known as Cascade. Association of Cascade with target DNA can also lead to acquisition of new immunity elements in a process known as primed adaptation. Here, we assess the specificity determinants for Cascade-DNA interaction, interference, and primed adaptation in vivo, for the type I-E system of Escherichia coli. Remarkably, as few as 5 bp of crRNA-DNA are sufficient for association of Cascade with a DNA target. Consequently, a single crRNA promotes Cascade association with numerous off-target sites, and the endogenous E. coli crRNAs direct Cascade binding to >100 chromosomal sites. In contrast to the low specificity of Cascade-DNA interactions, >18 bp are required for both interference and primed adaptation. Hence, Cascade binding to suboptimal, off-target sites is inert. Our data support a model in which the initial Cascade association with DNA targets requires only limited sequence complementarity at the crRNA 5′ end whereas recruitment and/or activation of the Cas3 nuclease, a prerequisite for interference and primed adaptation, requires extensive base pairing. PMID:29666291

  10. Distribution and sequence homogeneity of an abundant satellite DNA in the beetle, Tenebrio molitor.

    PubMed Central

    Davis, C A; Wyatt, G R

    1989-01-01

    The mealworm beetle, Tenebrio molitor, contains an unusually abundant and homogeneous satellite DNA which constitutes up to 60% of its genome. The satellite DNA is shown to be present in all of the chromosomes by in situ hybridization. 18 dimers of the repeat unit were cloned and sequenced. The consensus sequence is 142 nt long and lacks any internal repeat structure. Monomers of the sequence are very similar, showing on average a 2% divergence from the calculated consensus. Variant nucleotides are scattered randomly throughout the sequence although some variants are more common than others. Neighboring repeat units are no more alike than randomly chosen ones. The results suggest that some mechanism, perhaps gene conversion, is acting to maintain the homogeneity of the satellite DNA despite its abundance and distribution on all of the chromosomes. Images PMID:2762148

  11. The structures of non-CG-repeat Z-DNAs co-crystallized with the Z-DNA-binding domain, hZ alpha(ADAR1).

    PubMed

    Ha, Sung Chul; Choi, Jongkeun; Hwang, Hye-Yeon; Rich, Alexander; Kim, Yang-Gyun; Kim, Kyeong Kyu

    2009-02-01

    The Z-DNA conformation preferentially occurs at alternating purine-pyrimidine repeats, and is specifically recognized by Z alpha domains identified in several Z-DNA-binding proteins. The binding of Z alpha to foreign or chromosomal DNA in various sequence contexts is known to influence various biological functions, including the DNA-mediated innate immune response and transcriptional modulation of gene expression. For these reasons, understanding its binding mode and the conformational diversity of Z alpha bound Z-DNAs is of considerable importance. However, structural studies of Z alpha bound Z-DNA have been mostly limited to standard CG-repeat DNAs. Here, we have solved the crystal structures of three representative non-CG repeat DNAs, d(CACGTG)(2), d(CGTACG)(2) and d(CGGCCG)(2) complexed to hZ alpha(ADAR1) and compared those structures with that of hZ alpha(ADAR1)/d(CGCGCG)(2) and the Z alpha-free Z-DNAs. hZ alpha(ADAR1) bound to each of the three Z-DNAs showed a well conserved binding mode with very limited structural deviation irrespective of the DNA sequence, although varying numbers of residues were in contact with Z-DNA. Z-DNAs display less structural alterations in the Z alpha-bound state than in their free form, thereby suggesting that conformational diversities of Z-DNAs are restrained by the binding pocket of Z alpha. These data suggest that Z-DNAs are recognized by Z alpha through common conformational features regardless of the sequence and structural alterations.

  12. Control of gene expression by CRISPR-Cas systems

    PubMed Central

    2013-01-01

    Clustered regularly interspaced short palindromic repeats (CRISPR) loci and their associated cas (CRISPR-associated) genes provide adaptive immunity against viruses (phages) and other mobile genetic elements in bacteria and archaea. While most of the early work has largely been dominated by examples of CRISPR-Cas systems directing the cleavage of phage or plasmid DNA, recent studies have revealed a more complex landscape where CRISPR-Cas loci might be involved in gene regulation. In this review, we summarize the role of these loci in the regulation of gene expression as well as the recent development of synthetic gene regulation using engineered CRISPR-Cas systems. PMID:24273648

  13. Coordinated DNA dynamics during the human telomerase catalytic cycle

    NASA Astrophysics Data System (ADS)

    Parks, Joseph W.; Stone, Michael D.

    2014-06-01

    The human telomerase reverse transcriptase (hTERT) utilizes a template within the integral RNA subunit (hTR) to direct extension of telomeres. Telomerase exhibits repeat addition processivity (RAP) and must therefore translocate the nascent DNA product into a new RNA:DNA hybrid register to prime each round of telomere repeat synthesis. Here, we use single-molecule FRET and nuclease protection assays to monitor telomere DNA structure and dynamics during the telomerase catalytic cycle. DNA translocation during RAP proceeds through a previously uncharacterized kinetic substep during which the 3‧-end of the DNA substrate base pairs downstream within the hTR template. The rate constant for DNA primer realignment reveals this step is not rate limiting for RAP, suggesting a second slow conformational change repositions the RNA:DNA hybrid into the telomerase active site and drives the extrusion of the 5‧-end of the DNA primer out of the enzyme complex.

  14. Utility of next-generation RNA-sequencing in identifying chimeric transcription involving human endogenous retroviruses.

    PubMed

    Sokol, Martin; Jessen, Karen Margrethe; Pedersen, Finn Skou

    2016-01-01

    Several studies have shown that human endogenous retroviruses and endogenous retrovirus-like repeats (here collectively HERVs) impose direct regulation on human genes through enhancer and promoter motifs present in their long terminal repeats (LTRs). Although chimeric transcription in which novel gene isoforms containing retroviral and human sequence are transcribed from viral promoters are commonly associated with disease, regulation by HERVs is beneficial in other settings; for example, in human testis chimeric isoforms of TP63 induced by an ERV9 LTR protect the male germ line upon DNA damage by inducing apoptosis, whereas in the human globin locus the γ- and β-globin switch during normal hematopoiesis is mediated by complex interactions of an ERV9 LTR and surrounding human sequence. The advent of deep sequencing or next-generation sequencing (NGS) has revolutionized the way researchers solve important scientific questions and develop novel hypotheses in relation to human genome regulation. We recently applied next-generation paired-end RNA-sequencing (RNA-seq) together with chromatin immunoprecipitation with sequencing (ChIP-seq) to examine ERV9 chimeric transcription in human reference cell lines from Encyclopedia of DNA Elements (ENCODE). This led to the discovery of advanced regulation mechanisms by ERV9s and other HERVs across numerous human loci including transcription of large gene-unannotated genomic regions, as well as cooperative regulation by multiple HERVs and non-LTR repeats such as Alu elements. In this article, well-established examples of human gene regulation by HERVs are reviewed followed by a description of paired-end RNA-seq, and its application in identifying chimeric transcription genome-widely. Based on integrative analyses of RNA-seq and ChIP-seq, data we then present novel examples of regulation by ERV9s of tumor suppressor genes CADM2 and SEMA3A, as well as transcription of an unannotated region. Taken together, this article highlights the high suitability of contemporary sequencing methods in future analyses of human biology in relation to evolutionary acquired retroviruses in the human genome. © 2016 APMIS. Published by John Wiley & Sons Ltd.

  15. Sequences characterization of microsatellite DNA sequences in Pacific abalone ( Haliotis discus hannai)

    NASA Astrophysics Data System (ADS)

    Li, Qi; Akihiro, Kijima

    2007-01-01

    The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the microsatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and fifty white colonies were screened using PCR-based technique, and 84 clones were identified to potentially contain microsatellite repeat motif. The 84 clones were sequenced, and 42 microsatellites and 4 minisatellites with a minimum of five repeats were found (13.1% of white colonies screened). Besides the motif of CA contained in the oligoprobe, we also found other 16 types of microsatellite repeats including a dinucleotide repeat, two tetranucleotide repeats, twelve pentanucleotide repeats and a hexanucleotide repeat. According to Weber (1990), the microsatellite sequences obtained could be categorized structurally into perfect repeats (73.3%), imperfect repeats (13.3%), and compound repeats (13.4%). Among the microsatellite repeats, relatively short arrays (<20 repeats) were most abundant, accounting for 75.0%. The largest length of microsatellites was 48 repeats, and the average number of repeats was 13.4. The data on the composition and length distribution of microsatellites obtained in the present study can be useful for choosing the repeat motifs for microsatellite isolation in other abalone species.

  16. APE1 incision activity at abasic sites in tandem repeat sequences.

    PubMed

    Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

    2014-05-29

    Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.

  17. Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kass, D.H.; Batzer, M.A.; Deininger, P.L.

    The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome.more » However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.« less

  18. Recombinant SINEs are formed at high frequency during induced retrotransposition in vivo.

    PubMed

    Yadav, Vijay Pal; Mandal, Prabhat Kumar; Bhattacharya, Alok; Bhattacharya, Sudha

    2012-05-22

    Non-long terminal repeat Retrotransposons are referred to as long interspersed nuclear elements (LINEs) and their non-autonomous partners are short interspersed nuclear elements (SINEs). It is believed that an active SINE copy, upon retrotransposition, generates near identical copies of itself, which subsequently accumulate mutations resulting in sequence polymorphism. Here we show that when a retrotransposition-competent cell line of the parasitic protist Entamoeba histolytica, transfected with a marked SINE copy, is induced to retrotranspose, >20% of the newly retrotransposed copies are neither identical to the marked SINE nor to the mobilized resident SINEs. Rather they are recombinants of resident SINEs and the marked SINE. They are a consequence of retrotransposition and not DNA recombination, as they are absent in cells not expressing the retrotransposition functions. This high-frequency recombination provides a new explanation for the existence of mosaic SINEs, which may impact on genetic analysis of SINE lineages, and measurement of phylogenetic distances.

  19. Dynamic Alu Methylation during Normal Development, Aging, and Tumorigenesis

    PubMed Central

    Lu, Xuemei

    2014-01-01

    DNA methylation primarily occurs on CpG dinucleotides and plays an important role in transcriptional regulations during tissue development and cell differentiation. Over 25% of CpG dinucleotides in the human genome reside within Alu elements, the most abundant human repeats. The methylation of Alu elements is an important mechanism to suppress Alu transcription and subsequent retrotransposition. Decades of studies revealed that Alu methylation is highly dynamic during early development and aging. Recently, many environmental factors were shown to have a great impact on Alu methylation. In addition, aberrant Alu methylation has been documented to be an early event in many tumors and Alu methylation levels have been associated with tumor aggressiveness. The assessment of the Alu methylation has become an important approach for early diagnosis and/or prognosis of cancer. This review focuses on the dynamic Alu methylation during development, aging, and tumor genesis. The cause and consequence of Alu methylation changes will be discussed. PMID:25243180

  20. GENETIC VARIATION IN RED RASPBERRIES (RUBUS IDAEUS L.; ROSACEAE) FROM SITES DIFFERING IN ORGANIC POLLUTANTS COMPARED WITH SYNTHETIC TANDEM REPEAT DNA PROBES

    EPA Science Inventory

    Two synthetic tandem repetitive DNA probes were used to compare genetic variation at variable-number-tandem-repeat (VNTR) loci among Rubus idaeus L. var. strigosus (Michx.) Maxim. (Rosaceae) individuals sampled at eight sites contaminated by pollutants (N = 39) and eight adjacent...

  1. Densely ionizing radiation affects DNA methylation of selective LINE-1 elements1

    PubMed Central

    Prior, Sara; Miousse, Isabelle R.; Nzabarushimana, Etienne; Pathak, Rupak; Skinner, Charles; Kutanzi, Kristy R.; Allen, Antiño R.; Raber, Jacob; Tackett, Alan J.; Hauer-Jensen, Martin; Nelson, Gregory A.; Koturbash, Igor

    2016-01-01

    Long Interspersed Nucleotide Element 1 (LINE-1) retrotransposons are heavily methylated and are the most abundant transposable elements in mammalian genomes. Here, we investigated the differential DNA methylation within the LINE-1 under normal conditions and in response to environmentally relevant doses of sparsely and densely ionizing radiation. We demonstrate that DNA methylation of LINE-1 elements in the lungs of C57BL6 mice is dependent on their evolutionary age, where the elder age of the element is associated with the lower extent of DNA methylation. Exposure to 5-aza-2′-deoxycytidine and methionine-deficient diet affected DNA methylation of selective LINE-1 elements in an age- and promoter type-dependent manner. Exposure to densely IR, but not sparsely IR, resulted in DNA hypermethylation of older LINE-1 elements, while the DNA methylation of evolutionary younger elements remained mostly unchanged. We also demonstrate that exposure to densely IR increased mRNA and protein levels of LINE-1 via the loss of the histone H3K9 dimethylation and an increase in the H3K4 trimethylation at the LINE-1 5′-untranslated region, independently of DNA methylation. Our findings suggest that DNA methylation is important for regulation of LINE-1 expression under normal conditions, but histone modifications may dictate the transcriptional activity of LINE-1 in response to exposure to densely IR. PMID:27419368

  2. The abundant extrachromosomal DNA content of the Spiroplasma citri GII3-3X genome

    PubMed Central

    Saillard, Colette; Carle, Patricia; Duret-Nurbel, Sybille; Henri, Raphaël; Killiny, Nabil; Carrère, Sébastien; Gouzy, Jérome; Bové, Joseph-Marie; Renaudin, Joël; Foissac, Xavier

    2008-01-01

    Background Spiroplama citri, the causal agent of citrus stubborn disease, is a bacterium of the class Mollicutes and is transmitted by phloem-feeding leafhopper vectors. In order to characterize candidate genes potentially involved in spiroplasma transmission and pathogenicity, the genome of S. citri strain GII3-3X is currently being deciphered. Results Assembling 20,000 sequencing reads generated seven circular contigs, none of which fit the 1.8 Mb chromosome map or carried chromosomal markers. These contigs correspond to seven plasmids: pSci1 to pSci6, with sizes ranging from 12.9 to 35.3 kbp and pSciA of 7.8 kbp. Plasmids pSci were detected as multiple copies in strain GII3-3X. Plasmid copy numbers of pSci1-6, as deduced from sequencing coverage, were estimated at 10 to 14 copies per spiroplasma cell, representing 1.6 Mb of extrachromosomal DNA. Genes encoding proteins of the TrsE-TraE, Mob, TraD-TraG, and Soj-ParA protein families were predicted in most of the pSci sequences, in addition to members of 14 protein families of unknown function. Plasmid pSci6 encodes protein P32, a marker of insect transmissibility. Plasmids pSci1-5 code for eight different S. citri adhesion-related proteins (ScARPs) that are homologous to the previously described protein P89 and the S. kunkelii SkARP1. Conserved signal peptides and C-terminal transmembrane alpha helices were predicted in all ScARPs. The predicted surface-exposed N-terminal region possesses the following elements: (i) 6 to 8 repeats of 39 to 42 amino acids each (sarpin repeats), (ii) a central conserved region of 330 amino acids followed by (iii) a more variable domain of about 110 amino acids. The C-terminus, predicted to be cytoplasmic, consists of a 27 amino acid stretch enriched in arginine and lysine (KR) and an optional 23 amino acid stretch enriched in lysine, aspartate and glutamate (KDE). Plasmids pSci mainly present a linear increase of cumulative GC skew except in regions presenting conserved hairpin structures. Conclusion The genome of S. citri GII3-3X is characterized by abundant extrachromosomal elements. The pSci plasmids could not only be vertically inherited but also horizontally transmitted, as they encode proteins usually involved in DNA element partitioning and cell to cell DNA transfer. Because plasmids pSci1-5 encode surface proteins of the ScARP family and pSci6 was recently shown to confer insect transmissibility, diversity and abundance of S. citri plasmids may essentially aid the rapid adaptation of S. citri to more efficient transmission by different insect vectors and to various plant hosts. PMID:18442384

  3. Characterization of contiguous gene deletions in COL4A6 and COL4A5 in Alport syndrome-diffuse leiomyomatosis.

    PubMed

    Nozu, Kandai; Minamikawa, Shogo; Yamada, Shiro; Oka, Masafumi; Yanagita, Motoko; Morisada, Naoya; Fujinaga, Shuichiro; Nagano, China; Gotoh, Yoshimitsu; Takahashi, Eihiko; Morishita, Takahiro; Yamamura, Tomohiko; Ninchoji, Takeshi; Kaito, Hiroshi; Morioka, Ichiro; Nakanishi, Koichi; Vorechovsky, Igor; Iijima, Kazumoto

    2017-07-01

    Alport syndrome-diffuse leiomyomatosis (AS-DL, OMIM: 308940) is a rare variant of the X-linked Alport syndrome that shows overgrowth of visceral smooth muscles in the gastrointestinal, respiratory and female reproductive tracts in addition to renal symptoms. AS-DL results from deletions that encompass the 5' ends of the COL4A5 and COL4A6 genes, but deletion breakpoints between COL4A5 and COL4A6 have been determined in only four cases. Here, we characterize deletion breakpoints in five AS-DL patients and show a contiguous COL4A6/COL4A5 deletion in each case. We also demonstrate that eight out of nine deletion alleles involved sequences homologous between COL4A5 and COL4A6. Most breakpoints took place in recognizable transposed elements, including long and short interspersed repeats, DNA transposons and long-terminal repeat retrotransposons. Because deletions involved the bidirectional promoter region in each case, we suggest that the occurrence of leiomyomatosis in AS-DL requires inactivation of both genes. Altogether, our study highlights the importance of homologous recombination involving multiple transposed elements for the development of this continuous gene syndrome and other atypical loss-of-function phenotypes.

  4. Different mechanisms are involved in the transcriptional activation by yeast heat shock transcription factor through two different types of heat shock elements.

    PubMed

    Hashikawa, Naoya; Yamamoto, Noritaka; Sakurai, Hiroshi

    2007-04-06

    The hydrophobic repeat is a conserved structural motif of eukaryotic heat shock transcription factor (HSF) that enables HSF to form a homotrimer. Homotrimeric HSF binds to heat shock elements (HSEs) consisting of three inverted repeats of the sequence nGAAn. Sequences consisting of four or more nGAAn units are bound cooperatively by two HSF trimers. We show that in Saccharomyces cerevisiae cells oligomerization-defective Hsf1 is not able to bind HSEs with three units and is not extensively phosphorylated in response to stress; it is therefore unable to activate genes containing this type of HSE. Several lines of evidence indicate that oligomerization is a prerequisite for stress-induced hyperphosphorylation of Hsf1. In contrast, oligomerization and hyperphosphorylation are not necessary for gene activation via HSEs with four units. Intragenic suppressor screening of oligomerization-defective hsf1 showed that an interface between adjacent DNA-binding domains is important for the binding of Hsf1 to the HSE. We suggest that Saccharomyces cerevisiae HSEs with different structures are regulated differently; HSEs with three units require Hsf1 to be both oligomerized and hyperphosphorylated, whereas HSEs with four or more units do not require either.

  5. Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

    PubMed

    VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C

    2015-11-26

    Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.

  6. The Diversity of Prokaryotic DDE Transposases of the Mutator Superfamily, Insertion Specificity, and Association with Conjugation Machineries

    PubMed Central

    Guérillot, Romain; Siguier, Patricia; Gourbeyre, Edith; Chandler, Michael; Glaser, Philippe

    2014-01-01

    Transposable elements (TEs) are major components of both prokaryotic and eukaryotic genomes and play a significant role in their evolution. In this study, we have identified new prokaryotic DDE transposase families related to the eukaryotic Mutator-like transposases. These genes were retrieved by cascade PSI-Blast using as initial query the transposase of the streptococcal integrative and conjugative element (ICE) TnGBS2. By combining secondary structure predictions and protein sequence alignments, we predicted the DDE catalytic triad and the DNA-binding domain recognizing the terminal inverted repeats. Furthermore, we systematically characterized the organization and the insertion specificity of the TEs relying on these prokaryotic Mutator-like transposases (p-MULT) for their mobility. Strikingly, two distant TE families target their integration upstream σA dependent promoters. This allowed us to identify a transposase sequence signature associated with this unique insertion specificity and to show that the dissymmetry between the two inverted repeats is responsible for the orientation of the insertion. Surprisingly, while DDE transposases are generally associated with small and simple transposons such as insertion sequences (ISs), p-MULT encoding TEs show an unprecedented diversity with several families of IS, transposons, and ICEs ranging in size from 1.1 to 52 kb. PMID:24418649

  7. Low abundance of microsatellite repeats in the genome of the Brown-headed Cowbird (Molothrus ater)

    USGS Publications Warehouse

    Longmire, Jonathan L.; Hahn, D.C.; Roach, J.L.

    1999-01-01

    A cosmid library made from brown-headed cowbird (Molothrus ater) DNA was examined for representation of 17 distinct microsatellite motifs including all possible mono-, di-, and trinucleotide microsatellites, and the tetranucleotide repeat (GATA)n. The overall density of microsatellites within cowbird DNA was found to be one repeat per 89 kb and the frequency of the most abundant motif, (AGC)n, was once every 382 kb. The abundance of microsatellites within the cowbird genome is estimated to be reduced approximately 15-fold compared to humans. The reduced frequency of microsatellites seen in this study is consistent with previous observations indicating reduced numbers of microsatellites and other interspersed repeats in avian DNA. In addition to providing new information concerning the abundance of microsatellites within an avian genome, these results provide useful insights for selecting cloning strategies that might be used in the development of locus-specific microsatellite markers for avian studies.

  8. Super-lncRNAs: identification of lncRNAs that target super-enhancers via RNA:DNA:DNA triplex formation.

    PubMed

    Soibam, Benjamin

    2017-11-01

    Super-enhancers are characterized by high levels of Mediator binding and are major contributors to the expression of their associated genes. They exhibit high levels of local chromatin interactions and a higher order of local chromatin organization. On the other hand, lncRNAs can localize to specific DNA sites by forming a RNA:DNA:DNA triplex, which in turn can contribute to local chromatin organization. In this paper, we characterize a new class of lncRNAs called super-lncRNAs that target super-enhancers and which can contribute to the local chromatin organization of the super-enhancers. Using a logistic regression model based on the number of RNA:DNA:DNA triplex sites a lncRNA forms within the super-enhancer, we identify 442 unique super-lncRNA transcripts in 27 different human cell and tissue types; 70% of these super-lncRNAs were tissue restricted. They primarily harbor a single triplex-forming repeat domain, which forms an RNA:DNA:DNA triplex with multiple anchor DNA sites (originating from transposable elements) within the super-enhancers. Super-lncRNAs can be grouped into 17 different clusters based on the tissue or cell lines they target. Super-lncRNAs in a particular cluster share common short structural motifs and their corresponding super-enhancer targets are associated with gene ontology terms pertaining to the tissue or cell line. Super-lncRNAs may use these structural motifs to recruit and transport necessary regulators (such as transcription factors and Mediator complexes) to super-enhancers, influence chromatin organization, and act as spatial amplifiers for key tissue-specific genes associated with super-enhancers. © 2017 Soibam; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  9. Cascade DNA nanomachine and exponential amplification biosensing.

    PubMed

    Xu, Jianguo; Wu, Zai-Sheng; Shen, Weiyu; Xu, Huo; Li, Hongling; Jia, Lee

    2015-11-15

    DNA is a versatile scaffold for the assembly of multifunctional nanostructures, and potential applications of various DNA nanodevices have been recently demonstrated for disease diagnosis and treatment. In the current study, a powerful cascade DNA nanomachine was developed that can execute the exponential amplification of p53 tumor suppressor gene. During the operation of the newly-proposed DNA nanomachine, dual-cyclical nucleic acid strand-displacement polymerization (dual-CNDP) was ingeniously introduced, where the target trigger is repeatedly used as the fuel molecule and the nicked fragments are dramatically accumulated. Moreover, each displaced nicked fragment is able to activate the another type of cyclical strand-displacement amplification, increasing exponentially the value of fluorescence intensity. Essentially, one target binding event can induce considerable number of subsequent reactions, and the nanodevice was called cascade DNA nanomachine. It can implement several functions, including recognition element, signaling probe, polymerization primer and template. Using the developed autonomous operation of DNA nanomachine, the p53 gene can be quantified in the wide concentration range from 0.05 to 150 nM with the detection limit of 50 pM. If taking into account the final volume of mixture, the detection limit is calculated as lower as 6.2 pM, achieving an desirable assay ability. More strikingly, the mutant gene can be easily distinguished from the wild-type one. The proof-of-concept demonstrations reported herein is expected to promote the development and application of DNA nanomachine, showing great potential value in basic biology and medical diagnosis. Copyright © 2015 Elsevier B.V. All rights reserved.

  10. Two synthetic Sp1-binding sites functionally substitute for the 21-base-pair repeat region to activate simian virus 40 growth in CV-1 cells.

    PubMed Central

    Lednicky, J; Folk, W R

    1992-01-01

    The 21-bp repeat region of simian virus 40 (SV40) activates viral transcription and DNA replication and contains binding sites for many cellular proteins, including Sp1, LSF, ETF, Ap2, Ap4, GT-1B, H16, and p53, and for the SV40 large tumor antigen. We have attempted to reduce the complexity of this region while maintaining its growth-promoting capacity. Deletion of the 21-bp repeat region from the SV40 genome delays the expression of viral early proteins and DNA replication and reduces virus production in CV-1 cells. Replacement of the 21-bp repeat region with two copies of DNA sequence motifs bound with high affinities by Sp1 promotes SV40 growth in CV-1 cells to nearly wild-type levels, but substitution by motifs bound less avidly by Sp1 or bound by other activator proteins does not restore growth. This indicates that Sp1 or a protein with similar sequence specificity is primarily responsible for the function of the 21-bp repeat region. We speculate about how Sp1 activates both SV40 transcription and DNA replication. Images PMID:1328672

  11. Quantitative analysis of TALE-DNA interactions suggests polarity effects.

    PubMed

    Meckler, Joshua F; Bhakta, Mital S; Kim, Moon-Soo; Ovadia, Robert; Habrian, Chris H; Zykovich, Artem; Yu, Abigail; Lockwood, Sarah H; Morbitzer, Robert; Elsäesser, Janett; Lahaye, Thomas; Segal, David J; Baldwin, Enoch P

    2013-04-01

    Transcription activator-like effectors (TALEs) have revolutionized the field of genome engineering. We present here a systematic assessment of TALE DNA recognition, using quantitative electrophoretic mobility shift assays and reporter gene activation assays. Within TALE proteins, tandem 34-amino acid repeats recognize one base pair each and direct sequence-specific DNA binding through repeat variable di-residues (RVDs). We found that RVD choice can affect affinity by four orders of magnitude, with the relative RVD contribution in the order NG > HD ≈ NN > NI > NK. The NN repeat preferred the base G over A, whereas the NK repeat bound G with 10(3)-fold lower affinity. We compared AvrBs3, a naturally occurring TALE that recognizes its target using some atypical RVD-base combinations, with a designed TALE that precisely matches 'standard' RVDs with the target bases. This comparison revealed unexpected differences in sensitivity to substitutions of the invariant 5'-T. Another surprising observation was that base mismatches at the 5' end of the target site had more disruptive effects on affinity than those at the 3' end, particularly in designed TALEs. These results provide evidence that TALE-DNA recognition exhibits a hitherto un-described polarity effect, in which the N-terminal repeats contribute more to affinity than C-terminal ones.

  12. Isolation of human simple repeat loci by hybridization selection.

    PubMed

    Armour, J A; Neumann, R; Gobert, S; Jeffreys, A J

    1994-04-01

    We have isolated short tandem repeat arrays from the human genome, using a rapid method involving filter hybridization to enrich for tri- or tetranucleotide tandem repeats. About 30% of clones from the enriched library cross-hybridize with probes containing trimeric or tetrameric tandem arrays, facilitating the rapid isolation of large numbers of clones. In an initial analysis of 54 clones, 46 different tandem arrays were identified. Analysis of these tandem repeat loci by PCR showed that 24 were polymorphic in length; substantially higher levels of polymorphism were displayed by the tetrameric repeat loci isolated than by the trimeric repeats. Primary mapping of these loci by linkage analysis showed that they derive from 17 chromosomes, including the X chromosome. We anticipate the use of this strategy for the efficient isolation of tandem repeats from other sources of genomic DNA, including DNA from flow-sorted chromosomes, and from other species.

  13. Chompy: an infestation of MITE-like repetitive elements in the crocodilian genome.

    PubMed

    Ray, David A; Hedges, Dale J; Herke, Scott W; Fowlkes, Justin D; Barnes, Erin W; LaVie, Daniel K; Goodwin, Lindsey M; Densmore, Llewellyn D; Batzer, Mark A

    2005-12-05

    Interspersed repeats are a major component of most eukaryotic genomes and have an impact on genome size and stability, but the repetitive element landscape of crocodilian genomes has not yet been fully investigated. In this report, we provide the first detailed characterization of an interspersed repeat element in any crocodilian genome. Chompy is a putative miniature inverted-repeat transposable element (MITE) family initially recovered from the genome of Alligator mississippiensis (American alligator) but also present in the genomes of Crocodylus moreletii (Morelet's crocodile) and Gavialis gangeticus (Indian gharial). The element has all of the hallmarks of MITEs including terminal inverted repeats, possible target site duplications, and a tendency to form secondary structures. We estimate the copy number in the alligator genome to be approximately 46,000 copies. As a result of their size and unique properties, Chompy elements may provide a useful source of genomic variation for crocodilian comparative genomics.

  14. Characterization of proviruses cloned from mink cell focus-forming virus-infected cellular DNA.

    PubMed Central

    Khan, A S; Repaske, R; Garon, C F; Chan, H W; Rowe, W P; Martin, M A

    1982-01-01

    Two proviruses were cloned from EcoRI-digested DNA extracted from mink cells chronically infected with AKR mink cell focus-forming (MCF) 247 murine leukemia virus (MuLV), using a lambda phage host vector system. One cloned MuLV DNA fragment (designated MCF 1) contained sequences extending 6.8 kilobases from an EcoRI restriction site in the 5' long terminal repeat (LTR) to an EcoRI site located in the envelope (env) region and was indistinguishable by restriction endonuclease mapping for 5.1 kilobases (except for the EcoRI site in the LTR) from the 5' end of AKR ecotropic proviral DNA. The DNA segment extending from 5.1 to 6.8 kilobases contained several restriction sites that were not present in the AKR ecotropic provirus. A 0.5-kilobase DNA segment located at the 3' end of MCF 1 DNA contained sequences which hybridized to a xenotropic env-specific DNA probe but not to labeled ecotropic env-specific DNA. This dual character of MCF 1 proviral DNA was also confirmed by analyzing heteroduplex molecules by electron microscopy. The second cloned proviral DNA (designated MCF 2) was a 6.9-kilobase EcoRI DNA fragment which contained LTR sequences at each end and a 2.0-kilobase deletion encompassing most of the env region. The MCF 2 proviral DNA proved to be a useful reagent for detecting LTRs electron microscopically due to the presence of nonoverlapping, terminally located LTR sequences which effected its circularization with DNAs containing homologous LTR sequences. Nucleotide sequence analysis demonstrated the presence of a 104-base-pair direct repeat in the LTR of MCF 2 DNA. In contrast, only a single copy of the reiterated component of the direct repeat was present in MCF 1 DNA. Images PMID:6281459

  15. [Variability of nuclear 18S-25S rDNA of Gentiana lutea L. in nature and in tissue culture in vitro].

    PubMed

    Mel'nyk, V M; Spiridonova, K V; Andrieiev, I O; Strashniuk, N M; Kunakh, V A

    2004-01-01

    18S-25S rDNA sequence in genomes of G. lutea plants from different natural populations and from tissue culture has been studied with blot-hybridization method. It was shown that ribosomal repeats are represented by the variants which differ for their size and for the presence of additional HindIII restriction site. Genome of individual plant usually possesses several variants of DNA repeats. Interpopulation variability according to their quantitative ratio and to the presence of some of them has been shown. Modifications of the range of rDNA repeats not exceeding intraspecific variability were observed in callus tissues in comparison with the plants of initial population. Non-randomness of genome modifications in the course of cell adaptation to in vitro conditions makes it possible to some extent to forecast these modifications in tissue culture.

  16. Engineering DNA Backbone Interactions Results in TALE Scaffolds with Enhanced 5-Methylcytosine Selectivity.

    PubMed

    Rathi, Preeti; Witte, Anna; Summerer, Daniel

    2017-11-08

    Transcription activator-like effectors (TALEs) are DNA major-groove binding proteins widely used for genome targeting. TALEs contain an N-terminal region (NTR) and a central repeat domain (CRD). Repeats of the CRD selectively recognize each one DNA nucleobase, offering programmability. Moreover, repeats with selectivity for 5-methylcytosine (5mC) and its oxidized derivatives can be designed for analytical applications. However, both TALE domains also nonspecifically interact with DNA phosphates via basic amino acids. To enhance the 5mC selectivity of TALEs, we aimed to decrease the nonselective binding energy of TALEs. We substituted basic amino acids with alanine in the NTR and identified TALE mutants with increased selectivity. We then analysed conserved, DNA phosphate-binding KQ diresidues in CRD repeats and identified further improved mutants. Combination of mutations in the NTR and CRD was highly synergetic and resulted in TALE scaffolds with up to 4.3-fold increased selectivity in genomic 5mC analysis via affinity enrichment. Moreover, transcriptional activation in HEK293T cells by a TALE-VP64 construct based on this scaffold design exhibited a 3.5-fold increased 5mC selectivity. This provides perspectives for improved 5mC analysis and for the 5mC-conditional control of TALE-based editing constructs in vivo.

  17. Centromeric and non-centromeric satellite DNA organisation differs in holocentric Rhynchospora species.

    PubMed

    Ribeiro, Tiago; Marques, André; Novák, Petr; Schubert, Veit; Vanzela, André L L; Macas, Jiri; Houben, Andreas; Pedrosa-Harand, Andrea

    2017-03-01

    Satellite DNA repeats (or satDNA) are fast-evolving sequences usually associated with condensed heterochromatin. To test whether the chromosomal organisation of centromeric and non-centromeric satDNA differs in species with holocentric chromosomes, we identified and characterised the major satDNA families in the holocentric Cyperaceae species Rhynchospora ciliata (2n = 10), R. globosa (2n = 50) and R. tenuis (2n = 2x = 4 and 2n = 4x = 8). While conserved centromeric repeats (present in R. ciliata and R. tenuis) revealed linear signals at both chromatids, non-centromeric, species-specific satDNAs formed distinct clusters along the chromosomes. Colocalisation of both repeat types resulted in a ladder-like hybridisation pattern at mitotic chromosomes. In interphase, the centromeric satDNA was dispersed while non-centromeric satDNA clustered and partly colocalised to chromocentres. Despite the banding-like hybridisation patterns of the clustered satDNA, the identification of chromosome pairs was impaired due to the irregular hybridisation patterns of the homologues in R. tenuis and R. ciliata. These differences are probably caused by restricted or impaired meiotic recombination as reported for R. tenuis, or alternatively by complex chromosome rearrangements or unequal condensation of homologous metaphase chromosomes. Thus, holocentricity influences the chromosomal organisation leading to differences in the distribution patterns and condensation dynamics of centromeric and non-centromeric satDNA.

  18. Drosophila muller f elements maintain a distinct set of genomic properties over 40 million years of evolution.

    PubMed

    Leung, Wilson; Shaffer, Christopher D; Reed, Laura K; Smith, Sheryl T; Barshop, William; Dirkes, William; Dothager, Matthew; Lee, Paul; Wong, Jeannette; Xiong, David; Yuan, Han; Bedard, James E J; Machone, Joshua F; Patterson, Seantay D; Price, Amber L; Turner, Bryce A; Robic, Srebrenka; Luippold, Erin K; McCartha, Shannon R; Walji, Tezin A; Walker, Chelsea A; Saville, Kenneth; Abrams, Marita K; Armstrong, Andrew R; Armstrong, William; Bailey, Robert J; Barberi, Chelsea R; Beck, Lauren R; Blaker, Amanda L; Blunden, Christopher E; Brand, Jordan P; Brock, Ethan J; Brooks, Dana W; Brown, Marie; Butzler, Sarah C; Clark, Eric M; Clark, Nicole B; Collins, Ashley A; Cotteleer, Rebecca J; Cullimore, Peterson R; Dawson, Seth G; Docking, Carter T; Dorsett, Sasha L; Dougherty, Grace A; Downey, Kaitlyn A; Drake, Andrew P; Earl, Erica K; Floyd, Trevor G; Forsyth, Joshua D; Foust, Jonathan D; Franchi, Spencer L; Geary, James F; Hanson, Cynthia K; Harding, Taylor S; Harris, Cameron B; Heckman, Jonathan M; Holderness, Heather L; Howey, Nicole A; Jacobs, Dontae A; Jewell, Elizabeth S; Kaisler, Maria; Karaska, Elizabeth A; Kehoe, James L; Koaches, Hannah C; Koehler, Jessica; Koenig, Dana; Kujawski, Alexander J; Kus, Jordan E; Lammers, Jennifer A; Leads, Rachel R; Leatherman, Emily C; Lippert, Rachel N; Messenger, Gregory S; Morrow, Adam T; Newcomb, Victoria; Plasman, Haley J; Potocny, Stephanie J; Powers, Michelle K; Reem, Rachel M; Rennhack, Jonathan P; Reynolds, Katherine R; Reynolds, Lyndsey A; Rhee, Dong K; Rivard, Allyson B; Ronk, Adam J; Rooney, Meghan B; Rubin, Lainey S; Salbert, Luke R; Saluja, Rasleen K; Schauder, Taylor; Schneiter, Allison R; Schulz, Robert W; Smith, Karl E; Spencer, Sarah; Swanson, Bryant R; Tache, Melissa A; Tewilliager, Ashley A; Tilot, Amanda K; VanEck, Eve; Villerot, Matthew M; Vylonis, Megan B; Watson, David T; Wurzler, Juliana A; Wysocki, Lauren M; Yalamanchili, Monica; Zaborowicz, Matthew A; Emerson, Julia A; Ortiz, Carlos; Deuschle, Frederic J; DiLorenzo, Lauren A; Goeller, Katie L; Macchi, Christopher R; Muller, Sarah E; Pasierb, Brittany D; Sable, Joseph E; Tucci, Jessica M; Tynon, Marykathryn; Dunbar, David A; Beken, Levent H; Conturso, Alaina C; Danner, Benjamin L; DeMichele, Gabriella A; Gonzales, Justin A; Hammond, Maureen S; Kelley, Colleen V; Kelly, Elisabeth A; Kulich, Danielle; Mageeney, Catherine M; McCabe, Nikie L; Newman, Alyssa M; Spaeder, Lindsay A; Tumminello, Richard A; Revie, Dennis; Benson, Jonathon M; Cristostomo, Michael C; DaSilva, Paolo A; Harker, Katherine S; Jarrell, Jenifer N; Jimenez, Luis A; Katz, Brandon M; Kennedy, William R; Kolibas, Kimberly S; LeBlanc, Mark T; Nguyen, Trung T; Nicolas, Daniel S; Patao, Melissa D; Patao, Shane M; Rupley, Bryan J; Sessions, Bridget J; Weaver, Jennifer A; Goodman, Anya L; Alvendia, Erica L; Baldassari, Shana M; Brown, Ashley S; Chase, Ian O; Chen, Maida; Chiang, Scott; Cromwell, Avery B; Custer, Ashley F; DiTommaso, Tia M; El-Adaimi, Jad; Goscinski, Nora C; Grove, Ryan A; Gutierrez, Nestor; Harnoto, Raechel S; Hedeen, Heather; Hong, Emily L; Hopkins, Barbara L; Huerta, Vilma F; Khoshabian, Colin; LaForge, Kristin M; Lee, Cassidy T; Lewis, Benjamin M; Lydon, Anniken M; Maniaci, Brian J; Mitchell, Ryan D; Morlock, Elaine V; Morris, William M; Naik, Priyanka; Olson, Nicole C; Osterloh, Jeannette M; Perez, Marcos A; Presley, Jonathan D; Randazzo, Matt J; Regan, Melanie K; Rossi, Franca G; Smith, Melanie A; Soliterman, Eugenia A; Sparks, Ciani J; Tran, Danny L; Wan, Tiffany; Welker, Anne A; Wong, Jeremy N; Sreenivasan, Aparna; Youngblom, Jim; Adams, Andrew; Alldredge, Justin; Bryant, Ashley; Carranza, David; Cifelli, Alyssa; Coulson, Kevin; Debow, Calise; Delacruz, Noelle; Emerson, Charlene; Farrar, Cassandra; Foret, Don; Garibay, Edgar; Gooch, John; Heslop, Michelle; Kaur, Sukhjit; Khan, Ambreen; Kim, Van; Lamb, Travis; Lindbeck, Peter; Lucas, Gabi; Macias, Elizabeth; Martiniuc, Daniela; Mayorga, Lissett; Medina, Joseph; Membreno, Nelson; Messiah, Shady; Neufeld, Lacey; Nguyen, San Francisco; Nichols, Zachary; Odisho, George; Peterson, Daymon; Rodela, Laura; Rodriguez, Priscilla; Rodriguez, Vanessa; Ruiz, Jorge; Sherrill, Will; Silva, Valeria; Sparks, Jeri; Statton, Geeta; Townsend, Ashley; Valdez, Isabel; Waters, Mary; Westphal, Kyle; Winkler, Stacey; Zumkehr, Joannee; DeJong, Randall J; Hoogewerf, Arlene J; Ackerman, Cheri M; Armistead, Isaac O; Baatenburg, Lara; Borr, Matthew J; Brouwer, Lindsay K; Burkhart, Brandon J; Bushhouse, Kelsey T; Cesko, Lejla; Choi, Tiffany Y Y; Cohen, Heather; Damsteegt, Amanda M; Darusz, Jess M; Dauphin, Cory M; Davis, Yelena P; Diekema, Emily J; Drewry, Melissa; Eisen, Michelle E M; Faber, Hayley M; Faber, Katherine J; Feenstra, Elizabeth; Felzer-Kim, Isabella T; Hammond, Brandy L; Hendriksma, Jesse; Herrold, Milton R; Hilbrands, Julia A; Howell, Emily J; Jelgerhuis, Sarah A; Jelsema, Timothy R; Johnson, Benjamin K; Jones, Kelly K; Kim, Anna; Kooienga, Ross D; Menyes, Erika E; Nollet, Eric A; Plescher, Brittany E; Rios, Lindsay; Rose, Jenny L; Schepers, Allison J; Scott, Geoff; Smith, Joshua R; Sterling, Allison M; Tenney, Jenna C; Uitvlugt, Chris; VanDyken, Rachel E; VanderVennen, Marielle; Vue, Samantha; Kokan, Nighat P; Agbley, Kwabea; Boham, Sampson K; Broomfield, Daniel; Chapman, Kayla; Dobbe, Ali; Dobbe, Ian; Harrington, William; Ibrahem, Marwan; Kennedy, Andre; Koplinsky, Chad A; Kubricky, Cassandra; Ladzekpo, Danielle; Pattison, Claire; Ramirez, Roman E; Wande, Lucia; Woehlke, Sarah; Wawersik, Matthew; Kiernan, Elizabeth; Thompson, Jeffrey S; Banker, Roxanne; Bartling, Justina R; Bhatiya, Chinmoy I; Boudoures, Anna L; Christiansen, Lena; Fosselman, Daniel S; French, Kristin M; Gill, Ishwar S; Havill, Jessen T; Johnson, Jaelyn L; Keny, Lauren J; Kerber, John M; Klett, Bethany M; Kufel, Christina N; May, Francis J; Mecoli, Jonathan P; Merry, Callie R; Meyer, Lauren R; Miller, Emily G; Mullen, Gregory J; Palozola, Katherine C; Pfeil, Jacob J; Thomas, Jessica G; Verbofsky, Evan M; Spana, Eric P; Agarwalla, Anant; Chapman, Julia; Chlebina, Ben; Chong, Insun; Falk, I N; Fitzgibbons, John D; Friedman, Harrison; Ighile, Osagie; Kim, Andrew J; Knouse, Kristin A; Kung, Faith; Mammo, Danny; Ng, Chun Leung; Nikam, Vinayak S; Norton, Diana; Pham, Philip; Polk, Jessica W; Prasad, Shreya; Rankin, Helen; Ratliff, Camille D; Scala, Victoria; Schwartz, Nicholas U; Shuen, Jessica A; Xu, Amy; Xu, Thomas Q; Zhang, Yi; Rosenwald, Anne G; Burg, Martin G; Adams, Stephanie J; Baker, Morgan; Botsford, Bobbi; Brinkley, Briana; Brown, Carter; Emiah, Shadie; Enoch, Erica; Gier, Chad; Greenwell, Alyson; Hoogenboom, Lindsay; Matthews, Jordan E; McDonald, Mitchell; Mercer, Amanda; Monsma, Nicholaus; Ostby, Kristine; Ramic, Alen; Shallman, Devon; Simon, Matthew; Spencer, Eric; Tomkins, Trisha; Wendland, Pete; Wylie, Anna; Wolyniak, Michael J; Robertson, Gregory M; Smith, Samuel I; DiAngelo, Justin R; Sassu, Eric D; Bhalla, Satish C; Sharif, Karim A; Choeying, Tenzin; Macias, Jason S; Sanusi, Fareed; Torchon, Karvyn; Bednarski, April E; Alvarez, Consuelo J; Davis, Kristen C; Dunham, Carrie A; Grantham, Alaina J; Hare, Amber N; Schottler, Jennifer; Scott, Zackary W; Kuleck, Gary A; Yu, Nicole S; Kaehler, Marian M; Jipp, Jacob; Overvoorde, Paul J; Shoop, Elizabeth; Cyrankowski, Olivia; Hoover, Betsy; Kusner, Matt; Lin, Devry; Martinov, Tijana; Misch, Jonathan; Salzman, Garrett; Schiedermayer, Holly; Snavely, Michael; Zarrasola, Stephanie; Parrish, Susan; Baker, Atlee; Beckett, Alissa; Belella, Carissa; Bryant, Julie; Conrad, Turner; Fearnow, Adam; Gomez, Carolina; Herbstsomer, Robert A; Hirsch, Sarah; Johnson, Christen; Jones, Melissa; Kabaso, Rita; Lemmon, Eric; Vieira, Carolina Marques Dos Santos; McFarland, Darryl; McLaughlin, Christopher; Morgan, Abbie; Musokotwane, Sepo; Neutzling, William; Nietmann, Jana; Paluskievicz, Christina; Penn, Jessica; Peoples, Emily; Pozmanter, Caitlin; Reed, Emily; Rigby, Nichole; Schmidt, Lasse; Shelton, Micah; Shuford, Rebecca; Tirasawasdichai, Tiara; Undem, Blair; Urick, Damian; Vondy, Kayla; Yarrington, Bryan; Eckdahl, Todd T; Poet, Jeffrey L; Allen, Alica B; Anderson, John E; Barnett, Jason M; Baumgardner, Jordan S; Brown, Adam D; Carney, Jordan E; Chavez, Ramiro A; Christgen, Shelbi L; Christie, Jordan S; Clary, Andrea N; Conn, Michel A; Cooper, Kristen M; Crowley, Matt J; Crowley, Samuel T; Doty, Jennifer S; Dow, Brian A; Edwards, Curtis R; Elder, Darcie D; Fanning, John P; Janssen, Bridget M; Lambright, Anthony K; Lane, Curtiss E; Limle, Austin B; Mazur, Tammy; McCracken, Marly R; McDonough, Alexa M; Melton, Amy D; Minnick, Phillip J; Musick, Adam E; Newhart, William H; Noynaert, Joseph W; Ogden, Bradley J; Sandusky, Michael W; Schmuecker, Samantha M; Shipman, Anna L; Smith, Anna L; Thomsen, Kristen M; Unzicker, Matthew R; Vernon, William B; Winn, Wesley W; Woyski, Dustin S; Zhu, Xiao; Du, Chunguang; Ament, Caitlin; Aso, Soham; Bisogno, Laura Simone; Caronna, Jason; Fefelova, Nadezhda; Lopez, Lenin; Malkowitz, Lorraine; Marra, Jonathan; Menillo, Daniella; Obiorah, Ifeanyi; Onsarigo, Eric Nyabeta; Primus, Shekerah; Soos, Mahdi; Tare, Archana; Zidan, Ameer; Jones, Christopher J; Aronhalt, Todd; Bellush, James M; Burke, Christa; DeFazio, Steve; Does, Benjamin R; Johnson, Todd D; Keysock, Nicholas; Knudsen, Nelson H; Messler, James; Myirski, Kevin; Rekai, Jade Lea; Rempe, Ryan Michael; Salgado, Michael S; Stagaard, Erica; Starcher, Justin R; Waggoner, Andrew W; Yemelyanova, Anastasia K; Hark, Amy T; Bertolet, Anne; Kuschner, Cyrus E; Parry, Kesley; Quach, Michael; Shantzer, Lindsey; Shaw, Mary E; Smith, Mary A; Glenn, Omolara; Mason, Portia; Williams, Charlotte; Key, S Catherine Silver; Henry, Tyneshia C P; Johnson, Ashlee G; White, Jackie X; Haberman, Adam; Asinof, Sam; Drumm, Kelly; Freeburg, Trip; Safa, Nadia; Schultz, Darrin; Shevin, Yakov; Svoronos, Petros; Vuong, Tam; Wellinghoff, Jules; Hoopes, Laura L M; Chau, Kim M; Ward, Alyssa; Regisford, E Gloria C; Augustine, LaJerald; Davis-Reyes, Brionna; Echendu, Vivienne; Hales, Jasmine; Ibarra, Sharon; Johnson, Lauriaun; Ovu, Steven; Braverman, John M; Bahr, Thomas J; Caesar, Nicole M; Campana, Christopher; Cassidy, Daniel W; Cognetti, Peter A; English, Johnathan D; Fadus, Matthew C; Fick, Cameron N; Freda, Philip J; Hennessy, Bryan M; Hockenberger, Kelsey; Jones, Jennifer K; King, Jessica E; Knob, Christopher R; Kraftmann, Karen J; Li, Linghui; Lupey, Lena N; Minniti, Carl J; Minton, Thomas F; Moran, Joseph V; Mudumbi, Krishna; Nordman, Elizabeth C; Puetz, William J; Robinson, Lauren M; Rose, Thomas J; Sweeney, Edward P; Timko, Ashley S; Paetkau, Don W; Eisler, Heather L; Aldrup, Megan E; Bodenberg, Jessica M; Cole, Mara G; Deranek, Kelly M; DeShetler, Megan; Dowd, Rose M; Eckardt, Alexandra K; Ehret, Sharon C; Fese, Jessica; Garrett, Amanda D; Kammrath, Anna; Kappes, Michelle L; Light, Morgan R; Meier, Anne C; O'Rouke, Allison; Perella, Mallory; Ramsey, Kimberley; Ramthun, Jennifer R; Reilly, Mary T; Robinett, Deirdre; Rossi, Nadine L; Schueler, Mary Grace; Shoemaker, Emma; Starkey, Kristin M; Vetor, Ashley; Vrable, Abby; Chandrasekaran, Vidya; Beck, Christopher; Hatfield, Kristen R; Herrick, Douglas A; Khoury, Christopher B; Lea, Charlotte; Louie, Christopher A; Lowell, Shannon M; Reynolds, Thomas J; Schibler, Jeanine; Scoma, Alexandra H; Smith-Gee, Maxwell T; Tuberty, Sarah; Smith, Christopher D; Lopilato, Jane E; Hauke, Jeanette; Roecklein-Canfield, Jennifer A; Corrielus, Maureen; Gilman, Hannah; Intriago, Stephanie; Maffa, Amanda; Rauf, Sabya A; Thistle, Katrina; Trieu, Melissa; Winters, Jenifer; Yang, Bib; Hauser, Charles R; Abusheikh, Tariq; Ashrawi, Yara; Benitez, Pedro; Boudreaux, Lauren R; Bourland, Megan; Chavez, Miranda; Cruz, Samantha; Elliott, GiNell; Farek, Jesse R; Flohr, Sarah; Flores, Amanda H; Friedrichs, Chelsey; Fusco, Zach; Goodwin, Zane; Helmreich, Eric; Kiley, John; Knepper, John Mark; Langner, Christine; Martinez, Megan; Mendoza, Carlos; Naik, Monal; Ochoa, Andrea; Ragland, Nicolas; Raimey, England; Rathore, Sunil; Reza, Evangelina; Sadovsky, Griffin; Seydoux, Marie-Isabelle B; Smith, Jonathan E; Unruh, Anna K; Velasquez, Vicente; Wolski, Matthew W; Gosser, Yuying; Govind, Shubha; Clarke-Medley, Nicole; Guadron, Leslie; Lau, Dawn; Lu, Alvin; Mazzeo, Cheryl; Meghdari, Mariam; Ng, Simon; Pamnani, Brad; Plante, Olivia; Shum, Yuki Kwan Wa; Song, Roy; Johnson, Diana E; Abdelnabi, Mai; Archambault, Alexi; Chamma, Norma; Gaur, Shailly; Hammett, Deborah; Kandahari, Adrese; Khayrullina, Guzal; Kumar, Sonali; Lawrence, Samantha; Madden, Nigel; Mandelbaum, Max; Milnthorp, Heather; Mohini, Shiv; Patel, Roshni; Peacock, Sarah J; Perling, Emily; Quintana, Amber; Rahimi, Michael; Ramirez, Kristen; Singhal, Rishi; Weeks, Corinne; Wong, Tiffany; Gillis, Aubree T; Moore, Zachary D; Savell, Christopher D; Watson, Reece; Mel, Stephanie F; Anilkumar, Arjun A; Bilinski, Paul; Castillo, Rostislav; Closser, Michael; Cruz, Nathalia M; Dai, Tiffany; Garbagnati, Giancarlo F; Horton, Lanor S; Kim, Dongyeon; Lau, Joyce H; Liu, James Z; Mach, Sandy D; Phan, Thu A; Ren, Yi; Stapleton, Kenneth E; Strelitz, Jean M; Sunjed, Ray; Stamm, Joyce; Anderson, Morgan C; Bonifield, Bethany Grace; Coomes, Daniel; Dillman, Adam; Durchholz, Elaine J; Fafara-Thompson, Antoinette E; Gross, Meleah J; Gygi, Amber M; Jackson, Lesley E; Johnson, Amy; Kocsisova, Zuzana; Manghelli, Joshua L; McNeil, Kylie; Murillo, Michael; Naylor, Kierstin L; Neely, Jessica; Ogawa, Emmy E; Rich, Ashley; Rogers, Anna; Spencer, J Devin; Stemler, Kristina M; Throm, Allison A; Van Camp, Matt; Weihbrecht, Katie; Wiles, T Aaron; Williams, Mallory A; Williams, Matthew; Zoll, Kyle; Bailey, Cheryl; Zhou, Leming; Balthaser, Darla M; Bashiri, Azita; Bower, Mindy E; Florian, Kayla A; Ghavam, Nazanin; Greiner-Sosanko, Elizabeth S; Karim, Helmet; Mullen, Victor W; Pelchen, Carly E; Yenerall, Paul M; Zhang, Jiayu; Rubin, Michael R; Arias-Mejias, Suzette M; Bermudez-Capo, Armando G; Bernal-Vega, Gabriela V; Colon-Vazquez, Mariela; Flores-Vazquez, Arelys; Gines-Rosario, Mariela; Llavona-Cartagena, Ivan G; Martinez-Rodriguez, Javier O; Ortiz-Fuentes, Lionel; Perez-Colomba, Eliezer O; Perez-Otero, Joseph; Rivera, Elisandra; Rodriguez-Giron, Luke J; Santiago-Sanabria, Arnaldo J; Senquiz-Gonzalez, Andrea M; delValle, Frank R Soto; Vargas-Franco, Dorianmarie; Velázquez-Soto, Karla I; Zambrana-Burgos, Joan D; Martinez-Cruzado, Juan Carlos; Asencio-Zayas, Lillyann; Babilonia-Figueroa, Kevin; Beauchamp-Pérez, Francis D; Belén-Rodríguez, Juliana; Bracero-Quiñones, Luciann; Burgos-Bula, Andrea P; Collado-Méndez, Xavier A; Colón-Cruz, Luis R; Correa-Muller, Ana I; Crooke-Rosado, Jonathan L; Cruz-García, José M; Defendini-Ávila, Marianna; Delgado-Peraza, Francheska M; Feliciano-Cancela, Alex J; Gónzalez-Pérez, Valerie M; Guiblet, Wilfried; Heredia-Negrón, Aldo; Hernández-Muñiz, Jennifer; Irizarry-González, Lourdes N; Laboy-Corales, Ángel L; Llaurador-Caraballo, Gabriela A; Marín-Maldonado, Frances; Marrero-Llerena, Ulises; Martell-Martínez, Héctor A; Martínez-Traverso, Idaliz M; Medina-Ortega, Kiara N; Méndez-Castellanos, Sonya G; Menéndez-Serrano, Krizia C; Morales-Caraballo, Carol I; Ortiz-DeChoudens, Saryleine; Ortiz-Ortiz, Patricia; Pagán-Torres, Hendrick; Pérez-Afanador, Diana; Quintana-Torres, Enid M; Ramírez-Aponte, Edwin G; Riascos-Cuero, Carolina; Rivera-Llovet, Michelle S; Rivera-Pagán, Ingrid T; Rivera-Vicéns, Ramón E; Robles-Juarbe, Fabiola; Rodríguez-Bonilla, Lorraine; Rodríguez-Echevarría, Brian O; Rodríguez-García, Priscila M; Rodríguez-Laboy, Abneris E; Rodríguez-Santiago, Susana; Rojas-Vargas, Michael L; Rubio-Marrero, Eva N; Santiago-Colón, Albeliz; Santiago-Ortiz, Jorge L; Santos-Ramos, Carlos E; Serrano-González, Joseline; Tamayo-Figueroa, Alina M; Tascón-Peñaranda, Edna P; Torres-Castillo, José L; Valentín-Feliciano, Nelson A; Valentín-Feliciano, Yashira M; Vargas-Barreto, Nadyan M; Vélez-Vázquez, Miguel; Vilanova-Vélez, Luis R; Zambrana-Echevarría, Cristina; MacKinnon, Christy; Chung, Hui-Min; Kay, Chris; Pinto, Anthony; Kopp, Olga R; Burkhardt, Joshua; Harward, Chris; Allen, Robert; Bhat, Pavan; Chang, Jimmy Hsiang-Chun; Chen, York; Chesley, Christopher; Cohn, Dara; DuPuis, David; Fasano, Michael; Fazzio, Nicholas; Gavinski, Katherine; Gebreyesus, Heran; Giarla, Thomas; Gostelow, Marcus; Greenstein, Rachel; Gunasinghe, Hashini; Hanson, Casey; Hay, Amanda; He, Tao Jian; Homa, Katie; Howe, Ruth; Howenstein, Jeff; Huang, Henry; Khatri, Aaditya; Kim, Young Lu; Knowles, Olivia; Kong, Sarah; Krock, Rebecca; Kroll, Matt; Kuhn, Julia; Kwong, Matthew; Lee, Brandon; Lee, Ryan; Levine, Kevin; Li, Yedda; Liu, Bo; Liu, Lucy; Liu, Max; Lousararian, Adam; Ma, Jimmy; Mallya, Allyson; Manchee, Charlie; Marcus, Joseph; McDaniel, Stephen; Miller, Michelle L; Molleston, Jerome M; Diez, Cristina Montero; Ng, Patrick; Ngai, Natalie; Nguyen, Hien; Nylander, Andrew; Pollack, Jason; Rastogi, Suchita; Reddy, Himabindu; Regenold, Nathaniel; Sarezky, Jon; Schultz, Michael; Shim, Jien; Skorupa, Tara; Smith, Kenneth; Spencer, Sarah J; Srikanth, Priya; Stancu, Gabriel; Stein, Andrew P; Strother, Marshall; Sudmeier, Lisa; Sun, Mengyang; Sundaram, Varun; Tazudeen, Noor; Tseng, Alan; Tzeng, Albert; Venkat, Rohit; Venkataram, Sandeep; Waldman, Leah; Wang, Tracy; Yang, Hao; Yu, Jack Y; Zheng, Yin; Preuss, Mary L; Garcia, Angelica; Juergens, Matt; Morris, Robert W; Nagengast, Alexis A; Azarewicz, Julie; Carr, Thomas J; Chichearo, Nicole; Colgan, Mike; Donegan, Megan; Gardner, Bob; Kolba, Nik; Krumm, Janice L; Lytle, Stacey; MacMillian, Laurell; Miller, Mary; Montgomery, Andrew; Moretti, Alysha; Offenbacker, Brittney; Polen, Mike; Toth, John; Woytanowski, John; Kadlec, Lisa; Crawford, Justin; Spratt, Mary L; Adams, Ashley L; Barnard, Brianna K; Cheramie, Martin N; Eime, Anne M; Golden, Kathryn L; Hawkins, Allyson P; Hill, Jessica E; Kampmeier, Jessica A; Kern, Cody D; Magnuson, Emily E; Miller, Ashley R; Morrow, Cody M; Peairs, Julia C; Pickett, Gentry L; Popelka, Sarah A; Scott, Alexis J; Teepe, Emily J; TerMeer, Katie A; Watchinski, Carmen A; Watson, Lucas A; Weber, Rachel E; Woodard, Kate A; Barnard, Daron C; Appiah, Isaac; Giddens, Michelle M; McNeil, Gerard P; Adebayo, Adeola; Bagaeva, Kate; Chinwong, Justina; Dol, Chrystel; George, Eunice; Haltaufderhyde, Kirk; Haye, Joanna; Kaur, Manpreet; Semon, Max; Serjanov, Dmitri; Toorie, Anika; Wilson, Christopher; Riddle, Nicole C; Buhler, Jeremy; Mardis, Elaine R; Elgin, Sarah C R

    2015-03-04

    The Muller F element (4.2 Mb, ~80 protein-coding genes) is an unusual autosome of Drosophila melanogaster; it is mostly heterochromatic with a low recombination rate. To investigate how these properties impact the evolution of repeats and genes, we manually improved the sequence and annotated the genes on the D. erecta, D. mojavensis, and D. grimshawi F elements and euchromatic domains from the Muller D element. We find that F elements have greater transposon density (25-50%) than euchromatic reference regions (3-11%). Among the F elements, D. grimshawi has the lowest transposon density (particularly DINE-1: 2% vs. 11-27%). F element genes have larger coding spans, more coding exons, larger introns, and lower codon bias. Comparison of the Effective Number of Codons with the Codon Adaptation Index shows that, in contrast to the other species, codon bias in D. grimshawi F element genes can be attributed primarily to selection instead of mutational biases, suggesting that density and types of transposons affect the degree of local heterochromatin formation. F element genes have lower estimated DNA melting temperatures than D element genes, potentially facilitating transcription through heterochromatin. Most F element genes (~90%) have remained on that element, but the F element has smaller syntenic blocks than genome averages (3.4-3.6 vs. 8.4-8.8 genes per block), indicating greater rates of inversion despite lower rates of recombination. Overall, the F element has maintained characteristics that are distinct from other autosomes in the Drosophila lineage, illuminating the constraints imposed by a heterochromatic milieu. Copyright © 2015 Leung et al.

  19. Identification of a Recently Active Mammalian SINE Derived from Ribosomal RNA

    PubMed Central

    Longo, Mark S.; Brown, Judy D.; Zhang, Chu; O’Neill, Michael J.; O’Neill, Rachel J.

    2015-01-01

    Complex eukaryotic genomes are riddled with repeated sequences whose derivation does not coincide with phylogenetic history and thus is often unknown. Among such sequences, the capacity for transcriptional activity coupled with the adaptive use of reverse transcription can lead to a diverse group of genomic elements across taxa, otherwise known as selfish elements or mobile elements. Short interspersed nuclear elements (SINEs) are nonautonomous mobile elements found in eukaryotic genomes, typically derived from cellular RNAs such as tRNAs, 7SL or 5S rRNA. Here, we identify and characterize a previously unknown SINE derived from the 3′-end of the large ribosomal subunit (LSU or 28S rDNA) and transcribed via RNA polymerase III. This new element, SINE28, is represented in low-copy numbers in the human reference genome assembly, wherein we have identified 27 discrete loci. Phylogenetic analysis indicates these elements have been transpositionally active within primate lineages as recently as 6 MYA while modern humans still carry transcriptionally active copies. Moreover, we have identified SINE28s in all currently available assembled mammalian genome sequences. Phylogenetic comparisons indicate that these elements are frequently rederived from the highly conserved LSU rRNA sequences in a lineage-specific manner. We propose that this element has not been previously recognized as a SINE given its high identity to the canonical LSU, and that SINE28 likely represents one of possibly many unidentified, active transposable elements within mammalian genomes. PMID:25637222

  20. Condensin loaded onto the replication fork barrier site in the rRNA gene repeats during S phase in a FOB1-dependent fashion to prevent contraction of a long repetitive array in Saccharomyces cerevisiae.

    PubMed

    Johzuka, Katsuki; Terasawa, Masahiro; Ogawa, Hideyuki; Ogawa, Tomoko; Horiuchi, Takashi

    2006-03-01

    An average of 200 copies of the rRNA gene (rDNA) is clustered in a long tandem array in Saccharomyces cerevisiae. FOB1 is known to be required for expansion/contraction of the repeats by stimulating recombination, thereby contributing to the maintenance of the average copy number. In Deltafob1 cells, the repeats are still maintained without any fluctuation in the copy number, suggesting that another, unknown system acts to prevent repeat contraction. Here, we show that condensin acts together with FOB1 in a functionally complemented fashion to maintain the long tandem repeats. Six condensin mutants possessing severely contracted rDNA repeats were isolated in Deltafob1 cells but not in FOB1+ cells. We also found that the condensin complex associated with the nontranscribed spacer region of rDNA with a major peak coincided with the replication fork barrier (RFB) site in a FOB1-dependent fashion. Surprisingly, condensin association with the RFB site was established during S phase and was maintained until anaphase. These results indicate that FOB1 plays a novel role in preventing repeat contraction by regulating condensin association and suggest a link between replication termination and chromosome condensation and segregation.

  1. Mapping Fifteen Trace Elements in Human Seminal Plasma and Sperm DNA.

    PubMed

    Ali, Sazan; Chaspoul, Florence; Anderson, Loundou; Bergé-Lefranc, David; Achard, Vincent; Perrin, Jeanne; Gallice, Philippe; Guichaoua, Marie

    2017-02-01

    Studies suggest a relationship between semen quality and the concentration of trace elements in serum or seminal plasma. However, trace elements may be linked to DNA and capable of altering the gene expression patterns. Thus, trace element interactions with DNA may contribute to the mechanisms for a trans-generational reproductive effect. We developed an analytical method to determine the amount of trace elements bound to the sperm DNA, and to estimate their affinity for the sperm DNA by the ratio: R = Log [metal concentration in the sperm DNA/metal concentration in seminal plasma]. We then analyzed the concentrations of 15 trace elements (Al, Cd, Cr, Cu, Hg, Mn, Mo, Ni, Pb, Ti, V, Zn, As, Sb, and Se) in the seminal plasma and the sperm DNA in 64 normal and 30 abnormal semen specimens with Inductively Coupled Plasma/Mass Spectrometry (ICP-MS). This study showed all trace elements were detected in the seminal plasma and only metals were detected in the sperm DNA. There was no correlation between the metals' concentrations in the seminal plasma and the sperm DNA. Al had the highest affinity for DNA followed by Pb and Cd. This strong affinity is consistent with the known mutagenic effects of these metals. The lowest affinity was observed for Zn and Ti. We observed a significant increase of Al linked to the sperm DNA of patients with oligozoospermia and teratozoospermia. Al's reproductive toxicity might be due to Al linked to DNA, by altering spermatogenesis and expression patterns of genes involved in the function of reproduction.

  2. A Portrait of Ribosomal DNA Contacts with Hi-C Reveals 5S and 45S rDNA Anchoring Points in the Folded Human Genome

    PubMed Central

    Yu, Shoukai; Lemos, Bernardo

    2016-01-01

    Ribosomal RNAs (rRNAs) account for >60% of all RNAs in eukaryotic cells and are encoded in the ribosomal DNA (rDNA) arrays. The rRNAs are produced from two sets of loci: the 5S rDNA array resides exclusively on human chromosome 1, whereas the 45S rDNA array resides on the short arm of five human acrocentric chromosomes. The 45S rDNA gives origin to the nucleolus, the nuclear organelle that is the site of ribosome biogenesis. Intriguingly, 5S and 45S rDNA arrays exhibit correlated copy number variation in lymphoblastoid cells (LCLs). Here we examined the genomic architecture and repeat content of the 5S and 45S rDNA arrays in multiple human genome assemblies (including PacBio MHAP assembly) and ascertained contacts between the rDNA arrays and the rest of the genome using Hi-C datasets from two human cell lines (erythroleukemia K562 and lymphoblastoid cells). Our analyses revealed that 5S and 45S arrays each have thousands of contacts in the folded genome, with rDNA-associated regions and genes dispersed across all chromosomes. The rDNA contact map displayed conserved and disparate features between two cell lines, and pointed to specific chromosomes, genomic regions, and genes with evidence of spatial proximity to the rDNA arrays; the data also showed a lack of direct physical interaction between the 5S and 45S rDNA arrays. Finally, the analysis identified an intriguing organization in the 5S array with Alu and 5S elements adjacent to one another and organized in opposite orientation along the array. Portraits of genome folding centered on the ribosomal DNA array could help understand the emergence of concerted variation, the control of 5S and 45S expression, as well as provide insights into an organelle that contributes to the spatial localization of human chromosomes during interphase. PMID:27797956

  3. Strand invasion structures in the inverted repeat of Candida albicans mitochondrial DNA reveal a role for homologous recombination in replication.

    PubMed

    Gerhold, Joachim M; Aun, Anu; Sedman, Tiina; Jõers, Priit; Sedman, Juhan

    2010-09-24

    Molecular recombination and transcription are proposed mechanisms to initiate mitochondrial DNA (mtDNA) replication in yeast. We conducted a comprehensive analysis of mtDNA from the yeast Candida albicans. Two-dimensional agarose gel electrophoresis of mtDNA intermediates reveals no bubble structures diagnostic of specific replication origins, but rather supports recombination-driven replication initiation of mtDNA in yeast. Specific species of Y structures together with DNA copy number analyses of a C. albicans mutant strain provide evidence that a region in a mainly noncoding inverted repeat is predominantly involved in replication initiation via homologous recombination. Our further findings show that the C. albicans mtDNA forms a complex branched network that does not contain detectable amounts of circular molecules. We provide topological evidence for recombination-driven mtDNA replication initiation and introduce C. albicans as a suitable model organism to study wild-type mtDNA maintenance in yeast. Copyright © 2010 Elsevier Inc. All rights reserved.

  4. High-throughput single-molecule telomere characterization.

    PubMed

    McCaffrey, Jennifer; Young, Eleanor; Lassahn, Katy; Sibert, Justin; Pastor, Steven; Riethman, Harold; Xiao, Ming

    2017-11-01

    We have developed a novel method that enables global subtelomere and haplotype-resolved analysis of telomere lengths at the single-molecule level. An in vitro CRISPR/Cas9 RNA-directed nickase system directs the specific labeling of human (TTAGGG)n DNA tracts in genomes that have also been barcoded using a separate nickase enzyme that recognizes a 7-bp motif genome-wide. High-throughput imaging and analysis of large DNA single molecules from genomes labeled in this fashion using a nanochannel array system permits mapping through subtelomere repeat element (SRE) regions to unique chromosomal DNA while simultaneously measuring the (TTAGGG)n tract length at the end of each large telomere-terminal DNA segment. The methodology also permits subtelomere and haplotype-resolved analyses of SRE organization and variation, providing a window into the population dynamics and potential functions of these complex and structurally variant telomere-adjacent DNA regions. At its current stage of development, the assay can be used to identify and characterize telomere length distributions of 30-35 discrete telomeres simultaneously and accurately. The assay's utility is demonstrated using early versus late passage and senescent human diploid fibroblasts, documenting the anticipated telomere attrition on a global telomere-by-telomere basis as well as identifying subtelomere-specific biases for critically short telomeres. Similarly, we present the first global single-telomere-resolved analyses of two cancer cell lines. © 2017 McCaffrey et al.; Published by Cold Spring Harbor Laboratory Press.

  5. Rapid construction of insulated genetic circuits via synthetic sequence-guided isothermal assembly

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Torella, JP; Boehm, CR; Lienert, F

    2013-12-28

    In vitro recombination methods have enabled one-step construction of large DNA sequences from multiple parts. Although synthetic biological circuits can in principle be assembled in the same fashion, they typically contain repeated sequence elements such as standard promoters and terminators that interfere with homologous recombination. Here we use a computational approach to design synthetic, biologically inactive unique nucleotide sequences (UNSes) that facilitate accurate ordered assembly. Importantly, our designed UNSes make it possible to assemble parts with repeated terminator and insulator sequences, and thereby create insulated functional genetic circuits in bacteria and mammalian cells. Using UNS-guided assembly to construct repeating promoter-gene-terminatormore » parts, we systematically varied gene expression to optimize production of a deoxychromoviridans biosynthetic pathway in Escherichia coli. We then used this system to construct complex eukaryotic AND-logic gates for genomic integration into embryonic stem cells. Construction was performed by using a standardized series of UNS-bearing BioBrick-compatible vectors, which enable modular assembly and facilitate reuse of individual parts. UNS-guided isothermal assembly is broadly applicable to the construction and optimization of genetic circuits and particularly those requiring tight insulation, such as complex biosynthetic pathways, sensors, counters and logic gates.« less

  6. Regulation of HFE expression by Poly(ADP-ribose) polymerase-1 (PARP1) through an inverted repeat DNA sequence in the distal promoter

    PubMed Central

    Rodova, Marianna; Rudolph, Angela; Chipps, Elizabeth; Islam, M. Rafiq

    2013-01-01

    Hereditary hemochromatosis (HH) is a common autosomal recessive disorder of iron overload among Caucasians of northern European descent. Over 85% of all cases with HH are due to mutations in the hemochromatosis protein (HFE) involved in iron metabolism. Although the importance in iron homeostasis is well recognized, the mechanism of sensing and regulating iron absorption by HFE, especially in the absence of iron response element in its gene, is not fully understood. In this report, we have identified an inverted repeat sequence (ATGGTcttACCTA) within 1700 bp (−1675/+35) of the HFE promoter capable to form cruciform structure that binds PARP1 and strongly represses HFE promoter. Knockdown of PARP1 increases HFE mRNA and protein. Similarly, hemin or FeCl3 treatments resulted in increase in HFE expression by reducing nuclear PARP1 pool via its apoptosis induced cleavage, leading to upregulation of the iron regulatory hormone hepcidin mRNA. Thus, PARP1 binding to the inverted repeat sequence on the HFE promoter may serve as a novel iron sensing mechanism as increased iron level can trigger PARP1 cleavage and relief of HFE transcriptional repression. PMID:24184271

  7. Regulation of HFE expression by poly(ADP-ribose) polymerase-1 (PARP1) through an inverted repeat DNA sequence in the distal promoter.

    PubMed

    Pelham, Christopher; Jimenez, Tamara; Rodova, Marianna; Rudolph, Angela; Chipps, Elizabeth; Islam, M Rafiq

    2013-12-01

    Hereditary hemochromatosis (HH) is a common autosomal recessive disorder of iron overload among Caucasians of northern European descent. Over 85% of all cases with HH are due to mutations in the hemochromatosis protein (HFE) involved in iron metabolism. Although the importance in iron homeostasis is well recognized, the mechanism of sensing and regulating iron absorption by HFE, especially in the absence of iron response element in its gene, is not fully understood. In this report, we have identified an inverted repeat sequence (ATGGTcttACCTA) within 1700bp (-1675/+35) of the HFE promoter capable to form cruciform structure that binds PARP1 and strongly represses HFE promoter. Knockdown of PARP1 increases HFE mRNA and protein. Similarly, hemin or FeCl3 treatments resulted in increase in HFE expression by reducing nuclear PARP1 pool via its apoptosis induced cleavage, leading to upregulation of the iron regulatory hormone hepcidin mRNA. Thus, PARP1 binding to the inverted repeat sequence on the HFE promoter may serve as a novel iron sensing mechanism as increased iron level can trigger PARP1 cleavage and relief of HFE transcriptional repression. © 2013.

  8. Ordered mapping of 3 alphoid DNA subsets on human chromosome 22

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Antonacci, R.; Baldini, A.; Archidiacono, N.

    1994-09-01

    Alpha satellite DNA consists of tandemly repeated monomers of 171 bp clustered in the centromeric region of primate chromosomes. Sequence divergence between subsets located in different human chromosomes is usually high enough to ensure chromosome-specific hybridization. Alphoid probes specific for almost every human chromosome have been reported. A single chromosome can carry different subsets of alphoid DNA and some alphoid subsets can be shared by different chromosomes. We report the physical order of three alphoid DNA subsets on human chromosome 22 determined by a combination of low and high resolution cytological mapping methods. Results visually demonstrate the presence of threemore » distinct alphoid DNA domains at the centromeric region of chromosome 22. We have measured the interphase distances between the three probes in three-color FISH experiments. Statistical analysis of the results indicated the order of the subsets. Two color experiments on prometaphase chromosomes established the order of the three domains relative to the arms of chromosome 22 and confirmed the results obtained using interphase mapping. This demonstrates the applicability of interphase mapping for alpha satellite DNA orderering. However, in our experiments, interphase mapping did not provide any information about the relationship between extremities of the repeat arrays. This information was gained from extended chromatin hybridization. The extremities of two of the repeat arrays were seen to be almost overlapping whereas the third repeat array was clearly separated from the other two. Our data show the value of extended chromatin hybridization as a complement of other cytological techniques for high resolution mapping of repetitive DNA sequences.« less

  9. An annotated genetic map of loblolly pine based on microsatellite and cDNA markers

    USDA-ARS?s Scientific Manuscript database

    Previous loblolly pine (Pinus taeda L.) genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats), also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective o...

  10. Comparative molecular cytogenetic analyses of a major tandemly repeated DNA family and retrotransposon sequences in cultivated jute Corchorus species (Malvaceae)

    PubMed Central

    Begum, Rabeya; Zakrzewski, Falk; Menzel, Gerhard; Weber, Beatrice; Alam, Sheikh Shamimul; Schmidt, Thomas

    2013-01-01

    Background and Aims The cultivated jute species Corchorus olitorius and Corchorus capsularis are important fibre crops. The analysis of repetitive DNA sequences, comprising a major part of plant genomes, has not been carried out in jute but is useful to investigate the long-range organization of chromosomes. The aim of this study was the identification of repetitive DNA sequences to facilitate comparative molecular and cytogenetic studies of two jute cultivars and to develop a fluorescent in situ hybridization (FISH) karyotype for chromosome identification. Methods A plasmid library was generated from C. olitorius and C. capsularis with genomic restriction fragments of 100–500 bp, which was complemented by targeted cloning of satellite DNA by PCR. The diversity of the repetitive DNA families was analysed comparatively. The genomic abundance and chromosomal localization of different repeat classes were investigated by Southern analysis and FISH, respectively. The cytosine methylation of satellite arrays was studied by immunolabelling. Key Results Major satellite repeats and retrotransposons have been identified from C. olitorius and C. capsularis. The satellite family CoSat I forms two undermethylated species-specific subfamilies, while the long terminal repeat (LTR) retrotransposons CoRetro I and CoRetro II show similarity to the Metaviridea of plant retroelements. FISH karyotypes were developed by multicolour FISH using these repetitive DNA sequences in combination with 5S and 18S–5·8S–25S rRNA genes which enable the unequivocal chromosome discrimination in both jute species. Conclusions The analysis of the structure and diversity of the repeated DNA is crucial for genome sequence annotation. The reference karyotypes will be useful for breeding of jute and provide the basis for karyotyping homeologous chromosomes of wild jute species to reveal the genetic and evolutionary relationship between cultivated and wild Corchorus species. PMID:23666888

  11. The CRISPR RNA-guided surveillance complex in Escherichia coli accommodates extended RNA spacers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luo, Michelle L.; Jackson, Ryan N.; Denny, Steven R.

    Bacteria and archaea acquire resistance to foreign genetic elements by integrating fragments of foreign DNA into CRISPR (clustered regularly interspaced short palindromic repeats) loci. In Escherichia coli, CRISPR-derived RNAs (crRNAs) assemble with Cas proteins into a multi-subunit surveillance complex called Cascade (CRISPR-associated complex for antiviral defense). Cascade recognizes DNA targets via protein-mediated recognition of a protospacer adjacent motif and complementary base pairing between the crRNA spacer and the DNA target. Previously determined structures of Cascade showed that the crRNA is stretched along an oligomeric protein assembly, leading us to ask how crRNA length impacts the assembly and function of thismore » complex. We found that extending the spacer portion of the crRNA resulted in larger Cascade complexes with altered stoichiometry and preserved in vitro binding affinity for target DNA. Longer spacers also preserved the in vivo ability of Cascade to repress target gene expression and to recruit the Cas3 endonuclease for target degradation. Lastly, longer spacers exhibited enhanced silencing at particular target locations and were sensitive to mismatches within the extended region. These findings demonstrate the flexibility of the Type I-E CRISPR machinery and suggest that spacer length can be modified to fine-tune Cascade activity.« less

  12. The CRISPR RNA-guided surveillance complex in Escherichia coli accommodates extended RNA spacers

    DOE PAGES

    Luo, Michelle L.; Jackson, Ryan N.; Denny, Steven R.; ...

    2016-05-12

    Bacteria and archaea acquire resistance to foreign genetic elements by integrating fragments of foreign DNA into CRISPR (clustered regularly interspaced short palindromic repeats) loci. In Escherichia coli, CRISPR-derived RNAs (crRNAs) assemble with Cas proteins into a multi-subunit surveillance complex called Cascade (CRISPR-associated complex for antiviral defense). Cascade recognizes DNA targets via protein-mediated recognition of a protospacer adjacent motif and complementary base pairing between the crRNA spacer and the DNA target. Previously determined structures of Cascade showed that the crRNA is stretched along an oligomeric protein assembly, leading us to ask how crRNA length impacts the assembly and function of thismore » complex. We found that extending the spacer portion of the crRNA resulted in larger Cascade complexes with altered stoichiometry and preserved in vitro binding affinity for target DNA. Longer spacers also preserved the in vivo ability of Cascade to repress target gene expression and to recruit the Cas3 endonuclease for target degradation. Lastly, longer spacers exhibited enhanced silencing at particular target locations and were sensitive to mismatches within the extended region. These findings demonstrate the flexibility of the Type I-E CRISPR machinery and suggest that spacer length can be modified to fine-tune Cascade activity.« less

  13. Mutations in Nonconserved Domains of Ty3 Integrase Affect Multiple Stages of the Ty3 Life Cycle

    PubMed Central

    Nymark-McMahon, M. Henrietta; Sandmeyer, Suzanne B.

    1999-01-01

    Ty3, a retroviruslike element of Saccharomyces cerevisiae, transposes into positions immediately upstream of RNA polymerase III-transcribed genes. The Ty3 integrase (IN) protein is required for integration of the replicated, extrachromosomal Ty3 DNA. In retroviral IN, a conserved core region is sufficient for strand transfer activity. In this study, charged-to-alanine scanning mutagenesis was used to investigate the roles of the nonconserved amino- and carboxyl-terminal regions of Ty3 IN. Each of the 20 IN mutants was defective for transposition, but no mutant was grossly defective for capsid maturation. All mutations affecting steady-state levels of mature IN protein resulted in reduced levels of replicated DNA, even when polymerase activity was not grossly defective as measured by exogenous reverse transcriptase activity assay. Thus, IN could contribute to nonpolymerase functions required for DNA production in vivo or to the stability of the DNA product. Several mutations in the carboxyl-terminal domain resulted in relatively low levels of processed 3′ ends of the replicated DNA, suggesting that this domain may be important for binding of IN to the long terminal repeat. Another class of mutants produced wild-type amounts of DNA with correctly processed 3′ ends. This class could include mutants affected in nuclear entry and target association. Collectively, these mutations demonstrate that in vivo, within the preintegration complex, IN performs a central role in coordinating multiple late stages of the retrotransposition life cycle. PMID:9847351

  14. Functional centromeres in Astragalus sinicus include a compact centromere-specific histone H3 and a 20-bp tandem repeat.

    PubMed

    Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka

    2011-11-01

    The centromere plays an essential role for proper chromosome segregation during cell division and usually harbors long arrays of tandem repeated satellite DNA sequences. Although this function is conserved among eukaryotes, the sequences of centromeric DNA repeats are variable. Most of our understanding of functional centromeres, which are defined by localization of a centromere-specific histone H3 (CENH3) protein, comes from model organisms. The components of the functional centromere in legumes are poorly known. The genus Astragalus is a member of the legumes and bears the largest numbers of species among angiosperms. Therefore, we studied the components of centromeres in Astragalus sinicus. We identified the CenH3 homolog of A. sinicus, AsCenH3 that is the most compact in size among higher eukaryotes. A CENH3-based assay revealed the functional centromeric DNA sequences from A. sinicus, called CentAs. The CentAs repeat is localized in A. sinicus centromeres, and comprises an AT-rich tandem repeat with a monomer size of 20 nucleotides.

  15. Insights into mutagenesis using Escherichia coli chromosomal lacZ strains that enable detection of a wide spectrum of mutational events.

    PubMed

    Seier, Tracey; Padgett, Dana R; Zilberberg, Gal; Sutera, Vincent A; Toha, Noor; Lovett, Susan T

    2011-06-01

    Strand misalignments at DNA repeats during replication are implicated in mutational hotspots. To study these events, we have generated strains carrying mutations in the Escherichia coli chromosomal lacZ gene that revert via deletion of a short duplicated sequence or by template switching within imperfect inverted repeat (quasipalindrome, QP) sequences. Using these strains, we demonstrate that mutation of the distal repeat of a quasipalindrome, with respect to replication fork movement, is about 10-fold higher than the proximal repeat, consistent with more common template switching on the leading strand. The leading strand bias was lost in the absence of exonucleases I and VII, suggesting that it results from more efficient suppression of template switching by 3' exonucleases targeted to the lagging strand. The loss of 3' exonucleases has no effect on strand misalignment at direct repeats to produce deletion. To compare these events to other mutations, we have reengineered reporters (designed by Cupples and Miller 1989) that detect specific base substitutions or frameshifts in lacZ with the reverting lacZ locus on the chromosome rather than an F' element. This set allows rapid screening of potential mutagens, environmental conditions, or genetic loci for effects on a broad set of mutational events. We found that hydroxyurea (HU), which depletes dNTP pools, slightly elevated templated mutations at inverted repeats but had no effect on deletions, simple frameshifts, or base substitutions. Mutations in nucleotide diphosphate kinase, ndk, significantly elevated simple mutations but had little effect on the templated class. Zebularine, a cytosine analog, elevated all classes.

  16. Retroelements (LINEs and SINEs) in vole genomes: differential distribution in the constitutive heterochromatin.

    PubMed

    Acosta, M J; Marchal, J A; Fernández-Espartero, C H; Bullejos, M; Sánchez, A

    2008-01-01

    The chromosomal distribution of mobile genetic elements is scarcely known in Arvicolinae species, but could be of relevance to understand the origin and complex evolution of the sex chromosome heterochromatin. In this work we cloned two retrotransposon sequences, L1 and SINE-B1, from the genome of Chionomys nivalis and investigated their chromosomal distribution on several arvicoline species. Our results demonstrate first that both retroelements are the most abundant repeated DNA sequences in the genome of these species. L1 elements, in most species, are highly accumulated in the sex chromosomes compared to the autosomes. This favoured L1 insertion could have played an important role in the origin of the enlarged heterochromatic blocks existing in the sex chromosomes of some Microtus species. Also, we propose that L1 accumulation on the X heterochromatin could have been the consequence of different, independent and rapid amplification processes acting in each species. SINE elements, however, were completely lacking from the constitutive heterochromatin, either in autosomes or in the heterochromatic blocks of sex chromosomes. These data could indicate that some SINE elements are incompatible with the formation of heterochromatic complexes and hence are necessarily missing from the constitutive heterochromatin.

  17. Noncoding origins of anthropoid traits and a new null model of transposon functionalization.

    PubMed

    del Rosario, Ricardo C H; Rayan, Nirmala Arul; Prabhakar, Shyam

    2014-09-01

    Little is known about novel genetic elements that drove the emergence of anthropoid primates. We exploited the sequencing of the marmoset genome to identify 23,849 anthropoid-specific constrained (ASC) regions and confirmed their robust functional signatures. Of the ASC base pairs, 99.7% were noncoding, suggesting that novel anthropoid functional elements were overwhelmingly cis-regulatory. ASCs were highly enriched in loci associated with fetal brain development, motor coordination, neurotransmission, and vision, thus providing a large set of candidate elements for exploring the molecular basis of hallmark primate traits. We validated ASC192 as a primate-specific enhancer in proliferative zones of the developing brain. Unexpectedly, transposable elements (TEs) contributed to >56% of ASCs, and almost all TE families showed functional potential similar to that of nonrepetitive DNA. Three L1PA repeat-derived ASCs displayed coherent eye-enhancer function, thus demonstrating that the "gene-battery" model of TE functionalization applies to enhancers in vivo. Our study provides fundamental insights into genome evolution and the origins of anthropoid phenotypes and supports an elegantly simple new null model of TE exaptation. © 2014 del Rosario et al.; Published by Cold Spring Harbor Laboratory Press.

  18. Optical mapping reveals a large genetic inversion between two methicillin-resistant Staphylococcus aureus strains.

    PubMed

    Shukla, Sanjay K; Kislow, Jennifer; Briska, Adam; Henkhaus, John; Dykes, Colin

    2009-09-01

    Staphylococcus aureus is a highly versatile and evolving bacterium of great clinical importance. S. aureus can evolve by acquiring single nucleotide polymorphisms and mobile genetic elements and by recombination events. Identification and location of novel genomic elements in a bacterial genome are not straightforward, unless the whole genome is sequenced. Optical mapping is a new tool that creates a high-resolution, in situ ordered restriction map of a bacterial genome. These maps can be used to determine genomic organization and perform comparative genomics to identify genomic rearrangements, such as insertions, deletions, duplications, and inversions, compared to an in silico (virtual) restriction map of a known genome sequence. Using this technology, we report here the identification, approximate location, and characterization of a genetic inversion of approximately 500 kb of a DNA element between the NRS387 (USA800) and FPR3757 (USA300) strains. The presence of the inversion and location of its junction sites were confirmed by site-specific PCR and sequencing. At both the left and right junction sites in NRS387, an IS1181 element and a 73-bp sequence were identified as inverted repeats, which could explain the possible mechanism of the inversion event.

  19. Identifying structural variation in haploid microbial genomes from short-read resequencing data using breseq.

    PubMed

    Barrick, Jeffrey E; Colburn, Geoffrey; Deatherage, Daniel E; Traverse, Charles C; Strand, Matthew D; Borges, Jordan J; Knoester, David B; Reba, Aaron; Meyer, Austin G

    2014-11-29

    Mutations that alter chromosomal structure play critical roles in evolution and disease, including in the origin of new lifestyles and pathogenic traits in microbes. Large-scale rearrangements in genomes are often mediated by recombination events involving new or existing copies of mobile genetic elements, recently duplicated genes, or other repetitive sequences. Most current software programs for predicting structural variation from short-read DNA resequencing data are intended primarily for use on human genomes. They typically disregard information in reads mapping to repeat sequences, and significant post-processing and manual examination of their output is often required to rule out false-positive predictions and precisely describe mutational events. We have implemented an algorithm for identifying structural variation from DNA resequencing data as part of the breseq computational pipeline for predicting mutations in haploid microbial genomes. Our method evaluates the support for new sequence junctions present in a clonal sample from split-read alignments to a reference genome, including matches to repeat sequences. Then, it uses a statistical model of read coverage evenness to accept or reject these predictions. Finally, breseq combines predictions of new junctions and deleted chromosomal regions to output biologically relevant descriptions of mutations and their effects on genes. We demonstrate the performance of breseq on simulated Escherichia coli genomes with deletions generating unique breakpoint sequences, new insertions of mobile genetic elements, and deletions mediated by mobile elements. Then, we reanalyze data from an E. coli K-12 mutation accumulation evolution experiment in which structural variation was not previously identified. Transposon insertions and large-scale chromosomal changes detected by breseq account for ~25% of spontaneous mutations in this strain. In all cases, we find that breseq is able to reliably predict structural variation with modest read-depth coverage of the reference genome (>40-fold). Using breseq to predict structural variation should be useful for studies of microbial epidemiology, experimental evolution, synthetic biology, and genetics when a reference genome for a closely related strain is available. In these cases, breseq can discover mutations that may be responsible for important or unintended changes in genomes that might otherwise go undetected.

  20. Repetitive DNA and Plant Domestication: Variation in Copy Number and Proximity to Genes of LTR-Retrotransposons among Wild and Cultivated Sunflower (Helianthus annuus) Genotypes.

    PubMed

    Mascagni, Flavia; Barghini, Elena; Giordani, Tommaso; Rieseberg, Loren H; Cavallini, Andrea; Natali, Lucia

    2015-11-24

    The sunflower (Helianthus annuus) genome contains a very large proportion of transposable elements, especially long terminal repeat retrotransposons. However, knowledge on the retrotransposon-related variability within this species is still limited. We used next-generation sequencing (NGS) technologies to perform a quantitative and qualitative survey of intraspecific variation of the retrotransposon fraction of the genome across 15 genotypes--7 wild accessions and 8 cultivars--of H. annuus. By mapping the Illumina reads of the 15 genotypes onto a library of sunflower long terminal repeat retrotransposons, we observed considerable variability in redundancy among genotypes, at both superfamily and family levels. In another analysis, we mapped Illumina paired reads to two sets of sequences, that is, long terminal repeat retrotransposons and protein-encoding sequences, and evaluated the extent of retrotransposon proximity to genes in the sunflower genome by counting the number of paired reads in which one read mapped to a retrotransposon and the other to a gene. Large variability among genotypes was also ascertained for retrotransposon proximity to genes. Both long terminal repeat retrotransposon redundancy and proximity to genes varied among retrotransposon families and also between cultivated and wild genotypes. Such differences are discussed in relation to the possible role of long terminal repeat retrotransposons in the domestication of sunflower. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. DNA cytosine methylation in the bovine leukemia virus promoter is associated with latency in a lymphoma-derived B-cell line: potential involvement of direct inhibition of cAMP-responsive element (CRE)-binding protein/CRE modulator/activation transcription factor binding.

    PubMed

    Pierard, Valérie; Guiguen, Allan; Colin, Laurence; Wijmeersch, Gaëlle; Vanhulle, Caroline; Van Driessche, Benoît; Dekoninck, Ann; Blazkova, Jana; Cardona, Christelle; Merimi, Makram; Vierendeel, Valérie; Calomme, Claire; Nguyên, Thi Liên-Anh; Nuttinck, Michèle; Twizere, Jean-Claude; Kettmann, Richard; Portetelle, Daniel; Burny, Arsène; Hirsch, Ivan; Rohr, Olivier; Van Lint, Carine

    2010-06-18

    Bovine leukemia virus (BLV) proviral latency represents a viral strategy to escape the host immune system and allow tumor development. Besides the previously demonstrated role of histone deacetylation in the epigenetic repression of BLV expression, we showed here that BLV promoter activity was induced by several DNA methylation inhibitors (such as 5-aza-2'-deoxycytidine) and that overexpressed DNMT1 and DNMT3A, but not DNMT3B, down-regulated BLV promoter activity. Importantly, cytosine hypermethylation in the 5'-long terminal repeat (LTR) U3 and R regions was associated with true latency in the lymphoma-derived B-cell line L267 but not with defective latency in YR2 cells. Moreover, the virus-encoded transactivator Tax(BLV) decreased DNA methyltransferase expression levels, which could explain the lower level of cytosine methylation observed in the L267(LTaxSN) 5'-LTR compared with the L267 5'-LTR. Interestingly, DNA methylation inhibitors and Tax(BLV) synergistically activated BLV promoter transcriptional activity in a cAMP-responsive element (CRE)-dependent manner. Mechanistically, methylation at the -154 or -129 CpG position (relative to the transcription start site) impaired in vitro binding of CRE-binding protein (CREB) transcription factors to their respective CRE sites. Methylation at -129 CpG alone was sufficient to decrease BLV promoter-driven reporter gene expression by 2-fold. We demonstrated in vivo the recruitment of CREB/CRE modulator (CREM) and to a lesser extent activating transcription factor-1 (ATF-1) to the hypomethylated CRE region of the YR2 5'-LTR, whereas we detected no CREB/CREM/ATF recruitment to the hypermethylated corresponding region in the L267 cells. Altogether, these findings suggest that site-specific DNA methylation of the BLV promoter represses viral transcription by directly inhibiting transcription factor binding, thereby contributing to true proviral latency.

  2. Ribosomal DNA copy loss and repeat instability in ATRX-mutated cancers

    PubMed Central

    Udugama, Maheshi; Sanij, Elaine; Voon, Hsiao P. J.; Son, Jinbae; Hii, Linda; Henson, Jeremy D.; Chan, F. Lyn; Chang, Fiona T. M.; Liu, Yumei; Pearson, Richard B.; Kalitsis, Paul; Mann, Jeffrey R.; Collas, Philippe; Hannan, Ross D.; Wong, Lee H.

    2018-01-01

    ATRX (alpha thalassemia/mental retardation X-linked) complexes with DAXX to deposit histone variant H3.3 into repetitive heterochromatin. Recent genome sequencing studies in cancers have revealed mutations in ATRX and their association with ALT (alternative lengthening of telomeres) activation. Here we report depletion of ATRX in mouse ES cells leads to selective loss in ribosomal RNA gene (rDNA) copy number. Supporting this, ATRX-mutated human ALT-positive tumors also show a substantially lower rDNA copy than ALT-negative tumors. Further investigation shows that the rDNA copy loss and repeat instability are caused by a disruption in H3.3 deposition and thus a failure in heterochromatin formation at rDNA repeats in the absence of ATRX. We also find that ATRX-depleted cells are reduced in ribosomal RNA transcription output and show increased sensitivity to RNA polymerase I (Pol I) transcription inhibitor CX5461. In addition, human ALT-positive cancer cell lines are also more sensitive to CX5461 treatment. Our study provides insights into the contribution of ATRX loss of function to tumorigenesis through the loss of rDNA stability and suggests the therapeutic potential of targeting Pol I transcription in ALT cancers. PMID:29669917

  3. Genome Comparison of Barley and Maize Smut Fungi Reveals Targeted Loss of RNA Silencing Components and Species-Specific Presence of Transposable Elements[W

    PubMed Central

    Laurie, John D.; Ali, Shawkat; Linning, Rob; Mannhaupt, Gertrud; Wong, Philip; Güldener, Ulrich; Münsterkötter, Martin; Moore, Richard; Kahmann, Regine; Bakkeren, Guus; Schirawski, Jan

    2012-01-01

    Ustilago hordei is a biotrophic parasite of barley (Hordeum vulgare). After seedling infection, the fungus persists in the plant until head emergence when fungal spores develop and are released from sori formed at kernel positions. The 26.1-Mb U. hordei genome contains 7113 protein encoding genes with high synteny to the smaller genomes of the related, maize-infecting smut fungi Ustilago maydis and Sporisorium reilianum but has a larger repeat content that affected genome evolution at important loci, including mating-type and effector loci. The U. hordei genome encodes components involved in RNA interference and heterochromatin formation, normally involved in genome defense, that are lacking in the U. maydis genome due to clean excision events. These excision events were possibly a result of former presence of repetitive DNA and of an efficient homologous recombination system in U. maydis. We found evidence of repeat-induced point mutations in the genome of U. hordei, indicating that smut fungi use different strategies to counteract the deleterious effects of repetitive DNA. The complement of U. hordei effector genes is comparable to the other two smuts but reveals differences in family expansion and clustering. The availability of the genome sequence will facilitate the identification of genes responsible for virulence and evolution of smut fungi on their respective hosts. PMID:22623492

  4. Genome comparison of barley and maize smut fungi reveals targeted loss of RNA silencing components and species-specific presence of transposable elements.

    PubMed

    Laurie, John D; Ali, Shawkat; Linning, Rob; Mannhaupt, Gertrud; Wong, Philip; Güldener, Ulrich; Münsterkötter, Martin; Moore, Richard; Kahmann, Regine; Bakkeren, Guus; Schirawski, Jan

    2012-05-01

    Ustilago hordei is a biotrophic parasite of barley (Hordeum vulgare). After seedling infection, the fungus persists in the plant until head emergence when fungal spores develop and are released from sori formed at kernel positions. The 26.1-Mb U. hordei genome contains 7113 protein encoding genes with high synteny to the smaller genomes of the related, maize-infecting smut fungi Ustilago maydis and Sporisorium reilianum but has a larger repeat content that affected genome evolution at important loci, including mating-type and effector loci. The U. hordei genome encodes components involved in RNA interference and heterochromatin formation, normally involved in genome defense, that are lacking in the U. maydis genome due to clean excision events. These excision events were possibly a result of former presence of repetitive DNA and of an efficient homologous recombination system in U. maydis. We found evidence of repeat-induced point mutations in the genome of U. hordei, indicating that smut fungi use different strategies to counteract the deleterious effects of repetitive DNA. The complement of U. hordei effector genes is comparable to the other two smuts but reveals differences in family expansion and clustering. The availability of the genome sequence will facilitate the identification of genes responsible for virulence and evolution of smut fungi on their respective hosts.

  5. Heteroplasmy and evidence for recombination in the mitochondrial control region of the flatfish Platichthys flesus.

    PubMed

    Hoarau, Galice; Holla, Suzanne; Lescasse, Rachel; Stam, Wytze T; Olsen, Jeanine L

    2002-12-01

    The general assumption that mitochondrial DNA (mtDNA) does not undergo recombination has been challenged recently in invertebrates. Here we present the first direct evidence for recombination in the mtDNA of a vertebrate, the flounder Platichthys flesus. The control region in the mtDNA of this flatfish is characterized by the presence of a variable number of tandem repeats and a high level of heteroplasmy. Two types of repeats were recognized, differing by two C-T point mutations. Most individuals carry a pure "C" or a pure "T" array, but one individual showed a compound "CT" array. Such a compound array is evidence for recombination in the mtDNA control region from the flounder.

  6. DNA methylation polymorphism in a set of elite rice cultivars and its possible contribution to inter-cultivar differential gene expression.

    PubMed

    Wang, Yongming; Lin, Xiuyun; Dong, Bo; Wang, Yingdian; Liu, Bao

    2004-01-01

    RAPD (randomly amplified polymorphic DNA) and ISSR (inter-simple sequence repeat) fingerprinting on HpaII/MspI-digested genomic DNA of nine elite japonica rice cultivars implies inter-cultivar DNA methylation polymorphism. Using both DNA fragments isolated from RAPD or ISSR gels and selected low-copy sequences as probes, methylation-sensitive Southern blot analysis confirms the existence of extensive DNA methylation polymorphism in both genes and DNA repeats among the rice cultivars. The cultivar-specific methylation patterns are stably maintained, and can be used as reliable molecular markers. Transcriptional analysis of four selected sequences (RdRP, AC9, HSP90 and MMR) on leaves and roots from normal and 5-azacytidine-treated seedlings of three representative cultivars shows an association between the transcriptional activity of one of the genes, the mismatch repair (MMR) gene, and its CG methylation patterns.

  7. Biological sequence compression algorithms.

    PubMed

    Matsumoto, T; Sadakane, K; Imai, H

    2000-01-01

    Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.

  8. A PNPase Dependent CRISPR System in Listeria

    PubMed Central

    Sesto, Nina; Touchon, Marie; Andrade, José Marques; Kondo, Jiro; Rocha, Eduardo P. C.; Arraiano, Cecilia Maria; Archambaud, Cristel; Westhof, Éric; Romby, Pascale; Cossart, Pascale

    2014-01-01

    The human bacterial pathogen Listeria monocytogenes is emerging as a model organism to study RNA-mediated regulation in pathogenic bacteria. A class of non-coding RNAs called CRISPRs (clustered regularly interspaced short palindromic repeats) has been described to confer bacterial resistance against invading bacteriophages and conjugative plasmids. CRISPR function relies on the activity of CRISPR associated (cas) genes that encode a large family of proteins with nuclease or helicase activities and DNA and RNA binding domains. Here, we characterized a CRISPR element (RliB) that is expressed and processed in the L. monocytogenes strain EGD-e, which is completely devoid of cas genes. Structural probing revealed that RliB has an unexpected secondary structure comprising basepair interactions between the repeats and the adjacent spacers in place of canonical hairpins formed by the palindromic repeats. Moreover, in contrast to other CRISPR-Cas systems identified in Listeria, RliB-CRISPR is ubiquitously present among Listeria genomes at the same genomic locus and is never associated with the cas genes. We showed that RliB-CRISPR is a substrate for the endogenously encoded polynucleotide phosphorylase (PNPase) enzyme. The spacers of the different Listeria RliB-CRISPRs share many sequences with temperate and virulent phages. Furthermore, we show that a cas-less RliB-CRISPR lowers the acquisition frequency of a plasmid carrying the matching protospacer, provided that trans encoded cas genes of a second CRISPR-Cas system are present in the genome. Importantly, we show that PNPase is required for RliB-CRISPR mediated DNA interference. Altogether, our data reveal a yet undescribed CRISPR system whose both processing and activity depend on PNPase, highlighting a new and unexpected function for PNPase in “CRISPRology”. PMID:24415952

  9. DNA triplet repeats mediate heterochromatin-protein-1-sensitive variegated gene silencing.

    PubMed

    Saveliev, Alexander; Everett, Christopher; Sharpe, Tammy; Webster, Zoë; Festenstein, Richard

    2003-04-24

    Gene repression is crucial to the maintenance of differentiated cell types in multicellular organisms, whereas aberrant silencing can lead to disease. The organization of DNA into chromatin and heterochromatin is implicated in gene silencing. In chromatin, DNA wraps around histones, creating nucleosomes. Further condensation of chromatin, associated with large blocks of repetitive DNA sequences, is known as heterochromatin. Position effect variegation (PEV) occurs when a gene is located abnormally close to heterochromatin, silencing the affected gene in a proportion of cells. Here we show that the relatively short triplet-repeat expansions found in myotonic dystrophy and Friedreich's ataxia confer variegation of expression on a linked transgene in mice. Silencing was correlated with a decrease in promoter accessibility and was enhanced by the classical PEV modifier heterochromatin protein 1 (HP1). Notably, triplet-repeat-associated variegation was not restricted to classical heterochromatic regions but occurred irrespective of chromosomal location. Because the phenomenon described here shares important features with PEV, the mechanisms underlying heterochromatin-mediated silencing might have a role in gene regulation at many sites throughout the mammalian genome and modulate the extent of gene silencing and hence severity in several triplet-repeat diseases.

  10. Identification, variation and transcription of pneumococcal repeat sequences

    PubMed Central

    2011-01-01

    Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003

  11. [Tissue-specific Changes in the Polymorphism of Simple Repeats in DNA of the Offspring of Different Sex Born from Irradiated Male or Female Mice].

    PubMed

    Lomaeva, M G; Fomenko, L A; Vasil'eva, G V; Bezlepkin, V G

    2016-01-01

    Evidence is presented indicating the differences in the polymorphism of microsatellite (MCS) repeats in DNA of somatic tissues in the offspring of BALB/c mice of different sex born from preconceptionally irradiated males or females. Brother-sister groups of the offspring born by non-irradiated parental pairs were compared with the offspring obtained after the irradiation of one parent in the same pairs. The number of MCS repeats in DNA of somatic tissues of the offspring from irradiated males or females was compared by a polymerase chain reaction using an arbitrary primer. It was found that changes in the polymorphism of the number of MCS repeats in the offspring from the males irradiated at a dose of 2 Gy was insignificant as compared with the offspring from control animals. In the offspring born by the females irradiated at a dose of 2 Gy (which does not impair the reproductive capacity), a statistically significant increase in the polymorphism was observed. Changes in the polymorphism were different in the offspring of different sex. A higher level of polymorphism was revealed in the female offspring born from the females of the F0 generation after their irradiation at a dose of 2 Gy. The increase in the polymorphism of the number of MCS repeats in DNA was more pronounced in postmitotic tissues compared with proliferating tissues.

  12. DNA transposon-based gene vehicles - scenes from an evolutionary drive

    PubMed Central

    2013-01-01

    DNA transposons are primitive genetic elements which have colonized living organisms from plants to bacteria and mammals. Through evolution such parasitic elements have shaped their host genomes by replicating and relocating between chromosomal loci in processes catalyzed by the transposase proteins encoded by the elements themselves. DNA transposable elements are constantly adapting to life in the genome, and self-suppressive regulation as well as defensive host mechanisms may assist in buffering ‘cut-and-paste’ DNA mobilization until accumulating mutations will eventually restrict events of transposition. With the reconstructed Sleeping Beauty DNA transposon as a powerful engine, a growing list of transposable elements with activity in human cells have moved into biomedical experimentation and preclinical therapy as versatile vehicles for delivery and genomic insertion of transgenes. In this review, we aim to link the mechanisms that drive transposon evolution with the realities and potential challenges we are facing when adapting DNA transposons for gene transfer. We argue that DNA transposon-derived vectors may carry inherent, and potentially limiting, traits of their mother elements. By understanding in detail the evolutionary journey of transposons, from host colonization to element multiplication and inactivation, we may better exploit the potential of distinct transposable elements. Hence, parallel efforts to investigate and develop distinct, but potent, transposon-based vector systems will benefit the broad applications of gene transfer. Insight and clever optimization have shaped new DNA transposon vectors, which recently debuted in the first DNA transposon-based clinical trial. Learning from an evolutionary drive may help us create gene vehicles that are safer, more efficient, and less prone for suppression and inactivation. PMID:24320156

  13. Novel Role of 3’UTR-Embedded Alu Elements as Facilitators of Processed Pseudogene Genesis and Host Gene Capture by Viral Genomes

    PubMed Central

    Engel, Pablo; Angulo, Ana

    2016-01-01

    Since the discovery of the high abundance of Alu elements in the human genome, the interest for the functional significance of these retrotransposons has been increasing. Primate Alu and rodent Alu-like elements are retrotransposed by a mechanism driven by the LINE1 (L1) encoded proteins, the same machinery that generates the L1 repeats, the processed pseudogenes (PPs), and other retroelements. Apart from free Alu RNAs, Alus are also transcribed and retrotranscribed as part of cellular gene transcripts, generally embedded inside 3’ untranslated regions (UTRs). Despite different proposed hypotheses, the functional implication of the presence of Alus inside 3’UTRs remains elusive. In this study we hypothesized that Alu elements in 3’UTRs could be involved in the genesis of PPs. By analyzing human genome data we discovered that the existence of 3’UTR-embedded Alu elements is overrepresented in genes source of PPs. In contrast, the presence of other retrotransposable elements in 3’UTRs does not show this PP linked overrepresentation. This research was extended to mouse and rat genomes and the results accordingly reveal overrepresentation of 3’UTR-embedded B1 (Alu-like) elements in PP parent genes. Interestingly, we also demonstrated that the overrepresentation of 3’UTR-embedded Alus is particularly significant in PP parent genes with low germline gene expression level. Finally, we provide data that support the hypothesis that the L1 machinery is also the system that herpesviruses, and possibly other large DNA viruses, use to capture host genes expressed in germline or somatic cells. Altogether our results suggest a novel role for Alu or Alu-like elements inside 3’UTRs as facilitators of the genesis of PPs, particularly in lowly expressed genes. Moreover, we propose that this L1-driven mechanism, aided by the presence of 3’UTR-embedded Alus, may also be exploited by DNA viruses to incorporate host genes to their viral genomes. PMID:28033411

  14. The repeating nucleotide sequence in the repetitive mitochondrial DNA from a "low-density" petite mutant of yeast.

    PubMed Central

    Van Kreijl, C F; Bos, J L

    1977-01-01

    The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740

  15. Cis-acting regulatory sequences promote high-frequency gene conversion between repeated sequences in mammalian cells.

    PubMed

    Raynard, Steven J; Baker, Mark D

    2004-01-01

    In mammalian cells, little is known about the nature of recombination-prone regions of the genome. Previously, we reported that the immunoglobulin heavy chain (IgH) mu locus behaved as a hotspot for mitotic, intrachromosomal gene conversion (GC) between repeated mu constant (Cmu) regions in mouse hybridoma cells. To investigate whether elements within the mu gene regulatory region were required for hotspot activity, gene targeting was used to delete a 9.1 kb segment encompassing the mu gene promoter (Pmu), enhancer (Emu) and switch region (Smu) from the locus. In these cell lines, GC between the Cmu repeats was significantly reduced, indicating that this 'recombination-enhancing sequence' (RES) is necessary for GC hotspot activity at the IgH locus. Importantly, the RES fragment stimulated GC when appended to the same Cmu repeats integrated at ectopic genomic sites. We also show that deletion of Emu and flanking matrix attachment regions (MARs) from the RES abolishes GC hotspot activity at the IgH locus. However, no stimulation of ectopic GC was observed with the Emu/MARs fragment alone. Finally, we provide evidence that no correlation exists between the level of transcription and GC promoted by the RES. We suggest a model whereby Emu/MARS enhances mitotic GC at the endogenous IgH mu locus by effecting chromatin modifications in adjacent DNA.

  16. DNA CTG triplet repeats involved in dynamic mutations of neurologically related gene sequences form stable duplexes

    NASA Technical Reports Server (NTRS)

    Smith, G. K.; Jie, J.; Fox, G. E.; Gao, X.

    1995-01-01

    DNA triplet repeats, 5'-d(CTG)n and 5'-d(CAG)n, are present in genes which have been implicated in several neurodegenerative disorders. To investigate possible stable structures formed by these repeating sequences, we have examined d(CTG)n, d(CAG)n and d(CTG).d(CAG)n (n = 2 and 3) using NMR and UV optical spectroscopy. These studies reveal that single stranded (CTG)n (n > 2) forms stable, antiparallel helical duplexes, while the single stranded (CAG)n requires at least three repeating units to form a duplex. NMR and UV melting experiments show that the Tm increases in the order of [(CAG)3]2 < [(CTG)3]2 << (CAG)3.(CTG)3. The (CTG)3 duplex is stable and exhibits similar NMR spectra in solutions containing 0.1-4 M NaCl and at a pH range from 4.6 to 8.8. The (CTG)3 duplex, which contains multiple-T.T mismatches, displays many NMR spectral characteristics similar to those of B-form DNA. However, unique NOE and 1H-31P coupling patterns associated with the repetitive T.T mismatches in the CTG repeats are discerned. These results, in conjunction with recent in vitro studies suggest that longer CTG repeats may form hairpin structures, which can potentially cause interruption in replication, leading to dynamic expansion or deletion of triplet repeats.

  17. Construction of a small Mus musculus repetitive DNA library: identification of a new satellite sequence in Mus musculus.

    PubMed Central

    Pietras, D F; Bennett, K L; Siracusa, L D; Woodworth-Gutai, M; Chapman, V M; Gross, K W; Kane-Haas, C; Hastie, N D

    1983-01-01

    We report the construction of a small library of recombinant plasmids containing Mus musculus repetitive DNA inserts. The repetitive cloned fraction was derived from denatured genomic DNA by reassociation to a Cot value at which repetitive, but not unique, sequences have reannealed followed by exhaustive S1 nuclease treatment to degrade single stranded DNA. Initial characterizations of this library by colony filter hybridizations have led to the identification of a previously undetected M. musculus minor satellite as well as to clones containing M. musculus major satellite sequences. This new satellite is repeated 10-20 times less than the major satellite in the M. musculus genome. It has a repeat length of 130 nucleotides compared with the M. musculus major satellite with a repeat length of 234 nucleotides. Sequence analysis of the minor satellite has shown that it has a 29 base pair region with extensive homology to one of the major satellite repeating subunits. We also show by in situ hybridization that this minor satellite sequence is located at the centromeres and possibly the arms of at least half the M musculus chromosomes. Sequences related to the minor satellite have been found in the DNA of a related Mus species, Mus spretus, and may represent the major satellite of that species. Images PMID:6314268

  18. Universal strategies for the DNA-encoding of libraries of small molecules using the chemical ligation of oligonucleotide tags

    PubMed Central

    Litovchick, Alexander; Clark, Matthew A; Keefe, Anthony D

    2014-01-01

    The affinity-mediated selection of large libraries of DNA-encoded small molecules is increasingly being used to initiate drug discovery programs. We present universal methods for the encoding of such libraries using the chemical ligation of oligonucleotides. These methods may be used to record the chemical history of individual library members during combinatorial synthesis processes. We demonstrate three different chemical ligation methods as examples of information recording processes (writing) for such libraries and two different cDNA-generation methods as examples of information retrieval processes (reading) from such libraries. The example writing methods include uncatalyzed and Cu(I)-catalyzed alkyne-azide cycloadditions and a novel photochemical thymidine-psoralen cycloaddition. The first reading method “relay primer-dependent bypass” utilizes a relay primer that hybridizes across a chemical ligation junction embedded in a fixed-sequence and is extended at its 3′-terminus prior to ligation to adjacent oligonucleotides. The second reading method “repeat-dependent bypass” utilizes chemical ligation junctions that are flanked by repeated sequences. The upstream repeat is copied prior to a rearrangement event during which the 3′-terminus of the cDNA hybridizes to the downstream repeat and polymerization continues. In principle these reading methods may be used with any ligation chemistry and offer universal strategies for the encoding (writing) and interpretation (reading) of DNA-encoded chemical libraries. PMID:25483841

  19. 2009 Epigenetics Gordon Research Conference (August 9 - 14, 2009)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jeanie Lee

    Epigenetics refers to the study of heritable changes in genome function that occur without a change in primary DNA sequence. The 2009 Gordon Conference in Epigenetics will feature discussion of various epigenetic phenomena, emerging understanding of their underlying mechanisms, and the growing appreciation that human, animal, and plant health all depend on proper epigenetic control. Special emphasis will be placed on genome-environment interactions particularly as they relate to human disease. Towards improving knowledge of molecular mechanisms, the conference will feature international leaders studying the roles of higher order chromatin structure, noncoding RNA, repeat elements, nuclear organization, and morphogenic evolution. Traditionalmore » and new model organisms are selected from plants, fungi, and metazoans.« less

  20. Homology-dependent repair is involved in 45S rDNA loss in plant CAF-1 mutants

    PubMed Central

    Muchová, Veronika; Amiard, Simon; Mozgová, Iva; Dvořáčková, Martina; Gallego, Maria E; White, Charles; Fajkus, Jiří

    2015-01-01

    Arabidopsis thaliana mutants in FAS1 and FAS2 subunits of chromatin assembly factor 1 (CAF1) show progressive loss of 45S rDNA copies and telomeres. We hypothesized that homology-dependent DNA damage repair (HDR) may contribute to the loss of these repeats in fas mutants. To test this, we generated double mutants by crossing fas mutants with knock-out mutants in RAD51B, one of the Rad51 paralogs of A. thaliana. Our results show that the absence of RAD51B decreases the rate of rDNA loss, confirming the implication of RAD51B-dependent recombination in rDNA loss in the CAF1 mutants. Interestingly, this effect is not observed for telomeric repeat loss, which thus differs from that acting in rDNA loss. Involvement of DNA damage repair in rDNA dynamics in fas mutants is further supported by accumulation of double-stranded breaks (measured as γ-H2AX foci) in 45S rDNA. Occurrence of the foci is not specific for S-phase, and is ATM-independent. While the foci in fas mutants occur both in the transcribed (intranucleolar) and non-transcribed (nucleoplasmic) fraction of rDNA, double fas rad51b mutants show a specific increase in the number of the intranucleolar foci. These results suggest that the repair of double-stranded breaks present in the transcribed rDNA region is RAD51B dependent and that this contributes to rDNA repeat loss in fas mutants, presumably via the single-stranded annealing recombination pathway. Our results also highlight the importance of proper chromatin assembly in the maintenance of genome stability. PMID:25359579

Top