The organization of repeating units in mitochondrial DNA from yeast petite mutants.
Bos, J L; Heyting, C; Van der Horst, G; Borst, P
1980-04-01
We have reinvestigated the linkage orientation of repeating units in mtDNAs of yeast ρ(-) petite mutants containing an inverted duplication. All five petite mtDNAs studied contain a continuous segment of wild-type mtDNA, part of which is duplicated and present in inverted form in the repeat. We show by restriction enzyme analysis that the non-duplicated segments between the inverted duplications are present in random orientation in all five petite mtDNAs. There is no segregation of sub-types with unique orientation. We attribute this to the high rate of intramolecular recombination between the inverted duplications. The results provide additional evidence for the high rate of recombination of yeast mtDNA even in haploid ρ(-) petite cells.We conclude that only two types of stable sequence organization exist in petite mtDNA: petites without an inverted duplication have repeats linked in straight head-to-tail arrangement (abcabc); petites with an inverted duplication have repeats in which the non-duplicated segments are present in random orientation.
Adeno-associated virus inverted terminal repeats stimulate gene editing.
Hirsch, M L
2015-02-01
Advancements in genome editing have relied on technologies to specifically damage DNA which, in turn, stimulates DNA repair including homologous recombination (HR). As off-target concerns complicate the therapeutic translation of site-specific DNA endonucleases, an alternative strategy to stimulate gene editing based on fragile DNA was investigated. To do this, an episomal gene-editing reporter was generated by a disruptive insertion of the adeno-associated virus (AAV) inverted terminal repeat (ITR) into the egfp gene. Compared with a non-structured DNA control sequence, the ITR induced DNA damage as evidenced by increased gamma-H2AX and Mre11 foci formation. As local DNA damage stimulates HR, ITR-mediated gene editing was investigated using DNA oligonucleotides as repair substrates. The AAV ITR stimulated gene editing >1000-fold in a replication-independent manner and was not biased by the polarity of the repair oligonucleotide. Analysis of additional human DNA sequences demonstrated stimulation of gene editing to varying degrees. In particular, inverted yet not direct, Alu repeats induced gene editing, suggesting a role for DNA structure in the repair event. Collectively, the results demonstrate that inverted DNA repeats stimulate gene editing via double-strand break repair in an episomal context and allude to efficient gene editing of the human chromosome using fragile DNA sequences.
Scalvenzi, Thibault; Pollet, Nicolas
2014-12-01
The genome size in eukaryotes does not correlate well with the number of genes they contain. We can observe this so-called C-value paradox in amphibian species. By analyzing an amphibian genome we asked how repetitive DNA can impact genome size and architecture. We describe here our discovery of a Tc1/mariner miniature inverted-repeat transposon family present in Xenopus frogs. These transposons named miDNA4 are unique since they contain a satellite DNA motif. We found that miDNA4 measured 331 bp, contained 25 bp long inverted terminal repeat sequences and a sequence motif of 119 bp present as a unique copy or as an array of 2-47 copies. We characterized the structure, dynamics, impact and evolution of the miDNA4 family and its satellite DNA in Xenopus frog genomes. This led us to propose a model for the evolution of these two repeated sequences and how they can synergize to increase genome size. Copyright © 2014 Elsevier Inc. All rights reserved.
Fragile DNA Motifs Trigger Mutagenesis at Distant Chromosomal Loci in Saccharomyces cerevisiae
Saini, Natalie; Zhang, Yu; Nishida, Yuri; Sheng, Ziwei; Choudhury, Shilpa; Mieczkowski, Piotr; Lobachev, Kirill S.
2013-01-01
DNA sequences capable of adopting non-canonical secondary structures have been associated with gross-chromosomal rearrangements in humans and model organisms. Previously, we have shown that long inverted repeats that form hairpin and cruciform structures and triplex-forming GAA/TTC repeats induce the formation of double-strand breaks which trigger genome instability in yeast. In this study, we demonstrate that breakage at both inverted repeats and GAA/TTC repeats is augmented by defects in DNA replication. Increased fragility is associated with increased mutation levels in the reporter genes located as far as 8 kb from both sides of the repeats. The increase in mutations was dependent on the presence of inverted or GAA/TTC repeats and activity of the translesion polymerase Polζ. Mutagenesis induced by inverted repeats also required Sae2 which opens hairpin-capped breaks and initiates end resection. The amount of breakage at the repeats is an important determinant of mutations as a perfect palindromic sequence with inherently increased fragility was also found to elevate mutation rates even in replication-proficient strains. We hypothesize that the underlying mechanism for mutagenesis induced by fragile motifs involves the formation of long single-stranded regions in the broken chromosome, invasion of the undamaged sister chromatid for repair, and faulty DNA synthesis employing Polζ. These data demonstrate that repeat-mediated breaks pose a dual threat to eukaryotic genome integrity by inducing chromosomal aberrations as well as mutations in flanking genes. PMID:23785298
DNA-directed mutations. Leading and lagging strand specificity
NASA Technical Reports Server (NTRS)
Sinden, R. R.; Hashem, V. I.; Rosche, W. A.
1999-01-01
The fidelity of replication has evolved to reproduce B-form DNA accurately, while allowing a low frequency of mutation. The fidelity of replication can be compromised, however, by defined order sequence DNA (dosDNA) that can adopt unusual or non B-DNA conformations. These alternative DNA conformations, including hairpins, cruciforms, triplex DNAs, and slipped-strand structures, may affect enzyme-template interactions that potentially lead to mutations. To analyze the effect of dosDNA elements on spontaneous mutagenesis, various mutational inserts containing inverted repeats or direct repeats were cloned in a plasmid containing a unidirectional origin of replication and a selectable marker for the mutation. This system allows for analysis of mutational events that are specific for the leading or lagging strands during DNA replication in Escherichia coli. Deletions between direct repeats, involving misalignment stabilized by DNA secondary structure, occurred preferentially on the lagging strand. Intermolecular strand switch events, correcting quasipalindromes to perfect inverted repeats, occurred preferentially during replication of the leading strand.
Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903.
Grindley, N D; Joyce, C M
1980-01-01
The kanamycin resistance transposon Tn903 consists of a unique region of about 1000 base pairs bounded by a pair of 1050-base-pair inverted repeat sequences. Each repeat contains two Pvu II endonuclease cleavage sites separated by 520 base pairs. We have constructed derivatives of Tn903 in which this 520-base-pair fragment is deleted from one or both repeats. Those derivatives that lack both 520-base-pair fragments cannot transpose, whereas those that lack just one remain transposition proficient. One such transposable derivative, Tn903 delta I, has been selected for further study. We have determined the sequence of the intact inverted repeat. The 18 base pairs at each end are identical and inverted relative to one another, a structure characteristic of insertion sequences. Additional experiments indicate that a single inverted repeat from Tn903 can, in fact, transpose; we propose that this element be called IS903. To correlate the DNA sequence with genetic activities, we have created mutations by inserting a 10-base-pair DNA fragment at several sites within the intact repeat of Tn903 delta 1, and we have examined the effect of such insertions on transposability. The results suggest that IS903 encodes a 307-amino-acid polypeptide (a "transposase") that is absolutely required for transposition of IS903 or Tn903. Images PMID:6261245
USDA-ARS?s Scientific Manuscript database
Miniature inverted-repeat transposable elements (MITEs) are non-autonomous transposons (devoid a transposase gene, tps) involving insertion/deletion of genomic DNA in bacterial genomes influencing gene functions. No transposon has yet been reported in “Candidatus Liberibacter asiaticus”, an alpha-pr...
Zimmerman, Carl-Ulrich R; Rosengarten, Renate; Spergser, Joachim
2011-01-01
Phase variation of the major ureaplasma surface membrane protein, the multiple-banded antigen (MBA), with its counterpart, the UU376 protein, was recently discussed as a result of DNA inversion occurring at specific inverted repeats. Two similar inverted repeats to the ones within the mba locus were found in the genome of Ureaplasma parvum serovar 3; one within the MBA N-terminal paralogue UU172 and another in the adjacent intergenic spacer region. In this report, we demonstrate on both genomic and protein level that DNA inversion at these inverted repeats leads to alternating expression between UU172 and the neighbouring conserved hypothetical ORF UU171. Sequence analysis of this phase-variable ‘UU172 element’ from both U. parvum and U. urealyticum strains revealed that it is highly conserved among both species and that it also includes the orthologue of UU144. A third inverted repeat region in UU144 is proposed to serve as an additional potential inversion site from which chimeric genes can evolve. Our results indicate that site-specific recombination events in the genome of U. parvum serovar 3 are dynamic and frequent, leading to a broad spectrum of antigenic variation by which the organism may evade host immune responses. PMID:21255110
Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N
2003-09-01
Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.
Ye, Congting; Ji, Guoli; Li, Lei; Liang, Chun
2014-01-01
Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures--hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir.
Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes
Huang, Yongjie; Mrázek, Jan
2014-01-01
Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877
Origin-Dependent Inverted-Repeat Amplification: Tests of a Model for Inverted DNA Amplification
Brewer, Bonita J.; Payen, Celia; Di Rienzi, Sara C.; Higgins, Megan M.; Ong, Giang; Dunham, Maitreya J.; Raghuraman, M. K.
2015-01-01
DNA replication errors are a major driver of evolution—from single nucleotide polymorphisms to large-scale copy number variations (CNVs). Here we test a specific replication-based model to explain the generation of interstitial, inverted triplications. While no genetic information is lost, the novel inversion junctions and increased copy number of the included sequences create the potential for adaptive phenotypes. The model—Origin-Dependent Inverted-Repeat Amplification (ODIRA)—proposes that a replication error at pre-existing short, interrupted, inverted repeats in genomic sequences generates an extrachromosomal, inverted dimeric, autonomously replicating intermediate; subsequent genomic integration of the dimer yields this class of CNV without loss of distal chromosomal sequences. We used a combination of in vitro and in vivo approaches to test the feasibility of the proposed replication error and its downstream consequences on chromosome structure in the yeast Saccharomyces cerevisiae. We show that the proposed replication error—the ligation of leading and lagging nascent strands to create “closed” forks—can occur in vitro at short, interrupted inverted repeats. The removal of molecules with two closed forks results in a hairpin-capped linear duplex that we show replicates in vivo to create an inverted, dimeric plasmid that subsequently integrates into the genome by homologous recombination, creating an inverted triplication. While other models have been proposed to explain inverted triplications and their derivatives, our model can also explain the generation of human, de novo, inverted amplicons that have a 2:1 mixture of sequences from both homologues of a single parent—a feature readily explained by a plasmid intermediate that arises from one homologue and integrates into the other homologue prior to meiosis. Our tests of key features of ODIRA lend support to this mechanism and suggest further avenues of enquiry to unravel the origins of interstitial, inverted CNVs pivotal in human health and evolution. PMID:26700858
Origin-Dependent Inverted-Repeat Amplification: Tests of a Model for Inverted DNA Amplification.
Brewer, Bonita J; Payen, Celia; Di Rienzi, Sara C; Higgins, Megan M; Ong, Giang; Dunham, Maitreya J; Raghuraman, M K
2015-12-01
DNA replication errors are a major driver of evolution--from single nucleotide polymorphisms to large-scale copy number variations (CNVs). Here we test a specific replication-based model to explain the generation of interstitial, inverted triplications. While no genetic information is lost, the novel inversion junctions and increased copy number of the included sequences create the potential for adaptive phenotypes. The model--Origin-Dependent Inverted-Repeat Amplification (ODIRA)-proposes that a replication error at pre-existing short, interrupted, inverted repeats in genomic sequences generates an extrachromosomal, inverted dimeric, autonomously replicating intermediate; subsequent genomic integration of the dimer yields this class of CNV without loss of distal chromosomal sequences. We used a combination of in vitro and in vivo approaches to test the feasibility of the proposed replication error and its downstream consequences on chromosome structure in the yeast Saccharomyces cerevisiae. We show that the proposed replication error-the ligation of leading and lagging nascent strands to create "closed" forks-can occur in vitro at short, interrupted inverted repeats. The removal of molecules with two closed forks results in a hairpin-capped linear duplex that we show replicates in vivo to create an inverted, dimeric plasmid that subsequently integrates into the genome by homologous recombination, creating an inverted triplication. While other models have been proposed to explain inverted triplications and their derivatives, our model can also explain the generation of human, de novo, inverted amplicons that have a 2:1 mixture of sequences from both homologues of a single parent--a feature readily explained by a plasmid intermediate that arises from one homologue and integrates into the other homologue prior to meiosis. Our tests of key features of ODIRA lend support to this mechanism and suggest further avenues of enquiry to unravel the origins of interstitial, inverted CNVs pivotal in human health and evolution.
Trinh, T. Q.; Sinden, R. R.
1993-01-01
We describe a system to measure the frequency of both deletions and duplications between direct repeats. Short 17- and 18-bp palindromic and nonpalindromic DNA sequences were cloned into the EcoRI site within the chloramphenicol acetyltransferase gene of plasmids pBR325 and pJT7. This creates an insert between direct repeated EcoRI sites and results in a chloramphenicol-sensitive phenotype. Selection for chloramphenicol resistance was utilized to select chloramphenicol resistant revertants that included those with precise deletion of the insert from plasmid pBR325 and duplication of the insert in plasmid pJT7. The frequency of deletion or duplication varied more than 500-fold depending on the sequence of the short sequence inserted into the EcoRI site. For the nonpalindromic inserts, multiple internal direct repeats and the length of the direct repeats appear to influence the frequency of deletion. Certain palindromic DNA sequences with the potential to form DNA hairpin structures that might stabilize the misalignment of direct repeats had a high frequency of deletion. Other DNA sequences with the potential to form structures that might destabilize misalignment of direct repeats had a very low frequency of deletion. Duplication mutations occurred at the highest frequency when the DNA between the direct repeats contained no direct or inverted repeats. The presence of inverted repeats dramatically reduced the frequency of duplications. The results support the slippage-misalignment model, suggesting that misalignment occurring during DNA replication leads to deletion and duplication mutations. The results also support the idea that the formation of DNA secondary structures during DNA replication can facilitate and direct specific mutagenic events. PMID:8325478
Singh, Gurjeet; Klar, Amar J S
2002-01-01
The mat2,3 region of the fission yeast Schizosaccharomyces pombe exhibits a phenomenon of transcriptional silencing. This region is flanked by two identical DNA sequence elements, 2.1 kb in length, present in inverted orientation: IRL on the left and IRR on the right of the silent region. The repeats do not encode any ORF. The inverted repeat DNA region is also present in a newly identified related species, which we named S. kambucha. Interestingly, the left and right repeats share perfect identity within a species, but show approximately 2% bases interspecies variation. Deletion of IRL results in variegated expression of markers inserted in the silent region, while deletion of the IRR causes their derepression. When deletions of these repeats were genetically combined with mutations in different trans-acting genes previously shown to cause a partial defect in silencing, only mutations in clr1 and clr3 showed additive defects in silencing with the deletion of IRL. The rate of mat1 switching is also affected by deletion of repeats. The IRL or IRR deletion did not cause significant derepression of the mat2 or mat3 loci. These results implicate repeats for maintaining full repression of the mat2,3 region, for efficient mat1 switching, and further support the notion that multiple pathways cooperate to silence the mat2,3 domain. PMID:12399374
Gerhold, Joachim M; Aun, Anu; Sedman, Tiina; Jõers, Priit; Sedman, Juhan
2010-09-24
Molecular recombination and transcription are proposed mechanisms to initiate mitochondrial DNA (mtDNA) replication in yeast. We conducted a comprehensive analysis of mtDNA from the yeast Candida albicans. Two-dimensional agarose gel electrophoresis of mtDNA intermediates reveals no bubble structures diagnostic of specific replication origins, but rather supports recombination-driven replication initiation of mtDNA in yeast. Specific species of Y structures together with DNA copy number analyses of a C. albicans mutant strain provide evidence that a region in a mainly noncoding inverted repeat is predominantly involved in replication initiation via homologous recombination. Our further findings show that the C. albicans mtDNA forms a complex branched network that does not contain detectable amounts of circular molecules. We provide topological evidence for recombination-driven mtDNA replication initiation and introduce C. albicans as a suitable model organism to study wild-type mtDNA maintenance in yeast. Copyright © 2010 Elsevier Inc. All rights reserved.
Formation of Linear Amplicons with Inverted Duplications in Leishmania Requires the MRE11 Nuclease
Laffitte, Marie-Claude N.; Genois, Marie-Michelle; Mukherjee, Angana; Légaré, Danielle; Masson, Jean-Yves; Ouellette, Marc
2014-01-01
Extrachromosomal DNA amplification is frequent in the protozoan parasite Leishmania selected for drug resistance. The extrachromosomal amplified DNA is either circular or linear, and is formed at the level of direct or inverted homologous repeated sequences that abound in the Leishmania genome. The RAD51 recombinase plays an important role in circular amplicons formation, but the mechanism by which linear amplicons are formed is unknown. We hypothesized that the Leishmania infantum DNA repair protein MRE11 is required for linear amplicons following rearrangements at the level of inverted repeats. The purified LiMRE11 protein showed both DNA binding and exonuclease activities. Inactivation of the LiMRE11 gene led to parasites with enhanced sensitivity to DNA damaging agents. The MRE11−/− parasites had a reduced capacity to form linear amplicons after drug selection, and the reintroduction of an MRE11 allele led to parasites regaining their capacity to generate linear amplicons, but only when MRE11 had an active nuclease activity. These results highlight a novel MRE11-dependent pathway used by Leishmania to amplify portions of its genome to respond to a changing environment. PMID:25474106
Sequence of retrovirus provirus resembles that of bacterial transposable elements
NASA Astrophysics Data System (ADS)
Shimotohno, Kunitada; Mizutani, Satoshi; Temin, Howard M.
1980-06-01
The nucleotide sequences of the terminal regions of an infectious integrated retrovirus cloned in the modified λ phage cloning vector Charon 4A have been elucidated. There is a 569-base pair direct repeat at both ends of the viral DNA. The cell-virus junctions at each end consist of a 5-base pair direct repeat of cell DNA next to a 3-base pair inverted repeat of viral DNA. This structure resembles that of a transposable element and is consistent with the protovirus hypothesis that retroviruses evolved from the cell genome.
Li, Jia; Gao, Lei; Chen, Shanshan; Tao, Ke; Su, Yingjuan; Wang, Ting
2016-02-11
Sciadopitys verticillata is an evergreen conifer and an economically valuable tree used in construction, which is the only member of the family Sciadopityaceae. Acquisition of the S. verticillata chloroplast (cp) genome will be useful for understanding the evolutionary mechanism of conifers and phylogenetic relationships among gymnosperm. In this study, we have first reported the complete chloroplast genome of S. verticillata. The total genome is 138,284 bp in length, consisting of 118 unique genes. The S. verticillata cp genome has lost one copy of the canonical inverted repeats and shown distinctive genomic structure comparing with other cupressophytes. Fifty-three simple sequence repeat loci and 18 forward tandem repeats were identified in the S. verticillata cp genome. According to the rearrangement of cupressophyte cp genome, we proposed one mechanism for the formation of inverted repeat: tandem repeat occured first, then rearrangement divided the tandem repeat into inverted repeats located at different regions. Phylogenetic estimates inferred from 59-gene sequences and cpDNA organizations have both shown that S. verticillata was sister to the clade consisting of Cupressaceae, Taxaceae, and Cephalotaxaceae. Moreover, accD gene was found to be lost in the S. verticillata cp genome, and a nucleus copy was identified from two transcriptome data.
The rolling-circle melting-pot model for porcine circovirus DNA replication
USDA-ARS?s Scientific Manuscript database
A stem-loop structure, formed by a pair of inverted repeats during DNA replication, is a conserved feature at the origin of DNA replication (Ori) among plant and animal viruses, bacteriophages and plasmids that replicate their genomes via the rolling-circle replication (RCR) mechanism. Porcine circo...
Flexible DNA binding of the BTB/POZ-domain protein FBI-1.
Pessler, Frank; Hernandez, Nouria
2003-08-01
POZ-domain transcription factors are characterized by the presence of a protein-protein interaction domain called the POZ or BTB domain at their N terminus and zinc fingers at their C terminus. Despite the large number of POZ-domain transcription factors that have been identified to date and the significant insights that have been gained into their cellular functions, relatively little is known about their DNA binding properties. FBI-1 is a BTB/POZ-domain protein that has been shown to modulate HIV-1 Tat trans-activation and to repress transcription of some cellular genes. We have used various viral and cellular FBI-1 binding sites to characterize the interaction of a POZ-domain protein with DNA in detail. We find that FBI-1 binds to inverted sequence repeats downstream of the HIV-1 transcription start site. Remarkably, it binds efficiently to probes carrying these repeats in various orientations and spacings with no particular rotational alignment, indicating that its interaction with DNA is highly flexible. Indeed, FBI-1 binding sites in the adenovirus 2 major late promoter, the c-fos gene, and the c-myc P1 and P2 promoters reveal variously spaced direct, inverted, and everted sequence repeats with the consensus sequence G(A/G)GGG(T/C)(C/T)(T/C)(C/T) for each repeat.
Fu, Changlin; Donovan, William P; Shikapwashya-Hasser, Olga; Ye, Xudong; Cole, Robert H
2014-01-01
Molecular cloning is utilized in nearly every facet of biological and medical research. We have developed a method, termed Hot Fusion, to efficiently clone one or multiple DNA fragments into plasmid vectors without the use of ligase. The method is directional, produces seamless junctions and is not dependent on the availability of restriction sites for inserts. Fragments are assembled based on shared homology regions of 17-30 bp at the junctions, which greatly simplifies the construct design. Hot Fusion is carried out in a one-step, single tube reaction at 50 °C for one hour followed by cooling to room temperature. In addition to its utility for multi-fragment assembly Hot Fusion provides a highly efficient method for cloning DNA fragments containing inverted repeats for applications such as RNAi. The overall cloning efficiency is in the order of 90-95%.
Fu, Changlin; Donovan, William P.; Shikapwashya-Hasser, Olga; Ye, Xudong; Cole, Robert H.
2014-01-01
Molecular cloning is utilized in nearly every facet of biological and medical research. We have developed a method, termed Hot Fusion, to efficiently clone one or multiple DNA fragments into plasmid vectors without the use of ligase. The method is directional, produces seamless junctions and is not dependent on the availability of restriction sites for inserts. Fragments are assembled based on shared homology regions of 17–30 bp at the junctions, which greatly simplifies the construct design. Hot Fusion is carried out in a one-step, single tube reaction at 50°C for one hour followed by cooling to room temperature. In addition to its utility for multi-fragment assembly Hot Fusion provides a highly efficient method for cloning DNA fragments containing inverted repeats for applications such as RNAi. The overall cloning efficiency is in the order of 90–95%. PMID:25551825
DNA looping by FokI: the impact of synapse geometry on loop topology at varied site orientations
Rusling, David A.; Laurens, Niels; Pernstich, Christian; Wuite, Gijs J. L.; Halford, Stephen E.
2012-01-01
Most restriction endonucleases, including FokI, interact with two copies of their recognition sequence before cutting DNA. On DNA with two sites they act in cis looping out the intervening DNA. While many restriction enzymes operate symmetrically at palindromic sites, FokI acts asymmetrically at a non-palindromic site. The directionality of its sequence means that two FokI sites can be bridged in either parallel or anti-parallel alignments. Here we show by biochemical and single-molecule biophysical methods that FokI aligns two recognition sites on separate DNA molecules in parallel and that the parallel arrangement holds for sites in the same DNA regardless of whether they are in inverted or repeated orientations. The parallel arrangement dictates the topology of the loop trapped between sites in cis: the loop from inverted sites has a simple 180° bend, while that with repeated sites has a convoluted 360° turn. The ability of FokI to act at asymmetric sites thus enabled us to identify the synapse geometry for sites in trans and in cis, which in turn revealed the relationship between synapse geometry and loop topology. PMID:22362745
Weihofen, Wilhelm Andreas; Cicek, Aslan; Pratto, Florencia; Alonso, Juan Carlos; Saenger, Wolfram
2006-01-01
Repressor ω regulates transcription of genes required for copy number control, accurate segregation and stable maintenance of inc18 plasmids hosted by Gram-positive bacteria. ω belongs to homodimeric ribbon-helix-helix (RHH2) repressors typified by a central, antiparallel β-sheet for DNA major groove binding. Homodimeric ω2 binds cooperatively to promotors with 7 to 10 consecutive non-palindromic DNA heptad repeats (5′-A/TATCACA/T-3′, symbolized by →) in palindromic inverted, converging (→←) or diverging (←→) orientation and also, unique to ω2 and contrasting other RHH2 repressors, to non-palindromic direct (→→) repeats. Here we investigate with crystal structures how ω2 binds specifically to heptads in minimal operators with (→→) and (→←) repeats. Since the pseudo-2-fold axis relating the monomers in ω2 passes the central C–G base pair of each heptad with ∼0.3 Å downstream offset, the separation between the pseudo-2-fold axes is exactly 7 bp in (→→), ∼0.6 Å shorter in (→←) but would be ∼0.6 Å longer in (←→). These variations grade interactions between adjacent ω2 and explain modulations in cooperative binding affinity of ω2 to operators with different heptad orientations. PMID:16528102
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhattacharya, Monolekha; Das, Amit Kumar, E-mail: amitk@hijli.iitkgp.ernet.in
Highlights: Black-Right-Pointing-Pointer The regulatory sequences recognized by TcrX have been identified. Black-Right-Pointing-Pointer The regulatory region comprises of inverted repeats segregated by 30 bp region. Black-Right-Pointing-Pointer The mode of binding of TcrX with regulatory sequence is unique. Black-Right-Pointing-Pointer In silico TcrX-DNA docked model binds one of the inverted repeats. Black-Right-Pointing-Pointer Both phosphorylated and unphosphorylated TcrX binds regulatory sequence in vitro. -- Abstract: TcrY, a histidine kinase, and TcrX, a response regulator, constitute a two-component system in Mycobacterium tuberculosis. tcrX, which is expressed during iron scarcity, is instrumental in the survival of iron-dependent M. tuberculosis. However, the regulator of tcrX/Y has notmore » been fully characterized. Crosslinking studies of TcrX reveal that it can form oligomers in vitro. Electrophoretic mobility shift assays (EMSAs) show that TcrX recognizes two regions in the promoter that are comprised of inverted repeats separated by {approx}30 bp. The dimeric in silico model of TcrX predicts binding to one of these inverted repeat regions. Site-directed mutagenesis and radioactive phosphorylation indicate that D54 of TcrX is phosphorylated by H256 of TcrY. However, phosphorylated and unphosphorylated TcrX bind the regulatory sequence with equal efficiency, which was shown with an EMSA using the D54A TcrX mutant.« less
Plourde, Marie; Gingras, Hélène; Roy, Gaétan; Lapointe, Andréanne; Leprohon, Philippe; Papadopoulou, Barbara; Corbeil, Jacques; Ouellette, Marc
2014-01-01
Gene amplification of specific loci has been described in all kingdoms of life. In the protozoan parasite Leishmania, the product of amplification is usually part of extrachromosomal circular or linear amplicons that are formed at the level of direct or inverted repeated sequences. A bioinformatics screen revealed that repeated sequences are widely distributed in the Leishmania genome and the repeats are chromosome-specific, conserved among species, and generally present in low copy number. Using sensitive PCR assays, we provide evidence that the Leishmania genome is continuously being rearranged at the level of these repeated sequences, which serve as a functional platform for constitutive and stochastic amplification (and deletion) of genomic segments in the population. This process is adaptive as the copy number of advantageous extrachromosomal circular or linear elements increases upon selective pressure and is reversible when selection is removed. We also provide mechanistic insights on the formation of circular and linear amplicons through RAD51 recombinase-dependent and -independent mechanisms, respectively. The whole genome of Leishmania is thus stochastically rearranged at the level of repeated sequences, and the selection of parasite subpopulations with changes in the copy number of specific loci is used as a strategy to respond to a changing environment. PMID:24844805
Target Site Recognition by a Diversity-Generating Retroelement
Guo, Huatao; Tse, Longping V.; Nieh, Angela W.; Czornyj, Elizabeth; Williams, Steven; Oukil, Sabrina; Liu, Vincent B.; Miller, Jeff F.
2011-01-01
Diversity-generating retroelements (DGRs) are in vivo sequence diversification machines that are widely distributed in bacterial, phage, and plasmid genomes. They function to introduce vast amounts of targeted diversity into protein-encoding DNA sequences via mutagenic homing. Adenine residues are converted to random nucleotides in a retrotransposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using the Bordetella bacteriophage BPP-1 element as a prototype, we have characterized requirements for DGR target site function. Although sequences upstream of VR are dispensable, a 24 bp sequence immediately downstream of VR, which contains short inverted repeats, is required for efficient retrohoming. The inverted repeats form a hairpin or cruciform structure and mutational analysis demonstrated that, while the structure of the stem is important, its sequence can vary. In contrast, the loop has a sequence-dependent function. Structure-specific nuclease digestion confirmed the existence of a DNA hairpin/cruciform, and marker coconversion assays demonstrated that it influences the efficiency, but not the site of cDNA integration. Comparisons with other phage DGRs suggested that similar structures are a conserved feature of target sequences. Using a kanamycin resistance determinant as a reporter, we found that transplantation of the IMH and hairpin/cruciform-forming region was sufficient to target the DGR diversification machinery to a heterologous gene. In addition to furthering our understanding of DGR retrohoming, our results suggest that DGRs may provide unique tools for directed protein evolution via in vivo DNA diversification. PMID:22194701
Non-B DB v2.0: a database of predicted non-B DNA-forming motifs and its associated tools.
Cer, Regina Z; Donohue, Duncan E; Mudunuri, Uma S; Temiz, Nuri A; Loss, Michael A; Starner, Nathan J; Halusa, Goran N; Volfovsky, Natalia; Yi, Ming; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M
2013-01-01
The non-B DB, available at http://nonb.abcc.ncifcrf.gov, catalogs predicted non-B DNA-forming sequence motifs, including Z-DNA, G-quadruplex, A-phased repeats, inverted repeats, mirror repeats, direct repeats and their corresponding subsets: cruciforms, triplexes and slipped structures, in several genomes. Version 2.0 of the database revises and re-implements the motif discovery algorithms to better align with accepted definitions and thresholds for motifs, expands the non-B DNA-forming motifs coverage by including short tandem repeats and adds key visualization tools to compare motif locations relative to other genomic annotations. Non-B DB v2.0 extends the ability for comparative genomics by including re-annotation of the five organisms reported in non-B DB v1.0, human, chimpanzee, dog, macaque and mouse, and adds seven additional organisms: orangutan, rat, cow, pig, horse, platypus and Arabidopsis thaliana. Additionally, the non-B DB v2.0 provides an overall improved graphical user interface and faster query performance.
Ribosomal protein S14 transcripts are edited in Oenothera mitochondria.
Schuster, W; Unseld, M; Wissinger, B; Brennicke, A
1990-01-01
The gene encoding ribosomal protein S14 (rps14) in Oenothera mitochondria is located upstream of the cytochrome b gene (cob). Sequence analysis of independently derived cDNA clones covering the entire rps14 coding region shows two nucleotides edited from the genomic DNA to the mRNA derived sequences by C to U modifications. A third editing event occurs four nucleotides upstream of the AUG initiation codon and improves a potential ribosome binding site. A CGG codon specifying arginine in a position conserved in evolution between chloroplasts and E. coli as a UGG tryptophan codon is not edited in any of the cDNAs analysed. An inverted repeat 3' of an unidentified open reading frame is located upstream of the rps14 gene. The inverted repeat sequence is highly conserved at analogous regions in other Oenothera mitochondrial loci. Images PMID:2326162
Zimmerman, Carl-Ulrich R; Rosengarten, Renate; Spergser, Joachim
2013-01-01
Phase variation of two loci (‘mba locus’ and ‘UU172 phase-variable element’) in Ureaplasma parvum serovar 3 has been suggested as result of site-specific DNA inversion occurring at short inverted repeats. Three potential tyrosine recombinases (RipX, XerC, and CodV encoded by the genes UU145, UU222, and UU529) have been annotated in the genome of U. parvum serovar 3, which could be mediators in the proposed recombination event. We document that only orthologs of the gene xerC are present in all strains that show phase variation in the two loci. We demonstrate in vitro binding of recombinant maltose-binding protein fusions of XerC to the inverted repeats of the phase-variable loci, of RipX to a direct repeat that flanks a 20-kbp region, which has been proposed as putative pathogenicity island, and of CodV to a putative dif site. Co-transformation of the model organism Mycoplasma pneumoniae M129 with both the ‘mba locus’ and the recombinase gene xerC behind an active promoter region resulted in DNA inversion in the ‘mba locus’. Results suggest that XerC of U. parvum serovar 3 is a mediator in the proposed DNA inversion event of the two phase-variable loci. PMID:23305333
Kazakoff, Stephen H.; Imelfort, Michael; Edwards, David; Koehorst, Jasper; Biswas, Bandana; Batley, Jacqueline; Scott, Paul T.; Gresshoff, Peter M.
2012-01-01
Pongamia pinnata (syn. Millettia pinnata) is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® ‘Second Generation DNA Sequencing (2GS)’ and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA) and mitochondrial (425,718 bp; mtDNA) genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp). The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively) and chloroplast (8.37% and 8.99%, respectively) protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites. PMID:23272141
Kazakoff, Stephen H; Imelfort, Michael; Edwards, David; Koehorst, Jasper; Biswas, Bandana; Batley, Jacqueline; Scott, Paul T; Gresshoff, Peter M
2012-01-01
Pongamia pinnata (syn. Millettia pinnata) is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® 'Second Generation DNA Sequencing (2GS)' and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA) and mitochondrial (425,718 bp; mtDNA) genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp). The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively) and chloroplast (8.37% and 8.99%, respectively) protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites.
Dias, Guilherme B.; Svartman, Marta; Delprat, Alejandra; Ruiz, Alfredo; Kuhn, Gustavo C.S.
2014-01-01
Transposable elements (TEs) and satellite DNAs (satDNAs) are abundant components of most eukaryotic genomes studied so far and their impact on evolution has been the focus of several studies. A number of studies linked TEs with satDNAs, but the nature of their evolutionary relationships remains unclear. During in silico analyses of the Drosophila virilis assembled genome, we found a novel DNA transposon we named Tetris based on its modular structure and diversity of rearranged forms. We aimed to characterize Tetris and investigate its role in generating satDNAs. Data mining and sequence analysis showed that Tetris is apparently nonautonomous, with a structure similar to foldback elements, and present in D. virilis and D. americana. Herein, we show that Tetris shares the final portions of its terminal inverted repeats (TIRs) with DAIBAM, a previously described miniature inverted transposable element implicated in the generation of chromosome inversions. Both elements are likely to be mobilized by the same autonomous TE. Tetris TIRs contain approximately 220-bp internal tandem repeats that we have named TIR-220. We also found TIR-220 repeats making up longer (kb-size) satDNA-like arrays. Using bioinformatic, phylogenetic and cytogenomic tools, we demonstrated that Tetris has contributed to shaping the genomes of D. virilis and D. americana, providing internal tandem repeats that served as building blocks for the amplification of satDNA arrays. The β-heterochromatic genomic environment seemed to have favored such amplification. Our results imply for the first time a role for foldback elements in generating satDNAs. PMID:24858539
Xavier, Crislaine; Cabral-de-Mello, Diogo Cavalcanti; de Moura, Rita Cássia
2014-12-01
Cytogenetic studies of the Neotropical beetle genus Dichotomius (Scarabaeinae, Coleoptera) have shown dynamism for centromeric constitutive heterochromatin sequences. In the present work we studied the chromosomes and isolated repetitive sequences of Dichotomius schiffleri aiming to contribute to the understanding of coleopteran genome/chromosomal organization. Dichotomius schiffleri presented a conserved karyotype and heterochromatin distribution in comparison to other species of the genus with 2n = 18, biarmed chromosomes, and pericentromeric C-positive blocks. Similarly to heterochromatin distributional patterns, the highly and moderately repetitive DNA fraction (C 0 t-1 DNA) was detected in pericentromeric areas, contrasting with the euchromatic mapping of an isolated TE (named DsmarMITE). After structural analyses, the DsmarMITE was classified as a non-autonomous element of the type miniature inverted-repeat transposable element (MITE) with terminal inverted repeats similar to Mariner elements of insects from different orders. The euchromatic distribution for DsmarMITE indicates that it does not play a part in the dynamics of constitutive heterochromatin sequences.
Dias, Guilherme B; Svartman, Marta; Delprat, Alejandra; Ruiz, Alfredo; Kuhn, Gustavo C S
2014-05-24
Transposable elements (TEs) and satellite DNAs (satDNAs) are abundant components of most eukaryotic genomes studied so far and their impact on evolution has been the focus of several studies. A number of studies linked TEs with satDNAs, but the nature of their evolutionary relationships remains unclear. During in silico analyses of the Drosophila virilis assembled genome, we found a novel DNA transposon we named Tetris based on its modular structure and diversity of rearranged forms. We aimed to characterize Tetris and investigate its role in generating satDNAs. Data mining and sequence analysis showed that Tetris is apparently nonautonomous, with a structure similar to foldback elements, and present in D. virilis and D. americana. Herein, we show that Tetris shares the final portions of its terminal inverted repeats (TIRs) with DAIBAM, a previously described miniature inverted transposable element implicated in the generation of chromosome inversions. Both elements are likely to be mobilized by the same autonomous TE. Tetris TIRs contain approximately 220-bp internal tandem repeats that we have named TIR-220. We also found TIR-220 repeats making up longer (kb-size) satDNA-like arrays. Using bioinformatic, phylogenetic and cytogenomic tools, we demonstrated that Tetris has contributed to shaping the genomes of D. virilis and D. americana, providing internal tandem repeats that served as building blocks for the amplification of satDNA arrays. The β-heterochromatic genomic environment seemed to have favored such amplification. Our results imply for the first time a role for foldback elements in generating satDNAs. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Kayal, Ehsan; Lavrov, Dennis V
2008-02-29
The 16,314-nuceotide sequence of the linear mitochondrial DNA (mtDNA) molecule of Hydra oligactis (Cnidaria, Hydrozoa)--the first from the class Hydrozoa--has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs, as is typical for cnidarians. All genes have the same transcriptional orientation and their arrangement in the genome is similar to that of the jellyfish Aurelia aurita. In addition, a partial copy of cox1 is present at one end of the molecule in a transcriptional orientation opposite to the rest of the genes, forming a part of inverted terminal repeat characteristic of linear mtDNA and linear mitochondrial plasmids. The sequence close to at least one end of the molecule contains several homonucleotide runs as well as small inverted repeats that are able to form strong secondary structures and may be involved in mtDNA maintenance and expression. Phylogenetic analysis of mitochondrial genes of H. oligactis and other cnidarians supports the Medusozoa hypothesis but also suggests that Anthozoa may be paraphyletic, with octocorallians more closely related to the Medusozoa than to the Hexacorallia. The latter inference implies that Anthozoa is paraphyletic and that the polyp (rather than a medusa) is the ancestral body type in Cnidaria.
[Active miniature inverted-repeat transposable elements transposon in plants: a review].
Hu, Bingjie; Zhou, Mingbing
2018-02-25
Miniature inverted-repeat transposable elements transposon is a special transposon that could transpose by "cut-paste" mechanism, which is one of characteristics of DNA transposons. Otherwise, the copy number of MITEs is very high, which is one of characteristics of RNA transposons. Many MITE families have been reported, but little about active MITEs. We summarize recent advances in studying active MITEs. Most the MITEs belong to the Tourist-like family, such as mPing, mGing, PhTourist1, Tmi1 and PhTst-3. Additionally, DTstu1 and MITE-39 belong to Stowaway-like family, and AhMITEs1 belongs to Mutator-like family. Moreover, we summarize the structure (terminal inverse repeats and target site duplications), copy number, evolution pattern and transposition characteristics of these active MITEs, to provide the foundation for the identification of other active MITEs and subsequent research on MITE transposition and amplification mechanism.
Gill, Pooria; Ranjbar, Bijan; Saber, Reza; Khajeh, Khosro; Mohammadian, Mehdi
2011-07-01
Cauliflower-like DNAs are stem-loop DNAs that are fabricated periodically in inverted repetitions from deoxyribonucleic acid phosphates (dNTPs) by loop-mediated isothermal amplification (LAMP). Cauliflower-like DNAs have ladder-shape behaviors on gel electrophoresis, and increasing the time of LAMP leads to multiplying the repetitions, stem-loops, and electrophoretic bands. Cauliflower-like DNAs were fabricated via LAMP using two loop primers, two bumper primers, dNTPs, a λ-phage DNA template, and a Bst DNA polymerase in 75- and 90-min periods. These times led to manufacturing two types of cauliflower-like DNAs with different contents of inverted repetitions and stem-loops, which were clearly indicated by two comparable electrophoresis patterns in agarose gel. LAMP-fabricated DNAs and natural dsB-DNA (salmon genomic DNA) were dialyzed in Gomori phosphate buffer (10 mM, pH 7.4) to be isolated from salts, nucleotides, and primers. Dialyzed DNAs were studied using UV spectroscopy, circular dichroism spectropolarimetry, and fluorescence spectrophotometry. Structural analyses indicated reduction of the molecular ellipticity and extinction coefficients in comparison with B-DNA. Also, cauliflower-like DNAs demonstrated less intrinsic and more extrinsic fluorescence in comparison with natural DNA. The overwinding and lengthening of the cauliflower-like configurations of LAMP DNAs led to changes in physical parameters of this type of DNA in comparison with natural DNA. The results obtained introduced new biomolecular characteristics of DNA macromolecules fabricated within a LAMP process and show the effects of more inverted repeats and stem-loops, which are manufactured by lengthening the process.
Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G
2007-01-01
Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.
Rodova, Marianna; Rudolph, Angela; Chipps, Elizabeth; Islam, M. Rafiq
2013-01-01
Hereditary hemochromatosis (HH) is a common autosomal recessive disorder of iron overload among Caucasians of northern European descent. Over 85% of all cases with HH are due to mutations in the hemochromatosis protein (HFE) involved in iron metabolism. Although the importance in iron homeostasis is well recognized, the mechanism of sensing and regulating iron absorption by HFE, especially in the absence of iron response element in its gene, is not fully understood. In this report, we have identified an inverted repeat sequence (ATGGTcttACCTA) within 1700 bp (−1675/+35) of the HFE promoter capable to form cruciform structure that binds PARP1 and strongly represses HFE promoter. Knockdown of PARP1 increases HFE mRNA and protein. Similarly, hemin or FeCl3 treatments resulted in increase in HFE expression by reducing nuclear PARP1 pool via its apoptosis induced cleavage, leading to upregulation of the iron regulatory hormone hepcidin mRNA. Thus, PARP1 binding to the inverted repeat sequence on the HFE promoter may serve as a novel iron sensing mechanism as increased iron level can trigger PARP1 cleavage and relief of HFE transcriptional repression. PMID:24184271
Pelham, Christopher; Jimenez, Tamara; Rodova, Marianna; Rudolph, Angela; Chipps, Elizabeth; Islam, M Rafiq
2013-12-01
Hereditary hemochromatosis (HH) is a common autosomal recessive disorder of iron overload among Caucasians of northern European descent. Over 85% of all cases with HH are due to mutations in the hemochromatosis protein (HFE) involved in iron metabolism. Although the importance in iron homeostasis is well recognized, the mechanism of sensing and regulating iron absorption by HFE, especially in the absence of iron response element in its gene, is not fully understood. In this report, we have identified an inverted repeat sequence (ATGGTcttACCTA) within 1700bp (-1675/+35) of the HFE promoter capable to form cruciform structure that binds PARP1 and strongly represses HFE promoter. Knockdown of PARP1 increases HFE mRNA and protein. Similarly, hemin or FeCl3 treatments resulted in increase in HFE expression by reducing nuclear PARP1 pool via its apoptosis induced cleavage, leading to upregulation of the iron regulatory hormone hepcidin mRNA. Thus, PARP1 binding to the inverted repeat sequence on the HFE promoter may serve as a novel iron sensing mechanism as increased iron level can trigger PARP1 cleavage and relief of HFE transcriptional repression. © 2013.
Chatterjee, Gautam; Sankaranarayanan, Sundar Ram; Guin, Krishnendu; Thattikota, Yogitha; Padmanabhan, Sreedevi; Siddharthan, Rahul; Sanyal, Kaustuv
2016-01-01
The centromere, on which kinetochore proteins assemble, ensures precise chromosome segregation. Centromeres are largely specified by the histone H3 variant CENP-A (also known as Cse4 in yeasts). Structurally, centromere DNA sequences are highly diverse in nature. However, the evolutionary consequence of these structural diversities on de novo CENP-A chromatin formation remains elusive. Here, we report the identification of centromeres, as the binding sites of four evolutionarily conserved kinetochore proteins, in the human pathogenic budding yeast Candida tropicalis. Each of the seven centromeres comprises a 2 to 5 kb non-repetitive mid core flanked by 2 to 5 kb inverted repeats. The repeat-associated centromeres of C. tropicalis all share a high degree of sequence conservation with each other and are strikingly diverged from the unique and mostly non-repetitive centromeres of related Candida species—Candida albicans, Candida dubliniensis, and Candida lusitaniae. Using a plasmid-based assay, we further demonstrate that pericentric inverted repeats and the underlying DNA sequence provide a structural determinant in CENP-A recruitment in C. tropicalis, as opposed to epigenetically regulated CENP-A loading at centromeres in C. albicans. Thus, the centromere structure and its influence on de novo CENP-A recruitment has been significantly rewired in closely related Candida species. Strikingly, the centromere structural properties along with role of pericentric repeats in de novo CENP-A loading in C. tropicalis are more reminiscent to those of the distantly related fission yeast Schizosaccharomyces pombe. Taken together, we demonstrate, for the first time, fission yeast-like repeat-associated centromeres in an ascomycetous budding yeast. PMID:26845548
2006-11-01
terminal repetition of adenvirus type 4 DNA. Gene 18:329-334. 20. Van der Veen , J., and J. H. Dijkman . 1962. Association of type 21 adenovirus with acute respiratory illness in military recruits. Am J Hyg 76:149-159.
Tan, Benedict G.; Vijgenboom, Erik; Worrall, Jonathan A. R.
2014-01-01
Metal ion homeostasis in bacteria relies on metalloregulatory proteins to upregulate metal resistance genes and enable the organism to preclude metal toxicity. The copper sensitive operon repressor (CsoR) family is widely distributed in bacteria and controls the expression of copper efflux systems. CsoR operator sites consist of G-tract containing pseudopalindromes of which the mechanism of operator binding is poorly understood. Here, we use a structurally characterized CsoR from Streptomyces lividans (CsoRSl) together with three specific operator targets to reveal the salient features pertaining to the mechanism of DNA binding. We reveal that CsoRSl binds to its operator site through a 2-fold axis of symmetry centred on a conserved 5′-TAC/GTA-3′ inverted repeat. Operator recognition is stringently dependent not only on electropositive residues but also on a conserved polar glutamine residue. Thermodynamic and circular dichroic signatures of the CsoRSl–DNA interaction suggest selectivity towards the A-DNA-like topology of the G-tracts at the operator site. Such properties are enhanced on protein binding thus enabling the symmetrical binding of two CsoRSl tetramers. Finally, differential binding modes may exist in operator sites having more than one 5′-TAC/GTA-3′ inverted repeat with implications in vivo for a mechanism of modular control. PMID:24121681
Mammalian DNA enriched for replication origins is enriched for snap-back sequences.
Zannis-Hadjopoulos, M; Kaufmann, G; Martin, R G
1984-11-15
Using the instability of replication loops as a method for the isolation of double-stranded nascent DNA, extruded DNA enriched for replication origins was obtained and denatured. Snap-back DNA, single-stranded DNA with inverted repeats (palindromic sequences), reassociates rapidly into stem-loop structures with zero-order kinetics when conditions are changed from denaturing to renaturing, and can be assayed by chromatography on hydroxyapatite. Origin-enriched nascent DNA strands from mouse, rat and monkey cells growing either synchronously or asynchronously were purified and assayed for the presence of snap-back sequences. The results show that origin-enriched DNA is also enriched for snap-back sequences, implying that some origins for mammalian DNA replication contain or lie near palindromic sequences.
Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.
Šatović, Eva; Plohl, Miroslav
2017-10-01
Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.
Fitzpatrick, Terry; Huang, Sui
2012-01-01
Alu repeats within human genes may potentially alter gene expression. Here, we show that 3′-UTR-located inverted Alu repeats significantly reduce expression of an AcGFP reporter gene. Mutational analysis demonstrates that the secondary structure, but not the primary nucleotide sequence, of the inverted Alu repeats is critical for repression. The expression levels and nucleocytoplasmic distribution of reporter mRNAs with or without 3′-UTR inverted Alu repeats are similar; suggesting that reporter gene repression is not due to changes in mRNA levels or mRNA nuclear sequestration. Instead, reporter gene mRNAs harboring 3′-UTR inverted Alu repeats accumulate in cytoplasmic stress granules. These findings may suggest a novel mechanism whereby 3′-UTR-located inverted Alu repeats regulate human gene expression through sequestration of mRNAs within stress granules. PMID:22688648
DOE Office of Scientific and Technical Information (OSTI.GOV)
Polonskaya, Zhanna; Benham, Craig J.; Hearing, Janet
The minimal replicator of the Epstein-Barr virus (EBV) latent cycle origin of DNA replication oriP is composed of two binding sites for the Epstein-Barr virus nuclear antigen-1 (EBNA-1) and flanking inverted repeats that bind the telomere repeat binding factor TRF2. Although not required for minimal replicator activity, additional binding sites for EBNA-1 and TRF2 and one or more auxiliary elements located to the right of the EBNA-1/TRF2 sites are required for the efficient replication of oriP plasmids. Another region of oriP that is predicted to be destabilized by DNA supercoiling is shown here to be an important functional component ofmore » oriP. The ability of DNA fragments of unrelated sequence and possessing supercoiled-induced DNA duplex destabilized (SIDD) structures, but not fragments characterized by helically stable DNA, to substitute for this component of oriP demonstrates a role for the SIDD region in the initiation of oriP-plasmid DNA replication.« less
SV40 host-substituted variants: a new look at the monkey DNA inserts and recombinant junctions.
Singer, Maxine; Winocour, Ernest
2011-04-10
The available monkey genomic data banks were examined in order to determine the chromosomal locations of the host DNA inserts in 8 host-substituted SV40 variant DNAs. Five of the 8 variants contained more than one linked monkey DNA insert per tandem repeat unit and in all cases but one, the 19 monkey DNA inserts in the 8 variants mapped to different locations in the monkey genome. The 50 parental DNAs (32 monkey and 18 SV40 DNA segments) which spanned the crossover and flanking regions that participated in monkey/monkey and monkey/SV40 recombinations were characterized by substantial levels of microhomology of up to 8 nucleotides in length; the parental DNAs also exhibited direct and inverted repeats at or adjacent to the crossover sequences. We discuss how the host-substituted SV40 variants arose and the nature of the recombination mechanisms involved. Copyright © 2011 Elsevier Inc. All rights reserved.
Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes.
Cer, Regina Z; Bruce, Kevin H; Mudunuri, Uma S; Yi, Ming; Volfovsky, Natalia; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M
2011-01-01
Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov.
USDA-ARS?s Scientific Manuscript database
Transposable elements (TEs) are mobile DNA regions that alter host genome structure and gene expression. A novel 588 bp non-autonomous high copy number TE in the Ostrinia nubilalis genome has features in common with miniature inverted-repeat transposable elements (MITEs): high A+T content (62.3%),...
Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M
1996-08-01
DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.
Newman, S. M.; Boynton, J. E.; Gillham, N. W.; Randolph-Anderson, B. L.; Johnson, A. M.; Harris, E. H.
1990-01-01
Transformation of chloroplast ribosomal RNA (rRNA) genes in Chlamydomonas has been achieved by the biolistic process using cloned chloroplast DNA fragments carrying mutations that confer antibiotic resistance. The sites of exchange employed during the integration of the donor DNA into the recipient genome have been localized using a combination of antibiotic resistance mutations in the 16S and 23S rRNA genes and restriction fragment length polymorphisms that flank these genes. Complete or nearly complete replacement of a region of the chloroplast genome in the recipient cell by the corresponding sequence from the donor plasmid was the most common integration event. Exchange events between the homologous donor and recipient sequences occurred preferentially near the vector:insert junctions. Insertion of the donor rRNA genes and flanking sequences into one inverted repeat of the recipient genome was followed by intramolecular copy correction so that both copies of the inverted repeat acquired identical sequences. Increased frequencies of rRNA gene transformants were achieved by reducing the copy number of the chloroplast genome in the recipient cells and by decreasing the heterology between donor and recipient DNA sequences flanking the selectable markers. In addition to producing bona fide chloroplast rRNA transformants, the biolistic process induced mutants resistant to low levels of streptomycin, typical of nuclear mutations in Chlamydomonas. PMID:1981764
Wu, Chung-Shien; Lin, Ching-Ping; Hsu, Chi-Yao; Wang, Rui-Jiang; Chaw, Shu-Miaw
2011-01-01
Abstract Pinaceae, the largest family of conifers, has diversified organizations of chloroplast genomes (cpDNAs) with the two typical inverted repeats (IRs) highly reduced. To unravel the mechanism of this genomic diversification, we examined the cpDNA organizations from 53 species of the ten Pinaceous genera, including those of Larix decidua (122,474 bp), Picea morrisonicola (124,168 bp), and Pseudotsuga wilsoniana (122,513 bp), which were firstly elucidated. The results uncovered four distinct cpDNA forms (A−C and P) that are due to rearrangements of two ∼20 and ∼21 kb specific fragments. The C form was documented for the first time and the A form might be the most ancestral one. In addition, only the individuals of Ps. macrocarpa and Ps. wilsoniana were detected to have isomeric cpDNA forms. Three types (types 1−3) of Pinaceae-specific repeats situated nearby the rearranged fragments were found to be syntenic. We hypothesize that type 1 (949 ± 343 bp) and type 3 (608 ± 73 bp) repeats are substrates for homologous recombination (HR), whereas type 2 repeats are likely inactive for HR because of their relatively short sizes (151 ± 30 bp). Conversions among the four distinct forms may be achieved by HR and mediated by type 1 or 3 repeats, thus resulting in increased diversity of cpDNA organizations. We propose that in the Pinaceae cpDNAs, the reduced IRs have lost HR activity, then decreasing the diversity of cpDNA organizations, but the specific repeats that the evolution endowed Pinaceae complement the reduced IRs and increase the diversity of cpDNA organizations. PMID:21402866
Do, Hoang Dang Khoa; Kim, Joo-Hwan
2017-01-01
Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic results from SSM in chloroplast genomes which can be useful for further evolutionary studies in angiosperms. Additionally, genomics events in cpDNA are potential resources for mining molecular markers in Liliales.
Schnitzler, P; Delius, H; Scholz, J; Touray, M; Orth, E; Darai, G
1987-12-01
The genome of the fish lymphocystis disease virus (FLDV) was screened for the existence of repetitive DNA sequences using a defined and complete gene library of the viral genome (98 kbp) by DNA-DNA hybridization, heteroduplex analysis, and restriction fine mapping. A repetitive DNA sequence was detected at the coordinates 0.034 to 0.057 and 0.718 to 0.736 map units (m.u.) of the FLDV genome. The first region (0.034 to 0.057 m.u.) corresponds to the 5' terminus of the EcoRI FLDV DNA fragment B (0.034 to 0.165 m.u.) and the second region (0.718 to 0.736 m.u.) is identical to the EcoRI DNA fragment M of the viral genome. The DNA nucleotide sequence of the EcoRI FLDV DNA fragment M was determined. This analysis revealed the presence of many short direct and inverted repetitions, e.g., a 18-mer direct repetition (TTTAAAATTTAATTAA) that started at nucleotide positions 812 and 942 and a 14-mer inverted repeat (TTAAATTTAAATTT) at nucleotide positions 820 and 959. Only short open reading frames were detected within this region. The DNA repetitions are discussed as sequences that play a possible regulatory role for virus replication. Furthermore, hybridization experiments revealed that the repetitive DNA sequences are conserved in the genome of different strains of fish lymphocystis disease virus isolated from two species of Pleuronectidae (flounder and dab).
Wu, Chung-Shien; Huang, Ya-Yi; Chaw, Shu-Miaw
2012-01-01
We determined the complete chloroplast genome (cpDNA) of Ginkgo biloba (common name: ginkgo), the only relict of ginkgophytes from the Triassic Period. The cpDNA molecule of ginkgo is quadripartite and circular, with a length of 156,945 bp, which is 6,458 bp shorter than that of Cycas taitungensis. In ginkgo cpDNA, rpl23 becomes pseudo, only one copy of ycf2 is retained, and there are at least five editing sites. We propose that the retained ycf2 is a duplicate of the ancestral ycf2, and the ancestral one has been lost from the inverted repeat A (IRA). This loss event should have occurred and led to the contraction of IRs after ginkgos diverged from other gymnosperms. A novel cluster of three transfer RNA (tRNA) genes, trnY-AUA, trnC-ACA, and trnSeC-UCA, was predicted to be located between trnC-GCA and rpoB of the large single-copy region. Our phylogenetic analysis strongly suggests that the three predicted tRNA genes are duplicates of trnC-GCA. Interestingly, in ginkgo cpDNA, the loss of one ycf2 copy does not significantly elevate the synonymous rate (Ks) of the retained copy, which disagrees with the view of Perry and Wolfe (2002) that one of the two-copy genes is subjected to elevated Ks when its counterpart has been lost. We hypothesize that the loss of one ycf2 is likely recent, and therefore, the acquired Ks of the retained copy is low. Our data reveal that ginkgo possesses several unique features that contribute to our understanding of the cpDNA evolution in seed plants. PMID:22403032
DOE Office of Scientific and Technical Information (OSTI.GOV)
Petrillo-Peixoto, M.L.; Beverley, S.M.
1988-12-01
We describe the structure of amplified DNA that was discovered in two laboratory stocks of the protozoan parasite Leishmania tarentolae. Restriction mapping and molecular cloning revealed that a region of 42 kilobases was amplified 8- to 30-fold in these lines. Southern blot analyses of digested DNAs or chromosomes separated by pulsed-field electrophoresis showed that the amplified DNA corresponded to the H region, a locus defined originally by its amplification in methotrexate-resistant Leishmania major. Similarities between the amplified DNA of the two species included (i) extensive cross-hybridization; (ii) approximate conservation of sequence order; (iii) extrachromosomal localization; (iv) an overall inverted, head-to-headmore » configuration as a circular 140-kilobase tetrameric molecule; (v) two regions of DNA sequence rearrangement, each of which was closely associated with the two centers of the inverted repeats; (vi) association with methotrexate resistance; and (vii) phenotypically conservative amplification, in which the wild-type chromosomal arrangement was retained without apparent modification. Our data showed that amplified DNA mediating drug resistance arose in unselected L. tarentolae, although the pressures leading to apparently spontaneous amplification and maintenance of the H region are not known. The simple structure and limited extent of DNA amplified in these and other Leishmania lines suggests that the study of gene amplification in Leishmania spp. offers an attractive model system for the study of amplification in cultured mammalian cells and tumors. We also introduced a method for measuring the size of large circular DNAs, using gamma-irradiation to introduce limited double-strand breaks followed by sizing of the linear DNAs by pulsed-field electrophoresis.« less
Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald
2013-08-01
Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
Solov'ev, V V; Kel', A E; Kolchanov, N A
1989-01-01
The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.
DeBoy, Robert T; Mongodin, Emmanuel F; Emerson, Joanne B; Nelson, Karen E
2006-04-01
In the present study, the chromosomes of two members of the Thermotogales were compared. A whole-genome alignment of Thermotoga maritima MSB8 and Thermotoga neapolitana NS-E has revealed numerous large-scale DNA rearrangements, most of which are associated with CRISPR DNA repeats and/or tRNA genes. These DNA rearrangements do not include the putative origin of DNA replication but move within the same replichore, i.e., the same replicating half of the chromosome (delimited by the replication origin and terminus). Based on cumulative GC skew analysis, both the T. maritima and T. neapolitana lineages contain one or two major inverted DNA segments. Also, based on PCR amplification and sequence analysis of the DNA joints that are associated with the major rearrangements, the overall chromosome architecture was found to be conserved at most DNA joints for other strains of T. neapolitana. Taken together, the results from this analysis suggest that the observed chromosomal rearrangements in the Thermotogales likely occurred by successive inversions after their divergence from a common ancestor and before strain diversification. Finally, sequence analysis shows that size polymorphisms in the DNA joints associated with CRISPRs can be explained by expansion and possibly contraction of the DNA repeat and spacer unit, providing a tool for discerning the relatedness of strains from different geographic locations.
El Kafsi, Hela; Loux, Valentin; Mariadassou, Mahendra; Blin, Camille; Chiapello, Hélène; Abraham, Anne-Laure; Maguin, Emmanuelle; van de Guchte, Maarten
2017-01-01
The first Lactobacillus delbrueckii ssp. bulgaricus genome sequence revealed the presence of a very large inverted repeat (IR), a DNA sequence arrangement which thus far seemed inconceivable in a non-manipulated circular bacterial chromosome, at the replication terminus. This intriguing observation prompted us to investigate if similar IRs could be found in other bacteria. IRs with sizes varying from 38 to 76 kbp were found at the replication terminus of all 5 L. delbrueckii ssp. bulgaricus chromosomes analysed, but in none of 1373 other chromosomes. They represent the first naturally occurring very large IRs detected in circular bacterial genomes. A comparison of the L. bulgaricus replication terminus regions and the corresponding regions without IR in 5 L. delbrueckii ssp. lactis genomes leads us to propose a model for the formation and evolution of the IRs. The DNA sequence data are consistent with a novel model of chromosome rescue after premature replication termination or irreversible chromosome damage near the replication terminus, involving mechanisms analogous to those proposed in the formation of very large IRs in human cancer cells. We postulate that the L. delbrueckii ssp. bulgaricus-specific IRs in different strains derive from a single ancestral IR of at least 93 kbp. PMID:28281695
Aguado, Cristina; Gayà-Vidal, Magdalena; Villatoro, Sergi; Oliva, Meritxell; Izquierdo, David; Giner-Delgado, Carla; Montalvo, Víctor; García-González, Judit; Martínez-Fundichely, Alexander; Capilla, Laia; Ruiz-Herrera, Aurora; Estivill, Xavier; Puig, Marta; Cáceres, Mario
2014-01-01
In recent years different types of structural variants (SVs) have been discovered in the human genome and their functional impact has become increasingly clear. Inversions, however, are poorly characterized and more difficult to study, especially those mediated by inverted repeats or segmental duplications. Here, we describe the results of a simple and fast inverse PCR (iPCR) protocol for high-throughput genotyping of a wide variety of inversions using a small amount of DNA. In particular, we analyzed 22 inversions predicted in humans ranging from 5.1 kb to 226 kb and mediated by inverted repeat sequences of 1.6–24 kb. First, we validated 17 of the 22 inversions in a panel of nine HapMap individuals from different populations, and we genotyped them in 68 additional individuals of European origin, with correct genetic transmission in ∼12 mother-father-child trios. Global inversion minor allele frequency varied between 1% and 49% and inversion genotypes were consistent with Hardy-Weinberg equilibrium. By analyzing the nucleotide variation and the haplotypes in these regions, we found that only four inversions have linked tag-SNPs and that in many cases there are multiple shared SNPs between standard and inverted chromosomes, suggesting an unexpected high degree of inversion recurrence during human evolution. iPCR was also used to check 16 of these inversions in four chimpanzees and two gorillas, and 10 showed both orientations either within or between species, providing additional support for their multiple origin. Finally, we have identified several inversions that include genes in the inverted or breakpoint regions, and at least one disrupts a potential coding gene. Thus, these results represent a significant advance in our understanding of inversion polymorphism in human populations and challenge the common view of a single origin of inversions, with important implications for inversion analysis in SNP-based studies. PMID:24651690
de Cambiaire, Jean-Charles; Otis, Christian; Turmel, Monique; Lemieux, Claude
2007-01-01
Background In the Chlorophyta – the green algal phylum comprising the classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae and Chlorophyceae – the chloroplast genome displays a highly variable architecture. While chlorophycean chloroplast DNAs (cpDNAs) deviate considerably from the ancestral pattern described for the prasinophyte Nephroselmis olivacea, the degree of remodelling sustained by the two ulvophyte cpDNAs completely sequenced to date is intermediate relative to those observed for chlorophycean and trebouxiophyte cpDNAs. Chlorella vulgaris (Chlorellales) is currently the only photosynthetic trebouxiophyte whose complete cpDNA sequence has been reported. To gain insights into the evolutionary trends of the chloroplast genome in the Trebouxiophyceae, we sequenced cpDNA from the filamentous alga Leptosira terrestris (Ctenocladales). Results The 195,081-bp Leptosira chloroplast genome resembles the 150,613-bp Chlorella genome in lacking a large inverted repeat (IR) but differs greatly in gene order. Six of the conserved genes present in Chlorella cpDNA are missing from the Leptosira gene repertoire. The 106 conserved genes, four introns and 11 free standing open reading frames (ORFs) account for 48.3% of the genome sequence. This is the lowest gene density yet observed among chlorophyte cpDNAs. Contrary to the situation in Chlorella but similar to that in the chlorophycean Scenedesmus obliquus, the gene distribution is highly biased over the two DNA strands in Leptosira. Nine genes, compared to only three in Chlorella, have significantly expanded coding regions relative to their homologues in ancestral-type green algal cpDNAs. As observed in chlorophycean genomes, the rpoB gene is fragmented into two ORFs. Short repeats account for 5.1% of the Leptosira genome sequence and are present mainly in intergenic regions. Conclusion Our results highlight the great plasticity of the chloroplast genome in the Trebouxiophyceae and indicate that the IR was lost on at least two separate occasions. The intriguing similarities of the derived features exhibited by Leptosira cpDNA and its chlorophycean counterparts suggest that the same evolutionary forces shaped the IR-lacking chloroplast genomes in these two algal lineages. PMID:17610731
Conserved Sequences at the Origin of Adenovirus DNA Replication
Stillman, Bruce W.; Topp, William C.; Engler, Jeffrey A.
1982-01-01
The origin of adenovirus DNA replication lies within an inverted sequence repetition at either end of the linear, double-stranded viral DNA. Initiation of DNA replication is primed by a deoxynucleoside that is covalently linked to a protein, which remains bound to the newly synthesized DNA. We demonstrate that virion-derived DNA-protein complexes from five human adenovirus serological subgroups (A to E) can act as a template for both the initiation and the elongation of DNA replication in vitro, using nuclear extracts from adenovirus type 2 (Ad2)-infected HeLa cells. The heterologous template DNA-protein complexes were not as active as the homologous Ad2 DNA, most probably due to inefficient initiation by Ad2 replication factors. In an attempt to identify common features which may permit this replication, we have also sequenced the inverted terminal repeated DNA from human adenovirus serotypes Ad4 (group E), Ad9 and Ad10 (group D), and Ad31 (group A), and we have compared these to previously determined sequences from Ad2 and Ad5 (group C), Ad7 (group B), and Ad12 and Ad18 (group A) DNA. In all cases, the sequence around the origin of DNA replication can be divided into two structural domains: a proximal A · T-rich region which is partially conserved among these serotypes, and a distal G · C-rich region which is less well conserved. The G · C-rich region contains sequences similar to sequences present in papovavirus replication origins. The two domains may reflect a dual mechanism for initiation of DNA replication: adenovirus-specific protein priming of replication, and subsequent utilization of this primer by host replication factors for completion of DNA synthesis. Images PMID:7143575
Evolutionary genomics of miniature inverted-repeat transposable elements (MITEs) in Brassica.
Nouroz, Faisal; Noreen, Shumaila; Heslop-Harrison, J S
2015-12-01
Miniature inverted-repeat transposable elements (MITEs) are truncated derivatives of autonomous DNA transposons, and are dispersed abundantly in most eukaryotic genomes. We aimed to characterize various MITEs families in Brassica in terms of their presence, sequence characteristics and evolutionary activity. Dot plot analyses involving comparison of homoeologous bacterial artificial chromosome (BAC) sequences allowed identification of 15 novel families of mobile MITEs. Of which, 5 were Stowaway-like with TA Target Site Duplications (TSDs), 4 Tourist-like with TAA/TTA TSDs, 5 Mutator-like with 9-10 bp TSDs and 1 novel MITE (BoXMITE1) flanked by 3 bp TSDs. Our data suggested that there are about 30,000 MITE-related sequences in Brassica rapa and B. oleracea genomes. In situ hybridization showed one abundant family was dispersed in the A-genome, while another was located near 45S rDNA sites. PCR analysis using primers flanking sequences of MITE elements detected MITE insertion polymorphisms between and within the three Brassica (AA, BB, CC) genomes, with many insertions being specific to single genomes and others showing evidence of more recent evolutionary insertions. Our BAC sequence comparison strategy enables identification of evolutionarily active MITEs with no prior knowledge of MITE sequences. The details of MITE families reported in Brassica enable their identification, characterization and annotation. Insertion polymorphisms of MITEs and their transposition activity indicated important mechanism of genome evolution and diversification. MITE families derived from known Mariner, Harbinger and Mutator DNA transposons were discovered, as well as some novel structures. The identification of Brassica MITEs will have broad applications in Brassica genomics, breeding, hybridization and phylogeny through their use as DNA markers.
The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).
Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu
2017-05-01
The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.
Häring, Monika; Peng, Xu; Brügger, Kim; Rachel, Reinhard; Stetter, Karl O; Garrett, Roger A; Prangishvili, David
2004-06-01
A novel virus, termed Pyrobaculum spherical virus (PSV), is described that infects anaerobic hyperthermophilic archaea of the genera Pyrobaculum and Thermoproteus. Spherical enveloped virions, about 100 nm in diameter, contain a major multimeric 33-kDa protein and host-derived lipids. A viral envelope encases a superhelical nucleoprotein core containing linear double-stranded DNA. The PSV infection cycle does not cause lysis of host cells. The viral genome was sequenced and contains 28337 bp. The genome is unique for known archaeal viruses in that none of the genes, including that encoding the major structural protein, show any significant sequence matches to genes in public sequence databases. Exceptionally for an archaeal double-stranded DNA virus, almost all the recognizable genes are located on one DNA strand. The ends of the genome consist of 190-bp inverted repeats that contain multiple copies of short direct repeats. The two DNA strands are probably covalently linked at their termini. On the basis of the unusual morphological and genomic properties of this DNA virus, we propose to assign PSV to a new viral family, the Globuloviridae.
The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.
Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo
2016-05-01
The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.
The role of DNA repair in herpesvirus pathogenesis.
Brown, Jay C
2014-10-01
In cells latently infected with a herpesvirus, the viral DNA is present in the cell nucleus, but it is not extensively replicated or transcribed. In this suppressed state the virus DNA is vulnerable to mutagenic events that affect the host cell and have the potential to destroy the virus' genetic integrity. Despite the potential for genetic damage, however, herpesvirus sequences are well conserved after reactivation from latency. To account for this apparent paradox, I have tested the idea that host cell-encoded mechanisms of DNA repair are able to control genetic damage to latent herpesviruses. Studies were focused on homologous recombination-dependent DNA repair (HR). Methods of DNA sequence analysis were employed to scan herpesvirus genomes for DNA features able to activate HR. Analyses were carried out with a total of 39 herpesvirus DNA sequences, a group that included viruses from the alpha-, beta- and gamma-subfamilies. The results showed that all 39 genome sequences were enriched in two or more of the eight recombination-initiating features examined. The results were interpreted to indicate that HR can stabilize latent herpesvirus genomes. The results also showed, unexpectedly, that repair-initiating DNA features differed in alpha- compared to gamma-herpesviruses. Whereas inverted and tandem repeats predominated in alpha-herpesviruses, gamma-herpesviruses were enriched in short, GC-rich initiation sequences such as CCCAG and depleted in repeats. In alpha-herpesviruses, repair-initiating repeat sequences were found to be concentrated in a specific region (the S segment) of the genome while repair-initiating short sequences were distributed more uniformly in gamma-herpesviruses. The results suggest that repair pathways are activated differently in alpha- compared to gamma-herpesviruses. Copyright © 2014. Published by Elsevier Inc.
Seier, Tracey; Padgett, Dana R; Zilberberg, Gal; Sutera, Vincent A; Toha, Noor; Lovett, Susan T
2011-06-01
Strand misalignments at DNA repeats during replication are implicated in mutational hotspots. To study these events, we have generated strains carrying mutations in the Escherichia coli chromosomal lacZ gene that revert via deletion of a short duplicated sequence or by template switching within imperfect inverted repeat (quasipalindrome, QP) sequences. Using these strains, we demonstrate that mutation of the distal repeat of a quasipalindrome, with respect to replication fork movement, is about 10-fold higher than the proximal repeat, consistent with more common template switching on the leading strand. The leading strand bias was lost in the absence of exonucleases I and VII, suggesting that it results from more efficient suppression of template switching by 3' exonucleases targeted to the lagging strand. The loss of 3' exonucleases has no effect on strand misalignment at direct repeats to produce deletion. To compare these events to other mutations, we have reengineered reporters (designed by Cupples and Miller 1989) that detect specific base substitutions or frameshifts in lacZ with the reverting lacZ locus on the chromosome rather than an F' element. This set allows rapid screening of potential mutagens, environmental conditions, or genetic loci for effects on a broad set of mutational events. We found that hydroxyurea (HU), which depletes dNTP pools, slightly elevated templated mutations at inverted repeats but had no effect on deletions, simple frameshifts, or base substitutions. Mutations in nucleotide diphosphate kinase, ndk, significantly elevated simple mutations but had little effect on the templated class. Zebularine, a cytosine analog, elevated all classes.
Klobutcher, L A; Swanton, M T; Donini, P; Prescott, D M
1981-01-01
In hypotrichous ciliates, all of the macronuclear DNA is in the form of low molecular weight molecules with an average size of approximately 2200 base pairs. Total macronuclear DNA from four hypotrichs has been shown to have inverted terminal repeats by direct sequence analysis. In Oxytricha nova, Oxytricha sp., and Stylonychia pustulata, this terminal sequence may be written as 5'-C4A4C4A4C4 ... 3'-G4T4G4T4G4T4G4T4G4 ... In Euplotes aediculatus, the sequences is similar but differs in the lengths of the duplex region (28 base pairs) and of the putative 3' extension (14 base pairs). Also in Euplotes, a second common sequence of 5 base pairs (A-A-C-T-T-T-T-G-A-A) occurs internal to the terminal repeat and a 17-base-pair heterogeneous region: 5'-C4A4C4A4C4A4C4(X)17T-T-G-A-A ... 3'-G2T4G4T4G4T4G4T4G4T4G4(X)17A-A-C-T-T ... The length of the terminal repeat sequence for O. nova was confirmed in cloned macronuclear DNA molecules. Images PMID:6265931
Ayesh, Basim M
2017-01-01
Molecular markers are credible for the discrimination of genotypes and estimation of the extent of genetic diversity and relatedness in a set of genotypes. Inter-simple sequence repeat (ISSR) markers rapidly reveal high polymorphic fingerprints and have been used frequently to determine the genetic diversity among date palm cultivars. This chapter describes the application of ISSR markers for genotyping of date palm cultivars. The application involves extraction of genomic DNA from the target cultivars with reliable quality and quantity. Subsequently the extracted DNA serves as a template for amplification of genomic regions flanked by inverted simple sequence repeats using a single primer. The similarity of each pair of samples is measured by calculating the number of mono- and polymorphic bands revealed by gel electrophoresis. Matrices constructed for similarity and genetic distance are used to build a phylogenetic tree and cluster analysis, to determine the molecular relatedness of cultivars. The protocol describes 3 out of 9 tested primers consistently amplified 31 loci in 6 date palm cultivars, with 28 polymorphic loci.
Mre11-Sae2 and RPA Collaborate to Prevent Palindromic Gene Amplification.
Deng, Sarah K; Yin, Yi; Petes, Thomas D; Symington, Lorraine S
2015-11-05
Foldback priming at DNA double-stranded breaks is one mechanism proposed to initiate palindromic gene amplification, a common feature of cancer cells. Here, we show that small (5-9 bp) inverted repeats drive the formation of large palindromic duplications, the major class of chromosomal rearrangements recovered from yeast cells lacking Sae2 or the Mre11 nuclease. RPA dysfunction increased the frequency of palindromic duplications in Sae2 or Mre11 nuclease-deficient cells by ∼ 1,000-fold, consistent with intra-strand annealing to create a hairpin-capped chromosome that is subsequently replicated to form a dicentric isochromosome. The palindromic duplications were frequently associated with duplication of a second chromosome region bounded by a repeated sequence and a telomere, suggesting the dicentric chromosome breaks and repairs by recombination between dispersed repeats to acquire a telomere. We propose secondary structures within single-stranded DNA are potent instigators of genome instability, and RPA and Mre11-Sae2 play important roles in preventing their formation and propagation, respectively. Copyright © 2015 Elsevier Inc. All rights reserved.
Szuplewska, Magdalena; Ludwiczak, Marta; Lyzwa, Katarzyna; Czarnecki, Jakub; Bartosik, Dariusz
2014-01-01
Functional transposable elements (TEs) of several Pseudomonas spp. strains isolated from black shale ore of Lubin mine and from post-flotation tailings of Zelazny Most in Poland, were identified using a positive selection trap plasmid strategy. This approach led to the capture and characterization of (i) 13 insertion sequences from 5 IS families (IS3, IS5, ISL3, IS30 and IS1380), (ii) isoforms of two Tn3-family transposons--Tn5563a and Tn4662a (the latter contains a toxin-antitoxin system), as well as (iii) non-autonomous TEs of diverse structure, ranging in size from 262 to 3892 bp. The non-autonomous elements transposed into AT-rich DNA regions and generated 5- or 6-bp sequence duplications at the target site of transposition. Although these TEs lack a transposase gene, they contain homologous 38-bp-long terminal inverted repeat sequences (IRs), highly conserved in Tn5563a and many other Tn3-family transposons. The simplest elements of this type, designated TIMEs (Tn3 family-derived Inverted-repeat Miniature Elements) (262 bp), were identified within two natural plasmids (pZM1P1 and pLM8P2) of Pseudomonas spp. It was demonstrated that TIMEs are able to mobilize segments of plasmid DNA for transposition, which results in the generation of more complex non-autonomous elements, resembling IS-driven composite transposons in structure. Such transposon-like elements may contain different functional genetic modules in their core regions, including plasmid replication systems. Another non-autonomous element "captured" with a trap plasmid was a TIME derivative containing a predicted resolvase gene and a res site typical for many Tn3-family transposons. The identification of a portable site-specific recombination system is another intriguing example confirming the important role of non-autonomous TEs of the TIME family in shuffling genetic information in bacterial genomes. Transposition of such mosaic elements may have a significant impact on diversity and evolution, not only of transposons and plasmids, but also of other types of mobile genetic elements.
Galli, Alvaro; Cervelli, Tiziana; Schiestl, Robert H
2003-05-01
The DNA polymerase delta (Pol3p/Cdc2p) allele pol3-t of Saccharomyces cerevisiae has previously been shown to increase the frequency of deletions between short repeats (several base pairs), between homologous DNA sequences separated by long inverted repeats, and between distant short repeats, increasing the frequency of genomic deletions. We found that the pol3-t mutation increased intrachromosomal recombination events between direct DNA repeats up to 36-fold and interchromosomal recombination 14-fold. The hyperrecombination phenotype of pol3-t was partially dependent on the Rad52p function but much more so on Rad1p. However, in the double-mutant rad1 Delta rad52 Delta, the pol3-t mutation still increased spontaneous intrachromosomal recombination frequencies, suggesting that a Rad1p Rad52p-independent single-strand annealing pathway is involved. UV and gamma-rays were less potent inducers of recombination in the pol3-t mutant, indicating that Pol3p is partly involved in DNA-damage-induced recombination. In contrast, while UV- and gamma-ray-induced intrachromosomal recombination was almost completely abolished in the rad52 or the rad1 rad52 mutant, there was still good induction in those mutants in the pol3-t background, indicating channeling of lesions into the above-mentioned Rad1p Rad52p-independent pathway. Finally, a heterozygous pol3-t/POL3 mutant also showed an increased frequency of deletions and MMS sensitivity at the restrictive temperature, indicating that even a heterozygous polymerase delta mutation might increase the frequency of genetic instability.
Pombert, Jean-François; Lemieux, Claude; Turmel, Monique
2006-01-01
Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. The basal position of the Prasinophyceae has been well documented, but the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae is currently debated. The four complete chloroplast DNA (cpDNA) sequences presently available for representatives of these classes have revealed extensive variability in overall structure, gene content, intron composition and gene order. The chloroplast genome of Pseudendoclonium (Ulvophyceae), in particular, is characterized by an atypical quadripartite architecture that deviates from the ancestral type by a large inverted repeat (IR) featuring an inverted rRNA operon and a small single-copy (SSC) region containing 14 genes normally found in the large single-copy (LSC) region. To gain insights into the nature of the events that led to the reorganization of the chloroplast genome in the Ulvophyceae, we have determined the complete cpDNA sequence of Oltmannsiellopsis viridis, a representative of a distinct, early diverging lineage. Results The 151,933 bp IR-containing genome of Oltmannsiellopsis differs considerably from Pseudendoclonium and other chlorophyte cpDNAs in intron content and gene order, but shares close similarities with its ulvophyte homologue at the levels of quadripartite architecture, gene content and gene density. Oltmannsiellopsis cpDNA encodes 105 genes, contains five group I introns, and features many short dispersed repeats. As in Pseudendoclonium cpDNA, the rRNA genes in the IR are transcribed toward the single copy region featuring the genes typically found in the ancestral LSC region, and the opposite single copy region harbours genes characteristic of both the ancestral SSC and LSC regions. The 52 genes that were transferred from the ancestral LSC to SSC region include 12 of those observed in Pseudendoclonium cpDNA. Surprisingly, the overall gene organization of Oltmannsiellopsis cpDNA more closely resembles that of Chlorella (Trebouxiophyceae) cpDNA. Conclusion The chloroplast genome of the last common ancestor of Oltmannsiellopsis and Pseudendoclonium contained a minimum of 108 genes, carried only a few group I introns, and featured a distinctive quadripartite architecture. Numerous changes were experienced by the chloroplast genome in the lineages leading to Oltmannsiellopsis and Pseudendoclonium. Our comparative analyses of chlorophyte cpDNAs support the notion that the Ulvophyceae is sister to the Chlorophyceae. PMID:16472375
Freeman, S.; Redman, R.S.; Grantham, G.; Rodriguez, R.J.
1997-01-01
A 7.4-kilobase (kb) DNA plasmid was isolated from Glomerella musae isolate 927 and designated pGML1. Exonuclease treatments indicated that pGML1 was a linear plasmid with blocked 5' termini. Cell-fractionation experiments combined with sequence-specific PCR amplification revealed that pGML1 resided in mitochondria. The pGML1 plasmid hybridized to cesium chloride-fractionated nuclear DNA but not to A + T-rich mitochondrial DNA. An internal 7.0-kb section of pGML1 was cloned and did not hybridize with either nuclear or mitochondrial DNA from G. musae. Sequence analysis revealed identical terminal inverted repeats (TIR) of 520 bp at the ends of the cloned 7.0-kb section of pGML1. The occurrence of pGML1 did not correspond with the pathogenicity of G. musae on banana fruit. Four additional isolates of G. musae possessed extrachromosomal DNA fragments similar in size and sequence to pGML1.
Target Capture during Mos1 Transposition*
Pflieger, Aude; Jaillet, Jerôme; Petit, Agnès; Augé-Gouillou, Corinne; Renault, Sylvaine
2014-01-01
DNA transposition contributes to genomic plasticity. Target capture is a key step in the transposition process, because it contributes to the selection of new insertion sites. Nothing or little is known about how eukaryotic mariner DNA transposons trigger this step. In the case of Mos1, biochemistry and crystallography have deciphered several inverted terminal repeat-transposase complexes that are intermediates during transposition. However, the target capture complex is still unknown. Here, we show that the preintegration complex (i.e., the excised transposon) is the only complex able to capture a target DNA. Mos1 transposase does not support target commitment, which has been proposed to explain Mos1 random genomic integrations within host genomes. We demonstrate that the TA dinucleotide used as the target is crucial both to target recognition and in the chemistry of the strand transfer reaction. Bent DNA molecules are better targets for the capture when the target DNA is nicked two nucleotides apart from the TA. They improve strand transfer when the target DNA contains a mismatch near the TA dinucleotide. PMID:24269942
Target capture during Mos1 transposition.
Pflieger, Aude; Jaillet, Jerôme; Petit, Agnès; Augé-Gouillou, Corinne; Renault, Sylvaine
2014-01-03
DNA transposition contributes to genomic plasticity. Target capture is a key step in the transposition process, because it contributes to the selection of new insertion sites. Nothing or little is known about how eukaryotic mariner DNA transposons trigger this step. In the case of Mos1, biochemistry and crystallography have deciphered several inverted terminal repeat-transposase complexes that are intermediates during transposition. However, the target capture complex is still unknown. Here, we show that the preintegration complex (i.e., the excised transposon) is the only complex able to capture a target DNA. Mos1 transposase does not support target commitment, which has been proposed to explain Mos1 random genomic integrations within host genomes. We demonstrate that the TA dinucleotide used as the target is crucial both to target recognition and in the chemistry of the strand transfer reaction. Bent DNA molecules are better targets for the capture when the target DNA is nicked two nucleotides apart from the TA. They improve strand transfer when the target DNA contains a mismatch near the TA dinucleotide.
Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald
2013-01-01
Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. PMID:23648487
Withey, Jeffrey H; DiRita, Victor J
2005-05-01
The Gram-negative bacterium Vibrio cholerae is the infectious agent responsible for the disease Asiatic cholera. The genes required for V. cholerae virulence, such as those encoding the cholera toxin (CT) and toxin-coregulated pilus (TCP), are controlled by a cascade of transcriptional activators. Ultimately, the direct transcriptional activator of the majority of V. cholerae virulence genes is the AraC/XylS family member ToxT protein, the expression of which is activated by the ToxR and TcpP proteins. Previous studies have identified the DNA sites to which ToxT binds upstream of the ctx operon, encoding CT, and the tcpA operon, encoding, among other products, the major subunit of the TCP. These known ToxT binding sites are seemingly dissimilar in sequence other than being A/T rich. Further results suggested that ctx and tcpA each has a pair of ToxT binding sites arranged in a direct repeat orientation upstream of the core promoter elements. In this work, using both transcriptional lacZ fusions and in vitro copper-phenanthroline footprinting experiments, we have identified the ToxT binding sites between the divergently transcribed acfA and acfD genes, which encode components of the accessory colonization factor required for efficient intestinal colonization by V. cholerae. Our results indicate that ToxT binds to a pair of DNA sites between acfA and acfD in an inverted repeat orientation. Moreover, a mutational analysis of the ToxT binding sites indicates that both binding sites are required by ToxT for transcriptional activation of both acfA and acfD. Using copper-phenanthroline footprinting to assess the occupancy of ToxT on DNA having mutations in one of these binding sites, we found that protection by ToxT of the unaltered binding site was not affected, whereas protection by ToxT of the mutant binding site was significantly reduced in the region of the mutations. The results of further footprinting experiments using DNA templates having +5 bp and +10 bp insertions between the two ToxT binding sites indicate that both binding sites are occupied by ToxT regardless of their positions relative to each other. Based on these results, we propose that ToxT binds independently to two DNA sites between acfA and acfD to activate transcription of both genes.
Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.
2000-01-01
In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863
Active Site Sharing and Subterminal Hairpin Recognition in a New Class of DNA Transposases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ronning, Donald R.; Guynet, Catherine; Ton-Hoang, Bao
2010-07-20
Many bacteria harbor simple transposable elements termed insertion sequences (IS). In Helicobacter pylori, the chimeric IS605 family elements are particularly interesting due to their proximity to genes encoding gastric epithelial invasion factors. Protein sequences of IS605 transposases do not bear the hallmarks of other well-characterized transposases. We have solved the crystal structure of full-length transposase (TnpA) of a representative member, ISHp608. Structurally, TnpA does not resemble any characterized transposase; rather, it is related to rolling circle replication (RCR) proteins. Consistent with RCR, Mg{sup 2+} and a conserved tyrosine, Tyr127, are essential for DNA nicking and the formation of a covalentmore » intermediate between TnpA and DNA. TnpA is dimeric, contains two shared active sites, and binds two DNA stem loops representing the conserved inverted repeats near each end of ISHp608. The cocrystal structure with stem-loop DNA illustrates how this family of transposases specifically recognizes and pairs ends, necessary steps during transposition.« less
The complete chloroplast genome sequence of Dendrobium officinale.
Yang, Pei; Zhou, Hong; Qian, Jun; Xu, Haibin; Shao, Qingsong; Li, Yonghua; Yao, Hui
2016-01-01
The complete chloroplast sequence of Dendrobium officinale, an endangered and economically important traditional Chinese medicine, was reported and characterized. The genome size is 152,018 bp, with 37.5% GC content. A pair of inverted repeats (IRs) of 26,284 bp are separated by a large single-copy region (LSC, 84,944 bp) and a small single-copy region (SSC, 14,506 bp). The complete cp DNA contains 83 protein-coding genes, 39 tRNA genes and 8 rRNA genes. Fourteen genes contained one or two introns.
Foldback intercoil DNA and the mechanism of DNA transposition.
Kim, Byung-Dong
2014-09-01
Foldback intercoil (FBI) DNA is formed by the folding back at one point of a non-helical parallel track of double-stranded DNA at as sharp as 180° and the intertwining of two double helixes within each other's major groove to form an intercoil with a diameter of 2.2 nm. FBI DNA has been suggested to mediate intra-molecular homologous recombination of a deletion and inversion. Inter-molecular homologous recombination, known as site-specific insertion, on the other hand, is mediated by the direct perpendicular approach of the FBI DNA tip, as the attP site, onto the target DNA, as the attB site. Transposition of DNA transposons involves the pairing of terminal inverted repeats and 5-7-bp tandem target duplication. FBI DNA configuration effectively explains simple as well as replicative transposition, along with the involvement of an enhancer element. The majority of diverse retrotransposable elements that employ a target site duplication mechanism is also suggested to follow the FBI DNA-mediated perpendicular insertion of the paired intercoil ends by non-homologous end-joining, together with gap filling. A genome-wide perspective of transposable elements in light of FBI DNA is discussed.
Spielmann, A; Stutz, E
1983-10-25
The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.
DNA transposons have colonized the genome of the giant virus Pandoravirus salinus.
Sun, Cheng; Feschotte, Cédric; Wu, Zhiqiang; Mueller, Rachel Lockridge
2015-06-12
Transposable elements are mobile DNA sequences that are widely distributed in prokaryotic and eukaryotic genomes, where they represent a major force in genome evolution. However, transposable elements have rarely been documented in viruses, and their contribution to viral genome evolution remains largely unexplored. Pandoraviruses are recently described DNA viruses with genome sizes that exceed those of some prokaryotes, rivaling parasitic eukaryotes. These large genomes appear to include substantial noncoding intergenic spaces, which provide potential locations for transposable element insertions. However, no mobile genetic elements have yet been reported in pandoravirus genomes. Here, we report a family of miniature inverted-repeat transposable elements (MITEs) in the Pandoravirus salinus genome, representing the first description of a virus populated with a canonical transposable element family that proliferated by transposition within the viral genome. The MITE family, which we name Submariner, includes 30 copies with all the hallmarks of MITEs: short length, terminal inverted repeats, TA target site duplication, and no coding capacity. Submariner elements show signs of transposition and are undetectable in the genome of Pandoravirus dulcis, the closest known relative Pandoravirus salinus. We identified a DNA transposon related to Submariner in the genome of Acanthamoeba castellanii, a species thought to host pandoraviruses, which contains remnants of coding sequence for a Tc1/mariner transposase. These observations suggest that the Submariner MITEs of P. salinus belong to the widespread Tc1/mariner superfamily and may have been mobilized by an amoebozoan host. Ten of the 30 MITEs in the P. salinus genome are located within coding regions of predicted genes, while others are close to genes, suggesting that these transposons may have contributed to viral genetic novelty. Our discovery highlights the remarkable ability of DNA transposons to colonize and shape genomes from all domains of life, as well as giant viruses. Our findings continue to blur the division between viral and cellular genomes, adhering to the emerging view that the content, dynamics, and evolution of the genomes of giant viruses do not substantially differ from those of cellular organisms.
USDA-ARS?s Scientific Manuscript database
Small RNAs regulate the genome by guiding transcriptional and post-transcriptional silencing machinery to specific target sequences, including genes and transposable elements (TEs). Although miniature inverted-repeat transposable elements (MITEs) are closely associated with euchromatic genes, the br...
E622, a miniature, virulence-associated mobile element.
Stavrinides, John; Kirzinger, Morgan W B; Beasley, Federico C; Guttman, David S
2012-01-01
Miniature inverted terminal repeat elements (MITEs) are nonautonomous mobile elements that have a significant impact on bacterial evolution. Here we characterize E622, a 611-bp virulence-associated MITE from Pseudomonas syringae, which contains no coding region but has almost perfect 168-bp inverted repeats. Using an antibiotic coupling assay, we show that E622 is transposable and can mobilize an antibiotic resistance gene contained between its borders. Its predicted parent element, designated TnE622, has a typical transposon structure with a three-gene operon, consisting of resolvase, integrase, and exeA-like genes, which is bounded by the same terminal inverted repeats as E622. A broader genome level survey of the E622/TnE622 inverted repeats identified homologs in Pseudomonas, Salmonella, Shewanella, Erwinia, Pantoea, and the cyanobacteria Nostoc and Cyanothece, many of which appear to encompass known virulence genes, including genes encoding toxins, enzymes, and type III secreted effectors. Its association with niche-specific genetic determinants, along with its persistence and evolutionary diversification, indicates that this mobile element family has played a prominent role in the evolution of many agriculturally and clinically relevant pathogenic bacteria.
Mutational Dynamics of Aroid Chloroplast Genomes
Ahmed, Ibrar; Biggs, Patrick J.; Matthews, Peter J.; Collins, Lesley J.; Hendy, Michael D.; Lockhart, Peter J.
2012-01-01
A characteristic feature of eukaryote and prokaryote genomes is the co-occurrence of nucleotide substitution and insertion/deletion (indel) mutations. Although similar observations have also been made for chloroplast DNA, genome-wide associations have not been reported. We determined the chloroplast genome sequences for two morphotypes of taro (Colocasia esculenta; family Araceae) and compared these with four publicly available aroid chloroplast genomes. Here, we report the extent of genome-wide association between direct and inverted repeats, indels, and substitutions in these aroid chloroplast genomes. We suggest that alternative but not mutually exclusive hypotheses explain the mutational dynamics of chloroplast genome evolution. PMID:23204304
Zhang, H-H; Shen, Y-H; Xu, H-E; Liang, H-Y; Han, M-J; Zhang, Z
2013-10-01
Comparative analysis of transposable elements (TEs) from different species can make it possible to reconstruct their history over evolutionary time. In this study, we identified a novel hAT element in Bombyx mori and Rhodnius prolixus with characteristic GGGCGGCA repeats in its subterminal region. Meanwhile, phylogenetic analysis demonstrated that the elements in these two species might represent a separate cluster of the hAT superfamily. Strikingly, a previously identified miniature inverted repeat transposable element (MITE) shared high identity with this autonomous element across the entire length, supporting the hypothesis that MITEs are derived from the internal deletion of DNA transposons. Interestingly, identity of the consensus sequences of this novel hAT element between B. mori and R. prolixus, which diverged about 370 million years ago, was as high as 96.5% over their full length (about 3.6 kb) at the nucleotide level. The patchy distribution amongst species, coupled with overall lack of intense purifying selection acting on this element, suggest that this novel hAT element might have experienced horizontal transfer between the ancestors of B. mori and R. prolixus. Our results highlight that this novel hAT element could be used as a potential tool for germline transformation of R. prolixus to control the transmission of Trypanosoma cruzi, which causes Chagas disease. © 2013 Royal Entomological Society.
Tsutakawa, Susan E.; Thompson, Mark J.; Arvai, Andrew S.; ...
2017-06-27
DNA replication and repair enzyme Flap Endonuclease 1 (FEN1) is vital for genome integrity, and FEN1 mutations arise in multiple cancers. FEN1 precisely cleaves single-stranded (ss) 5'-flaps one nucleotide into duplex (ds) DNA. Yet, how FEN1 selects for but does not incise the ss 5'-flap was enigmatic. Here we combine crystallographic, biochemical and genetic analyses to show that two dsDNA binding sites set the 5'polarity and to reveal unexpected control of the DNA phosphodiester backbone by electrostatic interactions. Via phosphate steering', basic residues energetically steer an inverted ss 5'-flap through a gateway over FEN1's active site and shift dsDNA formore » catalysis. Mutations of these residues cause an 18,000-fold reduction in catalytic rate in vitro and large-scale trinucleotide (GAA) n repeat expansions in vivo, implying failed phosphate-steering promotes an unanticipated lagging-strand template-switch mechanism during replication. Thus, phosphate steering is an unappreciated FEN1 function that enforces 5'-flap specificity and catalysis, preventing genomic instability.« less
☆DNA assembly technique simplifies the construction of infectious clone of fowl adenovirus.
Zou, Xiao-Hui; Bi, Zhi-Xiang; Guo, Xiao-Juan; Zhang, Zun; Zhao, Yang; Wang, Min; Zhu, Ya-Lu; Jie, Hong-Ying; Yu, Yang; Hung, Tao; Lu, Zhuo-Zhuang
2018-07-01
Plasmid bearing adenovirus genome is generally constructed with the method of homologous recombination in E. coli BJ5183 strain. Here, we utilized Gibson gene assembly technique to generate infectious clone of fowl adenovirus 4 (FAdV-4). Primers flanked with partial inverted terminal repeat (ITR) sequence of FAdV-4 were synthesized to amplify a plasmid backbone containing kanamycin-resistant gene and pBR322 origin (KAN-ORI). DNA assembly was carried out by combining the KAN-ORI fragment, virus genomic DNA and DNA assembly master mix. E. coli competent cells were transformed with the assembled product, and plasmids (pKFAV4) were extracted and confirmed to contain viral genome by restriction analysis and sequencing. Virus was successfully rescued from linear pKFAV4-transfected chicken LMH cells. This approach was further verified in cloning of human adenovirus 5 genome. Our results indicated that DNA assembly technique simplified the construction of infectious clone of adenovirus, suggesting its possible application in virus traditional or reverse genetics. Copyright © 2018 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsutakawa, Susan E.; Thompson, Mark J.; Arvai, Andrew S.
DNA replication and repair enzyme Flap Endonuclease 1 (FEN1) is vital for genome integrity, and FEN1 mutations arise in multiple cancers. FEN1 precisely cleaves single-stranded (ss) 5'-flaps one nucleotide into duplex (ds) DNA. Yet, how FEN1 selects for but does not incise the ss 5'-flap was enigmatic. Here we combine crystallographic, biochemical and genetic analyses to show that two dsDNA binding sites set the 5'polarity and to reveal unexpected control of the DNA phosphodiester backbone by electrostatic interactions. Via phosphate steering', basic residues energetically steer an inverted ss 5'-flap through a gateway over FEN1's active site and shift dsDNA formore » catalysis. Mutations of these residues cause an 18,000-fold reduction in catalytic rate in vitro and large-scale trinucleotide (GAA) n repeat expansions in vivo, implying failed phosphate-steering promotes an unanticipated lagging-strand template-switch mechanism during replication. Thus, phosphate steering is an unappreciated FEN1 function that enforces 5'-flap specificity and catalysis, preventing genomic instability.« less
Lepetit, D; Pasquet, S; Olive, M; Thézé, N; Thiébaud, P
2000-01-01
We have characterised from Xenopus laevis two new short interspersed repetitive elements, we have named Glider and Vision, that belong to the family of miniature inverted-repeat transposable elements (MITEs). Glider was first characterised in an intronic region of the alpha-tropomyosin (alpha-TM) gene and database search has revealed the presence of this element in 10 other Xenopus laevis genes. Glider elements are about 150 bp long and for some of them, their terminal inverted repeats are flanked by potential target-site duplications. Evidence for the mobility of Glider element has been provided by the presence/absence of one element at corresponding location in duplicated alpha-TM genes. Vision element has been identified in the promoter region of the cyclin dependant kinase 2 gene (cdk2) where it is boxed in a Glider element. Vision is 284bp long and is framed by 14-bp terminal inverted repeats that are flanked by 7-bp direct repeats. We have estimated that there are about 20,000 and 300 copies of Glider and Vision respectively scattered throughout the Xenopus laevis genome. Every MITEs elements but two described in our study are found either in 5' or in 3' regulatory regions of genes suggesting a potential role in gene regulation.
Fisher, R P; Topper, J N; Clayton, D A
1987-07-17
Selective transcription of human mitochondrial DNA requires a transcription factor (mtTF) in addition to an essentially nonselective RNA polymerase. Partially purified mtTF is able to sequester promoter-containing DNA in preinitiation complexes in the absence of mitochondrial RNA polymerase, suggesting a DNA-binding mechanism for factor activity. Functional domains, required for positive transcriptional regulation by mtTF, are identified within both major promoters of human mtDNA through transcription of mutant promoter templates in a reconstituted in vitro system. These domains are essentially coextensive with DNA sequences protected from nuclease digestion by mtTF-binding. Comparison of the sequences of the two mtTF-responsive elements reveals significant homology only when one sequence is inverted; the binding sites are in opposite orientations with respect to the predominant direction of transcription. Thus mtTF may function bidirectionally, requiring additional protein-DNA interactions to dictate transcriptional polarity. The mtTF-responsive elements are arrayed as direct repeats, separated by approximately 80 bp within the displacement-loop region of human mitochondrial DNA; this arrangement may reflect duplication of an ancestral bidirectional promoter, giving rise to separate, unidirectional promoters for each strand.
Spooner, David M; Ruess, Holly; Iorizzo, Massimo; Senalik, Douglas; Simon, Philipp
2017-02-01
We explored the phylogenetic utility of entire plastid DNA sequences in Daucus and compared the results with prior phylogenetic results using plastid and nuclear DNA sequences. We used Illumina sequencing to obtain full plastid sequences of 37 accessions of 20 Daucus taxa and outgroups, analyzed the data with phylogenetic methods, and examined evidence for mitochondrial DNA transfer to the plastid ( Dc MP). Our phylogenetic trees of the entire data set were highly resolved, with 100% bootstrap support for most of the external and many of the internal clades, except for the clade of D. carota and its most closely related species D. syrticus . Subsets of the data, including regions traditionally used as phylogenetically informative regions, provide various degrees of soft congruence with the entire data set. There are areas of hard incongruence, however, with phylogenies using nuclear data. We extended knowledge of a mitochondrial to plastid DNA insertion sequence previously named Dc MP and identified the first instance in flowering plants of a sequence of potential nuclear genome origin inserted into the plastid genome. There is a relationship of inverted repeat junction classes and repeat DNA to phylogeny, but no such relationship with nonsynonymous mutations. Our data have allowed us to (1) produce a well-resolved plastid phylogeny of Daucus , (2) evaluate subsets of the entire plastid data for phylogeny, (3) examine evidence for plastid and nuclear DNA phylogenetic incongruence, and (4) examine mitochondrial and nuclear DNA insertion into the plastid. © 2017 Spooner et al. Published by the Botanical Society of America. This work is licensed under a Creative Commons public domain license (CC0 1.0).
Cordeiro, I B; Castro, D P; Nogueira, P P O; Angelo, P C S; Nogueira, P A; Gonçalves, J F C; Pereira, A M R F; Garcia, J S; Souza, G H M F; Arruda, M A Z; Eberlin, M N; Astolfi-Filho, S; Andrade, E V; López-Lozano, J L
2013-10-29
Chromobacterium violaceum is a Gram-negative proteobacteria found in water and soil; it is widely distributed in tropical and subtropical regions, such as the Amazon rainforest. We examined protein expression changes that occur in C. violaceum at different growth temperatures using electrophoresis and mass spectrometry. The total number of spots detected was 1985; the number ranged from 99 to 380 in each assay. The proteins that were identified spectrometrically were categorized as chaperones, proteins expressed exclusively under heat stress, enzymes involved in the respiratory and fermentation cycles, ribosomal proteins, and proteins related to transport and secretion. Controlling inverted repeat of chaperone expression and inverted repeat DNA binding sequences, as well as regions recognized by sigma factor 32, elements involved in the genetic regulation of the bacterial stress response, were identified in the promoter regions of several of the genes coding proteins, involved in the C. violaceum stress response. We found that 30 °C is the optimal growth temperature for C. violaceum, whereas 25, 35, and 40 °C are stressful temperatures that trigger the expression of chaperones, superoxide dismutase, a probable small heat shock protein, a probable phasing, ferrichrome-iron receptor protein, elongation factor P, and an ornithine carbamoyltransferase catabolite. This information improves our comprehension of the mechanisms involved in stress adaptation by C. violaceum.
Spielmann, A; Stutz, E
1983-01-01
The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2. PMID:6314279
Guérillot, Romain; Siguier, Patricia; Gourbeyre, Edith; Chandler, Michael; Glaser, Philippe
2014-01-01
Transposable elements (TEs) are major components of both prokaryotic and eukaryotic genomes and play a significant role in their evolution. In this study, we have identified new prokaryotic DDE transposase families related to the eukaryotic Mutator-like transposases. These genes were retrieved by cascade PSI-Blast using as initial query the transposase of the streptococcal integrative and conjugative element (ICE) TnGBS2. By combining secondary structure predictions and protein sequence alignments, we predicted the DDE catalytic triad and the DNA-binding domain recognizing the terminal inverted repeats. Furthermore, we systematically characterized the organization and the insertion specificity of the TEs relying on these prokaryotic Mutator-like transposases (p-MULT) for their mobility. Strikingly, two distant TE families target their integration upstream σA dependent promoters. This allowed us to identify a transposase sequence signature associated with this unique insertion specificity and to show that the dissymmetry between the two inverted repeats is responsible for the orientation of the insertion. Surprisingly, while DDE transposases are generally associated with small and simple transposons such as insertion sequences (ISs), p-MULT encoding TEs show an unprecedented diversity with several families of IS, transposons, and ICEs ranging in size from 1.1 to 52 kb. PMID:24418649
Gu, Cuihua; Tembrock, Luke R.; Johnson, Nels G.; Simmons, Mark P.; Wu, Zhiqiang
2016-01-01
Lagerstroemia (crape myrtle) is an important plant genus used in ornamental horticulture in temperate regions worldwide. As such, numerous hybrids have been developed. However, DNA sequence resources and genome information for Lagerstroemia are limited, hindering evolutionary inferences regarding interspecific relationships. We report the complete plastid genome of Lagerstroemia fauriei. To our knowledge, this is the first reported whole plastid genome within Lythraceae. This genome is 152,440 bp in length with 38% GC content and consists of two single-copy regions separated by a pair of 25,793 bp inverted repeats. The large single copy and the small single copy regions span 83,921 bp and 16,933 bp, respectively. The genome contains 129 genes, including 17 located in each inverted repeat. Phylogenetic analysis of genera sampled from Geraniaceae, Myrtaceae, and Onagraceae corroborated the sister relationship between Lythraceae and Onagraceae. The plastid genomes of L. fauriei and several other Lythraceae species lack the rpl2 intron, which indicating an early loss of this intron within the Lythraceae lineage. The plastid genome of L. fauriei provides a much needed genetic resource for further phylogenetic research in Lagerstroemia and Lythraceae. Highly variable markers were identified for application in phylogenetic, barcoding and conservation genetic applications. PMID:26950701
Human Xq28 inversion polymorphism: From sex linkage to Genomics--A genetic mother lode.
Kirby, Cait S; Kolber, Natalie; Salih Almohaidi, Asmaa M; Bierwert, Lou Ann; Saunders, Lori; Williams, Steven; Merritt, Robert
2016-01-01
An inversion polymorphism of the filamin and emerin genes at the tip of the long arm of the human X-chromosome serves as the basis of an investigative laboratory in which students learn something new about their own genomes. Long, nearly identical inverted repeats flanking the filamin and emerin genes illustrate how repetitive elements can lead to alterations in genome structure (inversions) through nonallelic homologous recombination. The near identity of the inverted repeats is an example of concerted evolution through gene conversion. While the laboratory in its entirety is designed for college level genetics courses, portions of the laboratory are appropriate for courses at other levels. Because the polymorphism is on the X-chromosome, the laboratory can be used in introductory biology courses to enhance understanding of sex-linkage and to test for Hardy-Weinberg equilibrium in females. More advanced topics, such as chromosome interference, the molecular model for recombination, and inversion heterozygosity suppression of recombination can be explored in upper-level genetics and evolution courses. DNA isolation, restriction digests, ligation, long PCR, and iPCR provide experience with techniques in molecular biology. This investigative laboratory weaves together topics stretching from molecular genetics to cytogenetics and sex-linkage, population genetics and evolutionary genetics. © 2016 The International Union of Biochemistry and Molecular Biology.
Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V
2006-10-15
The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.
Jaeger, Alex M.; Makley, Leah N.; Gestwicki, Jason E.; Thiele, Dennis J.
2014-01-01
The heat shock transcription factor 1 (HSF1) activates expression of a variety of genes involved in cell survival, including protein chaperones, the protein degradation machinery, anti-apoptotic proteins, and transcription factors. Although HSF1 activation has been linked to amelioration of neurodegenerative disease, cancer cells exhibit a dependence on HSF1 for survival. Indeed, HSF1 drives a program of gene expression in cancer cells that is distinct from that activated in response to proteotoxic stress, and HSF1 DNA binding activity is elevated in cycling cells as compared with arrested cells. Active HSF1 homotrimerizes and binds to a DNA sequence consisting of inverted repeats of the pentameric sequence nGAAn, known as heat shock elements (HSEs). Recent comprehensive ChIP-seq experiments demonstrated that the architecture of HSEs is very diverse in the human genome, with deviations from the consensus sequence in the spacing, orientation, and extent of HSE repeats that could influence HSF1 DNA binding efficacy and the kinetics and magnitude of target gene expression. To understand the mechanisms that dictate binding specificity, HSF1 was purified as either a monomer or trimer and used to evaluate DNA-binding site preferences in vitro using fluorescence polarization and thermal denaturation profiling. These results were compared with quantitative chromatin immunoprecipitation assays in vivo. We demonstrate a role for specific orientations of extended HSE sequences in driving preferential HSF1 DNA binding to target loci in vivo. These studies provide a biochemical basis for understanding differential HSF1 target gene recognition and transcription in neurodegenerative disease and in cancer. PMID:25204655
Itier, Roxane J; Taylor, Margot J
2002-02-01
Using ERPs in a face recognition task, we investigated whether inversion and contrast reversal, which seem to disrupt different aspects of face configuration, differentially affected encoding and memory for faces. Upright, inverted, and negative (contrast-reversed) unknown faces were either immediately repeated (0-lag) or repeated after 1 intervening face (1-lag). The encoding condition (new) consisted of the first presentation of items correctly recognized in the two repeated conditions. 0-lag faces were recognized better and faster than 1-lag faces. Inverted and negative pictures elicited longer reaction times, lower hit rates, and higher false alarm rates than upright faces. ERP analyses revealed that negative and inverted faces affected both early (encoding) and late (recognition) stages of face processing. Early components (N170, VPP) were delayed and enhanced by both inversion and contrast reversal which also affected P1 and P2 components. Amplitudes were higher for inverted faces at frontal and parietal sites from 350 to 600 ms. Priming effects were seen at encoding stages, revealed by shorter latencies and smaller amplitudes of N170 for repeated stimuli, which did not differ depending on face type. Repeated faces yielded more positive amplitudes than new faces from 250 to 450 ms frontally and from 400 to 600 ms parietally. However, ERP differences revealed that the magnitude of this repetition effect was smaller for negative and inverted than upright faces at 0-lag but not at 1-lag condition. Thus, face encoding and recognition processes were affected by inversion and contrast-reversal differently.
Kim, Eunsoo; Lane, Christopher E; Curtis, Bruce A; Kozera, Catherine; Bowman, Sharen; Archibald, John M
2008-05-12
Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote) endosymbiosis. Cryptophytes are unusual in that they possess four genomes-a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte Rhodomonas salina. Here, the second complete mitochondrial genome of the cryptophyte alga Hemiselmis andersenii CCMP644 is presented. The H. andersenii mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a approximately 20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22-336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The H. andersenii mtDNA shares a number of features in common with the genome of the cryptophyte Rhodomonas salina, including general architecture, gene content, and the presence of a large repeat region. However, the H. andersenii mtDNA is devoid of inverted repeats and introns, which are present in R. salina. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the H. andersenii mtDNA has lost or converted its original trnK(uuu) gene and possesses a trnS-derived 'trnK(uuu)', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher-order eukaryotic lineages. Comparison of the H. andersenii and R. salina mitochondrial genomes reveals a number of cryptophyte-specific genomic features, most notably the presence of a large repeat-rich intergenic region. However, unlike R. salina, the H. andersenii mtDNA does not possess introns and lacks a Lys-tRNA, which is presumably imported from the cytosol.
Kim, Eunsoo; Lane, Christopher E; Curtis, Bruce A; Kozera, Catherine; Bowman, Sharen; Archibald, John M
2008-01-01
Background Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote) endosymbiosis. Cryptophytes are unusual in that they possess four genomes–a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte Rhodomonas salina. Here, the second complete mitochondrial genome of the cryptophyte alga Hemiselmis andersenii CCMP644 is presented. Results The H. andersenii mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a ~20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22–336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The H. andersenii mtDNA shares a number of features in common with the genome of the cryptophyte Rhodomonas salina, including general architecture, gene content, and the presence of a large repeat region. However, the H. andersenii mtDNA is devoid of inverted repeats and introns, which are present in R. salina. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the H. andersenii mtDNA has lost or converted its original trnK(uuu) gene and possesses a trnS-derived 'trnK(uuu)', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher-order eukaryotic lineages. Conclusion Comparison of the H. andersenii and R. salina mitochondrial genomes reveals a number of cryptophyte-specific genomic features, most notably the presence of a large repeat-rich intergenic region. However, unlike R. salina, the H. andersenii mtDNA does not possess introns and lacks a Lys-tRNA, which is presumably imported from the cytosol. PMID:18474103
Zurawski, Gerard; Bohnert, Hans J.; Whitfeld, Paul R.; Bottomley, Warwick
1982-01-01
The gene for the so-called Mr 32,000 rapidly labeled photosystem II thylakoid membrane protein (here designated psbA) of spinach (Spinacia oleracea) chloroplasts is located on the chloroplast DNA in the large single-copy region immediately adjacent to one of the inverted repeat sequences. In this paper we show that the size of the mRNA for this protein is ≈ 1.25 kilobases and that the direction of transcription is towards the inverted repeat unit. The nucleotide sequence of the gene and its flanking regions is presented. The only large open reading frame in the sequence codes for a protein of Mr 38,950. The nucleotide sequence of psbA from Nicotiana debneyi also has been determined, and comparison of the sequences from the two species shows them to be highly conserved (>95% homology) throughout the entire reading frame. Conservation of the amino acid sequence is absolute, there being no changes in a total of 353 residues. This leads us to conclude that the primary translation product of psbA must be a protein of Mr 38,950. The protein is characterized by the complete absence of lysine residues and is relatively rich in hydrophobic amino acids, which tend to be clustered. Transcription of spinach psbA starts about 86 base pairs before the first ATG codon. Immediately upstream from this point there is a sequence typical of that found in E. coli promoters. An almost identical sequence occurs in the equivalent region of N. debneyi DNA. Images PMID:16593262
Ma, Ji; Yang, Bingxian; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Wang, Xumin
2013-10-10
Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae. Molecular dating analyses suggest that Ranunculaceae and Berberidaceae diverged between 90 and 84 mya, which is congruent with the fossil records and with recent estimates of the divergence time of these two taxa. © 2013.
Diversity and structure of PIF/Harbinger-like elements in the genome of Medicago truncatula
Grzebelus, Dariusz; Lasota, Slawomir; Gambin, Tomasz; Kucherov, Gregory; Gambin, Anna
2007-01-01
Background Transposable elements constitute a significant fraction of plant genomes. The PIF/Harbinger superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, required for transposition, is characteristic for the autonomous PIF/Harbinger-like elements. Based on the above features, PIF/Harbinger-like elements were identified in several plant genomes and divided into several evolutionary lineages. Availability of a significant portion of Medicago truncatula genomic sequence allowed for mining PIF/Harbinger-like elements, starting from a single previously described element MtMaster. Results Twenty two putative autonomous, i.e. carrying an ORF coding for TPase and complete terminal inverted repeats, and 67 non-autonomous PIF/Harbinger-like elements were found in the genome of M. truncatula. They were divided into five families, MtPH-A5, MtPH-A6, MtPH-D,MtPH-E, and MtPH-M, corresponding to three previously identified and two new lineages. The largest families, MtPH-A6 and MtPH-M were further divided into four and three subfamilies, respectively. Non-autonomous elements were usually direct deletion derivatives of the putative autonomous element, however other types of rearrangements, including inversions and nested insertions were also observed. An interesting structural characteristic – the presence of 60 bp tandem repeats – was observed in a group of elements of subfamily MtPH-A6-4. Some families could be related to miniature inverted repeat elements (MITEs). The presence of empty loci (RESites), paralogous to those flanking the identified transposable elements, both autonomous and non-autonomous, as well as the presence of transposon insertion related size polymorphisms, confirmed that some of the mined elements were capable for transposition. Conclusion The population of PIF/Harbinger-like elements in the genome of M. truncatula is diverse. A detailed intra-family comparison of the elements' structure proved that they proliferated in the genome generally following the model of abortive gap repair. However, the presence of tandem repeats facilitated more pronounced rearrangements of the element internal regions. The insertion polymorphism of the MtPH elements and related MITE families in different populations of M. truncatula, if further confirmed experimentally, could be used as a source of molecular markers complementary to other marker systems. PMID:17996080
Sala-Pérez, Sergi; España-Tost, Antoni; Vidal-Bel, August
2013-01-01
Inverted ductal papilloma of the oral cavity is an infrequent benign neoplasm of papillary appearance that originates in the secretory duct of a salivary gland. The etiology is unknown, though some authors have related it to human papillomavirus (HPV) infection. We present the case of a 40-year-old woman with a tumor of the lower lip mucosa. Histopathological study of the lesion diagnosed inverted ductal papilloma of the oral cavity. Human papillomavirus DNA detection and typing based on tumor lesion DNA amplification and posterior hybridization, revealed no presence of viral DNA. The antecedents of trauma reported by the patient could have played an important role in the development of this tumor. Key words:Inverted ductal papilloma, intraductal papilloma, oral papilloma, papillary epidermoid adenoma. PMID:24455058
USDA-ARS?s Scientific Manuscript database
Plasmids that contain a disrupted genome of the Junonia coenia densovirus (JcDNV) integrate into the chromosomes of the somatic cells of insects. When subcloned individually, both the P9 inverted terminal repeat (P9-ITR) and the P93-ITR promote the chromosomal integration of vector plasmids in insec...
Timmis, K N; Cabello, F; Andrés, I; Nordheim, A; Burkhardt, H J; Cohen, S N
1978-11-16
Detailed examination of the structure of cloned DNA fragments of the R6-5 antibiotic resistance plasmid has revealed a substantial degree of polynucleotide sequence heterogeneity and indicates that sequence rearrangements in plasmids and possible other replicons occur more frequently than has hitherto been appreciated. The sequences changes in cloned R6-5 fragments were shown in some instances to have occurred prior to cloning, i.e. existing in the original population of R6-5 molecules that was obtained from a single bacterial clone and by several different criteria judged to be homogeneous, and in others to have occurred either during the cloning procedure or during subsequent propagation of hybrid molecules. The molecular changes that are described involved insertion/deletion of the previously characterized IS2 insertion element, formation of a new inverted repeat structure probably by duplication of a preexisting R6-5 DNA sequence, sequence inversion, and loss and gain of restriction endonuclease cleavage sites.
Andrade, B S; Góes-Neto, A
2015-10-30
The filamentous fungus Moniliophthora perniciosa is a hemibiotrophic basidiomycete that causes witches' broom disease of cacao (Theobroma cacao L.). Many fungal mitochondrial plasmids are DNA and RNA polymerase-encoding invertrons with terminal inverted repeats and 5'-linked proteins. The aim of this study was to carry out comparative and phylogenetic analyses of DNA and RNA polymerases for all known linear mitochondrial plasmids in fungi. We performed these analyses at both gene and protein levels and assessed differences between fungal and viral polymerases in order to test the lateral gene transfer (LGT) hypothesis. We analyzed all mitochondrial plasmids of the invertron type within the fungal clade, including five from Ascomycota, seven from Basidiomycota, and one from Chytridiomycota. All phylogenetic analyses generated similar tree topologies regardless of the methods and datasets used. It is likely that DNA and RNA polymerase genes were inserted into the mitochondrial genomes of the 13 fungal species examined in our study as a result of different LGT events. These findings are important for a better understanding of the evolutionary relationships between fungal mitochondrial plasmids.
Amemiya, Kei; Meyers, Jennifer L; Deshazer, David; Riggins, Renaldo N; Halasohoris, Stephanie; England, Marilyn; Ribot, Wilson; Norris, Sarah L; Waag, David M
2007-10-01
We examined, by enzyme-linked immunosorbent assay and Western blot analysis, the host immune response to 2 heat-shock proteins (hsps) in a patient and mice previously infected with Burkholderia mallei. The patient was the first reported human glanders case in 50 years in the United States. The expression of the groEL and dnaK operons appeared to be dependent upon a sigma(32) RNA polymerase as suggested by conserved heat-shock promoter sequences, and the groESL operon may be negatively regulated by a controlling invert repeat of chaperone expression (CIRCE) site. In the antisera, the GroEL protein was found to be more immunoreactive than the DnaK protein in both a human patient and mice previously infected with B. mallei. Examination of the supernatant of a growing culture of B. mallei showed that more GroEL protein than DnaK protein was released from the cell. This may occur similarly within an infected host causing an elevated host immune response to the B. mallei hsps.
Adenovirus sequences required for replication in vivo.
Wang, K; Pearson, G D
1985-01-01
We have studied the in vivo replication properties of plasmids carrying deletion mutations within cloned adenovirus terminal sequences. Deletion mapping located the adenovirus DNA replication origin entirely within the first 67 bp of the adenovirus inverted terminal repeat. This region could be further subdivided into two functional domains: a minimal replication origin and an adjacent auxillary region which boosted the efficiency of replication by more than 100-fold. The minimal origin occupies the first 18 to 21 bp and includes sequences conserved between all adenovirus serotypes. The adjacent auxillary region extends past nucleotide 36 but not past nucleotide 67 and contains the binding site for nuclear factor I. Images PMID:2991857
Yao, Xiaohong; Tang, Ping; Li, Zuozhou; Li, Dawei; Liu, Yifei; Huang, Hongwen
2015-01-01
Actinidia chinensis is an important economic plant belonging to the basal lineage of the asterids. Availability of a complete Actinidia chloroplast genome sequence is crucial to understanding phylogenetic relationships among major lineages of angiosperms and facilitates kiwifruit genetic improvement. We report here the complete nucleotide sequences of the chloroplast genomes for Actinidia chinensis and A. chinensis var deliciosa obtained through de novo assembly of Illumina paired-end reads produced by total DNA sequencing. The total genome size ranges from 155,446 to 157,557 bp, with an inverted repeat (IR) of 24,013 to 24,391 bp, a large single copy region (LSC) of 87,984 to 88,337 bp and a small single copy region (SSC) of 20,332 to 20,336 bp. The genome encodes 113 different genes, including 79 unique protein-coding genes, 30 tRNA genes and 4 ribosomal RNA genes, with 16 duplicated in the inverted repeats, and a tRNA gene (trnfM-CAU) duplicated once in the LSC region. Comparisons of IR boundaries among four asterid species showed that IR/LSC borders were extended into the 5' portion of the psbA gene and IR contraction occurred in Actinidia. The clap gene has been lost from the chloroplast genome in Actinidia, and may have been transferred to the nucleus during chloroplast evolution. Twenty-seven polymorphic simple sequence repeat (SSR) loci were identified in the Actinidia chloroplast genome. Maximum parsimony analyses of a 72-gene, 16 taxa angiosperm dataset strongly support the placement of Actinidiaceae in Ericales within the basal asterids.
Gelincik, Ozkan; Blecua, Pedro; Edelmann, Winfried; Kucherlapati, Raju; Zhou, Kathy; Jasin, Maria; Gümüş, Zeynep H.; Lipkin, Steven M.
2017-01-01
Homologous recombination (HR) enables precise DNA repair after DNA double strand breaks (DSBs) using identical sequence templates, whereas homeologous recombination (HeR) uses only partially homologous sequences. Homeologous recombination introduces mutations through gene conversion and genomic deletions through single-strand annealing (SSA). DNA mismatch repair (MMR) inhibits HeR, but the roles of mammalian MMR MutL homologues (MLH1, PMS2 and MLH3) proteins in HeR suppression are poorly characterized. Here, we demonstrate that mouse embryonic fibroblasts (MEFs) carrying Mlh1, Pms2, and Mlh3 mutations have higher HeR rates, by using 7,863 uniquely mapping paired direct repeat sequences (DRs) in the mouse genome as endogenous gene conversion and SSA reporters. Additionally, when DSBs are induced by gamma-radiation, Mlh1, Pms2 and Mlh3 mutant MEFs have higher DR copy number alterations (CNAs), including DR CNA hotspots previously identified in mouse MMR-deficient colorectal cancer (dMMR CRC). Analysis of The Cancer Genome Atlas CRC data revealed that dMMR CRCs have higher genome-wide DR HeR rates than MMR proficient CRCs, and that dMMR CRCs have deletion hotspots in tumor suppressors FHIT/WWOX at chromosomal fragile sites FRA3B and FRA16D (which have elevated DSB rates) flanked by paired homologous DRs and inverted repeats (IR). Overall, these data provide novel insights into the MMR-dependent HeR inhibition mechanism and its role in tumor suppression. PMID:29069730
In and out of the rRNA genes: characterization of Pokey elements in the sequenced Daphnia genome
2013-01-01
Background Only a few transposable elements are known to exhibit site-specific insertion patterns, including the well-studied R-element retrotransposons that insert into specific sites within the multigene rDNA. The only known rDNA-specific DNA transposon, Pokey (superfamily: piggyBac) is found in the freshwater microcrustacean, Daphnia pulex. Here, we present a genome-wide analysis of Pokey based on the recently completed whole genome sequencing project for D. pulex. Results Phylogenetic analysis of Pokey elements recovered from the genome sequence revealed the presence of four lineages corresponding to two divergent autonomous families and two related lineages of non-autonomous miniature inverted repeat transposable elements (MITEs). The MITEs are also found at the same 28S rRNA gene insertion site as the Pokey elements, and appear to have arisen as deletion derivatives of autonomous elements. Several copies of the full-length Pokey elements may be capable of producing an active transposase. Surprisingly, both families of Pokey possess a series of 200 bp repeats upstream of the transposase that is derived from the rDNA intergenic spacer (IGS). The IGS sequences within the Pokey elements appear to be evolving in concert with the rDNA units. Finally, analysis of the insertion sites of Pokey elements outside of rDNA showed a target preference for sites similar to the specific sequence that is targeted within rDNA. Conclusions Based on the target site preference of Pokey elements and the concerted evolution of a segment of the element with the rDNA unit, we propose an evolutionary path by which the ancestors of Pokey elements have invaded the rDNA niche. We discuss how specificity for the rDNA unit may have evolved and how this specificity has played a role in the long-term survival of these elements in the subgenus Daphnia. PMID:24059783
Ducote, Matthew J.; Prakash, Shubha; Pettis, Gregg S.
2000-01-01
Efficient interbacterial transfer of streptomycete plasmid pIJ101 requires the pIJ101 tra gene, as well as a cis-acting plasmid function known as clt. Here we show that the minimal pIJ101 clt locus consists of a sequence no greater than 54 bp in size that includes essential inverted-repeat and direct-repeat sequences and is located in close proximity to the 3′ end of the korB regulatory gene. Evidence that sequences extending beyond the minimal locus and into the korB open reading frame influence clt transfer function and demonstration that clt-korB sequences are intrinsically curved raise the possibility that higher-order structuring of DNA and protein within this plasmid region may be an inherent feature of efficient pIJ101 transfer. PMID:11073933
Ducote, M J; Prakash, S; Pettis, G S
2000-12-01
Efficient interbacterial transfer of streptomycete plasmid pIJ101 requires the pIJ101 tra gene, as well as a cis-acting plasmid function known as clt. Here we show that the minimal pIJ101 clt locus consists of a sequence no greater than 54 bp in size that includes essential inverted-repeat and direct-repeat sequences and is located in close proximity to the 3' end of the korB regulatory gene. Evidence that sequences extending beyond the minimal locus and into the korB open reading frame influence clt transfer function and demonstration that clt-korB sequences are intrinsically curved raise the possibility that higher-order structuring of DNA and protein within this plasmid region may be an inherent feature of efficient pIJ101 transfer.
Evidence for horizontal transfer of mitochondrial DNA to the plastid genome in a bamboo genus.
Ma, Peng-Fei; Zhang, Yu-Xiao; Guo, Zhen-Hua; Li, De-Zhu
2015-06-23
In flowering plants, three genomes (nuclear, mitochondrial, and plastid) coexist and intracellular horizontal transfer of DNA is prevalent, especially from the plastid to the mitochondrion genome. However, the plastid genomes are generally conserved in evolution and have long been considered immune to foreign DNA. Recently, the opposite direction of DNA transfer from the mitochondrial to the plastid genome has been reported in two eudicot lineages. Here we sequenced 6 plastid genomes of bamboos, three of which are neotropical woody species and three are herbaceous ones. Several unusual features were found, including the duplication of trnT-GGU and loss of one copy of rps19 due to contraction of inverted repeats (IRs). The most intriguing was the ~2.7 kb insertion in the plastid IR regions in the three herbaceous bamboos. Furthermore, the insertion was documented to be horizontally transferred from the mitochondrial to the plastid genome. Our study provided evidence of the mitochondrial-to-plastid DNA transfer in the monocots, demonstrating again that this rare event does occur in other angiosperm lineages. However, the mechanism underlying the transfer remains obscure, and more studies in other plants may elucidate it in the future.
The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.
Raveendar, Sebastin; Na, Young-Wang; Lee, Jung-Ro; Shim, Donghwan; Ma, Kyung-Ho; Lee, Sok-Young; Chung, Jong-Wook
2015-07-20
Chloroplast (cp) genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum) using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs) of 50,284 bp were separated by a small single copy (SSC; 18,948 bp) and a large single copy (LSC; 87,446 bp). The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR) and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.
Sun, Di; Zhu, Jianya; Chen, Zhi; Li, Jilun; Wen, Ying
2016-11-14
Avermectins are useful anthelmintic antibiotics produced by Streptomyces avermitilis. We demonstrated that a novel AraC-family transcriptional regulator in this species, SAV742, is a global regulator that negatively controls avermectin biosynthesis and cell growth, but positively controls morphological differentiation. Deletion of its gene, sav_742, increased avermectin production and dry cell weight, but caused delayed formation of aerial hyphae and spores. SAV742 directly inhibited avermectin production by repressing transcription of ave structural genes, and also directly regulated its own gene (sav_742) and adjacent gene sig8 (sav_741). The precise SAV742-binding site on its own promoter region was determined by DNase I footprinting assay coupled with site-directed DNA mutagenesis, and 5-nt inverted repeats (GCCGA-n 10 /n 12 -TCGGC) were found to be essential for SAV742 binding. Similar 5-nt inverted repeats separated by 3, 10 or 15 nt were found in the promoter regions of target ave genes and sig8. The SAV742 regulon was predicted based on bioinformatic analysis. Twenty-six new SAV742 targets were identified and experimentally confirmed, including genes involved in primary metabolism, secondary metabolism and development. Our findings indicate that SAV742 plays crucial roles in not only avermectin biosynthesis but also coordination of complex physiological processes in S. avermitilis.
Graft-transmissible movement of inverted-repeat-induced siRNA signals into flowers.
Zhang, Wenna; Kollwig, Gregor; Stecyk, Ewelina; Apelt, Federico; Dirks, Rob; Kragler, Friedrich
2014-10-01
In plants, small interfering RNAs (siRNA) and microRNAs move to distant tissues where they control numerous developmental and physiological processes such as morphogenesis and stress responses. Grafting techniques and transient expression systems have been employed to show that sequence-specific siRNAs with a size of 21-24 nucleotides traffic to distant organs. We used inverted-repeat constructs producing siRNA targeting the meiosis factor DISRUPTED MEIOTIC cDNA 1 (DMC1) and GFP to test whether silencing signals move into meiotically active tissues. In grafted Nicotiana tabacum, a transgenic DMC1 siRNA signal made in source tissues preferably entered the anthers formed in the first flowers. Here, the DMC1 siRNA interfered with meiotic progression and, consequently, the flowers were at least partially sterile. In agro-infiltrated N. benthamiana plants, a GFP siRNA signal produced in leaves was allocated and active in most flower tissues including anthers. In hypocotyl-grafted Arabidopsis thaliana plants, the DMC1 silencing signal consistently appeared in leaves, petioles, and stem, and only a small number of plants displayed DMC1 siRNA signals in flowers. In all three tested plant species the systemic silencing signal penetrated male sporogenic tissues suggesting that plants harbour an endogenous long-distance small RNA transport pathway facilitating siRNA signalling into meiotically active cells. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Weaver, David; Karoonuthaisiri, Nitsara; Tsai, Hsiu-Hwei; Huang, Chih-Hung; Ho, Mai-Lan; Gai, Shuning; Patel, Kedar G; Huang, Jianqiang; Cohen, Stanley N; Hopwood, David A; Chen, Carton W; Kao, Camilla M
2004-03-01
The chromosomes of several widely used laboratory derivatives of Streptomyces coelicolor A3(2) were found to have 1.06 Mb inverted repeat sequences at their termini (i.e. long-terminal inverted repeats; L-TIRs), which are 50 times the length of the 22 kb TIRs of the sequenced S. coelicolor strain M145. The L-TIRs include 1005 annotated genes and increase the overall chromosome size to 9.7 Mb. The 1.06 Mb L-TIRs are the longest reported thus far for an actinomycete, and are proposed to represent the chromosomal state of the original soil isolate of S. coelicolor A3(2). S. coelicolor A3(2), M600 and J1501 possess L-TIRs, whereas approximately half the examined early mutants of A3(2) generated by ultraviolet (UV) or X-ray mutagenesis have truncated their TIRs to the 22 kb length. UV radiation was found to stimulate L-TIR truncation. Two copies of a transposase gene (SCO0020) flank 1.04 Mb of DNA in the right L-TIR, and recombination between them appears to generate strains containing short TIRs. This TIR reduction mechanism may represent a general strategy by which transposable elements can modulate the structure of chromosome ends. The presence of L-TIRs in certain S. coelicolor strains represents a major chromosomal alteration in strains previously thought to be genetically similar.
Horizontal gene transfer from Agrobacterium to plants.
Matveeva, Tatiana V; Lutova, Ludmila A
2014-01-01
Most genetic engineering of plants uses Agrobacterium mediated transformation to introduce novel gene content. In nature, insertion of T-DNA in the plant genome and its subsequent transfer via sexual reproduction has been shown in several species in the genera Nicotiana and Linaria. In these natural examples of horizontal gene transfer from Agrobacterium to plants, the T-DNA donor is assumed to be a mikimopine strain of A. rhizogenes. A sequence homologous to the T-DNA of the Ri plasmid of Agrobacterium rhizogenes was found in the genome of untransformed Nicotiana glauca about 30 years ago, and was named "cellular T-DNA" (cT-DNA). It represents an imperfect inverted repeat and contains homologs of several T-DNA oncogenes (NgrolB, NgrolC, NgORF13, NgORF14) and an opine synthesis gene (Ngmis). A similar cT-DNA has also been found in other species of the genus Nicotiana. These presumably ancient homologs of T-DNA genes are still expressed, indicating that they may play a role in the evolution of these plants. Recently T-DNA has been detected and characterized in Linaria vulgaris and L. dalmatica. In Linaria vulgaris the cT-DNA is present in two copies and organized as a tandem imperfect direct repeat, containing LvORF2, LvORF3, LvORF8, LvrolA, LvrolB, LvrolC, LvORF13, LvORF14, and the Lvmis genes. All L. vulgaris and L. dalmatica plants screened contained the same T-DNA oncogenes and the mis gene. Evidence suggests that there were several independent T-DNA integration events into the genomes of these plant genera. We speculate that ancient plants transformed by A. rhizogenes might have acquired a selective advantage in competition with the parental species. Thus, the events of T-DNA insertion in the plant genome might have affected their evolution, resulting in the creation of new plant species. In this review we focus on the structure and functions of cT-DNA in Linaria and Nicotiana and discuss their possible evolutionary role.
Michlewski, Gracjan; Finnegan, David J.; Elfick, Alistair; Rosser, Susan J.
2017-01-01
Abstract Delivery of DNA to cells and its subsequent integration into the host genome is a fundamental task in molecular biology, biotechnology and gene therapy. Here we describe an IP-free one-step method that enables stable genome integration into either prokaryotic or eukaryotic cells. A synthetic mariner transposon is generated by flanking a DNA sequence with short inverted repeats. When purified recombinant Mos1 or Mboumar-9 transposase is co-transfected with transposon-containing plasmid DNA, it penetrates prokaryotic or eukaryotic cells and integrates the target DNA into the genome. In vivo integrations by purified transposase can be achieved by electroporation, chemical transfection or Lipofection of the transposase:DNA mixture, in contrast to other published transposon-based protocols which require electroporation or microinjection. As in other transposome systems, no helper plasmids are required since transposases are not expressed inside the host cells, thus leading to generation of stable cell lines. Since it does not require electroporation or microinjection, this tool has the potential to be applied for automated high-throughput creation of libraries of random integrants for purposes including gene knock-out libraries, screening for optimal integration positions or safe genome locations in different organisms, selection of the highest production of valuable compounds for biotechnology, and sequencing. PMID:28204586
Xiao, Qing; Min, Taishan; Ma, Shuangping; Hu, Lingna; Chen, Hongyan; Lu, Daru
2018-04-18
Targeted integration of transgenes facilitates functional genomic research and holds prospect for gene therapy. The established microhomology-mediated end-joining (MMEJ)-based strategy leads to the precise gene knock-in with easily constructed donor, yet the limited efficiency remains to be further improved. Here, we show that single-strand DNA (ssDNA) donor contributes to efficient increase of knock-in efficiency and establishes a method to achieve the intracellular linearization of long ssDNA donor. We identified that the CRISPR/Cas9 system is responsible for breaking double-strand DNA (dsDNA) of palindromic structure in inverted terminal repeats (ITRs) region of recombinant adeno-associated virus (AAV), leading to the inhibition of viral second-strand DNA synthesis. Combing Cas9 plasmids targeting genome and ITR with AAV donor delivery, the precise knock-in of gene cassette was achieved, with 13-14% of the donor insertion events being mediated by MMEJ in HEK 293T cells. This study describes a novel method to integrate large single-strand transgene cassettes into the genomes, increasing knock-in efficiency by 13.6-19.5-fold relative to conventional AAV-mediated method. It also provides a comprehensive solution to the challenges of complicated production and difficult delivery with large exogenous fragments.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie
2009-11-20
RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR)more » shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.« less
The Effect of Syllable Repetition Rate on Vocal Characteristics
ERIC Educational Resources Information Center
Topbas, Oya; Orlikoff, Robert F.; St. Louis, Kenneth O.
2012-01-01
This study examined whether mean vocal fundamental frequency ("F"[subscript 0]) or speech sound pressure level (SPL) varies with changes in syllable repetition rate. Twenty-four young adults (12 M and 12 F) repeated the syllables/p[inverted v]/,/p[inverted v]t[schwa]/, and/p[inverted v]t[schwa]k[schwa]/at a modeled "slow" rate of approximately one…
Lewis, Leslie A; Astatke, Mekbib; Umekubo, Peter T; Alvi, Shaheen; Saby, Robert; Afrose, Jehan; Oliveira, Pedro H; Monteiro, Gabriel A; Prazeres, Duarte Mf
2012-01-26
Transposition in IS3, IS30, IS21 and IS256 insertion sequence (IS) families utilizes an unconventional two-step pathway. A figure-of-eight intermediate in Step I, from asymmetric single-strand cleavage and joining reactions, is converted into a double-stranded minicircle whose junction (the abutted left and right ends) is the substrate for symmetrical transesterification attacks on target DNA in Step II, suggesting intrinsically different synaptic complexes (SC) for each step. Transposases of these ISs bind poorly to cognate DNA and comparative biophysical analyses of SC I and SC II have proven elusive. We have prepared a native, soluble, active, GFP-tagged fusion derivative of the IS2 transposase that creates fully formed complexes with single-end and minicircle junction (MCJ) substrates and used these successfully in hydroxyl radical footprinting experiments. In IS2, Step I reactions are physically and chemically asymmetric; the left imperfect, inverted repeat (IRL), the exclusive recipient end, lacks donor function. In SC I, different protection patterns of the cleavage domains (CDs) of the right imperfect inverted repeat (IRR; extensive in cis) and IRL (selective in trans) at the single active cognate IRR catalytic center (CC) are related to their donor and recipient functions. In SC II, extensive binding of the IRL CD in trans and of the abutted IRR CD in cis at this CC represents the first phase of the complex. An MCJ substrate precleaved at the 3' end of IRR revealed a temporary transition state with the IRL CD disengaged from the protein. We propose that in SC II, sequential 3' cleavages at the bound abutted CDs trigger a conformational change, allowing the IRL CD to complex to its cognate CC, producing the second phase. Corroborating data from enhanced residues and curvature propensity plots suggest that CD to CD interactions in SC I and SC II require IRL to assume a bent structure, to facilitate binding in trans. Different transpososomes are assembled in each step of the IS2 transposition pathway. Recipient versus donor end functions of the IRL CD in SC I and SC II and the conformational change in SC II that produces the phase needed for symmetrical IRL and IRR donor attacks on target DNA highlight the differences.
2012-01-01
Background Transposition in IS3, IS30, IS21 and IS256 insertion sequence (IS) families utilizes an unconventional two-step pathway. A figure-of-eight intermediate in Step I, from asymmetric single-strand cleavage and joining reactions, is converted into a double-stranded minicircle whose junction (the abutted left and right ends) is the substrate for symmetrical transesterification attacks on target DNA in Step II, suggesting intrinsically different synaptic complexes (SC) for each step. Transposases of these ISs bind poorly to cognate DNA and comparative biophysical analyses of SC I and SC II have proven elusive. We have prepared a native, soluble, active, GFP-tagged fusion derivative of the IS2 transposase that creates fully formed complexes with single-end and minicircle junction (MCJ) substrates and used these successfully in hydroxyl radical footprinting experiments. Results In IS2, Step I reactions are physically and chemically asymmetric; the left imperfect, inverted repeat (IRL), the exclusive recipient end, lacks donor function. In SC I, different protection patterns of the cleavage domains (CDs) of the right imperfect inverted repeat (IRR; extensive in cis) and IRL (selective in trans) at the single active cognate IRR catalytic center (CC) are related to their donor and recipient functions. In SC II, extensive binding of the IRL CD in trans and of the abutted IRR CD in cis at this CC represents the first phase of the complex. An MCJ substrate precleaved at the 3' end of IRR revealed a temporary transition state with the IRL CD disengaged from the protein. We propose that in SC II, sequential 3' cleavages at the bound abutted CDs trigger a conformational change, allowing the IRL CD to complex to its cognate CC, producing the second phase. Corroborating data from enhanced residues and curvature propensity plots suggest that CD to CD interactions in SC I and SC II require IRL to assume a bent structure, to facilitate binding in trans. Conclusions Different transpososomes are assembled in each step of the IS2 transposition pathway. Recipient versus donor end functions of the IRL CD in SC I and SC II and the conformational change in SC II that produces the phase needed for symmetrical IRL and IRR donor attacks on target DNA highlight the differences. PMID:22277150
Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu
2009-01-01
Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593
Oggioni, M R; Claverys, J P
1999-10-01
A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.
Xie, Qing; Shen, Kang-Ning; Hao, Xiuying; Nam, Phan Nhut; Ngoc Hieu, Bui Thi; Chen, Ching-Hung; Zhu, Changqing; Lin, Yen-Chang; Hsiao, Chung-Der
2017-03-01
abtract We decoded the complete chloroplast DNA (cpDNA) sequence of the Tianshan Snow Lotus (Saussurea involucrata), a famous traditional Chinese medicinal plant of the family Asteraceae, by using next-generation sequencing technology. The genome consists of 152 490 bp containing a pair of inverted repeats (IRs) of 25 202 bp, which was separated by a large single-copy region and a small single-copy region of 83 446 bp and 18 639 bp, respectively. The genic regions account for 57.7% of whole cpDNA, and the GC content of the cpDNA was 37.7%. The S. involucrata cpDNA encodes 114 unigenes (82 protein-coding genes, 4 rRNA genes, and 28 tRNA genes). There are eight protein-coding genes (atpF, ndhA, ndhB, rpl2, rpoC1, rps16, clpP, and ycf3) and five tRNA genes (trnA-UGC, trnI-GAU, trnK-UUU, trnL-UAA, and trnV-UAC) containing introns. A phylogenetic analysis of the 11 complete cpDNA from Asteracease showed that S. involucrata is closely related to Centaurea diffusa (Diffuse Knapweed). The complete cpDNA of S. involucrata provides essential and important DNA molecular data for further phylogenetic and evolutionary analysis for Asteraceae.
Kim, Sunggil; Park, Jee Young; Yang, Tae-Jin
2015-06-01
Intact retrotransposon and DNA transposons inserted in a single gene were characterized in onions (Allium cepa) and their transcription and copy numbers were estimated in this study. While analyzing diverse onion germplasm, large insertions in the DFR-A gene encoding dihydroflavonol 4-reductase (DFR) involved in the anthocyanin biosynthesis pathway were found in two accessions. A 5,070-bp long terminal repeat (LTR) retrotransposon inserted in the active DFR-A (R4) allele was identified from one of the large insertions and designated AcCOPIA1. An intact ORF encoded typical domains of copia-like LTR retrotransposons. However, AcCOPIA1 contained atypical 'TG' and 'TA' dinucleotides at the ends of the LTRs. A 4,615-bp DNA transposon was identified in the other large insertion. This DNA transposon, designated AcCACTA1, contained an ORF coding for a transposase showing homology with the CACTA superfamily transposable elements (TEs). Another 5,073-bp DNA transposon was identified from the DFR-A (TRN) allele. This DNA transposon, designated AchAT1, belonged to the hAT superfamily with short 4-bp terminal inverted repeats (TIRs). Finally, a 6,258-bp non-autonomous DNA transposon, designated AcPINK, was identified in the ANS-p allele encoding anthocyanidin synthase, the next downstream enzyme to DFR in the anthocyanin biosynthesis pathway. AcPINK also possessed very short 3-bp TIRs. Active transcription of AcCOPIA1, AcCACTA1, and AchAT1 was observed through RNA-Seq analysis and RT-PCR. The copy numbers of AcPINK estimated by mapping the genomic DNA reads produced by NextSeq 500 were predominantly high compared with the other TEs. A series of evidence indicated that these TEs might have transposed in these onion genes very recently, providing a stepping stone for elucidation of enormously large-sized onion genome structure.
Comparative genomics of pyridoxal 5′-phosphate-dependent transcription factor regulons in Bacteria
Suvorova, Inna A.
2016-01-01
The MocR-subfamily transcription factors (MocR-TFs) characterized by the GntR-family DNA-binding domain and aminotransferase-like sensory domain are broadly distributed among certain lineages of Bacteria. Characterized MocR-TFs bind pyridoxal 5′-phosphate (PLP) and control transcription of genes involved in PLP, gamma aminobutyric acid (GABA) and taurine metabolism via binding specific DNA operator sites. To identify putative target genes and DNA binding motifs of MocR-TFs, we performed comparative genomics analysis of over 250 bacterial genomes. The reconstructed regulons for 825 MocR-TFs comprise structural genes from over 200 protein families involved in diverse biological processes. Using the genome context and metabolic subsystem analysis we tentatively assigned functional roles for 38 out of 86 orthologous groups of studied regulators. Most of these MocR-TF regulons are involved in PLP metabolism, as well as utilization of GABA, taurine and ectoine. The remaining studied MocR-TF regulators presumably control genes encoding enzymes involved in reduction/oxidation processes, various transporters and PLP-dependent enzymes, for example aminotransferases. Predicted DNA binding motifs of MocR-TFs are generally similar in each orthologous group and are characterized by two to four repeated sequences. Identified motifs were classified according to their structures. Motifs with direct and/or inverted repeat symmetry constitute the majority of inferred DNA motifs, suggesting preferable TF dimerization in head-to-tail or head-to-head configuration. The obtained genomic collection of in silico reconstructed MocR-TF motifs and regulons in Bacteria provides a basis for future experimental characterization of molecular mechanisms for various regulators in this family. PMID:28348826
A virus of hyperthermophilic archaea with a unique architecture among DNA viruses.
Rensen, Elena Ilka; Mochizuki, Tomohiro; Quemin, Emmanuelle; Schouten, Stefan; Krupovic, Mart; Prangishvili, David
2016-03-01
Viruses package their genetic material in diverse ways. Most known strategies include encapsulation of nucleic acids into spherical or filamentous virions with icosahedral or helical symmetry, respectively. Filamentous viruses with dsDNA genomes are currently associated exclusively with Archaea. Here, we describe a filamentous hyperthermophilic archaeal virus, Pyrobaculum filamentous virus 1 (PFV1), with a type of virion organization not previously observed in DNA viruses. The PFV1 virion, 400 ± 20 × 32 ± 3 nm, contains an envelope and an inner core consisting of two structural units: a rod-shaped helical nucleocapsid formed of two 14-kDa major virion proteins and a nucleocapsid-encompassing protein sheath composed of a single major virion protein of 18 kDa. The virion organization of PFV1 is superficially similar to that of negative-sense RNA viruses of the family Filoviridae, including Ebola virus and Marburg virus. The linear dsDNA of PFV1 carries 17,714 bp, including 60-bp-long terminal inverted repeats, and contains 39 predicted ORFs, most of which do not show similarities to sequences in public databases. PFV1 is a lytic virus that completely disrupts the host cell membrane at the end of the infection cycle.
Mobilization of a plant transposon by expression of the transposon-encoded anti-silencing factor.
Fu, Yu; Kawabe, Akira; Etcheverry, Mathilde; Ito, Tasuku; Toyoda, Atsushi; Fujiyama, Asao; Colot, Vincent; Tarutani, Yoshiaki; Kakutani, Tetsuji
2013-08-28
Transposable elements (TEs) have a major impact on genome evolution, but they are potentially deleterious, and most of them are silenced by epigenetic mechanisms, such as DNA methylation. Here, we report the characterization of a TE encoding an activity to counteract epigenetic silencing by the host. In Arabidopsis thaliana, we identified a mobile copy of the Mutator-like element (MULE) with degenerated terminal inverted repeats (TIRs). This TE, named Hiun (Hi), is silent in wild-type plants, but it transposes when DNA methylation is abolished. When a Hi transgene was introduced into the wild-type background, it induced excision of the endogenous Hi copy, suggesting that Hi is the autonomously mobile copy. In addition, the transgene induced loss of DNA methylation and transcriptional activation of the endogenous Hi. Most importantly, the trans-activation of Hi depends on a Hi-encoded protein different from the conserved transposase. Proteins related to this anti-silencing factor, which we named VANC, are widespread in the non-TIR MULEs and may have contributed to the recent success of these TEs in natural Arabidopsis populations.
5-Methyldeoxycytidine in the Physarum minichromosome containing the ribosomal RNA genes.
Cooney, C A; Matthews, H R; Bradbury, E M
1984-01-01
5-Methyldeoxycytidine (5MC) was analyzed by high pressure liquid chromatography (HPLC) and by restriction enzyme digestion in rDNA isolated from Physarum polycephalum. rDNA from Physarum M3C strain microplasmodia has a significant 5MC content (about half that of the whole genomic DNA). This rDNA contains many C5MCGG sites because it is clearly digested further by Msp I than by Hpa II. However, most 5MC is in other sites. In particular, alternating CG sequences appear to be highly methylated. HPLC of deoxyribonucleosides shows tha most of the transcribed regions contain little or no 5MC. Restriction digestion indicates that there is little or no 5MC in any of the transcribed regions including the transcription origin and adjacent sequences. Over 90% of the total 5MC is in or near the central nontranscribed spacer and most methylated restriction sites are in inverted repeats of this spacer. rDNA is very heterogeneous with respect to 5MC. The 5MC pattern doesn't appear to change with inactivation of the rRNA genes during reversible differentiation from microplasmodia (growing) to microsclerotia (dormant), showing that inactivation is due to changes in other chromatin variables. The 5MC pattern is different between Physarum strains. The possible involvement of this 5MC in rDNA chromatin structure and in cruciform and Z-DNA formation is discussed. Images PMID:6322108
Spy: a new group of eukaryotic DNA transposons without target site duplications.
Han, Min-Jin; Xu, Hong-En; Zhang, Hua-Hao; Feschotte, Cédric; Zhang, Ze
2014-06-24
Class 2 or DNA transposons populate the genomes of most eukaryotes and like other mobile genetic elements have a profound impact on genome evolution. Most DNA transposons belong to the cut-and-paste types, which are relatively simple elements characterized by terminal-inverted repeats (TIRs) flanking a single gene encoding a transposase. All eukaryotic cut-and-paste transposons so far described are also characterized by target site duplications (TSDs) of host DNA generated upon chromosomal insertion. Here, we report a new group of evolutionarily related DNA transposons called Spy, which also include TIRs and DDE motif-containing transposase but surprisingly do not create TSDs upon insertion. Instead, Spy transposons appear to transpose precisely between 5'-AAA and TTT-3' host nucleotides, without duplication or modification of the AAATTT target sites. Spy transposons were identified in the genomes of diverse invertebrate species based on transposase homology searches and structure-based approaches. Phylogenetic analyses indicate that Spy transposases are distantly related to IS5, ISL2EU, and PIF/Harbinger transposases. However, Spy transposons are distinct from these and other DNA transposon superfamilies by their lack of TSD and their target site preference. Our findings expand the known diversity of DNA transposons and reveal a new group of eukaryotic DDE transposases with unusual catalytic properties. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Yamada, Kazuteru; Kaneko, Jun; Kamio, Yoshiyuki; Itoh, Yoshifumi
2008-01-01
Pectobacterium carotovorum subsp. carotovorum strain Er simultaneously produces the phage tail-like bacteriocin carotovoricin (Ctv) and pectin lyase (Pnl) in response to DNA-damaging agents. The regulatory protein RdgB of the Mor/C family of proteins activates transcription of pnl through binding to the promoter. However, the optimal temperature for the synthesis of Ctv (23°C) differs from that for synthesis of Pnl (30°C), raising the question of whether RdgB directly activates ctv transcription. Here we report that RdgB directly regulates Ctv synthesis. Gel mobility shift assays demonstrated RdgB binding to the P0, P1, and P2 promoters of the ctv operons, and DNase I footprinting determined RdgB-binding sequences (RdgB boxes) on these and on the pnl promoters. The RdgB box of the pnl promoter included a perfect 7-bp inverted repeat with high binding affinity to the regulator (Kd [dissociation constant] = 150 nM). In contrast, RdgB boxes of the ctv promoters contained an imperfect inverted repeat with two or three mismatches that consequently reduced binding affinity (Kd = 250 to 350 nM). Transcription of the rdgB and ctv genes was about doubled at 23°C compared with that at 30°C. In contrast, the amount of pnl transcription tripled at 30°C. Thus, the inverse synthesis of Ctv and Pnl as a function of temperature is apparently controlled at the transcriptional level, and reduced rdgB expression at 30°C obviously affected transcription from the ctv promoters with low-affinity RdgB boxes. Pathogenicity toward potato tubers was reduced in an rdgB knockout mutant, suggesting that the RdgAB system contributes to the pathogenicity of this bacterium, probably by activating pnl expression. PMID:18689515
Halász, Júlia; Kodad, Ossama; Hegedűs, Attila
2014-07-01
Miniature inverted-repeat transposable elements (MITEs) are known to contribute to the evolution of plants, but only limited information is available for MITEs in the Prunus genome. We identified a MITE that has been named Falling Stones, FaSt. All structural features (349-bp size, 82-bp terminal inverted repeats and 9-bp target site duplications) are consistent with this MITE being a putative member of the Mutator transposase superfamily. FaSt showed a preferential accumulation in the short AT-rich segments of the euchromatin region of the peach genome. DNA sequencing and pollination experiments have been performed to confirm that the nested insertion of FaSt into the S-haplotype-specific F-box gene of apricot resulted in the breakdown of self-incompatibility (SI). A bioinformatics-based survey of the known Rosaceae and other genomes and a newly designed polymerase chain reaction (PCR) assay verified the Prunoideae-specific occurrence of FaSt elements. Phylogenetic analysis suggested a recent activity of FaSt in the Prunus genome. The occurrence of a nested insertion in the apricot genome further supports the recent activity of FaSt in response to abiotic stress conditions. This study reports on a presumably active non-autonomous Mutator element in Prunus that exhibits a major indirect genome shaping force through inducing loss-of-function mutation in the SI locus. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Friedrich, Michael; Meier, Doreen; Schuster, Isabelle; Nellen, Wolfgang
2015-01-01
We have previously shown that the most abundant Dictyostelium discoideum retroelement DIRS-1 is suppressed by RNAi mechanisms. Here we provide evidence that both inverted terminal repeats have strong promoter activity and that bidirectional expression apparently generates a substrate for Dicer. A cassette containing the inverted terminal repeats and a fragment of a gene of interest was sufficient to activate the RNAi response, resulting in the generation of ~21 nt siRNAs, a reduction of mRNA and protein expression of the respective endogene. Surprisingly, no transitivity was observed on the endogene. This was in contrast to previous observations, where endogenous siRNAs caused spreading on an artificial transgene. Knock-down was successful on seven target genes that we examined. In three cases a phenotypic analysis proved the efficiency of the approach. One of the target genes was apparently essential because no knock-out could be obtained; the RNAi mediated knock-down, however, resulted in a very slow growing culture indicating a still viable reduction of gene expression. ADVANTAGES OF THE DIRS-1–RNAI SYSTEM: The knock-down system required a short DNA fragment (~400 bp) of the target gene as an initial trigger. Further siRNAs were generated by RdRPs since we have shown some siRNAs with a 5'-triphosphate group. Extrachromosomal vectors facilitate the procedure and allowed for molecular and phenotypic analysis within one week. The system provides an efficient and rapid method to reduce protein levels including those of essential genes.
Herrmann, Luise; Haase, Ilka; Blauhut, Maike; Barz, Nadine; Fischer, Markus
2014-12-17
Two cocoa types, Arriba and CCN-51, are being cultivated in Ecuador. With regard to the unique aroma, Arriba is considered a fine cocoa type, while CCN-51 is a bulk cocoa because of its weaker aroma. Because it is being assumed that Arriba is mixed with CCN-51, there is an interest in the analytical differentiation of the two types. Two methods to identify CCN-51 adulterations in Arriba cocoa were developed on the basis of differences in the chloroplast DNA. On the one hand, a different repeat of the sequence TAAAG in the inverted repeat region results in a different length of amplicons for the two cocoa types, which can be detected by agarose gel electrophoresis, capillary gel electrophoresis, and denaturing high-performance liquid chromatography. On the other hand, single nucleotide polymorphisms (SNPs) between the CCN-51 and Arriba sequences represent restriction sites, which can be used for restriction fragment length polymorphism analysis. A semi-quantitative analysis based on these SNPs is feasible. A method for an exact quantitation based on these results is not realizable. These sequence variations were confirmed for a comprehensive cultivar collection of Arriba and CCN-51, for both bean and leaf samples.
Gubser, Caroline; Smith, Geoffrey L
2002-04-01
Camelpox virus (CMPV) and variola virus (VAR) are orthopoxviruses (OPVs) that share several biological features and cause high mortality and morbidity in their single host species. The sequence of a virulent CMPV strain was determined; it is 202182 bp long, with inverted terminal repeats (ITRs) of 6045 bp and has 206 predicted open reading frames (ORFs). As for other poxviruses, the genes are tightly packed with little non-coding sequence. Most genes within 25 kb of each terminus are transcribed outwards towards the terminus, whereas genes within the centre of the genome are transcribed from either DNA strand. The central region of the genome contains genes that are highly conserved in other OPVs and 87 of these are conserved in all sequenced chordopoxviruses. In contrast, genes towards either terminus are more variable and encode proteins involved in host range, virulence or immunomodulation. In some cases, these are broken versions of genes found in other OPVs. The relationship of CMPV to other OPVs was analysed by comparisons of DNA and predicted protein sequences, repeats within the ITRs and arrangement of ORFs within the terminal regions. Each comparison gave the same conclusion: CMPV is the closest known virus to variola virus, the cause of smallpox.
DOT National Transportation Integrated Search
1963-02-01
Vestibular stimulation by repeated unilateral caloric irrigation of cats occasioned the appearance of secondary, tertiary, and inverted primary nystagmus in some animals. These inverse responses were recorded with stimulus temperatures of 5, 23.5, an...
Evidence of an inverted hexagonal phase in self-assembled phospholipid-DNA-metal complexes
NASA Astrophysics Data System (ADS)
Francescangeli, O.; Pisani, M.; Stanic, V.; Bruni, P.; Weiss, T. M.
2004-08-01
We report the first observation of an inverted hexagonal phase of phospholipid-DNA-metal complexes. These ternary complexes are formed in a self-assembled manner when water solutions of neutral lipid dioleoylphosphatidylethanolamine (DOPE), DNA and divalent metal cations (Me2+; Me=Fe, Co, Mg, Mn) are mixed, which represents a striking example of supramolecular chemistry. The structure, derived from synchrotron X-ray diffraction, consists of cylindrical DNA strands coated by neutral lipid monolayers and arranged on a two-dimensional hexagonal lattice (HIIc). Besides the fundamental aspects, DOPE-DNA-Me2+ complexes may be of great interest as efficient nonviral delivery systems in gene therapy applications because of the low inherent cytotoxicity and the potential high transfection efficiency.
How type II CRISPR-Cas establish immunity through Cas1-Cas2-mediated spacer integration.
Xiao, Yibei; Ng, Sherwin; Nam, Ki Hyun; Ke, Ailong
2017-10-05
CRISPR (clustered regularly interspaced short palindromic repeats) and the nearby Cas (CRISPR-associated) operon establish an RNA-based adaptive immunity system in prokaryotes. Molecular memory is created when a short foreign DNA-derived prespacer is integrated into the CRISPR array as a new spacer. Whereas the RNA-guided CRISPR interference mechanism varies widely among CRISPR-Cas systems, the spacer integration mechanism is essentially identical. The conserved Cas1 and Cas2 proteins form an integrase complex consisting of two distal Cas1 dimers bridged by a Cas2 dimer. The prespacer is bound by Cas1-Cas2 as a dual-forked DNA, and the terminal 3'-OH of each 3' overhang serves as an attacking nucleophile during integration. The prespacer is preferentially integrated into the leader-proximal region of the CRISPR array, guided by the leader sequence and a pair of inverted repeats inside the CRISPR repeat. Spacer integration in the well-studied Escherichia coli type I-E CRISPR system also relies on the bacterial integration host factor. In type II-A CRISPR, however, Cas1-Cas2 alone integrates spacers efficiently in vitro; other Cas proteins (such as Cas9 and Csn2) have accessory roles in the biogenesis phase of prespacers. Here we present four structural snapshots from the type II-A system of Enterococcus faecalis Cas1 and Cas2 during spacer integration. Enterococcus faecalis Cas1-Cas2 selectively binds to a splayed 30-base-pair prespacer bearing 4-nucleotide 3' overhangs. Three molecular events take place upon encountering a target: first, the Cas1-Cas2-prespacer complex searches for half-sites stochastically, then it preferentially interacts with the leader-side CRISPR repeat, and finally, it catalyses a nucleophilic attack that connects one strand of the leader-proximal repeat to the prespacer 3' overhang. Recognition of the spacer half-site requires DNA bending and leads to full integration. We derive a mechanistic framework to explain the stepwise spacer integration process and the leader-proximal preference.
How Type II CRISPR-Cas establish immunity through Cas1-Cas2 mediated spacer integration
Xiao, Yibei; Ng, Sherwin; Nam, Ki Hyun; Ke, Ailong
2017-01-01
CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and the nearby cas (CRISPR-associated) operon establish an RNA-based adaptive immunity system in prokaryotes1–5. Molecular memory is created when a short foreign DNA-derived prespacer is integrated into the CRISPR array as a new spacer6–9. Whereas the RNA-guided CRISPR interference mechanism varies widely among CRISPR-Cas systems, the spacer integration mechanism is essentially identical7–9. The conserved Cas1 and Cas2 proteins form an integrase complex consisting two distal Cas1 dimers bridged by a Cas2 dimer in the middle6,10. The prespacer is bound by Cas1-Cas2 as a dual forked DNA, and the terminal 3′-OH of each 3′-overhang serves as an attacking nucleophile during integration11–14. Importantly, the prespacer is preferentially integrated into the leader-proximal region of the CRISPR array1,7,10,15, guided by the leader sequence and a pair of inverted repeats (IRs) inside the CRISPR repeat7,15–20. Spacer integration in the most well-studied Escherichia coli Type I-E CRISPR system further relies on the bacterial Integration Host Factor (IHF)21,22. In Type II-A CRISPR, however, Cas1-Cas2 alone integrates spacer efficiently in vitro18; other Cas proteins (Cas9 and Csn2) play accessory roles in prespacer biogenesis17,23. Focusing on the Enterococcus faecalis Type II-A system24, here we report four structure snapshots of Cas1-Cas2 during spacer integration. EfaCas1-Cas2 selectively binds to a splayed 30-bp prespacer bearing 4-nt 3′-overhangs. Three molecular events take place upon encountering a target: Cas1-Cas2/prespacer first searches for half-sites stochastically, then preferentially interacts with the leader-side CRISPR repeat and catalyzes a nucleophilic attack that connects one strand of the leader-proximal repeat to the prespacer 3′-overhang. Recognition of the spacer half-site requires DNA bending and leads to full integration. We derive a mechanistic framework explaining the stepwise spacer integration process and the leader-proximal preference. PMID:28869593
Genetics, structure, and prevalence of FP967 (CDC Triffid) T-DNA in flax.
Young, Lester; Hammerlindl, Joseph; Babic, Vivijan; McLeod, Jamille; Sharpe, Andrew; Matsalla, Chad; Bekkaoui, Faouzi; Marquess, Leigh; Booker, Helen M
2015-01-01
The detection of T-DNA from a genetically modified flaxseed line (FP967, formally CDC Triffid) in a shipment of Canadian flaxseed exported to Europe resulted in a large decrease in the amount of flax planted in Canada. The Canadian flaxseed industry undertook major changes to ensure the removal of FP967 from the supply chain. This study aimed to resolve the genetics and structure of the FP967 transfer DNA (T-DNA). The FP967 T-DNA is thought to be inserted in at single genomic locus. The junction between the T-DNA and genomic DNA consisted of two inverted Right Borders with no Left Border (LB) flanking genomic DNA sequences recovered. This information was used to develop an event-specific quantitative PCR (qPCR) assay. This assay and an existing assay specific to the T-DNA construct were used to determine the genetics and prevalence of the FP967 T-DNA. These data supported the hypothesis that the T-DNA is present at a single location in the genome. The FP967 T-DNA is present at a low level (between 0.01 and 0.1%) in breeder seed lots from 2009 and 2010. None of the 11,000 and 16,000 lines selected for advancement through the Flax Breeding Program in 2010 and 2011, respectively, tested positive for the FP967 T-DNA, however. Most of the FP967 T-DNA sequence was resolved via PCR cloning and next generation sequencing. A 3,720 bp duplication of an internal portion of the T-DNA (including a Right Border) was discovered between the flanking genomic DNA and the LB. An event-specific assay, SAT2-LB, was developed for the junction between this repeat and the LB.
NASA Astrophysics Data System (ADS)
Krupovic, Mart; Koonin, Eugene V.
2014-06-01
Single-stranded (ss)DNA viruses are extremely widespread, infect diverse hosts from all three domains of life and include important pathogens. Most ssDNA viruses possess small genomes that replicate by the rolling-circle-like mechanism initiated by a distinct virus-encoded endonuclease. However, viruses of the family Bidnaviridae, instead of the endonuclease, encode a protein-primed type B DNA polymerase (PolB) and hence break this pattern. We investigated the provenance of all bidnavirus genes and uncover an unexpected turbulent evolutionary history of these unique viruses. Our analysis strongly suggests that bidnaviruses evolved from a parvovirus ancestor from which they inherit a jelly-roll capsid protein and a superfamily 3 helicase. The radiation of bidnaviruses from parvoviruses was probably triggered by integration of the ancestral parvovirus genome into a large virus-derived DNA transposon of the Polinton (polintovirus) family resulting in the acquisition of the polintovirus PolB gene along with terminal inverted repeats. Bidnavirus genes for a receptor-binding protein and a potential novel antiviral defense modulator are derived from dsRNA viruses (Reoviridae) and dsDNA viruses (Baculoviridae), respectively. The unusual evolutionary history of bidnaviruses emphasizes the key role of horizontal gene transfer, sometimes between viruses with completely different genomes but occupying the same niche, in the emergence of new viral types.
Recognizing the enemy within: licensing RNA-guided genome defense
Dumesic, Phillip A.; Madhani, Hiten D.
2014-01-01
How do cells distinguish normal genes from transposons? Although much has been learned about RNAi-related RNA silencing pathways responsible for genome defense, this fundamental question remains. The literature points to several classes of mechanisms. In some cases, double-stranded RNA structures produced by transposon inverted repeats or antisense integration trigger endo-siRNA biogenesis. In other instances, DNA features associated with transposons—such as their unusual copy number, chromosomal arrangement, and/or chromatin environment—license RNA silencing. Finally, recent studies have identified improper transcript processing events, such as stalled pre-mRNA splicing, as signals for siRNA production. Thus, the suboptimal gene expression properties of selfish elements can enable their identification by RNA silencing pathways. PMID:24280023
Importance Sampling of Word Patterns in DNA and Protein Sequences
Chan, Hock Peng; Chen, Louis H.Y.
2010-01-01
Abstract Monte Carlo methods can provide accurate p-value estimates of word counting test statistics and are easy to implement. They are especially attractive when an asymptotic theory is absent or when either the search sequence or the word pattern is too short for the application of asymptotic formulae. Naive direct Monte Carlo is undesirable for the estimation of small probabilities because the associated rare events of interest are seldom generated. We propose instead efficient importance sampling algorithms that use controlled insertion of the desired word patterns on randomly generated sequences. The implementation is illustrated on word patterns of biological interest: palindromes and inverted repeats, patterns arising from position-specific weight matrices (PSWMs), and co-occurrences of pairs of motifs. PMID:21128856
Garcia-Fernàndez, J; Bayascas-Ramírez, J R; Marfany, G; Muñoz-Mármol, A M; Casali, A; Baguñà, J; Saló, E
1995-05-01
Several DNA sequences similar to the mariner element were isolated and characterized in the platyhelminthe Dugesia (Girardia) tigrina. They were 1,288 bp long, flanked by two 32 bp-inverted repeats, and contained a single 339 amino acid open-reading frame (ORF) encoding the transposase. The number of copies of this element is approximately 8,000 per haploid genome, constituting a member of the middle-repetitive DNA of Dugesia tigrina. Sequence analysis of several elements showed a high percentage of conservation between the different copies. Most of them presented an intact ORF and the standard signals of actively expressed genes, which suggests that some of them are or have recently been functional transposons. The high degree of similarity shared with other mariner elements from some arthropods, together with the fact that this element is undetectable in other planarian species, strongly suggests a case of horizontal transfer between these two distant phyla.
DNA sequences of three beta-1,4-endoglucanase genes from Thermomonospora fusca.
Lao, G; Ghangas, G S; Jung, E D; Wilson, D B
1991-01-01
The DNA sequences of the Thermomonospora fusca genes encoding cellulases E2 and E5 and the N-terminal end of E4 were determined. Each sequence contains an identical 14-bp inverted repeat upstream of the initiation codon. There were no significant homologies between the coding regions of the three genes. The E2 gene is 73% identical to the celA gene from Microbispora bispora, but this was the only homology found with other cellulase genes. E2 belongs to a family of cellulases that includes celA from M. bispora, cenA from Cellulomonas fimi, casA from an alkalophilic Streptomyces strain, and cellobiohydrolase II from Trichoderma reesei. E4 shows 44% identity to an avocado cellulase, while E5 belongs to the Bacillus cellulase family. There were strong similarities between the amino acid sequences of the E2 and E5 cellulose binding domains, and these regions also showed homology with C. fimi and Pseudomonas fluorescens cellulose binding domains. PMID:1904434
Comparative Analysis of the Complete Chloroplast Genome of Four Endangered Herbals of Notopterygium
Yang, Jiao; Yue, Ming; Niu, Chuan; Ma, Xiong-Feng; Li, Zhong-Hu
2017-01-01
Notopterygium H. de Boissieu (Apiaceae) is an endangered perennial herb endemic to China. A good knowledge of phylogenetic evolution and population genomics is conducive to the establishment of effective management and conservation strategies of the genus Notopterygium. In this study, the complete chloroplast (cp) genomes of four Notopterygium species (N. incisum C. C. Ting ex H. T. Chang, N. oviforme R. H. Shan, N. franchetii H. de Boissieu and N. forrestii H. Wolff) were assembled and characterized using next-generation sequencing. We investigated the gene organization, order, size and repeat sequences of the cp genome and constructed the phylogenetic relationships of Notopterygium species based on the chloroplast DNA and nuclear internal transcribed spacer (ITS) sequences. Comparative analysis of plastid genome showed that the cp DNA are the standard double-stranded molecule, ranging from 157,462 bp (N. oviforme) to 159,607 bp (N. forrestii) in length. The circular DNA each contained a large single-copy (LSC) region, a small single-copy (SSC) region, and a pair of inverted repeats (IRs). The cp DNA of four species contained 85 protein-coding genes, 37 transfer RNA (tRNA) genes and 8 ribosomal RNA (rRNA) genes, respectively. We determined the marked conservation of gene content and sequence evolutionary rate in the cp genome of four Notopterygium species. Three genes (psaI, psbI and rpoA) were possibly under positive selection among the four sampled species. Phylogenetic analysis showed that four Notopterygium species formed a monophyletic clade with high bootstrap support. However, the inconsistent interspecific relationships with the genus Notopterygium were identified between the cp DNA and ITS markers. The incomplete lineage sorting, convergence evolution or hybridization, gene infiltration and different sampling strategies among species may have caused the incongruence between the nuclear and cp DNA relationships. The present results suggested that Notopterygium species may have experienced a complex evolutionary history and speciation process. PMID:28422071
De Feyter, R; Yang, Y; Gabriel, D W
1993-01-01
Six plasmid-borne avirulence (avr) genes were previously cloned from strain XcmH of the cotton pathogen, Xanthomonas campestris pv. malvacearum. We have now localized all six avr genes on the cloned fragments by subcloning and Tn5-gusA insertional mutagenesis. None of these avr genes appeared to exhibit exclusively gene-for-gene patterns of interactions with cotton R genes, and avrB4 was demonstrated to confer avr gene-for-R genes (plural) avirulence to X. c. pv. malvacearum on congenic cotton lines carrying either of two different resistance loci, B1 or B4. Furthermore, the B1 locus appeared to confer R gene-for-avr genes resistance to cotton against isogenic X. c. pv. malvacearum strains carrying any one of three avr genes: avrB4, avrb6, or avrB102. Restriction enzyme, Southern blot hybridization, and DNA sequence analyses showed that the XcmH avr genes are all highly similar to each other, to avrBs3 and avrBsP from the pepper pathogen X. c. pv. vesicatoria, and to the host-specific virulence gene pthA from the citrus pathogen X. citri. The XcmH avr genes differed primarily in the multiplicity of a tandemly repeated 102-base pair motif within the central portions of the genes, repeated from 14 to 23 times in members of this gene family. The complete nucleotide sequence of avrb6 revealed that it is 97% identical in DNA sequence to avrB4, avrBs3, avrBsP, and pthA and that 62-bp inverted terminal repeats mark the boundaries of homology between avrb6 and all members of this Xanthomonas virulence/avirulence gene family sequenced to date. The terminal 38 bp of both inverted repeats are highly similar to the 38-bp consensus terminal sequence of the Tn3 family of transposons. Up to 11 members of the avr gene family appear to be present in North American strains of X. c. pv. malvacearum, including XcmH. The high level of homology observed among these avr genes and their presence in multiple copies may explain the gene-for-genes interactions and also the observed high frequencies (10(-3) to 10(-4) per locus) of X. c. pv. malvacearum race change mutations. Five spontaneous race change mutants of XcmH suffered avr locus deletions, strongly indicating intergenic recombination as the primary mechanism for generating new races in X. c. pv. malvacearum.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ono, M.
1986-06-01
By using a DNA fragment primarily encoding the reverse transcriptase (pol) region of the Syrian hamster intracisternal A particle (IAP; type A retrovirus) gene as a probe, human endogenous retrovirus genes, tentatively termed HERV-K genes, were cloned from a fetal human liver gene library. Typical HERV-K genes were 9.1 or 9.4 kilobases in length, having long terminal repeats (LTRs) of ca. 970 base pairs. Many structural features commonly observed on the retrovirus LTRs, such as the TATAA box, polyadenylation signal, and terminal inverted repeats, were present on each LTR, and a lysine (K) tRNA having a CUU anticodon was identifiedmore » as a presumed primer tRNA. The HERV-K LTR, however, had little sequence homology to either the IAP LTR or other typical oncovirus LTRs. By filter hybridization, the number of HERV-K genes was estimated to be ca. 50 copies per haploid human genome. The cloned mouse mammary tumor virus (type B) gene was found to hybridize with both the HERV-K and IAP genes to essentially the same extent.« less
Nishihara, Hidenori; Stanyon, Roscoe; Kusumi, Junko; Hirai, Hirohisa
2018-01-01
Abstract Rod cells of many nocturnal mammals have a “non-standard” nuclear architecture, which is called the inverted nuclear architecture. Heterochromatin localizes to the central region of the nucleus. This leads to an efficient light transmission to the outer segments of photoreceptors. Rod cells of diurnal mammals have the conventional nuclear architecture. Owl monkeys (genus Aotus) are the only taxon of simian primates that has a nocturnal or cathemeral lifestyle, and this adaptation is widely thought to be secondary. Their rod cells were shown to exhibit an intermediate chromatin distribution: a spherical heterochromatin block was found in the central region of the nucleus although it was less complete than that of typical nocturnal mammals. We recently demonstrated that the primary DNA component of this heterochromatin block was OwlRep, a megasatellite DNA consisting of 187-bp-long repeat units. However, the origin of OwlRep was not known. Here we show that OwlRep was derived from HSAT6, a simple repeat sequence found in the centromere regions of human chromosomes. HSAT6 occurs widely in primates, suggesting that it was already present in the last common ancestor of extant primates. Notably, Strepsirrhini and Tarsiformes apparently carry a single HSAT6 copy, whereas many species of Simiiformes contain multiple copies. Comparison of nucleotide sequences of these copies revealed the entire process of the OwlRep formation. HSAT6, with or without flanking sequences, was segmentally duplicated in New World monkeys. Then, in the owl monkey linage after its divergence from other New World monkeys, a copy of HSAT6 was tandemly amplified, eventually forming a megasatellite DNA. PMID:29294004
Ishiai, M; Wada, C; Kawasaki, Y; Yura, T
1994-01-01
Replication of mini-F plasmid requires the plasmid-encoded RepE initiator protein and several host factors including DnaJ, DnaK, and GrpE, heat shock proteins of Escherichia coli. The RepE protein plays a crucial role in replication and exhibits two major functions: initiation of replication from the origin, ori2, and autogenous repression of repE transcription. One of the mini-F plasmid mutants that can replicate in the dnaJ-defective host produces an altered RepE (RepE54) with a markedly enhanced initiator activity but little or no repressor activity. RepE54 has been purified from cell extracts primarily in monomeric form, unlike the wild-type RepE that is recovered in dimeric form. Gel-retardation assays revealed that RepE54 monomers bind to ori2 (direct repeats) with a very high efficiency but hardly bind to the repE operator (inverted repeat), in accordance with the properties of RepE54 in vivo. Furthermore, the treatment of wild-type RepE dimers with protein denaturants enhanced their binding to ori2 but reduced binding to the operator: RepE dimers were partially converted to monomers, and the ori2 binding activity was uniquely associated with monomers. These results strongly suggest that RepE monomers represent an active form by binding to ori2 to initiate replication, whereas dimers act as an autogenous repressor by binding to the operator. We propose that RepE is structurally and functionally differentiated and that monomerization of RepE dimers, presumably mediated by heat shock protein(s), activates the initiator function and participates in regulation of mini-F DNA replication. Images PMID:8170998
2013-01-01
Background Wheat gluten has unique nutritional and technological characteristics, but is also a major trigger of allergies and intolerances. One of the most severe diseases caused by gluten is coeliac disease. The peptides produced in the digestive tract by the incomplete digestion of gluten proteins trigger the disease. The majority of the epitopes responsible reside in the gliadin fraction of gluten. The location of the multiple gliadin genes in blocks has to date complicated their elimination by classical breeding techniques or by the use of biotechnological tools. As an approach to silence multiple gliadin genes we have produced 38 transgenic lines of bread wheat containing combinations of two endosperm-specific promoters and three different inverted repeat sequences to silence three fractions of gliadins by RNA interference. Results The effects of the RNA interference constructs on the content of the gluten proteins, total protein and starch, thousand seed weights and SDSS quality tests of flour were analyzed in these transgenic lines in two consecutive years. The characteristics of the inverted repeat sequences were the main factor that determined the efficiency of silencing. The promoter used had less influence on silencing, although a synergy in silencing efficiency was observed when the two promoters were used simultaneously. Genotype and the environment also influenced silencing efficiency. Conclusions We conclude that to obtain wheat lines with an optimum reduction of toxic gluten epitopes one needs to take into account the factors of inverted repeat sequences design, promoter choice and also the wheat background used. PMID:24044767
Pang, Xiuhua; Aigle, Bertrand; Girardet, Jean-Michel; Mangenot, Sophie; Pernodet, Jean-Luc; Decaris, Bernard; Leblond, Pierre
2004-01-01
Streptomyces ambofaciens has an 8-Mb linear chromosome ending in 200-kb terminal inverted repeats. Analysis of the F6 cosmid overlapping the terminal inverted repeats revealed a locus similar to type II polyketide synthase (PKS) gene clusters. Sequence analysis identified 26 open reading frames, including genes encoding the β-ketoacyl synthase (KS), chain length factor (CLF), and acyl carrier protein (ACP) that make up the minimal PKS. These KS, CLF, and ACP subunits are highly homologous to minimal PKS subunits involved in the biosynthesis of angucycline antibiotics. The genes encoding the KS and ACP subunits are transcribed constitutively but show a remarkable increase in expression after entering transition phase. Five genes, including those encoding the minimal PKS, were replaced by resistance markers to generate single and double mutants (replacement in one and both terminal inverted repeats). Double mutants were unable to produce either diffusible orange pigment or antibacterial activity against Bacillus subtilis. Single mutants showed an intermediate phenotype, suggesting that each copy of the cluster was functional. Transformation of double mutants with a conjugative and integrative form of F6 partially restored both phenotypes. The pigmented and antibacterial compounds were shown to be two distinct molecules produced from the same biosynthetic pathway. High-pressure liquid chromatography analysis of culture extracts from wild-type and double mutants revealed a peak with an associated bioactivity that was absent from the mutants. Two additional genes encoding KS and CLF were present in the cluster. However, disruption of the second KS gene had no effect on either pigment or antibiotic production. PMID:14742212
Fricova, Dominika; Valach, Matus; Farkas, Zoltan; Pfeiffer, Ilona; Kucsera, Judit; Tomaska, Lubomir; Nosek, Jozef
2010-01-01
As a part of our initiative aimed at a large-scale comparative analysis of fungal mitochondrial genomes, we determined the complete DNA sequence of the mitochondrial genome of the yeast Candida subhashii and found that it exhibits a number of peculiar features. First, the mitochondrial genome is represented by linear dsDNA molecules of uniform length (29 795 bp), with an unusually high content of guanine and cytosine residues (52.7 %). Second, the coding sequences lack introns; thus, the genome has a relatively compact organization. Third, the termini of the linear molecules consist of long inverted repeats and seem to contain a protein covalently bound to terminal nucleotides at the 5′ ends. This architecture resembles the telomeres in a number of linear viral and plasmid DNA genomes classified as invertrons, in which the terminal proteins serve as specific primers for the initiation of DNA synthesis. Finally, although the mitochondrial genome of C. subhashii contains essentially the same set of genes as other closely related pathogenic Candida species, we identified additional ORFs encoding two homologues of the family B protein-priming DNA polymerases and an unknown protein. The terminal structures and the genes for DNA polymerases are reminiscent of linear mitochondrial plasmids, indicating that this genome architecture might have emerged from fortuitous recombination between an ancestral, presumably circular, mitochondrial genome and an invertron-like element. PMID:20395267
Chompy: an infestation of MITE-like repetitive elements in the crocodilian genome.
Ray, David A; Hedges, Dale J; Herke, Scott W; Fowlkes, Justin D; Barnes, Erin W; LaVie, Daniel K; Goodwin, Lindsey M; Densmore, Llewellyn D; Batzer, Mark A
2005-12-05
Interspersed repeats are a major component of most eukaryotic genomes and have an impact on genome size and stability, but the repetitive element landscape of crocodilian genomes has not yet been fully investigated. In this report, we provide the first detailed characterization of an interspersed repeat element in any crocodilian genome. Chompy is a putative miniature inverted-repeat transposable element (MITE) family initially recovered from the genome of Alligator mississippiensis (American alligator) but also present in the genomes of Crocodylus moreletii (Morelet's crocodile) and Gavialis gangeticus (Indian gharial). The element has all of the hallmarks of MITEs including terminal inverted repeats, possible target site duplications, and a tendency to form secondary structures. We estimate the copy number in the alligator genome to be approximately 46,000 copies. As a result of their size and unique properties, Chompy elements may provide a useful source of genomic variation for crocodilian comparative genomics.
Stoichiometry of the Cre recombinase bound to the lox recombining site.
Mack, A; Sauer, B; Abremski, K; Hoess, R
1992-01-01
The site-specific recombinase Cre from bacteriophage P1 binds and carries out recombination at a 34 bp lox site. The lox site consists of two 13 bp inverted repeats, separated by an 8 bp spacer region. Both the palindromic nature of the site and the results of footprinting and band shift experiments suggest that a minimum of two Cre molecules bind to a lox site. We report here experiments that demonstrate the absolute stoichiometry of the Cre-lox complex to be one molecule of Cre bound per inverted repeat, or two molecules per lox site. Images PMID:1408747
Design and fabrication of inverted rib waveguide Bragg grating
NASA Astrophysics Data System (ADS)
Huang, Cheng-Sheng; Wang, Wei-Chih
2009-03-01
A polymeric SU8 rib waveguide Bragg grating filterfabricated using reactive ion etching (RIE) and solvent assisted microcontact molding (SAMIM) is presented. SAMIM is one kind of soft lithography. The technique is unique in which that a composite hPDMS/PDMS stamp was used to transfer the grating pattern onto an inverted SU8 rib waveguide system. The composite grating stamp can be used repeatedly several times with degradation. Using this stamp and inverter rib waveguide structure, the Bragg grating filter fabrication can be significantly simplified.
The DNA-bending protein HMGB1 is a cellular cofactor of Sleeping Beauty transposition.
Zayed, Hatem; Izsvák, Zsuzsanna; Khare, Dheeraj; Heinemann, Udo; Ivics, Zoltán
2003-05-01
Sleeping Beauty (SB) is the most active Tc1/ mariner-type transposon in vertebrates. SB contains two transposase-binding sites (DRs) at the end of each terminal inverted repeat (IR), a feature termed the IR/DR structure. We investigated the involvement of cellular proteins in the regulation of SB transposition. Here, we establish that the DNA-bending, high-mobility group protein, HMGB1 is a host-encoded cofactor of SB transposition. Transposition was severely reduced in mouse cells deficient in HMGB1. This effect was rescued by transient over-expression of HMGB1, and was partially complemented by HMGB2, but not with the HMGA1 protein. Over-expression of HMGB1 in wild-type mouse cells enhanced transposition, indicating that HMGB1 can be a limiting factor of transposition. SB transposase was found to interact with HMGB1 in vivo, suggesting that the transposase may recruit HMGB1 to transposon DNA. HMGB1 stimulated preferential binding of the transposase to the DR further from the cleavage site, and promoted bending of DNA fragments containing the transposon IR. We propose that the role of HMGB1 is to ensure that transposase-transposon complexes are first formed at the internal DRs, and subsequently to promote juxtaposition of functional sites in transposon DNA, thereby assisting the formation of synaptic complexes.
A variant Tc4 transposable element in the nematode C. elegans could encode a novel protein.
Li, W; Shaw, J E
1993-01-01
A variant C. elegans Tc4 transposable element, Tc4-rh1030, has been sequenced and is 3483 bp long. The Tc4 element that had been analyzed previously is 1605 bp long, consists of two 774-bp nearly perfect inverted terminal repeats connected by a 57-bp loop, and lacks significant open reading frames. In Tc4-rh1030, by comparison, a 2343-bp novel sequence is present in place of a 477-bp segment in one of the inverted repeats. The novel sequence of Tc4-rh1030 is present about five times per haploid genome and is invariably associated with Tc4 elements; we have used the designation Tc4v to denote this variant subfamily of Tc4 elements. Sequence analysis of three cDNA clones suggests that a Tc4v element contains at least five exons that could encode a novel basic protein of 537 amino acid residues. On northern blots, a 1.6-kb Tc4v-specific transcript was detected in the mutator strain TR679 but not in the wild-type strain N2; Tc4 elements are known to transpose in TR679 but appear to be quiescent in N2. We have analyzed transcripts produced by an unc-33 gene that has the Tc4-rh1030 insertional mutation in its transcribed region; all or almost all of the Tc4v sequence is frequently spliced out of the mutant unc-33 transcripts, sometimes by means of non-consensus splice acceptor sites. Images PMID:8382791
Xer1-Mediated Site-Specific DNA Inversions and Excisions in Mycoplasma agalactiae▿ ‡
Czurda, Stefan; Jechlinger, Wolfgang; Rosengarten, Renate; Chopra-Dewasthaly, Rohini
2010-01-01
Surface antigen variation in Mycoplasma agalactiae, the etiologic agent of contagious agalactia in sheep and goats, is governed by site-specific recombination within the vpma multigene locus encoding the Vpma family of variable surface lipoproteins. This high-frequency Vpma phase switching was previously shown to be mediated by a Xer1 recombinase encoded adjacent to the vpma locus. In this study, it was demonstrated in Escherichia coli that the Xer1 recombinase is responsible for catalyzing vpma gene inversions between recombination sites (RS) located in the 5′-untranslated region (UTR) in all six vpma genes, causing cleavage and strand exchange within a 21-bp conserved region that serves as a recognition sequence. It was further shown that the outcome of the site-specific recombination event depends on the orientation of the two vpma RS, as direct or inverted repeats. While recombination between inverted vpma RS led to inversions, recombination between direct repeat vpma RS led to excisions. Using a newly developed excision assay based on the lacZ reporter system, we were able to successfully demonstrate under native conditions that such Xer1-mediated excisions can indeed also occur in the M. agalactiae type strain PG2, whereas they were not observed in the control xer1-disrupted VpmaY phase-locked mutant (PLMY), which lacks Xer1 recombinase. Unless there are specific regulatory mechanisms preventing such excisions, this might be the cost that the pathogen has to render at the population level for maintaining this high-frequency phase variation machinery. PMID:20562305
Niu, Zhi-Tao; Liu, Wei; Xue, Qing-Yun; Ding, Xiao-Yu
2014-01-01
The orchid family Orchidaceae is one of the largest angiosperm families, including many species of important economic value. While chloroplast genomes are very informative for systematics and species identification, there is very limited information available on chloroplast genomes in the Orchidaceae. Here, we report the complete chloroplast genomes of the medicinal plant Dendrobium officinale and the ornamental orchid Cypripedium macranthos, demonstrating their gene content and order and potential RNA editing sites. The chloroplast genomes of the above two species and five known photosynthetic orchids showed similarities in structure as well as gene order and content, but differences in the organization of the inverted repeat/small single-copy junction and ndh genes. The organization of the inverted repeat/small single-copy junctions in the chloroplast genomes of these orchids was classified into four types; we propose that inverted repeats flanking the small single-copy region underwent expansion or contraction among Orchidaceae. The AT-rich regions of the ycf1 gene in orchids could be linked to the recombination of inverted repeat/small single-copy junctions. Relative species in orchids displayed similar patterns of variation in ndh gene contents. Furthermore, fifteen highly divergent protein-coding genes were identified, which are useful for phylogenetic analyses in orchids. To test the efficiency of these genes serving as markers in phylogenetic analyses, coding regions of four genes (accD, ccsA, matK, and ycf1) were used as a case study to construct phylogenetic trees in the subfamily Epidendroideae. High support was obtained for placement of previously unlocated subtribes Collabiinae and Dendrobiinae in the subfamily Epidendroideae. Our findings expand understanding of the diversity of orchid chloroplast genomes and provide a reference for study of the molecular systematics of this family. PMID:24911363
Luo, Jing; Hou, Bei-Wei; Niu, Zhi-Tao; Liu, Wei; Xue, Qing-Yun; Ding, Xiao-Yu
2014-01-01
The orchid family Orchidaceae is one of the largest angiosperm families, including many species of important economic value. While chloroplast genomes are very informative for systematics and species identification, there is very limited information available on chloroplast genomes in the Orchidaceae. Here, we report the complete chloroplast genomes of the medicinal plant Dendrobium officinale and the ornamental orchid Cypripedium macranthos, demonstrating their gene content and order and potential RNA editing sites. The chloroplast genomes of the above two species and five known photosynthetic orchids showed similarities in structure as well as gene order and content, but differences in the organization of the inverted repeat/small single-copy junction and ndh genes. The organization of the inverted repeat/small single-copy junctions in the chloroplast genomes of these orchids was classified into four types; we propose that inverted repeats flanking the small single-copy region underwent expansion or contraction among Orchidaceae. The AT-rich regions of the ycf1 gene in orchids could be linked to the recombination of inverted repeat/small single-copy junctions. Relative species in orchids displayed similar patterns of variation in ndh gene contents. Furthermore, fifteen highly divergent protein-coding genes were identified, which are useful for phylogenetic analyses in orchids. To test the efficiency of these genes serving as markers in phylogenetic analyses, coding regions of four genes (accD, ccsA, matK, and ycf1) were used as a case study to construct phylogenetic trees in the subfamily Epidendroideae. High support was obtained for placement of previously unlocated subtribes Collabiinae and Dendrobiinae in the subfamily Epidendroideae. Our findings expand understanding of the diversity of orchid chloroplast genomes and provide a reference for study of the molecular systematics of this family.
Choi, Kyoung Su; Park, Kyu Tae; Park, SeonJoo
2017-11-16
Symplocarpus renifolius is a member of Araceae family that is extraordinarily diverse in appearance. Previous studies on chloroplast genomes in Araceae were focused on duckweeds (Lemnoideae) and root crops ( Colocasia , commonly known as taro). Here, we determined the chloroplast genome of Symplocarpus renifolius and compared the factors, such as genes and inverted repeat (IR) junctions and performed phylogenetic analysis using other Araceae species. The chloroplast genome of S. renifolius is 158,521 bp and includes 113 genes. A comparison among the Araceae chloroplast genomes showed that infA in Lemna , Spirodela , Wolffiella , Wolffia , Dieffenbachia and Colocasia has been lost or has become a pseudogene and has only been retained in Symplocarpus . In the Araceae chloroplast DNA (cpDNA), psbZ is retained. However, psbZ duplication occurred in Wolffia species and tandem repeats were noted around the duplication regions. A comparison of the IR junction in Araceae species revealed the presence of ycf1 and rps15 in the small single copy region, whereas duckweed species contained ycf1 and rps15 in the IR region. The phylogenetic analyses of the chloroplast genomes revealed that Symplocarpus are a basal group and are sister to the other Araceae species. Consequently, infA deletion or pseudogene events in Araceae occurred after the divergence of Symplocarpus and aquatic plants (duckweeds) in Araceae and duplication events of rps15 and ycf1 occurred in the IR region.
Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong
2012-05-01
This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.
Weng, Mao-Lun; Blazier, John C; Govindu, Madhumita; Jansen, Robert K
2014-03-01
Geraniaceae plastid genomes are highly rearranged, and each of the four genera already sequenced in the family has a distinct genome organization. This study reports plastid genome sequences of six additional species, Francoa sonchifolia, Melianthus villosus, and Viviania marifolia from Geraniales, and Pelargonium alternans, California macrophylla, and Hypseocharis bilobata from Geraniaceae. These genome sequences, combined with previously published species, provide sufficient taxon sampling to reconstruct the ancestral plastid genome organization of Geraniaceae and the rearrangements unique to each genus. The ancestral plastid genome of Geraniaceae has a 4 kb inversion and a reduced, Pelargonium-like small single copy region. Our ancestral genome reconstruction suggests that a few minor rearrangements occurred in the stem branch of Geraniaceae followed by independent rearrangements in each genus. The genomic comparison demonstrates that a series of inverted repeat boundary shifts and inversions played a major role in shaping genome organization in the family. The distribution of repeats is strongly associated with breakpoints in the rearranged genomes, and the proportion and the number of large repeats (>20 bp and >60 bp) are significantly correlated with the degree of genome rearrangements. Increases in the degree of plastid genome rearrangements are correlated with the acceleration in nonsynonymous substitution rates (dN) but not with synonymous substitution rates (dS). Possible mechanisms that might contribute to this correlation, including DNA repair system and selection, are discussed.
SU8 inverted-rib waveguide Bragg grating filter.
Huang, Cheng-Sheng; Wang, Wei-Chih
2013-08-01
A polymeric SU8 inverted-rib waveguide Bragg grating filter fabricated using reactive ion etching (RIE) and solvent assisted microcontact molding (SAMIM) is presented. SAMIM is one kind of soft lithography. The technique is unique in that a composite hard-polydimethysiloxane/polydimethysiloxane stamp is used to transfer the grating pattern onto an inverted SU8 rib waveguide system. The composite grating stamp can be used repeatedly several times without degradation. Using this stamp and inverter-rib waveguide structure, the Bragg grating filter fabrication can be significantly simplified. The experiment result shows an attenuation dip in the transmission spectra, with a value of -7 dBm at 1550 nm for a grating with a period of 0.492 μm on an inverted-rib waveguide with 6.6 μm width and 4 μm height.
van Aelst, Kara; Saikrishnan, Kayarat; Szczelkun, Mark D.
2015-01-01
The prokaryotic Type ISP restriction-modification enzymes are single-chain proteins comprising an Mrr-family nuclease, a superfamily 2 helicase-like ATPase, a coupler domain, a methyltransferase, and a DNA-recognition domain. Upon recognising an unmodified DNA target site, the helicase-like domain hydrolyzes ATP to cause site release (remodeling activity) and to then drive downstream translocation consuming 1–2 ATP per base pair (motor activity). On an invading foreign DNA, double-strand breaks are introduced at random wherever two translocating enzymes form a so-called collision complex following long-range communication between a pair of target sites in inverted (head-to-head) repeat. Paradoxically, structural models for collision suggest that the nuclease domains are too far apart (>30 bp) to dimerise and produce a double-strand DNA break using just two strand-cleavage events. Here, we examined the organisation of different collision complexes and how these lead to nuclease activation. We mapped DNA cleavage when a translocating enzyme collides with a static enzyme bound to its site. By following communication between sites in both head-to-head and head-to-tail orientations, we could show that motor activity leads to activation of the nuclease domains via distant interactions of the helicase or MTase-TRD. Direct nuclease dimerization is not required. To help explain the observed cleavage patterns, we also used exonuclease footprinting to demonstrate that individual Type ISP domains can swing off the DNA. This study lends further support to a model where DNA breaks are generated by multiple random nicks due to mobility of a collision complex with an overall DNA-binding footprint of ∼30 bp. PMID:26507855
The complete chloroplast genome sequence of Curcuma flaviflora (Curcuma).
Zhang, Yan; Deng, Jiabin; Li, Yangyi; Gao, Gang; Ding, Chunbang; Zhang, Li; Zhou, Yonghong; Yang, Ruiwu
2016-09-01
The complete chloroplast (cp) genome of Curcuma flaviflora, a medicinal plant in Southeast Asia, was sequenced. The genome size was 160 478 bp in length, with 36.3% GC content. A pair of inverted repeats (IRs) of 26 946 bp were separated by a large single copy (LSC) of 88 008 bp and a small single copy (SSC) of 18 578 bp, respectively. The cp genome contained 132 annotated genes, including 79 protein coding genes, 30 tRNA genes, and four rRNA genes. And 19 of these genes were duplicated in inverted repeat regions.
Rasty, S; Poliani, P L; Fink, D J; Glorioso, J C
1997-08-01
A distinctive feature of the genetic make-up of herpes simplex virus type 1 (HSV-1), a human neurotropic virus, is that approximately half of the 81 known viral genes are not absolutely required for productive infection in Vero cells, and most can be individually deleted without substantially impairing viral replication in cell culture. If large blocks of contiguous viral genes could be replaced with foreign DNA sequences, it would be possible to engineer highly attenuated recombinant HSV-1 gene transfer vectors capable of carrying large cellular genes or multiple genes having related functions. We report the isolation and characterization of an HSV-1 mutant, designated d311, containing a 12 kb deletion of viral DNA located between the L-S Junction a sequence and the U(S)6 gene, spanning the S component inverted repeat sequence c' and the nonessential genes U(S)1 through U(S)5. Replication of d311 was totally inhibited in rat B103 and mouse Neuro-2A neuroblastoma cell lines, and was reduced by over three orders of magnitude in human SK-N-SH neuroblastoma cells compared to wild-type (wt) HSV-1 KOS. This suggested that the deleted genes, while nonessential for replication in Vero cells, play an important role in HSV replication in neuronal cells, particularly those of rodent origin. Unlike wt KOS which replicated locally and spread to other regions of brain following stereotactic inoculation into rat hippocampus, d311 was unable to replicate and spread within the brain, and did not cause any apparent local neuronal cell damage. These results demonstrate that d311 is highly attenuated for the rat central nervous system. d311 and other mutants of HSV containing major deletions of the nonessential genes within U(S) have the potential to serve as useful tools for gene transfer applications to brain.
Lei, Wanjun; Ni, Dapeng; Wang, Yujun; Shao, Junjie; Wang, Xincun; Yang, Dan; Wang, Jinsheng; Chen, Haimei; Liu, Chang
2016-02-22
Astragalus membranaceus is an important medicinal plant in Asia. Several of its varieties have been used interchangeably as raw materials for commercial production. High resolution genetic markers are in urgent need to distinguish these varieties. Here, we sequenced and analyzed the chloroplast genome of A. membranaceus (Fisch.) Bunge var. mongholicus (Bunge) P.K. Hsiao using the next generation DNA sequencing technology. The genome was assembled using Abyss and then subjected to gene prediction using CPGAVAS and repeat analysis using MISA, Tandem Repeats Finder, and REPuter. Finally, the genome was subjected phylogenetic and comparative genomic analyses. The complete genome is 123,582 bp long, containing only one copy of the inverted repeat. Gene prediction revealed 110 genes encoding 76 proteins, 30 tRNAs, and four rRNAs. Five intra-specific hypermutation loci were identified, three of which are heteroplasmic. Furthermore, three gene losses and two large inversions were identified. Comparative genomic analyses demonstrated the dynamic nature of the Papilionoideae chloroplast genomes, which showed occurrence of numerous hypermutation loci, frequent gene losses, and fragment inversions. Results obtained herein elucidate the complex evolutionary history of chloroplast genomes and have laid the foundation for the identification of genetic markers to distinguish A. membranaceus varieties.
Epigenetic events underlie the pathogenesis of sinonasal papillomas.
Stephen, Josena K; Vaught, Lori E; Chen, Kang M; Sethi, Seema; Shah, Veena; Benninger, Michael S; Gardner, Glendon M; Schweitzer, Vanessa G; Khan, Mumtaz; Worsham, Maria J
2007-10-01
Benign inverted papillomas have been reported as monoclonal but lacking common genetic alterations identified in squamous cell carcinoma of the head and neck. Epigenetic changes alter the heritable state of gene expression and chromatin organization without change in DNA sequence. We investigated whether epigenetic events of aberrant promoter hypermethylation in genes known to be involved in squamous head and neck cancer underlie the pathogenesis of sinonasal papillomas. Ten formalin-fixed paraffin DNA samples from three inverted papilloma cases, two exophytic (everted) papilloma cases, and two cases with inverted and exophytic components were studied. DNA was obtained from microdissected areas of normal and papilloma areas and examined using a panel of 41 gene probes, designed to interrogate 35 unique genes for aberrant methylation status (22 genes) using the methylation-specific multiplex-ligation-specific polymerase assay. Methylation-specific PCR was employed to confirm aberrant methylation detected by the methylation-specific multiplex-ligation-specific polymerase assay. All seven cases indicated at least one epigenetic event of aberrant promoter hypermethylation. The CDKN2B gene was a consistent target of aberrant methylation in six of seven cases. Methylation-specific PCR confirmed hypermethylation of CDKN2B. Recurrent biopsies from two inverted papilloma cases had common epigenetic events. Promoter hypermethylation of CDKN2B was a consistent epigenetic event. Common epigenetic alterations in recurrent biopsies underscore a monoclonal origin for these lesions. Epigenetic events contribute to the underlying pathogenesis of benign inverted and exophytic papillomas. As a consistent target of aberrant promoter hypermethylation, CDKN2B may serve as an important epigenetic biomarker for gene reactivation studies.
Auvray, Frédéric; Coddeville, Michèle; Ordonez, Romy Catoira; Ritzenthaler, Paul
1999-01-01
The temperate phage mv4 integrates its genome into the chromosome of Lactobacillus delbrueckii subsp. bulgaricus by site-specific recombination within the 3′ end of a tRNASer gene. Recombination is catalyzed by the phage-encoded integrase and occurs between the phage attP site and the bacterial attB site. In this study, we show that the mv4 integrase functions in vivo in Escherichia coli and we characterize the bacterial attB site with a site-specific recombination test involving compatible plasmids carrying the recombination sites. The importance of particular nucleotides within the attB sequence was determined by site-directed mutagenesis. The structure of the attB site was found to be simple but rather unusual. A 16-bp DNA fragment was sufficient for function. Unlike most genetic elements that integrate their DNA into tRNA genes, none of the dyad symmetry elements of the tRNASer gene were present within the minimal attB site. No inverted repeats were detected within this site either, in contrast to the lambda site-specific recombination model. PMID:10572145
Smith, David Roy; Hua, Jimeng; Archibald, John M.; Lee, Robert W.
2013-01-01
Organelle DNA is no stranger to palindromic repeats. But never has a mitochondrial or plastid genome been described in which every coding region is part of a distinct palindromic unit. While sequencing the mitochondrial DNA of the nonphotosynthetic green alga Polytomella magna, we uncovered precisely this type of genic arrangement. The P. magna mitochondrial genome is linear and made up entirely of palindromes, each containing 1–7 unique coding regions. Consequently, every gene in the genome is duplicated and in an inverted orientation relative to its partner. And when these palindromic genes are folded into putative stem-loops, their predicted translational start sites are often positioned in the apex of the loop. Gel electrophoresis results support the linear, 28-kb monomeric conformation of the P. magna mitochondrial genome. Analyses of other Polytomella taxa suggest that palindromic mitochondrial genes were present in the ancestor of the Polytomella lineage and lost or retained to various degrees in extant species. The possible origins and consequences of this bizarre genomic architecture are discussed. PMID:23940100
Complete Chloroplast Genome Sequences of Important Oilseed Crop Sesamum indicum L
Yi, Dong-Keun; Kim, Ki-Joong
2012-01-01
Sesamum indicum is an important crop plant species for yielding oil. The complete chloroplast (cp) genome of S. indicum (GenBank acc no. JN637766) is 153,324 bp in length, and has a pair of inverted repeat (IR) regions consisting of 25,141 bp each. The lengths of the large single copy (LSC) and the small single copy (SSC) regions are 85,170 bp and 17,872 bp, respectively. Comparative cp DNA sequence analyses of S. indicum with other cp genomes reveal that the genome structure, gene order, gene and intron contents, AT contents, codon usage, and transcription units are similar to the typical angiosperm cp genomes. Nucleotide diversity of the IR region between Sesamum and three other cp genomes is much lower than that of the LSC and SSC regions in both the coding region and noncoding region. As a summary, the regional constraints strongly affect the sequence evolution of the cp genomes, while the functional constraints weakly affect the sequence evolution of cp genomes. Five short inversions associated with short palindromic sequences that form step-loop structures were observed in the chloroplast genome of S. indicum. Twenty-eight different simple sequence repeat loci have been detected in the chloroplast genome of S. indicum. Almost all of the SSR loci were composed of A or T, so this may also contribute to the A-T richness of the cp genome of S. indicum. Seven large repeated loci in the chloroplast genome of S. indicum were also identified and these loci are useful to developing S. indicum-specific cp genome vectors. The complete cp DNA sequences of S. indicum reported in this paper are prerequisite to modifying this important oilseed crop by cp genetic engineering techniques. PMID:22606240
Wei, Liya; Gu, Lianfeng; Song, Xianwei; Cui, Xiekui; Lu, Zhike; Zhou, Ming; Wang, Lulu; Hu, Fengyi; Zhai, Jixian; Meyers, Blake C.; Cao, Xiaofeng
2014-01-01
Transposable elements (TEs) and repetitive sequences make up over 35% of the rice (Oryza sativa) genome. The host regulates the activity of different TEs by different epigenetic mechanisms, including DNA methylation, histone H3K9 methylation, and histone H3K4 demethylation. TEs can also affect the expression of host genes. For example, miniature inverted repeat TEs (MITEs), dispersed high copy-number DNA TEs, can influence the expression of nearby genes. In plants, 24-nt small interfering RNAs (siRNAs) are mainly derived from repeats and TEs. However, the extent to which TEs, particularly MITEs associated with 24-nt siRNAs, affect gene expression remains elusive. Here, we show that the rice Dicer-like 3 homolog OsDCL3a is primarily responsible for 24-nt siRNA processing. Impairing OsDCL3a expression by RNA interference caused phenotypes affecting important agricultural traits; these phenotypes include dwarfism, larger flag leaf angle, and fewer secondary branches. We used small RNA deep sequencing to identify 535,054 24-nt siRNA clusters. Of these clusters, ∼82% were OsDCL3a-dependent and showed significant enrichment of MITEs. Reduction of OsDCL3a function reduced the 24-nt siRNAs predominantly from MITEs and elevated expression of nearby genes. OsDCL3a directly targets genes involved in gibberellin and brassinosteroid homeostasis; OsDCL3a deficiency may affect these genes, thus causing the phenotypes of dwarfism and enlarged flag leaf angle. Our work identifies OsDCL3a-dependent 24-nt siRNAs derived from MITEs as broadly functioning regulators for fine-tuning gene expression, which may reflect a conserved epigenetic mechanism in higher plants with genomes rich in dispersed repeats or TEs. PMID:24554078
Zeng, Fan-chun; Gao, Cheng-wen; Gao, Li-zhi
2016-01-01
The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum) is reported and characterized in this study. The genome size is 156,612 bp, containing a pair of inverted repeats (IRs) of 25,776 bp separated by a large single-copy region of 87,213 bp and a small single-copy region of 17,851 bp. The chloroplast genome harbors 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes, and 37 tRNA genes. A total of 18 of these genes are duplicated in the inverted repeat regions, 16 genes contain 1 intron, and 2 genes and one ycf have 2 introns.
Huang, Ya-Yi; Matzke, Antonius J. M.; Matzke, Marjori
2013-01-01
Coconut, a member of the palm family (Arecaceae), is one of the most economically important trees used by mankind. Despite its diverse morphology, coconut is recognized taxonomically as only a single species (Cocos nucifera L.). There are two major coconut varieties, tall and dwarf, the latter of which displays traits resulting from selection by humans. We report here the complete chloroplast (cp) genome of a dwarf coconut plant, and describe the gene content and organization, inverted repeat fluctuations, repeated sequence structure, and occurrence of RNA editing. Phylogenetic relationships of monocots were inferred based on 47 chloroplast protein-coding genes. Potential nodes for events of gene duplication and pseudogenization related to inverted repeat fluctuation were mapped onto the tree using parsimony criteria. We compare our findings with those from other palm species for which complete cp genome sequences are available. PMID:24023703
Huang, Ya-Yi; Matzke, Antonius J M; Matzke, Marjori
2013-01-01
Coconut, a member of the palm family (Arecaceae), is one of the most economically important trees used by mankind. Despite its diverse morphology, coconut is recognized taxonomically as only a single species (Cocos nucifera L.). There are two major coconut varieties, tall and dwarf, the latter of which displays traits resulting from selection by humans. We report here the complete chloroplast (cp) genome of a dwarf coconut plant, and describe the gene content and organization, inverted repeat fluctuations, repeated sequence structure, and occurrence of RNA editing. Phylogenetic relationships of monocots were inferred based on 47 chloroplast protein-coding genes. Potential nodes for events of gene duplication and pseudogenization related to inverted repeat fluctuation were mapped onto the tree using parsimony criteria. We compare our findings with those from other palm species for which complete cp genome sequences are available.
Philippe, Cécile; Krupovic, Mart; Jaomanjaka, Fety; Claisse, Olivier; Petrel, Melina; le Marrec, Claire
2018-01-16
The Gluconobacter phage GC1 is a novel member of the Tectiviridae family isolated from a juice sample collected during dry white wine making. The bacteriophage infects Gluconobacter cerinus , an acetic acid bacterium which represents a spoilage microorganism during wine making, mainly because it is able to produce ethyl alcohol and transform it into acetic acid. Transmission electron microscopy revealed tail-less icosahedral particles with a diameter of ~78 nm. The linear double-stranded DNA genome of GC1 (16,523 base pairs) contains terminal inverted repeats and carries 36 open reading frames, only a handful of which could be functionally annotated. These encode for the key proteins involved in DNA replication (protein-primed family B DNA polymerase) as well as in virion structure and assembly (major capsid protein, genome packaging ATPase (adenosine triphosphatase) and several minor capsid proteins). GC1 is the first tectivirus infecting an alphaproteobacterial host and is thus far the only temperate tectivirus of gram-negative bacteria. Based on distinctive sequence and life-style features, we propose that GC1 represents a new genus within the Tectiviridae , which we tentatively named " Gammatectivirus ". Furthermore, GC1 helps to bridge the gap in the sequence space between alphatectiviruses and betatectiviruses.
Genomic characterization of a novel poxvirus from a flying fox: evidence for a new genus?
O'Dea, Mark A; Tu, Shin-Lin; Pang, Stanley; De Ridder, Thomas; Jackson, Bethany; Upton, Chris
2016-09-01
The carcass of an Australian little red flying fox (Pteropus scapulatus) which died following entrapment on a fence was submitted to the laboratory for Australian bat lyssavirus exclusion testing, which was negative. During post-mortem, multiple nodules were noted on the wing membranes, and therefore degenerate PCR primers targeting the poxvirus DNA polymerase gene were used to screen for poxviruses. The poxvirus PCR screen was positive and sequencing of the PCR product demonstrated very low, but significant, similarity with the DNA polymerase gene from members of the Poxviridae family. Next-generation sequencing of DNA extracted from the lesions returned a contig of 132 353 nucleotides (nt), which was further extended to produce a near full-length viral genome of 133 492 nt. Analysis of the genome revealed it to be AT-rich with inverted terminal repeats of at least 1314 nt and to contain 143 predicted genes. The genome contains a surprisingly large number (29) of genes not found in other poxviruses, one of which appears to be a homologue of the mammalian TNF-related apoptosis-inducing ligand (TRAIL) gene. Phylogenetic analysis indicates that the poxvirus described here is not closely related to any other poxvirus isolated from bats or other species, and that it likely should be placed in a new genus.
Wang, Dan; Zhao, Jieyu; Bai, Yan; Ao, You; Guo, Changhong
2017-08-10
Gametocidal (Gc) chromosomes can ensure their preferential transmission by killing the gametes without themselves through causing chromosome breakage and therefore have been exploited as an effective tool for genetic breeding. However, to date very little is known about the molecular mechanism of Gc action. In this study, we used methylation-sensitive amplified polymorphism (MSAP) technique to assess the extent and pattern of cytosine methylation alterations at the whole genome level between two lines of wheat Gc addition line and their common wheat parent. The results indicated that the overall levels of cytosine methylation of two studied Gc addition lines (CS-3C and CS-3C3C, 48.68% and 48.65%, respectively) were significantly increased when compared to common wheat CS (41.31%) and no matter fully methylated or hemimethylated rates enhanced in Gc addition lines. A set of 30 isolated fragments that showed different DNA methylation or demethylation patterns between the three lines were sequenced and the results indicated that 8 fragments showed significant homology to known sequences, of which three were homologous to MITE transposon (Miniature inverted-repeat transposable elements), LTR-retrotransposon WIS-1p and retrotransposon Gypsy , respectively. Overall, our results showed that DNA methylation could play a role in the Gc action.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shashi, V.; Allinson, P.S.; Golden, W.L.
1994-09-01
Recent studies in yeast have shown that telomeres rather than centromeres lead in chromosome movement just prior to meiosis and may have a role in recombination. Cytological studies of meiosis in Drosophila and mice have shown that in pericentric inversion heterozygotes there is lack of loop formation, with recobmination seen only outside the inversion. In a family with Duchenne muscular dystrophy (DMD) we recognized that only affected males and carrier females had a pericentric X chromosome inversion (inv X(p11.4;q26)). Since the short arm inversion breakpoint was proximal to the DMD locus, it could not be implicated in the mutational eventmore » causing DMD. There was no history of infertility, recurrent miscarriages or liveborn unbalanced females to suggest there was recombination within the inversion. We studied 22 members over three generations to understand the pattern of meiotic recombination between the normal and the inverted X chromosome. In total, 17 meioses involving the inverted X chromosome in females were studied by cytogenetic analysis and 16 CA repeat polymorphisms along the length of the X chromosome. Results: (a) There was complete concordance between the segregation of the DMD mutation and the inverted X chromosome. (b) On DNA analysis, there was complete absence of recombination within the inverted segment. We also found no recombination at the DMD locus. Recombination was seen only at Xp22 and Xq27-28. (c) Recombination was seen in the same individual at both Xp22 and Xq27-28 without recombination otherwise. Conclusions: (1) Pericentric X inversions reduce the genetic map length of the chromosome, with the physical map length being normal. (2) Meiotic X chromosome pairing in this family is initiated at the telomeres. (3) Following telomeric pairing in pericentric X chromosome inversions, there is inhibition of recombination within the inversion and adjacent regions.« less
Henke, Sarah K; Cronan, John E
2016-11-01
Group II biotin protein ligases (BPLs) are characterized by the presence of an N-terminal DNA binding domain that functions in transcriptional regulation of the genes of biotin biosynthesis and transport. The Staphylococcus aureus Group II BPL which is called BirA has been reported to bind an imperfect inverted repeat located upstream of the biotin synthesis operon. DNA binding by other Group II BPLs requires dimerization of the protein which is triggered by synthesis of biotinoyl-AMP (biotinoyl-adenylate), the intermediate in the ligation of biotin to its cognate target proteins. However, the S. aureus BirA was reported to dimerize and bind DNA in the absence of biotin or biotinoyl-AMP (Soares da Costa et al. (2014) Mol Microbiol 91: 110-120). These in vitro results argued that the protein would be unable to respond to the levels of biotin or acceptor proteins and thus would lack the regulatory properties of the other characterized BirA proteins. We tested the regulatory function of the protein using an in vivo model system and examined its DNA binding properties in vitro using electrophoretic mobility shift and fluorescence anisotropy analyses. We report that the S. aureus BirA is an effective regulator of biotin operon transcription and that the prior data can be attributed to artifacts of mobility shift analyses. We also report that deletion of the DNA binding domain of the S. aureus BirA results in loss of virtually all of its ligation activity. © 2016 John Wiley & Sons Ltd.
Park, Kyu Tae
2017-01-01
Symplocarpus renifolius is a member of Araceae family that is extraordinarily diverse in appearance. Previous studies on chloroplast genomes in Araceae were focused on duckweeds (Lemnoideae) and root crops (Colocasia, commonly known as taro). Here, we determined the chloroplast genome of Symplocarpus renifolius and compared the factors, such as genes and inverted repeat (IR) junctions and performed phylogenetic analysis using other Araceae species. The chloroplast genome of S. renifolius is 158,521 bp and includes 113 genes. A comparison among the Araceae chloroplast genomes showed that infA in Lemna, Spirodela, Wolffiella, Wolffia, Dieffenbachia and Colocasia has been lost or has become a pseudogene and has only been retained in Symplocarpus. In the Araceae chloroplast DNA (cpDNA), psbZ is retained. However, psbZ duplication occurred in Wolffia species and tandem repeats were noted around the duplication regions. A comparison of the IR junction in Araceae species revealed the presence of ycf1 and rps15 in the small single copy region, whereas duckweed species contained ycf1 and rps15 in the IR region. The phylogenetic analyses of the chloroplast genomes revealed that Symplocarpus are a basal group and are sister to the other Araceae species. Consequently, infA deletion or pseudogene events in Araceae occurred after the divergence of Symplocarpus and aquatic plants (duckweeds) in Araceae and duplication events of rps15 and ycf1 occurred in the IR region. PMID:29144427
Yi, Xuan; Gao, Lei; Wang, Bo; Su, Ying-Juan; Wang, Ting
2013-01-01
We have determined the complete chloroplast (cp) genome sequence of Cephalotaxus oliveri. The genome is 134,337 bp in length, encodes 113 genes, and lacks inverted repeat (IR) regions. Genome-wide mutational dynamics have been investigated through comparative analysis of the cp genomes of C. oliveri and C. wilsoniana. Gene order transformation analyses indicate that when distinct isomers are considered as alternative structures for the ancestral cp genome of cupressophyte and Pinaceae lineages, it is not possible to distinguish between hypotheses favoring retention of the same IR region in cupressophyte and Pinaceae cp genomes from a hypothesis proposing independent loss of IRA and IRB. Furthermore, in cupressophyte cp genomes, the highly reduced IRs are replaced by short repeats that have the potential to mediate homologous recombination, analogous to the situation in Pinaceae. The importance of repeats in the mutational dynamics of cupressophyte cp genomes is also illustrated by the accD reading frame, which has undergone extreme length expansion in cupressophytes. This has been caused by a large insertion comprising multiple repeat sequences. Overall, we find that the distribution of repeats, indels, and substitutions is significantly correlated in Cephalotaxus cp genomes, consistent with a hypothesis that repeats play a role in inducing substitutions and indels in conifer cp genomes.
The whole chloroplast genome of wild rice (Oryza australiensis).
Wu, Zhiqiang; Ge, Song
2016-01-01
The whole chloroplast genome of wild rice (Oryza australiensis) is characterized in this study. The genome size is 135,224 bp, exhibiting a typical circular structure including a pair of 25,776 bp inverted repeats (IRa,b) separated by a large single-copy region (LSC) of 82,212 bp and a small single-copy region (SSC) of 12,470 bp. The overall GC content of the genome is 38.95%. 110 unique genes were annotated, including 76 protein-coding genes, 4 ribosomal RNA genes, and 30t RNA genes. Among these, 18 are duplicated in the inverted repeat regions, 13 genes contain one intron, and 2 genes (rps12 and ycf3) have two introns.
Structure and Function of Na+-Symporters with Inverted Repeats
Abramson, Jeff; Wright, Ernest M.
2009-01-01
Summary Symporters are membrane proteins that couple energy stored in electrochemical potential gradients to drive the cotransport of molecules and ions into cells. Traditionally, proteins are classified into gene families based on sequence homology and functional properties, e.g. the sodium glucose (SLC5 or Sodium Solute Symporter Family, SSS or SSF) and GABA (SLC6 or Neurotransmitter Sodium Symporter Family, NSS or SNF) symporter families [1-4]. Recently, it has been established that four Na+-symporter proteins with unrelated sequences have a common structural core containing an inverted repeat of 5 transmembrane (TM) helices [5-8]. Analysis of these four structures reveals that they reside in different conformations along the transport cycle providing atomic insight into the mechanism of sodium solute cotransport. PMID:19631523
The complete chloroplast genome of salt cress (Eutrema salsugineum).
Guo, Xinyi; Hao, Guoqian; Ma, Tao
2016-07-01
The complete chloroplast (cp) sequence of the salt cress (Eutrema salsugineum), a plant well-adapted to salt stress, was presented in this study. The circular molecule is 153,407 bp in length and exhibit a typical quadripartite structure containing an 83,894 bp large single copy (LSC) region, a 17,607 bp small single copy (SSC) region, and the two 25,953 bp inverted repeats (IRs). The salt cress cp genome contains 135 known genes, including 87 protein-coding genes, 8 ribosomal RNA genes, and 40 tRNA genes; 21 of these are located in the inverted repeat region. As expected, phylogenetic analysis support the idea that E. salsugineum is sister to Brassiceae species within the Brassicaceae family.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2005-01-01
Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178
de Cambiaire, Jean-Charles; Otis, Christian; Lemieux, Claude; Turmel, Monique
2006-01-01
Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. While the basal position of the Prasinophyceae is well established, the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae (UTC) remains uncertain. The five complete chloroplast DNA (cpDNA) sequences currently available for representatives of these classes display considerable variability in overall structure, gene content, gene density, intron content and gene order. Among these genomes, that of the chlorophycean green alga Chlamydomonas reinhardtii has retained the least ancestral features. The two single-copy regions, which are separated from one another by the large inverted repeat (IR), have similar sizes, rather than unequal sizes, and differ radically in both gene contents and gene organizations relative to the single-copy regions of prasinophyte and ulvophyte cpDNAs. To gain insights into the various changes that underwent the chloroplast genome during the evolution of chlorophycean green algae, we have sequenced the cpDNA of Scenedesmus obliquus, a member of a distinct chlorophycean lineage. Results The 161,452 bp IR-containing genome of Scenedesmus features single-copy regions of similar sizes, encodes 96 genes, i.e. only two additional genes (infA and rpl12) relative to its Chlamydomonas homologue and contains seven group I and two group II introns. It is clearly more compact than the four UTC algal cpDNAs that have been examined so far, displays the lowest proportion of short repeats among these algae and shows a stronger bias in clustering of genes on the same DNA strand compared to Chlamydomonas cpDNA. Like the latter genome, Scenedesmus cpDNA displays only a few ancestral gene clusters. The two chlorophycean genomes share 11 gene clusters that are not found in previously sequenced trebouxiophyte and ulvophyte cpDNAs as well as a few genes that have an unusual structure; however, their single-copy regions differ considerably in gene content. Conclusion Our results underscore the remarkable plasticity of the chlorophycean chloroplast genome. Owing to this plasticity, only a sketchy portrait could be drawn for the chloroplast genome of the last common ancestor of Scenedesmus and Chlamydomonas. PMID:16638149
Ait-Arkoub, Zaïna; Voujon, Delphine; Deback, Claire; Abrao, Emiliana P.; Agut, Henri; Boutolleau, David
2013-01-01
The complete 154-kbp linear double-stranded genomic DNA sequence of herpes simplex virus 2 (HSV-2), consisting of two extended regions of unique sequences bounded by a pair of inverted repeat elements, was published in 1998 and since then has been widely employed in a wide range of studies. Throughout the HSV-2 genome are scattered 150 microsatellites (also referred to as short tandem repeats) of 1- to 6-nucleotide motifs, mainly distributed in noncoding regions. Microsatellites are considered reliable markers for genetic mapping to differentiate herpesvirus strains, as shown for cytomegalovirus and HSV-1. The aim of this work was to characterize 12 polymorphic microsatellites within the HSV-2 genome by use of 3 multiplex PCR assays in combination with length polymorphism analysis for the rapid genetic differentiation of 56 HSV-2 clinical isolates and 2 HSV-2 laboratory strains (gHSV-2 and MS). This new system was applied to a specific new HSV-2 variant recently identified in HIV-1-infected patients originating from West Africa. Our results confirm that microsatellite polymorphism analysis is an accurate tool for studying the epidemiology of HSV-2 infections. PMID:23966512
Human structural variation: mechanisms of chromosome rearrangements
Weckselblatt, Brooke; Rudd, M. Katharine
2015-01-01
Chromosome structural variation (SV) is a normal part of variation in the human genome, but some classes of SV can cause neurodevelopmental disorders. Analysis of the DNA sequence at SV breakpoints can reveal mutational mechanisms and risk factors for chromosome rearrangement. Large-scale SV breakpoint studies have become possible recently owing to advances in next-generation sequencing (NGS) including whole-genome sequencing (WGS). These findings have shed light on complex forms of SV such as triplications, inverted duplications, insertional translocations, and chromothripsis. Sequence-level breakpoint data resolve SV structure and determine how genes are disrupted, fused, and/or misregulated by breakpoints. Recent improvements in breakpoint sequencing have also revealed non-allelic homologous recombination (NAHR) between paralogous long interspersed nuclear element (LINE) or human endogenous retrovirus (HERV) repeats as a cause of deletions, duplications, and translocations. This review covers the genomic organization of simple and complex constitutional SVs, as well as the molecular mechanisms of their formation. PMID:26209074
Characterization of the Fb-Nof Transposable Element of Drosophila Melanogaster
Harden, N.; Ashburner, M.
1990-01-01
FB-NOF is a composite transposable element of Drosophila melanogaster. It is composed of foldback sequences, of variable length, which flank a 4-kb NOF sequence with 308-bp inverted repeat termini. The NOF sequence could potentially code for a 120-kD polypeptide. The FB-NOF element is responsible for unstable mutations of the white gene (w(c) and w(DZL)) and is associated with the large TEs of G. Ising. Although most strains of D. melanogaster have 20-30 sites of FB insertion, FB-NOF elements are usually rare, many strains lack this composite element or have only one copy of it. A few strains, including w(DZL) and Basc have many (8-21) copies of FB-NOF, and these show a tendency to insert at ``hot-spots.'' These strains also have an increased number of FB elements. The DNA sequence of the NOF region associated with TE146(Z) has been determined. PMID:2174013
Kouprina, Natalay; Samoshkin, Alexander; Erliandri, Indri; Nakano, Megumi; Lee, Hee-Sheung; Fu, Haiging; Iida, Yuichi; Aladjem, Mirit; Oshimura, Mitsuo; Masumoto, Hiroshi; Earnshaw, William C.; Larionov, Vladimir
2012-01-01
Human artificial chromosomes (HACs) represent a novel promising episomal system for functional genomics, gene therapy and synthetic biology. HACs are engineered from natural and synthetic alphoid DNA arrays upon transfection into human cells. The use of HACs for gene expression studies requires the knowledge of their structural organization. However, none of de novo HACs constructed so far has been physically mapped in detail. Recently we constructed a synthetic alphoidtetO-HAC that was successfully used for expression of full-length genes to correct genetic deficiencies in human cells. The HAC can be easily eliminated from cell populations by inactivation of its conditional kinetochore. This unique feature provides a control for phenotypic changes attributed to expression of HAC-encoded genes. This work describes organization of a megabase-size synthetic alphoid DNA array in the alphoidtetO-HAC that has been formed from a ~50 kb synthetic alphoidtetO-construct. Our analysis showed that this array represents a 1.1 Mb continuous sequence assembled from multiple copies of input DNA, a significant part of which was rearranged before assembling. The tandem and inverted alphoid DNA repeats in the HAC range in size from 25 to 150 kb. In addition, we demonstrated that the structure and functional domains of the HAC remains unchanged after several rounds of its transfer into different host cells. The knowledge of the alphoidtetO-HAC structure provides a tool to control HAC integrity during different manipulations. Our results also shed light on a mechanism for de novo HAC formation in human cells. PMID:23411994
Breaks in the 45S rDNA Lead to Recombination-Mediated Loss of Repeats.
Warmerdam, Daniël O; van den Berg, Jeroen; Medema, René H
2016-03-22
rDNA repeats constitute the most heavily transcribed region in the human genome. Tumors frequently display elevated levels of recombination in rDNA, indicating that the repeats are a liability to the genomic integrity of a cell. However, little is known about how cells deal with DNA double-stranded breaks in rDNA. Using selective endonucleases, we show that human cells are highly sensitive to breaks in 45S but not the 5S rDNA repeats. We find that homologous recombination inhibits repair of breaks in 45S rDNA, and this results in repeat loss. We identify the structural maintenance of chromosomes protein 5 (SMC5) as contributing to recombination-mediated repair of rDNA breaks. Together, our data demonstrate that SMC5-mediated recombination can lead to error-prone repair of 45S rDNA repeats, resulting in their loss and thereby reducing cellular viability. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-09-19
To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G planctonica and 262,888-bp G sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-01-01
Abstract To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G. planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G. planctonica and 262,888-bp G. sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G. sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. PMID:27503298
Zou, Hong; Zhang, Jin; Li, Wenxiang; Wu, Shangong; Wang, Guitang
2012-01-01
The 17,922 base pairs (bp) nucleotide sequence of the linear mitochondrial DNA (mtDNA) molecule of the freshwater jellyfish Craspedacusta sowerbyi (Hydrozoa, Trachylina, Limnomedusae) has been determined. This sequence exhibits surprisingly low A+T content (57.1%), containing genes for 13 energy pathway proteins, a small and a large subunit rRNAs, and methionine and tryptophan tRNAs. Mitochondrial ancestral medusozoan gene order (AMGO) was found in the C. sowerbyi, as those found in Cubaia aphrodite (Hydrozoa, Trachylina, Limnomedusae), discomedusan Scyphozoa and Staurozoa. The genes of C. sowerbyi mtDNA are arranged in two clusters with opposite transcriptional polarities, whereby transcription proceeds toward the ends of the DNA molecule. Identical inverted terminal repeats (ITRs) flank the ends of the mitochondrial DNA molecule, a characteristic typical of medusozoans. In addition, two open reading frames (ORFs) of 354 and 1611 bp in length were found downstream of the large subunit rRNA gene, similar to the two ORFs of ORF314 and polB discovered in the linear mtDNA of C. aphrodite, discomedusan Scyphozoa and Staurozoa. Phylogenetic analyses of C. sowerbyi and other cnidarians were carried out based on both nucleotide and inferred amino acid sequences of the 13 mitochondrial energy pathway genes. Our working hypothesis supports the monophyletic Medusozoa being a sister group to Octocorallia (Cnidaria, Anthozoa). Within Medusozoa, the phylogenetic analysis suggests that Staurozoa may be the earliest diverging class and the sister group of all other medusozoans. Cubozoa and coronate Scyphozoa form a clade that is the sister group of Hydrozoa plus discomedusan Scyphozoa. Hydrozoa is the sister group of discomedusan Scyphozoa. Semaeostomeae is a paraphyletic clade with Rhizostomeae, while Limnomedusae (Trachylina) is the sister group of hydroidolinans and may be the earliest diverging lineage among Hydrozoa.
Zou, Hong; Zhang, Jin; Li, Wenxiang; Wu, Shangong; Wang, Guitang
2012-01-01
The 17,922 base pairs (bp) nucleotide sequence of the linear mitochondrial DNA (mtDNA) molecule of the freshwater jellyfish Craspedacusta sowerbyi (Hydrozoa,Trachylina, Limnomedusae) has been determined. This sequence exhibits surprisingly low A+T content (57.1%), containing genes for 13 energy pathway proteins, a small and a large subunit rRNAs, and methionine and tryptophan tRNAs. Mitochondrial ancestral medusozoan gene order (AMGO) was found in the C. sowerbyi, as those found in Cubaia aphrodite (Hydrozoa, Trachylina, Limnomedusae), discomedusan Scyphozoa and Staurozoa. The genes of C. sowerbyi mtDNA are arranged in two clusters with opposite transcriptional polarities, whereby transcription proceeds toward the ends of the DNA molecule. Identical inverted terminal repeats (ITRs) flank the ends of the mitochondrial DNA molecule, a characteristic typical of medusozoans. In addition, two open reading frames (ORFs) of 354 and 1611 bp in length were found downstream of the large subunit rRNA gene, similar to the two ORFs of ORF314 and polB discovered in the linear mtDNA of C. aphrodite, discomedusan Scyphozoa and Staurozoa. Phylogenetic analyses of C. sowerbyi and other cnidarians were carried out based on both nucleotide and inferred amino acid sequences of the 13 mitochondrial energy pathway genes. Our working hypothesis supports the monophyletic Medusozoa being a sister group to Octocorallia (Cnidaria, Anthozoa). Within Medusozoa, the phylogenetic analysis suggests that Staurozoa may be the earliest diverging class and the sister group of all other medusozoans. Cubozoa and coronate Scyphozoa form a clade that is the sister group of Hydrozoa plus discomedusan Scyphozoa. Hydrozoa is the sister group of discomedusan Scyphozoa. Semaeostomeae is a paraphyletic clade with Rhizostomeae, while Limnomedusae (Trachylina) is the sister group of hydroidolinans and may be the earliest diverging lineage among Hydrozoa. PMID:23240028
R-loops: targets for nuclease cleavage and repeat instability.
Freudenreich, Catherine H
2018-01-11
R-loops form when transcribed RNA remains bound to its DNA template to form a stable RNA:DNA hybrid. Stable R-loops form when the RNA is purine-rich, and are further stabilized by DNA secondary structures on the non-template strand. Interestingly, many expandable and disease-causing repeat sequences form stable R-loops, and R-loops can contribute to repeat instability. Repeat expansions are responsible for multiple neurodegenerative diseases, including Huntington's disease, myotonic dystrophy, and several types of ataxias. Recently, it was found that R-loops at an expanded CAG/CTG repeat tract cause DNA breaks as well as repeat instability (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Two factors were identified as causing R-loop-dependent breaks at CAG/CTG tracts: deamination of cytosines and the MutLγ (Mlh1-Mlh3) endonuclease, defining two new mechanisms for how R-loops can generate DNA breaks (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Following R-loop-dependent nicking, base excision repair resulted in repeat instability. These results have implications for human repeat expansion diseases and provide a paradigm for how RNA:DNA hybrids can cause genome instability at structure-forming DNA sequences. This perspective summarizes mechanisms of R-loop-induced fragility at G-rich repeats and new links between DNA breaks and repeat instability.
... medications or doctor visits! Yoga and Recreational Body Inversion The long-term effects of repeatedly assuming a ... shoulder and headstands or any other recreational body inversion exercises that result in head-down or inverted ...
Zhang, Huibin; Susanto, Teodorus T.; Wan, Yue
2016-01-01
Type 1 pili (T1P) are major virulence factors for uropathogenic Escherichia coli (UPEC), which cause both acute and recurrent urinary tract infections. T1P expression therefore is of direct relevance for disease. T1P are phase variable (both piliated and nonpiliated bacteria exist in a clonal population) and are controlled by an invertible DNA switch (fimS), which contains the promoter for the fim operon encoding T1P. Inversion of fimS is stochastic but may be biased by environmental conditions and other signals that ultimately converge at fimS itself. Previous studies of fimS sequences important for T1P phase variation have focused on laboratory-adapted E. coli strains and have been limited in the number of mutations or by alteration of the fimS genomic context. We surmounted these limitations by using saturating genomic mutagenesis of fimS coupled with accurate sequencing to detect both mutations and phase status simultaneously. In addition to the sequences known to be important for biasing fimS inversion, our method also identifies a previously unknown pair of 5′ UTR inverted repeats that act by altering the relative fimA levels to control phase variation. Thus we have uncovered an additional layer of T1P regulation potentially impacting virulence and the coordinate expression of multiple pilus systems. PMID:27035967
Zhang, Huibin; Susanto, Teodorus T; Wan, Yue; Chen, Swaine L
2016-04-12
Type 1 pili (T1P) are major virulence factors for uropathogenic Escherichia coli (UPEC), which cause both acute and recurrent urinary tract infections. T1P expression therefore is of direct relevance for disease. T1P are phase variable (both piliated and nonpiliated bacteria exist in a clonal population) and are controlled by an invertible DNA switch (fimS), which contains the promoter for the fim operon encoding T1P. Inversion of fimS is stochastic but may be biased by environmental conditions and other signals that ultimately converge at fimS itself. Previous studies of fimS sequences important for T1P phase variation have focused on laboratory-adapted E coli strains and have been limited in the number of mutations or by alteration of the fimS genomic context. We surmounted these limitations by using saturating genomic mutagenesis of fimS coupled with accurate sequencing to detect both mutations and phase status simultaneously. In addition to the sequences known to be important for biasing fimS inversion, our method also identifies a previously unknown pair of 5' UTR inverted repeats that act by altering the relative fimA levels to control phase variation. Thus we have uncovered an additional layer of T1P regulation potentially impacting virulence and the coordinate expression of multiple pilus systems.
Short Tandem Repeat DNA Internet Database
National Institute of Standards and Technology Data Gateway
SRD 130 Short Tandem Repeat DNA Internet Database (Web, free access) Short Tandem Repeat DNA Internet Database is intended to benefit research and application of short tandem repeat DNA markers for human identity testing. Facts and sequence information on each STR system, population data, commonly used multiplex STR systems, PCR primers and conditions, and a review of various technologies for analysis of STR alleles have been included.
Hamilton, P T; Reeve, J N
1985-01-01
DNA fragments cloned from the methanogenic archaebacterium Methanobrevibacter smithii which complement mutations in the purE and proC genes of E. coli have been sequenced. Sequence analyses, transposon mutagenesis and expression in E. coli minicells indicate that purE and proC complementations result from the synthesis of M. smithii polypeptides with molecular weights of 36,697 and 27,836 respectively. The encoding genes appear to be located in operons. The M. smithii genome contains 69% A/T basepairs (bp) which is reflected in unusual codon usages and intergenic regions containing approximately 85% A/T bp. An insertion element, designated ISM1, was found within the cloned M. smithii DNA located adjacent to the proC complementing region. ISM1 is 1381 bp in length, has 29 bp terminal inverted repeat sequences and contains one major ORF encoded in 87% of the ISM1 sequence. ISM1 is mobile, present in approximately 10 copies per genome and integration duplicates 8 bp at the site of insertion. The duplicated sequences show homology with sequences within the 29 bp terminal repeat sequence of ISM1. Comparison of our data with sequences from halophilic archaebacteria suggests that 5'GAANTTTCA and 5'TTTTAATATAAA may be consensus promoter sequences for archaebacteria. These sequences closely resemble the consensus sequences which precede Drosophila heat-shock genes (Pelham 1982; Davidson et al. 1983). Methanogens appear to employ the eubacterial system of mRNA: 16SrRNA hybridization to ensure initiation of translation; the consensus ribosome binding sequence is 5'AGGTGA.
Izsvák, Zsuzsanna; Khare, Dheeraj; Behlke, Joachim; Heinemann, Udo; Plasterk, Ronald H; Ivics, Zoltán
2002-09-13
Sleeping Beauty (SB) is the most active Tc1/mariner-like transposon in vertebrate species. Each of the terminal inverted repeats (IRs) of SB contains two transposase-binding sites (DRs). This feature, termed the IR/DR structure, is conserved in a group of Tc1-like transposons. The DNA-binding region of SB transposase, similar to the paired domain of Pax proteins, consists of two helix-turn-helix subdomains (PAI + RED = PAIRED). The N-terminal PAI subdomain was found to play a dominant role in contacting the DRs. Transposase was able to bind to mutant sites retaining the 3' part of the DRs; thus, primary DNA binding is not sufficient to determine the specificity of the transposition reaction. The PAI subdomain was also found to bind to a transpositional enhancer-like sequence within the left IR of SB, and to mediate protein-protein interactions between transposase subunits. A tetrameric form of the transposase was detected in solution, consistent with an interaction between the IR/DR structure and a transposase tetramer. We propose a model in which the transpositional enhancer and the PAI subdomain stabilize complexes formed by a transposase tetramer bound at the IR/DR. These interactions may result in enhanced stability of synaptic complexes, which might explain the efficient transposition of Sleeping Beauty in vertebrate cells.
Within-genome evolution of REPINs: a new family of miniature mobile DNA in bacteria.
Bertels, Frederic; Rainey, Paul B
2011-06-01
Repetitive sequences are a conserved feature of many bacterial genomes. While first reported almost thirty years ago, and frequently exploited for genotyping purposes, little is known about their origin, maintenance, or processes affecting the dynamics of within-genome evolution. Here, beginning with analysis of the diversity and abundance of short oligonucleotide sequences in the genome of Pseudomonas fluorescens SBW25, we show that over-represented short sequences define three distinct groups (GI, GII, and GIII) of repetitive extragenic palindromic (REP) sequences. Patterns of REP distribution suggest that closely linked REP sequences form a functional replicative unit: REP doublets are over-represented, randomly distributed in extragenic space, and more highly conserved than singlets. In addition, doublets are organized as inverted repeats, which together with intervening spacer sequences are predicted to form hairpin structures in ssDNA or mRNA. We refer to these newly defined entities as REPINs (REP doublets forming hairpins) and identify short reads from population sequencing that reveal putative transposition intermediates. The proximal relationship between GI, GII, and GIII REPINs and specific REP-associated tyrosine transposases (RAYTs), combined with features of the putative transposition intermediate, suggests a mechanism for within-genome dissemination. Analysis of the distribution of REPs in a range of RAYT-containing bacterial genomes, including Escherichia coli K-12 and Nostoc punctiforme, show that REPINs are a widely distributed, but hitherto unrecognized, family of miniature non-autonomous mobile DNA.
Stabilization of perfect and imperfect tandem repeats by single-strand DNA exonucleases
Feschenko, Vladimir V.; Rajman, Luis A.; Lovett, Susan T.
2003-01-01
Rearrangements between tandemly repeated DNA sequences are a common source of genetic instability. Such rearrangements underlie several human genetic diseases. In many organisms, the mismatch-repair (MMR) system functions to stabilize repeats when the repeat unit is short or when sequence imperfections are present between the repeats. We show here that the action of single-stranded DNA (ssDNA) exonucleases plays an additional, important role in stabilizing tandem repeats, independent of their role in MMR. For perfect repeats of ≈100 bp in Escherichia coli that are not susceptible to MMR, exonuclease (Exo)-I, ExoX, and RecJ exonuclease redundantly inhibit deletion. Our data suggest that >90% of potential deletion events are avoided by the combined action of these three exonucleases. Imperfect tandem repeats, less prone to rearrangements, are stabilized by both the MMR-pathway and ssDNA-specific exonucleases. For 100-bp repeats containing four mispairs, ExoI alone aborts most deletion events, even in the presence of a functional MMR system. By genetic analysis, we show that the inhibitory effect of ssDNA exonucleases on deletion formation is independent of the MutS and UvrD proteins. Exonuclease degradation of DNA displaced during the deletion process may abort slipped misalignment. Exonuclease action is therefore a significant force in genetic stabilization of many forms of repetitive DNA. PMID:12538867
Lee, Kyubin; Kolb, Aaron W.; Sverchkov, Yuriy; Cuellar, Jacqueline A.; Craven, Mark
2015-01-01
ABSTRACT Herpes simplex virus 1 (HSV-1) causes recurrent mucocutaneous ulcers and is the leading cause of infectious blindness and sporadic encephalitis in the United States. HSV-1 has been shown to be highly recombinogenic; however, to date, there has been no genome-wide analysis of recombination. To address this, we generated 40 HSV-1 recombinants derived from two parental strains, OD4 and CJ994. The 40 OD4-CJ994 HSV-1 recombinants were sequenced using the Illumina sequencing system, and recombination breakpoints were determined for each of the recombinants using the Bootscan program. Breakpoints occurring in the terminal inverted repeats were excluded from analysis to prevent double counting, resulting in a total of 272 breakpoints in the data set. By placing windows around the 272 breakpoints followed by Monte Carlo analysis comparing actual data to simulated data, we identified a recombination bias toward both high GC content and intergenic regions. A Monte Carlo analysis also suggested that recombination did not appear to be responsible for the generation of the spontaneous nucleotide mutations detected following sequencing. Additionally, kernel density estimation analysis across the genome found that the large, inverted repeats comprise a recombination hot spot. IMPORTANCE Herpes simplex virus 1 (HSV-1) virus is the leading cause of sporadic encephalitis and blinding keratitis in developed countries. HSV-1 has been shown to be highly recombinogenic, and recombination itself appears to be a significant component of genome replication. To date, there has been no genome-wide analysis of recombination. Here we present the findings of the first genome-wide study of recombination performed by generating and sequencing 40 HSV-1 recombinants derived from the OD4 and CJ994 parental strains, followed by bioinformatics analysis. Recombination breakpoints were determined, yielding 272 breakpoints in the full data set. Kernel density analysis determined that the large inverted repeats constitute a recombination hot spot. Additionally, Monte Carlo analyses found biases toward high GC content and intergenic and repetitive regions. PMID:25926637
Garcia, J A; Harrich, D; Soultanakis, E; Wu, F; Mitsuyasu, R; Gaynor, R B
1989-01-01
The human immunodeficiency virus (HIV) type 1 LTR is regulated at the transcriptional level by both cellular and viral proteins. Using HeLa cell extracts, multiple regions of the HIV LTR were found to serve as binding sites for cellular proteins. An untranslated region binding protein UBP-1 has been purified and fractions containing this protein bind to both the TAR and TATA regions. To investigate the role of cellular proteins binding to both the TATA and TAR regions and their potential interaction with other HIV DNA binding proteins, oligonucleotide-directed mutagenesis of both these regions was performed followed by DNase I footprinting and transient expression assays. In the TATA region, two direct repeats TC/AAGC/AT/AGCTGC surround the TATA sequence. Mutagenesis of both of these direct repeats or of the TATA sequence interrupted binding over the TATA region on the coding strand, but only a mutation of the TATA sequence affected in vivo assays for tat-activation. In addition to TAR serving as the site of binding of cellular proteins, RNA transcribed from TAR is capable of forming a stable stem-loop structure. To determine the relative importance of DNA binding proteins as compared to secondary structure, oligonucleotide-directed mutations in the TAR region were studied. Local mutations that disrupted either the stem or loop structure were defective in gene expression. However, compensatory mutations which restored base pairing in the stem resulted in complete tat-activation. This indicated a significant role for the stem-loop structure in HIV gene expression. To determine the role of TAR binding proteins, mutations were constructed which extensively changed the primary structure of the TAR region, yet left stem base pairing, stem energy and the loop sequence intact. These mutations resulted in decreased protein binding to TAR DNA and defects in tat-activation, and revealed factor binding specifically to the loop DNA sequence. Further mutagenesis which inverted this stem and loop mutation relative to the HIV LTR mRNA start site resulted in even larger decreases in tat-activation. This suggests that multiple determinants, including protein binding, the loop sequence, and RNA or DNA secondary structure, are important in tat-activation and suggests that tat may interact with cellular proteins binding to DNA to increase HIV gene expression. Images PMID:2721501
The Crystal Structure of TAL Effector PthXo1 Bound to Its DNA Target
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mak, Amanda Nga-Sze; Bradley, Philip; Cernadas, Raul A.
2012-02-10
DNA recognition by TAL effectors is mediated by tandem repeats, each 33 to 35 residues in length, that specify nucleotides via unique repeat-variable diresidues (RVDs). The crystal structure of PthXo1 bound to its DNA target was determined by high-throughput computational structure prediction and validated by heavy-atom derivatization. Each repeat forms a left-handed, two-helix bundle that presents an RVD-containing loop to the DNA. The repeats self-associate to form a right-handed superhelix wrapped around the DNA major groove. The first RVD residue forms a stabilizing contact with the protein backbone, while the second makes a base-specific contact to the DNA sense strand.more » Two degenerate amino-terminal repeats also interact with the DNA. Containing several RVDs and noncanonical associations, the structure illustrates the basis of TAL effector-DNA recognition.« less
Telomere and ribosomal DNA repeats are chromosomal targets of the bloom syndrome DNA helicase
Schawalder, James; Paric, Enesa; Neff, Norma F
2003-01-01
Background Bloom syndrome is one of the most cancer-predisposing disorders and is characterized by genomic instability and a high frequency of sister chromatid exchange. The disorder is caused by loss of function of a 3' to 5' RecQ DNA helicase, BLM. The exact role of BLM in maintaining genomic integrity is not known but the helicase has been found to associate with several DNA repair complexes and some DNA replication foci. Results Chromatin immunoprecipitation of BLM complexes recovered telomere and ribosomal DNA repeats. The N-terminus of BLM, required for NB localization, is the same as the telomere association domain of BLM. The C-terminus is required for ribosomal DNA localization. BLM localizes primarily to the non-transcribed spacer region of the ribosomal DNA repeat where replication forks initiate. Bloom syndrome cells expressing the deletion alleles lacking the ribosomal DNA and telomere association domains have altered cell cycle populations with increased S or G2/M cells relative to normal. Conclusion These results identify telomere and ribosomal DNA repeated sequence elements as chromosomal targets for the BLM DNA helicase during the S/G2 phase of the cell cycle. BLM is localized in nuclear bodies when it associates with telomeric repeats in both telomerase positive and negative cells. The BLM DNA helicase participates in genomic stability at ribosomal DNA repeats and telomeres. PMID:14577841
The genome and transcriptome of perennial ryegrass mitochondria
2013-01-01
Background Perennial ryegrass (Lolium perenne L.) is one of the most important forage and turf grass species of temperate regions worldwide. Its mitochondrial genome is inherited maternally and contains genes that can influence traits of agricultural importance. Moreover, the DNA sequence of mitochondrial genomes has been established and compared for a large number of species in order to characterize evolutionary relationships. Therefore, it is crucial to understand the organization of the mitochondrial genome and how it varies between and within species. Here, we report the first de novo assembly and annotation of the complete mitochondrial genome from perennial ryegrass. Results Intact mitochondria from perennial ryegrass leaves were isolated and used for mtDNA extraction. The mitochondrial genome was sequenced to a 167-fold coverage using the Roche 454 GS-FLX Titanium platform, and assembled into a circular master molecule of 678,580 bp. A total of 34 proteins, 14 tRNAs and 3 rRNAs are encoded by the mitochondrial genome, giving a total gene space of 48,723 bp (7.2%). Moreover, we identified 149 open reading frames larger than 300 bp and covering 67,410 bp (9.93%), 250 SSRs, 29 tandem repeats, 5 pairs of large repeats, and 96 pairs of short inverted repeats. The genes encoding subunits of the respiratory complexes – nad1 to nad9, cob, cox1 to cox3 and atp1 to atp9 – all showed high expression levels both in absolute numbers and after normalization. Conclusions The circular master molecule of the mitochondrial genome from perennial ryegrass presented here constitutes an important tool for future attempts to compare mitochondrial genomes within and between grass species. Our results also demonstrate that mitochondria of perennial ryegrass contain genes crucial for energy production that are well conserved in the mitochondrial genome of monocotyledonous species. The expression analysis gave us first insights into the transcriptome of these mitochondrial genes in perennial ryegrass. PMID:23521852
Sundararajan, Rangapriya; Freudenreich, Catherine H.
2011-01-01
Repetitive DNA elements are mutational hotspots in the genome, and their instability is linked to various neurological disorders and cancers. Although it is known that expanded trinucleotide repeats can interfere with DNA replication and repair, the cellular response to these events has not been characterized. Here, we demonstrate that an expanded CAG/CTG repeat elicits a DNA damage checkpoint response in budding yeast. Using microcolony and single cell pedigree analysis, we found that cells carrying an expanded CAG repeat frequently experience protracted cell division cycles, persistent arrests, and morphological abnormalities. These phenotypes were further exacerbated by mutations in DSB repair pathways, including homologous recombination and end joining, implicating a DNA damage response. Cell cycle analysis confirmed repeat-dependent S phase delays and G2/M arrests. Furthermore, we demonstrate that the above phenotypes are due to the activation of the DNA damage checkpoint, since expanded CAG repeats induced the phosphorylation of the Rad53 checkpoint kinase in a rad52Δ recombination deficient mutant. Interestingly, cells mutated for the MRX complex (Mre11-Rad50-Xrs2), a central component of DSB repair which is required to repair breaks at CAG repeats, failed to elicit repeat-specific arrests, morphological defects, or Rad53 phosphorylation. We therefore conclude that damage at expanded CAG/CTG repeats is likely sensed by the MRX complex, leading to a checkpoint response. Finally, we show that repeat expansions preferentially occur in cells experiencing growth delays. Activation of DNA damage checkpoints in repeat-containing cells could contribute to the tissue degeneration observed in trinucleotide repeat expansion diseases. PMID:21437275
Bergquist, Helen; Rocha, Cristina S. J.; Álvarez-Asencio, Rubén; Nguyen, Chi-Hung; Rutland, Mark. W.; Smith, C. I. Edvard; Good, Liam; Nielsen, Peter E.; Zain, Rula
2016-01-01
Expansion of (GAA)n repeats in the first intron of the Frataxin gene is associated with reduced mRNA and protein levels and the development of Friedreich’s ataxia. (GAA)n expansions form non-canonical structures, including intramolecular triplex (H-DNA), and R-loops and are associated with epigenetic modifications. With the aim of interfering with higher order H-DNA (like) DNA structures within pathological (GAA)n expansions, we examined sequence-specific interaction of peptide nucleic acid (PNA) with (GAA)n repeats of different lengths (short: n=9, medium: n=75 or long: n=115) by chemical probing of triple helical and single stranded regions. We found that a triplex structure (H-DNA) forms at GAA repeats of different lengths; however, single stranded regions were not detected within the medium size pathological repeat, suggesting the presence of a more complex structure. Furthermore, (GAA)4-PNA binding of the repeat abolished all detectable triplex DNA structures, whereas (CTT)5-PNA did not. We present evidence that (GAA)4-PNA can invade the DNA at the repeat region by binding the DNA CTT strand, thereby preventing non-canonical-DNA formation, and that triplex invasion complexes by (CTT)5-PNA form at the GAA repeats. Locked nucleic acid (LNA) oligonucleotides also inhibited triplex formation at GAA repeat expansions, and atomic force microscopy analysis showed significant relaxation of plasmid morphology in the presence of GAA-LNA. Thus, by inhibiting disease related higher order DNA structures in the Frataxin gene, such PNA and LNA oligomers may have potential for discovery of drugs aiming at recovering Frataxin expression. PMID:27846236
Molecular Dynamics Simulations of DNA-Free and DNA-Bound TAL Effectors
Wan, Hua; Hu, Jian-ping; Li, Kang-shun; Tian, Xu-hong; Chang, Shan
2013-01-01
TAL (transcriptional activator-like) effectors (TALEs) are DNA-binding proteins, containing a modular central domain that recognizes specific DNA sequences. Recently, the crystallographic studies of TALEs revealed the structure of DNA-recognition domain. In this article, molecular dynamics (MD) simulations are employed to study two crystal structures of an 11.5-repeat TALE, in the presence and absence of DNA, respectively. The simulated results indicate that the specific binding of RVDs (repeat-variable diresidues) with DNA leads to the markedly reduced fluctuations of tandem repeats, especially at the two ends. In the DNA-bound TALE system, the base-specific interaction is formed mainly by the residue at position 13 within a TAL repeat. Tandem repeats with weak RVDs are unfavorable for the TALE-DNA binding. These observations are consistent with experimental studies. By using principal component analysis (PCA), the dominant motions are open-close movements between the two ends of the superhelical structure in both DNA-free and DNA-bound TALE systems. The open-close movements are found to be critical for the recognition and binding of TALE-DNA based on the analysis of free energy landscape (FEL). The conformational analysis of DNA indicates that the 5′ end of DNA target sequence has more remarkable structural deformability than the other sites. Meanwhile, the conformational change of DNA is likely associated with the specific interaction of TALE-DNA. We further suggest that the arrangement of N-terminal repeats with strong RVDs may help in the design of efficient TALEs. This study provides some new insights into the understanding of the TALE-DNA recognition mechanism. PMID:24130757
Chillón, Isabel; Pyle, Anna M.
2016-01-01
LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782
Meyer, C; Pouteau, S; Rouzé, P; Caboche, M
1994-01-01
By Northern blot analysis of nitrate reductase-deficient mutants of Nicotiana plumbaginifolia, we identified a mutant (mutant D65), obtained after gamma-ray irradiation of protoplasts, which contained an insertion sequence in the nitrate reductase (NR) mRNA. This insertion sequence was localized by polymerase chain reaction (PCR) in the first exon of NR and was also shown to be present in the NR gene. The mutant gene contained a 565 bp insertion sequence that exhibits the sequence characteristics of a transposable element, which was thus named dTnp1. The dTnp1 element has 14 bp terminal inverted repeats and is flanked by an 8-bp target site duplication generated upon transposition. These inverted repeats have significant sequence homology with those of other transposable elements. Judging by its size and the absence of a long open reading frame, dTnp1 appears to represent a defective, although mobile, transposable element. The octamer motif TTTAGGCC was found several times in direct orientation near the 5' and 3' ends of dTnp1 together with a perfect palindrome located after the 5' inverted repeat. Southern blot analysis using an internal probe of dTnp1 suggested that this element occurs as a single copy in the genome of N. plumbaginifolia. It is also present in N. tabacum, but absent in tomato or petunia. The dTnp1 element is therefore of potential use for gene tagging in Nicotiana species.
Molecular and bioinformatic analysis of the FB-NOF transposable element.
Badal, Martí; Portela, Anna; Xamena, Noel; Cabré, Oriol
2006-04-12
The Drosophila melanogaster transposable element FB-NOF is known to play a role in genome plasticity through the generation of all sort of genomic rearrangements. Moreover, several insertional mutants due to FB mobilizations have been reported. Its structure and sequence, however, have been poorly studied mainly as a consequence of the long, complex and repetitive sequence of FB inverted repeats. This repetitive region is composed of several 154 bp blocks, each with five almost identical repeats. In this paper, we report the sequencing process of 2 kb long FB inverted repeats of a complete FB-NOF element, with high precision and reliability. This achievement has been possible using a new map of the FB repetitive region, which identifies unambiguously each repeat with new features that can be used as landmarks. With this new vision of the element, a list of FB-NOF in the D. melanogaster genomic clones has been done, improving previous works that used only bioinformatic algorithms. The availability of many FB and FB-NOF sequences allowed an analysis of the FB insertion sequences that showed no sequence specificity, but a preference for A/T rich sequences. The position of NOF into FB is also studied, revealing that it is always located after a second repeat in a random block. With the results of this analysis, we propose a model of transposition in which NOF jumps from FB to FB, using an unidentified transposase enzyme that should specifically recognize the second repeat end of the FB blocks.
Complete Mitochondrial Genome of the Medicinal Mushroom Ganoderma lucidum
Chen, Haimei; Chen, Xiangdong; Lan, Jin; Liu, Chang
2013-01-01
Ganoderma lucidum is one of the well-known medicinal basidiomycetes worldwide. The mitochondrion, referred to as the second genome, is an organelle found in most eukaryotic cells and participates in critical cellular functions. Elucidating the structure and function of this genome is important to understand completely the genetic contents of G. lucidum. In this study, we assembled the mitochondrial genome of G. lucidum and analyzed the differential expressions of its encoded genes across three developmental stages. The mitochondrial genome is a typical circular DNA molecule of 60,630 bp with a GC content of 26.67%. Genome annotation identified genes that encode 15 conserved proteins, 27 tRNAs, small and large rRNAs, four homing endonucleases, and two hypothetical proteins. Except for genes encoding trnW and two hypothetical proteins, all genes were located on the positive strand. For the repeat structure analysis, eight forward, two inverted, and three tandem repeats were detected. A pair of fragments with a total length around 5.5 kb was found in both the nuclear and mitochondrial genomes, which suggests the possible transfer of DNA sequences between two genomes. RNA-Seq data for samples derived from three stages, namely, mycelia, primordia, and fruiting bodies, were mapped to the mitochondrial genome and qualified. The protein-coding genes were expressed higher in mycelia or primordial stages compared with those in the fruiting bodies. The rRNA abundances were significantly higher in all three stages. Two regions were transcribed but did not contain any identified protein or tRNA genes. Furthermore, three RNA-editing sites were detected. Genome synteny analysis showed that significant genome rearrangements occurred in the mitochondrial genomes. This study provides valuable information on the gene contents of the mitochondrial genome and their differential expressions at various developmental stages of G. lucidum. The results contribute to the understanding of the functions and evolution of fungal mitochondrial DNA. PMID:23991034
Turmel, Monique; Otis, Christian; Lemieux, Claude
1999-01-01
Green plants seem to form two sister lineages: Chlorophyta, comprising the green algal classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae, and Chlorophyceae, and Streptophyta, comprising the Charophyceae and land plants. We have determined the complete chloroplast DNA (cpDNA) sequence (200,799 bp) of Nephroselmis olivacea, a member of the class (Prasinophyceae) thought to include descendants of the earliest-diverging green algae. The 127 genes identified in this genome represent the largest gene repertoire among the green algal and land plant cpDNAs completely sequenced to date. Of the Nephroselmis genes, 2 (ycf81 and ftsI, a gene involved in peptidoglycan synthesis) have not been identified in any previously investigated cpDNA; 5 genes [ftsW, rnE, ycf62, rnpB, and trnS(cga)] have been found only in cpDNAs of nongreen algae; and 10 others (ndh genes) have been described only in land plant cpDNAs. Nephroselmis and land plant cpDNAs share the same quadripartite structure—which is characterized by the presence of a large rRNA-encoding inverted repeat and two unequal single-copy regions—and very similar sets of genes in corresponding genomic regions. Given that our phylogenetic analyses place Nephroselmis within the Chlorophyta, these structural characteristics were most likely present in the cpDNA of the common ancestor of chlorophytes and streptophytes. Comparative analyses of chloroplast genomes indicate that the typical quadripartite architecture and gene-partitioning pattern of land plant cpDNAs are ancient features that may have been derived from the genome of the cyanobacterial progenitor of chloroplasts. Our phylogenetic data also offer insight into the chlorophyte ancestor of euglenophyte chloroplasts. PMID:10468594
Sawada, Koichi; Kokeguchi, Susumu; Hongyo, Hiroshi; Sawada, Satoko; Miyamoto, Manabu; Maeda, Hiroshi; Nishimura, Fusanori; Takashiba, Shogo; Murayama, Yoji
1999-01-01
Subtractive hybridization was employed to isolate specific genes from virulent Porphyromonas gingivalis strains that are possibly related to abscess formation. The genomic DNA from the virulent strain P. gingivalis W83 was subtracted with DNA from the avirulent strain ATCC 33277. Three clones unique to strain W83 were isolated and sequenced. The cloned DNA fragments were 885, 369, and 132 bp and had slight homology with only Bacillus stearothermophilus IS5377, which is a putative transposase. The regions flanking the cloned DNA fragments were isolated and sequenced, and the gene structure around the clones was revealed. These three clones were located side-by-side in a gene reported as an outer membrane protein. The three clones interrupt the open reading frame of the outer membrane protein gene. This inserted DNA, consisting of three isolated clones, was designated IS1598, which was 1,396 bp (i.e., a 1,158-bp open reading frame) in length and was flanked by 16-bp terminal inverted repeats and a 9-bp duplicated target sequence. IS1598 was detected in P. gingivalis W83, W50, and FDC 381 by Southern hybridization. All three P. gingivalis strains have been shown to possess abscess-forming ability in animal models. However, IS1598 was not detected in avirulent strains of P. gingivalis, including ATCC 33277. The IS1598 may interrupt the synthesis of the outer membrane protein, resulting in changes in the structure of the bacterial outer membrane. The IS1598 isolated in this study is a novel insertion element which might be a specific marker for virulent P. gingivalis strains. PMID:10531208
Etiological role of human papillomavirus infection for inverted papilloma of the bladder.
Shigehara, Kazuyoshi; Sasagawa, Toshiyuki; Doorbar, John; Kawaguchi, Shohei; Kobori, Yoshitomo; Nakashima, Takao; Shimamura, Masayoshi; Maeda, Yuji; Miyagi, Tohru; Kitagawa, Yasuhide; Kadono, Yoshifumi; Konaka, Hiroyuki; Mizokami, Atsushi; Koh, Eitetsu; Namiki, Mikio
2011-02-01
The status of human papillomavirus (HPV) infection in urothelial inverted papilloma was examined in the present study. Formalin-fixed and paraffin-embedded tissues from eight cases of inverted papilloma of the bladder were studied. The presence of HPV-DNA was examined by modified GP5/6+PCR using archival tissue sections by microdissection. HPV genotype was determined with a Hybri-Max HPV genotyping kit. Immunohistochemical analysis for p16-INK4a, mcm7, HPV-E4, and L1, and in situ hybridization for the HPV genome were performed. HPV was detected in seven of eight cases (87.5%) of inverted papilloma. Three cases were diagnosed as inverted papilloma with atypia, while the remaining five were typical cases. HPV-18 was detected in two cases, including one inverted papilloma with atypia, and HPV-16 was detected in four cases, including one inverted papilloma with atypia. Multiple HPV type infection was detected in one typical case and one atypical case. High-risk HPV was present in all HPV-positive cases. Cellular proteins, p16-INK4a and mcm7, which are surrogate markers for HPV-E7 expression, were detected in all HPV-positive cases, and their levels were higher in inverted papilloma with atypia than in typical cases. In contrast, HPV-E4 and L1, which are markers for HPV propagation, were observed in some parts of the typical inverted papilloma tissue. High-risk HPV infection may be one of the causes of urothelial inverted papilloma, and inverted papilloma with atypia may have malignant potential. 2010 Wiley-Liss, Inc.
Dinsmore, P K; Klaenhammer, T R
1997-05-01
A spontaneous mutant of the lactococcal phage phi31 that is insensitive to the phage defense mechanism AbiA was characterized in an effort to identify the phage factor(s) involved in sensitivity of phi31 to AbiA. A point mutation was localized in the genome of the AbiA-insensitive phage (phi31A) by heteroduplex analysis of a 9-kb region. The mutation (G to T) was within a 738-bp open reading frame (ORF245) and resulted in an arginine-to-leucine change in the predicted amino acid sequence of the protein. The mutant phi31A-ORF245 reduced the sensitivity of phi31 to AbiA when present in trans, indicating that the mutation in ORF245 is responsible for the AbiA insensitivity of phi31A. Transcription of ORF245 occurs early in the phage infection cycles of phi31 and phi31A and is unaffected by AbiA. Expansion of the phi31 sequence revealed ORF169 (immediately upstream of ORF245) and ORF71 (which ends 84 bp upstream of ORF169). Two inverted repeats lie within the 84-bp region between ORF71 and ORF169. Sequence analysis of an independently isolated AbiA-insensitive phage, phi31B, identified a mutation (G to A) in one of the inverted repeats. A 118-bp fragment from phi31, encompassing the 84-bp region between ORF71 and ORF169, eliminates AbiA activity against phi31 when present in trans, establishing a relationship between AbiA and this fragment. The study of this region of phage phi31 has identified an open reading frame (ORF245) and a 118-bp DNA fragment that interact with AbiA and are likely to be involved in the sensitivity of this phage to AbiA.
Expansion of inverted repeat does not decrease substitution rates in Pelargonium plastid genomes.
Weng, Mao-Lun; Ruhlman, Tracey A; Jansen, Robert K
2017-04-01
For species with minor inverted repeat (IR) boundary changes in the plastid genome (plastome), nucleotide substitution rates were previously shown to be lower in the IR than the single copy regions (SC). However, the impact of large-scale IR expansion/contraction on plastid nucleotide substitution rates among closely related species remains unclear. We included plastomes from 22 Pelargonium species, including eight newly sequenced genomes, and used both pairwise and model-based comparisons to investigate the impact of the IR on sequence evolution in plastids. Ten types of plastome organization with different inversions or IR boundary changes were identified in Pelargonium. Inclusion in the IR was not sufficient to explain the variation of nucleotide substitution rates. Instead, the rate heterogeneity in Pelargonium plastomes was a mixture of locus-specific, lineage-specific and IR-dependent effects. Our study of Pelargonium plastomes that vary in IR length and gene content demonstrates that the evolutionary consequences of retaining these repeats are more complicated than previously suggested. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Wang, Jianye; Huang, Yu; Zhou, Mingxu; Zhu, Guoqiang
2016-09-01
Genomic information about Muscovy duck parvovirus is still limited. In this study, the genome of the pathogenic MDPV strain YY was sequenced. The full-length genome of YY is 5075 nucleotides (nt) long, 57 nt shorter than that of strain FM. Sequence alignment indicates that the 5' and 3' inverted terminal repeats (ITR) of strain YY contain a 14-nucleotide-pair deletion in the stem of the palindromic hairpin structure in comparison to strain FM and FZ91-30. The deleted region contains one "E-box" site and one repeated motif with the sequence "TTCCGGT" or "ACCGGAA". Phylogenetic trees constructed based the protein coding genes concordantly showed that YY, together with nine other MDPV isolates from various places, clustered in a separate branch, distinct from the branch formed by goose parvovirus (GPV) strains. These results demonstrate that, despite the distinctive deletion, the YY strain still belongs to the classical MDPV group. Moreover, the deletion of ITR may contribute to the genome evolution of MDPV under immunization pressure.
Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed
2013-01-01
Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms.AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence,were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigated podiversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity.We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants.
Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed
2013-01-01
Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms. AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence, were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigate dpo diversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity. We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants. PMID:23925788
Rabah, Samar O; Lee, Chaehee; Hajrah, Nahid H; Makki, Rania M; Alharby, Hesham F; Alhebshi, Alawiah M; Sabir, Jamal S M; Jansen, Robert K; Ruhlman, Tracey A
2017-11-01
In plant evolution, intracellular gene transfer (IGT) is a prevalent, ongoing process. While nuclear and mitochondrial genomes are known to integrate foreign DNA via IGT and horizontal gene transfer (HGT), plastid genomes (plastomes) have resisted foreign DNA incorporation and only recently has IGT been uncovered in the plastomes of a few land plants. In this study, we completed plastome sequences for l0 crop species and describe a number of structural features including variation in gene and intron content, inversions, and expansion and contraction of the inverted repeat (IR). We identified a putative in cinnamon ( J. Presl) and other sequenced Lauraceae and an apparent functional transfer of to the nucleus of quinoa ( Willd.). In the orchard tree cashew ( L.), we report the insertion of an ∼6.7-kb fragment of mitochondrial DNA into the plastome IR. BLASTn analyses returned high identity hits to mitogenome sequences including an intact open reading frame. Using three plastome markers for five species of , we generated a phylogeny to investigate the distribution and timing of the insertion. Four species share the insertion, suggesting that this event occurred <20 million yr ago in a single clade in the genus. Our study extends the observation of mitochondrial to plastome IGT to include long-lived tree species. While previous studies have suggested possible mechanisms facilitating IGT to the plastome, more examples of this phenomenon, along with more complete mitogenome sequences, will be required before a common, or variable, mechanism can be elucidated. Copyright © 2017 Crop Science Society of America.
Hashikawa, Naoya; Yamamoto, Noritaka; Sakurai, Hiroshi
2007-04-06
The hydrophobic repeat is a conserved structural motif of eukaryotic heat shock transcription factor (HSF) that enables HSF to form a homotrimer. Homotrimeric HSF binds to heat shock elements (HSEs) consisting of three inverted repeats of the sequence nGAAn. Sequences consisting of four or more nGAAn units are bound cooperatively by two HSF trimers. We show that in Saccharomyces cerevisiae cells oligomerization-defective Hsf1 is not able to bind HSEs with three units and is not extensively phosphorylated in response to stress; it is therefore unable to activate genes containing this type of HSE. Several lines of evidence indicate that oligomerization is a prerequisite for stress-induced hyperphosphorylation of Hsf1. In contrast, oligomerization and hyperphosphorylation are not necessary for gene activation via HSEs with four units. Intragenic suppressor screening of oligomerization-defective hsf1 showed that an interface between adjacent DNA-binding domains is important for the binding of Hsf1 to the HSE. We suggest that Saccharomyces cerevisiae HSEs with different structures are regulated differently; HSEs with three units require Hsf1 to be both oligomerized and hyperphosphorylated, whereas HSEs with four or more units do not require either.
2015-01-01
Conformational polymorphism of DNA is a major causative factor behind several incurable trinucleotide repeat expansion disorders that arise from overexpansion of trinucleotide repeats located in coding/non-coding regions of specific genes. Hairpin DNA structures that are formed due to overexpansion of CAG repeat lead to Huntington’s disorder and spinocerebellar ataxias. Nonetheless, DNA hairpin stem structure that generally embraces B-form with canonical base pairs is poorly understood in the context of periodic noncanonical A…A mismatch as found in CAG repeat overexpansion. Molecular dynamics simulations on DNA hairpin stems containing A…A mismatches in a CAG repeat overexpansion show that A…A dictates local Z-form irrespective of starting glycosyl conformation, in sharp contrast to canonical DNA duplex. Transition from B-to-Z is due to the mechanistic effect that originates from its pronounced nonisostericity with flanking canonical base pairs facilitated by base extrusion, backbone and/or base flipping. Based on these structural insights we envisage that such an unusual DNA structure of the CAG hairpin stem may have a role in disease pathogenesis. As this is the first study that delineates the influence of a single A…A mismatch in reversing DNA helicity, it would further have an impact on understanding DNA mismatch repair. PMID:25876062
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-01-01
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363
Spectroscopic insights into quadruplexes of five-repeat telomere DNA sequences upon G-block damage.
Dvořáková, Zuzana; Vorlíčková, Michaela; Renčiuk, Daniel
2017-11-01
The DNA lesions, resulting from oxidative damage, were shown to destabilize human telomere four-repeat quadruplex and to alter its structure. Long telomere DNA, as a repetitive sequence, offers, however, other mechanisms of dealing with the lesion: extrusion of the damaged repeat into loop or shifting the quadruplex position by one repeat. Using circular dichroism and UV absorption spectroscopy and polyacrylamide electrophoresis, we studied consequences of lesions at different positions of the model five-repeat human telomere DNA sequences on the structure and stability of their quadruplexes in sodium and in potassium. The repeats affected by lesion are preferentially positioned as terminal overhangs of the core quadruplex structurally similar to the four-repeat one. Forced affecting of the inner repeats leads to presence of variety of more parallel folds in potassium. In sodium the designed models form mixture of two dominant antiparallel quadruplexes whose population varies with the position of the affected repeat. The shapes of quadruplex CD spectra, namely the height of dominant peaks, significantly correlate with melting temperatures. Lesion in one guanine tract of a more than four repeats long human telomere DNA sequence may cause re-positioning of its quadruplex arrangement associated with a shift of the structure to less common quadruplex conformations. The type of the quadruplex depends on the loop position and external conditions. The telomere DNA quadruplexes are quite resistant to the effect of point mutations due to the telomere DNA repetitive nature, although their structure and, consequently, function might be altered. Copyright © 2017. Published by Elsevier B.V.
DNA Replication Dynamics of the GGGGCC Repeat of the C9orf72 Gene.
Thys, Ryan Griffin; Wang, Yuh-Hwa
2015-11-27
DNA has the ability to form a variety of secondary structures in addition to the normal B-form DNA, including hairpins and quadruplexes. These structures are implicated in a number of neurological diseases and cancer. Expansion of a GGGGCC repeat located at C9orf72 is associated with familial amyotrophic lateral sclerosis and frontotemporal dementia. This repeat expands from two to 24 copies in normal individuals to several hundreds or thousands of repeats in individuals with the disease. Biochemical studies have demonstrated that as little as four repeats have the ability to form a stable DNA secondary structure known as a G-quadruplex. Quadruplex structures have the ability to disrupt normal DNA processes such as DNA replication and transcription. Here we examine the role of GGGGCC repeat length and orientation on DNA replication using an SV40 replication system in human cells. Replication through GGGGCC repeats leads to a decrease in overall replication efficiency and an increase in instability in a length-dependent manner. Both repeat expansions and contractions are observed, and replication orientation is found to influence the propensity for expansions or contractions. The presence of replication stress, such as low-dose aphidicolin, diminishes replication efficiency but has no effect on instability. Two-dimensional gel electrophoresis analysis demonstrates a replication stall with as few as 20 GGGGCC repeats. These results suggest that replication of the GGGGCC repeat at C9orf72 is perturbed by the presence of expanded repeats, which has the potential to result in further expansion, leading to disease. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Structure and molecular mechanism of a nucleobase-cation-symport-1 family transporter.
Weyand, Simone; Shimamura, Tatsuro; Yajima, Shunsuke; Suzuki, Shun'ichi; Mirza, Osman; Krusong, Kuakarun; Carpenter, Elisabeth P; Rutherford, Nicholas G; Hadden, Jonathan M; O'Reilly, John; Ma, Pikyee; Saidijam, Massoud; Patching, Simon G; Hope, Ryan J; Norbertczak, Halina T; Roach, Peter C J; Iwata, So; Henderson, Peter J F; Cameron, Alexander D
2008-10-31
The nucleobase-cation-symport-1 (NCS1) transporters are essential components of salvage pathways for nucleobases and related metabolites. Here, we report the 2.85-angstrom resolution structure of the NCS1 benzyl-hydantoin transporter, Mhp1, from Microbacterium liquefaciens. Mhp1 contains 12 transmembrane helices, 10 of which are arranged in two inverted repeats of five helices. The structures of the outward-facing open and substrate-bound occluded conformations were solved, showing how the outward-facing cavity closes upon binding of substrate. Comparisons with the leucine transporter LeuT(Aa) and the galactose transporter vSGLT reveal that the outward- and inward-facing cavities are symmetrically arranged on opposite sides of the membrane. The reciprocal opening and closing of these cavities is synchronized by the inverted repeat helices 3 and 8, providing the structural basis of the alternating access model for membrane transport.
Nezha, a novel active miniature inverted-repeat transposable element in cyanobacteria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou Fengfeng; Tran Thao; Xu Ying
2008-01-25
Miniature inverted-repeat transposable elements (MITEs) were first identified in plants and exerted extensive proliferations throughout eukaryotic and archaeal genomes. But very few MITEs have been characterized in bacteria. We identified a novel MITE, called Nezha, in cyanobacteria Anabaena variabilis ATCC 29413 and Nostoc sp. PCC 7120. Nezha, like most previously known MITEs in other organisms, is small in size, non-coding, carrying TIR and DR signals, and of potential to form a stable RNA secondary structure, and it tends to insert into A+T-rich regions. Recent transpositions of Nezha were observed in A. variabilis ATCC 29413 and Nostoc sp. PCC 7120, respectively.more » Nezha might have proliferated recently with aid from the transposase encoded by ISNpu3-like elements. A possible horizontal transfer event of Nezha from cyanobacteria to Polaromonas JS666 is also observed.« less
Demura, Masashi; Takeda, Yoshiyu; Yoneda, Takashi; Furukawa, Kenji; Usukura, Mikiya; Itoh, Yuji; Mabuchi, Hiroshi
2002-01-01
Study of two families containing individuals with nephrogenic diabetes insipidus (NDI) indicated different types of 21.3 kb and 26.3 kb deletions involving the AVPR2 and ARHGAP4 (RhoGAP C1) genes. In the case of the 21.3 kb deletion, the deletion consensus motif (5'-TGAAGG-3') and polypurine runs, known as the arrest site of polymerase alpha, were detected in the vicinity of the deletion junction. Inverted repeats (7/8 matches), believed to potentiate DNA loop formation, flank the deletion breakpoint. We propose this deletion to be the result of slipped mispairing during DNA replication. In the case of the 26.3 kb deletion, the 12,945 bp inverted region with the 10,003 bp internal deletion was accompanied with the 2,509 bp deletion in the 5'-side and the 13,785 bp deletion in the 3'-side. We defined three deletion junctions in this rearrangement (DJ1, DJ2, and DJ3) from the 5'-side. The surrounding sequence of DJ1 (5'-CCC-3') closely resembled that of DJ3 (5'-AGGG-3') (DJ1; 5'-cCCCgaggg-3', DJ3; 5'-ccccAGGG-3'), and DJ1 was located in the 5'-side of DJ3 without any overlapping in sequence. The immunoglobulin class switch (ICS) motif (5'-TGGGG-3') was found around the complementary sequence of DJ3. There was a 10-base palindrome (5'-aGACAtgtct-3') in the alignment of the DJ2 (5'-GACA-3') region. From these findings, we propose a novel mutation process with the rearrangement probably resulting from stem-loop induced non-homologous recombination in an ICS-like fashion. Both patients, despite lacking ARHGAP4, had no morphological, clinical, or laboratory abnormalities except for those usually found in patients with NDI. Copyright 2001 Wiley-Liss, Inc.
Methods for sequencing GC-rich and CCT repeat DNA templates
Robinson, Donna L.
2007-02-20
The present invention is directed to a PCR-based method of cycle sequencing DNA and other polynucleotide sequences having high CG content and regions of high GC content, and includes for example DNA strands with a high Cytosine and/or Guanosine content and repeated motifs such as CCT repeats.
Genomic and genetic analyses of diversity and plant interactions of Pseudomonas fluorescens
Silby, Mark W; Cerdeño-Tárraga, Ana M; Vernikos, Georgios S; Giddens, Stephen R; Jackson, Robert W; Preston, Gail M; Zhang, Xue-Xian; Moon, Christina D; Gehrig, Stefanie M; Godfrey, Scott AC; Knight, Christopher G; Malone, Jacob G; Robinson, Zena; Spiers, Andrew J; Harris, Simon; Challis, Gregory L; Yaxley, Alice M; Harris, David; Seeger, Kathy; Murphy, Lee; Rutter, Simon; Squares, Rob; Quail, Michael A; Saunders, Elizabeth; Mavromatis, Konstantinos; Brettin, Thomas S; Bentley, Stephen D; Hothersall, Joanne; Stephens, Elton; Thomas, Christopher M; Parkhill, Julian; Levy, Stuart B; Rainey, Paul B; Thomson, Nicholas R
2009-01-01
Background Pseudomonas fluorescens are common soil bacteria that can improve plant health through nutrient cycling, pathogen antagonism and induction of plant defenses. The genome sequences of strains SBW25 and Pf0-1 were determined and compared to each other and with P. fluorescens Pf-5. A functional genomic in vivo expression technology (IVET) screen provided insight into genes used by P. fluorescens in its natural environment and an improved understanding of the ecological significance of diversity within this species. Results Comparisons of three P. fluorescens genomes (SBW25, Pf0-1, Pf-5) revealed considerable divergence: 61% of genes are shared, the majority located near the replication origin. Phylogenetic and average amino acid identity analyses showed a low overall relationship. A functional screen of SBW25 defined 125 plant-induced genes including a range of functions specific to the plant environment. Orthologues of 83 of these exist in Pf0-1 and Pf-5, with 73 shared by both strains. The P. fluorescens genomes carry numerous complex repetitive DNA sequences, some resembling Miniature Inverted-repeat Transposable Elements (MITEs). In SBW25, repeat density and distribution revealed 'repeat deserts' lacking repeats, covering approximately 40% of the genome. Conclusions P. fluorescens genomes are highly diverse. Strain-specific regions around the replication terminus suggest genome compartmentalization. The genomic heterogeneity among the three strains is reminiscent of a species complex rather than a single species. That 42% of plant-inducible genes were not shared by all strains reinforces this conclusion and shows that ecological success requires specialized and core functions. The diversity also indicates the significant size of genetic information within the Pseudomonas pan genome. PMID:19432983
Turmel, Monique; Otis, Christian; Lemieux, Claude
2015-07-01
Previous studies of trebouxiophycean chloroplast genomes revealed little information regarding the evolutionary dynamics of this genome because taxon sampling was too sparse and the relationships between the sampled taxa were unknown. We recently sequenced the chloroplast genomes of 27 trebouxiophycean and 2 pedinophycean green algae to resolve the relationships among the main lineages recognized for the Trebouxiophyceae. These taxa and the previously sampled members of the Pedinophyceae and Trebouxiophyceae are included in the comparative chloroplast genome analysis we report here. The 38 genomes examined display considerable variability at all levels, except gene content. Our results highlight the high propensity of the rDNA-containing large inverted repeat (IR) to vary in size, gene content and gene order as well as the repeated losses it experienced during trebouxiophycean evolution. Of the seven predicted IR losses, one event demarcates a superclade of 11 taxa representing 5 late-diverging lineages. IR expansions/contractions account not only for changes in gene content in this region but also for changes in gene order and gene duplications. Inversions also led to gene rearrangements within the IR, including the reversal or disruption of the rDNA operon in some lineages. Most of the 20 IR-less genomes are more rearranged compared with their IR-containing homologs and tend to show an accelerated rate of sequence evolution. In the IR-less superclade, several ancestral operons were disrupted, a few genes were fragmented, and a subgroup of taxa features a G+C-biased nucleotide composition. Our analyses also unveiled putative cases of gene acquisitions through horizontal transfer. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Tembrock, Luke R.; Zheng, Shaoyu; Wu, Zhiqiang
2018-01-01
Qat (Catha edulis, Celastraceae) is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp) genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA) genes, 8 ribosomal RNA (rRNA) genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae. PMID:29425128
RNAi triggered by symmetrically transcribed transgenes in Drosophila melanogaster.
Giordano, Ennio; Rendina, Rosaria; Peluso, Ivana; Furia, Maria
2002-01-01
Specific silencing of target genes can be induced in a variety of organisms by providing homologous double-stranded RNA molecules. In vivo, these molecules can be generated either by transcription of sequences having an inverted-repeat (IR) configuration or by simultaneous transcription of sense-antisense strands. Since IR constructs are difficult to prepare and can stimulate genomic rearrangements, we investigated the silencing potential of symmetrically transcribed sequences. We report that Drosophila transgenes whose sense-antisense transcription was driven by two convergent arrays of Gal4-dependent UAS sequences can induce specific, dominant, and heritable repression of target genes. This effect is not dependent on a mechanism based on homology-dependent DNA/DNA interactions, but is directly triggered by transcriptional activation and is accompanied by specific depletion of the endogenous target RNA. Tissue-specific induction of these transgenes restricts the target gene silencing to selected body domains, and spreading phenomena described in other cases of post-transcriptional gene silencing (PTGS) were not observed. In addition to providing an additional tool useful for Drosophila functional genomic analysis, these results add further strength to the view that events of sense-antisense transcription may readily account for some, if not all, PTGS-cosuppression phenomena and can potentially play a relevant role in gene regulation. PMID:11861567
Unusually Long Palindromes Are Abundant in Mitochondrial Control Regions of Insects and Nematodes
Arunkumar, K. P.; Nagaraju, Javaregowda
2006-01-01
Background Palindromes are known to be involved in a variety of biological processes. In the present investigation we carried out a comprehensive analysis of palindromes in the mitochondrial control regions (CRs) of several animal groups to study their frequency, distribution and architecture to gain insights into the origin of replication of mtDNA. Methodology/Principal Findings Many species of Arthropoda, Nematoda, Mollusca and Annelida harbor palindromes and inverted repeats (IRs) in their CRs. Lower animals like cnidarians and higher animal groups like chordates are almost devoid of palindromes and IRs. The study revealed that palindrome occurrence is positively correlated with the AT content of CRs, and that IRs are likely to give rise to longer palindromes. Conclusions/Significance The present study attempts to explain possible reasons and gives in silico evidence for absence of palindromes and IRs from CR of vertebrate mtDNA and acquisition and retention of the same in insects. Study of CRs of different animal phyla uncovered unique architecture of this locus, be it high abundance of long palindromes and IRs in CRs of Insecta and Nematoda, or short IRs of 10–20 nucleotides with a spacer region of 12–14 bases in subphylum Chelicerata, or nearly complete of absence of any long palindromes and IRs in Vertebrata, Cnidaria and Echinodermata. PMID:17205114
Tsukamoto, Mariko; Yamashita, Kentaro; Miyazaki, Toshiko; Shinohara, Miki; Shinohara, Akira
2003-01-01
In Saccharomyces cerevisiae, the Rad52 protein plays a role in both RAD51-dependent and RAD51-independent recombination pathways. We characterized a rad52 mutant, rad52-329, which lacks the C-terminal Rad51-interacting domain, and studied its role in RAD51-independent recombination. The rad52-329 mutant is completely defective in mating-type switching, but partially proficient in recombination between inverted repeats. We also analyzed the effect of the rad52-329 mutant on telomere recombination. Yeast cells lacking telomerase maintain telomere length by recombination. The rad52-329 mutant is deficient in RAD51-dependent telomere recombination, but is proficient in RAD51-independent telomere recombination. In addition, we examined the roles of other recombination genes in the telomere recombination. The RAD51-independent recombination in the rad52-329 mutant is promoted by a paralogue of Rad52, Rad59. All components of the Rad50-Mre11-Xrs2 complex are also important, but not essential, for RAD51-independent telomere recombination. Interestingly, RAD51 inhibits the RAD51-independent, RAD52-dependent telomere recombination. These findings indicate that Rad52 itself, and more precisely its N-terminal DNA-binding domain, promote an essential reaction in recombination in the absence of RAD51. PMID:14704160
Within-Genome Evolution of REPINs: a New Family of Miniature Mobile DNA in Bacteria
Bertels, Frederic; Rainey, Paul B.
2011-01-01
Repetitive sequences are a conserved feature of many bacterial genomes. While first reported almost thirty years ago, and frequently exploited for genotyping purposes, little is known about their origin, maintenance, or processes affecting the dynamics of within-genome evolution. Here, beginning with analysis of the diversity and abundance of short oligonucleotide sequences in the genome of Pseudomonas fluorescens SBW25, we show that over-represented short sequences define three distinct groups (GI, GII, and GIII) of repetitive extragenic palindromic (REP) sequences. Patterns of REP distribution suggest that closely linked REP sequences form a functional replicative unit: REP doublets are over-represented, randomly distributed in extragenic space, and more highly conserved than singlets. In addition, doublets are organized as inverted repeats, which together with intervening spacer sequences are predicted to form hairpin structures in ssDNA or mRNA. We refer to these newly defined entities as REPINs (REP doublets forming hairpins) and identify short reads from population sequencing that reveal putative transposition intermediates. The proximal relationship between GI, GII, and GIII REPINs and specific REP-associated tyrosine transposases (RAYTs), combined with features of the putative transposition intermediate, suggests a mechanism for within-genome dissemination. Analysis of the distribution of REPs in a range of RAYT–containing bacterial genomes, including Escherichia coli K-12 and Nostoc punctiforme, show that REPINs are a widely distributed, but hitherto unrecognized, family of miniature non-autonomous mobile DNA. PMID:21698139
Identification of Genetic Elements Associated with EPSPS Gene Amplification
Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.
2013-01-01
Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434
A transposable element in a NAC gene is associated with drought tolerance in maize seedlings
Mao, Hude; Wang, Hongwei; Liu, Shengxue; Li, Zhigang; Yang, Xiaohong; Yan, Jianbing; Li, Jiansheng; Tran, Lam-Son Phan; Qin, Feng
2015-01-01
Drought represents a major constraint on maize production worldwide. Understanding the genetic basis for natural variation in drought tolerance of maize may facilitate efforts to improve this trait in cultivated germplasm. Here, using a genome-wide association study, we show that a miniature inverted-repeat transposable element (MITE) inserted in the promoter of a NAC gene (ZmNAC111) is significantly associated with natural variation in maize drought tolerance. The 82-bp MITE represses ZmNAC111 expression via RNA-directed DNA methylation and H3K9 dimethylation when heterologously expressed in Arabidopsis. Increasing ZmNAC111 expression in transgenic maize enhances drought tolerance at the seedling stage, improves water-use efficiency and induces upregulation of drought-responsive genes under water stress. The MITE insertion in the ZmNAC111 promoter appears to have occurred after maize domestication and spread among temperate germplasm. The identification of this MITE insertion provides insight into the genetic basis for natural variation in maize drought tolerance. PMID:26387805
Zaba: a novel miniature transposable element present in genomes of legume plants.
Macas, J; Neumann, P; Pozárková, D
2003-08-01
A novel family of miniature transposable elements, named Zaba, was identified in pea (Pisum sativum) and subsequently also in other legume species using computer analysis of their DNA sequences. Zaba elements are 141-190 bp long, generate 10-bp target site duplications, and their terminal inverted repeats make up most of the sequence. Zaba elements thus resemble class 3 foldback transposons. The elements are only moderately repetitive in pea (tens to hundreds copies per haploid genome), but they are present in up to thousands of copies in the genomes of several Medicago and Vicia species. More detailed analysis of the elements from pea, including isolation of new sequences from a genomic library, revealed that a fraction of these elements are truncated, and that their last transposition probably did not occur recently. A search for Zaba sequences in EST databases showed that at least some elements are transcribed, most probably due to their association with genic regions.
Plastid genome sequence of an ornamental and editable fruit tree of Rosaceae, Prunus mume.
Wang, Shuo; Gao, Cheng-Wen; Gao, Li-Zhi
2016-11-01
Here we assembled and analyzed the complete chloroplast genome of Prunus mume, a popular ornamental and editable fruit tree of Rosaceae. The cp genome exhibited a circular DNA molecule of 157 712 bp with a typical quadripartite structure consisted of two inverted repeat regions (IRa and IRb) of 26 394 bp separated by large (LSC) and small (SSC) single-copy regions of 85 861 and 19 063 bp, respectively. It encoded 112 unique genes, 19 of which were duplicated in the IR regions, giving a total of 131 genes. Eighteen of these genes harbored one or two introns. GC content was 38.9%, and coding regions accounted for 51.3% of the genome. Phylogenetic analysis showed that P. mume clustered with P. persica and P. kansuensis in the genus Punus. This newly determined chloroplast genome will enhance modern breeding programs for the purpose of genetic improvement of this valuable plant.
Isolation and characterization of a water stress-specific genomic gene, pwsi 18, from rice.
Joshee, N; Kisaka, H; Kitagawa, Y
1998-01-01
One of the water stress-specific cDNA clones of rice characterised previously, wsi18, was selected for further study. The wsi18 gene can be induced by water stress conditions such as mannitol, NaCl, and dryness, but not by ABA, cold, or heat. A genomic clone for wsi18, pwsi18, contained about 1.7 kbp of the 5' upstream sequence, two introns, and the full coding sequence. The 5'-upstream sequence of pwsi18 contained putative cis-acting elements, namely an ABA-responsive element (ABRE), three G-boxes, three E-boxes, a MEF-2 sequence, four direct and two inverted repeats, and four sequences similar to DRE, which is involved in the dehydration response of Arabidopsis genes. The gusA reporter gene under the control of the pwsi18 promoter showed transient expression in response to water stress. Deletion of the downstream DRE-like sequence between the distal G-boxes-2 and -3 resulted in rather low GUS expression.
Complete plastid genome sequence of goosegrass (Eleusine indica) and comparison with other Poaceae.
Zhang, Hui; Hall, Nathan; McElroy, J Scott; Lowe, Elijah K; Goertzen, Leslie R
2017-02-05
Eleusine indica, also known as goosegrass, is a serious weed in at least 42 countries. In this paper we report the complete plastid genome sequence of goosegrass obtained by de novo assembly of paired-end and mate-paired reads generated by Illumina sequencing of total genomic DNA. The goosegrass plastome is a circular molecule of 135,151bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 20,919 bases. The large (LSC) and the small (SSC) single-copy regions span 80,667 bases and 12,646 bases, respectively. The plastome of goosegrass has 38.19% GC content and includes 108 unique genes, of which 76 are protein-coding, 28 are transfer RNA, and 4 are ribosomal RNA. The goosegrass plastome sequence was compared to eight other species of Poaceae. Although generally conserved with respect to Poaceae, this genomic resource will be useful for evolutionary studies within this weed species and the genus Eleusine. Copyright © 2016. Published by Elsevier B.V.
Molecular architecture of classical cytological landmarks: Centromeres and telomeres
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meyne, J.
1994-11-01
Both the human telomere repeat and the pericentromeric repeat sequence (GGAAT)n were isolated based on evolutionary conservation. Their isolation was based on the premise that chromosomal features as structurally and functionally important as telomeres and centromeres should be highly conserved. Both sequences were isolated by high stringency screening of a human repetitive DNA library with rodent repetitive DNA. The pHuR library (plasmid Human Repeat) used for this project was enriched for repetitive DNA by using a modification of the standard DNA library preparation method. Usually DNA for a library is cut with restriction enzymes, packaged, infected, and the library ismore » screened. A problem with this approach is that many tandem repeats don`t have any (or many) common restriction sites. Therefore, many of the repeat sequences will not be represented in the library because they are not restricted to a viable length for the vector used. To prepare the pHuR library, human DNA was mechanically sheared to a small size. These relatively short DNA fragments were denatured and then renatured to C{sub o}t 50. Theoretically only repetitive DNA sequences should renature under C{sub o}t 50 conditions. The single-stranded regions were digested using S1 nuclease, leaving the double-stranded, renatured repeat sequences.« less
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-11-16
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Wachter, Shaun; Raghavan, Rahul; Wachter, Jenny; Minnick, Michael F
2018-04-11
Coxiella burnetii is a Gram-negative gammaproteobacterium and zoonotic agent of Q fever. C. burnetii's genome contains an abundance of pseudogenes and numerous selfish genetic elements. MITEs (miniature inverted-repeat transposable elements) are non-autonomous transposons that occur in all domains of life and are thought to be insertion sequences (ISs) that have lost their transposase function. Like most transposable elements (TEs), MITEs are thought to play an active role in evolution by altering gene function and expression through insertion and deletion activities. However, information regarding bacterial MITEs is limited. We describe two MITE families discovered during research on small non-coding RNAs (sRNAs) of C. burnetii. Two sRNAs, Cbsr3 and Cbsr13, were found to originate from a novel MITE family, termed QMITE1. Another sRNA, CbsR16, was found to originate from a separate and novel MITE family, termed QMITE2. Members of each family occur ~ 50 times within the strains evaluated. QMITE1 is a typical MITE of 300-400 bp with short (2-3 nt) direct repeats (DRs) of variable sequence and is often found overlapping annotated open reading frames (ORFs). Additionally, QMITE1 elements possess sigma-70 promoters and are transcriptionally active at several loci, potentially influencing expression of nearby genes. QMITE2 is smaller (150-190 bps), but has longer (7-11 nt) DRs of variable sequences and is mainly found in the 3' untranslated region of annotated ORFs and intergenic regions. QMITE2 contains a GTAG repetitive extragenic palindrome (REP) that serves as a target for IS1111 TE insertion. Both QMITE1 and QMITE2 display inter-strain linkage and sequence conservation, suggesting that they are adaptive and existed before divergence of C. burnetii strains. We have discovered two novel MITE families of C. burnetii. Our finding that MITEs serve as a source for sRNAs is novel. QMITE2 has a unique structure and occurs in large or small versions with unique DRs that display linkage and sequence conservation between strains, allowing for tracking of genomic rearrangements. QMITE1 and QMITE2 copies are hypothesized to influence expression of neighboring genes involved in DNA repair and virulence through transcriptional interference and ribonuclease processing.
Contrasting Patterns of rDNA Homogenization within the Zygosaccharomyces rouxii Species Complex
Chand Dakal, Tikam; Giudici, Paolo; Solieri, Lisa
2016-01-01
Arrays of repetitive ribosomal DNA (rDNA) sequences are generally expected to evolve as a coherent family, where repeats within such a family are more similar to each other than to orthologs in related species. The continuous homogenization of repeats within individual genomes is a recombination process termed concerted evolution. Here, we investigated the extent and the direction of concerted evolution in 43 yeast strains of the Zygosaccharomyces rouxii species complex (Z. rouxii, Z. sapae, Z. mellis), by analyzing two portions of the 35S rDNA cistron, namely the D1/D2 domains at the 5’ end of the 26S rRNA gene and the segment including the internal transcribed spacers (ITS) 1 and 2 (ITS regions). We demonstrate that intra-genomic rDNA sequence variation is unusually frequent in this clade and that rDNA arrays in single genomes consist of an intermixing of Z. rouxii, Z. sapae and Z. mellis-like sequences, putatively evolved by reticulate evolutionary events that involved repeated hybridization between lineages. The levels and distribution of sequence polymorphisms vary across rDNA repeats in different individuals, reflecting four patterns of rDNA evolution: I) rDNA repeats that are homogeneous within a genome but are chimeras derived from two parental lineages via recombination: Z. rouxii in the ITS region and Z. sapae in the D1/D2 region; II) intra-genomic rDNA repeats that retain polymorphisms only in ITS regions; III) rDNA repeats that vary only in their D1/D2 domains; IV) heterogeneous rDNA arrays that have both polymorphic ITS and D1/D2 regions. We argue that an ongoing process of homogenization following allodiplodization or incomplete lineage sorting gave rise to divergent evolutionary trajectories in different strains, depending upon temporal, structural and functional constraints. We discuss the consequences of these findings for Zygosaccharomyces species delineation and, more in general, for yeast barcoding. PMID:27501051
Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng
2014-04-01
Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.
Pritham, Ellen J; Putliwala, Tasneem; Feschotte, Cédric
2007-04-01
We previously identified a group of atypical mobile elements designated Mavericks from the nematodes Caenorhabditis elegans and C. briggsae and the zebrafish Danio rerio. Here we present the results of comprehensive database searches of the genome sequences available, which reveal that Mavericks are widespread in invertebrates and non-mammalian vertebrates but show a patchy distribution in non-animal species, being present in the fungi Glomus intraradices and Phakopsora pachyrhizi and in several single-celled eukaryotes such as the ciliate Tetrahymena thermophila, the stramenopile Phytophthora infestans and the trichomonad Trichomonas vaginalis, but not detectable in plants. This distribution, together with comparative and phylogenetic analyses of Maverick-encoded proteins, is suggestive of an ancient origin of these elements in eukaryotes followed by lineage-specific losses and/or recurrent episodes of horizontal transmission. In addition, we report that Maverick elements have amplified recently to high copy numbers in T. vaginalis where they now occupy as much as 30% of the genome. Sequence analysis confirms that most Mavericks encode a retroviral-like integrase, but lack other open reading frames typically found in retroelements. Nevertheless, the length and conservation of the target site duplication created upon Maverick insertion (5- or 6-bp) is consistent with a role of the integrase-like protein in the integration of a double-stranded DNA transposition intermediate. Mavericks also display long terminal-inverted repeats but do not contain ORFs similar to proteins encoded by DNA transposons. Instead, Mavericks encode a conserved set of 5 to 9 genes (in addition to the integrase) that are predicted to encode proteins with homology to replication and packaging proteins of some bacteriophages and diverse eukaryotic double-stranded DNA viruses, including a DNA polymerase B homolog and putative capsid proteins. Based on these and other structural similarities, we speculate that Mavericks represent an evolutionary missing link between seemingly disparate invasive DNA elements that include bacteriophages, adenoviruses and eukaryotic linear plasmids.
Evidence that human papillomavirus causes inverted papilloma is sparse.
Justice, Jeb M; Davis, Kern M; Saenz, Daniel A; Lanza, Donald C
2014-12-01
Controversy exists regarding the pathogenesis of inverted papilloma as it relates to the involvement of human papillomavirus (HPV). The purpose of this report is to describe the prevalence of HPV in nondysplastic, "early inverted papilloma" and to summarize HPV detection rates in the general population and in other HPV related neoplasia. This case series report characterizes consecutive inverted papilloma patients from January 2005 to August 2012 with regard to smoking history, dysplasia, and HPV detection rates. Presence or absence of low/high risk HPV was determined by standardized in situ hybridization DNA probes. Medline literature review was performed to determine the prevalence of HPV in inverted papilloma without moderate or severe dysplasia. Thirty-six consecutive patients were identified with an average age of 63.6 (range, 40-84) years; gender: 23 men, 13 women. More than half (55%) were active or former smokers (14% active and 41% former). High/low risk HPV was present in 1 in 36 (2.7%) patients and 1 in 36 (2.7%) had mild dysplasia. In the literature review: (1) HPV was detected in 16.4% of inverted papilloma without dysplasia; (2) oral cavity HPV detection was 4.2% to 11.4% in the normal population; and (3) HPV was normally detected in 85% to 95% of HPV-related neoplasia. Given histological features of inverted papilloma and comparatively low detection rates of HPV in inverted papilloma without dysplasia (2.7%), as well as the summary of the world literature, HPV is not related to the initial pathogenesis of inverted papilloma or inverted papilloma's tendency to persist or recur. It is postulated that since inverted papilloma is more an inflammatory polyp, it is susceptible to secondary HPV infection because of its metaplasia. Tobacco and other causes of respiratory epithelium remodeling are more plausible explanations for the initial tissue transformation to inverted papilloma. © 2014 ARS-AAOA, LLC.
Linkage map of the fragments of herpesvirus papio DNA.
Lee, Y S; Tanaka, A; Lau, R Y; Nonoyama, M; Rabin, H
1981-01-01
Herpesvirus papio (HVP), an Epstein-Barr-like virus, causes lymphoblastoid disease in baboons. The physical map of HVP DNA was constructed for the fragments produced by cleavage of HVP DNA with restriction endonucleases EcoRI, HindIII, SalI, and PvuI, which produced 12, 12, 10, and 4 fragments, respectively. The total molecular size of HVP DNA was calculated as close to 110 megadaltons. The following methods were used for construction of the map; (i) fragments near the ends of HVP DNA were identified by treating viral DNA with lambda exonuclease before restriction enzyme digestion; (ii) fragments containing nucleotide sequences in common with fragments from the second enzyme digest of HVP DNA were examined by Southern blot hybridization; and (iii) the location of some fragments was determined by isolating individual fragments from agarose gels and redigesting the isolated fragments with a second restriction enzyme. Terminal heterogeneity and internal repeats were found to be unique features of HVP DNA molecule. One to five repeats of 0.8 megadaltons were found at both terminal ends. Although the repeats of both ends shared a certain degree of homology, it was not determined whether they were identical repeats. The internal repeat sequence of HVP DNA was found in the EcoRI-C region, which extended from 8.4 to 23 megadaltons from the left end of the molecule. The average number of the repeats was calculated to be seven, and the molecular size was determined to be 1.8 megadaltons. Similar unique features have been reported in EBV DNA (D. Given and E. Kieff, J. Virol. 28:524-542, 1978). Images PMID:6261015
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peng, Jamy C.
Heterochromatin constitutes a significant portion of the genome in higher eukaryotes; approximately 30% in Drosophila and human. Heterochromatin contains a high repeat DNA content and a low density of protein-encoding genes. In contrast, euchromatin is composed mostly of unique sequences and contains the majority of single-copy genes. Genetic and cytological studies demonstrated that heterochromatin exhibits regulatory roles in chromosome organization, centromere function and telomere protection. As an epigenetically regulated structure, heterochromatin formation is not defined by any DNA sequence consensus. Heterochromatin is characterized by its association with nucleosomes containing methylated-lysine 9 of histone H3 (H3K9me), heterochromatin protein 1 (HP1) thatmore » binds H3K9me, and Su(var)3-9, which methylates H3K9 and binds HP1. Heterochromatin formation and functions are influenced by HP1, Su(var)3-9, and the RNA interference (RNAi) pathway. My thesis project investigates how heterochromatin formation and function impact nuclear architecture, repeated DNA organization, and genome stability in Drosophila melanogaster. H3K9me-based chromatin reduces extrachromosomal DNA formation; most likely by restricting the access of repair machineries to repeated DNAs. Reducing extrachromosomal ribosomal DNA stabilizes rDNA repeats and the nucleolus structure. H3K9me-based chromatin also inhibits DNA damage in heterochromatin. Cells with compromised heterochromatin structure, due to Su(var)3-9 or dcr-2 (a component of the RNAi pathway) mutations, display severe DNA damage in heterochromatin compared to wild type. In these mutant cells, accumulated DNA damage leads to chromosomal defects such as translocations, defective DNA repair response, and activation of the G2-M DNA repair and mitotic checkpoints that ensure cellular and animal viability. My thesis research suggests that DNA replication, repair, and recombination mechanisms in heterochromatin differ from those in euchromatin. Remarkably, human euchromatin and fly heterochromatin share similar features; such as repeated DNA content, intron lengths and open reading frame sizes. Human cells likely stabilize their DNA content via mechanisms and factors similar to those in Drosophila heterochromatin. Furthermore, my thesis work raises implications for H3K9me and chromatin functions in complex-DNA genome stability, repeated DNA homogenization by molecular drive, and in genome reorganization through evolution.« less
Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.
Ananiev, E V; Phillips, R L; Rines, H W
1998-01-01
The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055
2013-01-01
Background Adenosine-to-inosine (A-to-I) RNA editing is recognized as a cellular mechanism for generating both RNA and protein diversity. Inosine base pairs with cytidine during reverse transcription and therefore appears as guanosine during sequencing of cDNA. Current approaches of RNA editing identification largely depend on the comparison between transcriptomes and genomic DNA (gDNA) sequencing datasets from the same individuals, and it has been challenging to identify editing candidates from transcriptomes in the absence of gDNA information. Results We have developed a new strategy to accurately predict constitutive RNA editing sites from publicly available human RNA-seq datasets in the absence of relevant genomic sequences. Our approach establishes new parameters to increase the ability to map mismatches and to minimize sequencing/mapping errors and unreported genome variations. We identified 695 novel constitutive A-to-I editing sites that appear in clusters (named “editing boxes”) in multiple samples and which exhibit spatial and dynamic regulation across human tissues. Some of these editing boxes are enriched in non-repetitive regions lacking inverted repeat structures and contain an extremely high conversion frequency of As to Is. We validated a number of editing boxes in multiple human cell lines and confirmed that ADAR1 is responsible for the observed promiscuous editing events in non-repetitive regions, further expanding our knowledge of the catalytic substrate of A-to-I RNA editing by ADAR enzymes. Conclusions The approach we present here provides a novel way of identifying A-to-I RNA editing events by analyzing only RNA-seq datasets. This method has allowed us to gain new insights into RNA editing and should also aid in the identification of more constitutive A-to-I editing sites from additional transcriptomes. PMID:23537002
Wu, Chung-Shien; Chaw, Shu-Miaw
2014-04-01
Although conifers are of immense ecological and economic value, bioengineering of their chloroplasts remains undeveloped. Understanding the chloroplast genomic organization of conifers can facilitate their bioengineering. Members of the conifer II clade (or cupressophytes) are highly diverse in both morphologic features and chloroplast genomic organization. We compared six cupressophyte chloroplast genomes (cpDNAs) that represent four of the five cupressophyte families, including three genomes that are first reported here (Agathis dammara, Calocedrus formosana and Nageia nagi). The six cupressophyte cpDNAs have lost a pair of large inverted repeats (IRs) and vary greatly in size, organization and tRNA copies. We demonstrate that cupressophyte cpDNAs have evolved towards reduced size, largely due to shrunken intergenic spacers. In cupressophytes, cpDNA rearrangements are capable of extending intergenic spacers, and synonymous mutations are negatively associated with the size and frequency of rearrangements. The variable cpDNA sizes of cupressophytes may have been shaped by mutational burden and genomic rearrangements. On the basis of cpDNA organization, our analyses revealed that in gymnosperms, cpDNA rearrangements are phylogenetically informative, which supports the 'gnepines' clade. In addition, removal of a specific IR influences the minimal rearrangements required for the gnepines and cupressophyte clades, whereby Pinaceae favours the removal of IRB but cupressophytes exclusion of IRA. This result strongly suggests that different IR copies have been lost from conifers I and II. Our data help understand the complexity and evolution of cupressophyte cpDNAs. © 2013 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology, The Association of Applied Biologists and John Wiley & Sons Ltd.
Yu, Xiang-Qin; Drew, Bryan T; Yang, Jun-Bo; Gao, Lian-Ming; Li, De-Zhu
2017-01-01
Schima is an ecologically and economically important woody genus in tea family (Theaceae). Unresolved species delimitations and phylogenetic relationships within Schima limit our understanding of the genus and hinder utilization of the genus for economic purposes. In the present study, we conducted comparative analysis among the complete chloroplast (cp) genomes of 11 Schima species. Our results indicate that Schima cp genomes possess a typical quadripartite structure, with conserved genomic structure and gene order. The size of the Schima cp genome is about 157 kilo base pairs (kb). They consistently encode 114 unique genes, including 80 protein-coding genes, 30 tRNAs, and 4 rRNAs, with 17 duplicated in the inverted repeat (IR). These cp genomes are highly conserved and do not show obvious expansion or contraction of the IR region. The percent variability of the 68 coding and 93 noncoding (>150 bp) fragments is consistently less than 3%. The seven most widely touted DNA barcode regions as well as one promising barcode candidate showed low sequence divergence. Eight mutational hotspots were identified from the 11 cp genomes. These hotspots may potentially be useful as specific DNA barcodes for species identification of Schima. The 58 cpSSR loci reported here are complementary to the microsatellite markers identified from the nuclear genome, and will be leveraged for further population-level studies. Phylogenetic relationships among the 11 Schima species were resolved with strong support based on the cp genome data set, which corresponds well with the species distribution pattern. The data presented here will serve as a foundation to facilitate species identification, DNA barcoding and phylogenetic reconstructions for future exploration of Schima.
Genetic manipulation of Bacillus methanolicus, a gram-positive, thermotolerant methylotroph.
Cue, D; Lam, H; Dillingham, R L; Hanson, R S; Flickinger, M C
1997-01-01
We report the fist genetic transformation system, shuttle vectors, and integrative vectors for the thermotolerant, methylotrophic bacterium Bacillus methanolicus. By using a polyethylene glycol-mediated transformation procedure, we have successfully transformed B. methanolicus with both integrative and multicopy plasmids. For plasmids with a single BmeTI recognition site, dam methylation of plasmid DNA (in vivo or in vitro) was found to enhance transformation efficiency from 7- to 11-fold. Two low-copy-number Escherichia coli-B, methanolicus shuttle plasmids, pDQ507 and pDQ508, are described. pDQ508 caries the replication origin cloned from a 17-kb endogenous B. methanolicus plasmid, pBM1. pDQ507 carries a cloned B. methanolicus DNA fragment, pmr-1, possibly of chromosomal origin, that supports maintenance of pDQ507 as a circular, extrachromosomal DNA molecule. Deletion analysis of pDQ507 indicated two regions required for replication, i.e., a 90-bp AT-rich segment containing a 46-bp imperfect, inverted repeat sequence and a second region 65% homologous to the B. subtilis dpp operon. We also evaluated two E. coli-B. subtilis vectors, pEN1 and pHP13, for use as E. coli-B. methanolicus shuttle vectors. The plasmids pHP13, pDQ507, and pDQ508 were segregationally and structurally stable in B. methanolicus for greater than 60 generations of growth under nonselective conditions; pEN1 was segregationally unstable. Single-stranded plasmid DNA was detected in B. methanolicus transformants carrying either pEN1, pHP13, or pDQ508, suggesting that pDQ508, like the B. subtilis plasmids, is replicated by a rolling-circle mechanism. These studies provide the basic tools for the genetic manipulation of B. methanolicus. PMID:9097439
Genetic manipulation of Bacillus methanolicus, a gram-positive, thermotolerant methylotroph.
Cue, D; Lam, H; Dillingham, R L; Hanson, R S; Flickinger, M C
1997-04-01
We report the fist genetic transformation system, shuttle vectors, and integrative vectors for the thermotolerant, methylotrophic bacterium Bacillus methanolicus. By using a polyethylene glycol-mediated transformation procedure, we have successfully transformed B. methanolicus with both integrative and multicopy plasmids. For plasmids with a single BmeTI recognition site, dam methylation of plasmid DNA (in vivo or in vitro) was found to enhance transformation efficiency from 7- to 11-fold. Two low-copy-number Escherichia coli-B, methanolicus shuttle plasmids, pDQ507 and pDQ508, are described. pDQ508 caries the replication origin cloned from a 17-kb endogenous B. methanolicus plasmid, pBM1. pDQ507 carries a cloned B. methanolicus DNA fragment, pmr-1, possibly of chromosomal origin, that supports maintenance of pDQ507 as a circular, extrachromosomal DNA molecule. Deletion analysis of pDQ507 indicated two regions required for replication, i.e., a 90-bp AT-rich segment containing a 46-bp imperfect, inverted repeat sequence and a second region 65% homologous to the B. subtilis dpp operon. We also evaluated two E. coli-B. subtilis vectors, pEN1 and pHP13, for use as E. coli-B. methanolicus shuttle vectors. The plasmids pHP13, pDQ507, and pDQ508 were segregationally and structurally stable in B. methanolicus for greater than 60 generations of growth under nonselective conditions; pEN1 was segregationally unstable. Single-stranded plasmid DNA was detected in B. methanolicus transformants carrying either pEN1, pHP13, or pDQ508, suggesting that pDQ508, like the B. subtilis plasmids, is replicated by a rolling-circle mechanism. These studies provide the basic tools for the genetic manipulation of B. methanolicus.
Shashi, V.; Golden, W. L.; Allinson, P. S.; Blanton, S. H.; von Kap-Herr, C.; Kelly, T. E.
1996-01-01
It has been demonstrated in animal studies that, in animals heterozygous for pericentric chromosomal inversions, loop formation is greatly reduced during meiosis. This results in absence of recombination within the inverted segment, with recombination seen only outside the inversion. A recent study in yeast has shown that telomeres, rather than centromeres, lead in chromosome movement just prior to meiosis and may be involved in promoting recombination. We studied by cytogenetic analysis and DNA polymorphisms the nature of meiotic recombination in a three-generation family with a large pericentric X chromosome inversion, inv(X)(p21.1q26), in which Duchenne muscular dystrophy (DMD) was cosegregating with the inversion. On DNA analysis there was no evidence of meiotic recombination between the inverted and normal X chromosomes in the inverted segment. Recombination was seen at the telomeric regions, Xp22 and Xq27-28. No deletion or point mutation was found on analysis of the DMD gene. On the basis of the FISH results, we believe that the X inversion is the mutation responsible for DMD in this family. Our results indicate that (1) pericentric X chromosome inversions result in reduction of recombination between the normal and inverted X chromosomes; (2) meiotic X chromosome pairing in these individuals is likely initiated at the telomeres; and (3) in this family DMD is caused by the pericentric inversion. Images Figure 2 Figure 5 Figure 6 Figure 7 PMID:8651300
Han, Limin; Chen, Chen; Wang, Zhezhi
2018-01-01
Epipremnum aureum is an important foliage plant in the Araceae family. In this study, we have sequenced the complete chloroplast genome of E. aureum by using Illumina Hiseq sequencing platforms. This genome is a double-stranded circular DNA sequence of 164,831 bp that contains 35.8% GC. The two inverted repeats (IRa and IRb; 26,606 bp) are spaced by a small single-copy region (22,868 bp) and a large single-copy region (88,751 bp). The chloroplast genome has 131 (113 unique) functional genes, including 86 (79 unique) protein-coding genes, 37 (30 unique) tRNA genes, and eight (four unique) rRNA genes. Tandem repeats comprise the majority of the 43 long repetitive sequences. In addition, 111 simple sequence repeats are present, with mononucleotides being the most common type and di- and tetranucleotides being infrequent events. Positive selection pressure on rps12 in the E. aureum chloroplast has been demonstrated via synonymous and nonsynonymous substitution rates and selection pressure sites analyses. Ycf15 and infA are pseudogenes in this species. We constructed a Maximum Likelihood phylogenetic tree based on the complete chloroplast genomes of 38 species from 13 families. Those results strongly indicated that E. aureum is positioned as the sister of Colocasia esculenta within the Araceae family. This work may provide information for further study of the molecular phylogenetic relationships within Araceae, as well as molecular markers and breeding novel varieties by chloroplast genetic-transformation of E. aureum in particular. PMID:29529038
Kuipers, A G J; Kamstra, S A; de Jeu, M J; Visser, R G F
2002-01-01
Highly repetitive DNA sequences were isolated from genomic DNA libraries of Alstroemeria psittacina and A. inodora. Among the repetitive sequences that were isolated, tandem repeats as well as dispersed repeats could be discerned. The tandem repeats belonged to a family of interlinked Sau3A subfragments with sizes varying from 68-127 bp, and constituted a larger HinfI repeat of approximately 400 bp. Southern hybridization showed a similar molecular organization of the tandem repeats in each of the Brazilian Alstroemeria species tested. None of the repeats hybridized with DNA from Chilean Alstroemeria species, which indicates that they are specific for the Brazilian species. In-situ localization studies revealed the tandem repeats to be localized in clusters on the chromosomes of A. inodora and A. psittacina: distal hybridization sites were found on chromosome arms 2PS, 6PL, 7PS, 7PL and 8PL, interstitial sites on chromosome arms 2PL, 3PL, 4PL and 5PL. The applicability of the tandem repeats for cytogenetic analysis of interspecific hybrids and their role in heterochromatin organization are discussed.
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.
Benslimane, A A; Dron, M; Hartmann, C; Rode, A
1986-01-01
Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Su, Ming; Lee, Daniel; Ganss, Bernhard; Sodek, Jaro
2006-04-14
Basal transcription of the bone sialoprotein gene is mediated by highly conserved inverted CCAAT (ICE; ATTGG) and TATA elements (TTTATA) separated by precisely 21 nucleotides. Here we studied the importance of the relative position and orientation of the CCAAT and TATA elements in the proximal promoter by measuring the transcriptional activity of a series of mutated reporter constructs in transient transfection assays. Whereas inverting the TTTATA (wild type) to a TATAAA (consensus TATA) sequence increased transcription slightly, transcription was reduced when the flanking dinucleotides were also inverted. In contrast, reversing the ATTGG (wild type; ICE) to a CCAAT (RICE) sequence caused a marked reduction in transcription, whereas both transcription and NF-Y binding were progressively increased with the simultaneous inversion of flanking nucleotides (f-RICE-f). Reducing the distance between the ICE and TATA elements produced cyclical changes in transcriptional activity that correlated with progressive alterations in the relative positions of the CCAAT and TATA elements on the face of the DNA helix. Minimal transcription was observed after 5 nucleotides were deleted (equivalent to approximately one half turn of the helix), whereas transcription was fully restored after deleting 10 nucleotides (approximately one full turn of the DNA helix), transcriptional activity being progressively lost with deletions beyond 10 nucleotides. In comparison, when deletions were made with the ICE in the reversed (f-RICE-f) orientation transcriptional activity was progressively lost with no recovery. These results show that, although transcription can still occur when the CCAAT box is reversed and/or displaced relative to the TATA box, the activity is dependent upon the flexibility of the intervening DNA helix needed to align the NF-Y complex on the CCAAT box with preinitiation complex proteins that bind to the TATA box. Thus, the precise location and orientation of the CCAAT element is necessary for optimizing basal transcription of the bone sialoprotein gene.
Vergara-Jaque, Ariela; Fenollar-Ferrer, Cristina; Kaufmann, Desirée; Forrest, Lucy R.
2015-01-01
Secondary active transporters are critical for neurotransmitter clearance and recycling during synaptic transmission and uptake of nutrients. These proteins mediate the movement of solutes against their concentration gradients, by using the energy released in the movement of ions down pre-existing concentration gradients. To achieve this, transporters conform to the so-called alternating-access hypothesis, whereby the protein adopts at least two conformations in which the substrate binding sites are exposed to one or other side of the membrane, but not both simultaneously. Structures of a bacterial homolog of neuronal glutamate transporters, GltPh, in several different conformational states have revealed that the protein structure is asymmetric in the outward- and inward-open states, and that the conformational change connecting them involves a elevator-like movement of a substrate binding domain across the membrane. The structural asymmetry is created by inverted-topology repeats, i.e., structural repeats with similar overall folds whose transmembrane topologies are related to each other by two-fold pseudo-symmetry around an axis parallel to the membrane plane. Inverted repeats have been found in around three-quarters of secondary transporter folds. Moreover, the (a)symmetry of these systems has been successfully used as a bioinformatic tool, called “repeat-swap modeling” to predict structural models of a transporter in one conformation using the known structure of the transporter in the complementary conformation as a template. Here, we describe an updated repeat-swap homology modeling protocol, and calibrate the accuracy of the method using GltPh, for which both inward- and outward-facing conformations are known. We then apply this repeat-swap homology modeling procedure to a concentrative nucleoside transporter, VcCNT, which has a three-dimensional arrangement related to that of GltPh. The repeat-swapped model of VcCNT predicts that nucleoside transport also occurs via an elevator-like mechanism. PMID:26388773
BAC Modification through Serial or Simultaneous Use of CRE/Lox Technology
Parrish, Mark; Unruh, Jay; Krumlauf, Robb
2011-01-01
Bacterial Artificial Chromosomes (BACs) are vital tools in mouse genomic analyses because of their ability to propagate large inserts. The size of these constructs, however, prevents the use of conventional molecular biology techniques for modification and manipulation. Techniques such as recombineering and Cre/Lox methodologies have thus become heavily relied upon for such purposes. In this work, we investigate the applicability of Lox variant sites for serial and/or simultaneous manipulations of BACs. We show that Lox spacer mutants are very specific, and inverted repeat variants reduce Lox reaction rates through reducing the affinity of Cre for the site, while retaining some functionality. Employing these methods, we produced serial modifications encompassing four independent changes which generated a mouse HoxB BAC with fluorescent reporter proteins inserted into four adjacent Hox genes. We also generated specific, simultaneous deletions using combinations of spacer variants and inverted repeat variants. These techniques will facilitate BAC manipulations and open a new repertoire of methods for BAC and genome manipulation. PMID:21197414
NASA Astrophysics Data System (ADS)
Ito, Masami; Kari, Lila; Kincaid, Zachary; Seki, Shinnosuke
The duplication and repeat-deletion operations are the basis of a formal language theoretic model of errors that can occur during DNA replication. During DNA replication, subsequences of a strand of DNA may be copied several times (resulting in duplications) or skipped (resulting in repeat-deletions). As formal language operations, iterated duplication and repeat-deletion of words and languages have been well studied in the literature. However, little is known about single-step duplications and repeat-deletions. In this paper, we investigate several properties of these operations, including closure properties of language families in the Chomsky hierarchy and equations involving these operations. We also make progress toward a characterization of regular languages that are generated by duplicating a regular language.
Scholze, Heidi; Boch, Jens
2010-01-01
TAL effectors are important virulence factors of bacterial plant pathogenic Xanthomonas, which infect a wide variety of plants including valuable crops like pepper, rice, and citrus. TAL proteins are translocated via the bacterial type III secretion system into host cells and induce transcription of plant genes by binding to target gene promoters. Members of the TAL effector family differ mainly in their central domain of tandemly arranged repeats of typically 34 amino acids each with hypervariable di-amino acids at positions 12 and 13. We recently showed that target DNA-recognition specificity of TAL effectors is encoded in a modular and clearly predictable mode. The repeats of TAL effectors feature a surprising one repeat-to-one-bp correlation with different repeat types exhibiting a different DNA base pair specificity. Accordingly, we predicted DNA specificities of TAL effectors and generated artificial TAL proteins with novel DNA recognition specificities. We describe here novel artificial TALs and discuss implications for the DNA recognition specificity. The unique TAL-DNA binding domain allows design of proteins with potentially any given DNA recognition specificity enabling many uses for biotechnology.
Modular synthetic inverters from zinc finger proteins and small RNAs
Hsia, Justin; Holtz, William J.; Maharbiz, Michel M.; ...
2016-02-17
Synthetic zinc finger proteins (ZFPs) can be created to target promoter DNA sequences, repressing transcription. The binding of small RNA (sRNA) to ZFP mRNA creates an ultrasensitive response to generate higher effective Hill coefficients. Here we combined three “off the shelf” ZFPs and three sRNAs to create new modular inverters in E. coli and quantify their behavior using induction fold. We found a general ordering of the effects of the ZFPs and sRNAs on induction fold that mostly held true when combining these parts. We then attempted to construct a ring oscillator using our new inverters. In conclusion, our chosenmore » parts performed insufficiently to create oscillations, but we include future directions for improvement upon our work presented here.« less
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using
Weier, H.U.G.; Gray, J.W.
1995-06-27
A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using
Weier, Heinz-Ulrich G.; Gray, Joe W.
1995-01-01
A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.
Sowa, Yoshihiro; Itsukage, Sizu; Morita, Daiki; Numajiri, Toshiaki
2017-10-01
An inverted nipple is a common congenital condition in young women that may cause breastfeeding difficulty, psychological distress, repeated inflammation, and loss of sensation. Various surgical techniques have been reported for correction of inverted nipples, and all have advantages and disadvantages. Here, we report a new technique for correction of an inverted nipple using an operative microscope and traction that results in low recurrence and preserves lactation function and sensation. Between January 2010 and January 2013, we treated eight inverted nipples in seven patients with selective lactiferous duct dissection using an operative microscope. An opposite Z-plasty was added at the junction of the nipple and areola. Postoperatively, traction was applied through an apparatus made from a rubber gasket attached to a sterile syringe. Patients were followed up for 15-48 months. Adequate projection was achieved in all patients, and there was no wound dehiscence or complications such as infection. Three patients had successful pregnancies and subsequent breastfeeding that was not adversely affected by the treatment. There was no loss of sensation in any patient during the postoperative period. Our technique for treating an inverted nipple is effective and preserves lactation function and nipple sensation. The method maintains traction for a longer period, which we believe increases the success rate of the surgery for correction of severely inverted nipples. This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .
Preferential Nucleosome Assembly at DNA Triplet Repeats from the Myotonic Dystrophy Gene
NASA Astrophysics Data System (ADS)
Wang, Yuh-Hwa; Amirhaeri, Sorour; Kang, Seongman; Wells, Robert D.; Griffith, Jack D.
1994-07-01
The expansion of CTG repeats in DNA occurs in or near genes involved in several human diseases, including myotonic dystrophy and Huntington's disease. Nucleosomes, the basic structural element of chromosomes, consist of 146 base pairs of DNA coiled about an octamer of histone proteins and mediate general transcriptional repression. Electron microscopy was used to examine in vitro the nucleosome assembly of DNA containing repeating CTG triplets. The efficiency of nucleosome formation increased with expanded triplet blocks, suggesting that such blocks may repress transcription through the creation of stable nucleosomes.
Inverted drop testing and neck injury potential.
Forrest, Stephen; Herbst, Brian; Meyer, Steve; Sances, Anthony; Kumaresan, Srirangam
2003-01-01
Inverted drop testing of vehicles is a methodology that has long been used by the automotive industry and researchers to test roof integrity and is currently being considered by the National Highway Traffic Safety Administration as a roof strength test. In 1990 a study was reported which involved 8 dolly rollover tests and 5 inverted drop tests. These studies were conducted with restrained Hybrid III instrumented Anthropometric Test Devices (ATD) in production and rollcaged vehicles to investigate the relationship between roof strength and occupant injury potential. The 5 inverted drop tests included in the study provided a methodology producing "repeatable roof impacts" exposing the ATDs to the similar impact environment as those seen in the dolly rollover tests. Authors have conducted two inverted drop test sets as part of an investigation of two real world rollover accidents. Hybrid-III ATD's were used in each test with instrumented head and necks. Both test sets confirm that reduction of roof intrusion and increased headroom can significantly enhance occupant protection. In both test pairs, the neck force of the dummy in the vehicle with less crush and more survival space was significantly lower. Reduced roof crush and dynamic preservation of the occupant survival space resulted in only minor occupant contact and minimal occupant loading, establishing a clear causal relationship between roof crush and neck injuries.
Medium-sized tandem repeats represent an abundant component of the Drosophila virilis genome.
Abdurashitov, Murat A; Gonchar, Danila A; Chernukhin, Valery A; Tomilov, Victor N; Tomilova, Julia E; Schostak, Natalia G; Zatsepina, Olga G; Zelentsova, Elena S; Evgen'ev, Michael B; Degtyarev, Sergey K H
2013-11-09
Previously, we developed a simple method for carrying out a restriction enzyme analysis of eukaryotic DNA in silico, based on the known DNA sequences of the genomes. This method allows the user to calculate lengths of all DNA fragments that are formed after a whole genome is digested at the theoretical recognition sites of a given restriction enzyme. A comparison of the observed peaks in distribution diagrams with the results from DNA cleavage using several restriction enzymes performed in vitro have shown good correspondence between the theoretical and experimental data in several cases. Here, we applied this approach to the annotated genome of Drosophila virilis which is extremely rich in various repeats. Here we explored the combined approach to perform the restriction analysis of D. virilis DNA. This approach enabled to reveal three abundant medium-sized tandem repeats within the D. virilis genome. While the 225 bp repeats were revealed previously in intergenic non-transcribed spacers between ribosomal genes of D. virilis, two other families comprised of 154 bp and 172 bp repeats were not described. Tandem Repeats Finder search demonstrated that 154 bp and 172 bp units are organized in multiple clusters in the genome of D. virilis. Characteristically, only 154 bp repeats derived from Helitron transposon are transcribed. Using in silico digestion in combination with conventional restriction analysis and sequencing of repeated DNA fragments enabled us to isolate and characterize three highly abundant families of medium-sized repeats present in the D. virilis genome. These repeats comprise a significant portion of the genome and may have important roles in genome function and structural integrity. Therefore, we demonstrated an approach which makes possible to investigate in detail the gross arrangement and expression of medium-sized repeats basing on sequencing data even in the case of incompletely assembled and/or annotated genomes.
Financsek, I; Mizumoto, K; Mishima, Y; Muramatsu, M
1982-01-01
The transcription initiation site of the human ribosomal RNA gene (rDNA) was located by using the single-strand specific nuclease protection method and by determining the first nucleotide of the in vitro capped 45S preribosomal RNA. The sequence of 1,211 nucleotides surrounding the initiation site was determined. The sequenced region was found to consist of 75% G and C and to contain a number of short direct and inverted repeats and palindromes. By comparison of the corresponding initiation regions of three mammalian species, several conserved sequences were found upstream and downstream from the transcription starting point. Two short A + T-rich sequences are present on human, mouse, and rat ribosomal RNA genes between the initiation site and 40 nucleotides upstream, and a C + T cluster is located at a position around -60. At and downstream from the initiation site, a common sequence, T-AG-C-T-G-A-C-A-C-G-C-T-G-T-C-C-T-CT-T, was found in the three genes from position -1 through +18. The strong conservation of these sequences suggests their functional significance in rDNA. The S1 nuclease protection experiments with cloned rDNA fragments indicated the presence in human 45S RNA of molecules several hundred nucleotides shorter than the supposed primary transcript. The first 19 nucleotides of these molecules appear identical--except for one mismatch--to the nucleotide sequence of the 5' end of a supposed early processing product of the mouse 45S RNA. Images PMID:6954460
The expanding universe of transposon technologies for gene and cell engineering.
Ivics, Zoltán; Izsvák, Zsuzsanna
2010-12-07
Transposable elements can be viewed as natural DNA transfer vehicles that, similar to integrating viruses, are capable of efficient genomic insertion. The mobility of class II transposable elements (DNA transposons) can be controlled by conditionally providing the transposase component of the transposition reaction. Thus, a DNA of interest (be it a fluorescent marker, a small hairpin (sh)RNA expression cassette, a mutagenic gene trap or a therapeutic gene construct) cloned between the inverted repeat sequences of a transposon-based vector can be used for stable genomic insertion in a regulated and highly efficient manner. This methodological paradigm opened up a number of avenues for genome manipulations in vertebrates, including transgenesis for the generation of transgenic cells in tissue culture, the production of germline transgenic animals for basic and applied research, forward genetic screens for functional gene annotation in model species, and therapy of genetic disorders in humans. Sleeping Beauty (SB) was the first transposon shown to be capable of gene transfer in vertebrate cells, and recent results confirm that SB supports a full spectrum of genetic engineering including transgenesis, insertional mutagenesis, and therapeutic somatic gene transfer both ex vivo and in vivo. The first clinical application of the SB system will help to validate both the safety and efficacy of this approach. In this review, we describe the major transposon systems currently available (with special emphasis on SB), discuss the various parameters and considerations pertinent to their experimental use, and highlight the state of the art in transposon technology in diverse genetic applications.
Development of marker-free transgenic lettuce resistant to Mirafiori lettuce big-vein virus.
Kawazu, Yoichi; Fujiyama, Ryoi; Imanishi, Shunsuke; Fukuoka, Hiroyuki; Yamaguchi, Hirotaka; Matsumoto, Satoru
2016-10-01
Lettuce big-vein disease caused by Mirafiori lettuce big-vein virus (MLBVV) is found in major lettuce production areas worldwide, but highly resistant cultivars have not yet been developed. To produce MLBVV-resistant marker-free transgenic lettuce that would have a transgene with a promoter and terminator of lettuce origin, we constructed a two T-DNA binary vector, in which the first T-DNA contained the selectable marker gene neomycin phosphotransferase II, and the second T-DNA contained the lettuce ubiquitin gene promoter and terminator and inverted repeats of the coat protein (CP) gene of MLBVV. This vector was introduced into lettuce cultivars 'Watson' and 'Fuyuhikari' by Agrobacterium tumefaciens-mediated transformation. Regenerated plants (T0 generation) that were CP gene-positive by PCR analysis were self-pollinated, and 312 T1 lines were analyzed for resistance to MLBVV. Virus-negative plants were checked for the CP gene and the marker gene, and nine lines were obtained which were marker-free and resistant to MLBVV. Southern blot analysis showed that three of the nine lines had two copies of the CP gene, whereas six lines had a single copy and were used for further analysis. Small interfering RNAs, which are indicative of RNA silencing, were detected in all six lines. MLBVV infection was inhibited in all six lines in resistance tests performed in a growth chamber and a greenhouse, resulting in a high degree of resistance to lettuce big-vein disease. Transgenic lettuce lines produced in this study could be used as resistant cultivars or parental lines for breeding.
Solution properties of the archaeal CRISPR DNA repeat-binding homeodomain protein Cbp2
Kenchappa, Chandra S.; Heidarsson, Pétur O.; Kragelund, Birthe B.; Garrett, Roger A.; Poulsen, Flemming M.
2013-01-01
Clustered regularly interspaced short palindromic repeats (CRISPR) form the basis of diverse adaptive immune systems directed primarily against invading genetic elements of archaea and bacteria. Cbp1 of the crenarchaeal thermoacidophilic order Sulfolobales, carrying three imperfect repeats, binds specifically to CRISPR DNA repeats and has been implicated in facilitating production of long transcripts from CRISPR loci. Here, a second related class of CRISPR DNA repeat-binding protein, denoted Cbp2, is characterized that contains two imperfect repeats and is found amongst members of the crenarchaeal thermoneutrophilic order Desulfurococcales. DNA repeat-binding properties of the Hyperthermus butylicus protein Cbp2Hb were characterized and its three-dimensional structure was determined by NMR spectroscopy. The two repeats generate helix-turn-helix structures separated by a basic linker that is implicated in facilitating high affinity DNA binding of Cbp2 by tethering the two domains. Structural studies on mutant proteins provide support for Cys7 and Cys28 enhancing high thermal stability of Cbp2Hb through disulphide bridge formation. Consistent with their proposed CRISPR transcriptional regulatory role, Cbp2Hb and, by inference, other Cbp1 and Cbp2 proteins are closely related in structure to homeodomain proteins with linked helix-turn-helix (HTH) domains, in particular the paired domain Pax and Myb family proteins that are involved in eukaryal transcriptional regulation. PMID:23325851
Structure of chromatin and the linking number of DNA.
Worcel, A; Strogatz, S; Riley, D
1981-01-01
Recent observations suggest that the basic supranucleosomal structure of chromatin is a zigzag helical ribbon with a repeat unit made of two nucleosomes connected by a relaxed spacer DNA. A remarkable feature of one particular ribbon is that it solves the apparent paradox between the number of DNA turns per nucleosome and the total linking number of a nucleosome-containing closed circular DNA molecule. We show here that the repeat unit of the proposed structure, which contains two nucleosomes with -1 3/4 DNA turns per nucleosome and one spacer crossover per repeat, contributes -2 to the linking number of closed circular DNA. Space-filling models show that the cylindrical 250-A chromatin fiber can be generated by twisting the ribbon. Images PMID:6940168
Birth and death of genes linked to chromosomal inversion
Furuta, Yoshikazu; Kawai, Mikihiko; Yahara, Koji; Takahashi, Noriko; Handa, Naofumi; Tsuru, Takeshi; Oshima, Kenshiro; Yoshida, Masaru; Azuma, Takeshi; Hattori, Masahira; Uchiyama, Ikuo; Kobayashi, Ichizo
2011-01-01
The birth and death of genes is central to adaptive evolution, yet the underlying genome dynamics remain elusive. The availability of closely related complete genome sequences helps to follow changes in gene contents and clarify their relationship to overall genome organization. Helicobacter pylori, bacteria in our stomach, are known for their extreme genome plasticity through mutation and recombination and will make a good target for such an analysis. In comparing their complete genome sequences, we found that gain and loss of genes (loci) for outer membrane proteins, which mediate host interaction, occurred at breakpoints of chromosomal inversions. Sequence comparison there revealed a unique mechanism of DNA duplication: DNA duplication associated with inversion. In this process, a DNA segment at one chromosomal locus is copied and inserted, in an inverted orientation, into a distant locus on the same chromosome, while the entire region between these two loci is also inverted. Recognition of this and three more inversion modes, which occur through reciprocal recombination between long or short sequence similarity or adjacent to a mobile element, allowed reconstruction of synteny evolution through inversion events in this species. These results will guide the interpretation of extensive DNA sequencing results for understanding long- and short-term genome evolution in various organisms and in cancer cells. PMID:21212362
Disease-associated repeat instability and mismatch repair.
Schmidt, Monika H M; Pearson, Christopher E
2016-02-01
Expanded tandem repeat sequences in DNA are associated with at least 40 human genetic neurological, neurodegenerative, and neuromuscular diseases. Repeat expansion can occur during parent-to-offspring transmission, and arise at variable rates in specific tissues throughout the life of an affected individual. Since the ongoing somatic repeat expansions can affect disease age-of-onset, severity, and progression, targeting somatic expansion holds potential as a therapeutic target. Thus, understanding the factors that regulate this mutation is crucial. DNA repair, in particular mismatch repair (MMR), is the major driving force of disease-associated repeat expansions. In contrast to its anti-mutagenic roles, mammalian MMR curiously drives the expansion mutations of disease-associated (CAG)·(CTG) repeats. Recent advances have broadened our knowledge of both the MMR proteins involved in disease repeat expansions, including: MSH2, MSH3, MSH6, MLH1, PMS2, and MLH3, as well as the types of repeats affected by MMR, now including: (CAG)·(CTG), (CGG)·(CCG), and (GAA)·(TTC) repeats. Mutagenic slipped-DNA structures have been detected in patient tissues, and the size of the slip-out and their junction conformation can determine the involvement of MMR. Furthermore, the formation of other unusual DNA and R-loop structures is proposed to play a key role in MMR-mediated instability. A complex correlation is emerging between tissues showing varying amounts of repeat instability and MMR expression levels. Notably, naturally occurring polymorphic variants of DNA repair genes can have dramatic effects upon the levels of repeat instability, which may explain the variation in disease age-of-onset, progression and severity. An increasing grasp of these factors holds prognostic and therapeutic potential. Copyright © 2015 Elsevier B.V. All rights reserved.
Marzo, Mar; Puig, Marta; Ruiz, Alfredo
2008-02-26
Galileo is the only transposable element (TE) known to have generated natural chromosomal inversions in the genus Drosophila. It was discovered in Drosophila buzzatii and classified as a Foldback-like element because of its long, internally repetitive, terminal inverted repeats (TIRs) and lack of coding capacity. Here, we characterized a seemingly complete copy of Galileo from the D. buzzatii genome. It is 5,406 bp long, possesses 1,229-bp TIRs, and encodes a 912-aa transposase similar to those of the Drosophila melanogaster 1360 (Hoppel) and P elements. We also searched the recently available genome sequences of 12 Drosophila species for elements similar to Dbuz\\Galileo by using bioinformatic tools. Galileo was found in six species (ananassae, willistoni, peudoobscura, persimilis, virilis, and mojavensis) from the two main lineages within the Drosophila genus. Our observations place Galileo within the P superfamily of cut-and-paste transposons and extend considerably its phylogenetic distribution. The interspecific distribution of Galileo indicates an ancient presence in the genus, but the phylogenetic tree built with the transposase amino acid sequences contrasts significantly with that of the species, indicating lineage sorting and/or horizontal transfer events. Our results also suggest that Foldback-like elements such as Galileo may evolve from DNA-based transposon ancestors by loss of the transposase gene and disproportionate elongation of TIRs.
Marzo, Mar; Puig, Marta; Ruiz, Alfredo
2008-01-01
Galileo is the only transposable element (TE) known to have generated natural chromosomal inversions in the genus Drosophila. It was discovered in Drosophila buzzatii and classified as a Foldback-like element because of its long, internally repetitive, terminal inverted repeats (TIRs) and lack of coding capacity. Here, we characterized a seemingly complete copy of Galileo from the D. buzzatii genome. It is 5,406 bp long, possesses 1,229-bp TIRs, and encodes a 912-aa transposase similar to those of the Drosophila melanogaster 1360 (Hoppel) and P elements. We also searched the recently available genome sequences of 12 Drosophila species for elements similar to Dbuz\\Galileo by using bioinformatic tools. Galileo was found in six species (ananassae, willistoni, peudoobscura, persimilis, virilis, and mojavensis) from the two main lineages within the Drosophila genus. Our observations place Galileo within the P superfamily of cut-and-paste transposons and extend considerably its phylogenetic distribution. The interspecific distribution of Galileo indicates an ancient presence in the genus, but the phylogenetic tree built with the transposase amino acid sequences contrasts significantly with that of the species, indicating lineage sorting and/or horizontal transfer events. Our results also suggest that Foldback-like elements such as Galileo may evolve from DNA-based transposon ancestors by loss of the transposase gene and disproportionate elongation of TIRs. PMID:18287066
M.N. lslam-Faridi; C.D. Nelson; S.P. DiFazio; L.E. Gunter; G.A. Tuskan
2009-01-01
The 185-285 rDNA and 55 rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 185-285 rDNA sites and one 55 rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis-type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones...
The nucleoid protein Dps binds genomic DNA of Escherichia coli in a non-random manner
Kondrashov, F. A.; Toshchakov, S. V.; Dominova, I.; Shvyreva, U. S.; Vrublevskaya, V. V.; Morenkov, O. S.; Panyukov, V. V.
2017-01-01
Dps is a multifunctional homododecameric protein that oxidizes Fe2+ ions accumulating them in the form of Fe2O3 within its protein cavity, interacts with DNA tightly condensing bacterial nucleoid upon starvation and performs some other functions. During the last two decades from discovery of this protein, its ferroxidase activity became rather well studied, but the mechanism of Dps interaction with DNA still remains enigmatic. The crucial role of lysine residues in the unstructured N-terminal tails led to the conventional point of view that Dps binds DNA without sequence or structural specificity. However, deletion of dps changed the profile of proteins in starved cells, SELEX screen revealed genomic regions preferentially bound in vitro and certain affinity of Dps for artificial branched molecules was detected by atomic force microscopy. Here we report a non-random distribution of Dps binding sites across the bacterial chromosome in exponentially growing cells and show their enrichment with inverted repeats prone to form secondary structures. We found that the Dps-bound regions overlap with sites occupied by other nucleoid proteins, and contain overrepresented motifs typical for their consensus sequences. Of the two types of genomic domains with extensive protein occupancy, which can be highly expressed or transcriptionally silent only those that are enriched with RNA polymerase molecules were preferentially occupied by Dps. In the dps-null mutant we, therefore, observed a differentially altered expression of several targeted genes and found suppressed transcription from the dps promoter. In most cases this can be explained by the relieved interference with Dps for nucleoid proteins exploiting sequence-specific modes of DNA binding. Thus, protecting bacterial cells from different stresses during exponential growth, Dps can modulate transcriptional integrity of the bacterial chromosome hampering RNA biosynthesis from some genes via competition with RNA polymerase or, vice versa, competing with inhibitors to activate transcription. PMID:28800583
Tajaddod, Mansoureh; Tanzer, Andrea; Licht, Konstantin; Wolfinger, Michael T; Badelt, Stefan; Huber, Florian; Pusch, Oliver; Schopoff, Sandy; Janisiw, Michael; Hofacker, Ivo; Jantsch, Michael F
2016-10-25
Short interspersed elements (SINEs) represent the most abundant group of non-long-terminal repeat transposable elements in mammalian genomes. In primates, Alu elements are the most prominent and homogenous representatives of SINEs. Due to their frequent insertion within or close to coding regions, SINEs have been suggested to play a crucial role during genome evolution. Moreover, Alu elements within mRNAs have also been reported to control gene expression at different levels. Here, we undertake a genome-wide analysis of insertion patterns of human Alus within transcribed portions of the genome. Multiple, nearby insertions of SINEs within one transcript are more abundant in tandem orientation than in inverted orientation. Indeed, analysis of transcriptome-wide expression levels of 15 ENCODE cell lines suggests a cis-repressive effect of inverted Alu elements on gene expression. Using reporter assays, we show that the negative effect of inverted SINEs on gene expression is independent of known sensors of double-stranded RNAs. Instead, transcriptional elongation seems impaired, leading to reduced mRNA levels. Our study suggests that there is a bias against multiple SINE insertions that can promote intramolecular base pairing within a transcript. Moreover, at a genome-wide level, mRNAs harboring inverted SINEs are less expressed than mRNAs harboring single or tandemly arranged SINEs. Finally, we demonstrate a novel mechanism by which inverted SINEs can impact on gene expression by interfering with RNA polymerase II.
DNA replication stress restricts ribosomal DNA copy number.
Salim, Devika; Bradford, William D; Freeland, Amy; Cady, Gillian; Wang, Jianmin; Pruitt, Steven C; Gerton, Jennifer L
2017-09-01
Ribosomal RNAs (rRNAs) in budding yeast are encoded by ~100-200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA) locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how "normal" copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2)-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a "normal" rDNA copy number.
Gladyshev, Eugene; Kleckner, Nancy
2017-01-01
Eukaryotic genomes contain substantial amounts of repetitive DNA organized in the form of constitutive heterochromatin and associated with repressive epigenetic modifications, such as H3K9me3 and C5-cytosine methylation (5mC). In the fungus Neurospora crassa, H3K9me3 and 5mC are catalyzed, respectively, by a conserved SUV39 histone methyltransferase DIM-5 and a DNMT1-like cytosine methyltransferase DIM-2. Here we show that DIM-2 can also mediate Repeat-Induced Point mutation (RIP) of repetitive DNA in N. crassa. We further show that DIM-2-dependent RIP requires DIM-5, HP1, and other known heterochromatin factors, implying the role of a repeat-induced heterochromatin-related process. Our previous findings suggest that the mechanism of repeat recognition for RIP involves direct interactions between homologous double-stranded (ds) DNA segments. We thus now propose that, in somatic cells, homologous dsDNA/dsDNA interactions between a small number of repeat copies can nucleate a transient heterochromatic state, which, on longer repeat arrays, may lead to the formation of constitutive heterochromatin. PMID:28459455
Faraldo-Gómez, José D.
2017-01-01
The membrane transporter anion exchanger 1 (AE1), or band 3, is a key component in the processes of carbon-dioxide transport in the blood and urinary acidification in the renal collecting duct. In both erythrocytes and the basolateral membrane of the collecting-duct α-intercalated cells, the role of AE1 is to catalyze a one-for-one exchange of chloride for bicarbonate. After decades of biochemical and functional studies, the structure of the transmembrane region of AE1, which catalyzes the anion-exchange reaction, has finally been determined. Each protomer of the AE1 dimer comprises two repeats with inverted transmembrane topologies, but the structures of these repeats differ. This asymmetry causes the putative substrate-binding site to be exposed only to the extracellular space, consistent with the expectation that anion exchange occurs via an alternating-access mechanism. Here, we hypothesize that the unknown, inward-facing conformation results from inversion of this asymmetry, and we propose a model of this state constructed using repeat-swap homology modeling. By comparing this inward-facing model with the outward-facing experimental structure, we predict that the mechanism of AE1 involves an elevator-like motion of the substrate-binding domain relative to the nearly stationary dimerization domain and to the membrane plane. This hypothesis is in qualitative agreement with a wide range of biochemical and functional data, which we review in detail, and suggests new avenues of experimentation. PMID:29167180
USDA-ARS?s Scientific Manuscript database
Creeping bentgrass (Agrostis stolonifera L.) is an important species to the turfgrass industry because of its adaptation for use in high quality turf stands such as golf course putting greens, tees, and fairways. A. stolonifera is a highly outcrossing allotetraploid making genetic marker developmen...
Molecular epidemiology of infectious laryngotracheitis: a review
USDA-ARS?s Scientific Manuscript database
Falconid herpesvirus type 1 (FHV-1) is the causative agent of falcon inclusion body disease, an acute, highly contagious disease of raptors. The complete nucleotide sequence of the genome of FHV-1 has been determined. The genome is arranged as a D-type genome with large inverted repeats flanking a ...
Evolutional dynamics of 45S and 5S ribosomal DNA in ancient allohexaploid Atropa belladonna.
Volkov, Roman A; Panchuk, Irina I; Borisjuk, Nikolai V; Hosiawa-Baranska, Marta; Maluszynska, Jolanta; Hemleben, Vera
2017-01-23
Polyploid hybrids represent a rich natural resource to study molecular evolution of plant genes and genomes. Here, we applied a combination of karyological and molecular methods to investigate chromosomal structure, molecular organization and evolution of ribosomal DNA (rDNA) in nightshade, Atropa belladonna (fam. Solanaceae), one of the oldest known allohexaploids among flowering plants. Because of their abundance and specific molecular organization (evolutionarily conserved coding regions linked to variable intergenic spacers, IGS), 45S and 5S rDNA are widely used in plant taxonomic and evolutionary studies. Molecular cloning and nucleotide sequencing of A. belladonna 45S rDNA repeats revealed a general structure characteristic of other Solanaceae species, and a very high sequence similarity of two length variants, with the only difference in number of short IGS subrepeats. These results combined with the detection of three pairs of 45S rDNA loci on separate chromosomes, presumably inherited from both tetraploid and diploid ancestor species, example intensive sequence homogenization that led to substitution/elimination of rDNA repeats of one parent. Chromosome silver-staining revealed that only four out of six 45S rDNA sites are frequently transcriptionally active, demonstrating nucleolar dominance. For 5S rDNA, three size variants of repeats were detected, with the major class represented by repeats containing all functional IGS elements required for transcription, the intermediate size repeats containing partially deleted IGS sequences, and the short 5S repeats containing severe defects both in the IGS and coding sequences. While shorter variants demonstrate increased rate of based substitution, probably in their transition into pseudogenes, the functional 5S rDNA variants are nearly identical at the sequence level, pointing to their origin from a single parental species. Localization of the 5S rDNA genes on two chromosome pairs further supports uniparental inheritance from the tetraploid progenitor. The obtained molecular, cytogenetic and phylogenetic data demonstrate complex evolutionary dynamics of rDNA loci in allohexaploid species of Atropa belladonna. The high level of sequence unification revealed in 45S and 5S rDNA loci of this ancient hybrid species have been seemingly achieved by different molecular mechanisms.
Mlinarec, Jelena; Chester, Mike; Siljak-Yakovlev, Sonja; Papes, Drazena; Leitch, Andrew R; Besendorfer, Visnja
2009-01-01
The structure, abundance and location of repetitive DNA sequences on chromosomes can characterize the nature of higher plant genomes. Here we report on three new repeat DNA families isolated from Anemone hortensis L.; (i) AhTR1, a family of satellite DNA (stDNA) composed of a 554-561 bp long EcoRV monomer; (ii) AhTR2, a stDNA family composed of a 743 bp long HindIII monomer and; (iii) AhDR, a repeat family composed of a 945 bp long HindIII fragment that exhibits some sequence similarity to Ty3/gypsy-like retroelements. Fluorescence in-situ hybridization (FISH) to metaphase chromosomes of A. hortensis (2n = 16) revealed that both AhTR1 and AhTR2 sequences co-localized with DAPI-positive AT-rich heterochromatic regions. AhTR1 sequences occur at intercalary DAPI bands while AhTR2 sequences occur at 8-10 terminally located heterochromatic blocks. In contrast AhDR sequences are dispersed over all chromosomes as expected of a Ty3/gypsy-like element. AhTR2 and AhTR1 repeat families include polyA- and polyT-tracks, AT/TA-motifs and a pentanucleotide sequence (CAAAA) that may have consequences for chromatin packing and sequence homogeneity. AhTR2 repeats also contain TTTAGGG motifs and degenerate variants. We suggest that they arose by interspersion of telomeric repeats with subtelomeric repeats, before hybrid unit(s) amplified through the heterochromatic domain. The three repetitive DNA families together occupy approximately 10% of the A. hortensis genome. Comparative analyses of eight Anemone species revealed that the divergence of the A. hortensis genome was accompanied by considerable modification and/or amplification of repeats.
Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R
2006-12-01
Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.
Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V
1985-09-01
The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this.
A novel site-specific recombination system derived from bacteriophage phiMR11.
Rashel, Mohammad; Uchiyama, Jumpei; Ujihara, Takako; Takemura, Iyo; Hoshiba, Hiroshi; Matsuzaki, Shigenobu
2008-04-04
We report identification of a novel site-specific DNA recombination system that functions in both in vivo and in vitro, derived from lysogenic Staphylococcus aureus phage phiMR11. In silico analysis of the phiMR11 genome indicated orf1 as a putative integrase gene. Phage and bacterial attachment sites (attP and attB, respectively) and attachment junctions were determined and their nucleotide sequences decoded. Sequences of attP and attB were mostly different to each other except for a two bp common core that was the crossover point. We found several inverted repeats adjacent to the core sequence of attP as potential protein binding sites. The precise and efficient integration properties of phiMR11 integrase were shown on attP and attB in Escherichia coli and the minimum size of attP was found to be 34bp. In in vitro assays using crude or purified integrase, only buffer and substrate DNAs were required for the recombination reaction, indicating that other bacterially encoded factors are not essential for activity.
Egan, Muireann; O'Connell Motherway, Mary; van Sinderen, Douwe
2015-02-01
Bifidobacterium breve strains are numerically prevalent among the gut microbiota of healthy, breast-fed infants. The metabolism of sialic acid, a ubiquitous monosaccharide in the infant and adult gut, by B. breve UCC2003 is dependent on a large gene cluster, designated the nan/nag cluster. This study describes the transcriptional regulation of the nan/nag cluster and thus sialic acid metabolism in B. breve UCC2003. Insertion mutagenesis and transcriptome analysis revealed that the nan/nag cluster is regulated by a GntR family transcriptional repressor, designated NanR. Crude cell extract of Escherichia coli EC101 in which the nanR gene had been cloned and overexpressed was shown to bind to two promoter regions within this cluster, each of which containing an imperfect inverted repeat that is believed to act as the NanR operator sequence. Formation of the DNA-NanR complex is prevented in the presence of sialic acid, which we had previously shown to induce transcription of this gene cluster. © FEMS 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Saathoff, Aaron J.; Sarath, Gautam; Chow, Elaine K.; Dien, Bruce S.; Tobias, Christian M.
2011-01-01
Cinnamyl alcohol dehydrogenase (CAD) catalyzes the last step in monolignol biosynthesis and genetic evidence indicates CAD deficiency in grasses both decreases overall lignin, alters lignin structure and increases enzymatic recovery of sugars. To ascertain the effect of CAD downregulation in switchgrass, RNA mediated silencing of CAD was induced through Agrobacterium mediated transformation of cv. “Alamo” with an inverted repeat construct containing a fragment derived from the coding sequence of PviCAD2. The resulting primary transformants accumulated less CAD RNA transcript and protein than control transformants and were demonstrated to be stably transformed with between 1 and 5 copies of the T-DNA. CAD activity against coniferaldehyde, and sinapaldehyde in stems of silenced lines was significantly reduced as was overall lignin and cutin. Glucose release from ground samples pretreated with ammonium hydroxide and digested with cellulases was greater than in control transformants. When stained with the lignin and cutin specific stain phloroglucinol-HCl the staining intensity of one line indicated greater incorporation of hydroxycinnamyl aldehydes in the lignin. PMID:21298014
Saathoff, Aaron J; Sarath, Gautam; Chow, Elaine K; Dien, Bruce S; Tobias, Christian M
2011-01-27
Cinnamyl alcohol dehydrogenase (CAD) catalyzes the last step in monolignol biosynthesis and genetic evidence indicates CAD deficiency in grasses both decreases overall lignin, alters lignin structure and increases enzymatic recovery of sugars. To ascertain the effect of CAD downregulation in switchgrass, RNA mediated silencing of CAD was induced through Agrobacterium mediated transformation of cv. "Alamo" with an inverted repeat construct containing a fragment derived from the coding sequence of PviCAD2. The resulting primary transformants accumulated less CAD RNA transcript and protein than control transformants and were demonstrated to be stably transformed with between 1 and 5 copies of the T-DNA. CAD activity against coniferaldehyde, and sinapaldehyde in stems of silenced lines was significantly reduced as was overall lignin and cutin. Glucose release from ground samples pretreated with ammonium hydroxide and digested with cellulases was greater than in control transformants. When stained with the lignin and cutin specific stain phloroglucinol-HCl the staining intensity of one line indicated greater incorporation of hydroxycinnamyl aldehydes in the lignin.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bolognese, F.; Di Lecce, C.; Galli, E.
The arrangement of the genes involved in o-xylene, m-xylene, and p-xylene catabolism was investigated in three Pseudomonas stutzeri strains: the wild-type strain OX1, which is able to grow on o-xylene but not on the meta and para isomers; the mutant M1, which grows on m-xylene and p-xylene but is unable to utilize the ortho isomer; and the revertant R1, which can utilize all the three isomers of xylene. A 3-kb insertion sequence (IS) termed ISPs1, which inactivates the m-xylene and p-xylene catabolic pathway in P. stutzeri OX1 and the o-xylene catabolic genes in P. stutzeri M1, was detected. No ISmore » was detected in the corresponding catabolic regions of the P. stutzeri R1 genome. ISPs1 is present in several copies in the genomes of the three strains. It is flanked by 24-bp imperfect inverted repeats, causes the direct duplication of 8 bp in the target DNA, and seems to be related to the ISL3 family.« less
Laser mass spectrometry for DNA fingerprinting for forensic applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, C.H.; Tang, K.; Taranenko, N.I.
The application of DNA fingerprinting has become very broad in forensic analysis, patient identification, diagnostic medicine, and wildlife poaching, since every individual`s DNA structure is identical within all tissues of their body. DNA fingerprinting was initiated by the use of restriction fragment length polymorphisms (RFLP). In 1987, Nakamura et al. found that a variable number of tandem repeats (VNTR) often occurred in the alleles. The probability of different individuals having the same number of tandem repeats in several different alleles is very low. Thus, the identification of VNTR from genomic DNA became a very reliable method for identification of individuals.more » DNA fingerprinting is a reliable tool for forensic analysis. In DNA fingerprinting, knowledge of the sequence of tandem repeats and restriction endonuclease sites can provide the basis for identification. The major steps for conventional DNA fingerprinting include (1) specimen processing (2) amplification of selected DNA segments by PCR, and (3) gel electrophoresis to do the final DNA analysis. In this work we propose to use laser desorption mass spectrometry for fast DNA fingerprinting. The process and advantages are discussed.« less
Kinnevey, Peter M.; Shore, Anna C.; Brennan, Grainne I.; Sullivan, Derek J.; Ehricht, Ralf; Monecke, Stefan; Slickers, Peter
2013-01-01
Methicillin-resistant Staphylococcus aureus (MRSA) has been a major cause of nosocomial infection in Irish hospitals for 4 decades, and replacement of predominant MRSA clones has occurred several times. An MRSA isolate recovered in 2006 as part of a larger study of sporadic MRSA exhibited a rare spa (t878) and multilocus sequence (ST779) type and was nontypeable by PCR- and DNA microarray-based staphylococcal cassette chromosome mec (SCCmec) element typing. Whole-genome sequencing revealed the presence of a novel 51-kb composite island (CI) element with three distinct domains, each flanked by direct repeat and inverted repeat sequences, including (i) a pseudo SCCmec element (16.3 kb) carrying mecA with a novel mec class region, a fusidic acid resistance gene (fusC), and two copper resistance genes (copB and copC) but lacking ccr genes; (ii) an SCC element (17.5 kb) carrying a novel ccrAB4 allele; and (iii) an SCC element (17.4 kb) carrying a novel ccrC allele and a clustered regularly interspaced short palindromic repeat (CRISPR) region. The novel CI was subsequently identified by PCR in an additional 13 t878/ST779 MRSA isolates, six from bloodstream infections, recovered between 2006 and 2011 in 11 hospitals. Analysis of open reading frames (ORFs) carried by the CI showed amino acid sequence similarity of 44 to 100% to ORFs from S. aureus and coagulase-negative staphylococci (CoNS). These findings provide further evidence of genetic transfer between S. aureus and CoNS and show how this contributes to the emergence of novel SCCmec elements and MRSA strains. Ongoing surveillance of this MRSA strain is warranted and will require updating of currently used SCCmec typing methods. PMID:23147725
Structural and biophysical properties of h-FANCI ARM repeat protein.
Siddiqui, Mohd Quadir; Choudhary, Rajan Kumar; Thapa, Pankaj; Kulkarni, Neha; Rajpurohit, Yogendra S; Misra, Hari S; Gadewal, Nikhil; Kumar, Satish; Hasan, Syed K; Varma, Ashok K
2017-11-01
Fanconi anemia complementation groups - I (FANCI) protein facilitates DNA ICL (Inter-Cross-link) repair and plays a crucial role in genomic integrity. FANCI is a 1328 amino acids protein which contains armadillo (ARM) repeats and EDGE motif at the C-terminus. ARM repeats are functionally diverse and evolutionarily conserved domain that plays a pivotal role in protein-protein and protein-DNA interactions. Considering the importance of ARM repeats, we have explored comprehensive in silico and in vitro approach to examine folding pattern. Size exclusion chromatography, dynamic light scattering (DLS) and glutaraldehyde crosslinking studies suggest that FANCI ARM repeat exist as monomer as well as in oligomeric forms. Circular dichroism (CD) and fluorescence spectroscopy results demonstrate that protein has predominantly α- helices and well-folded tertiary structure. DNA binding was analysed using electrophoretic mobility shift assay by autoradiography. Temperature-dependent CD, Fluorescence spectroscopy and DLS studies concluded that protein unfolds and start forming oligomer from 30°C. The existence of stable portion within FANCI ARM repeat was examined using limited proteolysis and mass spectrometry. The normal mode analysis, molecular dynamics and principal component analysis demonstrated that helix-turn-helix (HTH) motif present in ARM repeat is highly dynamic and has anti-correlated motion. Furthermore, FANCI ARM repeat has HTH structural motif which binds to double-stranded DNA.
Environmental stress induces trinucleotide repeat mutagenesis in human cells
Chatterjee, Nimrat; Lin, Yunfu; Santillan, Beatriz A.; Yotnda, Patricia; Wilson, John H.
2015-01-01
The dynamic mutability of microsatellite repeats is implicated in the modification of gene function and disease phenotype. Studies of the enhanced instability of long trinucleotide repeats (TNRs)—the cause of multiple human diseases—have revealed a remarkable complexity of mutagenic mechanisms. Here, we show that cold, heat, hypoxic, and oxidative stresses induce mutagenesis of a long CAG repeat tract in human cells. We show that stress-response factors mediate the stress-induced mutagenesis (SIM) of CAG repeats. We show further that SIM of CAG repeats does not involve mismatch repair, nucleotide excision repair, or transcription, processes that are known to promote TNR mutagenesis in other pathways of instability. Instead, we find that these stresses stimulate DNA rereplication, increasing the proportion of cells with >4 C-value (C) DNA content. Knockdown of the replication origin-licensing factor CDT1 eliminates both stress-induced rereplication and CAG repeat mutagenesis. In addition, direct induction of rereplication in the absence of stress also increases the proportion of cells with >4C DNA content and promotes repeat mutagenesis. Thus, environmental stress triggers a unique pathway for TNR mutagenesis that likely is mediated by DNA rereplication. This pathway may impact normal cells as they encounter stresses in their environment or during development or abnormal cells as they evolve metastatic potential. PMID:25775519
Environmental stress induces trinucleotide repeat mutagenesis in human cells.
Chatterjee, Nimrat; Lin, Yunfu; Santillan, Beatriz A; Yotnda, Patricia; Wilson, John H
2015-03-24
The dynamic mutability of microsatellite repeats is implicated in the modification of gene function and disease phenotype. Studies of the enhanced instability of long trinucleotide repeats (TNRs)-the cause of multiple human diseases-have revealed a remarkable complexity of mutagenic mechanisms. Here, we show that cold, heat, hypoxic, and oxidative stresses induce mutagenesis of a long CAG repeat tract in human cells. We show that stress-response factors mediate the stress-induced mutagenesis (SIM) of CAG repeats. We show further that SIM of CAG repeats does not involve mismatch repair, nucleotide excision repair, or transcription, processes that are known to promote TNR mutagenesis in other pathways of instability. Instead, we find that these stresses stimulate DNA rereplication, increasing the proportion of cells with >4 C-value (C) DNA content. Knockdown of the replication origin-licensing factor CDT1 eliminates both stress-induced rereplication and CAG repeat mutagenesis. In addition, direct induction of rereplication in the absence of stress also increases the proportion of cells with >4C DNA content and promotes repeat mutagenesis. Thus, environmental stress triggers a unique pathway for TNR mutagenesis that likely is mediated by DNA rereplication. This pathway may impact normal cells as they encounter stresses in their environment or during development or abnormal cells as they evolve metastatic potential.
Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.
Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru
2015-01-01
The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.
Evidence of birth-and-death evolution of 5S rRNA gene in Channa species (Teleostei, Perciformes).
Barman, Anindya Sundar; Singh, Mamta; Singh, Rajeev Kumar; Lal, Kuldeep Kumar
2016-12-01
In higher eukaryotes, minor rDNA family codes for 5S rRNA that is arranged in tandem arrays and comprises of a highly conserved 120 bp long coding sequence with a variable non-transcribed spacer (NTS). Initially the 5S rDNA repeats are considered to be evolved by the process of concerted evolution. But some recent reports, including teleost fishes suggested that evolution of 5S rDNA repeat does not fit into the concerted evolution model and evolution of 5S rDNA family may be explained by a birth-and-death evolution model. In order to study the mode of evolution of 5S rDNA repeats in Perciformes fish species, nucleotide sequence and molecular organization of five species of genus Channa were analyzed in the present study. Molecular analyses revealed several variants of 5S rDNA repeats (four types of NTS) and networks created by a neighbor net algorithm for each type of sequences (I, II, III and IV) did not show a clear clustering in species specific manner. The stable secondary structure is predicted and upstream and downstream conserved regulatory elements were characterized. Sequence analyses also shown the presence of two putative pseudogenes in Channa marulius. Present study supported that 5S rDNA repeats in genus Channa were evolved under the process of birth-and-death.
Bio-recognitive photonics of a DNA-guided organic semiconductor.
Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June
2016-01-04
Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA-DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an 'inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA-DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition.
Bio-recognitive photonics of a DNA-guided organic semiconductor
NASA Astrophysics Data System (ADS)
Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June
2016-01-01
Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA-DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an `inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA-DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition.
Kim, Ji Eun; Lee, Min Hee; Cho, Eun Ju; Kim, Ji Hong; Chung, Byung Yeoup; Kim, Jin-Hong
2013-12-01
Ionizing radiation causes various epigenetic changes, as well as a variety of DNA lesions such as strand breaks, cross-links, oxidative damages, etc., in genomes. However, radiation-induced epigenetic changes have rarely been substantiated in plant genomes. The current study investigates whether DNA methylation of Arabidopsis thaliana genome is altered by gamma rays. We found that genomic DNA methylation decreased in wild-type plants with increasing doses of gamma rays (5, 50 and 200 Gy). Irradiation with 200 Gy significantly increased the expression of transcriptionally inactive centromeric 180-bp (CEN) and transcriptionally silent information (TSI) repeats. This increase suggested that there was a substantial release of transcriptional gene silencing by gamma rays, probably by induction of DNA hypomethylation. High expression of the DNA demethylase ROS1 and low expression of the DNA methyltransferase CMT3 supported this hypothesis. Moreover, Southern blot analysis following digestion of genomic DNA with methylation-sensitive enzymes revealed that the DNA hypomethylation occured preferentially at CHG or CHH sites rather than CG sites, depending on the radiation dose. Unlike CEN and TSI repeats, the number of Ta3, AtSN1 and FWA repeats decreased in transcription but increased in non-CG methylation. In addition, the cmt3-11 mutant showed neither DNA hypomethylation nor transcriptional activation of silenced repeats upon gamma irradiation. Furthermore, profiles of genome-wide transcriptomes in response to gamma rays differed between the wild-type and cmt3-11 mutant. These results suggest that gamma irradiation induced DNA hypomethylation preferentially at non-CG sites of transcriptionally inactive repeats in a locus-specific manner, which depends on CMT3 activity.
Brouard, Jean-Simon; Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-01-01
The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA) structure, size, gene order, and intron content have been observed. The large inverted repeat (IR), an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales) but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum . The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium , it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold longer and dispersed repeats are more abundant, but a smaller fraction of the Oedocladium genome is occupied by introns. Six additional group II introns are present, five of which lack ORFs and carry highly similar sequences to that of the ORF-less IIA intron shared with Oedogonium . Secondary structure analysis of the group IIA introns disclosed marked differences in the exon-binding sites; however, each intron showed perfect or nearly perfect base pairing interactions with its target site. Our results suggest that chloroplast genes rearrange more slowly in the Oedogoniales than in the Chaetophorales and raise questions as to what was the nature of the foreign coding sequences in the IR of the common ancestor of the Oedogoniales. They provide the first evidence for intragenomic proliferation of group IIA introns in the Viridiplantae, revealing that intron spread in the Oedocladium lineage likely occurred by retrohoming after sequence divergence of the exon-binding sites.
Symonová, Radka; Ocalewicz, Konrad; Kirtiklis, Lech; Delmastro, Giovanni Battista; Pelikánová, Šárka; Garcia, Sonia; Kovařík, Aleš
2017-05-18
Pikes represent an important genus (Esox) harbouring a pre-duplication karyotype (2n = 2x = 50) of economically important salmonid pseudopolyploids. Here, we have characterized the 5S ribosomal RNA genes (rDNA) in Esox lucius and its closely related E. cisalpinus using cytogenetic, molecular and genomic approaches. Intragenomic homogeneity and copy number estimation was carried out using Illumina reads. The higher-order structure of rDNA arrays was investigated by the analysis of long PacBio reads. Position of loci on chromosomes was determined by FISH. DNA methylation was analysed by methylation-sensitive restriction enzymes. The 5S rDNA loci occupy exclusively (peri)centromeric regions on 30-38 acrocentric chromosomes in both E. lucius and E. cisalpinus. The large number of loci is accompanied by extreme amplification of genes (>20,000 copies), which is to the best of our knowledge one of the highest copy number of rRNA genes in animals ever reported. Conserved secondary structures of predicted 5S rRNAs indicate that most of the amplified genes are potentially functional. Only few SNPs were found in genic regions indicating their high homogeneity while intergenic spacers were more heterogeneous and several families were identified. Analysis of 10-30 kb-long molecules sequenced by the PacBio technology (containing about 40% of total 5S rDNA) revealed that the vast majority (96%) of genes are organised in large several kilobase-long blocks. Dispersed genes or short tandems were less common (4%). The adjacent 5S blocks were directly linked, separated by intervening DNA and even inverted. The 5S units differing in the intergenic spacers formed both homogeneous and heterogeneous (mixed) blocks indicating variable degree of homogenisation between the loci. Both E. lucius and E. cisalpinus 5S rDNA was heavily methylated at CG dinucleotides. Extreme amplification of 5S rRNA genes in the Esox genome occurred in the absence of significant pseudogenisation suggesting its recent origin and/or intensive homogenisation processes. The dense methylation of units indicates that powerful epigenetic mechanisms have evolved in this group of fish to silence amplified genes. We discuss how the higher-order repeat structures impact on homogenisation of 5S rDNA in the genome.
The Repeat Expansion Diseases: the dark side of DNA repair?
Zhao, Xiao-Nan; Usdin, Karen
2015-01-01
DNA repair normally protects the genome against mutations that threaten genome integrity and thus cell viability. However, growing evidence suggests that in the case of the Repeat Expansion Diseases, disorders that result from an increase in the size of a disease-specific microsatellite, the disease-causing mutation is actually the result of aberrant DNA repair. A variety of proteins from different DNA repair pathways have thus far been implicated in this process. This review will summarize recent findings from patients and from mouse models of these diseases that shed light on how these pathways may interact to cause repeat expansion. PMID:26002199
Kelly, Laura J; Renny-Byfield, Simon; Pellicer, Jaume; Macas, Jiří; Novák, Petr; Neumann, Pavel; Lysak, Martin A; Day, Peter D; Berger, Madeleine; Fay, Michael F; Nichols, Richard A; Leitch, Andrew R; Leitch, Ilia J
2015-10-01
Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Inverted ILM flap, free ILM flap and conventional ILM peeling for large macular holes.
Velez-Montoya, Raul; Ramirez-Estudillo, J Abel; Sjoholm-Gomez de Liano, Carl; Bejar-Cornejo, Francisco; Sanchez-Ramos, Jorge; Guerrero-Naranjo, Jose Luis; Morales-Canton, Virgilio; Hernandez-Da Mota, Sergio E
2018-01-01
To assess closure rate after a single surgery of large macular holes and their visual recovery in the short term with three different surgical techniques. Prospective multicenter randomized controlled trial. We included treatment-naïve patients with diagnosis of large macular hole (minimum diameter of > 400 µm). All patients underwent a comprehensive ophthalmological examination. Before surgery, the patients were randomized into three groups: group A: conventional internal limiting membrane peeling, group B: inverted-flap technique and group C: free-flap technique. All study measurements were repeated within the period of 1 and 3 months after surgery. Continuous variables were assessed with a Kruskal-Wallis test, change in visual acuity was assessed with analysis of variance for repeated measurements with a Bonferroni correction for statistical significance. Thirty-eight patients were enrolled (group A: 12, group B: 12, group C: 14). The closure rate was in group A and B: 91.6%; 95% CI 61.52-99.79%. In group C: 85.71%; 95% CI 57.19-98.22%. There were no differences in the macular hole closure rate between groups ( p = 0.85). All groups improved ≈ 0.2 logMAR, but only group B reached statistical significance ( p < 0.007). Despite all techniques displayed a trend toward visual improvement, the inverted-flap technique seems to induce a faster and more significant recovery in the short term.
Examining impulse-variability in overarm throwing.
Urbin, M A; Stodden, David; Boros, Rhonda; Shannon, David
2012-01-01
The purpose of this study was to examine variability in overarm throwing velocity and spatial output error at various percentages of maximum to test the prediction of an inverted-U function as predicted by impulse-variability theory and a speed-accuracy trade-off as predicted by Fitts' Law Thirty subjects (16 skilled, 14 unskilled) were instructed to throw a tennis ball at seven percentages of their maximum velocity (40-100%) in random order (9 trials per condition) at a target 30 feet away. Throwing velocity was measured with a radar gun and interpreted as an index of overall systemic power output. Within-subject throwing velocity variability was examined using within-subjects repeated-measures ANOVAs (7 repeated conditions) with built-in polynomial contrasts. Spatial error was analyzed using mixed model regression. Results indicated a quadratic fit with variability in throwing velocity increasing from 40% up to 60%, where it peaked, and then decreasing at each subsequent interval to maximum (p < .001, η2 = .555). There was no linear relationship between speed and accuracy. Overall, these data support the notion of an inverted-U function in overarm throwing velocity variability as both skilled and unskilled subjects approach maximum effort. However, these data do not support the notion of a speed-accuracy trade-off. The consistent demonstration of an inverted-U function associated with systemic power output variability indicates an enhanced capability to regulate aspects of force production and relative timing between segments as individuals approach maximum effort, even in a complex ballistic skill.
Komosa, Martin; Root, Heather; Meyn, M. Stephen
2015-01-01
Current methods for characterizing extrachromosomal nuclear DNA in mammalian cells do not permit single-cell analysis, are often semi-quantitative and frequently biased toward the detection of circular species. To overcome these limitations, we developed Halo-FISH to visualize and quantitatively analyze extrachromosomal DNA in single cells. We demonstrate Halo-FISH by using it to analyze extrachromosomal telomere-repeat (ECTR) in human cells that use the Alternative Lengthening of Telomeres (ALT) pathway(s) to maintain telomere lengths. We find that GM847 and VA13 ALT cells average ∼80 detectable G/C-strand ECTR DNA molecules/nucleus, while U2OS ALT cells average ∼18 molecules/nucleus. In comparison, human primary and telomerase-positive cells contain <5 ECTR DNA molecules/nucleus. ECTR DNA in ALT cells exhibit striking cell-to-cell variations in number (<20 to >300), range widely in length (<1 to >200 kb) and are composed of primarily G- or C-strand telomere-repeat DNA. Halo-FISH enables, for the first time, the simultaneous analysis of ECTR DNA and chromosomal telomeres in a single cell. We find that ECTR DNA comprises ∼15% of telomere-repeat DNA in GM847 and VA13 cells, but <4% in U2OS cells. In addition to its use in ALT cell analysis, Halo-FISH can facilitate the study of a wide variety of extrachromosomal DNA in mammalian cells. PMID:25662602
High-throughput analysis of the satellitome illuminates satellite DNA evolution
NASA Astrophysics Data System (ADS)
Ruiz-Ruano, Francisco J.; López-León, María Dolores; Cabrero, Josefa; Camacho, Juan Pedro M.
2016-07-01
Satellite DNA (satDNA) is a major component yet the great unknown of eukaryote genomes and clearly underrepresented in genome sequencing projects. Here we show the high-throughput analysis of satellite DNA content in the migratory locust by means of the bioinformatic analysis of Illumina reads with the RepeatExplorer and RepeatMasker programs. This unveiled 62 satDNA families and we propose the term “satellitome” for the whole collection of different satDNA families in a genome. The finding that satDNAs were present in many contigs of the migratory locust draft genome indicates that they show many genomic locations invisible by fluorescent in situ hybridization (FISH). The cytological pattern of five satellites showing common descent (belonging to the SF3 superfamily) suggests that non-clustered satDNAs can become into clustered through local amplification at any of the many genomic loci resulting from previous dissemination of short satDNA arrays. The fact that all kinds of satDNA (micro- mini- and satellites) can show the non-clustered and clustered states suggests that all these elements are mostly similar, except for repeat length. Finally, the presence of VNTRs in bacteria, showing similar properties to non-clustered satDNAs in eukaryotes, suggests that this kind of tandem repeats show common properties in all living beings.
Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster
Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.
1993-01-01
Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654
[Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].
Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou
2002-01-01
To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.
Molecular characterization of the complete genome of falconid herpesvirus strain S-18
USDA-ARS?s Scientific Manuscript database
Falconid herpesvirus type 1 (FHV-1) is the causative agent of falcon inclusion body disease, an acute, highly contagious disease of raptors. The complete nucleotide sequence of the genome of FHV-1 has been determined. The genome is arranged as a D-type genome with large inverted repeats flanking a ...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.
2006-01-09
Chloroplast genome structure, gene order and content arehighly conserved in land plants. We sequenced the complete chloroplastgenome sequence of Trachelium caeruleum (Campanulaceae) a member of anangiosperm family known for highly rearranged chloroplast genomes. Thetotal genome size is 162,321 bp with an IR of 27,273 bp, LSC of 100,113bp and SSC of 7,661 bp. The genome encodes 115 unique genes, with 19duplicated in the IR, a tRNA (trnI-CAU) duplicated once in the LSC and aprotein coding gene (psbJ) duplicated twice, for a total of 137 genes.Four genes (ycf15, rpl23, infA and accD) are truncated and likelynonfunctional; three others (clpP, ycf1 andmore » ycf2) are so highly divergedthat they may now be pseudogenes. The most conspicuous feature of theTrachelium genome is the presence of eighteen internally unrearrangedblocks of genes that have been inverted or relocated within the genome,relative to the typical gene order of most angiosperm chloroplastgenomes. Recombination between repeats or tRNAs has been suggested as twomeans of chloroplast genome rearrangements. We compared the relativenumber of repeats in Trachelium to eight other angiosperm chloroplastgenomes, and evaluated the location of repeats and tRNAs in relation torearrangements. Trachelium has the highest number and largest repeats,which are concentrated near inversion endpoints or other rearrangements.tRNAs occur at many but not all inversion endpoints. There is likely nosingle mechanism responsible for the remarkable number of alterations inthis genome, but both repeats and tRNAs are clearly associated with theserearrangements. Land plant chloroplast genomes are highly conserved instructure, gene order and content. The chloroplast genomes of ferns, thegymnosperm Ginkgo, and most angiosperms are nearly collinear, reflectingthe gene order in lineages that diverged from lycopsids and the ancestralchloroplast gene order over 350 million years ago (Raubeson and Jansen,1992). Although earlier mapping studies identified a number of taxa inwhich several rearrangements have occurred (reviewed in Raubeson andJansen, 2005), an extraordinary number of chloroplast genome alterationsare concentrated in several families in the angiosperm order Asterales(sensu APGII, Bremer et al., 2003). Gene mapping studies ofrepresentatives of the Campanulaceae (Cosner, 1993; Cosner et al.,1997,2004) and Lobeliaceae (Knox et al., 1993; Knox and Palmer, 1999)identified large inversions, contraction and expansion of the invertedrepeat regions, and several insertions and deletions in the cpDNAs ofthese closely related taxa. Detailed restriction site and gene mapping ofthe chloroplast genome of Trachelium caeruleum (Campanulaceae) identifiedseven to ten large inversions, families of repeats associated withrearrangements, possible transpositions, and even the disruption ofoperons (Cosner et al., 1997). Seventeen other members of theCampanulaceae were mapped and exhibit many additional rearrangements(Cosner et al., 2004). What happened in this lineage that made itsusceptible to so many chloroplast genome rearrangements? How do normallyvery conserved chloroplast genomes change? The cause of rearrangements inthis group is unclear based on the limited resolution available withmapping techniques. Several mechanisms have been proposed to explain howrearrangements occur: recombination between repeats, transposition, ortemporary instability due to loss of the inverted repeat (Raubeson andJansen, 2005). Sequencing whole chloroplast genomes within theCampanulaceae offers a unique opportunity to examine both the extent andmechanisms of rearrangements within a phylogenetic framework.We reporthere the first complete chloroplast genome sequence of a member of theCampanulaceae, Trachelium caeruleum. This work will serve as a benchmarkfor subsequent, comparative sequencing and analysis of other members ofthis family and close relatives, with the goal of further understandingchloroplast genome evolution. We confirmed features previously identifiedthrough mapping, and discovered many additional structural changes,including several partial to entire gene duplications, deterioration ofat least four normally conserved chloroplast genes into gene fragments,and the nature and position of numerous repeat elements at or nearinversion endpoints. The focus of this paper is on analyses of sequencesat or near these rearrangements in Trachelium caeruleum. Inversions arebelieved to occur due to the presence of repeat elements subject tohomologous recombination (Palmer, 1991; Knox et al., 1993). Repeats mayfacilitate inversions or other genome rearrangements (Achaz et al.,2003), and higher incidences of repeats have been correlated with greaternumbers of rearrangements (Rocha, 2003). Alternatively, repeats mayproliferate within a genome asa result of DNA strand repair mechanismsfollowing a rearrangement event such as an inversion. Gene« less
Large diversity of the piggyBac-like elements in the genome of Tribolium castaneum
Wang, Jianjun; Du, Yuzhou; Wang, Suzhi; Brown, Sue; Park, Yoonseong
2011-01-01
The piggyBac transposable element, originally discovered in the cabbage looper, Trichoplusia ni, has been widely used in insect transgenesis including the red flour beetle Tribolium castaneum. We surveyed piggyBac-like (PLE) sequences in the genome of Tribolium castaneum by homology searches using as queries the diverse PLE sequences that have been described previously. The search yielded a total of 32 piggyBac-like elements (TcPLEs) which were classified into 14 distinct groups. Most of the TcPLEs contain defective functional motifs in that they are lacking inverted terminal repeats or have disrupted open reading frames. Only one single copy of TcPLE1 appears to be intact with imperfect 16 bp inverted terminal repeats flanking an open reading frame encoding a transposase of 571 amino acid residues. Many copies of TcPLEs were found to be inserted into or close to other transposon-like sequences. This large diversity of TcPLEs with generally low copy numbers suggests multiple invasions of the TcPLEs over a long evolutionary time without extensive multiplications or occurrence of rapid loss of TcPLEs copies. PMID:18342253
NASA Astrophysics Data System (ADS)
Fuh, Yiin-Kuen; Lai, Zheng-Hong
2017-02-01
A fast processing route of aspheric polydimethylsiloxane (PDMS) lenses array (APLA) is proposed via the combined effect of inverted gravitational and heat-assisted forces. The fabrication time can be dramatically reduced to 30 s, compared favorably to the traditional duration of 2 hours of repeated cycles of addition-curing processes. In this paper, a low-cost flexible lens can be fabricated by repeatedly depositing, inverting, curing a hanging transparent PDMS elastomer droplet on a previously deposited curved structure. Complex structures with aspheric curve features and various focal lengths can be successfully produced and the fabricated 4 types of APLA have various focal lengths in the range of 7.03 mm, 6.00 mm, 5.33 mm, and 4.43 mm, respectively. Empirically, a direct relationship between the PDMS volume and focal lengths of the lenses can be experimentally deducted. Using these fabricated APLA, an ordinary commercial smartphone camera can be easily transformed to a low-cost, portable digital microscopy (50×magnification) such that point of care diagnostic can be implemented pervasively.
Rathi, Preeti; Maurer, Sara; Summerer, Daniel
2018-06-05
The epigenetic DNA nucleobases 5-methylcytosine (5mC) and N 4-methylcytosine (4mC) coexist in bacterial genomes and have important functions in host defence and transcription regulation. To better understand the individual biological roles of both methylated nucleobases, analytical strategies for distinguishing unmodified cytosine (C) from 4mC and 5mC are required. Transcription-activator-like effectors (TALEs) are programmable DNA-binding repeat proteins, which can be re-engineered for the direct detection of epigenetic nucleobases in user-defined DNA sequences. We here report the natural, cytosine-binding TALE repeat to not strongly differentiate between 5mC and 4mC. To engineer repeats with selectivity in the context of C, 5mC and 4mC, we developed a homogeneous fluorescence assay and screened a library of size-reduced TALE repeats for binding to all three nucleobases. This provided insights into the requirements of size-reduced TALE repeats for 4mC binding and revealed a single mutant repeat as a selective binder of 4mC. Employment of a TALE with this repeat in affinity enrichment enabled the isolation of a user-defined DNA sequence containing a single 4mC but not C or 5mC from the background of a bacterial genome. Comparative enrichments with TALEs bearing this or the natural C-binding repeat provides an approach for the complete, programmable decoding of all cytosine nucleobases found in bacterial genomes.This article is part of a discussion meeting issue 'Frontiers in epigenetic chemical biology'. © 2018 The Author(s).
Andriollo, Paolo; Hind, Charlotte K; Picconi, Pietro; Nahar, Kazi S; Jamshidi, Shirin; Varsha, Amrit; Clifford, Melanie; Sutton, J Mark; Rahman, Khondaker Miraz
2018-02-09
Antimicrobial resistance has become a major global concern. Development of novel antimicrobial agents for the treatment of infections caused by multidrug resistant (MDR) pathogens is an urgent priority. Pyrrolobenzodiazepines (PBDs) are a promising class of antibacterial agents initially discovered and isolated from natural sources. Recently, C8-linked PBD biaryl conjugates have been shown to be active against some MDR Gram-positive strains. To explore the role of building block orientations on antibacterial activity and obtain structure activity relationship (SAR) information, four novel structures were synthesized in which the building blocks of previously reported compounds were inverted, and their antibacterial activity was studied. The compounds showed minimum inhibitory concentrations (MICs) in the range of 0.125-32 μg/mL against MDR Gram-positive strains with a bactericidal mode of action. The results showed that a single inversion of amide bonds reduces the activity while the double inversion restores the activity against MDR pathogens. All inverted compounds did not stabilize DNA and lacked eukaryotic toxicity. The compounds inhibit DNA gyrase in vitro, and the most potent compound was equally active against both wild-type and mutant DNA gyrase in a biochemical assay. The observed activity of the compounds against methicillin resistant S. aureus (MRSA) strains with equivalent gyrase mutations is consistent with gyrase inhibition being the mechanism of action in vivo, although this has not been definitively confirmed in whole cells. This conclusion is supported by a molecular modeling study showing interaction of the compounds with wild-type and mutant gyrases. This study provides important SAR information about this new class of antibacterial agents.
DNA replication stress restricts ribosomal DNA copy number
Salim, Devika; Bradford, William D.; Freeland, Amy; Cady, Gillian; Wang, Jianmin
2017-01-01
Ribosomal RNAs (rRNAs) in budding yeast are encoded by ~100–200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA) locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how “normal” copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2)-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a “normal” rDNA copy number. PMID:28915237
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.
Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V
1985-01-01
The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this. Images PMID:3016521
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-01-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
Primary analysis of repeat elements of the Asian seabass (Lates calcarifer) transcriptome and genome
Kuznetsova, Inna S.; Thevasagayam, Natascha M.; Sridatta, Prakki S. R.; Komissarov, Aleksey S.; Saju, Jolly M.; Ngoh, Si Y.; Jiang, Junhui; Shen, Xueyan; Orbán, László
2014-01-01
As part of our Asian seabass genome project, we are generating an inventory of repeat elements in the genome and transcriptome. The karyotype showed a diploid number of 2n = 24 chromosomes with a variable number of B-chromosomes. The transcriptome and genome of Asian seabass were searched for repetitive elements with experimental and bioinformatics tools. Six different types of repeats constituting 8–14% of the genome were characterized. Repetitive elements were clustered in the pericentromeric heterochromatin of all chromosomes, but some of them were preferentially accumulated in pretelomeric and pericentromeric regions of several chromosomes pairs and have chromosomes specific arrangement. From the dispersed class of fish-specific non-LTR retrotransposon elements Rex1 and MAUI-like repeats were analyzed. They were wide-spread both in the genome and transcriptome, accumulated on the pericentromeric and peritelomeric areas of all chromosomes. Every analyzed repeat was represented in the Asian seabass transcriptome, some showed differential expression between the gonads. The other group of repeats analyzed belongs to the rRNA multigene family. FISH signal for 5S rDNA was located on a single pair of chromosomes, whereas that for 18S rDNA was found on two pairs. A BAC-derived contig containing rDNA was sequenced and assembled into a scaffold containing incomplete fragments of 18S rDNA. Their assembly and chromosomal position revealed that this part of Asian seabass genome is extremely rich in repeats containing evolutionarily conserved and novel sequences. In summary, transcriptome assemblies and cDNA data are suitable for the identification of repetitive DNA from unknown genomes and for comparative investigation of conserved elements between teleosts and other vertebrates. PMID:25120555
Heterochromatic siRNAs and DDM1 Independently Silence Aberrant 5S rDNA Transcripts in Arabidopsis
Blevins, Todd; Pontes, Olga; Pikaard, Craig S.; Meins, Frederick
2009-01-01
5S ribosomal RNA gene repeats are arranged in heterochromatic arrays (5S rDNA) situated near the centromeres of Arabidopsis chromosomes. The chromatin remodeling factor DDM1 is known to maintain 5S rDNA methylation patterns while silencing transcription through 5S rDNA intergenic spacers (IGS). We mapped small-interfering RNAs (siRNA) to a composite 5S rDNA repeat, revealing a high density of siRNAs matching silenced IGS transcripts. IGS transcript repression requires proteins of the heterochromatic siRNA pathway, including RNA polymerase IV (Pol IV), RNA-DEPENDENT RNA POLYMERASE 2 (RDR2) and DICER-LIKE 3 (DCL3). Using molecular and cytogenetic approaches, we show that the DDM1 and siRNA-dependent silencing effects are genetically independent. DDM1 suppresses production of the siRNAs, however, thereby limiting RNA-directed DNA methylation at 5S rDNA repeats. We conclude that DDM1 and siRNA-dependent silencing are overlapping processes that both repress aberrant 5S rDNA transcription and contribute to the heterochromatic state of 5S rDNA arrays. PMID:19529764
Yang, Yingjie; Kurokawa, Toru; Takahama, Yoshifumi; Nindita, Yosi; Mochizuki, Susumu; Arakawa, Kenji; Endo, Satoru; Kinashi, Haruyasu
2011-01-01
The 113,463-bp nucleotide sequence of the linear plasmid pSLA2-M of Streptomyces rochei 7434AN4 was determined. pSLA2-M had a 69.7% overall GC content, 352-bp terminal inverted repeats with 91% (321/352) identity at both ends, and 121 open reading frames. The rightmost 14.6-kb sequence was almost (14,550/14,555) identical to that of the coexisting 211-kb linear plasmid pSLA2-L. Adjacent to this homologous region an 11.8-kb CRISPR cluster was identified, which is known to function against phage infection in prokaryotes. This cluster region as well as another one containing two large membrane protein genes (orf78 and orf79) were flanked by direct repeats of 194 and 566 bp respectively. Hence the insertion of circular DNAs containing each cluster by homologous recombination was suggested. In addition, the orf71 encoded a Ku70/Ku80-like protein, known to function in the repair of double-strand DNA breaks in eukaryotes, but disruption of it did not affect the radiation sensitivity of the mutant. A pair of replication initiation genes (orf1-orf2) were identified at the extreme left end. Thus, pSLA2-M proved to be a composite linear plasmid characterized by self-defense genes and homology with pSLA2-L that might have been generated by multiple recombination events.
Yan, Fan; Di, Shaokang; Takahashi, Ryoji
2015-08-01
The R gene of soybean, presumably encoding a MYB transcription factor, controls seed coat color. The gene consists of multiple alleles, R (black), r-m (black spots and (or) concentric streaks on brown seed), and r (brown seed). This study was conducted to determine the structure of the MYB transcription factor gene in a near-isogenic line (NIL) having r-m allele. PCR amplification of a fragment of the candidate gene Glyma.09G235100 generated a fragment of about 1 kb in the soybean cultivar Clark, whereas a fragment of about 14 kb in addition to fragments of 1 and 1.4 kb were produced in L72-2040, a Clark 63 NIL with the r-m allele. Clark 63 is a NIL of Clark with the rxp and Rps1 alleles. A DNA fragment of 13 060 bp was inserted in the intron of Glyma.09G235100 in L72-2040. The fragment had the CACTA motif at both ends, imperfect terminal inverted repeats (TIR), inverse repetition of short sequence motifs close to the 5' and 3' ends, and a duplication of three nucleotides at the site of integration, indicating that it belongs to a CACTA-superfamily transposable element. We designated the element as Tgm11. Overall nucleotide sequence, motifs of TIR, and subterminal repeats were similar to those of Tgm1 and Tgs1, suggesting that these elements comprise a family.
Ehrmann, M A; Vogel, R E
2001-11-01
An insertion sequence has been identified in the genome of Lactobacillus sanfranciscensis DSM 20451T as segment of 1351 nucleotides containing 37-bp imperfect terminal inverted repeats. The sequence of this element encodes two out of phase, overlapping open reading frames, orfA and orfB, from which three putative proteins are produced. OrfAB is a transframe protein produced by -1 translational frame shifting between orf A and orf B that is presumed to be the transposase. The large orfAB of this element encodes a 342 amino acid protein that displays similarities with transposases encoded by bacterial insertion sequences belonging to the IS3 family. In L. sanfranciscensis type strain DSM 20451T multiple truncated IS elements were identified. Inverse PCR was used to analyze target sites of four of these elements, but except of their highly AT rich character not any sequence specificity was identified so far. Moreover, no flanking direct repeats were identified. Multiple copies of IS153 were detected by hybridization in other strains of L. sanfranciscensis. Resulting hybridization patterns were shown to differentiate between organisms at strain level rather than a probe targeted against the 16S rDNA. With a PCR based approach IS153 or highly similar sequences were detected in L. acidophilus, L. casei, L. malefermentans, L. plantarum, L. hilgardii, L. collinoides L. farciminis L. sakei and L. salivarius, L. reuteri as well as in Enterococcus faecium, Pediococcus acidilactici and P. pentosaceus.
Tandemly repeated sequences in mtDNA control region of whitefish, Coregonus lavaretus.
Brzuzan, P
2000-06-01
Length variation of the mitochondrial DNA control region was observed with PCR amplification of a sample of 138 whitefish (Coregonus lavaretus). Nucleotide sequences of representative PCR products showed that the variation was due to the presence of an approximately 100-bp motif tandemly repeated two, three, or five times in the region between the conserved sequence block-3 (CSB-3) and the gene for phenylalanine tRNA. This is the first report on the tandem array composed of long repeat units in mitochondrial DNA of salmonids.
Dynamics and biological relevance of DNA demethylation in Arabidopsis antibacterial defense.
Yu, Agnès; Lepère, Gersende; Jay, Florence; Wang, Jingyu; Bapaume, Laure; Wang, Yu; Abraham, Anne-Laure; Penterman, Jon; Fischer, Robert L; Voinnet, Olivier; Navarro, Lionel
2013-02-05
DNA methylation is an epigenetic mark that silences transposable elements (TEs) and repeats. Whereas the establishment and maintenance of DNA methylation are relatively well understood, little is known about their dynamics and biological relevance in plant and animal innate immunity. Here, we show that some TEs are demethylated and transcriptionally reactivated during antibacterial defense in Arabidopsis. This effect is correlated with the down-regulation of key transcriptional gene silencing factors and is partly dependent on an active demethylation process. DNA demethylation restricts multiplication and vascular propagation of the bacterial pathogen Pseudomonas syringae in leaves and, accordingly, some immune-response genes, containing repeats in their promoter regions, are negatively regulated by DNA methylation. This study provides evidence that DNA demethylation is part of a plant-induced immune response, potentially acting to prime transcriptional activation of some defense genes linked to TEs/repeats.
Willwand, K; Baldauf, A Q; Deleu, L; Mumtsidu, E; Costello, E; Beard, P; Rommelaere, J
1997-10-01
The right-end telomere of replicative form (RF) DNA of the autonomous parvovirus minute virus of mice (MVM) consists of a sequence that is self-complementary except for a three nucleotide loop around the axis of symmetry and an interior bulge of three unpaired nucleotides on one strand (designated the right-end 'bubble'). This right-end inverted repeat can exist in the form of a folded-back strand (hairpin conformation) or in an extended form, base-paired to a copy strand (duplex conformation). We recently reported that the right-end telomere is processed in an A9 cell extract supplemented with the MVM nonstructural protein NS1. This processing is shown here to result from the NS1-dependent nicking of the complementary strand at a unique position 21 nt inboard of the folded-back genomic 5' end. DNA species terminating in duplex or hairpin configurations, or in a mutated structure that has lost the right-end bulge, are all cleaved in the presence of NS1, indicating that features distinguishing these structures are not prerequisites for nicking under the in vitro conditions tested. Cleavage of the hairpin structure is followed by strand-displacement synthesis, generating the right-end duplex conformation, while processing of the duplex structure leads to the release of free right-end telomeres. In the majority of molecules, displacement synthesis at the right terminus stops a few nucleotides before reaching the end of the template strand, possibly due to NS1 which is covalently bound to this end. A fraction of the right-end duplex product undergoes melting and re-folding into hairpin structures (formation of a 'rabbit-ear' structure).
rbcL and matK earn two thumbs up as the core DNA barcode for ferns.
Li, Fay-Wei; Kuo, Li-Yaung; Rothfels, Carl J; Ebihara, Atsushi; Chiou, Wen-Liang; Windham, Michael D; Pryer, Kathleen M
2011-01-01
DNA barcoding will revolutionize our understanding of fern ecology, most especially because the accurate identification of the independent but cryptic gametophyte phase of the fern's life history--an endeavor previously impossible--will finally be feasible. In this study, we assess the discriminatory power of the core plant DNA barcode (rbcL and matK), as well as alternatively proposed fern barcodes (trnH-psbA and trnL-F), across all major fern lineages. We also present plastid barcode data for two genera in the hyperdiverse polypod clade--Deparia (Woodsiaceae) and the Cheilanthes marginata group (currently being segregated as a new genus of Pteridaceae)--to further evaluate the resolving power of these loci. Our results clearly demonstrate the value of matK data, previously unavailable in ferns because of difficulties in amplification due to a major rearrangement of the plastid genome. With its high sequence variation, matK complements rbcL to provide a two-locus barcode with strong resolving power. With sequence variation comparable to matK, trnL-F appears to be a suitable alternative barcode region in ferns, and perhaps should be added to the core barcode region if universal primer development for matK fails. In contrast, trnH-psbA shows dramatically reduced sequence variation for the majority of ferns. This is likely due to the translocation of this segment of the plastid genome into the inverted repeat regions, which are known to have a highly constrained substitution rate. Our study provides the first endorsement of the two-locus barcode (rbcL+matK) in ferns, and favors trnL-F over trnH-psbA as a potential back-up locus. Future work should focus on gathering more fern matK sequence data to facilitate universal primer development.
Processing of double-R-loops in (CAG)·(CTG) and C9orf72 (GGGGCC)·(GGCCCC) repeats causes instability
Reddy, Kaalak; Schmidt, Monika H.M.; Geist, Jaimie M.; Thakkar, Neha P.; Panigrahi, Gagan B.; Wang, Yuh-Hwa; Pearson, Christopher E.
2014-01-01
R-loops, transcriptionally-induced RNA:DNA hybrids, occurring at repeat tracts (CTG)n, (CAG)n, (CGG)n, (CCG)n and (GAA)n, are associated with diseases including myotonic dystrophy, Huntington's disease, fragile X and Friedreich's ataxia. Many of these repeats are bidirectionally transcribed, allowing for single- and double-R-loop configurations, where either or both DNA strands may be RNA-bound. R-loops can trigger repeat instability at (CTG)·(CAG) repeats, but the mechanism of this is unclear. We demonstrate R-loop-mediated instability through processing of R-loops by HeLa and human neuron-like cell extracts. Double-R-loops induced greater instability than single-R-loops. Pre-treatment with RNase H only partially suppressed instability, supporting a model in which R-loops directly generate instability by aberrant processing, or via slipped-DNA formation upon RNA removal and its subsequent aberrant processing. Slipped-DNAs were observed to form following removal of the RNA from R-loops. Since transcriptionally-induced R-loops can occur in the absence of DNA replication, R-loop processing may be a source of repeat instability in the brain. Double-R-loop formation and processing to instability was extended to the expanded C9orf72 (GGGGCC)·(GGCCCC) repeats, known to cause amyotrophic lateral sclerosis and frontotemporal dementia, providing the first suggestion through which these repeats may become unstable. These findings provide a mechanistic basis for R-loop-mediated instability at disease-associated repeats. PMID:25147206
Ni, ZhouXian; Ye, YouJu; Bai, Tiandao; Xu, Meng; Xu, Li-An
2017-09-11
The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between "IRa" and "IRb". The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.
Price, G Dean; Howitt, Susan M
2014-09-01
This mini-review addresses advances in understanding the transmembrane topologies of two unrelated, single-subunit bicarbonate transporters from cyanobacteria, namely BicA and SbtA. BicA is a Na(+)-dependent bicarbonate transporter that belongs to the SulP/SLC26 family that is widespread in both eukaryotes and prokaryotes. Topology mapping of BicA via the phoA/lacZ fusion reporter method identified 12 transmembrane helices with an unresolved hydrophobic region just beyond helix 8. Re-interpreting this data in the light of a recent topology study on rat prestin leads to a consensus topology of 14 transmembrane domains with a 7+7 inverted repeat structure. SbtA is also a Na(+)-dependent bicarbonate transporter, but of considerably higher affinity (Km 2-5 μM versus >100 μM for BicA). Whilst SbtA is widespread in cyanobacteria and a few bacteria, it appears to be absent from eukaryotes. Topology mapping of SbtA via the phoA/lacZ fusion reporter method identified 10 transmembrane helices. The topology consists of a 5+5 inverted repeat, with the two repeats separated by a large intracellular loop. The unusual location of the N and C-termini outside the cell raises the possibility that SbtA forms a novel fold, not so far identified by structural and topological studies on transport proteins.
Sabir, Jamal; Schwarz, Erika; Ellison, Nicholas; Zhang, Jin; Baeshen, Nabih A; Mutwakil, Muhammed; Jansen, Robert; Ruhlman, Tracey
2014-08-01
Land plant plastid genomes (plastomes) provide a tractable model for evolutionary study in that they are relatively compact and gene dense. Among the groups that display an appropriate level of variation for structural features, the inverted-repeat-lacking clade (IRLC) of papilionoid legumes presents the potential to advance general understanding of the mechanisms of genomic evolution. Here, are presented six complete plastome sequences from economically important species of the IRLC, a lineage previously represented by only five completed plastomes. A number of characters are compared across the IRLC including gene retention and divergence, synteny, repeat structure and functional gene transfer to the nucleus. The loss of clpP intron 2 was identified in one newly sequenced member of IRLC, Glycyrrhiza glabra. Using deeply sequenced nuclear transcriptomes from two species helped clarify the nature of the functional transfer of accD to the nucleus in Trifolium, which likely occurred in the lineage leading to subgenus Trifolium. Legumes are second only to cereal crops in agricultural importance based on area harvested and total production. Genetic improvement via plastid transformation of IRLC crop species is an appealing proposition. Comparative analyses of intergenic spacer regions emphasize the need for complete genome sequences for developing transformation vectors for plastid genetic engineering of legume crops. © 2014 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Alu repeated DNAs are differentially methylated in primate germ cells.
Rubin, C M; VandeVoort, C A; Teplitz, R L; Schmid, C W
1994-01-01
A significant fraction of Alu repeats in human sperm DNA, previously found to be unmethylated, is nearly completely methylated in DNA from many somatic tissues. A similar fraction of unmethylated Alus is observed here in sperm DNA from rhesus monkey. However, Alus are almost completely methylated at the restriction sites tested in monkey follicular oocyte DNA. The Alu methylation patterns in mature male and female monkey germ cells are consistent with Alu methylation in human germ cell tumors. Alu sequences are hypomethylated in seminoma DNAs and more methylated in a human ovarian dysgerminoma. These results contrast with methylation patterns reported for germ cell single-copy, CpG island, satellite, and L1 sequences. The function of Alu repeats is not known, but differential methylation of Alu repeats in the male and female germ lines suggests that they may serve as markers for genomic imprinting or in maintaining differences in male and female meiosis. Images PMID:7800508
Length and sequence heterogeneity in 5S rDNA of Populus deltoides.
Negi, Madan S; Rajagopal, Jyothi; Chauhan, Neeti; Cronn, Richard; Lakshmikumaran, Malathi
2002-12-01
The 5S rRNA genes and their associated non-transcribed spacer (NTS) regions are present as repeat units arranged in tandem arrays in plant genomes. Length heterogeneity in 5S rDNA repeats was previously identified in Populus deltoides and was also observed in the present study. Primers were designed to amplify the 5S rDNA NTS variants from the P. deltoides genome. The PCR-amplified products from the two accessions of P. deltoides (G3 and G48) suggested the presence of length heterogeneity of 5S rDNA units within and among accessions, and the size of the spacers ranged from 385 to 434 bp. Sequence analysis of the non-transcribed spacer (NTS) revealed two distinct classes of 5S rDNA within both accessions: class 1, which contained GAA trinucleotide microsatellite repeats, and class 2, which lacked the repeats. The class 1 spacer shows length variation owing to the microsatellite, with two clones exhibiting 10 GAA repeat units and one clone exhibiting 16 such repeat units. However, distance analysis shows that class 1 spacer sequences are highly similar inter se, yielding nucleotide diversity (pi) estimates that are less than 0.15% of those obtained for class 2 spacers (pi = 0.0183 vs. 0.1433, respectively). The presence of microsatellite in the NTS region leading to variation in spacer length is reported and discussed for the first time in P. deltoides.
Electronic Transport in Single-Stranded DNA Molecule Related to Huntington's Disease
NASA Astrophysics Data System (ADS)
Sarmento, R. G.; Silva, R. N. O.; Madeira, M. P.; Frazão, N. F.; Sousa, J. O.; Macedo-Filho, A.
2018-04-01
We report a numerical analysis of the electronic transport in single chain DNA molecule consisting of 182 nucleotides. The DNA chains studied were extracted from a segment of the human chromosome 4p16.3, which were modified by expansion of CAG (cytosine-adenine-guanine) triplet repeats to mimics Huntington's disease. The mutated DNA chains were connected between two platinum electrodes to analyze the relationship between charge propagation in the molecule and Huntington's disease. The computations were performed within a tight-binding model, together with a transfer matrix technique, to investigate the current-voltage (I-V) of 23 types of DNA sequence and compare them with the distributions of the related CAG repeat numbers with the disease. All DNA sequences studied have a characteristic behavior of a semiconductor. In addition, the results showed a direct correlation between the current-voltage curves and the distributions of the CAG repeat numbers, suggesting possible applications in the development of DNA-based biosensors for molecular diagnostics.
Parvovirus infection-induced DNA damage response
Luo, Yong; Qiu, Jianming
2014-01-01
Parvoviruses are a group of small DNA viruses with ssDNA genomes flanked by two inverted terminal structures. Due to a limited genetic resource they require host cellular factors and sometimes a helper virus for efficient viral replication. Recent studies have shown that parvoviruses interact with the DNA damage machinery, which has a significant impact on the life cycle of the virus as well as the fate of infected cells. In addition, due to special DNA structures of the viral genomes, parvoviruses are useful tools for the study of the molecular mechanisms underlying viral infection-induced DNA damage response (DDR). This review aims to summarize recent advances in parvovirus-induced DDR, with a focus on the diverse DDR pathways triggered by different parvoviruses and the consequences of DDR on the viral life cycle as well as the fate of infected cells. PMID:25429305
ERIC Educational Resources Information Center
McNamara-Schroeder, Kathleen; Olonan, Cheryl; Chu, Simon; Montoya, Maria C.; Alviri, Mahta; Ginty, Shannon; Love, John J.
2006-01-01
We have devised and implemented a DNA fingerprinting module for an upper division undergraduate laboratory based on the amplification and analysis of three of the 13 short tandem repeat loci that are required by the Federal Bureau of Investigation Combined DNA Index System (FBI CODIS) data base. Students first collect human epithelial (cheek)…
Bio-recognitive photonics of a DNA-guided organic semiconductor
Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June
2016-01-01
Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA–DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an ‘inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA–DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition. PMID:26725969
Wang, Hetong; He, Lei; Song, Jie; Cui, Weina; Zhang, Yanzhao; Jia, Chunyun; Francis, Dennis; Rogers, Hilary J; Sun, Lizong; Tai, Peidong; Hui, Xiujuan; Yang, Yuesuo; Liu, Wan
2016-05-01
Microsatellite instability (MSI) analysis, random-amplified polymorphic DNA (RAPD), and methylation-sensitive arbitrarily primed PCR (MSAP-PCR) are methods to evaluate the toxicity of environmental pollutants in stress-treated plants and human cancer cells. Here, we evaluate these techniques to screen for genetic and epigenetic alterations of Arabidopsis plantlets exposed to 0-5.0 mg L(-1) cadmium (Cd) for 15 d. There was a substantial increase in RAPD polymorphism of 24.5, and in genomic methylation polymorphism of 30.5-34.5 at CpG and of 14.5-20 at CHG sites under Cd stress of 5.0 mg L(-1) by RAPD and of 0.25-5.0 mg L(-1) by MSAP-PCR, respectively. However, only a tiny increase of 1.5 loci by RAPD occurred under Cd stress of 4.0 mg L(-1), and an additional high dose (8.0 mg L(-1)) resulted in one repeat by MSI analysis. MSAP-PCR detected the most significant epigenetic modifications in plantlets exposed to Cd stress, and the patterns of hypermethylation and polymorphisms were consistent with inverted U-shaped dose responses. The presence of genomic methylation polymorphism in Cd-treated seedlings, prior to the onset of RAPD polymorphism, MSI and obvious growth effects, suggests that these altered DNA methylation loci are the most sensitive biomarkers for early diagnosis and risk assessment of genotoxic effects of Cd pollution in ecotoxicology. Copyright © 2016 Elsevier Ltd. All rights reserved.
Pietan, Lucas L.; Spradling, Theresa A.
2016-01-01
In animals, mitochondrial DNA (mtDNA) typically occurs as a single circular chromosome with 13 protein-coding genes and 22 tRNA genes. The various species of lice examined previously, however, have shown mitochondrial genome rearrangements with a range of chromosome sizes and numbers. Our research demonstrates that the mitochondrial genomes of two species of chewing lice found on pocket gophers, Geomydoecus aurei and Thomomydoecus minor, are fragmented with the 1,536 base-pair (bp) cytochrome-oxidase subunit I (cox1) gene occurring as the only protein-coding gene on a 1,916–1,964 bp minicircular chromosome in the two species, respectively. The cox1 gene of T. minor begins with an atypical start codon, while that of G. aurei does not. Components of the non-protein coding sequence of G. aurei and T. minor include a tRNA (isoleucine) gene, inverted repeat sequences consistent with origins of replication, and an additional non-coding region that is smaller than the non-coding sequence of other lice with such fragmented mitochondrial genomes. Sequences of cox1 minichromosome clones for each species reveal extensive length and sequence heteroplasmy in both coding and noncoding regions. The highly variable non-gene regions of G. aurei and T. minor have little sequence similarity with one another except for a 19-bp region of phylogenetically conserved sequence with unknown function. PMID:27589589
Inverter Load Rejection Over-Voltage Testing: SolarCity CRADA Task 1a Final Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nelson, A.; Hoke, A.; Chakraborty, S.
Various interconnection challenges exist when connecting distributed PV into the electrical distribution grid in terms of safety, reliability, and stability of electric power systems. One of the urgent areas for additional research - as identified by inverter manufacturers, installers, and utilities - is the potential for transient over-voltage from PV inverters. In one stage of a cooperative tests were repeated a total of seven times. The maximum over-voltage measured in any test did not exceed 200% of nominal, and typical over-voltage levels were significantly lower. The total voltage duration and the maximum continuous time above each threshold are presented here,more » as well as the time to disconnect for each test. Finally, we present a brief investigation into the effect of DC input voltage as well as a series of no-load tests. This report describes testing conducted at NREL to determine the duration and magnitude of transient over-voltages created by several commercial PV inverters during load-rejection conditions. For this work, a test plan that is currently under development by the Forum on Inverter Grid Integration Issues (FIGII) has been implemented in a custom test setup at NREL. Through a cooperative research and development agreement, NREL is working with SolarCity to address two specific types of transient overvoltage: load rejection overvoltage (LRO) and ground fault overvoltage (GFO). Additional partners in this effort include the Hawaiian Electric Companies, Northern Plains Power Technologies, and the Electric Power Research Institute.« less
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-06-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ying-Tai Wang; Zhao-Cai Wang; Bajalica, S.
We present the first case of direct and inverted reciprocal chromosome insertions between human chromosomes 7 and 14, ascertained because of repeated spontaneous abortions. Prometaphase GTG banding analysis showed the karyotype to be 46, XX, inv ins (7;14)(7pter {yields} 7q11.23::14q32.2 {yields} 14q22::7q21.2 {yields} 7qter), dir ins(14;7)(14pter {yields} 14q22::7q11.23 {yields} 7q21.2::14q32.2 {yields} 14qter). Origins of the insertion have been confirmed by chromosome painting with libraries specific for chromosomes 7 and 14 using fluorescence in situ hybridization. 5 refs., 3 figs.
Wang, Qianqian; Li, Lanlan; Wang, Xiaoting; Liu, Huanxiang; Yao, Xiaojun
2014-11-01
The Z-DNA-binding domain of human double-stranded RNA adenosine deaminase I (hZαADAR1) can specifically recognize the left-handed Z-DNA which preferentially occurs at alternating purine-pyrimidine repeats, especially the CG-repeats. The interactions of hZαADAR1 and Z-DNAs in different sequence contexts can affect many important biological functions including gene regulation and chromatin remodeling. Therefore it is of great necessity to fully understand their recognition mechanisms. However, most existing studies are aimed at the standard CG-repeat Z-DNA rather than the non-CG-repeats, and whether the molecular basis of hZαADAR1 binding to various Z-DNAs are identical or not is still unclear on the atomic level. Here, based on the recently determined crystal structures of three representative non-CG-repeat Z-DNAs (d(CACGTG)2, d(CGTACG)2 and d(CGGCCG)2) in complex with hZαADAR1, 40 ns molecular dynamics simulation together with binding free energy calculation were performed for each system. For comparison, the standard CG-repeat Z-DNA (d(CGCGCG)2) complexed with hZαADAR1 was also simulated. The consistent results demonstrate that nonpolar interaction is the driving force during the protein-DNA binding process, and that polar interaction mainly from helix α3 also provides important contributions. Five common hot-spot residues were identified, namely Lys169, Lys170, Asn173, Arg174 and Tyr177. Hydrogen bond analysis coupled with surface charge distribution further reveal the interfacial information between hZαADAR1 and Z-DNA in detail. All of the analysis illustrate that four complexes share the common key features and the similar binding modes irrespective of Z-DNA sequences, suggesting that Z-DNA recognition by hZαADAR1 is conformation-specific rather than sequence-specific. Additionally, by analyzing the conformational changes of hZαADAR1, we found that the binding of Z-DNA could effectively stabilize hZαADAR1 protein. Our study can provide some valuable information for better understanding the binding mechanism between hZαADAR1 or even other Z-DNA-binding protein and Z-DNA.
Linking actions and objects: Context-specific learning of novel weight priors.
Trewartha, Kevin M; Flanagan, J Randall
2017-06-01
Distinct explicit and implicit memory processes support weight predictions used when lifting objects and making perceptual judgments about weight, respectively. The first time that an object is encountered weight is predicted on the basis of learned associations, or priors, linking size and material to weight. A fundamental question is whether the brain maintains a single, global representation of priors, or multiple representations that can be updated in a context specific way. A second key question is whether the updating of priors, or the ability to scale lifting forces when repeatedly lifting unusually weighted objects requires focused attention. To investigate these questions we compared the adaptability of weight predictions used when lifting objects and judging their weights in different groups of participants who experienced size-weight inverted objects passively (with the objects placed on the hands) or actively (where participants lift the objects) under full or divided attention. To assess weight judgments we measured the size-weight illusion after every 20 trials of experience with the inverted objects both passively and actively. The attenuation of the illusion that arises when lifting inverted object was found to be context-specific such that the attenuation was larger when the mode of interaction with the inverted objects matched the method of assessment of the illusion. Dividing attention during interaction with the inverted objects had no effect on attenuation of the illusion, but did slow the rate at which lifting forces were scaled to the weight inverted objects. These findings suggest that the brain stores multiple representations of priors that are context specific, and that focused attention is important for scaling lifting forces, but not for updating weight predictions used when judging object weight. Copyright © 2017 Elsevier B.V. All rights reserved.
C9orf72 nucleotide repeat structures initiate molecular cascades of disease.
Haeusler, Aaron R; Donnelly, Christopher J; Periz, Goran; Simko, Eric A J; Shaw, Patrick G; Kim, Min-Sik; Maragakis, Nicholas J; Troncoso, Juan C; Pandey, Akhilesh; Sattler, Rita; Rothstein, Jeffrey D; Wang, Jiou
2014-03-13
A hexanucleotide repeat expansion (HRE), (GGGGCC)n, in C9orf72 is the most common genetic cause of the neurodegenerative diseases amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Here we identify a molecular mechanism by which structural polymorphism of the HRE leads to ALS/FTD pathology and defects. The HRE forms DNA and RNA G-quadruplexes with distinct structures and promotes RNA•DNA hybrids (R-loops). The structural polymorphism causes a repeat-length-dependent accumulation of transcripts aborted in the HRE region. These transcribed repeats bind to ribonucleoproteins in a conformation-dependent manner. Specifically, nucleolin, an essential nucleolar protein, preferentially binds the HRE G-quadruplex, and patient cells show evidence of nucleolar stress. Our results demonstrate that distinct C9orf72 HRE structural polymorphism at both DNA and RNA levels initiates molecular cascades leading to ALS/FTD pathologies, and provide the basis for a mechanistic model for repeat-associated neurodegenerative diseases.
Røsby, O; Berg, K
2000-01-01
In order to search for factors influencing the Lp(a) lipoprotein level, we have examined the apolipoprotein(a) (apo(a)) size polymorphism as well as a pentanucleotide (TTTTA) repeat polymorphism in the 5' control region of the LPA gene. Lp(a) lipoprotein levels were compared between individuals with different genotypes as defined by pulsed field gel electrophoresis of DNA plugs, and PCR of DNA samples followed by polyacrylamide gel electrophoresis. DNA plugs and DNA were prepared from blood samples collected from blood donors. Twenty-seven different K IV repeat alleles were observed in the 71 women and 92 men from which apo(a) size polymorphism results were obtained. Alleles encoding 26-32 Kringle IV repeats were the most frequent. Alleles encoding seven to 11 TTTTA repeats were detected in the 84 women and 122 men included in the pentanucleotide polymorphism study, and homozygosity for eight TTTTA repeats was the most common genotype. The eight TTTTA repeat allele occurred with almost any apo(a) allele. An inverse relationship between number of K IV repeats and Lp(a) concentration was confirmed. The contributions of the apo(a) size polymorphism and the pentanucleotide repeat polymorphism to the interindividual variance of Lp(a) lipoprotein concentrations were 9.7 and 3.5%, respectively (type IV sum of squares). Nineteen per cent of the variance in Lp(a) lipoprotein level appeared to be the result of the multiplication product (interaction) between the apo(a) size polymorphism and the pentanucleotide repeat polymorphism. The contribution of the apo(a) size polymorphism alone to the variation in Lp(a) lipoprotein level was lower than previously reported. However, the multiplicative interaction effect between the K IV repeat polymorphism and the pentanucleotide repeat polymorphism may be an important factor explaining the variation in Lp(a) lipoprotein levels among the populations.
Wang, Rui; Li, Ming; Gong, Luyao; Hu, Songnian; Xiang, Hua
2016-01-01
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) acquire new spacers to generate adaptive immunity in prokaryotes. During spacer integration, the leader-preceded repeat is always accurately duplicated, leading to speculations of a repeat-length ruler. Here in Haloarcula hispanica, we demonstrate that the accurate duplication of its 30-bp repeat requires two conserved mid-repeat motifs, AACCC and GTGGG. The AACCC motif was essential and needed to be ∼10 bp downstream from the leader-repeat junction site, where duplication consistently started. Interestingly, repeat duplication terminated sequence-independently and usually with a specific distance from the GTGGG motif, which seemingly served as an anchor site for a molecular ruler. Accordingly, altering the spacing between the two motifs led to an aberrant duplication size (29, 31, 32 or 33 bp). We propose the adaptation complex may recognize these mid-repeat elements to enable measuring the repeat DNA for spacer integration. PMID:27085805
DNA mismatch repair complex MutSβ promotes GAA·TTC repeat expansion in human cells.
Halabi, Anasheh; Ditch, Scott; Wang, Jeffrey; Grabczyk, Ed
2012-08-24
While DNA repair has been implicated in CAG·CTG repeat expansion, its role in the GAA·TTC expansion of Friedreich ataxia (FRDA) is less clear. We have developed a human cellular model that recapitulates the DNA repeat expansion found in FRDA patient tissues. In this model, GAA·TTC repeats expand incrementally and continuously. We have previously shown that the expansion rate is linked to transcription within the repeats. Our working hypothesis is that structures formed within the GAA·TTC repeat during transcription attract DNA repair enzymes that then facilitate the expansion process. MutSβ, a heterodimer of MSH2 and MSH3, is known to have a role in CAG·CTG repeat expansion. We now show that shRNA knockdown of either MSH2 or MSH3 slowed GAA·TTC expansion in our system. We further characterized the role of MutSβ in GAA·TTC expansion using a functional assay in primary FRDA patient-derived fibroblasts. These fibroblasts have no known propensity for instability in their native state. Ectopic expression of MSH2 and MSH3 induced GAA·TTC repeat expansion in the native FXN gene. MSH2 is central to mismatch repair and its absence or reduction causes a predisposition to cancer. Thus, despite its essential role in GAA·TTC expansion, MSH2 is not an attractive therapeutic target. The absence or reduction of MSH3 is not strongly associated with cancer predisposition. Accordingly, MSH3 has been suggested as a therapeutic target for CAG·CTG repeat expansion disorders. Our results suggest that MSH3 may also serve as a therapeutic target to slow the expansion of GAA·TTC repeats in the future.
DNA Mismatch Repair Complex MutSβ Promotes GAA·TTC Repeat Expansion in Human Cells*
Halabi, Anasheh; Ditch, Scott; Wang, Jeffrey; Grabczyk, Ed
2012-01-01
While DNA repair has been implicated in CAG·CTG repeat expansion, its role in the GAA·TTC expansion of Friedreich ataxia (FRDA) is less clear. We have developed a human cellular model that recapitulates the DNA repeat expansion found in FRDA patient tissues. In this model, GAA·TTC repeats expand incrementally and continuously. We have previously shown that the expansion rate is linked to transcription within the repeats. Our working hypothesis is that structures formed within the GAA·TTC repeat during transcription attract DNA repair enzymes that then facilitate the expansion process. MutSβ, a heterodimer of MSH2 and MSH3, is known to have a role in CAG·CTG repeat expansion. We now show that shRNA knockdown of either MSH2 or MSH3 slowed GAA·TTC expansion in our system. We further characterized the role of MutSβ in GAA·TTC expansion using a functional assay in primary FRDA patient-derived fibroblasts. These fibroblasts have no known propensity for instability in their native state. Ectopic expression of MSH2 and MSH3 induced GAA·TTC repeat expansion in the native FXN gene. MSH2 is central to mismatch repair and its absence or reduction causes a predisposition to cancer. Thus, despite its essential role in GAA·TTC expansion, MSH2 is not an attractive therapeutic target. The absence or reduction of MSH3 is not strongly associated with cancer predisposition. Accordingly, MSH3 has been suggested as a therapeutic target for CAG·CTG repeat expansion disorders. Our results suggest that MSH3 may also serve as a therapeutic target to slow the expansion of GAA·TTC repeats in the future. PMID:22787155
Kar, Anirban; Jones, Nathan; Arat, N Özlem; Fishel, Richard; Griffith, Jack
2018-04-19
Conformations adopted by long stretches of single stranded DNA (ssDNA) are of central interest in understanding the architecture of replication forks, R loops, and other structures generated during DNA metabolism in vivo. This is particularly so if the ssDNA consists of short nucleotide repeats. Such studies have been hampered by the lack of defined substrates greater than ~150 nt, and the absence of high-resolution biophysical approaches. Here we describe the generation of very long ssDNA consisting of the mammalian telomeric repeat (5'-TTAGGG-3')n as well as the interrogation of its structure by electron microscopy (EM) and single molecule magnetic tweezers (smMT). This repeat is of particular interest as it contains a run of 3 contiguous guanine residues capable of forming G quartets as ssDNA. Fluorescent-dye exclusion assays confirmed that this G-strand ssDNA forms ubiquitous G-quadruplex folds. EM revealed thick bead-like filaments that condensed the DNA ~12 fold. The bead-like structures were 5 nm and 8 nm in diameter and linked by thin filaments. The G-strand ssDNA displayed initial stability to smMT force extension that ultimately released in steps that were multiples ~28 nm at forces between 6-12 pN; well below the >20 pN required to unravel G-quadruplexes. Most smMT steps were consistent with the disruption of the beads seen by EM. Binding by RAD51 distinctively altered the force extension properties of the G-strand ssDNA, suggesting a stochastic G-quadruplex-dependent condensation model that is discussed. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.
Molecular characterization of the canine mitochondrial DNA control region for forensic applications.
Eichmann, Cordula; Parson, Walther
2007-09-01
The canine mitochondrial DNA (mtDNA) control region of 133 dogs living in the area around Innsbruck, Austria was sequenced. A total of 40 polymorphic sites were observed in the first hypervariable segment and 15 in the second, which resulted in the differentiation of 40 distinct haplotypes. We observed five nucleotide positions that were highly polymorphic within different haplogroups, and they represent good candidates for mtDNA screening. We found five point heteroplasmic positions; all located in HVS-I and a polythymine region in HVS-II, the latter often being associated with length heteroplasmy. In contrast to human mtDNA, the canine control region contains a hypervariable 10 nucleotide repeat region, which is located between the two hypervariable regions. In our population sample, we observed eight different repeat types, which we characterized by direct sequencing and fragment length analysis. The discrimination power of the canine mtDNA control region was 0.93, not taking the polymorphic repeat region into consideration.
STRBase: a short tandem repeat DNA database for the human identity testing community
Ruitberg, Christian M.; Reeder, Dennis J.; Butler, John M.
2001-01-01
The National Institute of Standards and Technology (NIST) has compiled and maintained a Short Tandem Repeat DNA Internet Database (http://www.cstl.nist.gov/biotech/strbase/) since 1997 commonly referred to as STRBase. This database is an information resource for the forensic DNA typing community with details on commonly used short tandem repeat (STR) DNA markers. STRBase consolidates and organizes the abundant literature on this subject to facilitate on-going efforts in DNA typing. Observed alleles and annotated sequence for each STR locus are described along with a review of STR analysis technologies. Additionally, commercially available STR multiplex kits are described, published polymerase chain reaction (PCR) primer sequences are reported, and validation studies conducted by a number of forensic laboratories are listed. To supplement the technical information, addresses for scientists and hyperlinks to organizations working in this area are available, along with the comprehensive reference list of over 1300 publications on STRs used for DNA typing purposes. PMID:11125125
Pilotte, Nils; Papaiakovou, Marina; Grant, Jessica R; Bierwert, Lou Ann; Llewellyn, Stacey; McCarthy, James S; Williams, Steven A
2016-03-01
The soil transmitted helminths are a group of parasitic worms responsible for extensive morbidity in many of the world's most economically depressed locations. With growing emphasis on disease mapping and eradication, the availability of accurate and cost-effective diagnostic measures is of paramount importance to global control and elimination efforts. While real-time PCR-based molecular detection assays have shown great promise, to date, these assays have utilized sub-optimal targets. By performing next-generation sequencing-based repeat analyses, we have identified high copy-number, non-coding DNA sequences from a series of soil transmitted pathogens. We have used these repetitive DNA elements as targets in the development of novel, multi-parallel, PCR-based diagnostic assays. Utilizing next-generation sequencing and the Galaxy-based RepeatExplorer web server, we performed repeat DNA analysis on five species of soil transmitted helminths (Necator americanus, Ancylostoma duodenale, Trichuris trichiura, Ascaris lumbricoides, and Strongyloides stercoralis). Employing high copy-number, non-coding repeat DNA sequences as targets, novel real-time PCR assays were designed, and assays were tested against established molecular detection methods. Each assay provided consistent detection of genomic DNA at quantities of 2 fg or less, demonstrated species-specificity, and showed an improved limit of detection over the existing, proven PCR-based assay. The utilization of next-generation sequencing-based repeat DNA analysis methodologies for the identification of molecular diagnostic targets has the ability to improve assay species-specificity and limits of detection. By exploiting such high copy-number repeat sequences, the assays described here will facilitate soil transmitted helminth diagnostic efforts. We recommend similar analyses when designing PCR-based diagnostic tests for the detection of other eukaryotic pathogens.
Villahermosa, Desirée; Christensen, Olaf; Knapp, Karen; Fleck, Oliver
2017-01-01
Defective mismatch repair (MMR) in humans is associated with colon cancer and instability of microsatellites, that is, DNA sequences with one or several nucleotides repeated. Key factors of eukaryotic MMR are the heterodimers MutSα (Msh2-Msh6), which recognizes base-base mismatches and unpaired nucleotides in DNA, and MutLα (Mlh1-Pms1), which facilitates downstream steps. In addition, MutSβ (Msh2-Msh3) recognizes DNA loops of various sizes, although our previous data and the data presented here suggest that Msh3 of Schizosaccharomyces pombe does not play a role in MMR. To test microsatellite stability in S. pombe and hence DNA loop repair, we have inserted tetra-, penta-, and hepta-nucleotide repeats in the ade6 gene and determined their Ade+ reversion rates and spectra in wild type and various mutants. Our data indicate that loops with four unpaired nucleotides in the nascent and the template strand are the upper limit of MutSα- and MutLα-mediated MMR in S. pombe. Stability of hepta-nucleotide repeats requires Msh3 and Exo1 in MMR-independent processes as well as the DNA repair proteins Rad50, Rad51, and Rad2FEN1. Most strikingly, mutation rates in the double mutants msh3 exo1 and msh3 rad51 were decreased when compared to respective single mutants, indicating that Msh3 prevents error prone processes carried out by Exo1 and Rad51. We conclude that Msh3 has no obvious function in MMR in S. pombe, but contributes to DNA repeat stability in MMR-independent processes. PMID:28341698
Villahermosa, Desirée; Christensen, Olaf; Knapp, Karen; Fleck, Oliver
2017-05-05
Defective mismatch repair (MMR) in humans is associated with colon cancer and instability of microsatellites, that is, DNA sequences with one or several nucleotides repeated. Key factors of eukaryotic MMR are the heterodimers MutSα (Msh2-Msh6), which recognizes base-base mismatches and unpaired nucleotides in DNA, and MutLα (Mlh1-Pms1), which facilitates downstream steps. In addition, MutSβ (Msh2-Msh3) recognizes DNA loops of various sizes, although our previous data and the data presented here suggest that Msh3 of Schizosaccharomyces pombe does not play a role in MMR. To test microsatellite stability in S. pombe and hence DNA loop repair, we have inserted tetra-, penta-, and hepta-nucleotide repeats in the ade6 gene and determined their Ade + reversion rates and spectra in wild type and various mutants. Our data indicate that loops with four unpaired nucleotides in the nascent and the template strand are the upper limit of MutSα- and MutLα-mediated MMR in S. pombe Stability of hepta-nucleotide repeats requires Msh3 and Exo1 in MMR-independent processes as well as the DNA repair proteins Rad50, Rad51, and Rad2 FEN1 Most strikingly, mutation rates in the double mutants msh3 exo1 and msh3 rad51 were decreased when compared to respective single mutants, indicating that Msh3 prevents error prone processes carried out by Exo1 and Rad51. We conclude that Msh3 has no obvious function in MMR in S. pombe , but contributes to DNA repeat stability in MMR-independent processes. Copyright © 2017 Villahermosa et al.
Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao
2018-05-01
Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.
Brouard, Jean-Simon; Otis, Christian; Lemieux, Claude; Turmel, Monique
2008-01-01
Background To gain insight into the branching order of the five main lineages currently recognized in the green algal class Chlorophyceae and to expand our understanding of chloroplast genome evolution, we have undertaken the sequencing of chloroplast DNA (cpDNA) from representative taxa. The complete cpDNA sequences previously reported for Chlamydomonas (Chlamydomonadales), Scenedesmus (Sphaeropleales), and Stigeoclonium (Chaetophorales) revealed tremendous variability in their architecture, the retention of only few ancestral gene clusters, and derived clusters shared by Chlamydomonas and Scenedesmus. Unexpectedly, our recent phylogenies inferred from these cpDNAs and the partial sequences of three other chlorophycean cpDNAs disclosed two major clades, one uniting the Chlamydomonadales and Sphaeropleales (CS clade) and the other uniting the Oedogoniales, Chaetophorales and Chaetopeltidales (OCC clade). Although molecular signatures provided strong support for this dichotomy and for the branching of the Oedogoniales as the earliest-diverging lineage of the OCC clade, more data are required to validate these phylogenies. We describe here the complete cpDNA sequence of Oedogonium cardiacum (Oedogoniales). Results Like its three chlorophycean homologues, the 196,547-bp Oedogonium chloroplast genome displays a distinctive architecture. This genome is one of the most compact among photosynthetic chlorophytes. It has an atypical quadripartite structure, is intron-rich (17 group I and 4 group II introns), and displays 99 different conserved genes and four long open reading frames (ORFs), three of which are clustered in the spacious inverted repeat of 35,493 bp. Intriguingly, two of these ORFs (int and dpoB) revealed high similarities to genes not usually found in cpDNA. At the gene content and gene order levels, the Oedogonium genome most closely resembles its Stigeoclonium counterpart. Characters shared by these chlorophyceans but missing in members of the CS clade include the retention of psaM, rpl32 and trnL(caa), the loss of petA, the disruption of three ancestral clusters and the presence of five derived gene clusters. Conclusion The Oedogonium chloroplast genome disclosed additional characters that bolster the evidence for a close alliance between the Oedogoniales and Chaetophorales. Our unprecedented finding of int and dpoB in this cpDNA provides a clear example that novel genes were acquired by the chloroplast genome through horizontal transfers, possibly from a mitochondrial genome donor. PMID:18558012
Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto
2015-01-01
Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815
Broxson, Christopher; Beckett, Joshua; Tornaletti, Silvia
2011-05-17
Non canonical DNA structures correspond to genomic regions particularly susceptible to genetic instability. The transcription process facilitates formation of these structures and plays a major role in generating the instability associated with these genomic sites. However, little is known about how non canonical structures are processed when encountered by an elongating RNA polymerase. Here we have studied the behavior of T7 RNA polymerase (T7RNAP) when encountering a G quadruplex forming-(GGA)(4) repeat located in the human c-myb proto-oncogene. To make direct correlations between formation of the structure and effects on transcription, we have taken advantage of the ability of the T7 polymerase to transcribe single-stranded substrates and of G4 DNA to form in single-stranded G-rich sequences in the presence of potassium ions. Under physiological KCl concentrations, we found that T7 RNAP transcription was arrested at two sites that mapped to the c-myb (GGA)(4) repeat sequence. The extent of arrest did not change with time, indicating that the c-myb repeat represented an absolute block and not a transient pause to T7 RNAP. Consistent with G4 DNA formation, arrest was not observed in the absence of KCl or in the presence of LiCl. Furthermore, mutations in the c-myb (GGA)(4) repeat, expected to prevent transition to G4, also eliminated the transcription block. We show T7 RNAP arrest at the c-myb repeat in double-stranded DNA under conditions mimicking the cellular concentration of biomolecules and potassium ions, suggesting that the G4 structure formed in the c-myb repeat may represent a transcription roadblock in vivo. Our results support a mechanism of transcription-coupled DNA repair initiated by arrest of transcription at G4 structures.
Molecular Cytogenetic Analysis of Deschampsia antarctica Desv. (Poaceae), Maritime Antarctic.
Amosova, Alexandra V; Bolsheva, Nadezhda L; Samatadze, Tatiana E; Twardovska, Maryana O; Zoshchuk, Svyatoslav A; Andreev, Igor O; Badaeva, Ekaterina D; Kunakh, Viktor A; Muravenko, Olga V
2015-01-01
Deschampsia antarctica Desv. (Poaceae) (2n = 26) is one of the two vascular plants adapted to the harshest environment of the Antarctic. Although the species is a valuable model for study of environmental stress tolerance in plants, its karyotype is still poorly investigated. We firstly conducted a comprehensive molecular cytogenetic analysis of D. antarctica collected on four islands of the Maritime Antarctic. D. antarctica karyotypes were studied by Giemsa C- and DAPI/C-banding, Ag-NOR staining, multicolour fluorescence in situ hybridization with repeated DNA probes (pTa71, pTa794, telomere repeats, pSc119.2, pAs1) and the GAA simple sequence repeat probe. We also performed sequential rapid in situ hybridization with genomic DNA of D. caespitosa. Two chromosome pairs bearing transcriptionally active 45S rDNA loci and five pairs with 5S rDNA sites were detected. A weak intercalary site of telomere repeats was revealed on the largest chromosome in addition to telomere hybridization signals at terminal positions. This fact confirms indirectly the hypothesis that chromosome fusion might have been the cause of the unusual for cereals chromosome number in this species. Based on patterns of distribution of the examined molecular cytogenetic markers, all chromosomes in karyotypes were identified, and chromosome idiograms of D. antarctica were constructed. B chromosomes were found in most karyotypes of plants from Darboux Island. A mixoploid plant with mainly triploid cells bearing a Robertsonian rearrangement was detected among typical diploid specimens from Great Jalour Island. The karyotype variability found in D. antarctica is probably an expression of genome instability induced by environmental stress factors. The differences in C-banding patterns and in chromosome distribution of rDNA loci as well as homologous highly repeated DNA sequences detected between genomes of D. antarctica and its related species D. caespitosa indicate that genome reorganization involving coding and noncoding repeated DNA sequences had occurred during the divergence of these species.
Molecular Cytogenetic Analysis of Deschampsia antarctica Desv. (Poaceae), Maritime Antarctic
Amosova, Alexandra V.; Bolsheva, Nadezhda L.; Samatadze, Tatiana E.; Twardovska, Maryana O.; Zoshchuk, Svyatoslav A.; Andreev, Igor O.; Badaeva, Ekaterina D.; Kunakh, Viktor A.; Muravenko, Olga V.
2015-01-01
Deschampsia antarctica Desv. (Poaceae) (2n = 26) is one of the two vascular plants adapted to the harshest environment of the Antarctic. Although the species is a valuable model for study of environmental stress tolerance in plants, its karyotype is still poorly investigated. We firstly conducted a comprehensive molecular cytogenetic analysis of D. antarctica collected on four islands of the Maritime Antarctic. D. antarctica karyotypes were studied by Giemsa C- and DAPI/C-banding, Ag-NOR staining, multicolour fluorescence in situ hybridization with repeated DNA probes (pTa71, pTa794, telomere repeats, pSc119.2, pAs1) and the GAA simple sequence repeat probe. We also performed sequential rapid in situ hybridization with genomic DNA of D. caespitosa. Two chromosome pairs bearing transcriptionally active 45S rDNA loci and five pairs with 5S rDNA sites were detected. A weak intercalary site of telomere repeats was revealed on the largest chromosome in addition to telomere hybridization signals at terminal positions. This fact confirms indirectly the hypothesis that chromosome fusion might have been the cause of the unusual for cereals chromosome number in this species. Based on patterns of distribution of the examined molecular cytogenetic markers, all chromosomes in karyotypes were identified, and chromosome idiograms of D. antarctica were constructed. B chromosomes were found in most karyotypes of plants from Darboux Island. A mixoploid plant with mainly triploid cells bearing a Robertsonian rearrangement was detected among typical diploid specimens from Great Jalour Island. The karyotype variability found in D. antarctica is probably an expression of genome instability induced by environmental stress factors. The differences in C-banding patterns and in chromosome distribution of rDNA loci as well as homologous highly repeated DNA sequences detected between genomes of D. antarctica and its related species D. caespitosa indicate that genome reorganization involving coding and noncoding repeated DNA sequences had occurred during the divergence of these species. PMID:26394331
USDA-ARS?s Scientific Manuscript database
Marek’s disease (MD) is the leading cause of losses in chicken production in the world. Over the past 40 years significant progress has been made in the control of MD through the use of vaccines which reduce or delay tumor formation in vaccinated flocks. However, these vaccines fail to induce an imm...
Characterization of the complete chloroplast genome of Platycarya strobilacea (Juglandaceae)
Jing Yan; Kai Han; Shuyun Zeng; Peng Zhao; Keith Woeste; Jianfang Li; Zhan-Lin Liu
2017-01-01
The whole chloroplast genome (cp genome) sequence of Platycarya strobilacea was characterized from Illumina pair-end sequencing data. The complete cp genome was 160,994 bp in length and contained a large single copy region (LSC) of 90,225 bp and a small single copy region (SSC) of 18,371 bp, which were separated by a pair of inverted repeat regions...
Human Xq28 Inversion Polymorphism: From Sex Linkage to Genomics--A Genetic Mother Lode
ERIC Educational Resources Information Center
Kirby, Cait S.; Kolber, Natalie; Salih Almohaidi, Asmaa M.; Bierwert, Lou Ann; Saunders, Lori; Williams, Steven; Merritt, Robert
2016-01-01
An inversion polymorphism of the filamin and emerin genes at the tip of the long arm of the human X-chromosome serves as the basis of an investigative laboratory in which students learn something new about their own genomes. Long, nearly identical inverted repeats flanking the filamin and emerin genes illustrate how repetitive elements can lead to…
S Elements: A Family of Tc1-like Transposons in the Genome of Drosophila Melanogaster
Merriman, P. J.; Grimes, C. D.; Ambroziak, J.; Hackett, D. A.; Skinner, P.; Simmons, M. J.
1995-01-01
The S elements form a diverse family of long-inverted-repeat transposons within the genome of Drosophila melanogaster. These elements vary in size and sequence, the longest consisting of 1736 bp with 234-bp inverted terminal repeats. The longest open reading frame in an intact S element could encode a 345-amino acid polypeptide. This polypeptide is homologous to the transposases of the mariner-Tc1 superfamily of transposable elements. S elements are ubiquitous in D. melanogaster populations and also appear to be present in the genomes of two sibling species; however, they seem to be absent from 17 other Drosophila species that were examined. Within D. melanogaster strains, there are, on average, 37.4 cytologically detectable S elements per diploid genome. These elements are scattered throughout the chromosomes, but several sites in both the euchromatin and β heterochromatin are consistently occupied. The discovery of an S-element-insertion mutation and a reversion of this mutation indicates that S elements are at least occasionally mobile in the D. melanogaster genome. These elements seem to insert at an AT dinucleotide within a short palindrome and apparently duplicate that dinucleotide upon insertion. PMID:8601484
Genomic organization of the canine herpesvirus US region.
Haanes, E J; Tomlinson, C C
1998-02-01
Canine herpesvirus (CHV) is an alpha-herpesvirus of limited pathogenicity in healthy adult dogs and infectivity of the virus appears to be largely limited to cells of canine origin. CHV's low virulence and species specificity make it an attractive candidate for a recombinant vaccine vector to protect dogs against a variety of pathogens. As part of the analysis of the CHV genome, the authors determined the complete nucleotide sequence of the CHV US region as well as portions of the flanking inverted repeats. Seven full open reading frames (ORFs) encoding proteins larger than 100 amino acids were identified within, or partially within the CHV US: cUS2, cUS3, cUS4, cUS6, cUS7, cUS8 and cUS9; which are homologs of the herpes simplex virus type-1 US2; protein kinase; gG, gD, gI, gE; and US9 genes, respectively. An eighth ORF was identified in the inverted repeat region, cIR6, a homolog of the equine herpesvirus type-1 IR6 gene. The authors identified and mapped most of the major transcripts for the predicted CHV US ORFs by Northern analysis.
Begum, Rabeya; Zakrzewski, Falk; Menzel, Gerhard; Weber, Beatrice; Alam, Sheikh Shamimul; Schmidt, Thomas
2013-07-01
The cultivated jute species Corchorus olitorius and Corchorus capsularis are important fibre crops. The analysis of repetitive DNA sequences, comprising a major part of plant genomes, has not been carried out in jute but is useful to investigate the long-range organization of chromosomes. The aim of this study was the identification of repetitive DNA sequences to facilitate comparative molecular and cytogenetic studies of two jute cultivars and to develop a fluorescent in situ hybridization (FISH) karyotype for chromosome identification. A plasmid library was generated from C. olitorius and C. capsularis with genomic restriction fragments of 100-500 bp, which was complemented by targeted cloning of satellite DNA by PCR. The diversity of the repetitive DNA families was analysed comparatively. The genomic abundance and chromosomal localization of different repeat classes were investigated by Southern analysis and FISH, respectively. The cytosine methylation of satellite arrays was studied by immunolabelling. Major satellite repeats and retrotransposons have been identified from C. olitorius and C. capsularis. The satellite family CoSat I forms two undermethylated species-specific subfamilies, while the long terminal repeat (LTR) retrotransposons CoRetro I and CoRetro II show similarity to the Metaviridea of plant retroelements. FISH karyotypes were developed by multicolour FISH using these repetitive DNA sequences in combination with 5S and 18S-5·8S-25S rRNA genes which enable the unequivocal chromosome discrimination in both jute species. The analysis of the structure and diversity of the repeated DNA is crucial for genome sequence annotation. The reference karyotypes will be useful for breeding of jute and provide the basis for karyotyping homeologous chromosomes of wild jute species to reveal the genetic and evolutionary relationship between cultivated and wild Corchorus species.
Characterization of species-specific repeated DNA sequences from B. nigra.
Gupta, V; Lakshmisita, G; Shaila, M S; Jagannathan, V; Lakshmikumaran, M S
1992-07-01
The construction and characterization of two genome-specific recombinant DNA clones from B. nigra are described. Southern analysis showed that the two clones belong to a dispersed repeat family. They differ from each other in their length, distribution and sequence, though the average GC content is nearly the same (45%). These B genome-specific repeats have been used to analyse the phylogenetic relationships between cultivated and wild species of the family Brassicaceae.
Venieraki, Anastasia; Dimou, Maria; Vezyri, Eleni; Vamvakas, Alexandros; Katinaki, Pagona-Artemis; Chatzipavlidis, Iordanis; Tampakaki, Anastasia; Katinakis, Panagiotis
2014-01-01
The presence of nitrogen fixers within the genus Pseudomonas has been established and so far most isolated strains are phylogenetically affiliated to Pseudomonas stutzeri. A gene ortholog neighborhood analysis of the nitrogen fixation island (NFI) in four diazotrophic P. stutzeri strains and Pseudomonas azotifigens revealed that all are flanked by genes coding for cobalamin synthase (cobS) and glutathione peroxidise (gshP). The putative NFIs lack all the features characterizing a mobilizable genomic island. Nevertheless, bioinformatic analysis P. stutzeri DSM 4166 NFI demonstrated the presence of short inverted and/or direct repeats within both flanking regions. The other P. stutzeri strains carry only one set of repeats. The genetic diversity of eleven diazotrophic Pseudomonas isolates was also investigated. Multilocus sequence typing grouped nine isolates along with P. stutzeri and two isolates are grouped in a separate clade. A Rep-PCR fingerprinting analysis grouped the eleven isolates into four distinct genotypes. We also provided evidence that the putative NFI in our diazotrophic Pseudomonas isolates is flanked by cobS and gshP genes. Furthermore, we demonstrated that the putative NFI of Pseudomonas sp. Gr65 is flanked by inverted repeats identical to those found in P. stutzeri DSM 4166 and while the other P. stutzeri isolates harbor the repeats located in the intergenic region between cobS and glutaredoxin genes as in the case of P. stutzeri A1501. Taken together these data suggest that all putative NFIs of diazotrophic Pseudomonas isolates are anchored in an intergenic region between cobS and gshP genes and their flanking regions are designated by distinct repeats patterns. Moreover, the presence of almost identical NFIs in diazotrophic Pseudomonas strains isolated from distal geographical locations around the world suggested that this horizontal gene transfer event may have taken place early in the evolution. PMID:25251496
Venieraki, Anastasia; Dimou, Maria; Vezyri, Eleni; Vamvakas, Alexandros; Katinaki, Pagona-Artemis; Chatzipavlidis, Iordanis; Tampakaki, Anastasia; Katinakis, Panagiotis
2014-01-01
The presence of nitrogen fixers within the genus Pseudomonas has been established and so far most isolated strains are phylogenetically affiliated to Pseudomonas stutzeri. A gene ortholog neighborhood analysis of the nitrogen fixation island (NFI) in four diazotrophic P. stutzeri strains and Pseudomonas azotifigens revealed that all are flanked by genes coding for cobalamin synthase (cobS) and glutathione peroxidise (gshP). The putative NFIs lack all the features characterizing a mobilizable genomic island. Nevertheless, bioinformatic analysis P. stutzeri DSM 4166 NFI demonstrated the presence of short inverted and/or direct repeats within both flanking regions. The other P. stutzeri strains carry only one set of repeats. The genetic diversity of eleven diazotrophic Pseudomonas isolates was also investigated. Multilocus sequence typing grouped nine isolates along with P. stutzeri and two isolates are grouped in a separate clade. A Rep-PCR fingerprinting analysis grouped the eleven isolates into four distinct genotypes. We also provided evidence that the putative NFI in our diazotrophic Pseudomonas isolates is flanked by cobS and gshP genes. Furthermore, we demonstrated that the putative NFI of Pseudomonas sp. Gr65 is flanked by inverted repeats identical to those found in P. stutzeri DSM 4166 and while the other P. stutzeri isolates harbor the repeats located in the intergenic region between cobS and glutaredoxin genes as in the case of P. stutzeri A1501. Taken together these data suggest that all putative NFIs of diazotrophic Pseudomonas isolates are anchored in an intergenic region between cobS and gshP genes and their flanking regions are designated by distinct repeats patterns. Moreover, the presence of almost identical NFIs in diazotrophic Pseudomonas strains isolated from distal geographical locations around the world suggested that this horizontal gene transfer event may have taken place early in the evolution.
Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping
2015-01-01
The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum.
RNA editing of non-coding RNA and its role in gene regulation.
Daniel, Chammiran; Lagergren, Jens; Öhman, Marie
2015-10-01
It has for a long time been known that repetitive elements, particularly Alu sequences in human, are edited by the adenosine deaminases acting on RNA, ADAR, family. The functional interpretation of these events has been even more difficult than that of editing events in coding sequences, but today there is an emerging understanding of their downstream effects. A surprisingly large fraction of the human transcriptome contains inverted Alu repeats, often forming long double stranded structures in RNA transcripts, typically occurring in introns and UTRs of protein coding genes. Alu repeats are also common in other primates, and similar inverted repeats can frequently be found in non-primates, although the latter are less prone to duplex formation. In human, as many as 700,000 Alu elements have been identified as substrates for RNA editing, of which many are edited at several sites. In fact, recent advancements in transcriptome sequencing techniques and bioinformatics have revealed that the human editome comprises at least a hundred million adenosine to inosine (A-to-I) editing sites in Alu sequences. Although substantial additional efforts are required in order to map the editome, already present knowledge provides an excellent starting point for studying cis-regulation of editing. In this review, we will focus on editing of long stem loop structures in the human transcriptome and how it can effect gene expression. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping
2015-01-01
The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum. PMID:25705213
Shukla, Sanjay K; Kislow, Jennifer; Briska, Adam; Henkhaus, John; Dykes, Colin
2009-09-01
Staphylococcus aureus is a highly versatile and evolving bacterium of great clinical importance. S. aureus can evolve by acquiring single nucleotide polymorphisms and mobile genetic elements and by recombination events. Identification and location of novel genomic elements in a bacterial genome are not straightforward, unless the whole genome is sequenced. Optical mapping is a new tool that creates a high-resolution, in situ ordered restriction map of a bacterial genome. These maps can be used to determine genomic organization and perform comparative genomics to identify genomic rearrangements, such as insertions, deletions, duplications, and inversions, compared to an in silico (virtual) restriction map of a known genome sequence. Using this technology, we report here the identification, approximate location, and characterization of a genetic inversion of approximately 500 kb of a DNA element between the NRS387 (USA800) and FPR3757 (USA300) strains. The presence of the inversion and location of its junction sites were confirmed by site-specific PCR and sequencing. At both the left and right junction sites in NRS387, an IS1181 element and a 73-bp sequence were identified as inverted repeats, which could explain the possible mechanism of the inversion event.
Estevez, Carlos; Villegas, Pedro
2006-06-01
Recombinant avian adeno-associated viruses coding for the LacZ gene were used to inoculate embryonating chicken eggs, to assess the usefulness of the system for the expression of a transgene in vivo. The results obtained indicate significantly higher levels of expression of the reporter gene at various time intervals in the embryos inoculated with the recombinant virus in comparison with the mock-inoculated controls. At the embryo level, significant differences were evident at 120 hr postinoculation; hatched chicks showed transgene expression up to 14 days of age. In a second experiment, different cell-line cultures were transfected with plasmids encoding for a reporter gene flanked by the avian adeno-associated virus inverted terminal repeats (ITR), either alone or in the presence of the major nonstructural proteins of the virus (Rep 78/68) to assess the ability of these proteins and DNA elements to enhance gene expression. Results indicate that the inclusion of the viral ITR alone or during coexpression of the Rep proteins significantly enhances the expression of the transgene in all cell lines tested, as evidenced by the detection of the beta-galacrosidase protein through chemiluminescence reactions and staining of transfected monolayers.
Precision platform for convex lens-induced confinement microscopy
NASA Astrophysics Data System (ADS)
Berard, Daniel; McFaul, Christopher M. J.; Leith, Jason S.; Arsenault, Adriel K. J.; Michaud, François; Leslie, Sabrina R.
2013-10-01
We present the conception, fabrication, and demonstration of a versatile, computer-controlled microscopy device which transforms a standard inverted fluorescence microscope into a precision single-molecule imaging station. The device uses the principle of convex lens-induced confinement [S. R. Leslie, A. P. Fields, and A. E. Cohen, Anal. Chem. 82, 6224 (2010)], which employs a tunable imaging chamber to enhance background rejection and extend diffusion-limited observation periods. Using nanopositioning stages, this device achieves repeatable and dynamic control over the geometry of the sample chamber on scales as small as the size of individual molecules, enabling regulation of their configurations and dynamics. Using microfluidics, this device enables serial insertion as well as sample recovery, facilitating temporally controlled, high-throughput measurements of multiple reagents. We report on the simulation and experimental characterization of this tunable chamber geometry, and its influence upon the diffusion and conformations of DNA molecules over extended observation periods. This new microscopy platform has the potential to capture, probe, and influence the configurations of single molecules, with dramatically improved imaging conditions in comparison to existing technologies. These capabilities are of immediate interest to a wide range of research and industry sectors in biotechnology, biophysics, materials, and chemistry.
Yu, Xuefei; Zheng, Wei; Bhat, Somanath; Aquilina, J. Andrew
2015-01-01
Bacillus sp. CDB3 possesses a novel eight-gene ars cluster (ars1, arsRYCDATorf7orf8) with some unusual features in regard to expression regulation. This study demonstrated that the cluster is a single operon but can also produce a short three-gene arsRYC transcript. A hairpin structure formed by internal inverted repeats between arsC and arsD was shown to diminish the expression of the full operon, thereby probably acting as a transcription attenuator. A degradation product of the arsRYC transcript was also identified. Electrophoretic mobility shift analysis demonstrated that ArsR interacts with the ars1 promoter forming a protein-DNA complex that could be impaired by arsenite. However, no interaction was detected between ArsD and the ars1 promoter, suggesting that the CDB3 ArsD protein may not play a regulatory role. Compared to other ars gene clusters, regulation of the Bacillus sp. CDB3 ars1 operon is more complex. It represents another example of specific mRNA degradation in the transporter gene region and possibly the first case of attenuator-mediated regulation of ars operons. PMID:26355338
Analysis of hairpin RNA transgene-induced gene silencing in Fusarium oxysporum
2013-01-01
Background Hairpin RNA (hpRNA) transgenes can be effective at inducing RNA silencing and have been exploited as a powerful tool for gene function analysis in many organisms. However, in fungi, expression of hairpin RNA transcripts can induce post-transcriptional gene silencing, but in some species can also lead to transcriptional gene silencing, suggesting a more complex interplay of the two pathways at least in some fungi. Because many fungal species are important pathogens, RNA silencing is a powerful technique to understand gene function, particularly when gene knockouts are difficult to obtain. We investigated whether the plant pathogenic fungus Fusarium oxysporum possesses a functional gene silencing machinery and whether hairpin RNA transcripts can be employed to effectively induce gene silencing. Results Here we show that, in the phytopathogenic fungus F. oxysporum, hpRNA transgenes targeting either a β-glucuronidase (Gus) reporter transgene (hpGus) or the endogenous gene Frp1 (hpFrp) did not induce significant silencing of the target genes. Expression analysis suggested that the hpRNA transgenes are prone to transcriptional inactivation, resulting in low levels of hpRNA and siRNA production. However, the hpGus RNA can be efficiently transcribed by promoters acquired either by recombination with a pre-existing, actively transcribed Gus transgene or by fortuitous integration near an endogenous gene promoter allowing siRNA production. These siRNAs effectively induced silencing of a target Gus transgene, which in turn appeared to also induce secondary siRNA production. Furthermore, our results suggested that hpRNA transcripts without poly(A) tails are efficiently processed into siRNAs to induce gene silencing. A convergent promoter transgene, designed to express poly(A)-minus sense and antisense Gus RNAs, without an inverted-repeat DNA structure, induced consistent Gus silencing in F. oxysporum. Conclusions These results indicate that F. oxysporum possesses functional RNA silencing machineries for siRNA production and target mRNA cleavage, but hpRNA transgenes may induce transcriptional self-silencing due to its inverted-repeat structure. Our results suggest that F. oxysporum possesses a similar gene silencing pathway to other fungi like fission yeast, and indicate a need for developing more effective RNA silencing technology for gene function studies in this fungal pathogen. PMID:23819794
Fernández-Tajes, Juan; Méndez, Josefina
2009-12-01
For a study of 5S ribosomal genes (rDNA) in the razor clam Ensis macha, the 5S rDNA region was amplified and sequenced. Two variants, so-called type I or short repeat (approximately 430 bp) and type II or long repeat (approximately 735 bp), appeared to be the main components of the 5S rDNA of this species. Their spacers differed markedly, both in length and nucleotide composition. The organization of the two variants was investigated by amplifying the genomic DNA with primers based on the sequence of the type I and type II spacers. PCR amplification products with primers EMLbF and EMSbR showed that the long and short repeats are associated within the same tandem array, suggesting an intermixed arrangement of both spacers. Nevertheless, amplifications carried out with inverse primers EMSinvF/R and EMLinvF/R revealed that some short and long repeats are contiguous in the same tandem array. This is the first report of the coexistence of two variable spacers in the same tandem array in bivalve mollusks.
Characterization of (CA)n microsatellite repeats from large-insert clones.
Litt, M; Browne, D
2001-05-01
The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit determination of sequences flanking the microsatellites. When cosmids or large-insert phage clones are used as primary sources of (CA)n repeat markers, they have traditionally been subcloned into plasmid vectors such as pUC18 or M13 mp 18/19 cloning vectors to obtain fragments of suitable size for DNA sequencing. This unit presents an alternative approach whereby a set of degenerate sequencing primers that anneal directly to (CA)n microsatellites can be used to determine sequences that are inaccessible with vector-derived primers. Because the primers anneal to the repeat and not to the vector, they can be used with subclones containing inserts of several kilobases and should, in theory, always give sequence in the regions directly flanking the repeat. Degeneracy at the 3 end of each of these primers prevents elongation of primers that have annealed out-of-register. The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit.
Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L
2013-01-30
Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
2013-01-01
Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705
Genetic diversity at variable-number-tandem-repeat (VNTR) loci was examined in the common cattail, Typha latifolia (Typhaceae), using three synthetic DNA probes composed of tandemly repeated "core" sequences (GACA, GATA, and GCAC). The principal objectives of this investigation w...
Lin, Hai; Lin, Dong; Xiong, Xi-Sheng
2016-02-01
The purpose of this study was to investigate roles of human papillomavirus (HPV) infection and stathmin in sinonasal inverted papilloma (SNIP). HPV DNA detection was performed by the fluorescence-based polymerase chain reaction (PCR) method. Stathmin protein expression was investigated by the immunohistochemistry method and mRNA expression of stathmin, Kif2a, and cyclin D1 were assessed by real-time PCR in SNIP and control subjects. The positive rate of HPV DNA detected in SNIP was about 53.6% (15 of 28). Recurrent cases showed a higher rate of HPV infection compared with initial cases and higher Krouse stage (T3 + T4) cases showed higher rate of HPV infection than lower Krouse stage (T1 + T2) cases. Stronger expression of stathmin, Kif2a, and cyclin D1 were observed in SNIP, especially HPV(+) SNIP. HPV infection was closely associated with recurrence and progression of SNIP. Stathmin is a valuable prognostic marker and could be considered as a therapeutic target in patients with SNIP. © 2015 Wiley Periodicals, Inc.
Rubinson, Emily H.; Metz, Audrey H.; O'Quin, Jami; Eichman, Brandt F.
2013-01-01
Summary DNA glycosylases safeguard the genome by locating and excising chemically modified bases from DNA. AlkD is a recently discovered bacterial DNA glycosylase that removes positively charged methylpurines from DNA, and was predicted to adopt a protein fold distinct from other DNA repair proteins. The crystal structure of Bacillus cereus AlkD presented here shows that the protein is composed exclusively of helical HEAT-like repeats, which form a solenoid perfectly shaped to accommodate a DNA duplex on the concave surface. Structural analysis of the variant HEAT repeats in AlkD provides a rationale for how this protein scaffolding motif has been modified to bind DNA. We report 7mG excision and DNA binding activities of AlkD mutants, along with a comparison of alkylpurine DNA glycosylase structures. Together, these data provide important insight into the requirements for alkylation repair within DNA and suggest that AlkD utilizes a novel strategy to manipulate DNA in its search for alkylpurine bases. PMID:18585735
Modeling the Volcanic Source at Long Valley, CA, Using a Genetic Algorithm Technique
NASA Technical Reports Server (NTRS)
Tiampo, Kristy F.
1999-01-01
In this project, we attempted to model the deformation pattern due to the magmatic source at Long Valley caldera using a real-value coded genetic algorithm (GA) inversion similar to that found in Michalewicz, 1992. The project has been both successful and rewarding. The genetic algorithm, coded in the C programming language, performs stable inversions over repeated trials, with varying initial and boundary conditions. The original model used a GA in which the geophysical information was coded into the fitness function through the computation of surface displacements for a Mogi point source in an elastic half-space. The program was designed to invert for a spherical magmatic source - its depth, horizontal location and volume - using the known surface deformations. It also included the capability of inverting for multiple sources.
Taylor, J S; Breden, F
2000-01-01
The standard slipped-strand mispairing (SSM) model for the formation of variable number tandem repeats (VNTRs) proposes that a few tandem repeats, produced by chance mutations, provide the "raw material" for VNTR expansion. However, this model is unlikely to explain the formation of VNTRs with long motifs (e.g., minisatellites), because the likelihood of a tandem repeat forming by chance decreases rapidly as the length of the repeat motif increases. Phylogenetic reconstruction of the birth of a mitochondrial (mt) DNA minisatellite in guppies suggests that VNTRs with long motifs can form as a consequence of SSM at noncontiguous repeats. VNTRs formed in this manner have motifs longer than the noncontiguous repeat originally formed by chance and are flanked by one unit of the original, noncontiguous repeat. SSM at noncontiguous repeats can therefore explain the birth of VNTRs with long motifs and the "imperfect" or "short direct" repeats frequently observed adjacent to both mtDNA and nuclear VNTRs. PMID:10880490
C9orf72 Nucleotide Repeat Structures Initiate Molecular Cascades of Disease
Haeusler, Aaron R.; Donnelly, Christopher J.; Periz, Goran; Simko, Eric A.J.; Shaw, Patrick G.; Kim, Min-Sik; Maragakis, Nicholas J.; Troncoso, Juan C.; Pandey, Akhilesh; Sattler, Rita; Rothstein, Jeffrey D.; Wang, Jiou
2014-01-01
Summary A hexanucleotide repeat expansion (HRE), (GGGGCC)n, in C9orf72 is the most common genetic cause of the neurodegenerative diseases amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Here we identify a molecular mechanism by which structural polymorphism of the HRE leads to ALS/FTD pathology and defects. The HRE forms DNA and RNA G-quadruplexes with distinct structures and promotes RNA•DNA hybrids (R-loops). The structural polymorphism causes a repeat length-dependent accumulation of transcripts aborted in the HRE region. These transcribed repeats bind to ribonucleoproteins in a conformationdependent manner. Specifically, nucleolin (NCL), an essential nucleolar protein, preferentially binds the HRE G-quadruplex, and patient cells show evidence of nucleolar stress. Our results demonstrate that distinct C9orf72 HRE structural polymorphism at both DNA and RNA levels initiates molecular cascades leading to ALS/FTD pathologies, and provide the basis for a mechanistic model for repeat-associated neurodegenerative diseases. PMID:24598541
Non-radioactive detection of trinucleotide repeat size variability.
Tomé, Stéphanie; Nicole, Annie; Gomes-Pereira, Mario; Gourdon, Genevieve
2014-03-06
Many human diseases are associated with the abnormal expansion of unstable trinucleotide repeat sequences. The mechanisms of trinucleotide repeat size mutation have not been fully dissected, and their understanding must be grounded on the detailed analysis of repeat size distributions in human tissues and animal models. Small-pool PCR (SP-PCR) is a robust, highly sensitive and efficient PCR-based approach to assess the levels of repeat size variation, providing both quantitative and qualitative data. The method relies on the amplification of a very low number of DNA molecules, through sucessive dilution of a stock genomic DNA solution. Radioactive Southern blot hybridization is sensitive enough to detect SP-PCR products derived from single template molecules, separated by agarose gel electrophoresis and transferred onto DNA membranes. We describe a variation of the detection method that uses digoxigenin-labelled locked nucleic acid probes. This protocol keeps the sensitivity of the original method, while eliminating the health risks associated with the manipulation of radiolabelled probes, and the burden associated with their regulation, manipulation and waste disposal.
Lee, Hae-Lim; Jansen, Robert K; Chumley, Timothy W; Kim, Ki-Joong
2007-05-01
The chloroplast (cp) DNA sequence of Jasminum nudiflorum (Oleaceae-Jasmineae) is completed and compared with the large single-copy region sequences from 6 related species. The cp genomes of the tribe Jasmineae (Jasminum and Menodora) show several distinctive rearrangements, including inversions, gene duplications, insertions, inverted repeat expansions, and gene and intron losses. The ycf4-psaI region in Jasminum section Primulina was relocated as a result of 2 overlapping inversions of 21,169 and 18,414 bp. The 1st, larger inversion is shared by all members of the Jasmineae indicating that it occurred in the common ancestor of the tribe. Similar rearrangements were also identified in the cp genome of Menodora. In this case, 2 fragments including ycf4 and rps4-trnS-ycf3 genes were moved by 2 additional inversions of 14 and 59 kb that are unique to Menodora. Other rearrangements in the Oleaceae are confined to certain regions of the Jasminum and Menodora cp genomes, including the presence of highly repeated sequences and duplications of coding and noncoding sequences that are inserted into clpP and between rbcL and psaI. These insertions are correlated with the loss of 2 introns in clpP and a serial loss of segments of accD. The loss of the accD gene and clpP introns in both the monocot family Poaceae and the eudicot family Oleaceae are clearly independent evolutionary events. However, their genome organization is surprisingly similar despite the distant relationship of these 2 angiosperm families.
Nie, Xiaojun; Lv, Shuzuo; Zhang, Yingxin; Du, Xianghong; Wang, Le; Biradar, Siddanagouda S; Tan, Xiufang; Wan, Fanghao; Weining, Song
2012-01-01
Crofton weed (Ageratina adenophora) is one of the most hazardous invasive plant species, which causes serious economic losses and environmental damages worldwide. However, the sequence resource and genome information of A. adenophora are rather limited, making phylogenetic identification and evolutionary studies very difficult. Here, we report the complete sequence of the A. adenophora chloroplast (cp) genome based on Illumina sequencing. The A. adenophora cp genome is 150, 689 bp in length including a small single-copy (SSC) region of 18, 358 bp and a large single-copy (LSC) region of 84, 815 bp separated by a pair of inverted repeats (IRs) of 23, 755 bp. The genome contains 130 unique genes and 18 duplicated in the IR regions, with the gene content and organization similar to other Asteraceae cp genomes. Comparative analysis identified five DNA regions (ndhD-ccsA, psbI-trnS, ndhF-ycf1, ndhI-ndhG and atpA-trnR) containing parsimony-informative characters higher than 2%, which may be potential informative markers for barcoding and phylogenetic analysis. Repeat structure, codon usage and contraction of the IR were also investigated to reveal the pattern of evolution. Phylogenetic analysis demonstrated a sister relationship between A. adenophora and Guizotia abyssinica and supported a monophyly of the Asterales. We have assembled and analyzed the chloroplast genome of A. adenophora in this study, which was the first sequenced plastome in the Eupatorieae tribe. The complete chloroplast genome information is useful for plant phylogenetic and evolutionary studies within this invasive species and also within the Asteraceae family.
Distribution and sequence homogeneity of an abundant satellite DNA in the beetle, Tenebrio molitor.
Davis, C A; Wyatt, G R
1989-01-01
The mealworm beetle, Tenebrio molitor, contains an unusually abundant and homogeneous satellite DNA which constitutes up to 60% of its genome. The satellite DNA is shown to be present in all of the chromosomes by in situ hybridization. 18 dimers of the repeat unit were cloned and sequenced. The consensus sequence is 142 nt long and lacks any internal repeat structure. Monomers of the sequence are very similar, showing on average a 2% divergence from the calculated consensus. Variant nucleotides are scattered randomly throughout the sequence although some variants are more common than others. Neighboring repeat units are no more alike than randomly chosen ones. The results suggest that some mechanism, perhaps gene conversion, is acting to maintain the homogeneity of the satellite DNA despite its abundance and distribution on all of the chromosomes. Images PMID:2762148
Ha, Sung Chul; Choi, Jongkeun; Hwang, Hye-Yeon; Rich, Alexander; Kim, Yang-Gyun; Kim, Kyeong Kyu
2009-02-01
The Z-DNA conformation preferentially occurs at alternating purine-pyrimidine repeats, and is specifically recognized by Z alpha domains identified in several Z-DNA-binding proteins. The binding of Z alpha to foreign or chromosomal DNA in various sequence contexts is known to influence various biological functions, including the DNA-mediated innate immune response and transcriptional modulation of gene expression. For these reasons, understanding its binding mode and the conformational diversity of Z alpha bound Z-DNAs is of considerable importance. However, structural studies of Z alpha bound Z-DNA have been mostly limited to standard CG-repeat DNAs. Here, we have solved the crystal structures of three representative non-CG repeat DNAs, d(CACGTG)(2), d(CGTACG)(2) and d(CGGCCG)(2) complexed to hZ alpha(ADAR1) and compared those structures with that of hZ alpha(ADAR1)/d(CGCGCG)(2) and the Z alpha-free Z-DNAs. hZ alpha(ADAR1) bound to each of the three Z-DNAs showed a well conserved binding mode with very limited structural deviation irrespective of the DNA sequence, although varying numbers of residues were in contact with Z-DNA. Z-DNAs display less structural alterations in the Z alpha-bound state than in their free form, thereby suggesting that conformational diversities of Z-DNAs are restrained by the binding pocket of Z alpha. These data suggest that Z-DNAs are recognized by Z alpha through common conformational features regardless of the sequence and structural alterations.
Coordinated DNA dynamics during the human telomerase catalytic cycle
NASA Astrophysics Data System (ADS)
Parks, Joseph W.; Stone, Michael D.
2014-06-01
The human telomerase reverse transcriptase (hTERT) utilizes a template within the integral RNA subunit (hTR) to direct extension of telomeres. Telomerase exhibits repeat addition processivity (RAP) and must therefore translocate the nascent DNA product into a new RNA:DNA hybrid register to prime each round of telomere repeat synthesis. Here, we use single-molecule FRET and nuclease protection assays to monitor telomere DNA structure and dynamics during the telomerase catalytic cycle. DNA translocation during RAP proceeds through a previously uncharacterized kinetic substep during which the 3‧-end of the DNA substrate base pairs downstream within the hTR template. The rate constant for DNA primer realignment reveals this step is not rate limiting for RAP, suggesting a second slow conformational change repositions the RNA:DNA hybrid into the telomerase active site and drives the extrusion of the 5‧-end of the DNA primer out of the enzyme complex.
NASA Astrophysics Data System (ADS)
Li, Qi; Akihiro, Kijima
2007-01-01
The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the microsatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and fifty white colonies were screened using PCR-based technique, and 84 clones were identified to potentially contain microsatellite repeat motif. The 84 clones were sequenced, and 42 microsatellites and 4 minisatellites with a minimum of five repeats were found (13.1% of white colonies screened). Besides the motif of CA contained in the oligoprobe, we also found other 16 types of microsatellite repeats including a dinucleotide repeat, two tetranucleotide repeats, twelve pentanucleotide repeats and a hexanucleotide repeat. According to Weber (1990), the microsatellite sequences obtained could be categorized structurally into perfect repeats (73.3%), imperfect repeats (13.3%), and compound repeats (13.4%). Among the microsatellite repeats, relatively short arrays (<20 repeats) were most abundant, accounting for 75.0%. The largest length of microsatellites was 48 repeats, and the average number of repeats was 13.4. The data on the composition and length distribution of microsatellites obtained in the present study can be useful for choosing the repeat motifs for microsatellite isolation in other abalone species.
Bhatia, S; Singh Negi, M; Lakshmikumaran, M
1996-11-01
EcoRI restriction of the B. nigra rDNA recombinants, isolated from a lambda genomic library, showed that the 3.9-kb fragment corresponded to the Intergenic Spacer (IGS), which was sequenced and found to be 3,928 bp in size. Sequence and dot-matrix analyses showed that the organization of the B. nigra rDNA IGS was typical of most rDNA spacers, consisting of a central repetitive region and flanking unique sequences on either side. The repetitive region was composed of two repeat families-RF 'A' and RF 'B.' The B. nigra RF 'A' consisted of a tandem array of three full-length copies of a 106-bp sequence element. RF 'B' was composed of 66 tandemly repeated elements. Each 'B' element was only 21-bp in size and this is the smallest repeat unit identified in plant rDNA to date. The putative transcription initiation site (TIS) was identified as nucleotide position 3,110. Based on the sequence analysis it was suggested that the present organization of the repeat families was generated by successive cycles of deletions and amplifications and was being maintained by homogenization processes such as gene conversion and crossing-over.A detailed comparison of the rDNA IGS sequences of the three diploid Brassica species-namely, B. nigra, B. campestris, and B. oleracea-was carried out. First, comparisons revealed that B. campestris and B. oleracea were close to each other as the repeat families in both showed high sequence homology between each other. Second, the repeat elements in both the species were organized in an interspersed manner. Third, a 52-bp sequence, present just downstream of the repeats in B. campestris, was found to be identical to the B. oleracea repeats, thereby suggesting a common progenitor. On the other hand, in B. nigra no interspersion pattern of organization of repeats was observed. Further, the B. nigra RF 'A' was identified as distinct from the repeat families of B. campestris and B. oleracea. Based on this analysis, it was suggested that during speciation B. campestris and B. oleracea evolved in one lineage whereas B. nigra diverged into a separate lineage. The comparative analysis of the IGS helped in identifying not only conserved ancestral sequence motifs of possible functional significance such as promoters and enhancers, but also sequences which showed variation between the three diploid species and were therefore identified as species-specific sequences.
APE1 incision activity at abasic sites in tandem repeat sequences.
Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M
2014-05-29
Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.
Two synthetic tandem repetitive DNA probes were used to compare genetic variation at variable-number-tandem-repeat (VNTR) loci among Rubus idaeus L. var. strigosus (Michx.) Maxim. (Rosaceae) individuals sampled at eight sites contaminated by pollutants (N = 39) and eight adjacent...
Hall, Amanda C.; Ostrowski, Lauren A.; Mekhail, Karim
2017-01-01
ABSTRACT Cells have evolved intricate mechanisms to maintain genome stability despite allowing mutational changes to drive evolutionary adaptation. Repetitive DNA sequences, which represent the bulk of most genomes, are a major threat to genome stability often driving chromosome rearrangements and disease. The major source of repetitive DNA sequences and thus the most vulnerable constituents of the genome are the rDNA (rDNA) repeats, telomeres, and transposable elements. Maintaining the stability of these loci is critical to overall cellular fitness and lifespan. Therefore, cells have evolved mechanisms to regulate rDNA copy number, telomere length and transposon activity, as well as DNA repair at these loci. In addition, non-canonical structure-forming DNA motifs can also modulate the function of these repetitive DNA loci by impacting their transcription, replication, and stability. Here, we discuss key mechanisms that maintain rDNA repeats, telomeres, and transposons in yeast and human before highlighting emerging roles for non-canonical DNA structures at these repetitive loci. PMID:28406751
Comparative Genomics and Phylogenomics of East Asian Tulips (Amana, Liliaceae)
Li, Pan; Lu, Rui-Sen; Xu, Wu-Qin; Ohi-Toma, Tetsuo; Cai, Min-Qi; Qiu, Ying-Xiong; Cameron, Kenneth M.; Fu, Cheng-Xin
2017-01-01
The genus Amana Honda (Liliaceae), when it is treated as separate from Tulipa, comprises six perennial herbaceous species that are restricted to China, Japan and the Korean Peninsula. Although all six Amana species have important medicinal and horticultural uses, studies focused on species identification and molecular phylogenetics are few. Here we report the nucleotide sequences of six complete Amana chloroplast (cp) genomes. The cp genomes of Amana range from 150,613 bp to 151,136 bp in length, all including a pair of inverted repeats (25,629–25,859 bp) separated by the large single-copy (81,482–82,218 bp) and small single-copy (17,366–17,465 bp) regions. Each cp genome equivalently contains 112 unique genes consisting of 30 transfer RNA genes, four ribosomal RNA genes, and 78 protein coding genes. Gene content, gene order, AT content, and IR/SC boundary structure are nearly identical among all Amana cp genomes. However, the relative contraction and expansion of the IR/SC borders among the six Amana cp genomes results in length variation among them. Simple sequence repeat (SSR) analyses of these Amana cp genomes indicate that the richest SSRs are A/T mononucleotides. The number of repeats among the six Amana species varies from 54 (A. anhuiensis) to 69 (Amana kuocangshanica) with palindromic (28–35) and forward repeats (23–30) as the most common types. Phylogenomic analyses based on these complete cp genomes and 74 common protein-coding genes strongly support the monophyly of the genus, and a sister relationship between Amana and Erythronium, rather than a shared common ancestor with Tulipa. Nine DNA markers (rps15–ycf1, accD–psaI, petA–psbJ, rpl32–trnL, atpH–atpI, petD–rpoA, trnS–trnG, psbM–trnD, and ycf4–cemA) with number of variable sites greater than 0.9% were identified, and these may be useful for future population genetic and phylogeographic studies of Amana species. PMID:28421090
Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat
2017-01-01
Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in plastomes is important to characterize the nature of evolutionary processes in nuclear and cytoplasmic genomes with the different selection pressures, population structures and breeding systems. PMID:28182646
M Salih, Rubar Hussein; Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat
2017-01-01
Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in plastomes is important to characterize the nature of evolutionary processes in nuclear and cytoplasmic genomes with the different selection pressures, population structures and breeding systems.
Low abundance of microsatellite repeats in the genome of the Brown-headed Cowbird (Molothrus ater)
Longmire, Jonathan L.; Hahn, D.C.; Roach, J.L.
1999-01-01
A cosmid library made from brown-headed cowbird (Molothrus ater) DNA was examined for representation of 17 distinct microsatellite motifs including all possible mono-, di-, and trinucleotide microsatellites, and the tetranucleotide repeat (GATA)n. The overall density of microsatellites within cowbird DNA was found to be one repeat per 89 kb and the frequency of the most abundant motif, (AGC)n, was once every 382 kb. The abundance of microsatellites within the cowbird genome is estimated to be reduced approximately 15-fold compared to humans. The reduced frequency of microsatellites seen in this study is consistent with previous observations indicating reduced numbers of microsatellites and other interspersed repeats in avian DNA. In addition to providing new information concerning the abundance of microsatellites within an avian genome, these results provide useful insights for selecting cloning strategies that might be used in the development of locus-specific microsatellite markers for avian studies.
Expression of human papillomavirus 6 in inverted papilloma arising in a renal transplant recipient.
Harris, M O; Beck, J C; Terrell, J E; McClatchey, K D; Carey, T E; Bradford, C R
1998-01-01
A 36-year-old renal transplant recipient taking cyclosporin A presented with bilateral nasal polypoid lesions involving the nasal septum and lateral nasal walls. Pathologic findings from surgical excision demonstrated inverted papilloma (IP) with focal atypia and mild dysplasia. DNA extracted from the tissue was tested with the polymerase chain reaction (PCR) using human papillomavirus (HPV) E6 and L1 consensus primers. This revealed amplification of the expected size fragment consistent with the presence of HPV DNA. Hybridization of PCR products with HPV type-specific oligonucleotide probes revealed a strong signal with only HPV 6. This result was confirmed by PCR amplification with HPV 6 type-specific primers. RNA extracted from the tissue was subjected to reverse transcription PCR (RT-PCR) with a primer pair specific for viral E6/E7 transcripts. The HPV early proteins, E6 and E7, are the transforming proteins implicated as critical for tumorigenesis. RT-PCR experiments generated products representing the E1/E4 spliced transcript originating from the E6/E6 promoter and a smaller unclassified fragment. These results provide evidence for HPV 6 E6/E7 expression in IP, lending credence to the concept that HPV may play a role in the origin of this neoplasm. Histologically normal nasal tissue from the same patient contained HPV DNA and similar transcripts to those described in the IP specimen.
Lednicky, J; Folk, W R
1992-01-01
The 21-bp repeat region of simian virus 40 (SV40) activates viral transcription and DNA replication and contains binding sites for many cellular proteins, including Sp1, LSF, ETF, Ap2, Ap4, GT-1B, H16, and p53, and for the SV40 large tumor antigen. We have attempted to reduce the complexity of this region while maintaining its growth-promoting capacity. Deletion of the 21-bp repeat region from the SV40 genome delays the expression of viral early proteins and DNA replication and reduces virus production in CV-1 cells. Replacement of the 21-bp repeat region with two copies of DNA sequence motifs bound with high affinities by Sp1 promotes SV40 growth in CV-1 cells to nearly wild-type levels, but substitution by motifs bound less avidly by Sp1 or bound by other activator proteins does not restore growth. This indicates that Sp1 or a protein with similar sequence specificity is primarily responsible for the function of the 21-bp repeat region. We speculate about how Sp1 activates both SV40 transcription and DNA replication. Images PMID:1328672
Quantitative analysis of TALE-DNA interactions suggests polarity effects.
Meckler, Joshua F; Bhakta, Mital S; Kim, Moon-Soo; Ovadia, Robert; Habrian, Chris H; Zykovich, Artem; Yu, Abigail; Lockwood, Sarah H; Morbitzer, Robert; Elsäesser, Janett; Lahaye, Thomas; Segal, David J; Baldwin, Enoch P
2013-04-01
Transcription activator-like effectors (TALEs) have revolutionized the field of genome engineering. We present here a systematic assessment of TALE DNA recognition, using quantitative electrophoretic mobility shift assays and reporter gene activation assays. Within TALE proteins, tandem 34-amino acid repeats recognize one base pair each and direct sequence-specific DNA binding through repeat variable di-residues (RVDs). We found that RVD choice can affect affinity by four orders of magnitude, with the relative RVD contribution in the order NG > HD ≈ NN > NI > NK. The NN repeat preferred the base G over A, whereas the NK repeat bound G with 10(3)-fold lower affinity. We compared AvrBs3, a naturally occurring TALE that recognizes its target using some atypical RVD-base combinations, with a designed TALE that precisely matches 'standard' RVDs with the target bases. This comparison revealed unexpected differences in sensitivity to substitutions of the invariant 5'-T. Another surprising observation was that base mismatches at the 5' end of the target site had more disruptive effects on affinity than those at the 3' end, particularly in designed TALEs. These results provide evidence that TALE-DNA recognition exhibits a hitherto un-described polarity effect, in which the N-terminal repeats contribute more to affinity than C-terminal ones.
The Genome of Melanoplus sanguinipes Entomopoxvirus
Afonso, C. L.; Tulman, E. R.; Lu, Z.; Oma, E.; Kutish, G. F.; Rock, D. L.
1999-01-01
The family Poxviridae contains two subfamilies: the Entomopoxvirinae (poxviruses of insects) and the Chordopoxvirinae (poxviruses of vertebrates). Here we present the first characterization of the genome of an entomopoxvirus (EPV) which infects the North American migratory grasshopper Melanoplus sanguinipes and other important orthopteran pests. The 236-kbp M. sanguinipes EPV (MsEPV) genome consists of a central coding region bounded by 7-kbp inverted terminal repeats and contains 267 open reading frames (ORFs), of which 107 exhibit similarity to previously described genes. The presence of genes not previously described in poxviruses, and in some cases in any other known virus, suggests significant viral adaptation to the arthropod host and the external environment. Genes predicting interactions with host cellular mechanisms include homologues of the inhibitor of apoptosis protein, stress response protein phosphatase 2C, extracellular matrixin metalloproteases, ubiquitin, calcium binding EF-hand protein, glycosyltransferase, and a triacylglyceride lipase. MsEPV genes with putative functions in prevention and repair of DNA damage include a complete base excision repair pathway (uracil DNA glycosylase, AP endonuclease, DNA polymerase β, and an NAD+-dependent DNA ligase), a photoreactivation repair pathway (cyclobutane pyrimidine dimer photolyase), a LINE-type reverse transcriptase, and a mutT homologue. The presence of these specific repair pathways may represent viral adaptation for repair of environmentally induced DNA damage. The absence of previously described poxvirus enzymes involved in nucleotide metabolism and the presence of a novel thymidylate synthase homologue suggest that MsEPV is heavily reliant on host cell nucleotide pools and the de novo nucleotide biosynthesis pathway. MsEPV and lepidopteran genus B EPVs lack genome colinearity and exhibit a low level of amino acid identity among homologous genes (20 to 59%), perhaps reflecting a significant evolutionary distance between lepidopteran and orthopteran viruses. Divergence between MsEPV and the Chordopoxvirinae is indicated by the presence of only 49 identifiable chordopoxvirus homologues, low-level amino acid identity among these genes (20 to 48%), and the presence in MsEPV of 43 novel ORFs in five gene families. Genes common to both poxvirus subfamilies, which include those encoding enzymes involved in RNA transcription and modification, DNA replication, protein processing, virion assembly, and virion structural proteins, define the genetic core of the Poxviridae. PMID:9847359
Rebrikov, Denis V; Bulina, Maria E; Bogdanova, Ekaterina A; Vagner, Loura L; Lukyanov, Sergey A
2002-01-01
Background Freshwater planarians are widely used as models for investigation of pattern formation and studies on genetic variation in populations. Despite extensive information on the biology and genetics of planaria, the occurrence and distribution of viruses in these animals remains an unexplored area of research. Results Using a combination of Suppression Subtractive Hybridization (SSH) and Mirror Orientation Selection (MOS), we compared the genomes of two strains of freshwater planarian, Girardia tigrina. The novel extrachromosomal DNA-containing virus-like element denoted PEVE (Planarian Extrachromosomal Virus-like Element) was identified in one planarian strain. The PEVE genome (about 7.5 kb) consists of two unique regions (Ul and Us) flanked by inverted repeats. Sequence analyses reveal that PEVE comprises two helicase-like sequences in the genome, of which the first is a homolog of a circoviral replication initiator protein (Rep), and the second is similar to the papillomavirus E1 helicase domain. PEVE genome exists in at least two variant forms with different arrangements of single-stranded and double-stranded DNA stretches that correspond to the Us and Ul regions. Using PCR analysis and whole-mount in situ hybridization, we characterized PEVE distribution and expression in the planarian body. Conclusions PEVE is the first viral element identified in free-living flatworms. This element differs from all known viruses and viral elements, and comprises two potential helicases that are homologous to proteins from distant viral phyla. PEVE is unevenly distributed in the worm body, and is detected in specific parenchyma cells. PMID:12065025
Isolation of human simple repeat loci by hybridization selection.
Armour, J A; Neumann, R; Gobert, S; Jeffreys, A J
1994-04-01
We have isolated short tandem repeat arrays from the human genome, using a rapid method involving filter hybridization to enrich for tri- or tetranucleotide tandem repeats. About 30% of clones from the enriched library cross-hybridize with probes containing trimeric or tetrameric tandem arrays, facilitating the rapid isolation of large numbers of clones. In an initial analysis of 54 clones, 46 different tandem arrays were identified. Analysis of these tandem repeat loci by PCR showed that 24 were polymorphic in length; substantially higher levels of polymorphism were displayed by the tetrameric repeat loci isolated than by the trimeric repeats. Primary mapping of these loci by linkage analysis showed that they derive from 17 chromosomes, including the X chromosome. We anticipate the use of this strategy for the efficient isolation of tandem repeats from other sources of genomic DNA, including DNA from flow-sorted chromosomes, and from other species.
Characterization of proviruses cloned from mink cell focus-forming virus-infected cellular DNA.
Khan, A S; Repaske, R; Garon, C F; Chan, H W; Rowe, W P; Martin, M A
1982-01-01
Two proviruses were cloned from EcoRI-digested DNA extracted from mink cells chronically infected with AKR mink cell focus-forming (MCF) 247 murine leukemia virus (MuLV), using a lambda phage host vector system. One cloned MuLV DNA fragment (designated MCF 1) contained sequences extending 6.8 kilobases from an EcoRI restriction site in the 5' long terminal repeat (LTR) to an EcoRI site located in the envelope (env) region and was indistinguishable by restriction endonuclease mapping for 5.1 kilobases (except for the EcoRI site in the LTR) from the 5' end of AKR ecotropic proviral DNA. The DNA segment extending from 5.1 to 6.8 kilobases contained several restriction sites that were not present in the AKR ecotropic provirus. A 0.5-kilobase DNA segment located at the 3' end of MCF 1 DNA contained sequences which hybridized to a xenotropic env-specific DNA probe but not to labeled ecotropic env-specific DNA. This dual character of MCF 1 proviral DNA was also confirmed by analyzing heteroduplex molecules by electron microscopy. The second cloned proviral DNA (designated MCF 2) was a 6.9-kilobase EcoRI DNA fragment which contained LTR sequences at each end and a 2.0-kilobase deletion encompassing most of the env region. The MCF 2 proviral DNA proved to be a useful reagent for detecting LTRs electron microscopically due to the presence of nonoverlapping, terminally located LTR sequences which effected its circularization with DNAs containing homologous LTR sequences. Nucleotide sequence analysis demonstrated the presence of a 104-base-pair direct repeat in the LTR of MCF 2 DNA. In contrast, only a single copy of the reiterated component of the direct repeat was present in MCF 1 DNA. Images PMID:6281459
[Variability of nuclear 18S-25S rDNA of Gentiana lutea L. in nature and in tissue culture in vitro].
Mel'nyk, V M; Spiridonova, K V; Andrieiev, I O; Strashniuk, N M; Kunakh, V A
2004-01-01
18S-25S rDNA sequence in genomes of G. lutea plants from different natural populations and from tissue culture has been studied with blot-hybridization method. It was shown that ribosomal repeats are represented by the variants which differ for their size and for the presence of additional HindIII restriction site. Genome of individual plant usually possesses several variants of DNA repeats. Interpopulation variability according to their quantitative ratio and to the presence of some of them has been shown. Modifications of the range of rDNA repeats not exceeding intraspecific variability were observed in callus tissues in comparison with the plants of initial population. Non-randomness of genome modifications in the course of cell adaptation to in vitro conditions makes it possible to some extent to forecast these modifications in tissue culture.
The DL1 repeats in the genome of Diphyllobothrium latum.
Usmanova, Nadezhda M; Kazakov, Vasiliy I
2010-07-01
Diphyllobothrium latum is a widespread intestinal parasite, which has a great clinical relevance, but there are no sequences of its nuclear genome. In this paper, a repetitive element in the D. latum genome is firstly described. The adult D. latum was obtained in the result of expulsion from intestinum of a patient suffering from diphyllobothriasis. Genomic DNA was isolated from several proglottids of this individual. PstI restriction products of D. latum genomic DNA were sequenced. Polymerase chain reaction (PCR) amplification of these products using genomic DNA and selected primers was carried out. Thereby a cluster of a repetitive element, called DL1, was discovered. For precise identification of a beginning and an end of the repeat, a product of PCR amplification of D. latum genomic DNA with one specific primer was sequenced. In discussion, several evidences that DL1 repeat is a member of the SINE family of retroposons were adduced.
Rathi, Preeti; Witte, Anna; Summerer, Daniel
2017-11-08
Transcription activator-like effectors (TALEs) are DNA major-groove binding proteins widely used for genome targeting. TALEs contain an N-terminal region (NTR) and a central repeat domain (CRD). Repeats of the CRD selectively recognize each one DNA nucleobase, offering programmability. Moreover, repeats with selectivity for 5-methylcytosine (5mC) and its oxidized derivatives can be designed for analytical applications. However, both TALE domains also nonspecifically interact with DNA phosphates via basic amino acids. To enhance the 5mC selectivity of TALEs, we aimed to decrease the nonselective binding energy of TALEs. We substituted basic amino acids with alanine in the NTR and identified TALE mutants with increased selectivity. We then analysed conserved, DNA phosphate-binding KQ diresidues in CRD repeats and identified further improved mutants. Combination of mutations in the NTR and CRD was highly synergetic and resulted in TALE scaffolds with up to 4.3-fold increased selectivity in genomic 5mC analysis via affinity enrichment. Moreover, transcriptional activation in HEK293T cells by a TALE-VP64 construct based on this scaffold design exhibited a 3.5-fold increased 5mC selectivity. This provides perspectives for improved 5mC analysis and for the 5mC-conditional control of TALE-based editing constructs in vivo.
Ribeiro, Tiago; Marques, André; Novák, Petr; Schubert, Veit; Vanzela, André L L; Macas, Jiri; Houben, Andreas; Pedrosa-Harand, Andrea
2017-03-01
Satellite DNA repeats (or satDNA) are fast-evolving sequences usually associated with condensed heterochromatin. To test whether the chromosomal organisation of centromeric and non-centromeric satDNA differs in species with holocentric chromosomes, we identified and characterised the major satDNA families in the holocentric Cyperaceae species Rhynchospora ciliata (2n = 10), R. globosa (2n = 50) and R. tenuis (2n = 2x = 4 and 2n = 4x = 8). While conserved centromeric repeats (present in R. ciliata and R. tenuis) revealed linear signals at both chromatids, non-centromeric, species-specific satDNAs formed distinct clusters along the chromosomes. Colocalisation of both repeat types resulted in a ladder-like hybridisation pattern at mitotic chromosomes. In interphase, the centromeric satDNA was dispersed while non-centromeric satDNA clustered and partly colocalised to chromocentres. Despite the banding-like hybridisation patterns of the clustered satDNA, the identification of chromosome pairs was impaired due to the irregular hybridisation patterns of the homologues in R. tenuis and R. ciliata. These differences are probably caused by restricted or impaired meiotic recombination as reported for R. tenuis, or alternatively by complex chromosome rearrangements or unequal condensation of homologous metaphase chromosomes. Thus, holocentricity influences the chromosomal organisation leading to differences in the distribution patterns and condensation dynamics of centromeric and non-centromeric satDNA.
Johzuka, Katsuki; Terasawa, Masahiro; Ogawa, Hideyuki; Ogawa, Tomoko; Horiuchi, Takashi
2006-03-01
An average of 200 copies of the rRNA gene (rDNA) is clustered in a long tandem array in Saccharomyces cerevisiae. FOB1 is known to be required for expansion/contraction of the repeats by stimulating recombination, thereby contributing to the maintenance of the average copy number. In Deltafob1 cells, the repeats are still maintained without any fluctuation in the copy number, suggesting that another, unknown system acts to prevent repeat contraction. Here, we show that condensin acts together with FOB1 in a functionally complemented fashion to maintain the long tandem repeats. Six condensin mutants possessing severely contracted rDNA repeats were isolated in Deltafob1 cells but not in FOB1+ cells. We also found that the condensin complex associated with the nontranscribed spacer region of rDNA with a major peak coincided with the replication fork barrier (RFB) site in a FOB1-dependent fashion. Surprisingly, condensin association with the RFB site was established during S phase and was maintained until anaphase. These results indicate that FOB1 plays a novel role in preventing repeat contraction by regulating condensin association and suggest a link between replication termination and chromosome condensation and segregation.
Usdin, K; Furano, A V
1988-01-01
The L family (long interspersed repeated DNA) of mobile genetic elements is a persistent feature of the mammalian genome. In rats, this family contains approximately equal to 40,000 members and accounts for approximately equal to 10% of the haploid genome. We demonstrate here that the guanine-rich homopurine stretches located at the right end of L-DNA induce oligonucleotide uptake by contiguous duplex DNA. The uptake is dependent on negative supercoiling and the length of the homopurine stretch and occurs even when the L-DNA homopurine stretches are introduced into a different DNA environment. The bound oligomer primes DNA synthesis when DNA polymerase and deoxyribonucleoside triphosphates are added, resulting in a faithful copy of the template to which the oligonucleotide had bound. The implications of this property of the L-DNA guanine-rich homopurine stretches in the amplification, recombination, and dispersal of L elements is discussed. Images PMID:2837766
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.
Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav
2010-09-16
Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection.
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing
2010-01-01
Background Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. Results In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. Conclusion A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection. PMID:20846365
Le, Nam Cao Hoai; Yokokawa, Ryuji; Dao, Dzung Viet; Nguyen, Thien Duy; Wells, John C; Sugiyama, Susumu
2009-01-21
A poly(dimethylsiloxane) (PDMS) chip for Total Internal Reflection (TIR)-based imaging and detection has been developed using Si bulk micromachining and PDMS casting. In this paper, we report the applications of the chip on both inverted and upright fluorescent microscopes and confirm that two types of sample delivery platforms, PDMS microchannel and glass microchannel, can be easily integrated depending on the magnification of an objective lens needed to visualize a sample. Although any device configuration can be achievable, here we performed two experiments to demonstrate the versatility of the microfluidic TIR-based devices. The first experiment was velocity measurement of Nile red microbeads with nominal diameter of 500 nm in a pressure-driven flow. The time-sequenced fluorescent images of microbeads, illuminated by an evanescent field, were cross-correlated by a Particle Image Velocimetry (PIV) program to obtain near-wall velocity field of the microbeads at various flow rates from 500 nl/min to 3000 nl/min. We then evaluated the capabilities of the device for Single Molecule Detection (SMD) of fluorescently labeled DNA molecules from 30 bp to 48.5 kbp and confirm that DNA molecules as short as 1105 bp were detectable. Our versatile, integrated device could provide low-cost and fast accessibility to Total Internal Reflection Fluorescent Microscopy (TIRFM) on both conventional upright and inverted microscopes. It could also be a useful component in a Micro-Total Analysis System (micro-TAS) to analyze nanoparticles or biomolecules near-wall transport or motion.
Ordered mapping of 3 alphoid DNA subsets on human chromosome 22
DOE Office of Scientific and Technical Information (OSTI.GOV)
Antonacci, R.; Baldini, A.; Archidiacono, N.
1994-09-01
Alpha satellite DNA consists of tandemly repeated monomers of 171 bp clustered in the centromeric region of primate chromosomes. Sequence divergence between subsets located in different human chromosomes is usually high enough to ensure chromosome-specific hybridization. Alphoid probes specific for almost every human chromosome have been reported. A single chromosome can carry different subsets of alphoid DNA and some alphoid subsets can be shared by different chromosomes. We report the physical order of three alphoid DNA subsets on human chromosome 22 determined by a combination of low and high resolution cytological mapping methods. Results visually demonstrate the presence of threemore » distinct alphoid DNA domains at the centromeric region of chromosome 22. We have measured the interphase distances between the three probes in three-color FISH experiments. Statistical analysis of the results indicated the order of the subsets. Two color experiments on prometaphase chromosomes established the order of the three domains relative to the arms of chromosome 22 and confirmed the results obtained using interphase mapping. This demonstrates the applicability of interphase mapping for alpha satellite DNA orderering. However, in our experiments, interphase mapping did not provide any information about the relationship between extremities of the repeat arrays. This information was gained from extended chromatin hybridization. The extremities of two of the repeat arrays were seen to be almost overlapping whereas the third repeat array was clearly separated from the other two. Our data show the value of extended chromatin hybridization as a complement of other cytological techniques for high resolution mapping of repetitive DNA sequences.« less
An annotated genetic map of loblolly pine based on microsatellite and cDNA markers
USDA-ARS?s Scientific Manuscript database
Previous loblolly pine (Pinus taeda L.) genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats), also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective o...
Graw, J; Liebstein, A; Pietrowski, D; Schmitt-John, T; Werner, T
1993-12-22
The murine genes, gamma B-cry and gamma C-cry, encoding the gamma B- and gamma C-crystallins, were isolated from a genomic DNA library. The complete nucleotide (nt) sequences of both genes were determined from 661 and 711 bp, respectively, upstream from the first exon to the corresponding polyadenylation sites, comprising more than 2650 and 2890 bp, respectively. The new sequences were compared to the partial cDNA sequences available for the murine gamma B-cry and gamma C-cry, as well as to the corresponding genomic sequences from rat and man, at both the nt and predicted amino acid (aa) sequence levels. In the gamma B-cry promoter region, a canonical CCAAT-box, a TATA-box, putative NF-I and C/EBP sites were detected. An R-repeat is inserted 366 bp upstream from the transcription start point. In contrast, the gamma C-cry promoter does not contain a CCAAT-box, but some other putative binding sites for transcription factors (AP-2, UBP-1, LBP-1) were located by computer analysis. The promoter regions of all six gamma-cry from mouse, rat and human, except human psi gamma F-cry, were analyzed for common sequence elements. A complex sequence element of about 70-80 bp was found in the proximal promoter, which contains a gamma-cry-specific and almost invariant sequence (crygpel) of 14 nt, and ends with the also invariant TATA-box. Within the complex sequence element, a minimum of three further features specific for the gamma A-, gamma B- and gamma D/E/F-cry genes can be defined, at least two of which were recently shown to be functional. In addition to these four sequence elements, a subtype-specific structure of inverted repeats with different-sized spacers can be deduced from the multiple sequence alignment. A phylogenetic analysis based on the promoter region, as well as the complete exon 3 of all gamma-cry from mouse, rat and man, suggests separation of only five gamma-cry subtypes (gamma A-, gamma B-, gamma C-, gamma D- and gamma E/F-cry) prior to species separation.
Direct Imaging of Gene-Carrier Complexes in Animal Cells
NASA Astrophysics Data System (ADS)
Lin, Alison J.; Slack, Nelle L.; Ahmad, Ayesha; Matsumoto, Brian; Safinya, Cyrus R.
1998-03-01
Cationic lipids are promising gene carriers for DNA transfection. Establishing the correlations between structures of cationic lipid/DNA complexes (CL-DNA) and pathways of transfection will greatly aid us in achieving the optimal CL-DNA transfections. Our first step is to determine the uptake mechanism of DNA by studying the interactions and structures of DNA and cationic lipids. X-ray diffraction shows that the CL-DNA undergoes structural phase transitions from lamellar( J. Raedler, I. Koltover, T. Salditt, C. R. Safinya, Science 275, 810 (1997).) to inverted hexagonal self-assemblies as we change the lipid composition. X-ray diffraction and optical microscopy techniques are used to directly image the progress of the CL-DNA in mouse L-cells and unravel the complex structure in-situ. Fluorescence and confocal optical microscopy techniques allow us to monitor the interactions between the complexes and different organelles in the cell cytoplasm. Current results indicate that once inside cells, complexes containing DOPE follow a different pathway from those containing DOPC. This research is funded by NSF-DMR-9624091, PRF-31352-AC7, and Los Alamos-STB/UC:96-108.
The effects of spatially displaced visual feedback on remote manipulator performance
NASA Technical Reports Server (NTRS)
Smith, Randy L.; Stuart, Mark A.
1989-01-01
The effects of spatially displaced visual feedback on the operation of a camera viewed remote manipulation task are analyzed. A remote manipulation task is performed by operators exposed to the following different viewing conditions: direct view of the work site; normal camera view; reversed camera view; inverted/reversed camera view; and inverted camera view. The task completion performance times are statistically analyzed with a repeated measures analysis of variance, and a Newman-Keuls pairwise comparison test is administered to the data. The reversed camera view is ranked third out of four camera viewing conditions, while the normal viewing condition is found significantly slower than the direct viewing condition. It is shown that generalization to remote manipulation applications based upon the results of direct manipulation studies are quite useful, but they should be made cautiously.
Begum, Rabeya; Zakrzewski, Falk; Menzel, Gerhard; Weber, Beatrice; Alam, Sheikh Shamimul; Schmidt, Thomas
2013-01-01
Background and Aims The cultivated jute species Corchorus olitorius and Corchorus capsularis are important fibre crops. The analysis of repetitive DNA sequences, comprising a major part of plant genomes, has not been carried out in jute but is useful to investigate the long-range organization of chromosomes. The aim of this study was the identification of repetitive DNA sequences to facilitate comparative molecular and cytogenetic studies of two jute cultivars and to develop a fluorescent in situ hybridization (FISH) karyotype for chromosome identification. Methods A plasmid library was generated from C. olitorius and C. capsularis with genomic restriction fragments of 100–500 bp, which was complemented by targeted cloning of satellite DNA by PCR. The diversity of the repetitive DNA families was analysed comparatively. The genomic abundance and chromosomal localization of different repeat classes were investigated by Southern analysis and FISH, respectively. The cytosine methylation of satellite arrays was studied by immunolabelling. Key Results Major satellite repeats and retrotransposons have been identified from C. olitorius and C. capsularis. The satellite family CoSat I forms two undermethylated species-specific subfamilies, while the long terminal repeat (LTR) retrotransposons CoRetro I and CoRetro II show similarity to the Metaviridea of plant retroelements. FISH karyotypes were developed by multicolour FISH using these repetitive DNA sequences in combination with 5S and 18S–5·8S–25S rRNA genes which enable the unequivocal chromosome discrimination in both jute species. Conclusions The analysis of the structure and diversity of the repeated DNA is crucial for genome sequence annotation. The reference karyotypes will be useful for breeding of jute and provide the basis for karyotyping homeologous chromosomes of wild jute species to reveal the genetic and evolutionary relationship between cultivated and wild Corchorus species. PMID:23666888
Alu elements shape the primate transcriptome by cis-regulation of RNA editing
2014-01-01
Background RNA editing by adenosine to inosine deamination is a widespread phenomenon, particularly frequent in the human transcriptome, largely due to the presence of inverted Alu repeats and their ability to form double-stranded structures – a requisite for ADAR editing. While several hundred thousand editing sites have been identified within these primate-specific repeats, the function of Alu-editing has yet to be elucidated. Results We show that inverted Alu repeats, expressed in the primate brain, can induce site-selective editing in cis on sites located several hundred nucleotides from the Alu elements. Furthermore, a computational analysis, based on available RNA-seq data, finds that site-selective editing occurs significantly closer to edited Alu elements than expected. These targets are poorly edited upon deletion of the editing inducers, as well as in homologous transcripts from organisms lacking Alus. Sequences surrounding sites near edited Alus in UTRs, have been subjected to a lesser extent of evolutionary selection than those far from edited Alus, indicating that their editing generally depends on cis-acting Alus. Interestingly, we find an enrichment of primate-specific editing within encoded sequence or the UTRs of zinc finger-containing transcription factors. Conclusions We propose a model whereby primate-specific editing is induced by adjacent Alu elements that function as recruitment elements for the ADAR editing enzymes. The enrichment of site-selective editing with potentially functional consequences on the expression of transcription factors indicates that editing contributes more profoundly to the transcriptomic regulation and repertoire in primates than previously thought. PMID:24485196
Chien, Maw-Sheng; Gilbert , Teresa L.; Huang, Chienjin; Landolt, Marsha L.; O'Hara, Patrick J.; Winton, James R.
1992-01-01
The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated Mr value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of Mr 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.
Franco, Bernardo; González-Cerón, Gabriela; Servín-González, Luis
2003-11-01
The functionality of direct and inverted repeat sequences inside the cis acting locus of transfer (clt) of the Streptomyces plasmid pJV1 was determined by testing the effect of different deletions on plasmid transfer. The results show that the single most important element for pJV1 clt function is a series of evenly spaced 9 bp long direct repeats which match the consensus CCGCACA(C/G)(C/G), since their deletion caused a dramatic reduction in plasmid transfer. The presence of these repeats in the absence of any other clt sequences allowed plasmid transfer to occur at a frequency that was at least two orders of magnitude higher than that obtained in the complete absence of clt. A database search revealed regions with a similar organization, and in the same position, in Streptomyces plasmids pSN22 and pSLS, which have transfer proteins homologous to those of pJV1.
Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka
2011-11-01
The centromere plays an essential role for proper chromosome segregation during cell division and usually harbors long arrays of tandem repeated satellite DNA sequences. Although this function is conserved among eukaryotes, the sequences of centromeric DNA repeats are variable. Most of our understanding of functional centromeres, which are defined by localization of a centromere-specific histone H3 (CENH3) protein, comes from model organisms. The components of the functional centromere in legumes are poorly known. The genus Astragalus is a member of the legumes and bears the largest numbers of species among angiosperms. Therefore, we studied the components of centromeres in Astragalus sinicus. We identified the CenH3 homolog of A. sinicus, AsCenH3 that is the most compact in size among higher eukaryotes. A CENH3-based assay revealed the functional centromeric DNA sequences from A. sinicus, called CentAs. The CentAs repeat is localized in A. sinicus centromeres, and comprises an AT-rich tandem repeat with a monomer size of 20 nucleotides.
Divergent copies of the large inverted repeat in the chloroplast genomes of ulvophycean green algae.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2017-04-20
The chloroplast genomes of many algae and almost all land plants carry two identical copies of a large inverted repeat (IR) sequence that can pair for flip-flop recombination and undergo expansion/contraction. Although the IR has been lost multiple times during the evolution of the green algae, the underlying mechanisms are still largely unknown. A recent comparison of IR-lacking and IR-containing chloroplast genomes of chlorophytes from the Ulvophyceae (Ulotrichales) suggested that differential elimination of genes from the IR copies might lead to IR loss. To gain deeper insights into the evolutionary history of the chloroplast genome in the Ulvophyceae, we analyzed the genomes of Ignatius tetrasporus and Pseudocharacium americanum (Ignatiales, an order not previously sampled), Dangemannia microcystis (Oltmannsiellopsidales), Pseudoneochloris marina (Ulvales) and also Chamaetrichon capsulatum and Trichosarcina mucosa (Ulotrichales). Our comparison of these six chloroplast genomes with those previously reported for nine ulvophyceans revealed unsuspected variability. All newly examined genomes feature an IR, but remarkably, the copies of the IR present in the Ignatiales, Pseudoneochloris, and Chamaetrichon diverge in sequence, with the tRNA genes from the rRNA operon missing in one IR copy. The implications of this unprecedented finding for the mechanism of IR loss and flip-flop recombination are discussed.
Power, Imana L; Dang, Phat M; Sobolev, Victor S; Orner, Valerie A; Powell, Joseph L; Lamb, Marshall C; Arias, Renee S
2017-04-01
Aflatoxin contamination is a major constraint in food production worldwide. In peanut (Arachis hypogaea L.), these toxic and carcinogenic aflatoxins are mainly produced by Aspergillus flavus Link and A. parasiticus Speare. The use of RNA interference (RNAi) is a promising method to reduce or prevent the accumulation of aflatoxin in peanut seed. In this study, we performed high-throughput sequencing of small RNA populations in a control line and in two transformed peanut lines that expressed an inverted repeat targeting five genes involved in the aflatoxin-biosynthesis pathway and that showed up to 100% less aflatoxin B 1 than the controls. The objective was to determine the putative involvement of the small RNA populations in aflatoxin reduction. In total, 41 known microRNA (miRNA) families and many novel miRNAs were identified. Among those, 89 known and 10 novel miRNAs were differentially expressed in the transformed lines. We furthermore found two small interfering RNAs derived from the inverted repeat, and 39 sRNAs that mapped without mismatches to the genome of A. flavus and were present only in the transformed lines. This information will increase our understanding of the effectiveness of RNAi and enable the possible improvement of the RNAi technology for the control of aflatoxins. Copyright © 2017 Elsevier B.V. All rights reserved.
Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.
Kohany, Oleksiy; Gentles, Andrew J; Hankus, Lukasz; Jurka, Jerzy
2006-10-25
Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at http://www.girinst.org/repbase/submission.html (RepbaseSubmitter) and http://www.girinst.org/censor/index.php (Censor).
Ribosomal DNA copy loss and repeat instability in ATRX-mutated cancers
Udugama, Maheshi; Sanij, Elaine; Voon, Hsiao P. J.; Son, Jinbae; Hii, Linda; Henson, Jeremy D.; Chan, F. Lyn; Chang, Fiona T. M.; Liu, Yumei; Pearson, Richard B.; Kalitsis, Paul; Mann, Jeffrey R.; Collas, Philippe; Hannan, Ross D.; Wong, Lee H.
2018-01-01
ATRX (alpha thalassemia/mental retardation X-linked) complexes with DAXX to deposit histone variant H3.3 into repetitive heterochromatin. Recent genome sequencing studies in cancers have revealed mutations in ATRX and their association with ALT (alternative lengthening of telomeres) activation. Here we report depletion of ATRX in mouse ES cells leads to selective loss in ribosomal RNA gene (rDNA) copy number. Supporting this, ATRX-mutated human ALT-positive tumors also show a substantially lower rDNA copy than ALT-negative tumors. Further investigation shows that the rDNA copy loss and repeat instability are caused by a disruption in H3.3 deposition and thus a failure in heterochromatin formation at rDNA repeats in the absence of ATRX. We also find that ATRX-depleted cells are reduced in ribosomal RNA transcription output and show increased sensitivity to RNA polymerase I (Pol I) transcription inhibitor CX5461. In addition, human ALT-positive cancer cell lines are also more sensitive to CX5461 treatment. Our study provides insights into the contribution of ATRX loss of function to tumorigenesis through the loss of rDNA stability and suggests the therapeutic potential of targeting Pol I transcription in ALT cancers. PMID:29669917
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Hoarau, Galice; Holla, Suzanne; Lescasse, Rachel; Stam, Wytze T; Olsen, Jeanine L
2002-12-01
The general assumption that mitochondrial DNA (mtDNA) does not undergo recombination has been challenged recently in invertebrates. Here we present the first direct evidence for recombination in the mtDNA of a vertebrate, the flounder Platichthys flesus. The control region in the mtDNA of this flatfish is characterized by the presence of a variable number of tandem repeats and a high level of heteroplasmy. Two types of repeats were recognized, differing by two C-T point mutations. Most individuals carry a pure "C" or a pure "T" array, but one individual showed a compound "CT" array. Such a compound array is evidence for recombination in the mtDNA control region from the flounder.
Wang, Yongming; Lin, Xiuyun; Dong, Bo; Wang, Yingdian; Liu, Bao
2004-01-01
RAPD (randomly amplified polymorphic DNA) and ISSR (inter-simple sequence repeat) fingerprinting on HpaII/MspI-digested genomic DNA of nine elite japonica rice cultivars implies inter-cultivar DNA methylation polymorphism. Using both DNA fragments isolated from RAPD or ISSR gels and selected low-copy sequences as probes, methylation-sensitive Southern blot analysis confirms the existence of extensive DNA methylation polymorphism in both genes and DNA repeats among the rice cultivars. The cultivar-specific methylation patterns are stably maintained, and can be used as reliable molecular markers. Transcriptional analysis of four selected sequences (RdRP, AC9, HSP90 and MMR) on leaves and roots from normal and 5-azacytidine-treated seedlings of three representative cultivars shows an association between the transcriptional activity of one of the genes, the mismatch repair (MMR) gene, and its CG methylation patterns.
Biological sequence compression algorithms.
Matsumoto, T; Sadakane, K; Imai, H
2000-01-01
Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
The fission yeast CENP-B protein Abp1 prevents pervasive transcription of repetitive DNA elements.
Daulny, Anne; Mejía-Ramírez, Eva; Reina, Oscar; Rosado-Lugo, Jesus; Aguilar-Arnal, Lorena; Auer, Herbert; Zaratiegui, Mikel; Azorin, Fernando
2016-10-01
It is well established that eukaryotic genomes are pervasively transcribed producing cryptic unstable transcripts (CUTs). However, the mechanisms regulating pervasive transcription are not well understood. Here, we report that the fission yeast CENP-B homolog Abp1 plays an important role in preventing pervasive transcription. We show that loss of abp1 results in the accumulation of CUTs, which are targeted for degradation by the exosome pathway. These CUTs originate from different types of genomic features, but the highest increase corresponds to Tf2 retrotransposons and rDNA repeats, where they map along the entire elements. In the absence of abp1, increased RNAPII-Ser5P occupancy is observed throughout the Tf2 coding region and, unexpectedly, RNAPII-Ser5P is enriched at rDNA repeats. Loss of abp1 also results in Tf2 derepression and increased nucleolus size. Altogether these results suggest that Abp1 prevents pervasive RNAPII transcription of repetitive DNA elements (i.e., Tf2 and rDNA repeats) from internal cryptic sites. Copyright © 2016 Elsevier B.V. All rights reserved.
DNA triplet repeats mediate heterochromatin-protein-1-sensitive variegated gene silencing.
Saveliev, Alexander; Everett, Christopher; Sharpe, Tammy; Webster, Zoë; Festenstein, Richard
2003-04-24
Gene repression is crucial to the maintenance of differentiated cell types in multicellular organisms, whereas aberrant silencing can lead to disease. The organization of DNA into chromatin and heterochromatin is implicated in gene silencing. In chromatin, DNA wraps around histones, creating nucleosomes. Further condensation of chromatin, associated with large blocks of repetitive DNA sequences, is known as heterochromatin. Position effect variegation (PEV) occurs when a gene is located abnormally close to heterochromatin, silencing the affected gene in a proportion of cells. Here we show that the relatively short triplet-repeat expansions found in myotonic dystrophy and Friedreich's ataxia confer variegation of expression on a linked transgene in mice. Silencing was correlated with a decrease in promoter accessibility and was enhanced by the classical PEV modifier heterochromatin protein 1 (HP1). Notably, triplet-repeat-associated variegation was not restricted to classical heterochromatic regions but occurred irrespective of chromosomal location. Because the phenomenon described here shares important features with PEV, the mechanisms underlying heterochromatin-mediated silencing might have a role in gene regulation at many sites throughout the mammalian genome and modulate the extent of gene silencing and hence severity in several triplet-repeat diseases.
Lomaeva, M G; Fomenko, L A; Vasil'eva, G V; Bezlepkin, V G
2016-01-01
Evidence is presented indicating the differences in the polymorphism of microsatellite (MCS) repeats in DNA of somatic tissues in the offspring of BALB/c mice of different sex born from preconceptionally irradiated males or females. Brother-sister groups of the offspring born by non-irradiated parental pairs were compared with the offspring obtained after the irradiation of one parent in the same pairs. The number of MCS repeats in DNA of somatic tissues of the offspring from irradiated males or females was compared by a polymerase chain reaction using an arbitrary primer. It was found that changes in the polymorphism of the number of MCS repeats in the offspring from the males irradiated at a dose of 2 Gy was insignificant as compared with the offspring from control animals. In the offspring born by the females irradiated at a dose of 2 Gy (which does not impair the reproductive capacity), a statistically significant increase in the polymorphism was observed. Changes in the polymorphism were different in the offspring of different sex. A higher level of polymorphism was revealed in the female offspring born from the females of the F0 generation after their irradiation at a dose of 2 Gy. The increase in the polymorphism of the number of MCS repeats in DNA was more pronounced in postmitotic tissues compared with proliferating tissues.
Singh, Deepak K.; Rath, Pramod C.
2012-01-01
We report strong somatic and germ line expression of LINE RNAs in eight different tissues of rat by using a novel ~2.8 kb genomic PstI-LINE DNA (P1-LINE) isolated from the rat brain. P1-LINE is present in a 93 kb LINE-SINE-cluster in sub-telomeric region of chromosome 12 (12p12) and as multiple truncated copies interspersed in all rat chromosomes. P1-LINEs occur as inverted repeats at multiple genomic loci in tissue-specific and mosaic patterns. P1-LINE RNAs are strongly expressed in brain, liver, lungs, heart, kidney, testes, spleen and thymus into large to small heterogeneous RNAs (~5.0 to 0.2 kb) in tissue-specific and dynamic patterns in individual rats. P1-LINE DNA is strongly methylated at CpG-dinucleotides in most genomic copies in all the tissues and weakly hypomethylated in few copies in some tissues. Small (700–75 nt) P1-LINE RNAs expressed in all tissues may be possible precursors for small regulatory RNAs (PIWI-interacting/piRNAs) bioinformatically derived from P1-LINE. The strong and dynamic expression of LINE RNAs from multiple chromosomal loci and the putative piRNAs in somatic tissues of rat under normal physiological conditions may define functional chromosomal domains marked by LINE RNAs as long noncoding RNAs (lncRNAs) unrestricted by DNA methylation. The tissue-specific, dynamic RNA expression and mosaic genomic distribution of LINEs representing a steady-state genomic flux of retrotransposon RNAs suggest for biological role of LINE RNAs as long ncRNAs and small piRNAs in mammalian tissues independent of their cellular fate for translation, reverse-transcription and retrotransposition. This may provide evolutionary advantages to LINEs and mammalian genomes. PMID:23064113
Ponnazhagan, Selvarangan; Weigel, Kirsten A.; Raikwar, Sudhanshu P.; Mukherjee, Pinku; Yoder, Mervin C.; Srivastava, Arun
1998-01-01
A novel packaging strategy combining the salient features of two human parvoviruses, namely the pathogenic parvovirus B19 and the nonpathogenic adeno-associated virus type 2 (AAV), was developed to achieve erythroid cell-specific delivery as well as expression of the transduced gene. The development of such a chimeric vector system was accomplished by packaging heterologous DNA sequences cloned within the inverted terminal repeats of AAV and subsequently packaging the DNA inside the capsid structure of B19 virus. Recombinant B19 virus particles were assembled, as evidenced by electron microscopy as well as DNA slot blot analyses. The hybrid vector failed to transduce nonerythroid human cells, such as 293 cells, as expected. However, MB-02 cells, a human megakaryocytic leukemia cell line which can be infected by B19 virus following erythroid differentiation with erythropoietin (N. C. Munshi, S. Z. Zhou, M. J. Woody, D. A. Morgan, and A. Srivastava, J. Virol. 67:562–566, 1993) but lacks the putative receptor for AAV (S. Ponnazhagan, X.-S. Wang, M. J. Woody, F. Luo, L. Y. Kang, M. L. Nallari, N. C. Munshi, S. Z. Zhou, and A. Srivastava, J. Gen. Virol. 77:1111–1122, 1996), were readily transduced by this vector. The hybrid vector was also found to specifically target the erythroid population in primary human bone marrow cells as well as more immature hematopoietic progenitor cells following erythroid differentiation, as evidenced by selective expression of the transduced gene in these target cells. Preincubation with anticapsid antibodies against B19 virus, but not anticapsid antibodies against AAV, inhibited transduction of primary human erythroid cells. The efficiency of transduction of primary human erythroid cells by the recombinant B19 virus vector was significantly higher than that by the recombinant AAV vector. Further development of the AAV-B19 virus hybrid vector system should prove beneficial in gene therapy protocols aimed at the correction of inherited and acquired human diseases affecting cells of erythroid lineage. PMID:9573295
rbcL and matK Earn Two Thumbs Up as the Core DNA Barcode for Ferns
Li, Fay-Wei; Kuo, Li-Yaung; Rothfels, Carl J.; Ebihara, Atsushi; Chiou, Wen-Liang; Windham, Michael D.; Pryer, Kathleen M.
2011-01-01
Background DNA barcoding will revolutionize our understanding of fern ecology, most especially because the accurate identification of the independent but cryptic gametophyte phase of the fern's life history—an endeavor previously impossible—will finally be feasible. In this study, we assess the discriminatory power of the core plant DNA barcode (rbcL and matK), as well as alternatively proposed fern barcodes (trnH-psbA and trnL-F), across all major fern lineages. We also present plastid barcode data for two genera in the hyperdiverse polypod clade—Deparia (Woodsiaceae) and the Cheilanthes marginata group (currently being segregated as a new genus of Pteridaceae)—to further evaluate the resolving power of these loci. Principal Findings Our results clearly demonstrate the value of matK data, previously unavailable in ferns because of difficulties in amplification due to a major rearrangement of the plastid genome. With its high sequence variation, matK complements rbcL to provide a two-locus barcode with strong resolving power. With sequence variation comparable to matK, trnL-F appears to be a suitable alternative barcode region in ferns, and perhaps should be added to the core barcode region if universal primer development for matK fails. In contrast, trnH-psbA shows dramatically reduced sequence variation for the majority of ferns. This is likely due to the translocation of this segment of the plastid genome into the inverted repeat regions, which are known to have a highly constrained substitution rate. Conclusions Our study provides the first endorsement of the two-locus barcode (rbcL+matK) in ferns, and favors trnL-F over trnH-psbA as a potential back-up locus. Future work should focus on gathering more fern matK sequence data to facilitate universal primer development. PMID:22028918
Van Kreijl, C F; Bos, J L
1977-01-01
The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740
NASA Technical Reports Server (NTRS)
Smith, G. K.; Jie, J.; Fox, G. E.; Gao, X.
1995-01-01
DNA triplet repeats, 5'-d(CTG)n and 5'-d(CAG)n, are present in genes which have been implicated in several neurodegenerative disorders. To investigate possible stable structures formed by these repeating sequences, we have examined d(CTG)n, d(CAG)n and d(CTG).d(CAG)n (n = 2 and 3) using NMR and UV optical spectroscopy. These studies reveal that single stranded (CTG)n (n > 2) forms stable, antiparallel helical duplexes, while the single stranded (CAG)n requires at least three repeating units to form a duplex. NMR and UV melting experiments show that the Tm increases in the order of [(CAG)3]2 < [(CTG)3]2 << (CAG)3.(CTG)3. The (CTG)3 duplex is stable and exhibits similar NMR spectra in solutions containing 0.1-4 M NaCl and at a pH range from 4.6 to 8.8. The (CTG)3 duplex, which contains multiple-T.T mismatches, displays many NMR spectral characteristics similar to those of B-form DNA. However, unique NOE and 1H-31P coupling patterns associated with the repetitive T.T mismatches in the CTG repeats are discerned. These results, in conjunction with recent in vitro studies suggest that longer CTG repeats may form hairpin structures, which can potentially cause interruption in replication, leading to dynamic expansion or deletion of triplet repeats.
Pietras, D F; Bennett, K L; Siracusa, L D; Woodworth-Gutai, M; Chapman, V M; Gross, K W; Kane-Haas, C; Hastie, N D
1983-01-01
We report the construction of a small library of recombinant plasmids containing Mus musculus repetitive DNA inserts. The repetitive cloned fraction was derived from denatured genomic DNA by reassociation to a Cot value at which repetitive, but not unique, sequences have reannealed followed by exhaustive S1 nuclease treatment to degrade single stranded DNA. Initial characterizations of this library by colony filter hybridizations have led to the identification of a previously undetected M. musculus minor satellite as well as to clones containing M. musculus major satellite sequences. This new satellite is repeated 10-20 times less than the major satellite in the M. musculus genome. It has a repeat length of 130 nucleotides compared with the M. musculus major satellite with a repeat length of 234 nucleotides. Sequence analysis of the minor satellite has shown that it has a 29 base pair region with extensive homology to one of the major satellite repeating subunits. We also show by in situ hybridization that this minor satellite sequence is located at the centromeres and possibly the arms of at least half the M musculus chromosomes. Sequences related to the minor satellite have been found in the DNA of a related Mus species, Mus spretus, and may represent the major satellite of that species. Images PMID:6314268
Litovchick, Alexander; Clark, Matthew A; Keefe, Anthony D
2014-01-01
The affinity-mediated selection of large libraries of DNA-encoded small molecules is increasingly being used to initiate drug discovery programs. We present universal methods for the encoding of such libraries using the chemical ligation of oligonucleotides. These methods may be used to record the chemical history of individual library members during combinatorial synthesis processes. We demonstrate three different chemical ligation methods as examples of information recording processes (writing) for such libraries and two different cDNA-generation methods as examples of information retrieval processes (reading) from such libraries. The example writing methods include uncatalyzed and Cu(I)-catalyzed alkyne-azide cycloadditions and a novel photochemical thymidine-psoralen cycloaddition. The first reading method “relay primer-dependent bypass” utilizes a relay primer that hybridizes across a chemical ligation junction embedded in a fixed-sequence and is extended at its 3′-terminus prior to ligation to adjacent oligonucleotides. The second reading method “repeat-dependent bypass” utilizes chemical ligation junctions that are flanked by repeated sequences. The upstream repeat is copied prior to a rearrangement event during which the 3′-terminus of the cDNA hybridizes to the downstream repeat and polymerization continues. In principle these reading methods may be used with any ligation chemistry and offer universal strategies for the encoding (writing) and interpretation (reading) of DNA-encoded chemical libraries. PMID:25483841
Rizk, Francine; Laverdure, Sylvain; d'Alençon, Emmanuelle; Bossin, Hervé; Dupressoir, Thierry
2018-01-01
The Lepidopteran ambidensovirus 1 isolated from Junonia coenia (hereafter JcDV) is an invertebrate parvovirus considered as a viral transduction vector as well as a potential tool for the biological control of insect pests. Previous works showed that JcDV-based circular plasmids experimentally integrate into insect cells genomic DNA. In order to approach the natural conditions of infection and possible integration, we generated linear JcDV- gfp based molecules which were transfected into non permissive Spodoptera frugiperda ( Sf9 ) cultured cells. Cells were monitored for the expression of green fluorescent protein (GFP) and DNA was analyzed for integration of transduced viral sequences. Non-structural protein modulation of the VP-gene cassette promoter activity was additionally assayed. We show that linear JcDV-derived molecules are capable of long term genomic integration and sustained transgene expression in Sf9 cells. As expected, only the deletion of both inverted terminal repeats (ITR) or the polyadenylation signals of NS and VP genes dramatically impairs the global transduction/expression efficiency. However, all the integrated viral sequences we characterized appear "scrambled" whatever the viral content of the transfected vector. Despite a strong GFP expression, we were unable to recover any full sequence of the original constructs and found rearranged viral and non-viral sequences as well. Cellular flanking sequences were identified as non-coding ones. On the other hand, the kinetics of GFP expression over time led us to investigate the apparent down-regulation by non-structural proteins of the VP-gene cassette promoter. Altogether, our results show that JcDV-derived sequences included in linear DNA molecules are able to drive efficiently the integration and expression of a foreign gene into the genome of insect cells, whatever their composition, provided that at least one ITR is present. However, the transfected sequences were extensively rearranged with cellular DNA during or after random integration in the host cell genome. Lastly, the non-structural proteins seem to participate in the regulation of p9 promoter activity rather than to the integration of viral sequences.
Homology-dependent repair is involved in 45S rDNA loss in plant CAF-1 mutants
Muchová, Veronika; Amiard, Simon; Mozgová, Iva; Dvořáčková, Martina; Gallego, Maria E; White, Charles; Fajkus, Jiří
2015-01-01
Arabidopsis thaliana mutants in FAS1 and FAS2 subunits of chromatin assembly factor 1 (CAF1) show progressive loss of 45S rDNA copies and telomeres. We hypothesized that homology-dependent DNA damage repair (HDR) may contribute to the loss of these repeats in fas mutants. To test this, we generated double mutants by crossing fas mutants with knock-out mutants in RAD51B, one of the Rad51 paralogs of A. thaliana. Our results show that the absence of RAD51B decreases the rate of rDNA loss, confirming the implication of RAD51B-dependent recombination in rDNA loss in the CAF1 mutants. Interestingly, this effect is not observed for telomeric repeat loss, which thus differs from that acting in rDNA loss. Involvement of DNA damage repair in rDNA dynamics in fas mutants is further supported by accumulation of double-stranded breaks (measured as γ-H2AX foci) in 45S rDNA. Occurrence of the foci is not specific for S-phase, and is ATM-independent. While the foci in fas mutants occur both in the transcribed (intranucleolar) and non-transcribed (nucleoplasmic) fraction of rDNA, double fas rad51b mutants show a specific increase in the number of the intranucleolar foci. These results suggest that the repair of double-stranded breaks present in the transcribed rDNA region is RAD51B dependent and that this contributes to rDNA repeat loss in fas mutants, presumably via the single-stranded annealing recombination pathway. Our results also highlight the importance of proper chromatin assembly in the maintenance of genome stability. PMID:25359579
Danilowicz, Claudia; Hermans, Laura; Coljee, Vincent; Prévost, Chantal
2017-01-01
Abstract During DNA recombination and repair, RecA family proteins must promote rapid joining of homologous DNA. Repeated sequences with >100 base pair lengths occupy more than 1% of bacterial genomes; however, commitment to strand exchange was believed to occur after testing ∼20–30 bp. If that were true, pairings between different copies of long repeated sequences would usually become irreversible. Our experiments reveal that in the presence of ATP hydrolysis even 75 bp sequence-matched strand exchange products remain quite reversible. Experiments also indicate that when ATP hydrolysis is present, flanking heterologous dsDNA regions increase the reversibility of sequence matched strand exchange products with lengths up to ∼75 bp. Results of molecular dynamics simulations provide insight into how ATP hydrolysis destabilizes strand exchange products. These results inspired a model that shows how pairings between long repeated sequences could be efficiently rejected even though most homologous pairings form irreversible products. PMID:28854739
Arrieta-Montiel, Maria P; Shedge, Vikas; Davila, Jaime; Christensen, Alan C; Mackenzie, Sally A
2009-12-01
The plant mitochondrial genome is recombinogenic, with DNA exchange activity controlled to a large extent by nuclear gene products. One nuclear gene, MSH1, appears to participate in suppressing recombination in Arabidopsis at every repeated sequence ranging in size from 108 to 556 bp. Present in a wide range of plant species, these mitochondrial repeats display evidence of successful asymmetric DNA exchange in Arabidopsis when MSH1 is disrupted. Recombination frequency appears to be influenced by repeat sequence homology and size, with larger size repeats corresponding to increased DNA exchange activity. The extensive mitochondrial genomic reorganization of the msh1 mutant produced altered mitochondrial transcription patterns. Comparison of mitochondrial genomes from the Arabidopsis ecotypes C24, Col-0, and Ler suggests that MSH1 activity accounts for most or all of the polymorphisms distinguishing these genomes, producing ecotype-specific stoichiometric changes in each line. Our observations suggest that MSH1 participates in mitochondrial genome evolution by influencing the lineage-specific pattern of mitochondrial genetic variation in higher plants.
Plasmid P1 replication: negative control by repeated DNA sequences.
Chattoraj, D; Cordes, K; Abeles, A
1984-01-01
The incompatibility locus, incA, of the unit-copy plasmid P1 is contained within a fragment that is essentially a set of nine 19-base-pair repeats. One or more copies of the fragment destabilizes the plasmid when present in trans. Here we show that extra copies of incA interfere with plasmid DNA replication and that a deletion of most of incA increases plasmid copy number. Thus, incA is not essential for replication but is required for its control. When cloned in a high-copy-number vector, pieces of the incA fragment that each contain only three repeats destabilize P1 plasmids efficiently. This result makes it unlikely that incA specifies a regulatory product. Our in vivo results suggest that the repeating DNA sequence itself negatively controls replication by titrating a P1-determined protein, RepA, that is essential for replication. Consistent with this hypothesis is the observation that the RepA protein binds to the incA fragment in vitro. Images PMID:6387706
Kim, Min Jung; Hwang, Kyung Hwan; Lee, Young-Seok; Park, Jae-Yoon; Kook, Joong-Ki
2011-03-01
The aim of this study was to develop Prevotella intermedia-specific PCR primers based on the P. intermedia-specific DNA probe. The P. intermedia-specific DNA probe was screened by inverted dot blot hybridization and confirmed by Southern blot hybridization. The nucleotide sequences of the species-specific DNA probes were determined using a chain termination method. Southern blot analysis showed that the DNA probe, Pig27, detected only the genomic DNA of P. intermedia strains. PCR showed that the PCR primers, Pin-F1/Pin-R1, had species-specificity for P. intermedia. The detection limits of the PCR primer sets were 0.4pg of the purified genomic DNA of P. intermedia ATCC 49046. These results suggest that the PCR primers, Pin-F1/Pin-R1, could be useful in the detection of P. intermedia as well as in the development of a PCR kit in epidemiological studies related to periodontal diseases. Crown Copyright © 2010. Published by Elsevier B.V. All rights reserved.
Zhu, Zhixuan; Gui, Songtao; Jin, Jing; Yi, Rong; Wu, Zhihua; Qian, Qian; Ding, Yi
2016-09-01
Centromeres on eukaryotic chromosomes consist of large arrays of DNA repeats that undergo very rapid evolution. Nelumbo nucifera Gaertn. (sacred lotus) is a phylogenetic relict and an aquatic perennial basal eudicot. Studies concerning the centromeres of this basal eudicot species could provide ancient evolutionary perspectives. In this study, we characterized the centromeric marker protein NnCenH3 (sacred lotus centromere-specific histone H3 variant), and used a chromatin immunoprecipitation (ChIP)-based technique to recover the NnCenH3 nucleosome-associated sequences of sacred lotus. The properties of the centromere-binding protein and DNA sequences revealed notable divergence between sacred lotus and other flowering plants, including the following factors: (i) an NnCenH3 alternative splicing variant comprising only a partial centromere-targeting domain, (ii) active genes with low transcription levels in the NnCenH3 nucleosomal regions, and (iii) the prevalence of the Ty1/copia class of long terminal repeat (LTR) retrotransposons in the centromeres of sacred lotus chromosomes. In addition, the dynamic natures of the centromeric region showed that some of the centromeric repeat DNA sequences originated from telomeric repeats, and a pair of centromeres on the dicentric chromosome 1 was inactive in the metaphase cells of sacred lotus. Our characterization of the properties of centromeric DNA structure within the sacred lotus genome describes a centromeric profile in ancient basal eudicots and might provide evidence of the origins and evolution of centromeres. Furthermore, the identification of centromeric DNA sequences is of great significance for the assembly of the sacred lotus genome. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Slean, Meghan M.; Panigrahi, Gagan B.; Castel, Arturo López; Pearson, August B.; Tomkinson, Alan E.; Pearson, Christopher E.
2016-01-01
Typically disease-causing CAG/CTG repeats expand, but rare affected families can display high levels of contraction of the expanded repeat amongst offspring. Understanding instability is important since arresting expansions or enhancing contractions could be clinically beneficial. The MutSβ mismatch repair complex is required for CAG/CTG expansions in mice and patients. Oddly, by unknown mechanisms MutSβ-deficient mice incur contractions instead of expansions. Replication using CTG or CAG as the lagging strand template is known to cause contractions or expansions respectively; however, the interplay between replication and repair leading to this instability remains unclear. Towards understanding how repeat contractions may arise, we performed in vitro SV40-mediated replication of repeat-containing plasmids in the presence or absence of mismatch repair. Specifically, we separated repair from replication: Replication mediated by MutSβ- and MutSα-deficient human cells or cell extracts produced slipped-DNA heteroduplexes in the contraction- but not expansion-biased replication direction. Replication in the presence of MutSβ disfavoured the retention of replication products harbouring slipped-DNA heteroduplexes. Post-replication repair of slipped-DNAs by MutSβ-proficient extracts eliminated slipped-DNAs. Thus, a MutSβ-deficiency likely enhances repeat contractions because MutSβ protects against contractions by repairing template strand slip-outs. Replication deficient in LigaseI or PCNA-interaction mutant LigaseI revealed slipped-DNA formation at lagging strands. Our results reveal that distinct mechanisms lead to expansions or contractions and support inhibition of MutSβ as a therapeutic strategy to enhance the contraction of expanded repeats. PMID:27155933
Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.
Lakshmikumaran, M; Negi, M S
1994-03-01
Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.
Li, Lixin; Piatek, Marek J; Atef, Ahmed; Piatek, Agnieszka; Wibowo, Anjar; Fang, Xiaoyun; Sabir, J S M; Zhu, Jian-Kang; Mahfouz, Magdy M
2012-03-01
Transcription activator-like effectors (TALEs) can be used as DNA-targeting modules by engineering their repeat domains to dictate user-selected sequence specificity. TALEs have been shown to function as site-specific transcriptional activators in a variety of cell types and organisms. TALE nucleases (TALENs), generated by fusing the FokI cleavage domain to TALE, have been used to create genomic double-strand breaks. The identity of the TALE repeat variable di-residues, their number, and their order dictate the DNA sequence specificity. Because TALE repeats are nearly identical, their assembly by cloning or even by synthesis is challenging and time consuming. Here, we report the development and use of a rapid and straightforward approach for the construction of designer TALE (dTALE) activators and nucleases with user-selected DNA target specificity. Using our plasmid set of 100 repeat modules, researchers can assemble repeat domains for any 14-nucleotide target sequence in one sequential restriction-ligation cloning step and in only 24 h. We generated several custom dTALEs and dTALENs with new target sequence specificities and validated their function by transient expression in tobacco leaves and in vitro DNA cleavage assays, respectively. Moreover, we developed a web tool, called idTALE, to facilitate the design of dTALENs and the identification of their genomic targets and potential off-targets in the genomes of several model species. Our dTALE repeat assembly approach along with the web tool idTALE will expedite genome-engineering applications in a variety of cell types and organisms including plants.
Developmental validation of a Cannabis sativa STR multiplex system for forensic analysis.
Howard, Christopher; Gilmore, Simon; Robertson, James; Peakall, Rod
2008-09-01
A developmental validation study based on recommendations of the Scientific Working Group on DNA Analysis Methods (SWGDAM) was conducted on a multiplex system of 10 Cannabis sativa short tandem repeat loci. Amplification of the loci in four multiplex reactions was tested across DNA from dried root, stem, and leaf sources, and DNA from fresh, frozen, and dried leaf tissue with a template DNA range of 10.0-0.01 ng. The loci were amplified and scored consistently for all DNA sources when DNA template was in the range of 10.0-1.0 ng. Some allelic dropout and PCR failure occurred in reactions with lower template DNA amounts. Overall, amplification was best using 10.0 ng of template DNA from dried leaf tissue indicating that this is the optimal source material. Cross species amplification was observed in Humulus lupulus for three loci but there was no allelic overlap. This is the first study following SWGDAM validation guidelines to validate short tandem repeat markers for forensic use in plants.
Roh, Hwan-Jung; Mun, Sue Jean; Cho, Kyu-Sup; Hong, Sung-Lyong
2016-01-01
The recurrence rate of sinonasal inverted papillomas (SNIP) is 15-20%. However, few studies have investigated patient-dependent factors related to recurrence of SNIPs. To analyze risk factors, including human papilloma virus (HPV) infection and smoking, as well as other factors, for recurrence of SNIPs. Fifty-four patients who were diagnosed with SNIP and underwent surgery were enrolled: 39 men and 15 women, with the mean age of 54.0 years. Their mean follow-up was 40.6 months. Demographics and information about the history of smoking, previous surgery, tumor extent, follow-up, and recurrence were reviewed retrospectively. Those patients whose tumors were associated with malignant transformation were excluded in this study. HPV detection and genotyping in the tumor specimens were performed with the HPV DNA chip, a polymerase chain reaction-based DNA microarray system. Seven patients (13.0%) had recurrence, with a mean time to recurrence of 39.8 months. Recurrence rates in T1, T2, T3, and T4 of the Krouse staging system were 0% (0/4), 8.3% (2/24), 17.4% (4/23), and 33.3% (1/3), respectively (p > 0.5). Eight patients (14.8%) were positive for HPV DNA. All of these patients belonged to the group without recurrence (p > 0.5). However, recurrence rates according to HPV DNA positivity were not statistically different (0% versus 15.2%). Three (42.9%) in the group with recurrence and four (8.5%) in the group without recurrence were smokers (p < 0.5). Smoking was associated with recurrence of SNIP. However, HPV infection is not a recurrence of SNIP risk factor.
Variable presence of the inverted repeat and plastome stability in Erodium
Blazier, John C.; Jansen, Robert K.; Mower, Jeffrey P.; Govindu, Madhu; Zhang, Jin; Weng, Mao-Lun; Ruhlman, Tracey A.
2016-01-01
Background and Aims Several unrelated lineages such as plastids, viruses and plasmids, have converged on quadripartite genomes of similar size with large and small single copy regions and a large inverted repeat (IR). Except for Erodium (Geraniaceae), saguaro cactus and some legumes, the plastomes of all photosynthetic angiosperms display this structure. The functional significance of the IR is not understood and Erodium provides a system to examine the role of the IR in the long-term stability of these genomes. We compared the degree of genomic rearrangement in plastomes of Erodium that differ in the presence and absence of the IR. Methods We sequenced 17 new Erodium plastomes. Using 454, Illumina, PacBio and Sanger sequences, 16 genomes were assembled and categorized along with one incomplete and two previously published Erodium plastomes. We conducted phylogenetic analyses among these species using a dataset of 19 protein-coding genes and determined if significantly higher evolutionary rates had caused the long branch seen previously in phylogenetic reconstructions within the genus. Bioinformatic comparisons were also performed to evaluate plastome evolution across the genus. Key Results Erodium plastomes fell into four types (Type 1–4) that differ in their substitution rates, short dispersed repeat content and degree of genomic rearrangement, gene and intron content and GC content. Type 4 plastomes had significantly higher rates of synonymous substitutions (dS) for all genes and for 14 of the 19 genes non-synonymous substitutions (dN) were significantly accelerated. We evaluated the evidence for a single IR loss in Erodium and in doing so discovered that Type 4 plastomes contain a novel IR. Conclusions The presence or absence of the IR does not affect plastome stability in Erodium. Rather, the overall repeat content shows a negative correlation with genome stability, a pattern in agreement with other angiosperm groups and recent findings on genome stability in bacterial endosymbionts. PMID:27192713
Variable presence of the inverted repeat and plastome stability in Erodium.
Blazier, John C; Jansen, Robert K; Mower, Jeffrey P; Govindu, Madhu; Zhang, Jin; Weng, Mao-Lun; Ruhlman, Tracey A
2016-06-01
Several unrelated lineages such as plastids, viruses and plasmids, have converged on quadripartite genomes of similar size with large and small single copy regions and a large inverted repeat (IR). Except for Erodium (Geraniaceae), saguaro cactus and some legumes, the plastomes of all photosynthetic angiosperms display this structure. The functional significance of the IR is not understood and Erodium provides a system to examine the role of the IR in the long-term stability of these genomes. We compared the degree of genomic rearrangement in plastomes of Erodium that differ in the presence and absence of the IR. We sequenced 17 new Erodium plastomes. Using 454, Illumina, PacBio and Sanger sequences, 16 genomes were assembled and categorized along with one incomplete and two previously published Erodium plastomes. We conducted phylogenetic analyses among these species using a dataset of 19 protein-coding genes and determined if significantly higher evolutionary rates had caused the long branch seen previously in phylogenetic reconstructions within the genus. Bioinformatic comparisons were also performed to evaluate plastome evolution across the genus. Erodium plastomes fell into four types (Type 1-4) that differ in their substitution rates, short dispersed repeat content and degree of genomic rearrangement, gene and intron content and GC content. Type 4 plastomes had significantly higher rates of synonymous substitutions (dS) for all genes and for 14 of the 19 genes non-synonymous substitutions (dN) were significantly accelerated. We evaluated the evidence for a single IR loss in Erodium and in doing so discovered that Type 4 plastomes contain a novel IR. The presence or absence of the IR does not affect plastome stability in Erodium. Rather, the overall repeat content shows a negative correlation with genome stability, a pattern in agreement with other angiosperm groups and recent findings on genome stability in bacterial endosymbionts. © The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.
Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong
2014-05-01
We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.
Martin, Guillaume; Baurens, Franc-Christophe; Cardi, Céline; Aury, Jean-Marc; D’Hont, Angélique
2013-01-01
Background Banana (genus Musa) is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-)specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus. Methodology/Principal Findings The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp) and a Small Single Copy region (SSC, 10,768 bp) separated by Inverted Repeat regions (IRs, 35,433 bp). Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1) and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed. Conclusion The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas. PMID:23840670
Martin, Guillaume; Baurens, Franc-Christophe; Cardi, Céline; Aury, Jean-Marc; D'Hont, Angélique
2013-01-01
Banana (genus Musa) is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-)specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus. The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp) and a Small Single Copy region (SSC, 10,768 bp) separated by Inverted Repeat regions (IRs, 35,433 bp). Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1) and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed. The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas.
2010-01-01
Background The cultivated olive (Olea europaea L.) is the most agriculturally important species of the Oleaceae family. Although many studies have been performed on plastid polymorphisms to evaluate taxonomy, phylogeny and phylogeography of Olea subspecies, only few polymorphic regions discriminating among the agronomically and economically important olive cultivars have been identified. The objective of this study was to sequence the entire plastome of olive and analyze many potential polymorphic regions to develop new inter-cultivar genetic markers. Results The complete plastid genome of the olive cultivar Frantoio was determined by direct sequence analysis using universal and novel PCR primers designed to amplify all overlapping regions. The chloroplast genome of the olive has an organisation and gene order that is conserved among numerous Angiosperm species and do not contain any of the inversions, gene duplications, insertions, inverted repeat expansions and gene/intron losses that have been found in the chloroplast genomes of the genera Jasminum and Menodora, from the same family as Olea. The annotated sequence was used to evaluate the content of coding genes, the extent, and distribution of repeated and long dispersed sequences and the nucleotide composition pattern. These analyses provided essential information for structural, functional and comparative genomic studies in olive plastids. Furthermore, the alignment of the olive plastome sequence to those of other varieties and species identified 30 new organellar polymorphisms within the cultivated olive. Conclusions In addition to identifying mutations that may play a functional role in modifying the metabolism and adaptation of olive cultivars, the new chloroplast markers represent a valuable tool to assess the level of olive intercultivar plastome variation for use in population genetic analysis, phylogenesis, cultivar characterisation and DNA food tracking. PMID:20868482
2012-01-01
Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR) are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas. PMID:23256920
Analysis of repeat-mediated deletions in the mitochondrial genome of Saccharomyces cerevisiae.
Phadnis, Naina; Sia, Rey A; Sia, Elaine A
2005-12-01
Mitochondrial DNA deletions and point mutations accumulate in an age-dependent manner in mammals. The mitochondrial genome in aging humans often displays a 4977-bp deletion flanked by short direct repeats. Additionally, direct repeats flank two-thirds of the reported mitochondrial DNA deletions. The mechanism by which these deletions arise is unknown, but direct-repeat-mediated deletions involving polymerase slippage, homologous recombination, and nonhomologous end joining have been proposed. We have developed a genetic reporter to measure the rate at which direct-repeat-mediated deletions arise in the mitochondrial genome of Saccharomyces cerevisiae. Here we analyze the effect of repeat size and heterology between repeats on the rate of deletions. We find that the dependence on homology for repeat-mediated deletions is linear down to 33 bp. Heterology between repeats does not affect the deletion rate substantially. Analysis of recombination products suggests that the deletions are produced by at least two different pathways, one that generates only deletions and one that appears to generate both deletions and reciprocal products of recombination. We discuss how this reporter may be used to identify the proteins in yeast that have an impact on the generation of direct-repeat-mediated deletions.
Analysis of Repeat-Mediated Deletions in the Mitochondrial Genome of Saccharomyces cerevisiae
Phadnis, Naina; Sia, Rey A.; Sia, Elaine A.
2005-01-01
Mitochondrial DNA deletions and point mutations accumulate in an age-dependent manner in mammals. The mitochondrial genome in aging humans often displays a 4977-bp deletion flanked by short direct repeats. Additionally, direct repeats flank two-thirds of the reported mitochondrial DNA deletions. The mechanism by which these deletions arise is unknown, but direct-repeat-mediated deletions involving polymerase slippage, homologous recombination, and nonhomologous end joining have been proposed. We have developed a genetic reporter to measure the rate at which direct-repeat-mediated deletions arise in the mitochondrial genome of Saccharomyces cerevisiae. Here we analyze the effect of repeat size and heterology between repeats on the rate of deletions. We find that the dependence on homology for repeat-mediated deletions is linear down to 33 bp. Heterology between repeats does not affect the deletion rate substantially. Analysis of recombination products suggests that the deletions are produced by at least two different pathways, one that generates only deletions and one that appears to generate both deletions and reciprocal products of recombination. We discuss how this reporter may be used to identify the proteins in yeast that have an impact on the generation of direct-repeat-mediated deletions. PMID:16157666