Sample records for insertion sequence element

  1. Compositions and methods for the expression of selenoproteins in eukaryotic cells

    DOEpatents

    Gladyshev, Vadim [Lincoln, NE; Novoselov, Sergey [Puschino, RU

    2012-09-25

    Recombinant nucleic acid constructs for the efficient expression of eukaryotic selenoproteins and related methods for production of recombinant selenoproteins are provided. The nucleic acid constructs comprise novel selenocysteine insertion sequence (SECIS) elements. Certain novel SECIS elements of the invention contain non-canonical quartet sequences. Other novel SECIS elements provided by the invention are chimeric SECIS elements comprising a canonical SECIS element that contains a non-canonical quartet sequence and chimeric SECIS elements comprising a non-canonical SECIS element that contains a canonical quartet sequence. The novel SECIS elements of the invention facilitate the insertion of selenocysteine residues into recombinant polypeptides.

  2. The Genome Sequence of Avibacterium paragallinarum Strain CL Has a Large Repertoire of Insertion Sequence Elements.

    PubMed

    Horta-Valerdi, Guillermo; Sanchez-Alonso, Maria Patricia; Perez-Marquez, Victor M; Negrete-Abascal, Erasmo; Vaca-Pacheco, Sergio; Hernandez-Gonzalez, Ismael; Gomez-Lunar, Zulema; Olmedo-Álvarez, Gabriela; Vázquez-Cruz, Candelario

    2017-04-13

    The draft genome sequence of Avibacterium paragallinarum strain CL serovar C is reported here. The genome comprises 154 contigs corresponding to 2.4 Mb with 41% G+C content and many insertion sequence (IS) elements, a characteristic not previously reported in A. paragallinarum . Copyright © 2017 Horta-Valerdi et al.

  3. Effects of a Transposable Element Insertion on Alcohol Dehydrogenase Expression in Drosophila Melanogaster

    PubMed Central

    Dunn, R. C.; Laurie, C. C.

    1995-01-01

    Variation in the DNA sequence and level of alcohol dehydrogenase (Adh) gene expression in Drosophila melanogaster have been studied to determine what types of DNA polymorphisms contribute to phenotypic variation in natural populations. The Adh gene, like many others, shows a high level of variability in both DNA sequence and quantitative level of expression. A number of transposable element insertions occur in the Adh region and one of these, a copia insertion in the 5' flanking region, is associated with unusually low Adh expression. To determine whether this insertion (called RI42) causes the low expression level, the insertion was excised from the cloned RI42 Adh gene and the effect was assessed by P-element transformation. Removal of this insertion causes a threefold increase in the level of ADH, clearly showing that it contributes to the naturally occurring variation in expression at this locus. Removal of all but one LTR also causes a threefold increase, indicating that the mechanism is not a simple sequence disruption. Furthermore, this copia insertion, which is located between the two Adh promoters and their upstream enhancer sequences, has differential effects on the levels of proximal and distal transcripts. Finally, a test for the possible modifying effects of two suppressor loci, su(w(a)) and su(f), on this insertional mutation was negative, in contrast to a previous report in the literature. PMID:7498745

  4. Gift from statistical learning: Visual statistical learning enhances memory for sequence elements and impairs memory for items that disrupt regularities.

    PubMed

    Otsuka, Sachio; Saiki, Jun

    2016-02-01

    Prior studies have shown that visual statistical learning (VSL) enhances familiarity (a type of memory) of sequences. How do statistical regularities influence the processing of each triplet element and inserted distractors that disrupt the regularity? Given that increased attention to triplets induced by VSL and inhibition of unattended triplets, we predicted that VSL would promote memory for each triplet constituent, and degrade memory for inserted stimuli. Across the first two experiments, we found that objects from structured sequences were more likely to be remembered than objects from random sequences, and that letters (Experiment 1) or objects (Experiment 2) inserted into structured sequences were less likely to be remembered than those inserted into random sequences. In the subsequent two experiments, we examined an alternative account for our results, whereby the difference in memory for inserted items between structured and random conditions is due to individuation of items within random sequences. Our findings replicated even when control letters (Experiment 3A) or objects (Experiment 3B) were presented before or after, rather than inserted into, random sequences. Our findings suggest that statistical learning enhances memory for each item in a regular set and impairs memory for items that disrupt the regularity. Copyright © 2015 Elsevier B.V. All rights reserved.

  5. Transposon Insertion Finder (TIF): a novel program for detection of de novo transpositions of transposable elements.

    PubMed

    Nakagome, Mariko; Solovieva, Elena; Takahashi, Akira; Yasue, Hiroshi; Hirochika, Hirohiko; Miyao, Akio

    2014-03-14

    Transposition event detection of transposable element (TE) in the genome using short reads from the next-generation sequence (NGS) was difficult, because the nucleotide sequence of TE itself is repetitive, making it difficult to identify locations of its insertions by alignment programs for NGS. We have developed a program with a new algorithm to detect the transpositions from NGS data. In the process of tool development, we used next-generation sequence (NGS) data of derivative lines (ttm2 and ttm5) of japonica rice cv. Nipponbare, regenerated through cell culture. The new program, called a transposon insertion finder (TIF), was applied to detect the de novo transpositions of Tos17 in the regenerated lines. TIF searched 300 million reads of a line within 20 min, identifying 4 and 12 de novo transposition in ttm2 and ttm5 lines, respectively. All of the transpositions were confirmed by PCR/electrophoresis and sequencing. Using the program, we also detected new transposon insertions of P-element from NGS data of Drosophila melanogaster. TIF operates to find the transposition of any elements provided that target site duplications (TSDs) are generated by their transpositions.

  6. In and out of the rRNA genes: characterization of Pokey elements in the sequenced Daphnia genome

    PubMed Central

    2013-01-01

    Background Only a few transposable elements are known to exhibit site-specific insertion patterns, including the well-studied R-element retrotransposons that insert into specific sites within the multigene rDNA. The only known rDNA-specific DNA transposon, Pokey (superfamily: piggyBac) is found in the freshwater microcrustacean, Daphnia pulex. Here, we present a genome-wide analysis of Pokey based on the recently completed whole genome sequencing project for D. pulex. Results Phylogenetic analysis of Pokey elements recovered from the genome sequence revealed the presence of four lineages corresponding to two divergent autonomous families and two related lineages of non-autonomous miniature inverted repeat transposable elements (MITEs). The MITEs are also found at the same 28S rRNA gene insertion site as the Pokey elements, and appear to have arisen as deletion derivatives of autonomous elements. Several copies of the full-length Pokey elements may be capable of producing an active transposase. Surprisingly, both families of Pokey possess a series of 200 bp repeats upstream of the transposase that is derived from the rDNA intergenic spacer (IGS). The IGS sequences within the Pokey elements appear to be evolving in concert with the rDNA units. Finally, analysis of the insertion sites of Pokey elements outside of rDNA showed a target preference for sites similar to the specific sequence that is targeted within rDNA. Conclusions Based on the target site preference of Pokey elements and the concerted evolution of a segment of the element with the rDNA unit, we propose an evolutionary path by which the ancestors of Pokey elements have invaded the rDNA niche. We discuss how specificity for the rDNA unit may have evolved and how this specificity has played a role in the long-term survival of these elements in the subgenus Daphnia. PMID:24059783

  7. Insertion sequences enrichment in extreme Red sea brine pool vent.

    PubMed

    Elbehery, Ali H A; Aziz, Ramy K; Siam, Rania

    2017-03-01

    Mobile genetic elements are major agents of genome diversification and evolution. Limited studies addressed their characteristics, including abundance, and role in extreme habitats. One of the rare natural habitats exposed to multiple-extreme conditions, including high temperature, salinity and concentration of heavy metals, are the Red Sea brine pools. We assessed the abundance and distribution of different mobile genetic elements in four Red Sea brine pools including the world's largest known multiple-extreme deep-sea environment, the Red Sea Atlantis II Deep. We report a gradient in the abundance of mobile genetic elements, dramatically increasing in the harshest environment of the pool. Additionally, we identified a strong association between the abundance of insertion sequences and extreme conditions, being highest in the harshest and deepest layer of the Red Sea Atlantis II Deep. Our comparative analyses of mobile genetic elements in secluded, extreme and relatively non-extreme environments, suggest that insertion sequences predominantly contribute to polyextremophiles genome plasticity.

  8. The site-specific ribosomal DNA insertion element R1Bm belongs to a class of non-long-terminal-repeat retrotransposons.

    PubMed Central

    Xiong, Y; Eickbush, T H

    1988-01-01

    Two types of insertion elements, R1 and R2 (previously called type I and type II), are known to interrupt the 28S ribosomal genes of several insect species. In the silkmoth, Bombyx mori, each element occupies approximately 10% of the estimated 240 ribosomal DNA units, while at most only a few copies are located outside the ribosomal DNA units. We present here the complete nucleotide sequence of an R1 insertion from B. mori (R1Bm). This 5.1-kilobase element contains two overlapping open reading frames (ORFs) which together occupy 88% of its length. ORF1 is 461 amino acids in length and exhibits characteristics of retroviral gag genes. ORF2 is 1,051 amino acids in length and contains homology to reverse transcriptase-like enzymes. The analysis of 3' and 5' ends of independent isolates from the ribosomal locus supports the suggestion that R1 is still functioning as a transposable element. The precise location of the element within the genome implies that its transposition must occur with remarkable insertion sequence specificity. Comparison of the deduced amino acid sequences from six retrotransposons, R1 and R2 of B. mori, I factor and F element of Drosophila melanogaster, L1 of Mus domesticus, and Ingi of Trypanosoma brucei, reveals a relatively high level of sequence homology in the reverse transcriptase region. Like R1, these elements lack long terminal repeats. We have therefore named this class of related elements the non-long-terminal-repeat (non-LTR) retrotransposons. Images PMID:2447482

  9. Germ line insertion of mtDNA at the breakpoint junction of a reciprocal constitutional translocation.

    PubMed

    Willett-Brozick, J E; Savul, S A; Richey, L E; Baysal, B E

    2001-08-01

    Constitutional chromosomal translocations are relatively common causes of human morbidity, yet the DNA double-strand break (DSB) repair mechanisms that generate them are incompletely understood. We cloned, sequenced and analyzed the breakpoint junctions of a familial constitutional reciprocal translocation t(9;11)(p24;q23). Within the 10-kb region flanking the breakpoints, chromosome 11 had 25% repeat elements, whereas chromosome 9 had 98% repeats, 95% of which were L1-type LINE elements. The breakpoints occurred within an L1-type repeat element at 9p24 and at the 3'-end of an Alu sequence at 11q23. At the breakpoint junction of derivative chromosome 9, we discovered an unusually large 41-bp insertion, which showed 100% identity to 12S mitochondrial DNA (mtDNA) between nucleotides 896 and 936 of the mtDNA sequence. Analysis of the human genome failed to show the preexistence of the inserted sequence at normal chromosomes 9 and 11 breakpoint junctions or elsewhere in the genome, strongly suggesting that the insertion was derived from human mtDNA and captured into the junction during the DSB repair process. To our knowledge, these findings represent the first observation of spontaneous germ line insertion of modern human mtDNA sequences and suggest that DSB repair may play a role in inter-organellar gene transfer in vivo. Our findings also provide evidence for a previously unrecognized insertional mechanism in human, by which non-mobile extra-chromosomal fragments can be inserted into the genome at DSB repair junctions.

  10. Characterization of the Fb-Nof Transposable Element of Drosophila Melanogaster

    PubMed Central

    Harden, N.; Ashburner, M.

    1990-01-01

    FB-NOF is a composite transposable element of Drosophila melanogaster. It is composed of foldback sequences, of variable length, which flank a 4-kb NOF sequence with 308-bp inverted repeat termini. The NOF sequence could potentially code for a 120-kD polypeptide. The FB-NOF element is responsible for unstable mutations of the white gene (w(c) and w(DZL)) and is associated with the large TEs of G. Ising. Although most strains of D. melanogaster have 20-30 sites of FB insertion, FB-NOF elements are usually rare, many strains lack this composite element or have only one copy of it. A few strains, including w(DZL) and Basc have many (8-21) copies of FB-NOF, and these show a tendency to insert at ``hot-spots.'' These strains also have an increased number of FB elements. The DNA sequence of the NOF region associated with TE146(Z) has been determined. PMID:2174013

  11. Mobile element biology – new possibilities with high-throughput sequencing

    PubMed Central

    Xing, Jinchuan; Witherspoon, David J.; Jorde, Lynn B.

    2014-01-01

    Mobile elements compose more than half of the human genome, but until recently their large-scale detection was time-consuming and challenging. With the development of new high-throughput sequencing technologies, the complete spectrum of mobile element variation in humans can now be identified and analyzed. Thousands of new mobile element insertions have been discovered, yielding new insights into mobile element biology, evolution, and genomic variation. We review several high-throughput methods, with an emphasis on techniques that specifically target mobile element insertions in humans, and we highlight recent applications of these methods in evolutionary studies and in the analysis of somatic alterations in human cancers. PMID:23312846

  12. Vector for IS element entrapment and functional characterization based on turning on expression of distal promoterless genes.

    PubMed

    Szeverényi, I; Hodel, A; Arber, W; Olasz, F

    1996-09-26

    We constructed and characterized a novel trap vector for rapid isolation of insertion sequences. The strategy used for the isolation of IS elements is based on the ability of many IS elements to turn on the expression of otherwise silent genes distal to some sites of insertion. The simple transposition of an IS element can sometimes cause the constitutive expression of promoterless antibiotic resistance genes resulting in selectable phenotypes. The trap vector pAW1326 is based on a pBR322 replicon, it carries ampicillin and streptomycin resistance genes, and also silenced genes that confer chloramphenicol and kanamycin resistance once activated. The trap vector pAW1326 proved to be efficient and 85 percent of all isolated mutations were insertions. The majority of IS elements resident in the studied Escherichia coli strains tested became trapped, namely IS2, IS3, IS5, IS150, IS186 and Tn1000. We also encountered an insertion sequence, called IS10L/R-2, which is a hybrid of the two IS variants IS10L and IS10R. IS10L/R-2 is absent from most E. coli strains, but it is detectable in some strains such as JM109 which had been submitted to Tn10 mutagenesis. The distribution of the insertion sequences within the trap region was not random. Rather, the integration of chromosomal mobile genetic elements into the offered target sequence occurred in element-specific clusters. This is explained both by the target specificity and by the specific requirements for the activation of gene transcription by the DNA rearrangement. The employed trap vector pAW1326 proved to be useful for the isolation of mobile genetic elements, for a demonstration of their transposition activity as well as for the further characterization of some of the functional parameters of transposition.

  13. Retrotransposon insertion targeting: a mechanism for homogenization of centromere sequences on nonhomologous chromosomes.

    PubMed

    Birchler, James A; Presting, Gernot G

    2012-04-01

    The centromeres of most eukaryotic organisms consist of highly repetitive arrays that are similar across nonhomologous chromosomes. These sequences evolve rapidly, thus posing a mystery as to how such arrays can be homogenized. Recent work in species in which centromere-enriched retrotransposons occur indicates that these elements preferentially insert into the centromeric regions. In two different Arabidopsis species, a related element was recognized in which the specificity for such targeting was altered. These observations provide a partial explanation for how homogenization of centromere DNA sequences occurs.

  14. Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers

    PubMed Central

    2012-01-01

    Background Sequence analysis of the orangutan genome revealed that recent proliferative activity of Alu elements has been uncharacteristically quiescent in the Pongo (orangutan) lineage, compared with all previously studied primate genomes. With relatively few young polymorphic insertions, the genomic landscape of the orangutan seemed like the ideal place to search for a driver, or source element, of Alu retrotransposition. Results Here we report the identification of a nearly pristine insertion possessing all the known putative hallmarks of a retrotranspositionally competent Alu element. It is located in an intronic sequence of the DGKB gene on chromosome 7 and is highly conserved in Hominidae (the great apes), but absent from Hylobatidae (gibbon and siamang). We provide evidence for the evolution of a lineage-specific subfamily of this shared Alu insertion in orangutans and possibly the lineage leading to humans. In the orangutan genome, this insertion contains three orangutan-specific diagnostic mutations which are characteristic of the youngest polymorphic Alu subfamily, AluYe5b5_Pongo. In the Homininae lineage (human, chimpanzee and gorilla), this insertion has acquired three different mutations which are also found in a single human-specific Alu insertion. Conclusions This seemingly stealth-like amplification, ongoing at a very low rate over millions of years of evolution, suggests that this shared insertion may represent an ancient backseat driver of Alu element expansion. PMID:22541534

  15. Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers.

    PubMed

    Walker, Jerilyn A; Konkel, Miriam K; Ullmer, Brygg; Monceaux, Christopher P; Ryder, Oliver A; Hubley, Robert; Smit, Arian Fa; Batzer, Mark A

    2012-04-30

    Sequence analysis of the orangutan genome revealed that recent proliferative activity of Alu elements has been uncharacteristically quiescent in the Pongo (orangutan) lineage, compared with all previously studied primate genomes. With relatively few young polymorphic insertions, the genomic landscape of the orangutan seemed like the ideal place to search for a driver, or source element, of Alu retrotransposition. Here we report the identification of a nearly pristine insertion possessing all the known putative hallmarks of a retrotranspositionally competent Alu element. It is located in an intronic sequence of the DGKB gene on chromosome 7 and is highly conserved in Hominidae (the great apes), but absent from Hylobatidae (gibbon and siamang). We provide evidence for the evolution of a lineage-specific subfamily of this shared Alu insertion in orangutans and possibly the lineage leading to humans. In the orangutan genome, this insertion contains three orangutan-specific diagnostic mutations which are characteristic of the youngest polymorphic Alu subfamily, AluYe5b5_Pongo. In the Homininae lineage (human, chimpanzee and gorilla), this insertion has acquired three different mutations which are also found in a single human-specific Alu insertion. This seemingly stealth-like amplification, ongoing at a very low rate over millions of years of evolution, suggests that this shared insertion may represent an ancient backseat driver of Alu element expansion.

  16. Diagnostic use of computational retrotransposon detection: Successful definition of pathogenetic mechanism in a ciliopathy phenotype.

    PubMed

    Takenouchi, Toshiki; Kuchikata, Tomu; Yoshihashi, Hiroshi; Fujiwara, Mineko; Uehara, Tomoko; Miyama, Sahoko; Yamada, Shiro; Kosaki, Kenjiro

    2017-05-01

    Among more than 5,000 human monogenic disorders with known causative genes, transposable element insertion of a Long Interspersed Nuclear Element 1 (LINE1, L1) is known as the mechanistic basis in only 13 genetic conditions. Meckel-Gruber syndrome is a rare ciliopathy characterized by occipital encephalocele and cystic kidney disease. Here, we document a boy with occipital encephalocele, post-axial polydactyly, and multicystic renal disease. A medical exome analysis detected a heterozygous frameshift mutation, c.4582_4583delCG p.(Arg1528Serfs*17) in CC2D2A in the maternally derived allele. The further use of a dedicated bioinformatics algorithm for detecting retrotransposon insertions led to the detection of an L1 insertion affecting exon 7 in the paternally derived allele. The complete sequencing and sequence homology analysis of the inserted L1 element showed that the L1 element was classified as L1HS (L1 human specific) and that the element had intact open reading frames in the two L1-encoded proteins. This observation ranks Meckel-Gruber syndrome as only the 14th disorder to be caused by an L1 insertion among more than 5,000 known human genetic disorders. Although a transposable element detection algorithm is not included in the current best-practice next-generation sequencing analysis, the present observation illustrates the utility of such an algorithm, which would require modest computational time and resources. Whether the seemingly infrequent recognition of L1 insertion in the pathogenesis of human genetic diseases might simply reflect a lack of appropriate detection methods remains to be seen. © 2017 Wiley Periodicals, Inc.

  17. The site-specific ribosomal insertion element type II of Bombyx mori (R2Bm) contains the coding sequence for a reverse transcriptase-like enzyme.

    PubMed Central

    Burke, W D; Calalang, C C; Eickbush, T H

    1987-01-01

    Two classes of DNA elements interrupt a fraction of the rRNA repeats of Bombyx mori. We have analyzed by genomic blotting and sequence analysis one class of these elements which we have named R2. These elements occupy approximately 9% of the rDNA units of B. mori and appear to be homologous to the type II rDNA insertions detected in Drosophila melanogaster. Approximately 25 copies of R2 exist within the B. mori genome, of which at least 20 are located at a precise location within otherwise typical rDNA units. Nucleotide sequence analysis has revealed that the 4.2-kilobase-pair R2 element has a single large open reading frame, occupying over 82% of the total length of the element. The central region of this 1,151-amino-acid open reading frame shows homology to the reverse transcriptase enzymes found in retroviruses and certain transposable elements. Amino acid homology of this region is highest to the mobile line 1 elements of mammals, followed by the mitochondrial type II introns of fungi, and the pol gene of retroviruses. Less homology exists with transposable elements of D. melanogaster and Saccharomyces cerevisiae. Two additional regions of sequence homology between L1 and R2 elements were also found outside the reverse transcriptase region. We suggest that the R2 elements are retrotransposons that are site specific in their insertion into the genome. Such mobility would enable these elements to occupy a small fraction of the rDNA units of B. mori despite their continual elimination from the rDNA locus by sequence turnover. Images PMID:2439905

  18. Isolation and molecular characterization of dTnp1, a mobile and defective transposable element of Nicotiana plumbaginifolia.

    PubMed

    Meyer, C; Pouteau, S; Rouzé, P; Caboche, M

    1994-01-01

    By Northern blot analysis of nitrate reductase-deficient mutants of Nicotiana plumbaginifolia, we identified a mutant (mutant D65), obtained after gamma-ray irradiation of protoplasts, which contained an insertion sequence in the nitrate reductase (NR) mRNA. This insertion sequence was localized by polymerase chain reaction (PCR) in the first exon of NR and was also shown to be present in the NR gene. The mutant gene contained a 565 bp insertion sequence that exhibits the sequence characteristics of a transposable element, which was thus named dTnp1. The dTnp1 element has 14 bp terminal inverted repeats and is flanked by an 8-bp target site duplication generated upon transposition. These inverted repeats have significant sequence homology with those of other transposable elements. Judging by its size and the absence of a long open reading frame, dTnp1 appears to represent a defective, although mobile, transposable element. The octamer motif TTTAGGCC was found several times in direct orientation near the 5' and 3' ends of dTnp1 together with a perfect palindrome located after the 5' inverted repeat. Southern blot analysis using an internal probe of dTnp1 suggested that this element occurs as a single copy in the genome of N. plumbaginifolia. It is also present in N. tabacum, but absent in tomato or petunia. The dTnp1 element is therefore of potential use for gene tagging in Nicotiana species.

  19. QuickMap: a public tool for large-scale gene therapy vector insertion site mapping and analysis.

    PubMed

    Appelt, J-U; Giordano, F A; Ecker, M; Roeder, I; Grund, N; Hotz-Wagenblatt, A; Opelz, G; Zeller, W J; Allgayer, H; Fruehauf, S; Laufs, S

    2009-07-01

    Several events of insertional mutagenesis in pre-clinical and clinical gene therapy studies have created intense interest in assessing the genomic insertion profiles of gene therapy vectors. For the construction of such profiles, vector-flanking sequences detected by inverse PCR, linear amplification-mediated-PCR or ligation-mediated-PCR need to be mapped to the host cell's genome and compared to a reference set. Although remarkable progress has been achieved in mapping gene therapy vector insertion sites, public reference sets are lacking, as are the possibilities to quickly detect non-random patterns in experimental data. We developed a tool termed QuickMap, which uniformly maps and analyzes human and murine vector-flanking sequences within seconds (available at www.gtsg.org). Besides information about hits in chromosomes and fragile sites, QuickMap automatically determines insertion frequencies in +/- 250 kb adjacency to genes, cancer genes, pseudogenes, transcription factor and (post-transcriptional) miRNA binding sites, CpG islands and repetitive elements (short interspersed nuclear elements (SINE), long interspersed nuclear elements (LINE), Type II elements and LTR elements). Additionally, all experimental frequencies are compared with the data obtained from a reference set, containing 1 000 000 random integrations ('random set'). Thus, for the first time a tool allowing high-throughput profiling of gene therapy vector insertion sites is available. It provides a basis for large-scale insertion site analyses, which is now urgently needed to discover novel gene therapy vectors with 'safe' insertion profiles.

  20. [Construction of a general AAV vector regulated by minimal and artificial hypoxic-responsive element].

    PubMed

    Nie, Xiao-wei; Sun, Li-jun; Hao, Yue-wen; Yang, Guang-xiao; Wang, Quan-ying

    2011-03-01

    To synthesize the minimal and artificial HRE, and to insert it into the anterior extremity of CMV promoter of a AAV plasmid, and then to construct the AAV regulated by hypoxic-responsive element which was introduced into 293 cell by method of Ca3(PO4)2 using three plasmids. Thus obtaining the adenoassociated virus vector regulated by hypoxic-responsive element was possibly used for gene therapy in ischemia angiocardiopathy and cerebrovascular disease. Artificially synthesize the 36 bp nucleotide sequences of four connection in series HIF-binding sites A/GCGTG(4×HBS)and a 35 bp nucleotide sequences spacing inserted into anterior extremity of CMV promoter TATA Box, then amplified by PCR. The cDNA fragment was confirmed to be right by DNA sequencing. Molecular biology routine method was used to construct a AAV vector regulated by minimal hypoxic-responsive element after the normal CMV promoter in AAV vector was replaced by the CMV promoter included minimal hypoxic-responsive element. Then, NT4-6His-PR39 fusogenic peptide was inserted into MCS of the plasmid, the recombinant AAV vector was obtained by three plasmid co-transfection in 293 cells, in which we can also investigate the expression of 6×His using immunochemistry in hypoxia environment. Artificial HRE was inserted into anterior extremity of CMV promoter and there was a correct spacing between the HRE and the TATA-box. The DNA sequencing and restriction enzyme digestion results indicated that the AAV regulated by hypoxic-responsive element was successfully constructed. Compared to the control group, the expressions of 6×His was significantly increased in the experimental groups in hypoxia environment, which confirmed that the AAV effectually regulated by the minimal HRE was inserted into anterior extremity of CMV promoter. The HRE is inserted into anterior extremity of CMV promoter to lack incision enzyme recognition site by PCR. And eukaryotic expression vector regulated by hypoxic-responsive is constructed. The AAV effectually regulated by the minimal HRE inserted into anterior extremity of CMV promoter. The vector is successfully constructed and it has important theoretical and practical value in the synteresis and therapy of ischemia angiocardiopathy and cerebrovascular disease.

  1. A ribosomal orphon sequence from Xenopus laevis flanked by novel low copy number repetitive elements.

    PubMed

    Guimond, A; Moss, T

    1999-02-01

    We have used a differential cloning approach to isolate ribosomal/non-ribosomal frontier sequences from Xenopus laevis. A ribosomal intergenic spacer sequence (IGS) was cloned and shown not to be physically linked with the ribosomal locus. This ribosomal orphon contained the IGS sequences found immediately downstream of the 28S gene and included an array of enhancer repetitions and a non-functional spacer promoter. The orphon sequence was flanked by a member of the novel 'Frt' low copy repetitive element family. Three individual Frt repeats were sequenced and all members of this family were shown to lie clustered at two chromosomal sites, one of which contained the ribosomal orphon. One of the Frt elements contained an insertion of 297 bp that showed extensive homology to sequences within at least three other Xenopus genes. Each homology region was flanked by members of the T2 family of short interspersed repetitive elements, (SINEs), and by its target insertion sequence, suggesting multiple translocation events. The data are discussed in terms of the evolution of the ribosomal gene locus.

  2. TEs or not TEs? That is the evolutionary question.

    PubMed

    Vaknin, Keren; Goren, Amir; Ast, Gil

    2009-10-23

    Transposable elements (TEs) have contributed a wide range of functional sequences to their host genomes. A recent paper in BMC Molecular Biology discusses the creation of new transcripts by transposable element insertion upstream of retrocopies and the involvement of such insertions in tissue-specific post-transcriptional regulation.

  3. Cloning of a CACTA transposon-like insertion in intron I of tomato invertase Lin5 gene and identification of transposase-like sequences of Solanaceae species.

    PubMed

    Proels, Reinhard K; Roitsch, Thomas

    2006-03-01

    Very few CACTA transposon-like sequences have been described in Solanaceae species. Sequence information has been restricted to partial transposase (TPase)-like fragments, and no target gene of CACTA-like transposon insertion has been described in tomato to date. In this manuscript, we report on a CACTA transposon-like insertion in intron I of tomato (Lycopersicon esculentum) invertase gene Lin5 and TPase-like sequences of several Solanaceae species. Consensus primers deduced from the TPase region of the tomato CACTA transposon-like element allowed the amplification of similar sequences from various Solanaceae species of different subfamilies including Solaneae (Solanum tuberosum), Cestreae (Nicotiana tabacum) and Datureae (Datura stramonium). This demonstrates the ubiquitous presence of CACTA-like elements in Solanaceae genomes. The obtained partial sequences are highly conserved, and allow further detection and detailed analysis of CACTA-like transposons throughout Solanaceae species. CACTA-like transposon sequences make possible the evaluation of their use for genome analysis, functional studies of genes and the evolutionary relationships between plant species.

  4. RNA from the 5' end of the R2 retrotransposon controls R2 protein binding to and cleavage of its DNA target site.

    PubMed

    Christensen, Shawn M; Ye, Junqiang; Eickbush, Thomas H

    2006-11-21

    Non-LTR retrotransposons insert into eukaryotic genomes by target-primed reverse transcription (TPRT), a process in which cleaved DNA targets are used to prime reverse transcription of the element's RNA transcript. Many of the steps in the integration pathway of these elements can be characterized in vitro for the R2 element because of the rigid sequence specificity of R2 for both its DNA target and its RNA template. R2 retrotransposition involves identical subunits of the R2 protein bound to different DNA sequences upstream and downstream of the insertion site. The key determinant regulating which DNA-binding conformation the protein adopts was found to be a 320-nt RNA sequence from near the 5' end of the R2 element. In the absence of this 5' RNA the R2 protein binds DNA sequences upstream of the insertion site, cleaves the first DNA strand, and conducts TPRT when RNA containing the 3' untranslated region of the R2 transcript is present. In the presence of the 320-nt 5' RNA, the R2 protein binds DNA sequences downstream of the insertion site. Cleavage of the second DNA strand by the downstream subunit does not appear to occur until after the 5' RNA is removed from this subunit. We postulate that the removal of the 5' RNA normally occurs during reverse transcription, and thus provides a critical temporal link to first- and second-strand DNA cleavage in the R2 retrotransposition reaction.

  5. Isolation of an insertion sequence (IS1051) from Xanthomonas campestris pv. dieffenbachiae with potential use for strain identification and characterization.

    PubMed Central

    Berthier, Y; Thierry, D; Lemattre, M; Guesdon, J L

    1994-01-01

    A new insertion sequence was isolated from Xanthomonas campestris pv. dieffenbachiae. Sequence analysis showed that this element is 1,158 bp long and has 15-bp inverted repeat ends containing two mismatches. Comparison of this sequence with sequences in data bases revealed significant homology with Escherichia coli IS5. IS1051, which detected multiple restriction fragment length polymorphisms, was used as a probe to characterize strains from the pathovar dieffenbachiae. Images PMID:7906933

  6. Vertical Transmission of the Retrotransposable Elements R1 and R2 during the Evolution of the Drosophila Melanogaster Species Subgroup

    PubMed Central

    Eickbush, D. G.; Eickbush, T. H.

    1995-01-01

    R1 and R2 are non-long-terminal repeat retrotransposable elements that insert into specific sequences of insect 28S ribosomal RNA genes. These elements have been extensively described in Drosophila melanogaster. To determine whether these elements have been horizontally or vertically transmitted, we characterized R1 and R2 elements from the seven other members of the melanogaster species subgroup by genomic blotting and nucleotide sequencing. Each species was found to have homogeneous families of R1 and R2 elements with the exception of erecta and orena, which have no R2 elements. The DNA sequences of multiple R1 and R2 copies from each species indicated nucleotide divergence within each species averaged only 0.48% for R1 and 0.35% for R2, well below the level of divergence among the species. Most copies of R1 and R2 (40 of 47) sequenced from the seven species were potentially functional, as indicated by the absence of premature termination codons or translational frameshifts that would destroy the open reading frame of the element. The sequence relationships of both the R1 and R2 elements from the various members of the melanogaster subgroup closely followed that of the species phylogeny, suggesting that R1 and R2 have been stably maintained by vertical transmission since the origin of this species subgroup 17-20 million years ago. The remarkable stability of R1 and R2, compared to what has been suggested for transposable elements that insert at multiple locations in these same species, may be due to their unique specificity for sites in the rRNA gene locus. Under low copy number conditions, when it is essential for any mobile element to transpose, the insertion specificities of R1 and R2 ensure uniform developmentally regulated target sites that can be occupied with little or no detrimental effect on the host. PMID:7713424

  7. Characterisation of IS153, an IS3-family insertion sequence isolated from Lactobacillus sanfranciscensis and its use for strain differentiation.

    PubMed

    Ehrmann, M A; Vogel, R E

    2001-11-01

    An insertion sequence has been identified in the genome of Lactobacillus sanfranciscensis DSM 20451T as segment of 1351 nucleotides containing 37-bp imperfect terminal inverted repeats. The sequence of this element encodes two out of phase, overlapping open reading frames, orfA and orfB, from which three putative proteins are produced. OrfAB is a transframe protein produced by -1 translational frame shifting between orf A and orf B that is presumed to be the transposase. The large orfAB of this element encodes a 342 amino acid protein that displays similarities with transposases encoded by bacterial insertion sequences belonging to the IS3 family. In L. sanfranciscensis type strain DSM 20451T multiple truncated IS elements were identified. Inverse PCR was used to analyze target sites of four of these elements, but except of their highly AT rich character not any sequence specificity was identified so far. Moreover, no flanking direct repeats were identified. Multiple copies of IS153 were detected by hybridization in other strains of L. sanfranciscensis. Resulting hybridization patterns were shown to differentiate between organisms at strain level rather than a probe targeted against the 16S rDNA. With a PCR based approach IS153 or highly similar sequences were detected in L. acidophilus, L. casei, L. malefermentans, L. plantarum, L. hilgardii, L. collinoides L. farciminis L. sakei and L. salivarius, L. reuteri as well as in Enterococcus faecium, Pediococcus acidilactici and P. pentosaceus.

  8. The ATRX cDNA is prone to bacterial IS10 element insertions that alter its structure.

    PubMed

    Valle-García, David; Griffiths, Lyra M; Dyer, Michael A; Bernstein, Emily; Recillas-Targa, Félix

    2014-01-01

    The SWI/SNF-like chromatin-remodeling protein ATRX has emerged as a key factor in the regulation of α-globin gene expression, incorporation of histone variants into the chromatin template and, more recently, as a frequently mutated gene across a wide spectrum of cancers. Therefore, the availability of a functional ATRX cDNA for expression studies is a valuable tool for the scientific community. We have identified two independent transposon insertions of a bacterial IS10 element into exon 8 of ATRX isoform 2 coding sequence in two different plasmids derived from a single source. We demonstrate that these insertion events are common and there is an insertion hotspot within the ATRX cDNA. Such IS10 insertions produce a truncated form of ATRX, which significantly compromises its nuclear localization. In turn, we describe ways to prevent IS10 insertion during propagation and cloning of ATRX-containing vectors, including optimal growth conditions, bacterial strains, and suggested sequencing strategies. Finally, we have generated an insertion-free plasmid that is available to the community for expression studies of ATRX.

  9. P-Element Insertion Alleles of Essential Genes on the Third Chromosome of Drosophila Melanogaster: Correlation of Physical and Cytogenetic Maps in Chromosomal Region 86e-87f

    PubMed Central

    Deak, P.; Omar, M. M.; Saunders, RDC.; Pal, M.; Komonyi, O.; Szidonya, J.; Maroy, P.; Zhang, Y.; Ashburner, M.; Benos, P.; Savakis, C.; Siden-Kiamos, I.; Louis, C.; Bolshakov, V. N.; Kafatos, F. C.; Madueno, E.; Modolell, J.; Glover, D. M.

    1997-01-01

    We have established a collection of 2460 lethal or semi-lethal mutant lines using a procedure thought to insert single P elements into vital genes on the third chromosome of Drosophila melanogaster. More than 1200 randomly selected lines were examined by in situ hybridization and 90% found to contain single insertions at sites that mark 89% of all lettered subdivisions of the Bridges' map. A set of chromosomal deficiencies that collectively uncover ~25% of the euchromatin of chromosome 3 reveal lethal mutations in 468 lines corresponding to 145 complementation groups. We undertook a detailed analysis of the cytogenetic interval 86E-87F and identified 87 P-element-induced mutations falling into 38 complementation groups, 16 of which correspond to previously known genes. Twenty-one of these 38 complementation groups have at least one allele that has a P-element insertion at a position consistent with the cytogenetics of the locus. We have rescued P elements and flanking chromosomal sequences from the 86E-87F region in 35 lines with either lethal or genetically silent P insertions, and used these as probes to identify cosmids and P1 clones from the Drosophila genome projects. This has tied together the physical and genetic maps and has linked 44 previously identified cosmid contigs into seven ``supercontigs'' that span the interval. STS data for sequences flanking one side of the P-element insertions in 49 lines has identified insertions in the αγ element at 87C, two known transposable elements, and the open reading frames of seven putative single copy genes. These correspond to five known genes in this interval, and two genes identified by the homology of their predicted products to known proteins from other organisms. PMID:9409831

  10. Enhanced production of recombinant proteins with Corynebacterium glutamicum by deletion of insertion sequences (IS elements).

    PubMed

    Choi, Jae Woong; Yim, Sung Sun; Kim, Min Jeong; Jeong, Ki Jun

    2015-12-29

    In most bacteria, various jumping genetic elements including insertion sequences elements (IS elements) cause a variety of genetic rearrangements resulting in harmful effects such as genome and recombinant plasmid instability. The genetic stability of a plasmid in a host is critical for high-level production of recombinant proteins, and in this regard, the development of an IS element-free strain could be a useful strategy for the enhanced production of recombinant proteins. Corynebacterium glutamicum, which is a workhorse in the industrial-scale production of various biomolecules including recombinant proteins, also has several IS elements, and it is necessary to identify the critical IS elements and to develop IS element deleted strain. From the cultivation of C. glutamicum harboring a plasmid for green fluorescent protein (GFP) gene expression, non-fluorescent clones were isolated by FACS (fluorescent activated cell sorting). All the isolated clones had insertions of IS elements in the GFP coding region, and two major IS elements (ISCg1 and ISCg2 families) were identified. By co-cultivating cells harboring either the isolated IS element-inserted plasmid or intact plasmid, it was clearly confirmed that cells harboring the IS element-inserted plasmids became dominant during the cultivation due to their growth advantage over cells containing intact plasmids, which can cause a significant reduction in recombinant protein production during cultivation. To minimize the harmful effects of IS elements on the expression of heterologous genes in C. glutamicum, two IS element free C. glutamicum strains were developed in which each major IS element was deleted, and enhanced productivity in the engineered C. glutamicum strain was successfully demonstrated with three models: GFP, poly(3-hydroxybutyrate) [P(3HB)] and γ-aminobutyrate (GABA). Our findings clearly indicate that the hopping of IS elements could be detrimental to the production of recombinant proteins in C. glutamicum, emphasizing the importance of developing IS element free host strains.

  11. Activity of genes with functions in human Williams-Beuren Syndrome are impacted by mobile element insertions in the gray wolf genome.

    PubMed

    vonHoldt, Bridgett M; Ji, Sarah S; Aardema, Matthew L; Stahler, Daniel; Udell, Monique A R; Sinsheimer, Janet S

    2018-06-01

    In canines, transposon dynamics have been associated with a hyper-social behavioral syndrome, although the functional mechanism has yet to be described. We investigate the epigenetic and transcriptional consequences of these behavior-associated mobile element insertions in dogs and Yellowstone wolves. We posit that the transposons themselves may not be the causative feature; rather, their transcriptional regulation may exert the functional impact. We survey four outlier transposons associated with hyper-sociability, with the expectation that they are targeted for epigenetic silencing. We predict hyper-methylation of mobile element insertions (MEIs), suggestive that the epigenetic silencing of and not the MEIs themselves may be driving dysregulation of nearby genes. We found that transposon-derived sequences are significantly hyper-methylated, regardless of their copy number or species. Further, we have assessed transcriptome sequence data and found evidence that mobile element insertions impact the expression levels of six genes (WBSCR17, LIMK1, GTF2I, WBSCR27, BAZ1B, and BCL7B), all of which have known roles in human Williams-Beuren syndrome due to changes in copy number, typically hemizygosity. Although further evidence is needed, our results suggest that a few insertions alter local expression at multiple genes, likely through a cis-regulatory mechanism that excludes proximal methylation.

  12. Miniature Transposable Sequences Are Frequently Mobilized in the Bacterial Plant Pathogen Pseudomonas syringae pv. phaseolicola

    PubMed Central

    Bardaji, Leire; Añorga, Maite; Jackson, Robert W.; Martínez-Bilbao, Alejandro; Yanguas-Casás, Natalia; Murillo, Jesús

    2011-01-01

    Mobile genetic elements are widespread in Pseudomonas syringae, and often associate with virulence genes. Genome reannotation of the model bean pathogen P. syringae pv. phaseolicola 1448A identified seventeen types of insertion sequences and two miniature inverted-repeat transposable elements (MITEs) with a biased distribution, representing 2.8% of the chromosome, 25.8% of the 132-kb virulence plasmid and 2.7% of the 52-kb plasmid. Employing an entrapment vector containing sacB, we estimated that transposition frequency oscillated between 2.6×10−5 and 1.1×10−6, depending on the clone, although it was stable for each clone after consecutive transfers in culture media. Transposition frequency was similar for bacteria grown in rich or minimal media, and from cells recovered from compatible and incompatible plant hosts, indicating that growth conditions do not influence transposition in strain 1448A. Most of the entrapped insertions contained a full-length IS801 element, with the remaining insertions corresponding to sequences smaller than any transposable element identified in strain 1448A, and collectively identified as miniature sequences. From these, fragments of 229, 360 and 679-nt of the right end of IS801 ended in a consensus tetranucleotide and likely resulted from one-ended transposition of IS801. An average 0.7% of the insertions analyzed consisted of IS801 carrying a fragment of variable size from gene PSPPH_0008/PSPPH_0017, showing that IS801 can mobilize DNA in vivo. Retrospective analysis of complete plasmids and genomes of P. syringae suggests, however, that most fragments of IS801 are likely the result of reorganizations rather than one-ended transpositions, and that this element might preferentially contribute to genome flexibility by generating homologous regions of recombination. A further miniature sequence previously found to affect host range specificity and virulence, designated MITEPsy1 (100-nt), represented an average 2.4% of the total number of insertions entrapped in sacB, demonstrating for the first time the mobilization of a MITE in bacteria. PMID:22016774

  13. Miniature transposable sequences are frequently mobilized in the bacterial plant pathogen Pseudomonas syringae pv. phaseolicola.

    PubMed

    Bardaji, Leire; Añorga, Maite; Jackson, Robert W; Martínez-Bilbao, Alejandro; Yanguas-Casás, Natalia; Murillo, Jesús

    2011-01-01

    Mobile genetic elements are widespread in Pseudomonas syringae, and often associate with virulence genes. Genome reannotation of the model bean pathogen P. syringae pv. phaseolicola 1448A identified seventeen types of insertion sequences and two miniature inverted-repeat transposable elements (MITEs) with a biased distribution, representing 2.8% of the chromosome, 25.8% of the 132-kb virulence plasmid and 2.7% of the 52-kb plasmid. Employing an entrapment vector containing sacB, we estimated that transposition frequency oscillated between 2.6×10(-5) and 1.1×10(-6), depending on the clone, although it was stable for each clone after consecutive transfers in culture media. Transposition frequency was similar for bacteria grown in rich or minimal media, and from cells recovered from compatible and incompatible plant hosts, indicating that growth conditions do not influence transposition in strain 1448A. Most of the entrapped insertions contained a full-length IS801 element, with the remaining insertions corresponding to sequences smaller than any transposable element identified in strain 1448A, and collectively identified as miniature sequences. From these, fragments of 229, 360 and 679-nt of the right end of IS801 ended in a consensus tetranucleotide and likely resulted from one-ended transposition of IS801. An average 0.7% of the insertions analyzed consisted of IS801 carrying a fragment of variable size from gene PSPPH_0008/PSPPH_0017, showing that IS801 can mobilize DNA in vivo. Retrospective analysis of complete plasmids and genomes of P. syringae suggests, however, that most fragments of IS801 are likely the result of reorganizations rather than one-ended transpositions, and that this element might preferentially contribute to genome flexibility by generating homologous regions of recombination. A further miniature sequence previously found to affect host range specificity and virulence, designated MITEPsy1 (100-nt), represented an average 2.4% of the total number of insertions entrapped in sacB, demonstrating for the first time the mobilization of a MITE in bacteria.

  14. Read count-based method for high-throughput allelic genotyping of transposable elements and structural variants.

    PubMed

    Kuhn, Alexandre; Ong, Yao Min; Quake, Stephen R; Burkholder, William F

    2015-07-08

    Like other structural variants, transposable element insertions can be highly polymorphic across individuals. Their functional impact, however, remains poorly understood. Current genome-wide approaches for genotyping insertion-site polymorphisms based on targeted or whole-genome sequencing remain very expensive and can lack accuracy, hence new large-scale genotyping methods are needed. We describe a high-throughput method for genotyping transposable element insertions and other types of structural variants that can be assayed by breakpoint PCR. The method relies on next-generation sequencing of multiplex, site-specific PCR amplification products and read count-based genotype calls. We show that this method is flexible, efficient (it does not require rounds of optimization), cost-effective and highly accurate. This method can benefit a wide range of applications from the routine genotyping of animal and plant populations to the functional study of structural variants in humans.

  15. Alu element insertion in PKLR gene as a novel cause of pyruvate kinase deficiency in Middle Eastern patients.

    PubMed

    Lesmana, Harry; Dyer, Lisa; Li, Xia; Denton, James; Griffiths, Jenna; Chonat, Satheesh; Seu, Katie G; Heeney, Matthew M; Zhang, Kejian; Hopkin, Robert J; Kalfa, Theodosia A

    2018-03-01

    Pyruvate kinase deficiency (PKD) is the most frequent red blood cell enzyme abnormality of the glycolytic pathway and the most common cause of hereditary nonspherocytic hemolytic anemia. Over 250 PKLR-gene mutations have been described, including missense/nonsense, splicing and regulatory mutations, small insertions, small and gross deletions, causing PKD and hemolytic anemia of variable severity. Alu retrotransposons are the most abundant mobile DNA sequences in the human genome, contributing to almost 11% of its mass. Alu insertions have been associated with a number of human diseases either by disrupting a coding region or a splice signal. Here, we report on two unrelated Middle Eastern patients, both born from consanguineous parents, with transfusion-dependent hemolytic anemia, where sequence analysis revealed a homozygous insertion of AluYb9 within exon 6 of the PKLR gene, causing precipitous decrease of PKLR RNA levels. This Alu element insertion consists a previously unrecognized mechanism underlying pathogenesis of PKD. © 2017 Wiley Periodicals, Inc.

  16. The 3'-end region of the human PDGFR-β core promoter nuclease hypersensitive element forms a mixture of two unique end-insertion G-quadruplexes.

    PubMed

    Onel, Buket; Carver, Megan; Agrawal, Prashansa; Hurley, Laurence H; Yang, Danzhou

    2018-04-01

    While the most stable G-quadruplex formed in the human PDGFR-β promoter nuclease hypersensitive element (NHE) is the 5'-mid G-quadruplex, the 3'-end sequence that contains a 3'-GGA run forms a less stable G-quadruplex. Recently, the 3'-end G-quadruplex was found to be a transcriptional repressor and can be selectively targeted by a small molecule for PDGFR-β downregulation. We use 1D and 2D high-field NMR, in combination with Dimethylsulfate Footprinting, Circular Dichroism Spectroscopy, and Electrophoretic Mobility Shift Assay. We determine that the PDGFR-β extended 3'-end NHE sequence forms two novel end-insertion intramolecular G-quadruplexes that co-exist in equilibrium under physiological salt conditions. One G-quadruplex has a 3'-non-adjacent flanking guanine inserted into the 3'-external tetrad (3'-insertion-G4), and another has a 5'-non-adjacent flanking guanine inserted into the 5'-external tetrad (5'-insertion-G4). The two guanines in the GGA-run move up or down within the G-quadruplex to accommodate the inserted guanine. Each end-insertion G-quadruplex has a low thermal stability as compared to the 5'-mid G-quadruplex, but the selective stabilization of GSA1129 shifts the equilibrium toward the 3'-end G-quadruplex in the PDGFR-β NHE. An equilibrium mixture of two unique end-insertion intramolecular G-quadruplexes forms in the PDGFR-β NHE 3'-end sequence that contains a GGA-run and non-adjacent guanines in both the 3'- and 5'- flanking segments; the novel end-insertion structures of the 3'-end G-quadruplex are selectively stabilized by GSA1129. We show for the first time that an equilibrium mixture of two unusual end-insertion G-quadruplexes forms in a native promoter sequence and appears to be the molecular recognition for PDGFR-β downregulation. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. Palindromic repetitive DNA elements with coding potential in Methanocaldococcus jannaschii.

    PubMed

    Suyama, Mikita; Lathe, Warren C; Bork, Peer

    2005-10-10

    We have identified 141 novel palindromic repetitive elements in the genome of euryarchaeon Methanocaldococcus jannaschii. The total length of these elements is 14.3kb, which corresponds to 0.9% of the total genomic sequence and 6.3% of all extragenic regions. The elements can be divided into three groups (MJRE1-3) based on the sequence similarity. The low sequence identity within each of the groups suggests rather old origin of these elements in M. jannaschii. Three MJRE2 elements were located within the protein coding regions without disrupting the coding potential of the host genes, indicating that insertion of repeats might be a widespread mechanism to enhance sequence diversity in coding regions.

  18. Evolutionary genomics of miniature inverted-repeat transposable elements (MITEs) in Brassica.

    PubMed

    Nouroz, Faisal; Noreen, Shumaila; Heslop-Harrison, J S

    2015-12-01

    Miniature inverted-repeat transposable elements (MITEs) are truncated derivatives of autonomous DNA transposons, and are dispersed abundantly in most eukaryotic genomes. We aimed to characterize various MITEs families in Brassica in terms of their presence, sequence characteristics and evolutionary activity. Dot plot analyses involving comparison of homoeologous bacterial artificial chromosome (BAC) sequences allowed identification of 15 novel families of mobile MITEs. Of which, 5 were Stowaway-like with TA Target Site Duplications (TSDs), 4 Tourist-like with TAA/TTA TSDs, 5 Mutator-like with 9-10 bp TSDs and 1 novel MITE (BoXMITE1) flanked by 3 bp TSDs. Our data suggested that there are about 30,000 MITE-related sequences in Brassica rapa and B. oleracea genomes. In situ hybridization showed one abundant family was dispersed in the A-genome, while another was located near 45S rDNA sites. PCR analysis using primers flanking sequences of MITE elements detected MITE insertion polymorphisms between and within the three Brassica (AA, BB, CC) genomes, with many insertions being specific to single genomes and others showing evidence of more recent evolutionary insertions. Our BAC sequence comparison strategy enables identification of evolutionarily active MITEs with no prior knowledge of MITE sequences. The details of MITE families reported in Brassica enable their identification, characterization and annotation. Insertion polymorphisms of MITEs and their transposition activity indicated important mechanism of genome evolution and diversification. MITE families derived from known Mariner, Harbinger and Mutator DNA transposons were discovered, as well as some novel structures. The identification of Brassica MITEs will have broad applications in Brassica genomics, breeding, hybridization and phylogeny through their use as DNA markers.

  19. DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

    PubMed Central

    Palzkill, T G; Oliver, S G; Newlon, C S

    1986-01-01

    Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036

  20. CHARACTERIZATION AND NUCLEOTIDE SEQUENCE DETERMINATION OF A REPEAT ELEMENT ISOLATED FROM A 2,4,5,-T DEGRADING STRAIN OF PSEUDOMONAS CEPACIA

    EPA Science Inventory

    Pseudomonas cepacia strain AC1100, capable of growth on 2,4,5-trichlorophenoxyacetic acid (2,4,5-T), was mutated to the 2,4,5-T− strain PT88 by a ColE1 :: Tn5 chromosomal insertion. Using cloned DNA from the region flanking the insertion, a 1477-bp sequence (designated RS1100) wa...

  1. Retrotransposon Capture Sequencing (RC-Seq): A Targeted, High-Throughput Approach to Resolve Somatic L1 Retrotransposition in Humans.

    PubMed

    Sanchez-Luque, Francisco J; Richardson, Sandra R; Faulkner, Geoffrey J

    2016-01-01

    Mobile genetic elements (MGEs) are of critical importance in genomics and developmental biology. Polymorphic and somatic MGE insertions have the potential to impact the phenotype of an individual, depending on their genomic locations and functional consequences. However, the identification of polymorphic and somatic insertions among the plethora of copies residing in the genome presents a formidable technical challenge. Whole genome sequencing has the potential to address this problem; however, its efficacy depends on the abundance of cells carrying the new insertion. Robust detection of somatic insertions present in only a subset of cells within a given sample can also be prohibitively expensive due to a requirement for high sequencing depth. Here, we describe retrotransposon capture sequencing (RC-seq), a sequence capture approach in which Illumina libraries are enriched for fragments containing the 5' and 3' termini of specific MGEs. RC-seq allows the detection of known polymorphic insertions present in an individual, as well as the identification of rare or private germline insertions not previously described. Furthermore, RC-seq can be used to detect and characterize somatic insertions, providing a valuable tool to elucidate the extent and characteristics of MGE activity in healthy tissues and in various disease states.

  2. The Diversity of Prokaryotic DDE Transposases of the Mutator Superfamily, Insertion Specificity, and Association with Conjugation Machineries

    PubMed Central

    Guérillot, Romain; Siguier, Patricia; Gourbeyre, Edith; Chandler, Michael; Glaser, Philippe

    2014-01-01

    Transposable elements (TEs) are major components of both prokaryotic and eukaryotic genomes and play a significant role in their evolution. In this study, we have identified new prokaryotic DDE transposase families related to the eukaryotic Mutator-like transposases. These genes were retrieved by cascade PSI-Blast using as initial query the transposase of the streptococcal integrative and conjugative element (ICE) TnGBS2. By combining secondary structure predictions and protein sequence alignments, we predicted the DDE catalytic triad and the DNA-binding domain recognizing the terminal inverted repeats. Furthermore, we systematically characterized the organization and the insertion specificity of the TEs relying on these prokaryotic Mutator-like transposases (p-MULT) for their mobility. Strikingly, two distant TE families target their integration upstream σA dependent promoters. This allowed us to identify a transposase sequence signature associated with this unique insertion specificity and to show that the dissymmetry between the two inverted repeats is responsible for the orientation of the insertion. Surprisingly, while DDE transposases are generally associated with small and simple transposons such as insertion sequences (ISs), p-MULT encoding TEs show an unprecedented diversity with several families of IS, transposons, and ICEs ranging in size from 1.1 to 52 kb. PMID:24418649

  3. Molecular characterization and genomic distribution of Isis: a new retrotransposon of Drosophila buzzatii.

    PubMed

    García Guerreiro, M P; Fontdevila, A

    2007-01-01

    A new transposable element, Isis, is identified as a LTR retrotransposon in Drosophila buzzatii. DNA sequence analysis shows that Isis contains three long ORFs similar to gag, pol and env genes of retroviruses. The ORF1 exhibits sequence homology to matrix, capsid and nucleocapsid gag proteins and ORF2 encodes a putative protease (PR), a reverse transcriptase (RT), an Rnase H (RH) and an integrase (IN) region. The analysis of a putative env product, encoded by the env ORF3, shows a degenerated protein containing several stop codons. The molecular study of the putative proteins coded by this new element shows striking similarities to both Ulysses and Osvaldo elements, two LTR retrotransposons, present in D. virilis and D. buzzatii, respectively. Comparisons of the predicted Isis RT to several known retrotransposons show strong phylogenetic relationships to gypsy-like elements, particulary to Ulysses retrotransposon. Studies of Isis chromosomal distribution show a strong hybridization signal in centromeric and pericentromeric regions, and a scattered distribution along all chromosomal arms. The existence of insertional polymorphisms between different strains and high molecular weight bands by Southern blot suggests the existence of full-sized copies that have been active recently. The presence of euchromatic insertion sites coincident between Isis and Osvaldo could indicate preferential insertion sites of Osvaldo element into Isis sequence or vice versa. Moreover, the presence of Isis in different species of the buzzatii complex indicates the ancient origin of this element.

  4. Molecular Population Genetics of the Alcohol Dehydrogenase Gene Region of DROSOPHILA MELANOGASTER

    PubMed Central

    Aquadro, Charles F.; Desse, Susan F.; Bland, Molly M.; Langley, Charles H.; Laurie-Ahlberg, Cathy C.

    1986-01-01

    Variation in the DNA restriction map of a 13-kb region of chromosome II including the alcohol dehydrogenase structural gene (Adh) was examined in Drosophila melanogaster from natural populations. Detailed analysis of 48 D. melanogaster lines representing four eastern United States populations revealed extensive DNA sequence variation due to base substitutions, insertions and deletions. Cloning of this region from several lines allowed characterization of length variation as due to unique sequence insertions or deletions [nine sizes; 21–200 base pairs (bp)] or transposable element insertions (several sizes, 340 bp to 10.2 kb, representing four different elements). Despite this extensive variation in sequences flanking the Adh gene, only one length polymorphism is clearly associated with altered Adh expression (a copia element approximately 250 bp 5' to the distal transcript start site). Nonetheless, the frequency spectra of transposable elements within and between Drosophila species suggests they are slightly deleterious. Strong nonrandom associations are observed among Adh region sequence variants, ADH allozyme (Fast vs. Slow), ADH enzyme activity and the chromosome inversion ln(2L) t. Phylogenetic analysis of restriction map haplotypes suggest that the major twofold component of ADH activity variation (high vs. low, typical of Fast and Slow allozymes, respectively) is due to sequence variation tightly linked to and possibly distinct from that underlying the allozyme difference. The patterns of nucleotide and haplotype variation for Fast and Slow allozyme lines are consistent with the recent increase in frequency and spread of the Fast haplotype associated with high ADH activity. These data emphasize the important role of evolutionary history and strong nonrandom associations among tightly linked sequence variation as determinants of the patterns of variation observed in natural populations. PMID:3026893

  5. Functional impact of the human mobilome.

    PubMed

    Babatz, Timothy D; Burns, Kathleen H

    2013-06-01

    The human genome is replete with interspersed repetitive sequences derived from the propagation of mobile DNA elements. Three families of human retrotransposons remain active today: LINE1, Alu, and SVA elements. Since 1988, de novo insertions at previously recognized disease loci have been shown to generate highly penetrant alleles in Mendelian disorders. Only recently has the extent of germline-transmitted retrotransposon insertion polymorphism (RIP) in human populations been fully realized. Also exciting are recent studies of somatic retrotransposition in human tissues and reports of tumor-specific insertions, suggesting roles in tissue heterogeneity and tumorigenesis. Here we discuss mobile elements in human disease with an emphasis on exciting developments from the last several years. Copyright © 2013 Elsevier Ltd. All rights reserved.

  6. Ty1-copia elements reveal diverse insertion sites linked to polymorphisms among flax (Linum usitatissimum L.) accessions.

    PubMed

    Galindo-González, Leonardo; Mhiri, Corinne; Grandbastien, Marie-Angèle; Deyholos, Michael K

    2016-12-07

    Initial characterization of the flax genome showed that Ty1-copia retrotransposons are abundant, with several members being recently inserted, and in close association with genes. Recent insertions indicate a potential for ongoing transpositional activity that can create genomic diversity among accessions, cultivars or varieties. The polymorphisms generated constitute a good source of molecular markers that may be associated with phenotype if the insertions alter gene activity. Flax, where accessions are bred mainly for seed nutritional properties or for fibers, constitutes a good model for studying the relationship of transpositional activity with diversification and breeding. In this study, we estimated copy number and used a type of transposon display known as Sequence-Specific Amplification Polymorphisms (SSAPs), to characterize six families of Ty1-copia elements across 14 flax accessions. Polymorphic insertion sites were sequenced to find insertions that could potentially alter gene expression, and a preliminary test was performed with selected genes bearing transposable element (TE) insertions. Quantification of six families of Ty1-copia elements indicated different abundances among TE families and between flax accessions, which suggested diverse transpositional histories. SSAPs showed a high level of polymorphism in most of the evaluated retrotransposon families, with a trend towards higher levels of polymorphism in low-copy number families. Ty1-copia insertion polymorphisms among cultivars allowed a general distinction between oil and fiber types, and between spring and winter types, demonstrating their utility in diversity studies. Characterization of polymorphic insertions revealed an overwhelming association with genes, with insertions disrupting exons, introns or within 1 kb of coding regions. A preliminary test on the potential transcriptional disruption by TEs of four selected genes evaluated in three different tissues, showed one case of significant impact of the insertion on gene expression. We demonstrated that specific Ty1-copia families have been active since breeding commenced in flax. The retrotransposon-derived polymorphism can be used to separate flax types, and the close association of many insertions with genes defines a good source of potential mutations that could be associated with phenotypic changes, resulting in diversification processes.

  7. ISEScan: automated identification of insertion sequence elements in prokaryotic genomes.

    PubMed

    Xie, Zhiqun; Tang, Haixu

    2017-11-01

    The insertion sequence (IS) elements are the smallest but most abundant autonomous transposable elements in prokaryotic genomes, which play a key role in prokaryotic genome organization and evolution. With the fast growing genomic data, it is becoming increasingly critical for biology researchers to be able to accurately and automatically annotate ISs in prokaryotic genome sequences. The available automatic IS annotation systems are either providing only incomplete IS annotation or relying on the availability of existing genome annotations. Here, we present a new IS elements annotation pipeline to address these issues. ISEScan is a highly sensitive software pipeline based on profile hidden Markov models constructed from manually curated IS elements. ISEScan performs better than existing IS annotation systems when tested on prokaryotic genomes with curated annotations of IS elements. Applying it to 2784 prokaryotic genomes, we report the global distribution of IS families across taxonomic clades in Archaea and Bacteria. ISEScan is implemented in Python and released as an open source software at https://github.com/xiezhq/ISEScan. hatang@indiana.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  8. Insertion Sequences

    PubMed Central

    Mahillon, Jacques; Chandler, Michael

    1998-01-01

    Insertion sequences (ISs) constitute an important component of most bacterial genomes. Over 500 individual ISs have been described in the literature to date, and many more are being discovered in the ongoing prokaryotic and eukaryotic genome-sequencing projects. The last 10 years have also seen some striking advances in our understanding of the transposition process itself. Not least of these has been the development of various in vitro transposition systems for both prokaryotic and eukaryotic elements and, for several of these, a detailed understanding of the transposition process at the chemical level. This review presents a general overview of the organization and function of insertion sequences of eubacterial, archaebacterial, and eukaryotic origins with particular emphasis on bacterial elements and on different aspects of the transposition mechanism. It also attempts to provide a framework for classification of these elements by assigning them to various families or groups. A total of 443 members of the collection have been grouped in 17 families based on combinations of the following criteria: (i) similarities in genetic organization (arrangement of open reading frames); (ii) marked identities or similarities in the enzymes which mediate the transposition reactions, the recombinases/transposases (Tpases); (iii) similar features of their ends (terminal IRs); and (iv) fate of the nucleotide sequence of their target sites (generation of a direct target duplication of determined length). A brief description of the mechanism(s) involved in the mobility of individual ISs in each family and of the structure-function relationships of the individual Tpases is included where available. PMID:9729608

  9. The distribution of a phage-related insertion sequence element in the cyanobacterium, Microcystis aeruginosa.

    PubMed

    Kuno, Sotaro; Yoshida, Takashi; Kamikawa, Ryoma; Hosoda, Naohiko; Sako, Yoshihiko

    2010-01-01

    The cyanophage Ma-LMM01, specifically-infecting Microcystis aeruginosa, has an insertion sequence (IS) element that we named IS607-cp showing high nucleotide similarity to a counterpart in the genome of the cyanobacterium Cyanothece sp. We tested 21 strains of M. aeruginosa for the presence of IS607-cp using PCR and detected the element in strains NIES90, NIES112, NIES604, and RM6. Thermal asymmetric interlaced PCR (TAIL-PCR) revealed each of these strains has multiple copies of IS607-cp. Some of the ISs were classified into three types based on their inserted positions; IS607-cp-1 is common in strains NIES90, NIES112 and NIES604, whereas IS607-cp-2 and IS607-cp-3 are specific to strains NIES90 and RM6, respectively. This multiplicity may reflect the replicative transposition of IS607-cp. The sequence of IS607-cp in Ma-LMM01 showed robust affinity to those found in M. aeruginosa and Cyanothece spp. in a phylogenetic tree inferred from counterparts of various bacteria. This suggests the transfer of IS607-cp between the cyanobacterium and its cyanophage. We discuss the potential role of Ma-LMM01-related phages as donors of IS elements that may mediate the transfer of IS607-cp; and thereby partially contribute to the genome plasticity of M. aeruginosa.

  10. Structure of genes and an insertion element in the methane producing archaebacterium Methanobrevibacter smithii.

    PubMed

    Hamilton, P T; Reeve, J N

    1985-01-01

    DNA fragments cloned from the methanogenic archaebacterium Methanobrevibacter smithii which complement mutations in the purE and proC genes of E. coli have been sequenced. Sequence analyses, transposon mutagenesis and expression in E. coli minicells indicate that purE and proC complementations result from the synthesis of M. smithii polypeptides with molecular weights of 36,697 and 27,836 respectively. The encoding genes appear to be located in operons. The M. smithii genome contains 69% A/T basepairs (bp) which is reflected in unusual codon usages and intergenic regions containing approximately 85% A/T bp. An insertion element, designated ISM1, was found within the cloned M. smithii DNA located adjacent to the proC complementing region. ISM1 is 1381 bp in length, has 29 bp terminal inverted repeat sequences and contains one major ORF encoded in 87% of the ISM1 sequence. ISM1 is mobile, present in approximately 10 copies per genome and integration duplicates 8 bp at the site of insertion. The duplicated sequences show homology with sequences within the 29 bp terminal repeat sequence of ISM1. Comparison of our data with sequences from halophilic archaebacteria suggests that 5'GAANTTTCA and 5'TTTTAATATAAA may be consensus promoter sequences for archaebacteria. These sequences closely resemble the consensus sequences which precede Drosophila heat-shock genes (Pelham 1982; Davidson et al. 1983). Methanogens appear to employ the eubacterial system of mRNA: 16SrRNA hybridization to ensure initiation of translation; the consensus ribosome binding sequence is 5'AGGTGA.

  11. Turnover of R1 (Type I) and R2 (Type Ii) Retrotransposable Elements in the Ribosomal DNA of Drosophila Melanogaster

    PubMed Central

    Jakubczak, J. L.; Zenni, M. K.; Woodruff, R. C.; Eickbush, T. H.

    1992-01-01

    R1 and R2 are distantly related non-long terminal repeat retrotransposable elements each of which inserts into a specific site in the 28S rRNA genes of most insects. We have analyzed aspects of R1 and R2 abundance and sequence variation in 27 geographical isolates of Drosophila melanogaster. The fraction of 28S rRNA genes containing these elements varied greatly between strains, 17-67% for R1 elements and 2-28% for R2 elements. The total percentage of the rDNA repeats inserted ranged from 32 to 77%. The fraction of the rDNA repeats that contained both of these elements suggested that R1 and R2 exhibit neither an inhibition of nor preference for insertion into a 28S gene already containing the other type of element. Based on the conservation of restriction sites in the elements of all strains, and sequence analysis of individual elements from three strains, nucleotide divergence is very low for R1 and R2 elements within or between strains (<0.6%). This sequence uniformity is the expected result of the forces of concerted evolution (unequal crossovers and gene conversion) which act on the rRNA genes themselves. Evidence for the role of retrotransposition in the turnover of R1 and R2 was obtained by using naturally occurring 5' length polymorphisms of the elements as markers for independent transposition events. The pattern of these different length 5' truncations of R1 and R2 was found to be diverse and unique to most strains analyzed. Because recombination can only, with time, amplify or eliminate those length variants already present, the diversity found in each strain suggests that retrotransposition has played a critical role in maintaining these elements in the rDNA repeats of D. melanogaster. PMID:1317313

  12. Characterization of IS1515, a Functional Insertion Sequence in Streptococcus pneumoniae

    PubMed Central

    Muñoz, Rosario; López, Rubens; García, Ernesto

    1998-01-01

    We describe the characterization of a new insertion sequence, IS1515, identified in the genome of Streptococcus pneumoniae I41R, an unencapsulated mutant isolated many years ago (R. Austrian, H. P. Bernheimer, E. E. B. Smith, and G. T. Mills, J. Exp. Med. 110:585–602, 1959). A copy of this element located in the cap1EI41R gene was sequenced. The 871-bp-long IS1515 element possesses 12-bp perfect inverted repeats and generates a 3-bp target duplication upon insertion. The IS encodes a protein of 271 amino acid residues similar to the putative transposases of other insertion sequences, namely IS1381 from S. pneumoniae, ISL2 from Lactobacillus helveticus, IS702 from the cyanobacterium Calothrix sp. strain PCC 7601, and IS112 from Streptomyces albus G. IS1515 appears to be present in the genome of most type 1 pneumococci in a maximum of 13 copies, although it has also been found in the chromosome of pneumococcal isolates belonging to other serotypes. We have found that the unencapsulated phenotype of strain I41R is the result of both the presence of an IS1515 copy and a frameshift mutation in the cap1EI41R gene. Precise excision of the IS was observed in the type 1 encapsulated transformants isolated in experiments designed to repair the frameshift. These results reveal that IS1515 behaves quite differently from other previously described pneumococcal insertion sequences. Several copies of IS1515 were also able to excise and move to another locations in the chromosome of S. pneumoniae. To our knowledge, this is the first report of a functional IS in pneumococcus. PMID:9580131

  13. The Transposable Element Mariner Mediates Germline Transformation in Drosophila Melanogaster

    PubMed Central

    Lidholm, D. A.; Lohe, A. R.; Hartl, D. L.

    1993-01-01

    A vector for germline transformation in Drosophila melanogaster was constructed using the transposable element mariner. The vector, denoted pMlwB, contains a mariner element disrupted by an insertion containing the wild-type white gene from D. melanogaster, the β-galactosidase gene from Escherichia coli and sequences that enable plasmid replication and selection in E. coli. The white gene is controlled by the promoter of the D. melanogaster gene for heat-shock protein 70, and the β-galactosidase gene is flanked upstream by the promoter of the transposable element P as well as that of mariner. The MlwB element was introduced into the germline of D. melanogaster by co-injection into embryos with an active mariner element, Mos1, which codes for a functional transposase and serves as a helper. Two independent germline insertions were isolated and characterized. The results show that the MlwB element inserted into the genome in a mariner-dependent manner with the termini of the inverted repeats inserted at a TA dinucleotide. Both insertions exhibit an unexpected degree of germline and somatic stability, even in the presence of an active mariner element in the genetic background. These results demonstrate that the mariner transposable element, which is small (1286 bp) and relatively homogeneous in size among different copies, is nevertheless capable of promoting the insertion of the large (13.2 kb) MlwB element. Because of the widespread phylogenetic distribution of mariner among insects, these results suggest that mariner might provide a wide hostrange transformation vector for insects. PMID:8394264

  14. LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences.

    PubMed

    Disdero, Eric; Filée, Jonathan

    2017-01-01

    Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE population genomic softwares were not designed to handle long reads and the development of new dedicated tools is needed. LoRTE is the first tool able to use PacBio long read sequences to identify transposon deletions and insertions between a reference genome and genomes of different strains or populations. Tested against simulated and genuine Drosophila melanogaster PacBio datasets, LoRTE appears to be a reliable and broadly applicable tool to study the dynamic and evolutionary impact of transposable elements using low coverage, long read sequences. LoRTE is an efficient and accurate tool to identify structural genomic variants caused by TE insertion or deletion. LoRTE is available for download at http://www.egce.cnrs-gif.fr/?p=6422.

  15. Alu repeat discovery and characterization within human genomes

    PubMed Central

    Hormozdiari, Fereydoun; Alkan, Can; Ventura, Mario; Hajirasouliha, Iman; Malig, Maika; Hach, Faraz; Yorukoglu, Deniz; Dao, Phuong; Bakhshi, Marzieh; Sahinalp, S. Cenk; Eichler, Evan E.

    2011-01-01

    Human genomes are now being rapidly sequenced, but not all forms of genetic variation are routinely characterized. In this study, we focus on Alu retrotransposition events and seek to characterize differences in the pattern of mobile insertion between individuals based on the analysis of eight human genomes sequenced using next-generation sequencing. Applying a rapid read-pair analysis algorithm, we discover 4342 Alu insertions not found in the human reference genome and show that 98% of a selected subset (63/64) experimentally validate. Of these new insertions, 89% correspond to AluY elements, suggesting that they arose by retrotransposition. Eighty percent of the Alu insertions have not been previously reported and more novel events were detected in Africans when compared with non-African samples (76% vs. 69%). Using these data, we develop an experimental and computational screen to identify ancestry informative Alu retrotransposition events among different human populations. PMID:21131385

  16. Comparison of Ultra-Conserved Elements in Drosophilids and Vertebrates

    PubMed Central

    Makunin, Igor V.; Shloma, Viktor V.; Stephen, Stuart J.; Pheasant, Michael; Belyakin, Stepan N.

    2013-01-01

    Metazoan genomes contain many ultra-conserved elements (UCEs), long sequences identical between distant species. In this study we identified UCEs in drosophilid and vertebrate species with a similar level of phylogenetic divergence measured at protein-coding regions, and demonstrated that both the length and number of UCEs are larger in vertebrates. The proportion of non-exonic UCEs declines in distant drosophilids whilst an opposite trend was observed in vertebrates. We generated a set of 2,126 Sophophora UCEs by merging elements identified in several drosophila species and compared these to the eutherian UCEs identified in placental mammals. In contrast to vertebrates, the Sophophora UCEs are depleted around transcription start sites. Analysis of 52,954 P-element, piggyBac and Minos insertions in the D. melanogaster genome revealed depletion of the P-element and piggyBac insertions in and around the Sophophora UCEs. We examined eleven fly strains with transposon insertions into the intergenic UCEs and identified associated phenotypes in five strains. Four insertions behave as recessive lethals, and in one case we observed a suppression of the marker gene within the transgene, presumably by silenced chromatin around the integration site. To confirm the lethality is caused by integration of transposons we performed a phenotype rescue experiment for two stocks and demonstrated that the excision of the transposons from the intergenic UCEs restores viability. Sequencing of DNA after the transposon excision in one fly strain with the restored viability revealed a 47 bp insertion at the original transposon integration site suggesting that the nature of the mutation is important for the appearance of the phenotype. Our results suggest that the UCEs in flies and vertebrates have both common and distinct features, and demonstrate that a significant proportion of intergenic drosophila UCEs are sensitive to disruption. PMID:24349264

  17. Rotifer rDNA-specific R9 retrotransposable elements generate an exceptionally long target site duplication upon insertion.

    PubMed

    Gladyshev, Eugene A; Arkhipova, Irina R

    2009-12-15

    Ribosomal DNA genes in many eukaryotes contain insertions of non-LTR retrotransposable elements belonging to the R2 clade. These elements persist in the host genomes by inserting site-specifically into multicopy target sites, thereby avoiding random disruption of single-copy host genes. Here we describe R9 retrotransposons from the R2 clade in the 28S RNA genes of bdelloid rotifers, small freshwater invertebrate animals best known for their long-term asexuality and for their ability to survive repeated cycles of desiccation and rehydration. While the structural organization of R9 elements is highly similar to that of other members of the R2 clade, they are characterized by two distinct features: site-specific insertion into a previously unreported target sequence within the 28S gene, and an unusually long target site duplication of 126 bp. We discuss the implications of these findings in the context of bdelloid genome organization and the mechanisms of target-primed reverse transcription.

  18. The Plasmodium selenoproteome

    PubMed Central

    Lobanov, Alexey V.; Delgado, Cesar; Rahlfs, Stefan; Novoselov, Sergey V.; Kryukov, Gregory V.; Gromer, Stephan; Hatfield, Dolph L.; Becker, Katja; Gladyshev, Vadim N.

    2006-01-01

    The use of selenocysteine (Sec) as the 21st amino acid in the genetic code has been described in all three major domains of life. However, within eukaryotes, selenoproteins are only known in animals and algae. In this study, we characterized selenoproteomes and Sec insertion systems in protozoan Apicomplexa parasites. We found that among these organisms, Plasmodium and Toxoplasma utilized Sec, whereas Cryptosporidium did not. However, Plasmodium had no homologs of known selenoproteins. By searching computationally for evolutionarily conserved selenocysteine insertion sequence (SECIS) elements, which are RNA structures involved in Sec insertion, we identified four unique Plasmodium falciparum selenoprotein genes. These selenoproteins were incorrectly annotated in PlasmoDB, were conserved in other Plasmodia and had no detectable homologs in other species. We provide evidence that two Plasmodium SECIS elements supported Sec insertion into parasite and endogenous selenoproteins when they were expressed in mammalian cells, demonstrating that the Plasmodium SECIS elements are functional and indicating conservation of Sec insertion between Apicomplexa and animals. Dependence of the plasmodial parasites on selenium suggests possible strategies for antimalarial drug development. PMID:16428245

  19. Targeted gene insertion for molecular medicine.

    PubMed

    Voigt, Katrin; Izsvák, Zsuzsanna; Ivics, Zoltán

    2008-11-01

    Genomic insertion of a functional gene together with suitable transcriptional regulatory elements is often required for long-term therapeutical benefit in gene therapy for several genetic diseases. A variety of integrating vectors for gene delivery exist. Some of them exhibit random genomic integration, whereas others have integration preferences based on attributes of the targeted site, such as primary DNA sequence and physical structure of the DNA, or through tethering to certain DNA sequences by host-encoded cellular factors. Uncontrolled genomic insertion bears the risk of the transgene being silenced due to chromosomal position effects, and can lead to genotoxic effects due to mutagenesis of cellular genes. None of the vector systems currently used in either preclinical experiments or clinical trials displays sufficient preferences for target DNA sequences that would ensure appropriate and reliable expression of the transgene and simultaneously prevent hazardous side effects. We review in this paper the advantages and disadvantages of both viral and non-viral gene delivery technologies, discuss mechanisms of target site selection of integrating genetic elements (viruses and transposons), and suggest distinct molecular strategies for targeted gene delivery.

  20. Molecular and bioinformatic analysis of the FB-NOF transposable element.

    PubMed

    Badal, Martí; Portela, Anna; Xamena, Noel; Cabré, Oriol

    2006-04-12

    The Drosophila melanogaster transposable element FB-NOF is known to play a role in genome plasticity through the generation of all sort of genomic rearrangements. Moreover, several insertional mutants due to FB mobilizations have been reported. Its structure and sequence, however, have been poorly studied mainly as a consequence of the long, complex and repetitive sequence of FB inverted repeats. This repetitive region is composed of several 154 bp blocks, each with five almost identical repeats. In this paper, we report the sequencing process of 2 kb long FB inverted repeats of a complete FB-NOF element, with high precision and reliability. This achievement has been possible using a new map of the FB repetitive region, which identifies unambiguously each repeat with new features that can be used as landmarks. With this new vision of the element, a list of FB-NOF in the D. melanogaster genomic clones has been done, improving previous works that used only bioinformatic algorithms. The availability of many FB and FB-NOF sequences allowed an analysis of the FB insertion sequences that showed no sequence specificity, but a preference for A/T rich sequences. The position of NOF into FB is also studied, revealing that it is always located after a second repeat in a random block. With the results of this analysis, we propose a model of transposition in which NOF jumps from FB to FB, using an unidentified transposase enzyme that should specifically recognize the second repeat end of the FB blocks.

  1. IS30-related transposon mediated insertional inactivation of bile salt hydrolase (bsh1) gene of Lactobacillus plantarum strain Lp20.

    PubMed

    Kumar, Rajesh; Grover, Sunita; Kaushik, Jai K; Batish, Virender Kumar

    2014-01-01

    Lactobacillus plantarum is a flexible and versatile microorganism that inhabits a variety of niches, and its genome may express up to four bsh genes to maximize its survival in the mammalian gut. However, the ecological significance of multiple bsh genes in L. plantarum is still not clearly understood. Hence, this study demonstrated the disruption of bile salt hydrolase (bsh1) gene due to the insertion of a transposable element in L. plantarum Lp20 - a wild strain of human fecal origin. Surprisingly, L. plantarum strain Lp20 produced a ∼2.0 kb bsh1 amplicon against the normal size (∼1.0 kb) bsh1 amplicon of Bsh(+)L. plantarum Lp21. Strain Lp20 exhibited minimal Bsh activity in spite of having intact bsh2, bsh3 and bsh4 genes in its genome and hence had a Bsh(-) phenotype. Cloning and sequence characterization of Lp20 bsh1 gene predicted four individual open reading frames (ORFs) within this region. BLAST analysis of ORF1 and ORF2 revealed significant sequence similarity to the L. plantarum bsh1 gene while ORF3 and ORF4 showed high sequence homology to IS30-family transposases. Since, IS30-related transposon element was inserted within Lp20 bsh1 gene in reverse orientation (3'-5'), it introduced several stop codons and disrupted the protein reading frames of both Bsh1 and transposase. Inverted terminal repeats (GGCAGATTG) of transposon, mediated its insertion at 255-263 nt and 1301-1309 nt positions of Lp20 bsh1 gene. In conclusion, insertion of IS30 related-transposon within the bsh1 gene sequence of L. plantarum strain Lp20 demolished the integrity and functionality of Bsh1 enzyme. Additionally, this transposon DNA sequence remains active among various Lactobacillus spp. and hence harbors the potential to be explored in the development of efficient insertion mutagenesis system. Copyright © 2013 Elsevier GmbH. All rights reserved.

  2. Construction and sequence sampling of deep-coverage, large-insert BAC libraries for three model lepidopteran species

    PubMed Central

    Wu, Chengcang; Proestou, Dina; Carter, Dorothy; Nicholson, Erica; Santos, Filippe; Zhao, Shaying; Zhang, Hong-Bin; Goldsmith, Marian R

    2009-01-01

    Background Manduca sexta, Heliothis virescens, and Heliconius erato represent three widely-used insect model species for genomic and fundamental studies in Lepidoptera. Large-insert BAC libraries of these insects are critical resources for many molecular studies, including physical mapping and genome sequencing, but not available to date. Results We report the construction and characterization of six large-insert BAC libraries for the three species and sampling sequence analysis of the genomes. The six BAC libraries were constructed with two restriction enzymes, two libraries for each species, and each has an average clone insert size ranging from 152–175 kb. We estimated that the genome coverage of each library ranged from 6–9 ×, with the two combined libraries of each species being equivalent to 13.0–16.3 × haploid genomes. The genome coverage, quality and utility of the libraries were further confirmed by library screening using 6~8 putative single-copy probes. To provide a first glimpse into these genomes, we sequenced and analyzed the BAC ends of ~200 clones randomly selected from the libraries of each species. The data revealed that the genomes are AT-rich, contain relatively small fractions of repeat elements with a majority belonging to the category of low complexity repeats, and are more abundant in retro-elements than DNA transposons. Among the species, the H. erato genome is somewhat more abundant in repeat elements and simple repeats than those of M. sexta and H. virescens. The BLAST analysis of the BAC end sequences suggested that the evolution of the three genomes is widely varied, with the genome of H. virescens being the most conserved as a typical lepidopteran, whereas both genomes of H. erato and M. sexta appear to have evolved significantly, resulting in a higher level of species- or evolutionary lineage-specific sequences. Conclusion The high-quality and large-insert BAC libraries of the insects, together with the identified BACs containing genes of interest, provide valuable information, resources and tools for comprehensive understanding and studies of the insect genomes and for addressing many fundamental questions in Lepidoptera. The sample of the genomic sequences provides the first insight into the constitution and evolution of the insect genomes. PMID:19558662

  3. The Role(s) of Heparan Sulfate Proteoglycan(s) in the wnt-1 Signaling Pathway

    DTIC Science & Technology

    1998-08-01

    First , the sequence of the cDNA, when compared to the genomic site of insertion of the P-element, revealed that the P-element is inserted 686 bp...stages 8 to 13 (Yoffe et al. 1995). We first examined whether ectopic expression of Wgts effectively restores the naked cuticle as it does in wg and...by Kjell~n and Lindahl, 1991) . HS/heparin N-deacetylase/N-sulfotransferase catalyzes N-deacetylation and N-sulfation that is the first and key step

  4. Insertion sequence typing of Mycobacterium tuberculosis: characterization of a widespread subtype with a single copy of IS6110.

    PubMed

    Fomukong, N G; Tang, T H; al-Maamary, S; Ibrahim, W A; Ramayah, S; Yates, M; Zainuddin, Z F; Dale, J W

    1994-12-01

    DNA fingerprinting with the insertion sequence IS6110 (also known as IS986) has become established as a major tool for investigating the spread of tuberculosis. Most strains of Mycobacterium tuberculosis have multiple copies of IS6110, but a small minority carry a single copy only. We have examined selected strains from Malaysia, Tanzania and Oman, in comparison with M. bovis isolates and BCG strains carrying one or two copies of IS6110. The insertion sequence appears to be present in the same position in all these strains, which suggests that in these organisms the element is defective in transposition and that the loss of transposability may have occurred at an early stage in the evolution of the M. tuberculosis complex.

  5. Efficient transposition of the Tnt1 tobacco retrotransposon in the model legume Medicago truncatula.

    PubMed

    d'Erfurth, Isabelle; Cosson, Viviane; Eschstruth, Alexis; Lucas, Helene; Kondorosi, Adam; Ratet, P

    2003-04-01

    The tobacco element, Tnt1, is one of the few active retrotransposons in plants. Its transposition is activated during protoplast culture in tobacco and tissue culture in the heterologous host Arabidopsis thaliana. Here, we report its transposition in the R108 line of Medicago truncatula during the early steps of the in vitro transformation-regeneration process. Two hundred and twenty-five primary transformants containing Tnt1 were obtained. Among them, 11.2% contained only transposed copies of the element, indicating that Tnt1 transposed very early and efficiently during the in vitro transformation process, possibly even before the T-DNA integration. The average number of insertions per transgenic line was estimated to be about 15. These insertions were stable in the progeny and could be separated by segregation. Inspection of the sequences flanking the insertion sites revealed that Tnt1 had no insertion site specificity and often inserted in genes (one out of three insertions). Thus, our work demonstrates the functioning of an efficient transposable element in leguminous plants. These results indicate that Tnt1 can be used as a powerful tool for insertion mutagenesis in M. truncatula.

  6. Widespread and evolutionary analysis of a MITE family Monkey King in Brassicaceae.

    PubMed

    Dai, Shutao; Hou, Jinna; Long, Yan; Wang, Jing; Li, Cong; Xiao, Qinqin; Jiang, Xiaoxue; Zou, Xiaoxiao; Zou, Jun; Meng, Jinling

    2015-06-19

    Miniature inverted repeat transposable elements (MITEs) are important components of eukaryotic genomes, with hundreds of families and many copies, which may play important roles in gene regulation and genome evolution. However, few studies have investigated the molecular mechanisms involved. In our previous study, a Tourist-like MITE, Monkey King, was identified from the promoter region of a flowering time gene, BnFLC.A10, in Brassica napus. Based on this MITE, the characteristics and potential roles on gene regulation of the MITE family were analyzed in Brassicaceae. The characteristics of the Tourist-like MITE family Monkey King in Brassicaceae, including its distribution, copies and insertion sites in the genomes of major Brassicaceae species were analyzed in this study. Monkey King was actively amplified in Brassica after divergence from Arabidopsis, which was indicated by the prompt increase in copy number and by phylogenetic analysis. The genomic variations caused by Monkey King insertions, both intra- and inter-species in Brassica, were traced by PCR amplification. Genomic sequence analysis showed that most complete Monkey King elements are located in gene-rich regions, less than 3kb from genes, in both the B. rapa and A. thaliana genomes. Sixty-seven Brassica expressed sequence tags carrying Monkey King fragments were also identified from the NCBI database. Bisulfite sequencing identified specific DNA methylation of cytosine residues in the Monkey King sequence. A fragment containing putative TATA-box motifs in the MITE sequence could bind with nuclear protein(s) extracted from leaves of B. napus plants. A Monkey King-related microRNA, bna-miR6031, was identified in the microRNA database. In transgenic A. thaliana, when the Monkey King element was inserted upstream of 35S promoter, the promoter activity was weakened. Monkey King, a Brassicaceae Tourist-like MITE family, has amplified relatively recently and has induced intra- and inter-species genomic variations in Brassica. Monkey King elements are most abundant in the vicinity of genes and may have a substantial effect on genome-wide gene regulation in Brassicaceae. Monkey King insertions potentially regulate gene expression and genome evolution through epigenetic modification and new regulatory motif production.

  7. Novel insertion sequence- and transposon-mediated genetic rearrangements in genomic island SGI1 of Salmonella enterica serovar Kentucky.

    PubMed

    Doublet, Benoît; Praud, Karine; Bertrand, Sophie; Collard, Jean-Marc; Weill, François-Xavier; Cloeckaert, Axel

    2008-10-01

    Salmonella genomic island 1 (SGI1) is an integrative mobilizable element that harbors a multidrug resistance (MDR) gene cluster. Since its identification in epidemic Salmonella enterica serovar Typhimurium DT104 strains, variant SGI1 MDR gene clusters conferring different MDR phenotypes have been identified in several S. enterica serovars and classified as SGI1-A to -O. A study was undertaken to characterize SGI1 from serovar Kentucky strains isolated from travelers returning from Africa. Several strains tested were found to contain the partially characterized variant SGI1-K, recently described in a serovar Kentucky strain isolated in Australia. This variant contained only one cassette array, aac(3)-Id-aadA7, and an adjacent mercury resistance module. Here, the uncharacterized part of SGI1-K was sequenced. Downstream of the mer module similar to that found in Tn21, a mosaic genetic structure was found, comprising (i) part of Tn1721 containing the tetracycline resistance genes tetR and tet(A); (ii) part of Tn5393 containing the streptomycin resistance genes strAB, IS1133, and a truncated tnpR gene; and (iii) a Tn3-like region containing the tnpR gene and the beta-lactamase bla(TEM-1) gene flanked by two IS26 elements in opposite orientations. The rightmost IS26 element was shown to be inserted into the S044 open reading frame of the SGI1 backbone. This variant MDR region was named SGI1-K1 according to the previously described variant SGI1-K. Other SGI1-K MDR regions due to different IS26 locations, inversion, and partial deletions were characterized and named SGI1-K2 to -K5. Two new SGI1 variants named SGI1-P1 and -P2 contained only the Tn3-like region comprising the beta-lactamase bla(TEM-1) gene flanked by the two IS26 elements inserted into the SGI1 backbone. Three other new variants harbored only one IS26 element inserted in place of the MDR region of SGI1 and were named SGI1-Q1 to -Q3. Thus, in serovar Kentucky, the SGI1 MDR region undergoes recombinational and insertional events of transposon and insertion sequences, resulting in a higher diversity of MDR gene clusters than previously reported and consequently a higher diversity of MDR phenotypes.

  8. Alterations in the 5 'untranslated region of the EPSPS gene influence EPSPS overexpression in glyphosate-resistant Eleusine indica.

    PubMed

    Zhang, Chun; Feng, Li; Tian, Xing-Shan

    2018-04-26

    The herbicide glyphosate inhibits the enzyme 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS). Overexpression of the EPSPS gene is one of the molecular mechanisms conferring glyphosate resistance in weeds, but the transcriptional regulation of this gene is poorly understood. The EPSPS gene was found to be significantly up-regulated following glyphosate treatment in a glyphosate- resistant Eleusine indica population from South China. To further investigate the regulation of EPSPS overexpression, the promoter of the EPSPS gene from this E. indica population was cloned and analyzed. Two upstream regulatory sequences, Epro-S (862 bp) and Epro-R (877 bp) of EPSPS were obtained from glyphosate-susceptible (S) and -resistant (R) E. indica plants respectively by HiTAIL-PCR. The Epro-S and Epro-R sequences were 99% homologous, except for the two insertions (3 bp and12 bp) in the R sequence. The 12-base insertion of the Epro-R sequence was located in the 5'-UTR-Py-rich stretch element. The promoter activity tests showed that the 12-base insertion resulted in significant enhancement of the Epro-R promoter activity, whereas the 3-base insertion had little effect on Epro-R promoter activity. Alterations in the 5'-UTR-Py-rich stretch element of EPSPS are responsible for glyphosate induced EPSPS overexpression. Therefore, EPSPS transcriptional regulation confers glyphosate resistance in this E. indica population. This article is protected by copyright. All rights reserved.

  9. Combinatorial events of insertion sequences and ICE in Gram-negative bacteria.

    PubMed

    Toleman, Mark A; Walsh, Timothy R

    2011-09-01

    The emergence of antibiotic and antimicrobial resistance in Gram-negative bacteria is incremental and linked to genetic elements that function in a so-called 'one-ended transposition' manner, including ISEcp1, ISCR elements and Tn3-like transposons. The power of these elements lies in their inability to consistently recognize one of their own terminal sequences, while recognizing more genetically distant surrogate sequences. This has the effect of mobilizing the DNA sequence found adjacent to their initial location. In general, resistance in Gram-negatives is closely linked to a few one-off events. These include the capture of the class 1 integron by a Tn5090-like transposon; the formation of the 3' conserved segment (3'-CS); and the fusion of the ISCR1 element to the 3'-CS. The structures formed by these rare events have been massively amplified and disseminated in Gram-negative bacteria, but hitherto, are rarely found in Gram-positives. Such events dominate current resistance gene acquisition and are instrumental in the construction of large resistance gene islands on chromosomes and plasmids. Similar combinatorial events appear to have occurred between conjugative plasmids and phages constructing hybrid elements called integrative and conjugative elements or conjugative transposons. These elements are beginning to be closely linked to some of the more powerful resistance mechanisms such as the extended spectrum β-lactamases, metallo- and AmpC type β-lactamases. Antibiotic resistance in Gram-negative bacteria is dominated by unusual combinatorial mistakes of Insertion sequences and gene fusions which have been selected and amplified by antibiotic pressure enabling the formation of extended resistance islands. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  10. Ac-immobilized, a stable source of Activator transposase that mediates sporophytic and gametophytic excision of Dissociation elements in maize.

    PubMed

    Conrad, Liza J; Brutnell, Thomas P

    2005-12-01

    We have identified and characterized a novel Activator (Ac) element that is incapable of excision yet contributes to the canonical negative dosage effect of Ac. Cloning and sequence analysis of this immobilized Ac (Ac-im) revealed that it is identical to Ac with the exception of a 10-bp deletion of sequences at the left end of the element. In screens of approximately 6800 seeds, no germinal transpositions of Ac-im were detected. Importantly, Ac-im catalyzes germinal excisions of a Ds element resident at the r1 locus resulting in the recovery of independent transposed Ds insertions in approximately 4.5% of progeny kernels. Many of these transposition events occur during gametophytic development. Furthermore, we demonstrate that Ac-im transactivates multiple Ds insertions in somatic tissues including those in reporter alleles at bronze1, anthocyaninless1, and anthocyaninless2. We propose a model for the generation of Ac-im as an aberrant transposition event that failed to generate an 8-bp target site duplication and resulted in the deletion of Ac end sequences. We also discuss the utility of Ac-im in two-component Ac/Ds gene-tagging programs in maize.

  11. S Elements: A Family of Tc1-like Transposons in the Genome of Drosophila Melanogaster

    PubMed Central

    Merriman, P. J.; Grimes, C. D.; Ambroziak, J.; Hackett, D. A.; Skinner, P.; Simmons, M. J.

    1995-01-01

    The S elements form a diverse family of long-inverted-repeat transposons within the genome of Drosophila melanogaster. These elements vary in size and sequence, the longest consisting of 1736 bp with 234-bp inverted terminal repeats. The longest open reading frame in an intact S element could encode a 345-amino acid polypeptide. This polypeptide is homologous to the transposases of the mariner-Tc1 superfamily of transposable elements. S elements are ubiquitous in D. melanogaster populations and also appear to be present in the genomes of two sibling species; however, they seem to be absent from 17 other Drosophila species that were examined. Within D. melanogaster strains, there are, on average, 37.4 cytologically detectable S elements per diploid genome. These elements are scattered throughout the chromosomes, but several sites in both the euchromatin and β heterochromatin are consistently occupied. The discovery of an S-element-insertion mutation and a reversion of this mutation indicates that S elements are at least occasionally mobile in the D. melanogaster genome. These elements seem to insert at an AT dinucleotide within a short palindrome and apparently duplicate that dinucleotide upon insertion. PMID:8601484

  12. Familial retinoblastoma due to intronic LINE-1 insertion causes aberrant and noncanonical mRNA splicing of the RB1 gene.

    PubMed

    Rodríguez-Martín, Carlos; Cidre, Florencia; Fernández-Teijeiro, Ana; Gómez-Mariano, Gema; de la Vega, Leticia; Ramos, Patricia; Zaballos, Ángel; Monzón, Sara; Alonso, Javier

    2016-05-01

    Retinoblastoma (RB, MIM 180200) is the paradigm of hereditary cancer. Individuals harboring a constitutional mutation in one allele of the RB1 gene have a high predisposition to develop RB. Here, we present the first case of familial RB caused by a de novo insertion of a full-length long interspersed element-1 (LINE-1) into intron 14 of the RB1 gene that caused a highly heterogeneous splicing pattern of RB1 mRNA. LINE-1 insertion was inferred by mRNA studies and full-length sequenced by massive parallel sequencing. Some of the aberrant mRNAs were produced by noncanonical acceptor splice sites, a new finding that up to date has not been described to occur upon LINE-1 retrotransposition. Our results clearly show that RNA-based strategies have the potential to detect disease-causing transposon insertions. It also confirms that the incorporation of new genetic approaches, such as massive parallel sequencing, contributes to characterize at the sequence level these unique and exceptional genetic alterations.

  13. Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

    PubMed

    Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

    2012-12-01

    In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.

  14. Analysis of plastid and mitochondrial DNA insertions in the nucleus (NUPTs and NUMTs) of six plant species: size, relative age and chromosomal localization.

    PubMed

    Michalovova, M; Vyskot, B; Kejnovsky, E

    2013-10-01

    We analysed the size, relative age and chromosomal localization of nuclear sequences of plastid and mitochondrial origin (NUPTs-nuclear plastid DNA and NUMTs-nuclear mitochondrial DNA) in six completely sequenced plant species. We found that the largest insertions showed lower divergence from organelle DNA than shorter insertions in all species, indicating their recent origin. The largest NUPT and NUMT insertions were localized in the vicinity of the centromeres in the small genomes of Arabidopsis and rice. They were also present in other chromosomal regions in the large genomes of soybean and maize. Localization of NUPTs and NUMTs correlated positively with distribution of transposable elements (TEs) in Arabidopsis and sorghum, negatively in grapevine and soybean, and did not correlate in rice or maize. We propose a model where new plastid and mitochondrial DNA sequences are inserted close to centromeres and are later fragmented by TE insertions and reshuffled away from the centromere or removed by ectopic recombination. The mode and tempo of TE dynamism determines the turnover of NUPTs and NUMTs resulting in their species-specific chromosomal distributions.

  15. Dead Element Replicating: Degenerate R2 Element Replication and rDNA Genomic Turnover in the Bacillus rossius Stick Insect (Insecta: Phasmida)

    PubMed Central

    Martoni, Francesco; Eickbush, Danna G.; Scavariello, Claudia; Luchetti, Andrea; Mantovani, Barbara

    2015-01-01

    R2 is an extensively investigated non-LTR retrotransposon that specifically inserts into the 28S rRNA gene sequences of a wide range of metazoans, disrupting its functionality. During R2 integration, first strand synthesis can be incomplete so that 5’ end deleted copies are occasionally inserted. While active R2 copies repopulate the locus by retrotransposing, the non-functional truncated elements should frequently be eliminated by molecular drive processes leading to the concerted evolution of the rDNA array(s). Although, multiple R2 lineages have been discovered in the genome of many animals, the rDNA of the stick insect Bacillus rossius exhibits a peculiar situation: it harbors both a canonical, functional R2 element (R2Brfun) as well as a full-length but degenerate element (R2Brdeg). An intensive sequencing survey in the present study reveals that all truncated variants in stick insects are present in multiple copies suggesting they were duplicated by unequal recombination. Sequencing results also demonstrate that all R2Brdeg copies are full-length, i. e. they have no associated 5' end deletions, and functional assays indicate they have lost the active ribozyme necessary for R2 RNA maturation. Although it cannot be completely ruled out, it seems unlikely that the degenerate elements replicate via reverse transcription, exploiting the R2Brfun element enzymatic machinery, but rather via genomic amplification of inserted 28S by unequal recombination. That inactive copies (both R2Brdeg or 5'-truncated elements) are not eliminated in a short term in stick insects contrasts with findings for the Drosophila R2, suggesting a widely different management of rDNA loci and a lower efficiency of the molecular drive while achieving the concerted evolution. PMID:25799008

  16. A major insertion accounts for a significant proportion of mutations underlying human lipoprotein lipase deficiency

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Langlois, S.; Kastelein, J.J.; Hayden, M.R.

    1989-02-01

    Lipoprotein lipase is an important enzyme involved in triacylglycerol metabolism. Primary LPL deficiency is a genetic disorder that is usually manifested by a severe elevation in triacylglycerol levels. The authors have used a recently isolated LPL cDNA clone to study 15 probands from 11 families with this inherited disorder. Surprisingly, 7 of the probands from 4 families, of different ancestries, had a similar insertion in their LPL gene. In contrast to other human genetic disorders, where insertions are rare causes of mutation, this insertion accounts for a significant proportion of the alleles causing LPL deficiency. Detailed restriction mapping of themore » insertion revealed that it was unlikely to be a duplication of neighboring DNA and that it was not similar to the consensus sequence of human L1 repetitive elements. This suggests that there must be other mechanisms of insertional mutagenesis in human genetic disease besides transposition of mobile L1 repetitive elements.« less

  17. Insertion of an SVA element, a nonautonomous retrotransposon, in PMS2 intron 7 as a novel cause of Lynch syndrome.

    PubMed

    van der Klift, Heleen M; Tops, Carli M; Hes, Frederik J; Devilee, Peter; Wijnen, Juul T

    2012-07-01

    Heterozygous germline mutations in the mismatch repair gene PMS2 predispose carriers for Lynch syndrome, an autosomal dominant predisposition to cancer. Here, we present a LINE-1-mediated retrotranspositional insertion in PMS2 as a novel mutation type for Lynch syndrome. This insertion, detected with Southern blot analysis in the genomic DNA of the patient, is characterized as a 2.2 kb long 5' truncated SVA_F element. The insertion is not detectable by current diagnostic testing limited to MLPA and direct Sanger sequencing on genomic DNA. The molecular nature of this insertion could only be resolved in RNA from cultured lymphocytes in which nonsense-mediated RNA decay was inhibited. Our report illustrates the technical problems encountered in the detection of this mutation type. Especially large heterozygous insertions will remain unnoticed because of preferential amplification of the smaller wild-type allele in genomic DNA, and are probably underreported in the mutation spectra of autosomal dominant disorders. © 2012 Wiley Periodicals, Inc.

  18. Transposable elements in cancer.

    PubMed

    Burns, Kathleen H

    2017-07-01

    Transposable elements give rise to interspersed repeats, sequences that comprise most of our genomes. These mobile DNAs have been historically underappreciated - both because they have been presumed to be unimportant, and because their high copy number and variability pose unique technical challenges. Neither impediment now seems steadfast. Interest in the human mobilome has never been greater, and methods enabling its study are maturing at a fast pace. This Review describes the activity of transposable elements in human cancers, particularly long interspersed element-1 (LINE-1). LINE-1 sequences are self-propagating, protein-coding retrotransposons, and their activity results in somatically acquired insertions in cancer genomes. Altered expression of transposable elements and animation of genomic LINE-1 sequences appear to be hallmarks of cancer, and can be responsible for driving mutations in tumorigenesis.

  19. pTC Plasmids from Sulfolobus Species in the Geothermal Area of Tengchong, China: Genomic Conservation and Naturally-Occurring Variations as a Result of Transposition by Mobile Genetic Elements

    PubMed Central

    Xiang, Xiaoyu; Huang, Xiaoxing; Wang, Haina; Huang, Li

    2015-01-01

    Plasmids occur frequently in Archaea. A novel plasmid (denoted pTC1) containing typical conjugation functions has been isolated from Sulfolobus tengchongensis RT8-4, a strain obtained from a hot spring in Tengchong, China, and characterized. The plasmid is a circular double-stranded DNA molecule of 20,417 bp. Among a total of 26 predicted pTC1 ORFs, 23 have homologues in other known Sulfolobus conjugative plasmids (CPs). pTC1 resembles other Sulfolobus CPs in genome architecture, and is most highly conserved in the genomic region encoding conjugation functions. However, attempts to demonstrate experimentally the capacity of the plasmid for conjugational transfer were unsuccessful. A survey revealed that pTC1 and its closely related plasmid variants were widespread in the geothermal area of Tengchong. Variations of the plasmids at the target sites for transposition by an insertion sequence (IS) and a miniature inverted-repeat transposable element (MITE) were readily detected. The IS was efficiently inserted into the pTC1 genome, and the inserted sequence was inactivated and degraded more frequently in an imprecise manner than in a precise manner. These results suggest that the host organism has evolved a strategy to maintain a balance between the insertion and elimination of mobile genetic elements to permit genomic plasticity while inhibiting their fast spreading. PMID:25686154

  20. pTC Plasmids from Sulfolobus Species in the Geothermal Area of Tengchong, China: Genomic Conservation and Naturally-Occurring Variations as a Result of Transposition by Mobile Genetic Elements.

    PubMed

    Xiang, Xiaoyu; Huang, Xiaoxing; Wang, Haina; Huang, Li

    2015-02-12

    Plasmids occur frequently in Archaea. A novel plasmid (denoted pTC1) containing typical conjugation functions has been isolated from Sulfolobus tengchongensis RT8-4, a strain obtained from a hot spring in Tengchong, China, and characterized. The plasmid is a circular double-stranded DNA molecule of 20,417 bp. Among a total of 26 predicted pTC1 ORFs, 23 have homologues in other known Sulfolobus conjugative plasmids (CPs). pTC1 resembles other Sulfolobus CPs in genome architecture, and is most highly conserved in the genomic region encoding conjugation functions. However, attempts to demonstrate experimentally the capacity of the plasmid for conjugational transfer were unsuccessful. A survey revealed that pTC1 and its closely related plasmid variants were widespread in the geothermal area of Tengchong. Variations of the plasmids at the target sites for transposition by an insertion sequence (IS) and a miniature inverted-repeat transposable element (MITE) were readily detected. The IS was efficiently inserted into the pTC1 genome, and the inserted sequence was inactivated and degraded more frequently in an imprecise manner than in a precise manner. These results suggest that the host organism has evolved a strategy to maintain a balance between the insertion and elimination of mobile genetic elements to permit genomic plasticity while inhibiting their fast spreading.

  1. Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health

    PubMed Central

    Martin, William F.

    2017-01-01

    Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. PMID:28444372

  2. Discovery of rare, diagnostic AluYb8/9 elements in diverse human populations.

    PubMed

    Feusier, Julie; Witherspoon, David J; Scott Watkins, W; Goubert, Clément; Sasani, Thomas A; Jorde, Lynn B

    2017-01-01

    Polymorphic human Alu elements are excellent tools for assessing population structure, and new retrotransposition events can contribute to disease. Next-generation sequencing has greatly increased the potential to discover Alu elements in human populations, and various sequencing and bioinformatics methods have been designed to tackle the problem of detecting these highly repetitive elements. However, current techniques for Alu discovery may miss rare, polymorphic Alu elements. Combining multiple discovery approaches may provide a better profile of the polymorphic Alu mobilome. Alu Yb8/9 elements have been a focus of our recent studies as they are young subfamilies (~2.3 million years old) that contribute ~30% of recent polymorphic Alu retrotransposition events. Here, we update our ME-Scan methods for detecting Alu elements and apply these methods to discover new insertions in a large set of individuals with diverse ancestral backgrounds. We identified 5,288 putative Alu insertion events, including several hundred novel Alu Yb8/9 elements from 213 individuals from 18 diverse human populations. Hundreds of these loci were specific to continental populations, and 23 non-reference population-specific loci were validated by PCR. We provide high-quality sequence information for 68 rare Alu Yb8/9 elements, of which 11 have hallmarks of an active source element. Our subfamily distribution of rare Alu Yb8/9 elements is consistent with previous datasets, and may be representative of rare loci. We also find that while ME-Scan and low-coverage, whole-genome sequencing (WGS) detect different Alu elements in 41 1000 Genomes individuals, the two methods yield similar population structure results. Current in-silico methods for Alu discovery may miss rare, polymorphic Alu elements. Therefore, using multiple techniques can provide a more accurate profile of Alu elements in individuals and populations. We improved our false-negative rate as an indicator of sample quality for future ME-Scan experiments. In conclusion, we demonstrate that ME-Scan is a good supplement for next-generation sequencing methods and is well-suited for population-level analyses.

  3. Tnt1 Retrotransposon Mutagenesis: A Tool for Soybean Functional Genomics1[W][OA

    PubMed Central

    Cui, Yaya; Barampuram, Shyam; Stacey, Minviluz G.; Hancock, C. Nathan; Findley, Seth; Mathieu, Melanie; Zhang, Zhanyuan; Parrott, Wayne A.; Stacey, Gary

    2013-01-01

    Insertional mutagenesis is a powerful tool for determining gene function in both model and crop plant species. Tnt1, the transposable element of tobacco (Nicotiana tabacum) cell type 1, is a retrotransposon that replicates via an RNA copy that is reverse transcribed and integrated elsewhere in the plant genome. Based on studies in a variety of plants, Tnt1 appears to be inactive in normal plant tissue but can be reactivated by tissue culture. Our goal was to evaluate the utility of the Tnt1 retrotransposon as a mutagenesis strategy in soybean (Glycine max). Experiments showed that the Tnt1 element was stably transformed into soybean plants by Agrobacterium tumefaciens-mediated transformation. Twenty-seven independent transgenic lines carrying Tnt1 insertions were generated. Southern-blot analysis revealed that the copy number of transposed Tnt1 elements ranged from four to 19 insertions, with an average of approximately eight copies per line. These insertions showed Mendelian segregation and did not transpose under normal growth conditions. Analysis of 99 Tnt1 flanking sequences revealed insertions into 62 (62%) annotated genes, indicating that the element preferentially inserts into protein-coding regions. Tnt1 insertions were found in all 20 soybean chromosomes, indicating that Tnt1 transposed throughout the soybean genome. Furthermore, fluorescence in situ hybridization experiments validated that Tnt1 inserted into multiple chromosomes. Passage of transgenic lines through two different tissue culture treatments resulted in Tnt1 transposition, significantly increasing the number of insertions per line. Thus, our data demonstrate the Tnt1 retrotransposon to be a powerful system that can be used for effective large-scale insertional mutagenesis in soybean. PMID:23124322

  4. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    PubMed

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-02-15

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).

  5. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    PubMed Central

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-01-01

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U). PMID:8604302

  6. Nuclear Mitochondrial DNA Activates Replication in Saccharomyces cerevisiae

    PubMed Central

    Chatre, Laurent; Ricchetti, Miria

    2011-01-01

    The nuclear genome of eukaryotes is colonized by DNA fragments of mitochondrial origin, called NUMTs. These insertions have been associated with a variety of germ-line diseases in humans. The significance of this uptake of potentially dangerous sequences into the nuclear genome is unclear. Here we provide functional evidence that sequences of mitochondrial origin promote nuclear DNA replication in Saccharomyces cerevisiae. We show that NUMTs are rich in key autonomously replicating sequence (ARS) consensus motifs, whose mutation results in the reduction or loss of DNA replication activity. Furthermore, 2D-gel analysis of the mrc1 mutant exposed to hydroxyurea shows that several NUMTs function as late chromosomal origins. We also show that NUMTs located close to or within ARS provide key sequence elements for replication. Thus NUMTs can act as independent origins, when inserted in an appropriate genomic context or affect the efficiency of pre-existing origins. These findings show that migratory mitochondrial DNAs can impact on the replication of the nuclear region they are inserted in. PMID:21408151

  7. Nuclear mitochondrial DNA activates replication in Saccharomyces cerevisiae.

    PubMed

    Chatre, Laurent; Ricchetti, Miria

    2011-03-08

    The nuclear genome of eukaryotes is colonized by DNA fragments of mitochondrial origin, called NUMTs. These insertions have been associated with a variety of germ-line diseases in humans. The significance of this uptake of potentially dangerous sequences into the nuclear genome is unclear. Here we provide functional evidence that sequences of mitochondrial origin promote nuclear DNA replication in Saccharomyces cerevisiae. We show that NUMTs are rich in key autonomously replicating sequence (ARS) consensus motifs, whose mutation results in the reduction or loss of DNA replication activity. Furthermore, 2D-gel analysis of the mrc1 mutant exposed to hydroxyurea shows that several NUMTs function as late chromosomal origins. We also show that NUMTs located close to or within ARS provide key sequence elements for replication. Thus NUMTs can act as independent origins, when inserted in an appropriate genomic context or affect the efficiency of pre-existing origins. These findings show that migratory mitochondrial DNAs can impact on the replication of the nuclear region they are inserted in.

  8. The Nucleotide Excision Repair Pathway Limits L1 Retrotransposition

    PubMed Central

    Servant, Geraldine; Streva, Vincent A.; Derbes, Rebecca S.; Wijetunge, Madushani I.; Neeland, Marc; White, Travis B.; Belancio, Victoria P.; Roy-Engel, Astrid M.; Deininger, Prescott L.

    2017-01-01

    Long interspersed elements 1 (L1) are active mobile elements that constitute almost 17% of the human genome. They amplify through a “copy-and-paste” mechanism termed retrotransposition, and de novo insertions related to these elements have been reported to cause 0.2% of genetic diseases. Our previous data demonstrated that the endonuclease complex ERCC1-XPF, which cleaves a 3′ DNA flap structure, limits L1 retrotransposition. Although the ERCC1-XPF endonuclease participates in several different DNA repair pathways, such as single-strand annealing, or in telomere maintenance, its recruitment to DNA lesions is best characterized in the nucleotide excision repair (NER) pathway. To determine if the NER pathway prevents the insertion of retroelements in the genome, we monitored the retrotransposition efficiencies of engineered L1 elements in NER-deficient cells and in their complemented versions. Core proteins of the NER pathway, XPD and XPA, and the lesion binding protein, XPC, are involved in limiting L1 retrotransposition. In addition, sequence analysis of recovered de novo L1 inserts and their genomic locations in NER-deficient cells demonstrated the presence of abnormally large duplications at the site of insertion, suggesting that NER proteins may also play a role in the normal L1 insertion process. Here, we propose new functions for the NER pathway in the maintenance of genome integrity: limitation of insertional mutations caused by retrotransposons and the prevention of potentially mutagenic large genomic duplications at the site of retrotransposon insertion events. PMID:28049704

  9. Mobile elements reveal small population size in the ancient ancestors of Homo sapiens.

    PubMed

    Huff, Chad D; Xing, Jinchuan; Rogers, Alan R; Witherspoon, David; Jorde, Lynn B

    2010-02-02

    The genealogies of different genetic loci vary in depth. The deeper the genealogy, the greater the chance that it will include a rare event, such as the insertion of a mobile element. Therefore, the genealogy of a region that contains a mobile element is on average older than that of the rest of the genome. In a simple demographic model, the expected time to most recent common ancestor (TMRCA) is doubled if a rare insertion is present. We test this expectation by examining single nucleotide polymorphisms around polymorphic Alu insertions from two completely sequenced human genomes. The estimated TMRCA for regions containing a polymorphic insertion is two times larger than the genomic average (P < <10(-30)), as predicted. Because genealogies that contain polymorphic mobile elements are old, they are shaped largely by the forces of ancient population history and are insensitive to recent demographic events, such as bottlenecks and expansions. Remarkably, the information in just two human DNA sequences provides substantial information about ancient human population size. By comparing the likelihood of various demographic models, we estimate that the effective population size of human ancestors living before 1.2 million years ago was 18,500, and we can reject all models where the ancient effective population size was larger than 26,000. This result implies an unusually small population for a species spread across the entire Old World, particularly in light of the effective population sizes of chimpanzees (21,000) and gorillas (25,000), which each inhabit only one part of a single continent.

  10. Long interspersed element-1 (LINE-1): passenger or driver in human neoplasms?

    PubMed

    Rodić, Nemanja; Burns, Kathleen H

    2013-03-01

    LINE-1 (L1) retrotransposons make up a significant portion of human genomes, with an estimated 500,000 copies per genome. Like other retrotransposons, L1 retrotransposons propagate through RNA sequences that are reverse transcribed into DNA sequences, which are integrated into new genomic loci. L1 somatic insertions have the potential to disrupt the transcriptome by inserting into or nearby genes. By mutating genes and playing a role in epigenetic dysregulation, L1 transposons may contribute to tumorigenesis. Studies of the "mobilome" have lagged behind other tumor characterizations at the sequence, transcript, and epigenetic levels. Here, we consider evidence that L1 retrotransposons may sometimes drive human tumorigenesis.

  11. Lineage-specific genomics: Frequent birth and death in the human genome: The human genome contains many lineage-specific elements created by both sequence and functional turnover.

    PubMed

    Young, Robert S

    2016-07-01

    Frequent evolutionary birth and death events have created a large quantity of biologically important, lineage-specific DNA within mammalian genomes. The birth and death of DNA sequences is so frequent that the total number of these insertions and deletions in the human population remains unknown, although there are differences between these groups, e.g. transposable elements contribute predominantly to sequence insertion. Functional turnover - where the activity of a locus is specific to one lineage, but the underlying DNA remains conserved - can also drive birth and death. However, this does not appear to be a major driver of divergent transcriptional regulation. Both sequence and functional turnover have contributed to the birth and death of thousands of functional promoters in the human and mouse genomes. These findings reveal the pervasive nature of evolutionary birth and death and suggest that lineage-specific regions may play an important but previously underappreciated role in human biology and disease. © 2016 The Authors BioEssays Published by WILEY Periodicals, Inc.

  12. Identification and Characterization of Domesticated Bacterial Transposases

    PubMed Central

    Gallie, Jenna; Rainey, Paul B.

    2017-01-01

    Abstract Selfish genetic elements, such as insertion sequences and transposons are found in most genomes. Transposons are usually identifiable by their high copy number within genomes. In contrast, REP-associated tyrosine transposases (RAYTs), a recently described class of bacterial transposase, are typically present at just one copy per genome. This suggests that RAYTs no longer copy themselves and thus they no longer function as a typical transposase. Motivated by this possibility we interrogated thousands of fully sequenced bacterial genomes in order to determine patterns of RAYT diversity, their distribution across chromosomes and accessory elements, and rate of duplication. RAYTs encompass exceptional diversity and are divisible into at least five distinct groups. They possess features more similar to housekeeping genes than insertion sequences, are predominantly vertically transmitted and have persisted through evolutionary time to the point where they are now found in 24% of all species for which at least one fully sequenced genome is available. Overall, the genomic distribution of RAYTs suggests that they have been coopted by host genomes to perform a function that benefits the host cell. PMID:28910967

  13. Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae.

    PubMed

    Oggioni, M R; Claverys, J P

    1999-10-01

    A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.

  14. Selfish DNA in protein-coding genes of Rickettsia.

    PubMed

    Ogata, H; Audic, S; Barbe, V; Artiguenave, F; Fournier, P E; Raoult, D; Claverie, J M

    2000-10-13

    Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.

  15. Alu repeats: A source for the genesis of primate microsatellites

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Arcot, S.S.; Batzer, M.A.; Wang, Zhenyuan

    1995-09-01

    As a result of their abundance, relatively uniform distribution, and high degree of polymorphism, microsatellites and minisatellites have become valuable tools in genetic mapping, forensic identity testing, and population studies. In recent years, a number of microsatellite repeats have been found to be associated with Alu interspersed repeated DNA elements. The association of an Alu element with a microsatellite repeat could result from the integration of an Alu element within a preexisting microsatellite repeat. Alternatively, Alu elements could have a direct role in the origin of microsatellite repeats. Errors introduced during reverse transcription of the primary transcript derived from anmore » Alu {open_quotes}master{close_quote} gene or the accumulation of random mutations in the middle A-rich regions and oligo(dA)-rich tails of Alu elements after insertion and subsequent expansion and contraction of these sequences could result in the genesis of a microsatellite repeat. We have tested these hypotheses by a direct evolutionary comparison of the sequences of some recent Alu elements that are found only in humans and are absent from nonhuman primates, as well as some older Alu elements that are present at orthologous positions in a number of nonhuman primates. The origin of {open_quotes}young{close_quotes} Alu insertions, absence of sequences that resemble microsatellite repeats at the orthologous loci in chimpanzees, and the gradual expansion of microsatellite repeats in some old Alu repeats at orthologous positions within the genomes of a number of nonhuman primates suggest that Alu elements are a source for the genesis of primate microsatellite repeats. 48 refs., 5 figs., 3 tabs.« less

  16. Germline Transformation of Drosophila Virilis Mediated by the Transposable Element Hobo

    PubMed Central

    Lozovskaya, E. R.; Nurminsky, D. I.; Hartl, D. L.; Sullivan, D. T.

    1996-01-01

    A laboratory strain of Drosophila virilis was genetically transformed with a hobo vector carrying the miniwhite cassette using a helper plasmid with an hsp70-driven hobo transposase-coding sequence. The rate of transformation was 0.5% per fertile G0 animal. Three transgenic insertions were cloned and characterized and found to be authentic hobo insertions. These results, together with the known wide-spread distribution of hobo in diverse insect species, suggest that hobo and related transposable elements may be of considerable utility in the germline transformation of insects other than D. melanogaster. PMID:8770594

  17. Successful Gene Tagging in Lettuce Using the Tnt1 Retrotransposon from Tobacco

    PubMed Central

    Mazier, Marianne; Botton, Emmanuel; Flamain, Fabrice; Bouchet, Jean-Paul; Courtial, Béatrice; Chupeau, Marie-Christine; Chupeau, Yves; Maisonneuve, Brigitte; Lucas, Hélène

    2007-01-01

    The tobacco (Nicotiana tabacum) element Tnt1 is one of the few identified active retrotransposons in plants. These elements possess unique properties that make them ideal genetic tools for gene tagging. Here, we demonstrate the feasibility of gene tagging using the retrotransposon Tnt1 in lettuce (Lactuca sativa), which is the largest genome tested for retrotransposon mutagenesis so far. Of 10 different transgenic bushes carrying a complete Tnt1 containing T-DNA, eight contained multiple transposed copies of Tnt1. The number of transposed copies of the element per plant was particularly high, the smallest number being 28. Tnt1 transposition in lettuce can be induced by a very simple in vitro culture protocol. Tnt1 insertions were stable in the progeny of the primary transformants and could be segregated genetically. Characterization of the sequences flanking some insertion sites revealed that Tnt1 often inserted into genes. The progeny of some primary transformants showed phenotypic alterations due to recessive mutations. One of these mutations was due to Tnt1 insertion in the gibberellin 3β-hydroxylase gene. Taken together, these results indicate that Tnt1 is a powerful tool for insertion mutagenesis especially in plants with a large genome. PMID:17351058

  18. A VNTR element associated with steroid sulfatase gene deletions stimulates recombination in cultured cells

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gong, Y.; Li, X.M.; Shapiro, L.J.

    1994-09-01

    Steroid sulfatase deficiency is a common genetic disorder, with a prevalence of approximately one in every 3500 males world wide. About 90% of these patients have complete gene deletions, which appear to result from recombination between members of a low-copy repeat family (CRI-232 is the prototype) that flank the gene. RU1 and RU2 are two VNTR elements found within each of these family members. RU1 consists of 30 bp repeating units and its length shows minimal variation among individuals. The RU2 element consists of repeating sequences which are highly asymmetric, with about 90% purines and no C`s on one strand,more » and range from 0.6 kb to over 23 kb among different individuals. We conducted a study to determine if the RU1 or RU2 elements can promote recombination in an in vivo test system. We inserted these elements adjacent to the neo gene in each of two pSV2neo derivatives, one of which has a deletion in the 5{prime} portion of the neo gene and the other having a deletion in the 3{prime} portion. These plasmids were combined and used to transfect EJ cells. Survival of cells in G418 indicates restoration of a functional neo gene by recombination between two deletion constructs. Thus counting G418 resistant colonies gives a quantitative measure of the enhancement of recombination by the inserted VNTR elements. The results showed no effect on recombination by the inserted RU1 element (compared to the insertion of a nonspecific sequence), while the RU2 element stimulated recombination by 3.5-fold (P<0.01). A separate set of constructs placed RU1 or RU2 within the intron of an exon trapping vector. Following tranfection of cells, recombination events were monitored by a PCR assay that detected the approximation of primer binding sites (as a result of recombination). These studies showed that, as in the first set of experiments, the highly variable RU2 element is capable of stimulating somatic recombination in mammalian cells.« less

  19. CMV-promoter driven codon-optimized expression alters the assembly type and morphology of a reconstituted HERV-K(HML-2).

    PubMed

    Hohn, Oliver; Hanke, Kirsten; Lausch, Veronika; Zimmermann, Anja; Mostafa, Saeed; Bannert, Norbert

    2014-11-11

    The HERV-K(HML-2) family contains the most recently integrated and best preserved endogenized proviral sequences in the human genome. All known elements have nevertheless been subjected to mutations or deletions that render expressed particles non-infectious. Moreover, these post-insertional mutations hamper the analysis of the general biological properties of this ancient virus family. The expression of consensus sequences and sequences of elements with reverted post-insertional mutations has therefore been very instrumental in overcoming this limitation. We investigated the particle morphology of a recently reconstituted HERV-K113 element termed oriHERV-K113 using thin-section electron microscopy (EM) and could demonstrate that strong overexpression by substitution of the 5'LTR for a CMV promoter and partial codon optimization altered the virus assembly type and morphology. This included a conversion from the regular C-type to an A-type morphology with a mass of cytoplasmic immature cores tethered to the cell membrane and the membranes of vesicles. Overexpression permitted the release and maturation of virions but reduced the envelope content. A weaker boost of virus expression by Staufen-1 was not sufficient to induce these morphological alterations.

  20. CMV-Promoter Driven Codon-Optimized Expression Alters the Assembly Type and Morphology of a Reconstituted HERV-K(HML-2)

    PubMed Central

    Hohn, Oliver; Hanke, Kirsten; Lausch, Veronika; Zimmermann, Anja; Mostafa, Saeed; Bannert, Norbert

    2014-01-01

    The HERV-K(HML-2) family contains the most recently integrated and best preserved endogenized proviral sequences in the human genome. All known elements have nevertheless been subjected to mutations or deletions that render expressed particles non-infectious. Moreover, these post-insertional mutations hamper the analysis of the general biological properties of this ancient virus family. The expression of consensus sequences and sequences of elements with reverted post-insertional mutations has therefore been very instrumental in overcoming this limitation. We investigated the particle morphology of a recently reconstituted HERV-K113 element termed oriHERV-K113 using thin-section electron microscopy (EM) and could demonstrate that strong overexpression by substitution of the 5'LTR for a CMV promoter and partial codon optimization altered the virus assembly type and morphology. This included a conversion from the regular C-type to an A-type morphology with a mass of cytoplasmic immature cores tethered to the cell membrane and the membranes of vesicles. Overexpression permitted the release and maturation of virions but reduced the envelope content. A weaker boost of virus expression by Staufen-1 was not sufficient to induce these morphological alterations. PMID:25393897

  1. Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.

    PubMed

    Hazkani-Covo, Einat; Martin, William F

    2017-05-01

    Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  2. Expression of homing endonuclease gene and insertion-like element in sea anemone mitochondrial genomes: Lesson learned from Anemonia viridis.

    PubMed

    Chi, Sylvia Ighem; Urbarova, Ilona; Johansen, Steinar D

    2018-04-30

    The mitochondrial genomes of sea anemones are dynamic in structure. Invasion by genetic elements, such as self-catalytic group I introns or insertion-like sequences, contribute to sea anemone mitochondrial genome expansion and complexity. By using next generation sequencing we investigated the complete mtDNAs and corresponding transcriptomes of the temperate sea anemone Anemonia viridis and its closer tropical relative Anemonia majano. Two versions of fused homing endonuclease gene (HEG) organization were observed among the Actiniidae sea anemones; in-frame gene fusion and pseudo-gene fusion. We provided support for the pseudo-gene fusion organization in Anemonia species, resulting in a repressed HEG from the COI-884 group I intron. orfA, a putative protein-coding gene with insertion-like features, was present in both Anemonia species. Interestingly, orfA and COI expression were significantly up-regulated upon long-term environmental stress corresponding to low seawater pH conditions. This study provides new insights to the dynamics of sea anemone mitochondrial genome structure and function. Copyright © 2018 Elsevier B.V. All rights reserved.

  3. SECIS elements in the coding regions of selenoprotein transcripts are functional in higher eukaryotes

    PubMed Central

    Mix, Heiko; Lobanov, Alexey V.; Gladyshev, Vadim N.

    2007-01-01

    Expression of selenocysteine (Sec)-containing proteins requires the presence of a cis-acting mRNA structure, called selenocysteine insertion sequence (SECIS) element. In bacteria, this structure is located in the coding region immediately downstream of the Sec-encoding UGA codon, whereas in eukaryotes a completely different SECIS element has evolved in the 3′-untranslated region. Here, we report that SECIS elements in the coding regions of selenoprotein mRNAs support Sec insertion in higher eukaryotes. Comprehensive computational analysis of all available viral genomes revealed a SECIS element within the ORF of a naturally occurring selenoprotein homolog of glutathione peroxidase 4 in fowlpox virus. The fowlpox SECIS element supported Sec insertion when expressed in mammalian cells as part of the coding region of viral or mammalian selenoproteins. In addition, readthrough at UGA was observed when the viral SECIS element was located upstream of the Sec codon. We also demonstrate successful de novo design of a functional SECIS element in the coding region of a mammalian selenoprotein. Our data provide evidence that the location of the SECIS element in the untranslated region is not a functional necessity but rather is an evolutionary adaptation to enable a more efficient synthesis of selenoproteins. PMID:17169995

  4. Mycobacterium smegmatis strain for detection of Mycobacterium tuberculosis by PCR used as internal control for inhibition of amplification and for quantification of bacteria.

    PubMed Central

    Kolk, A H; Noordhoek, G T; de Leeuw, O; Kuijper, S; van Embden, J D

    1994-01-01

    For the detection of Mycobacterium tuberculosis by PCR, the IS6110 sequence was used. A modified target was constructed by insertion of 56 nucleotides in the IS6110 insertion element of Mycobacterium bovis BCG. This modified insertion sequence was integrated into the genome of Mycobacterium smegmatis, a mycobacterium species which does not contain the IS6110 element. When DNA from the modified M. smegmatis 1008 strain was amplified with IS6110-specific primers INS1 and INS2, a band of 301 bp was seen on agarose gel, whereas the PCR product of M. tuberculosis complex DNA was a 245-bp fragment with these primers. The addition of a small number of M. smegmatis 1008 cells to clinical samples before DNA purification enables the detection of problems which may be due to the loss of DNA in the isolation procedure or to the presence of inhibitors. The presence of inhibitors of the amplification reaction can be confirmed by the addition of M. smegmatis 1008 DNA after the DNA isolation procedure. Furthermore, competition between the different target DNAs of M. smegmatis 1008 DNA and M. tuberculosis complex DNA enables the estimation of the number of IS6110 elements in the clinical sample. Images PMID:8051267

  5. New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome.

    PubMed

    Garazha, Andrew; Ivanova, Alena; Suntsova, Maria; Malakhova, Galina; Roumiantsev, Sergey; Zhavoronkov, Alex; Buzdin, Anton

    2015-01-01

    Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of functional genomic elements. Here, we performed the genome-wide identification of human ERVs/LRs containing TFBS according to the ENCODE project. We created the first interactive ERV/LRs database that groups the individual inserts according to their familial nomenclature, number of mapped TFBS and divergence from their consensus sequence. Information on any particular element can be easily extracted by the user. We also created a genome browser tool, which enables quick mapping of any ERV/LR insert according to genomic coordinates, known human genes and TFBS. These tools can be used to easily explore functionally relevant individual ERV/LRs, and for studying their impact on the regulation of human genes. Overall, we identified ∼110,000 ERV/LR genomic elements having TFBS. We propose a hypothesis of "domestication" of ERV/LR TFBS by the genome milieu including subsequent stages of initial epigenetic repression, partial functional release, and further mutation-driven reshaping of TFBS in tight coevolution with the enclosing genomic loci.

  6. Genome Sequence of Lactobacillus delbrueckii subsp. lactis CNRZ327, a Dairy Bacterium with Anti-Inflammatory Properties.

    PubMed

    El Kafsi, Hela; Binesse, Johan; Loux, Valentin; Buratti, Julien; Boudebbouze, Samira; Dervyn, Rozenn; Hammani, Amal; Maguin, Emmanuelle; van de Guchte, Maarten

    2014-07-17

    Lactobacillus delbrueckii subsp. lactis CNRZ327 is a dairy bacterium with anti-inflammatory properties both in vitro and in vivo. Here, we report the genome sequence of this bacterium, which appears to contain no less than 215 insertion sequence (IS) elements, an exceptionally high number regarding the small genome size of the strain. Copyright © 2014 El Kafsi et al.

  7. A Forward Genetic Screening for Prostate Cancer Progression Genes

    DTIC Science & Technology

    2012-10-01

    sequence  reads. For verifying  the  prevalence of insertions in tumors, PCR was  performed on  genomic  DNA corresponding to 15 insertional mutations using...and has been utilized with great effect in many organisms, from the bacterium to the fruit fly Drosophila melanogaster [1,2]. The Sleeping Beauty (SB...TX SL JC TN. References 1. Cooley L, Kelley R, Spradling A (1988) Insertional mutagenesis of the Drosophila genome with single P elements. Science

  8. Three new insertion sequence elements ISLdl2, ISLdl3, and ISLdl4 in Lactobacillus delbrueckii: isolation, molecular characterization, and potential use for strain identification.

    PubMed

    Ravin, Victor; Alatossava, Tapani

    2003-05-01

    A group of new insertion sequence (IS) elements, ISLdl2, ISLdl3, and ISLdl4, from Lactobacillus delbrueckii subsp. lactis ATCC 15808 was isolated, characterized, and used for strain identification together with ISLdl1, recently characterized as an L. delbrueckii IS element belonging to the ISL3 family. ISLdl2 was 1367 bp in size and had a 24 bp IR and an 8 bp DR. The single ORF of ISLdl2 encoded a protein of 392 aa similar to transposases of the IS256 family. ISLdl3 had a single ORF encoding a protein of 343 aa similar to transposases of the IS30 family. Finally, ISLdl4 had a single ORF encoding a protein of 406 aa and displayed homology to the transposases of the IS110 family. ISLdl4 was only slight different from ISL4 (Accession No. AY040213). ISLdl1, ISLdl2, and ISLdl4 were present in all of the 10 L. delbrueckii subsp. lactis and subsp. delbrueckii strains tested, as well as in three of the 11 L. delbrueckii subsp. bulgaricus strains tested. ISLdl3 was present only in four closely related strains of L. delbrueckii subsp. lactis. These IS elements were not observed in Lactobacillus rhamnosus, Lactobacillus acidophilus, Lactobacillus helveticus, or Lactobacillus plantarum. A cluster of IS elements, ISLdl1, ISLdl2, ISLdl3, ISLdl4, and ISL6, was observed in L. delbrueckii subsp. lactis strain ATCC 15808. Within this cluster, ISLdl4 was inserted into ISLdl1 between the left IR and the start codon of ORF455, encoding a putative transposase. Most of the integration sites of the IS elements were strain-specific. We have observed that IS elements can migrate from one strain to another as integral parts of bacterial DNA by using phage LL-H as a vehicle. We demonstrate for the first time that inverse PCR and vectorette PCR methods with primers based on sequences of the IS elements could be used for identification of L. delbrueckii strains.

  9. The application of the high throughput sequencing technology in the transposable elements.

    PubMed

    Liu, Zhen; Xu, Jian-hong

    2015-09-01

    High throughput sequencing technology has dramatically improved the efficiency of DNA sequencing, and decreased the costs to a great extent. Meanwhile, this technology usually has advantages of better specificity, higher sensitivity and accuracy. Therefore, it has been applied to the research on genetic variations, transcriptomics and epigenomics. Recently, this technology has been widely employed in the studies of transposable elements and has achieved fruitful results. In this review, we summarize the application of high throughput sequencing technology in the fields of transposable elements, including the estimation of transposon content, preference of target sites and distribution, insertion polymorphism and population frequency, identification of rare copies, transposon horizontal transfers as well as transposon tagging. We also briefly introduce the major common sequencing strategies and algorithms, their advantages and disadvantages, and the corresponding solutions. Finally, we envision the developing trends of high throughput sequencing technology, especially the third generation sequencing technology, and its application in transposon studies in the future, hopefully providing a comprehensive understanding and reference for related scientific researchers.

  10. Insertion sequence ISRP10 inactivation of the oprD gene in imipenem-resistant Pseudomonas aeruginosa clinical isolates.

    PubMed

    Sun, Qinghui; Ba, Zhaofen; Wu, Guoying; Wang, Wei; Lin, Shuxiang; Yang, Hongjiang

    2016-05-01

    Carbapenem resistance mechanisms were investigated in 32 imipenem-resistant Pseudomonas aeruginosa clinical isolates recovered from hospitalised children. Sequence analysis revealed that 31 of the isolates had an insertion sequence element ISRP10 disrupting the porin gene oprD, demonstrating that ISRP10 inactivation of oprD conferred imipenem resistance in the majority of the isolates. Multilocus sequence typing (MLST) was used to discriminate the isolates. In total, 11 sequence types (STs) were identified including 3 novel STs, and 68.3% (28/41) of the tested strains were characterised as clone ST253. In combination with random amplified polymorphic DNA (RAPD) analysis, the imipenem-resistant isolates displayed a relatively high degree of genetic variability and were unlikely associated with nosocomial infections. Copyright © 2016 Elsevier B.V. and the International Society of Chemotherapy. All rights reserved.

  11. Short interspersed elements (SINEs) are a major source of canine genomic diversity.

    PubMed

    Wang, Wei; Kirkness, Ewen F

    2005-12-01

    SINEs are retrotransposons that have enjoyed remarkable reproductive success during the course of mammalian evolution, and have played a major role in shaping mammalian genomes. Previously, an analysis of survey-sequence data from an individual dog (a poodle) indicated that canine genomes harbor a high frequency of alleles that differ only by the absence or presence of a SINEC_Cf repeat. Comparison of this survey-sequence data with a draft genome sequence of a distinct dog (a boxer) has confirmed this prediction, and revealed the chromosomal coordinates for >10,000 loci that are bimorphic for SINEC_Cf insertions. Analysis of SINE insertion sites from the genomes of nine additional dogs indicates that 3%-5% are absent from either the poodle or boxer genome sequences--suggesting that an additional 10,000 bimorphic loci could be readily identified in the general dog population. We describe a methodology that can be used to identify these loci, and could be adapted to exploit these bimorphic loci for genotyping purposes. Approximately half of all annotated canine genes contain SINEC_Cf repeats, and these elements are occasionally transcribed. When transcribed in the antisense orientation, they provide splice acceptor sites that can result in incorporation of novel exons. The high frequency of bimorphic SINE insertions in the dog population is predicted to provide numerous examples of allele-specific transcription patterns that will be valuable for the study of differential gene expression among multiple dog breeds.

  12. Disruption of tetR type regulator adeN by mobile genetic element confers elevated virulence in Acinetobacter baumannii.

    PubMed

    Saranathan, Rajagopalan; Pagal, Sudhakar; Sawant, Ajit R; Tomar, Archana; Madhangi, M; Sah, Suresh; Satti, Annapurna; Arunkumar, K P; Prashanth, K

    2017-10-03

    Acinetobacter baumannii is an important human pathogen and considered as a major threat due to its extreme drug resistance. In this study, the genome of a hyper-virulent MDR strain PKAB07 of A. baumannii isolated from an Indian patient was sequenced and analyzed to understand its mechanisms of virulence, resistance and evolution. Comparative genome analysis of PKAB07 revealed virulence and resistance related genes scattered throughout the genome, instead of being organized as an island, indicating the highly mosaic nature of the genome. Many intermittent horizontal gene transfer events, insertion sequence (IS) element insertions identified were augmenting resistance machinery and elevating the SNP densities in A. baumannii eventually aiding in their swift evolution. ISAba1, the most widely distributed insertion sequence in A. baumannii was found in multiple sites in PKAB07. Out of many ISAba1 insertions, we identified novel insertions in 9 different genes wherein insertional inactivation of adeN (tetR type regulator) was significant. To assess the significance of this disruption in A. baumannii, adeN mutant and complement strains were constructed in A. baumannii ATCC 17978 strain and studied. Biofilm levels were abrogated in the adeN knockout when compared with the wild type and complemented strain of adeN knockout. Virulence of the adeN knockout mutant strain was observed to be high, which was validated by in vitro experiments and Galleria mellonella infection model. The overexpression of adeJ, a major component of AdeIJK efflux pump observed in adeN knockout strain could be the possible reason for the elevated virulence in adeN mutant and PKB07 strain. Knocking out of adeN in ATCC strain led to increased resistance and virulence at par with the PKAB07. Disruption of tetR type regulator adeN by ISAba1 consequently has led to elevated virulence in this pathogen.

  13. Lesion bypass activity of DNA polymerase θ (POLQ) is an intrinsic property of the pol domain and depends on unique sequence inserts.

    PubMed

    Hogg, Matthew; Seki, Mineaki; Wood, Richard D; Doublié, Sylvie; Wallace, Susan S

    2011-01-21

    DNA polymerase θ (POLQ, polθ) is a large, multidomain DNA polymerase encoded in higher eukaryotic genomes. It is important for maintaining genetic stability in cells and helping protect cells from DNA damage caused by ionizing radiation. POLQ contains an N-terminal helicase-like domain, a large central domain of indeterminate function, and a C-terminal polymerase domain with sequence similarity to the A-family of DNA polymerases. The enzyme has several unique properties, including low fidelity and the ability to insert and extend past abasic sites and thymine glycol lesions. It is not known whether the abasic site bypass activity is an intrinsic property of the polymerase domain or whether helicase activity is also required. Three "insertion" sequence elements present in POLQ are not found in any other A-family DNA polymerase, and it has been proposed that they may lend some unique properties to POLQ. Here, we analyzed the activity of the DNA polymerase in the absence of each sequence insertion. We found that the pol domain is capable of highly efficient bypass of abasic sites in the absence of the helicase-like or central domains. Insertion 1 increases the processivity of the polymerase but has little, if any, bearing on the translesion synthesis properties of the enzyme. However, removal of insertions 2 and 3 reduces activity on undamaged DNA and completely abrogates the ability of the enzyme to bypass abasic sites or thymine glycol lesions. Copyright © 2010 Elsevier Ltd. All rights reserved.

  14. Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project.

    PubMed

    Konkel, Miriam K; Walker, Jerilyn A; Hotard, Ashley B; Ranck, Megan C; Fontenot, Catherine C; Storer, Jessica; Stewart, Chip; Marth, Gabor T; Batzer, Mark A

    2015-08-29

    The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  15. Identification of novel MITEs (miniature inverted-repeat transposable elements) in Coxiella burnetii: implications for protein and small RNA evolution.

    PubMed

    Wachter, Shaun; Raghavan, Rahul; Wachter, Jenny; Minnick, Michael F

    2018-04-11

    Coxiella burnetii is a Gram-negative gammaproteobacterium and zoonotic agent of Q fever. C. burnetii's genome contains an abundance of pseudogenes and numerous selfish genetic elements. MITEs (miniature inverted-repeat transposable elements) are non-autonomous transposons that occur in all domains of life and are thought to be insertion sequences (ISs) that have lost their transposase function. Like most transposable elements (TEs), MITEs are thought to play an active role in evolution by altering gene function and expression through insertion and deletion activities. However, information regarding bacterial MITEs is limited. We describe two MITE families discovered during research on small non-coding RNAs (sRNAs) of C. burnetii. Two sRNAs, Cbsr3 and Cbsr13, were found to originate from a novel MITE family, termed QMITE1. Another sRNA, CbsR16, was found to originate from a separate and novel MITE family, termed QMITE2. Members of each family occur ~ 50 times within the strains evaluated. QMITE1 is a typical MITE of 300-400 bp with short (2-3 nt) direct repeats (DRs) of variable sequence and is often found overlapping annotated open reading frames (ORFs). Additionally, QMITE1 elements possess sigma-70 promoters and are transcriptionally active at several loci, potentially influencing expression of nearby genes. QMITE2 is smaller (150-190 bps), but has longer (7-11 nt) DRs of variable sequences and is mainly found in the 3' untranslated region of annotated ORFs and intergenic regions. QMITE2 contains a GTAG repetitive extragenic palindrome (REP) that serves as a target for IS1111 TE insertion. Both QMITE1 and QMITE2 display inter-strain linkage and sequence conservation, suggesting that they are adaptive and existed before divergence of C. burnetii strains. We have discovered two novel MITE families of C. burnetii. Our finding that MITEs serve as a source for sRNAs is novel. QMITE2 has a unique structure and occurs in large or small versions with unique DRs that display linkage and sequence conservation between strains, allowing for tracking of genomic rearrangements. QMITE1 and QMITE2 copies are hypothesized to influence expression of neighboring genes involved in DNA repair and virulence through transcriptional interference and ribonuclease processing.

  16. Retrotransposons of the Tnt1B family are mobile in Nicotiana plumbaginifolia and can induce alternative splicing of the host gene upon insertion.

    PubMed

    Leprinc, A S; Grandbastien, M A; Christian, M

    2001-11-01

    Active retrotransposons have been identified in Nicotiana plumbaginifolia by their ability to disrupt the nitrate reductase gene in chlorate-resistant mutants selected from protoplast-derived cultures. In mutants E23 and F97, two independent insertions of Tnp2, a new retrotransposon closely related to the tobacco Tnt1 elements, were detected in the nitrate reductase gene. These two Tnp2 elements are members of the Tnt1B subfamily which shows that Tnt1B elements can be active and mutagenic in the N. plumbaginifolia genome. Furthermore, these results suggest that Tnt1B is the most active family of Tntl elements in N. plumbaginifolia, whereas in tobacco only members of the Tnt1A subfamily were found inserted in the nitrate reductase gene. The transcriptional regulations of Tnp2 and Tnt1A elements are most probably different due to non-conserved U3 regions. Our results thus support the hypothesis that different Nicotiana species contain different active Tntl subfamilies and that only one active Tntl subfamily might be maintained in each of these species. The Tnp2 insertion found in the F97 mutant was found to be spliced out of the nitrate reductase mRNA by activation of cryptic donor and acceptor sites in the nitrate reductase and the Tnp2 sequences respectively.

  17. Instability of plasmid DNA sequences: macro and micro evolution of the antibiotic resistance plasmid R6-5.

    PubMed

    Timmis, K N; Cabello, F; Andrés, I; Nordheim, A; Burkhardt, H J; Cohen, S N

    1978-11-16

    Detailed examination of the structure of cloned DNA fragments of the R6-5 antibiotic resistance plasmid has revealed a substantial degree of polynucleotide sequence heterogeneity and indicates that sequence rearrangements in plasmids and possible other replicons occur more frequently than has hitherto been appreciated. The sequences changes in cloned R6-5 fragments were shown in some instances to have occurred prior to cloning, i.e. existing in the original population of R6-5 molecules that was obtained from a single bacterial clone and by several different criteria judged to be homogeneous, and in others to have occurred either during the cloning procedure or during subsequent propagation of hybrid molecules. The molecular changes that are described involved insertion/deletion of the previously characterized IS2 insertion element, formation of a new inverted repeat structure probably by duplication of a preexisting R6-5 DNA sequence, sequence inversion, and loss and gain of restriction endonuclease cleavage sites.

  18. Recent amplification and impact of MITEs on the genome of grapevine (Vitis vinifera L.)

    PubMed Central

    Benjak, Andrej; Boué, Stéphanie; Forneck, Astrid

    2009-01-01

    Miniature inverted-repeat transposable elements (MITEs) are a particular type of defective class II transposons present in genomes as highly homogeneous populations of small elements. Their high copy number and close association to genes make their potential impact on gene evolution particularly relevant. Here, we present a detailed analysis of the MITE families directly related to grapevine “cut-and-paste” transposons. Our results show that grapevine MITEs have transduplicated and amplified genomic sequences, including gene sequences and fragments of other mobile elements. Our results also show that although some of the MITE families were already present in the ancestor of the European and American Vitis wild species, they have been amplified and have been actively transposing accompanying grapevine domestication and breeding. We show that MITEs are abundant in grapevine and some of them are frequently inserted within the untranslated regions of grapevine genes. MITE insertions are highly polymorphic among grapevine cultivars, which frequently generate transcript variability. The data presented here show that MITEs have greatly contributed to the grapevine genetic diversity which has been used for grapevine domestication and breeding. PMID:20333179

  19. What makes up plant genomes: The vanishing line between transposable elements and genes.

    PubMed

    Zhao, Dongyan; Ferguson, Ann A; Jiang, Ning

    2016-02-01

    The ultimate source of evolution is mutation. As the largest component in plant genomes, transposable elements (TEs) create numerous types of mutations that cannot be mimicked by other genetic mechanisms. When TEs insert into genomic sequences, they influence the expression of nearby genes as well as genes unlinked to the insertion. TEs can duplicate, mobilize, and recombine normal genes or gene fragments, with the potential to generate new genes or modify the structure of existing genes. TEs also donate their transposase coding regions for cellular functions in a process called TE domestication. Despite the host defense against TE activity, a subset of TEs survived and thrived through discreet selection of transposition activity, target site, element size, and the internal sequence. Finally, TEs have established strategies to reduce the efficacy of host defense system by increasing the cost of silencing TEs. This review discusses the recent progress in the area of plant TEs with a focus on the interaction between TEs and genes. Copyright © 2015 Elsevier B.V. All rights reserved.

  20. Genome-Wide Estimates of Transposable Element Insertion and Deletion Rates in Drosophila Melanogaster

    PubMed Central

    Adrion, Jeffrey R.; Song, Michael J.; Schrider, Daniel R.; Hahn, Matthew W.

    2017-01-01

    Abstract Knowing the rate at which transposable elements (TEs) insert and delete is critical for understanding their role in genome evolution. We estimated spontaneous rates of insertion and deletion for all known, active TE superfamilies present in a set of Drosophila melanogaster mutation-accumulation (MA) lines using whole genome sequence data. Our results demonstrate that TE insertions far outpace TE deletions in D. melanogaster. We found a significant effect of background genotype on TE activity, with higher rates of insertions in one MA line. We also found significant rate heterogeneity between the chromosomes, with both insertion and deletion rates elevated on the X relative to the autosomes. Further, we identified significant associations between TE activity and chromatin state, and tested for associations between TE activity and other features of the local genomic environment such as TE content, exon content, GC content, and recombination rate. Our results provide the most detailed assessment of TE mobility in any organism to date, and provide a useful benchmark for both addressing theoretical predictions of TE dynamics and for exploring large-scale patterns of TE movement in D. melanogaster and other species. PMID:28338986

  1. Genomic characterization of two large Alu-mediated rearrangements of the BRCA1 gene.

    PubMed

    Peixoto, Ana; Pinheiro, Manuela; Massena, Lígia; Santos, Catarina; Pinto, Pedro; Rocha, Patrícia; Pinto, Carla; Teixeira, Manuel R

    2013-02-01

    To determine whether a large genomic rearrangement is actually novel and to gain insight about the mutational mechanism responsible for its occurrence, molecular characterization with breakpoint identification is mandatory. We here report the characterization of two large deletions involving the BRCA1 gene. The first rearrangement harbored a 89,664-bp deletion comprising exon 7 of the BRCA1 gene to exon 11 of the NBR1 gene (c.441+1724_oNBR1:c.1073+480del). Two highly homologous Alu elements were found in the genomic sequences flanking the deletion breakpoints. Furthermore, a 20-bp overlapping sequence at the breakpoint junction was observed, suggesting that the most likely mechanism for the occurrence of this rearrangement was nonallelic homologous recombination. The second rearrangement fully characterized at the nucleotide level was a BRCA1 exons 11-15 deletion (c.671-319_4677-578delinsAlu). The case harbored a 23,363-bp deletion with an Alu element inserted at the breakpoints of the deleted region. As the Alu element inserted belongs to a still active AluY family, the observed rearrangement could be due to an insertion-mediated deletion mechanism caused by Alu retrotransposition. To conclude, we describe the breakpoints of two novel large deletions involving the BRCA1 gene and analysis of their genomic context allowed us to gain insight about the respective mutational mechanism.

  2. Potential Links between Hepadnavirus and Bornavirus Sequences in the Host Genome and Cancer.

    PubMed

    Honda, Tomoyuki

    2017-01-01

    Various viruses leave their sequences in the host genomes during infection. Such events occur mainly in retrovirus infection but also sometimes in DNA and non-retroviral RNA virus infections. If viral sequences are integrated into the genomes of germ line cells, the sequences can become inherited as endogenous viral elements (EVEs). The integration events of viral sequences may have oncogenic potential. Because proviral integrations of some retroviruses and/or reactivation of endogenous retroviruses are closely linked to cancers, viral insertions related to non-retroviral viruses also possibly contribute to cancer development. This article focuses on genomic viral sequences derived from two non-retroviral viruses, whose endogenization is already reported, and discusses their possible contributions to cancer. Viral insertions of hepatitis B virus play roles in the development of hepatocellular carcinoma. Endogenous bornavirus-like elements, the only non-retroviral RNA virus-related EVEs found in the human genome, may also be involved in cancer formation. In addition, the possible contribution of the interactions between viruses and retrotransposons, which seem to be a major driving force for generating EVEs related to non-retroviral RNA viruses, to cancers will be discussed. Future studies regarding the possible links described here may open a new avenue for the development of novel therapeutics for tumor virus-related cancers and/or provide novel insights into EVE functions.

  3. Evolution of Sphingomonad Gene Clusters Related to Pesticide Catabolism Revealed by Genome Sequence and Mobilomics of Sphingobium herbicidovorans MH

    PubMed Central

    Nielsen, Tue Kjærgaard; Rasmussen, Morten; Demanèche, Sandrine; Cecillon, Sébastien; Vogel, Timothy M.

    2017-01-01

    Abstract Bacterial degraders of chlorophenoxy herbicides have been isolated from various ecosystems, including pristine environments. Among these degraders, the sphingomonads constitute a prominent group that displays versatile xenobiotic-degradation capabilities. Four separate sequencing strategies were required to provide the complete sequence of the complex and plastic genome of the canonical chlorophenoxy herbicide-degrading Sphingobium herbicidovorans MH. The genome has an intricate organization of the chlorophenoxy-herbicide catabolic genes sdpA, rdpA, and cadABCD that encode the (R)- and (S)-enantiomer-specific 2,4-dichlorophenoxypropionate dioxygenases and four subunits of a Rieske non-heme iron oxygenase involved in 2-methyl-chlorophenoxyacetic acid degradation, respectively. Several major genomic rearrangements are proposed to help understand the evolution and mobility of these important genes and their genetic context. Single-strain mobilomic sequence analysis uncovered plasmids and insertion sequence-associated circular intermediates in this environmentally important bacterium and enabled the description of evolutionary models for pesticide degradation in strain MH and related organisms. The mobilome presented a complex mosaic of mobile genetic elements including four plasmids and several circular intermediate DNA molecules of insertion-sequence elements and transposons that are central to the evolution of xenobiotics degradation. Furthermore, two individual chromosomally integrated prophages were shown to excise and form free circular DNA molecules. This approach holds great potential for improving the understanding of genome plasticity, evolution, and microbial ecology. PMID:28961970

  4. The Paramecium germline genome provides a niche for intragenic parasitic DNA: evolutionary dynamics of internal eliminated sequences.

    PubMed

    Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

    2012-01-01

    Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated.

  5. The Paramecium Germline Genome Provides a Niche for Intragenic Parasitic DNA: Evolutionary Dynamics of Internal Eliminated Sequences

    PubMed Central

    Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E.; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

    2012-01-01

    Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated. PMID:23071448

  6. Rates and patterns of great ape retrotransposition.

    PubMed

    Hormozdiari, Fereydoun; Konkel, Miriam K; Prado-Martinez, Javier; Chiatante, Giorgia; Herraez, Irene Hernando; Walker, Jerilyn A; Nelson, Benjamin; Alkan, Can; Sudmant, Peter H; Huddleston, John; Catacchio, Claudia R; Ko, Arthur; Malig, Maika; Baker, Carl; Marques-Bonet, Tomas; Ventura, Mario; Batzer, Mark A; Eichler, Evan E

    2013-08-13

    We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r(2) = 0.65) in contrast to Alu repeats, which show little correlation (r(2) = 0.07). We estimate that the "rate" of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation--the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human-great ape evolution, with increases and decreases occurring over very short periods of evolutionary time.

  7. Insertion and deletion polymorphisms of the ancient AluS family in the human genome.

    PubMed

    Kryatova, Maria S; Steranka, Jared P; Burns, Kathleen H; Payer, Lindsay M

    2017-01-01

    Polymorphic Alu elements account for 17% of structural variants in the human genome. The majority of these belong to the youngest AluY subfamilies, and most structural variant discovery efforts have focused on identifying Alu polymorphisms from these currently retrotranspositionally active subfamilies. In this report we analyze polymorphisms from the evolutionarily older AluS subfamily, whose peak activity was tens of millions of years ago. We annotate the AluS polymorphisms, assess their likely mechanism of origin, and evaluate their contribution to structural variation in the human genome. Of 52 previously reported polymorphic AluS elements ascertained for this study, 48 were confirmed to belong to the AluS subfamily using high stringency subfamily classification criteria. Of these, the majority (77%, 37/48) appear to be deletion polymorphisms. Two polymorphic AluS elements (4%) have features of non-classical Alu insertions and one polymorphic AluS element (2%) likely inserted by a mechanism involving internal priming. Seven AluS polymorphisms (15%) appear to have arisen by the classical target-primed reverse transcription (TPRT) retrotransposition mechanism. These seven TPRT products are 3' intact with 3' poly-A tails, and are flanked by target site duplications; L1 ORF2p endonuclease cleavage sites were also observed, providing additional evidence that these are L1 ORF2p endonuclease-mediated TPRT insertions. Further sequence analysis showed strong conservation of both the RNA polymerase III promoter and SRP9/14 binding sites, important for mediating transcription and interaction with retrotransposition machinery, respectively. This conservation of functional features implies that some of these are fairly recent insertions since they have not diverged significantly from their respective retrotranspositionally competent source elements. Of the polymorphic AluS elements evaluated in this report, 15% (7/48) have features consistent with TPRT-mediated insertion, thus suggesting that some AluS elements have been more active recently than previously thought, or that fixation of AluS insertion alleles remains incomplete. These data expand the potential significance of polymorphic AluS elements in contributing to structural variation in the human genome. Future discovery efforts focusing on polymorphic AluS elements are likely to identify more such polymorphisms, and approaches tailored to identify deletion alleles may be warranted.

  8. A genomic library-based amplification approach (GL-PCR) for the mapping of multiple IS6110 insertion sites and strain differentiation of Mycobacterium tuberculosis.

    PubMed

    Namouchi, Amine; Mardassi, Helmi

    2006-11-01

    Evidence suggests that insertion of the IS6110 element is not without consequence to the biology of Mycobacterium tuberculosis complex strains. Thus, mapping of multiple IS6110 insertion sites in the genome of biomedically relevant clinical isolates would result in a better understanding of the role of this mobile element, particularly with regard to transmission, adaptability and virulence. In the present paper, we describe a versatile strategy, referred to as GL-PCR, that amplifies IS6110-flanking sequences based on the construction of a genomic library. M. tuberculosis chromosomal DNA is fully digested with HincII and then ligated into a plasmid vector between T7 and T3 promoter sequences. The ligation reaction product is transformed into Escherichia coli and selective PCR amplification targeting both 5' and 3' IS6110-flanking sequences are performed on the plasmid library DNA. For this purpose, four separate PCR reactions are performed, each combining an outward primer specific for one IS6110 end with either T7 or T3 primer. Determination of the nucleotide sequence of the PCR products generated from a single ligation reaction allowed mapping of 21 out of the 24 IS6110 copies of two 12 banded M. tuberculosis strains, yielding an overall sensitivity of 87,5%. Furthermore, by simply comparing the migration pattern of GL-PCR-generated products, the strategy proved to be as valuable as IS6110 RFLP for molecular typing of M. tuberculosis complex strains. Importantly, GL-PCR was able to discriminate between strains differing by a single IS6110 band.

  9. Physiological impact of transposable elements encoding DDE transposases in the environmental adaptation of Streptococcus agalactiae.

    PubMed

    Fléchard, Maud; Gilot, Philippe

    2014-07-01

    We have referenced and described Streptococcus agalactiae transposable elements encoding DDE transposases. These elements belonged to nine families of insertion sequences (ISs) and to a family of conjugative transposons (TnGBSs). An overview of the physiological impact of the insertion of all these elements is provided. DDE-transposable elements affect S. agalactiae in a number of aspects of its capability to adapt to various environments and modulate the expression of several virulence genes, the scpB-lmB genomic region and the genes involved in capsule expression and haemolysin transport being the targets of several different mobile elements. The referenced mobile elements modify S. agalactiae behaviour by transferring new gene(s) to its genome, by modifying the expression of neighbouring genes at the integration site or by promoting genomic rearrangements. Transposition of some of these elements occurs in vivo, suggesting that by dynamically regulating some adaptation and/or virulence genes, they improve the ability of S. agalactiae to reach different niches within its host and ensure the 'success' of the infectious process. © 2014 The Authors.

  10. Mechanism for DNA transposons to generate introns on genomic scales

    PubMed Central

    Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.

    2017-01-01

    Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113

  11. New Insights into the Classification and Integration Specificity of Streptococcus Integrative Conjugative Elements through Extensive Genome Exploration

    PubMed Central

    Ambroset, Chloé; Coluzzi, Charles; Guédon, Gérard; Devignes, Marie-Dominique; Loux, Valentin; Lacroix, Thomas; Payot, Sophie; Leblond-Bourget, Nathalie

    2016-01-01

    Recent genome analyses suggest that integrative and conjugative elements (ICEs) are widespread in bacterial genomes and therefore play an essential role in horizontal transfer. However, only a few of these elements are precisely characterized and correctly delineated within sequenced bacterial genomes. Even though previous analysis showed the presence of ICEs in some species of Streptococci, the global prevalence and diversity of ICEs was not analyzed in this genus. In this study, we searched for ICEs in the completely sequenced genomes of 124 strains belonging to 27 streptococcal species. These exhaustive analyses revealed 105 putative ICEs and 26 slightly decayed elements whose limits were assessed and whose insertion site was identified. These ICEs were grouped in seven distinct unrelated or distantly related families, according to their conjugation modules. Integration of these streptococcal ICEs is catalyzed either by a site-specific tyrosine integrase, a low-specificity tyrosine integrase, a site-specific single serine integrase, a triplet of site-specific serine integrases or a DDE transposase. Analysis of their integration site led to the detection of 18 target-genes for streptococcal ICE insertion including eight that had not been identified previously (ftsK, guaA, lysS, mutT, rpmG, rpsI, traG, and ebfC). It also suggests that all specificities have evolved to minimize the impact of the insertion on the host. This overall analysis of streptococcal ICEs emphasizes their prevalence and diversity and demonstrates that exchanges or acquisitions of conjugation and recombination modules are frequent. PMID:26779141

  12. New Insights into the Classification and Integration Specificity of Streptococcus Integrative Conjugative Elements through Extensive Genome Exploration.

    PubMed

    Ambroset, Chloé; Coluzzi, Charles; Guédon, Gérard; Devignes, Marie-Dominique; Loux, Valentin; Lacroix, Thomas; Payot, Sophie; Leblond-Bourget, Nathalie

    2015-01-01

    Recent genome analyses suggest that integrative and conjugative elements (ICEs) are widespread in bacterial genomes and therefore play an essential role in horizontal transfer. However, only a few of these elements are precisely characterized and correctly delineated within sequenced bacterial genomes. Even though previous analysis showed the presence of ICEs in some species of Streptococci, the global prevalence and diversity of ICEs was not analyzed in this genus. In this study, we searched for ICEs in the completely sequenced genomes of 124 strains belonging to 27 streptococcal species. These exhaustive analyses revealed 105 putative ICEs and 26 slightly decayed elements whose limits were assessed and whose insertion site was identified. These ICEs were grouped in seven distinct unrelated or distantly related families, according to their conjugation modules. Integration of these streptococcal ICEs is catalyzed either by a site-specific tyrosine integrase, a low-specificity tyrosine integrase, a site-specific single serine integrase, a triplet of site-specific serine integrases or a DDE transposase. Analysis of their integration site led to the detection of 18 target-genes for streptococcal ICE insertion including eight that had not been identified previously (ftsK, guaA, lysS, mutT, rpmG, rpsI, traG, and ebfC). It also suggests that all specificities have evolved to minimize the impact of the insertion on the host. This overall analysis of streptococcal ICEs emphasizes their prevalence and diversity and demonstrates that exchanges or acquisitions of conjugation and recombination modules are frequent.

  13. The American cranberry mitochondrial genome reveals the presence of selenocysteine (tRNA-Sec and SECIS) insertion machinery in land plants.

    PubMed

    Fajardo, Diego; Schlautman, Brandon; Steffan, Shawn; Polashock, James; Vorsa, Nicholi; Zalapa, Juan

    2014-02-25

    This is the first de novo assembly and annotation of a complete mitochondrial genome in the Ericales order from the American cranberry (Vaccinium macrocarpon Ait.). Moreover, only four complete Asterid mitochondrial genomes have been made publicly available. The cranberry mitochondrial genome was assembled and reconstructed from whole genome 454 Roche GS-FLX and Illumina shotgun sequences. Compared with other Asterids, the reconstruction of the genome revealed an average size mitochondrion (459,678 nt) with relatively little repetitive sequences and DNA of plastid origin. The complete mitochondrial genome of cranberry was annotated obtaining a total of 34 genes classified based on their putative function, plus three ribosomal RNAs, and 17 transfer RNAs. Maternal organellar cranberry inheritance was inferred by analyzing gene variation in the cranberry mitochondria and plastid genomes. The annotation of cranberry mitochondrial genome revealed the presence of two copies of tRNA-Sec and a selenocysteine insertion sequence (SECIS) element which were lost in plants during evolution. This is the first report of a land plant possessing selenocysteine insertion machinery at the sequence level. Published by Elsevier B.V.

  14. Intracisternal A-Particle Element Transposition into the Murine β-Glucuronidase Gene Correlates with Loss of Enzyme Activity: a New Model for β-Glucuronidase Deficiency in the C3H Mouse†

    PubMed Central

    Gwynn, Babette; Lueders, Kira; Sands, Mark S.; Birkenmeier, Edward H.

    1998-01-01

    The severity of human mucopolysaccharidosis type VII (MPS VII), or Sly syndrome, depends on the relative activity of the enzyme β-glucuronidase. Loss of β-glucuronidase activity can cause hydrops fetalis, with in utero or postnatal death of the patient. In this report, we show that β-glucuronidase activity is not detectable by a standard fluorometric assay in C3H/HeOuJ (C3H) mice homozygous for a new mutation, gusmps2J. These gusmps2J/gusmps2J mice are born and survive much longer than the previously characterized β-glucuronidase-null B6.C-H-2bm1/ByBir-gusmps (gusmps/gusmps) mice. Northern blot analysis of liver from gusmps2J/gusmps2J mice demonstrates a 750-bp reduction in size of β-glucuronidase mRNA. A 5.4-kb insertion in the Gus-sh nucleotide sequence from these mice was localized by Southern blot analysis to intron 8. The ends of the inserted sequences were cloned by inverse PCR and revealed an intracisternal A-particle (IAP) element inserted near the 3′ end of the intron. The sequence of the long terminal repeat (LTR) regions of the IAP most closely matches that of a composite LTR found in transposed IAPs previously identified in the C3H strain. The inserted IAP may contribute to diminished β-glucuronidase activity either by interfering with transcription or by destabilizing the message. The resulting phenotype is much less severe than that previously described in the gusmps/gusmps mouse and provides an opportunity to study MPS VII on a genetic background that clearly modulates disease severity. PMID:9774663

  15. Persistence and Epidemic Propagation of a Pseudomonas aeruginosa Sequence Type 235 Clone Harboring an IS26 Composite Transposon Carrying the blaIMP-1 Integron in Hiroshima, Japan, 2005 to 2012

    PubMed Central

    Shimizu, Wataru; Kayama, Shizuo; Kouda, Shuntaro; Ogura, Yoshitoshi; Kobayashi, Kanao; Shigemoto, Norifumi; Shimada, Norimitsu; Yano, Raita; Hisatsune, Junzo; Kato, Fuminori; Hayashi, Tetsuya; Sueda, Taijiro; Ohge, Hiroki

    2015-01-01

    A 9-year surveillance for multidrug-resistant (MDR) Pseudomonas aeruginosa in the Hiroshima region showed that the number of isolates harboring the metallo-β-lactamase gene blaIMP-1 abruptly increased after 2004, recorded the highest peak in 2006, and showed a tendency to decline afterwards, indicating a history of an epidemic. PCR mapping of the variable regions of the integrons showed that this epidemic was caused by the clonal persistence and propagation of an MDR P. aeruginosa strain harboring the blaIMP-1 gene and an aminoglycoside 6′-N-acetyltransferase gene, aac(6′)-Iae in a class I integron (In113), whose integrase gene intl1 was disrupted by an IS26 insertion. Sequence analysis of the representative strain PA058447 resistance element containing the In113-derived gene cassette array showed that the element forms an IS26 transposon embedded in the chromosome. It has a Tn21 backbone and is composed of two segments sandwiched by three IS26s. In Japan, clonal nationwide expansion of an MDR P. aeruginosa NCGM2.S1 harboring chromosomally encoded In113 with intact intl1 is reported. Multilocus sequence typing and genomic comparison strongly suggest that PA058447 and NCGM2.S1 belong to the same clonal lineage. Moreover, the structures of the resistance element in the two strains are very similar, but the sites of insertion into the chromosome are different. Based on tagging information of the IS26 present in both resistance elements, we suggest that the MDR P. aeruginosa clone causing the epidemic in Hiroshima for the past 9 years originated from a common ancestor genome of PA058447 and NCGM2.S1 through an IS26 insertion into intl1 of In113 and through IS26-mediated genomic rearrangements. PMID:25712351

  16. DNA transposon activity is associated with increased mutation rates in genes of rice and other grasses

    PubMed Central

    Wicker, Thomas; Yu, Yeisoo; Haberer, Georg; Mayer, Klaus F. X.; Marri, Pradeep Reddy; Rounsley, Steve; Chen, Mingsheng; Zuccolo, Andrea; Panaud, Olivier; Wing, Rod A.; Roffler, Stefan

    2016-01-01

    DNA (class 2) transposons are mobile genetic elements which move within their ‘host' genome through excising and re-inserting elsewhere. Although the rice genome contains tens of thousands of such elements, their actual role in evolution is still unclear. Analysing over 650 transposon polymorphisms in the rice species Oryza sativa and Oryza glaberrima, we find that DNA repair following transposon excisions is associated with an increased number of mutations in the sequences neighbouring the transposon. Indeed, the 3,000 bp flanking the excised transposons can contain over 10 times more mutations than the genome-wide average. Since DNA transposons preferably insert near genes, this is correlated with increases in mutation rates in coding sequences and regulatory regions. Most importantly, we find this phenomenon also in maize, wheat and barley. Thus, these findings suggest that DNA transposon activity is a major evolutionary force in grasses which provide the basis of most food consumed by humankind. PMID:27599761

  17. Determination of the Optimal Chromosomal Location(s) for a DNA Element in Escherichia coli Using a Novel Transposon-mediated Approach.

    PubMed

    Frimodt-Møller, Jakob; Charbon, Godefroid; Krogfelt, Karen A; Løbner-Olesen, Anders

    2017-09-11

    The optimal chromosomal position(s) of a given DNA element was/were determined by transposon-mediated random insertion followed by fitness selection. In bacteria, the impact of the genetic context on the function of a genetic element can be difficult to assess. Several mechanisms, including topological effects, transcriptional interference from neighboring genes, and/or replication-associated gene dosage, may affect the function of a given genetic element. Here, we describe a method that permits the random integration of a DNA element into the chromosome of Escherichia coli and select the most favorable locations using a simple growth competition experiment. The method takes advantage of a well-described transposon-based system of random insertion, coupled with a selection of the fittest clone(s) by growth advantage, a procedure that is easily adjustable to experimental needs. The nature of the fittest clone(s) can be determined by whole-genome sequencing on a complex multi-clonal population or by easy gene walking for the rapid identification of selected clones. Here, the non-coding DNA region DARS2, which controls the initiation of chromosome replication in E. coli, was used as an example. The function of DARS2 is known to be affected by replication-associated gene dosage; the closer DARS2 gets to the origin of DNA replication, the more active it becomes. DARS2 was randomly inserted into the chromosome of a DARS2-deleted strain. The resultant clones containing individual insertions were pooled and competed against one another for hundreds of generations. Finally, the fittest clones were characterized and found to contain DARS2 inserted in close proximity to the original DARS2 location.

  18. Genome-Wide Comparison of Magnaporthe Species Reveals a Host-Specific Pattern of Secretory Proteins and Transposable Elements

    PubMed Central

    Gowda, Malali

    2016-01-01

    Blast disease caused by the Magnaporthe species is a major factor affecting the productivity of rice, wheat and millets. This study was aimed at generating genomic information for rice and non-rice Magnaporthe isolates to understand the extent of genetic variation. We have sequenced the whole genome of the Magnaporthe isolates, infecting rice (leaf and neck), finger millet (leaf and neck), foxtail millet (leaf) and buffel grass (leaf). Rice and finger millet isolates infecting both leaf and neck tissues were sequenced, since the damage and yield loss caused due to neck blast is much higher as compared to leaf blast. The genome-wide comparison was carried out to study the variability in gene content, candidate effectors, repeat element distribution, genes involved in carbohydrate metabolism and SNPs. The analysis of repeat element footprints revealed some genes such as naringenin, 2-oxoglutarate 3-dioxygenase being targeted by Pot2 and Occan, in isolates from different host species. Some repeat insertions were host-specific while other insertions were randomly shared between isolates. The distributions of repeat elements, secretory proteins, CAZymes and SNPs showed significant variation across host-specific lineages of Magnaporthe indicating an independent genome evolution orchestrated by multiple genomic factors. PMID:27658241

  19. Premature terminator analysis sheds light on a hidden world of bacterial transcriptional attenuation.

    PubMed

    Naville, Magali; Gautheret, Daniel

    2010-01-01

    Bacterial transcription attenuation occurs through a variety of cis-regulatory elements that control gene expression in response to a wide range of signals. The signal-sensing structures in attenuators are so diverse and rapidly evolving that only a small fraction have been properly annotated and characterized to date. Here we apply a broad-spectrum detection tool in order to achieve a more complete view of the transcriptional attenuation complement of key bacterial species. Our protocol seeks gene families with an unusual frequency of 5' terminators found across multiple species. Many of the detected attenuators are part of annotated elements, such as riboswitches or T-boxes, which often operate through transcriptional attenuation. However, a significant fraction of candidates were not previously characterized in spite of their unmistakable footprint. We further characterized some of these new elements using sequence and secondary structure analysis. We also present elements that may control the expression of several non-homologous genes, suggesting co-transcription and response to common signals. An important class of such elements, which we called mobile attenuators, is provided by 3' terminators of insertion sequences or prophages that may be exapted as 5' regulators when inserted directly upstream of a cellular gene. We show here that attenuators involve a complex landscape of signal-detection structures spanning the entire bacterial domain. We discuss possible scenarios through which these diverse 5' regulatory structures may arise or evolve.

  20. CRISPR-based screening of genomic island excision events in bacteria.

    PubMed

    Selle, Kurt; Klaenhammer, Todd R; Barrangou, Rodolphe

    2015-06-30

    Genomic analysis of Streptococcus thermophilus revealed that mobile genetic elements (MGEs) likely contributed to gene acquisition and loss during evolutionary adaptation to milk. Clustered regularly interspaced short palindromic repeats-CRISPR-associated genes (CRISPR-Cas), the adaptive immune system in bacteria, limits genetic diversity by targeting MGEs including bacteriophages, transposons, and plasmids. CRISPR-Cas systems are widespread in streptococci, suggesting that the interplay between CRISPR-Cas systems and MGEs is one of the driving forces governing genome homeostasis in this genus. To investigate the genetic outcomes resulting from CRISPR-Cas targeting of integrated MGEs, in silico prediction revealed four genomic islands without essential genes in lengths from 8 to 102 kbp, totaling 7% of the genome. In this study, the endogenous CRISPR3 type II system was programmed to target the four islands independently through plasmid-based expression of engineered CRISPR arrays. Targeting lacZ within the largest 102-kbp genomic island was lethal to wild-type cells and resulted in a reduction of up to 2.5-log in the surviving population. Genotyping of Lac(-) survivors revealed variable deletion events between the flanking insertion-sequence elements, all resulting in elimination of the Lac-encoding island. Chimeric insertion sequence footprints were observed at the deletion junctions after targeting all of the four genomic islands, suggesting a common mechanism of deletion via recombination between flanking insertion sequences. These results established that self-targeting CRISPR-Cas systems may direct significant evolution of bacterial genomes on a population level, influencing genome homeostasis and remodeling.

  1. Structure and Expression of Hybrid Dysgenesis-Induced Alleles of the Ovarian Tumor (Otu) Gene in Drosophila Melanogaster

    PubMed Central

    Sass, G. L.; Mohler, J. D.; Walsh, R. C.; Kalfayan, L. J.; Searles, L. L.

    1993-01-01

    Mutations at the ovarian tumor (otu) gene of Drosophila melanogaster cause female sterility and generate a range of ovarian phenotypes. Quiescent (QUI) mutants exhibit reduced germ cell proliferation; in oncogenic (ONC) mutants germ cells undergo uncontrolled proliferation generating excessive numbers of undifferentiated cells; the egg chambers of differentiated (DIF) mutants differentiate to variable degrees but fail to complete oogenesis. We have examined mutations caused by insertion and deletion of P elements at the otu gene. The P element insertion sites are upstream of the major otu transcription start sites. In deletion derivatives, the P element, regulatory regions and/or protein coding sequences have been removed. In both insertion and deletion mutants, the level of otu expression correlates directly with the severity of the phenotype: the absence of otu function produces the most severe QUI phenotype while the ONC mutants express lower levels of otu than those which are DIF. The results of this study demonstrate that the diverse mutant phenotypes of otu are the consequence of different levels of otu function. PMID:8436274

  2. Seedling lethality in Nicotiana plumbaginifolia conferred by Ds transposable element insertion into a plant-specific gene.

    PubMed

    Majira, Amel; Domin, Monique; Grandjean, Olivier; Gofron, Krystyna; Houba-Hérin, Nicole

    2002-10-01

    A seedling lethal mutant of Nicotiana plumbaginifolia (sdl-1) was isolated by transposon tagging using a maize Dissociation (Ds) element. The insertion mutation was produced by direct co-transformation of protoplasts with two plasmids: one containing Ds and a second with an Ac transposase gene. sdl-1 seedlings exhibit several phenotypes: swollen organs, short hypocotyls in light and dark conditions, and enlarged and multinucleated cells, that altogether suggest cell growth defects. Mutant cells are able to proliferate under in vitro culture conditions. Genomic DNA sequences bordering the transposon were used to recover cDNA from the normal allele. Complementation of the mutant phenotype with the cDNA confirmed that the transposon had caused the mutation. The Ds element was inserted into the first exon of the open reading frame and the homozygous mutant lacked detectable transcript. Phenocopies of the mutant were obtained by an antisense approach. SDL-1 encodes a novel protein found in several plant genomes but apparently missingfrom animal and fungal genomes; the protein is highly conserved and has a potential plastid targeting motif.

  3. Interfamilial recombination between viruses led to acquisition of a novel translation-enhancing RNA element that allows resistance breaking

    PubMed Central

    Miras, Manuel; Sempere, Raquel N.; Kraft, Jelena J.; Miller, W. Allen; Aranda, Miguel A.; Truniger, Veronica

    2015-01-01

    Summary Many plant viruses depend on functional RNA elements, called 3′-UTR cap-independent translation enhancers (3′-CITEs), for translation of their RNAs. In this manuscript we provide direct proof for the existing hypothesis that 3′-CITEs are modular and transferable by recombination in nature, and that this is associated with an advantage for the created virus. By characterizing a newly identified Melon necrotic spot virus (MNSV; Tombusviridae) isolate, which is able to overcome eukaryotic translation initiation factor 4E (eIF4E)-mediated resistance, we found that it contains a 55 nucleotide insertion in its 3′-UTR. We provide strong evidence that this insertion was acquired by interfamilial recombination with the 3′-UTR of an Asiatic Cucurbit aphid-borne yellows virus (CABYV; Luteoviridae). By constructing chimeric viruses, we showed that this recombined sequence is responsible for resistance breaking. Analysis of the translational efficiency of reporter constructs showed that this sequence functions as a novel 3′-CITE in both resistant and susceptible plants, being essential for translation control in resistant plants. In conclusion, we showed that a recombination event between two clearly identified viruses from different families led to the transfer of exactly the sequence corresponding to a functional RNA element, giving rise to a new isolate with the capacity to infect an otherwise non-susceptible host. PMID:24372390

  4. Interfamilial recombination between viruses led to acquisition of a novel translation-enhancing RNA element that allows resistance breaking.

    PubMed

    Miras, Manuel; Sempere, Raquel N; Kraft, Jelena J; Miller, W Allen; Aranda, Miguel A; Truniger, Veronica

    2014-04-01

    Many plant viruses depend on functional RNA elements, called 3'-UTR cap-independent translation enhancers (3'-CITEs), for translation of their RNAs. In this manuscript we provide direct proof for the existing hypothesis that 3'-CITEs are modular and transferable by recombination in nature, and that this is associated with an advantage for the created virus. By characterizing a newly identified Melon necrotic spot virus (MNSV; Tombusviridae) isolate, which is able to overcome eukaryotic translation initiation factor 4E (eIF4E)-mediated resistance, we found that it contains a 55 nucleotide insertion in its 3'-UTR. We provide strong evidence that this insertion was acquired by interfamilial recombination with the 3'-UTR of an Asiatic Cucurbit aphid-borne yellows virus (CABYV; Luteoviridae). By constructing chimeric viruses, we showed that this recombined sequence is responsible for resistance breaking. Analysis of the translational efficiency of reporter constructs showed that this sequence functions as a novel 3'-CITE in both resistant and susceptible plants, being essential for translation control in resistant plants. In conclusion, we showed that a recombination event between two clearly identified viruses from different families led to the transfer of exactly the sequence corresponding to a functional RNA element, giving rise to a new isolate with the capacity to infect an otherwise nonsusceptible host. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.

  5. Lineage-specific expansions of retroviral insertions within the genomes of African great apes but not humans and orangutans.

    PubMed

    Yohn, Chris T; Jiang, Zhaoshi; McGrath, Sean D; Hayden, Karen E; Khaitovich, Philipp; Johnson, Matthew E; Eichler, Marla Y; McPherson, John D; Zhao, Shaying; Pääbo, Svante; Eichler, Evan E

    2005-04-01

    Retroviral infections of the germline have the potential to episodically alter gene function and genome structure during the course of evolution. Horizontal transmissions between species have been proposed, but little evidence exists for such events in the human/great ape lineage of evolution. Based on analysis of finished BAC chimpanzee genome sequence, we characterize a retroviral element (Pan troglodytes endogenous retrovirus 1 [PTERV1]) that has become integrated in the germline of African great ape and Old World monkey species but is absent from humans and Asian ape genomes. We unambiguously map 287 retroviral integration sites and determine that approximately 95.8% of the insertions occur at non-orthologous regions between closely related species. Phylogenetic analysis of the endogenous retrovirus reveals that the gorilla and chimpanzee elements share a monophyletic origin with a subset of the Old World monkey retroviral elements, but that the average sequence divergence exceeds neutral expectation for a strictly nuclear inherited DNA molecule. Within the chimpanzee, there is a significant integration bias against genes, with only 14 of these insertions mapping within intronic regions. Six out of ten of these genes, for which there are expression data, show significant differences in transcript expression between human and chimpanzee. Our data are consistent with a retroviral infection that bombarded the genomes of chimpanzees and gorillas independently and concurrently, 3-4 million years ago. We speculate on the potential impact of such recent events on the evolution of humans and great apes.

  6. Recognition of the CDEI motif GTCACATG by mouse nuclear proteins and interference with the early development of the mouse embryo.

    PubMed Central

    Blangy, A; Léopold, P; Vidal, F; Rassoulzadegan, M; Cuzin, F

    1991-01-01

    We have reported previously (1) two unexpected consequences of the microinjection into fertilized mouse eggs of a recombinant plasmid designated p12B1, carrying a 343 bp insert of non-repetitive mouse DNA. Injected at very low concentrations, this plasmid could be established as an extrachromosomal genetic element. When injected in greater concentration, an early arrest of embryonic development resulted. In the present work, we have studied this toxic effect in more detail by microinjecting short synthetic oligonucleotides with sequences from the mouse insert. Lethality was associated with the nucleotide sequence GTCACATG, identical with the CDEl element of yeast centromeres. Development of injected embryos was arrested between the one-cell and the early morula stages, with abnormal structures and DNA contents. Electrophoretic mobility shift and DNAse foot-printing assays demonstrated the binding of mouse nuclear protein(s) to the CDEl-like box. Base changes within the CDEl sequence prevented both the toxic effects in embryos and the formation of protein complex in vitro, suggesting that protein binding at such sites in chromosomal DNA plays an important role in early development. Images PMID:1766880

  7. Evolution of Sphingomonad Gene Clusters Related to Pesticide Catabolism Revealed by Genome Sequence and Mobilomics of Sphingobium herbicidovorans MH.

    PubMed

    Nielsen, Tue Kjærgaard; Rasmussen, Morten; Demanèche, Sandrine; Cecillon, Sébastien; Vogel, Timothy M; Hansen, Lars Hestbjerg

    2017-09-01

    Bacterial degraders of chlorophenoxy herbicides have been isolated from various ecosystems, including pristine environments. Among these degraders, the sphingomonads constitute a prominent group that displays versatile xenobiotic-degradation capabilities. Four separate sequencing strategies were required to provide the complete sequence of the complex and plastic genome of the canonical chlorophenoxy herbicide-degrading Sphingobium herbicidovorans MH. The genome has an intricate organization of the chlorophenoxy-herbicide catabolic genes sdpA, rdpA, and cadABCD that encode the (R)- and (S)-enantiomer-specific 2,4-dichlorophenoxypropionate dioxygenases and four subunits of a Rieske non-heme iron oxygenase involved in 2-methyl-chlorophenoxyacetic acid degradation, respectively. Several major genomic rearrangements are proposed to help understand the evolution and mobility of these important genes and their genetic context. Single-strain mobilomic sequence analysis uncovered plasmids and insertion sequence-associated circular intermediates in this environmentally important bacterium and enabled the description of evolutionary models for pesticide degradation in strain MH and related organisms. The mobilome presented a complex mosaic of mobile genetic elements including four plasmids and several circular intermediate DNA molecules of insertion-sequence elements and transposons that are central to the evolution of xenobiotics degradation. Furthermore, two individual chromosomally integrated prophages were shown to excise and form free circular DNA molecules. This approach holds great potential for improving the understanding of genome plasticity, evolution, and microbial ecology. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  8. Dynamics and Differential Proliferation of Transposable Elements During the Evolution of the B and A Genomes of Wheat

    PubMed Central

    Charles, Mathieu; Belcram, Harry; Just, Jérémy; Huneau, Cécile; Viollet, Agnès; Couloux, Arnaud; Segurens, Béatrice; Carter, Meredith; Huteau, Virginie; Coriton, Olivier; Appels, Rudi; Samain, Sylvie; Chalhoub, Boulos

    2008-01-01

    Transposable elements (TEs) constitute >80% of the wheat genome but their dynamics and contribution to size variation and evolution of wheat genomes (Triticum and Aegilops species) remain unexplored. In this study, 10 genomic regions have been sequenced from wheat chromosome 3B and used to constitute, along with all publicly available genomic sequences of wheat, 1.98 Mb of sequence (from 13 BAC clones) of the wheat B genome and 3.63 Mb of sequence (from 19 BAC clones) of the wheat A genome. Analysis of TE sequence proportions (as percentages), ratios of complete to truncated copies, and estimation of insertion dates of class I retrotransposons showed that specific types of TEs have undergone waves of differential proliferation in the B and A genomes of wheat. While both genomes show similar rates and relatively ancient proliferation periods for the Athila retrotransposons, the Copia retrotransposons proliferated more recently in the A genome whereas Gypsy retrotransposon proliferation is more recent in the B genome. It was possible to estimate for the first time the proliferation periods of the abundant CACTA class II DNA transposons, relative to that of the three main retrotransposon superfamilies. Proliferation of these TEs started prior to and overlapped with that of the Athila retrotransposons in both genomes. However, they also proliferated during the same periods as Gypsy and Copia retrotransposons in the A genome, but not in the B genome. As estimated from their insertion dates and confirmed by PCR-based tracing analysis, the majority of differential proliferation of TEs in B and A genomes of wheat (87 and 83%, respectively), leading to rapid sequence divergence, occurred prior to the allotetraploidization event that brought them together in Triticum turgidum and Triticum aestivum, <0.5 million years ago. More importantly, the allotetraploidization event appears to have neither enhanced nor repressed retrotranspositions. We discuss the apparent proliferation of TEs as resulting from their insertion, removal, and/or combinations of both evolutionary forces. PMID:18780739

  9. Short Interspersed Nuclear Element (SINE) Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293

    PubMed Central

    Kanhayuwa, Lakkhana; Coutts, Robert H. A.

    2016-01-01

    Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4–14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140–493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3’-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50–65% and 60–75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259–343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity. PMID:27736869

  10. Short Interspersed Nuclear Element (SINE) Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293.

    PubMed

    Kanhayuwa, Lakkhana; Coutts, Robert H A

    2016-01-01

    Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.

  11. Transposon integration enhances expression of stress response genes.

    PubMed

    Feng, Gang; Leem, Young-Eun; Levin, Henry L

    2013-01-01

    Transposable elements possess specific patterns of integration. The biological impact of these integration profiles is not well understood. Tf1, a long-terminal repeat retrotransposon in Schizosaccharomyces pombe, integrates into promoters with a preference for the promoters of stress response genes. To determine the biological significance of Tf1 integration, we took advantage of saturated maps of insertion activity and studied how integration at hot spots affected the expression of the adjacent genes. Our study revealed that Tf1 integration did not reduce gene expression. Importantly, the insertions activated the expression of 6 of 32 genes tested. We found that Tf1 increased gene expression by inserting enhancer activity. Interestingly, the enhancer activity of Tf1 could be limited by Abp1, a host surveillance factor that sequesters transposon sequences into structures containing histone deacetylases. We found the Tf1 promoter was activated by heat treatment and, remarkably, only genes that themselves were induced by heat could be activated by Tf1 integration, suggesting a synergy of Tf1 enhancer sequence with the stress response elements of target promoters. We propose that the integration preference of Tf1 for the promoters of stress response genes and the ability of Tf1 to enhance the expression of these genes co-evolved to promote the survival of cells under stress.

  12. Transposon integration enhances expression of stress response genes

    PubMed Central

    Feng, Gang; Leem, Young-Eun; Levin, Henry L.

    2013-01-01

    Transposable elements possess specific patterns of integration. The biological impact of these integration profiles is not well understood. Tf1, a long-terminal repeat retrotransposon in Schizosaccharomyces pombe, integrates into promoters with a preference for the promoters of stress response genes. To determine the biological significance of Tf1 integration, we took advantage of saturated maps of insertion activity and studied how integration at hot spots affected the expression of the adjacent genes. Our study revealed that Tf1 integration did not reduce gene expression. Importantly, the insertions activated the expression of 6 of 32 genes tested. We found that Tf1 increased gene expression by inserting enhancer activity. Interestingly, the enhancer activity of Tf1 could be limited by Abp1, a host surveillance factor that sequesters transposon sequences into structures containing histone deacetylases. We found the Tf1 promoter was activated by heat treatment and, remarkably, only genes that themselves were induced by heat could be activated by Tf1 integration, suggesting a synergy of Tf1 enhancer sequence with the stress response elements of target promoters. We propose that the integration preference of Tf1 for the promoters of stress response genes and the ability of Tf1 to enhance the expression of these genes co-evolved to promote the survival of cells under stress. PMID:23193295

  13. Rates and patterns of great ape retrotransposition

    PubMed Central

    Hormozdiari, Fereydoun; Konkel, Miriam K.; Prado-Martinez, Javier; Chiatante, Giorgia; Herraez, Irene Hernando; Walker, Jerilyn A.; Nelson, Benjamin; Alkan, Can; Sudmant, Peter H.; Huddleston, John; Catacchio, Claudia R.; Ko, Arthur; Malig, Maika; Baker, Carl; Genome Project, Great Ape; Marques-Bonet, Tomas; Ventura, Mario; Batzer, Mark A.; Eichler, Evan E.

    2013-01-01

    We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r2 = 0.65) in contrast to Alu repeats, which show little correlation (r2 = 0.07). We estimate that the “rate” of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation—the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human–great ape evolution, with increases and decreases occurring over very short periods of evolutionary time. PMID:23884656

  14. The novel conjugative transposon tn1207.3 carries the macrolide efflux gene mef(A) in Streptococcus pyogenes.

    PubMed

    Santagati, Maria; Iannelli, Francesco; Cascone, Carmela; Campanile, Floriana; Oggioni, Marco R; Stefani, Stefania; Pozzi, Gianni

    2003-01-01

    The macrolide efflux gene mef(A) of the Streptococcus pyogenes clinical strain 2812A was found to be carried by a 52-kb chromosomal genetic element that could be transferred by conjugation to the chromosome of other streptococcal species. The characteristics of this genetic element are typical of conjugative transposons and was named Tn1207.3. The size of Tn1207.3 was established by pulsed-field gel electrophoresis (PFGE), and DNA sequencing analysis showed that the 7,244 bp at the left end of Tn1207.3 were identical to those of the pneumococcal Tn1207.1 element. Tn1207.3-like genetic elements were found to be inserted at a single specific chromosomal site in 12 different clinical isolates S. pyogenes exhibiting the M phenotype of resistance to macrolides and carrying the mef(A) gene. Tn1207.3 was transferred from S. pyogenes 2812A to Streptococcus pneumoniae, and sequence analysis carried out on six independent transconjugants showed that insertion of Tn1207.3 in the pneumococcal genome always occurred at a single specific site as in Tn1207.1. Using MF2, a representative S. pneumoniae transconjugant, as a donor, Tn1207.3 was transferred again by conjugation to S. pyogenes and Streptococcus gordonii. The previously described nonconjugative element Tn1207.1 of S. pneumoniae appears to be a defective element, part of a longer conjugative transposon that carries mef(A) and is found in clinical isolates of S. pyogenes.

  15. Insertion sequence diversity in archaea.

    PubMed

    Filée, J; Siguier, P; Chandler, M

    2007-03-01

    Insertion sequences (ISs) can constitute an important component of prokaryotic (bacterial and archaeal) genomes. Over 1,500 individual ISs are included at present in the ISfinder database (www-is.biotoul.fr), and these represent only a small portion of those in the available prokaryotic genome sequences and those that are being discovered in ongoing sequencing projects. In spite of this diversity, the transposition mechanisms of only a few of these ubiquitous mobile genetic elements are known, and these are all restricted to those present in bacteria. This review presents an overview of ISs within the archaeal kingdom. We first provide a general historical summary of the known properties and behaviors of archaeal ISs. We then consider how transposition might be regulated in some cases by small antisense RNAs and by termination codon readthrough. This is followed by an extensive analysis of the IS content in the sequenced archaeal genomes present in the public databases as of June 2006, which provides an overview of their distribution among the major archaeal classes and species. We show that the diversity of archaeal ISs is very great and comparable to that of bacteria. We compare archaeal ISs to known bacterial ISs and find that most are clearly members of families first described for bacteria. Several cases of lateral gene transfer between bacteria and archaea are clearly documented, notably for methanogenic archaea. However, several archaeal ISs do not have bacterial equivalents but can be grouped into Archaea-specific groups or families. In addition to ISs, we identify and list nonautonomous IS-derived elements, such as miniature inverted-repeat transposable elements. Finally, we present a possible scenario for the evolutionary history of ISs in the Archaea.

  16. Transposon variation by order during allopolyploidisation between Brassica oleracea and Brassica rapa.

    PubMed

    An, Z; Tang, Z; Ma, B; Mason, A S; Guo, Y; Yin, J; Gao, C; Wei, L; Li, J; Fu, D

    2014-07-01

    Although many studies have shown that transposable element (TE) activation is induced by hybridisation and polyploidisation in plants, much less is known on how different types of TE respond to hybridisation, and the impact of TE-associated sequences on gene function. We investigated the frequency and regularity of putative transposon activation for different types of TE, and determined the impact of TE-associated sequence variation on the genome during allopolyploidisation. We designed different types of TE primers and adopted the Inter-Retrotransposon Amplified Polymorphism (IRAP) method to detect variation in TE-associated sequences during the process of allopolyploidisation between Brassica rapa (AA) and Brassica oleracea (CC), and in successive generations of self-pollinated progeny. In addition, fragments with TE insertions were used to perform Blast2GO analysis to characterise the putative functions of the fragments with TE insertions. Ninety-two primers amplifying 548 loci were used to detect variation in sequences associated with four different orders of TE sequences. TEs could be classed in ascending frequency into LTR-REs, TIRs, LINEs, SINEs and unknown TEs. The frequency of novel variation (putative activation) detected for the four orders of TEs was highest from the F1 to F2 generations, and lowest from the F2 to F3 generations. Functional annotation of sequences with TE insertions showed that genes with TE insertions were mainly involved in metabolic processes and binding, and preferentially functioned in organelles. TE variation in our study severely disturbed the genetic compositions of the different generations, resulting in inconsistencies in genetic clustering. Different types of TE showed different patterns of variation during the process of allopolyploidisation. © 2013 German Botanical Society and The Royal Botanical Society of the Netherlands.

  17. Identification, variation and transcription of pneumococcal repeat sequences

    PubMed Central

    2011-01-01

    Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003

  18. Occurrence of Can-SINEs and intron sequence evolution supports robust phylogeny of pinniped carnivores and their terrestrial relatives.

    PubMed

    Schröder, Christiane; Bleidorn, Christoph; Hartmann, Stefanie; Tiedemann, Ralph

    2009-12-15

    Investigating the dog genome we found 178965 introns with a moderate length of 200-1000 bp. A screening of these sequences against 23 different repeat libraries to find insertions of short interspersed elements (SINEs) detected 45276 SINEs. Virtually all of these SINEs (98%) belong to the tRNA-derived Can-SINE family. Can-SINEs arose about 55 million years ago before Carnivora split into two basal groups, the Caniformia (dog-like carnivores) and the Feliformia (cat-like carnivores). Genome comparisons of dog and cat recovered 506 putatively informative SINE loci for caniformian phylogeny. In this study we show how to use such genome information of model organisms to research the phylogeny of related non-model species of interest. Investigating a dataset including representatives of all major caniformian lineages, we analysed 24 randomly chosen loci for 22 taxa. All loci were amplifiable and revealed 17 parsimony-informative SINE insertions. The screening for informative SINE insertions yields a large amount of sequence information, in particular of introns, which contain reliable phylogenetic information as well. A phylogenetic analysis of intron- and SINE sequence data provided a statistically robust phylogeny which is congruent with the absence/presence pattern of our SINE markers. This phylogeny strongly supports a sistergroup relationship of Musteloidea and Pinnipedia. Within Pinnipedia, we see strong support from bootstrapping and the presence of a SINE insertion for a sistergroup relationship of the walrus with the Otariidae.

  19. Tn5401, a new class II transposable element from Bacillus thuringiensis.

    PubMed Central

    Baum, J A

    1994-01-01

    A new class II (Tn3-like) transposable element, designated Tn5401, was recovered from a sporulation-deficient variant of Bacillus thuringiensis subsp. morrisoni EG2158 following its insertion into a recombinant plasmid. Sequence analysis of the insert revealed a 4,837-bp transposon with two large open reading frames, in the same orientation, encoding proteins of 36 kDa (306 residues) and 116 kDa (1,005 residues) and 53-bp terminal inverted repeats. The deduced amino acid sequence for the 36-kDa protein shows 24% sequence identity with the TnpI recombinase of the B. thuringiensis transposon Tn4430, a member of the phage integrase family of site-specific recombinases. The deduced amino acid sequence for the 116-kDa protein shows 42% sequence identity with the transposase of Tn3 but only 28% identity with the TnpA transposase of Tn4430. Two small open reading frames of unknown function, designated orf1 (85 residues) and orf2 (74 residues), were also identified. Southern blot analysis indicated that Tn5401, in contrast to Tn4430, is not commonly found among different subspecies of B. thuringiensis and is not typically associated with known insecticidal crystal protein genes. Transposition was studied with B. thuringiensis by using plasmid pEG922, a temperature-sensitive shuttle vector containing Tn5401. Tn5401 transposed to both chromosomal and plasmid target sites but displayed an apparent preference for plasmid sites. Transposition was replicative and resulted in the generation of a 5-bp duplication at the target site. Transcriptional start sites within Tn5401 were mapped by primer extension analysis. Two promoters, designated PL and PR, direct the transcription of orf1-orf2 and tnpI-tnpA, respectively, and are negatively regulated by TnpI. Sequence comparison of the promoter regions of Tn5401 and Tn4430 suggests that the conserved sequence element ATGTCCRCTAAY mediates TnpI binding and cointegrate resolution. The same element is contained within the 53-bp terminal inverted repeats, thus accounting for their unusual lengths and suggesting an additional role for TnpI in regulating Tn5401 transposition. Images PMID:7514590

  20. Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.

    PubMed Central

    Ananiev, E V; Phillips, R L; Rines, H W

    1998-01-01

    The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055

  1. Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kass, D.H.; Batzer, M.A.; Deininger, P.L.

    The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome.more » However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.« less

  2. Telomeric P elements associated with cytotype regulation of the P transposon family in Drosophila melanogaster.

    PubMed Central

    Stuart, Jeremy R; Haley, Kevin J; Swedzinski, Douglas; Lockner, Samuel; Kocian, Paul E; Merriman, Peter J; Simmons, Michael J

    2002-01-01

    P elements inserted at the left end of the Drosophila X chromosome were isolated genetically from wild-type P strains. Stocks carrying these elements were tested for repression of P-strain-induced gonadal dysgenesis in females and for repression of transposase-catalyzed P-element excision in males and females. Both traits were repressed by stocks carrying either complete or incomplete P elements inserted near the telomere of the X chromosome in cytological region 1A, but not by stocks carrying only nontelomeric X-linked P elements. All three of the telomeric P elements that were analyzed at the molecular level were inserted in one of the 1.8-kb telomere-associated sequence (TAS) repeats near the end of the X chromosome. Stocks with these telomeric P elements strongly repressed P-element excision induced in the male germline by a P strain or by the transposase-producing transgenes H(hsp/CP)2, H(hsp/CP)3, a combination of these two transgenes, and P(ry(+), delta2-3)99B. For H(hsp/CP)2 and P(ry(+), delta2-3)99B, the repression was also effective when the flies were subjected to heat-shock treatments. However, these stocks did not repress the somatic transposase activity of P(ry(+), delta2-3)99B. Repression of transposase activity in the germline required maternal transmission of the telomeric P elements themselves. Paternal transmission of these elements, or maternal transmission of the cytoplasm from carriers, both were insufficient to repress transposase activity. Collectively, these findings indicate that the regulatory abilities of telomeric P elements are similar to those of the P cytotype. PMID:12524339

  3. Chromosomal insertion and excision of a 30 kb unstable genetic element is responsible for phase variation of lipopolysaccharide and other virulence determinants in Legionella pneumophila.

    PubMed

    Lüneberg, E; Mayer, B; Daryab, N; Kooistra, O; Zähringer, U; Rohde, M; Swanson, J; Frosch, M

    2001-03-01

    We recently described the phase-variable expression of a virulence-associated lipopolysaccharide (LPS) epitope in Legionella pneumophila. In this study, the molecular mechanism for phase variation was investigated. We identified a 30 kb unstable genetic element as the molecular origin for LPS phase variation. Thirty putative genes were encoded on the 30 kb sequence, organized in two putative opposite transcription units. Some of the open reading frames (ORFs) shared homologies with bacteriophage genes, suggesting that the 30 kb element was of phage origin. In the virulent wild-type strain, the 30 kb element was located on the chromosome, whereas excision from the chromosome and replication as a high-copy plasmid resulted in the mutant phenotype, which is characterized by alteration of an LPS epitope and loss of virulence. Mapping and sequencing of the insertion site in the genome revealed that the chromosomal attachment site was located in an intergenic region flanked by genes of unknown function. As phage release could not be induced by mitomycin C, it is conceivable that the 30 kb element is a non-functional phage remnant. The protein encoded by ORF T on the 30 kb plasmid could be isolated by an outer membrane preparation, indicating that the genes encoded on the 30 kb element are expressed in the mutant phenotype. Therefore, it is conceivable that the phenotypic alterations seen in the mutant depend on high-copy replication of the 30 kb element and expression of the encoded genes. Excision of the 30 kb element from the chromosome was found to occur in a RecA-independent pathway, presumably by the involvement of RecE, RecT and RusA homologues that are encoded on the 30 kb element.

  4. Tol2 transposon-mediated transgenesis in Xenopus tropicalis.

    PubMed

    Hamlet, Michelle R Johnson; Yergeau, Donald A; Kuliyev, Emin; Takeda, Masatoshi; Taira, Masanori; Kawakami, Koichi; Mead, Paul E

    2006-09-01

    The diploid frog Xenopus tropicalis is becoming a powerful developmental genetic model system. Sequencing of the X. tropicalis genome is nearing completion and several labs are embarking on mutagenesis screens. We are interested in developing insertional mutagenesis strategies in X. tropicalis. Transposon-mediated insertional mutagenesis, once used exclusively in plants and invertebrate systems, is now more widely applicable to vertebrates. The first step in developing transposons as tools for mutagenesis is to demonstrate that these mobile elements function efficiently in the target organism. Here, we show that the Medaka fish transposon, Tol2, is able to stably integrate into the X. tropicalis genome and will serve as a powerful tool for insertional mutagenesis strategies in the frog.

  5. Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903.

    PubMed Central

    Grindley, N D; Joyce, C M

    1980-01-01

    The kanamycin resistance transposon Tn903 consists of a unique region of about 1000 base pairs bounded by a pair of 1050-base-pair inverted repeat sequences. Each repeat contains two Pvu II endonuclease cleavage sites separated by 520 base pairs. We have constructed derivatives of Tn903 in which this 520-base-pair fragment is deleted from one or both repeats. Those derivatives that lack both 520-base-pair fragments cannot transpose, whereas those that lack just one remain transposition proficient. One such transposable derivative, Tn903 delta I, has been selected for further study. We have determined the sequence of the intact inverted repeat. The 18 base pairs at each end are identical and inverted relative to one another, a structure characteristic of insertion sequences. Additional experiments indicate that a single inverted repeat from Tn903 can, in fact, transpose; we propose that this element be called IS903. To correlate the DNA sequence with genetic activities, we have created mutations by inserting a 10-base-pair DNA fragment at several sites within the intact repeat of Tn903 delta 1, and we have examined the effect of such insertions on transposability. The results suggest that IS903 encodes a 307-amino-acid polypeptide (a "transposase") that is absolutely required for transposition of IS903 or Tn903. Images PMID:6261245

  6. Complete genome analysis of three Acinetobacter baumannii clinical isolates in China for insight into the diversification of drug resistance elements.

    PubMed

    Zhu, Lingxiang; Yan, Zhongqiang; Zhang, Zhaojun; Zhou, Qiming; Zhou, Jinchun; Wakeland, Edward K; Fang, Xiangdong; Xuan, Zhenyu; Shen, Dingxia; Li, Quan-Zhen

    2013-01-01

    The emergence and rapid spreading of multidrug-resistant Acinetobacter baumannii strains has become a major health threat worldwide. To better understand the genetic recombination related with the acquisition of drug-resistant elements during bacterial infection, we performed complete genome analysis on three newly isolated multidrug-resistant A. baumannii strains from Beijing using next-generation sequencing technology. Whole genome comparison revealed that all 3 strains share some common drug resistant elements including carbapenem-resistant bla OXA-23 and tetracycline (tet) resistance islands, but the genome structures are diversified among strains. Various genomic islands intersperse on the genome with transposons and insertions, reflecting the recombination flexibility during the acquisition of the resistant elements. The blood-isolated BJAB07104 and ascites-isolated BJAB0868 exhibit high similarity on their genome structure with most of the global clone II strains, suggesting these two strains belong to the dominant outbreak strains prevalent worldwide. A large resistance island (RI) of about 121-kb, carrying a cluster of resistance-related genes, was inserted into the ATPase gene on BJAB07104 and BJAB0868 genomes. A 78-kb insertion element carrying tra-locus and bla OXA-23 island, can be either inserted into one of the tniB gene in the 121-kb RI on the chromosome, or transformed to conjugative plasmid in the two BJAB strains. The third strains of this study, BJAB0715, which was isolated from spinal fluid, exhibit much more divergence compared with above two strains. It harbors multiple drug-resistance elements including a truncated AbaR-22-like RI on its genome. One of the unique features of this strain is that it carries both bla OXA-23 and bla OXA-58 genes on its genome. Besides, an Acinetobacter lwoffii adeABC efflux element was found inserted into the ATPase position in BJAB0715. Our comparative analysis on currently completed Acinetobacter baumannii genomes revealed extensive and dynamic genome organizations, which may facilitate the bacteria to acquire drug-resistance elements into their genomes.

  7. Transcription initiation from the dihydrofolate reductase promoter is positioned by HIP1 binding at the initiation site.

    PubMed

    Means, A L; Farnham, P J

    1990-02-01

    We have identified a sequence element that specifies the position of transcription initiation for the dihydrofolate reductase gene. Unlike the functionally analogous TATA box that directs RNA polymerase II to initiate transcription 30 nucleotides downstream, the positioning element of the dihydrofolate reductase promoter is located directly at the site of transcription initiation. By using DNase I footprint analysis, we have shown that a protein binds to this initiator element. Transcription initiated at the dihydrofolate reductase initiator element when 28 nucleotides were inserted between it and all other upstream sequences, or when it was placed on either side of the DNA helix, suggesting that there is no strict spatial requirement between the initiator and an upstream element. Although neither a single Sp1-binding site nor a single initiator element was sufficient for transcriptional activity, the combination of one Sp1-binding site and the dihydrofolate reductase initiator element cloned into a plasmid vector resulted in transcription starting at the initiator element. We have also shown that the simian virus 40 late major initiation site has striking sequence homology to the dihydrofolate reductase initiation site and that the same, or a similar, protein binds to both sites. Examination of the sequences at other RNA polymerase II initiation sites suggests that we have identified an element that is important in the transcription of other housekeeping genes. We have thus named the protein that binds to the initiator element HIP1 (Housekeeping Initiator Protein 1).

  8. Genome of the Actinomycete Plant Pathogen Clavibacter michiganensis subsp. sepedonicus Suggests Recent Niche Adaptation▿ †

    PubMed Central

    Bentley, Stephen D.; Corton, Craig; Brown, Susan E.; Barron, Andrew; Clark, Louise; Doggett, Jon; Harris, Barbara; Ormond, Doug; Quail, Michael A.; May, Georgiana; Francis, David; Knudson, Dennis; Parkhill, Julian; Ishimaru, Carol A.

    2008-01-01

    Clavibacter michiganensis subsp. sepedonicus is a plant-pathogenic bacterium and the causative agent of bacterial ring rot, a devastating agricultural disease under strict quarantine control and zero tolerance in the seed potato industry. This organism appears to be largely restricted to an endophytic lifestyle, proliferating within plant tissues and unable to persist in the absence of plant material. Analysis of the genome sequence of C. michiganensis subsp. sepedonicus and comparison with the genome sequences of related plant pathogens revealed a dramatic recent evolutionary history. The genome contains 106 insertion sequence elements, which appear to have been active in extensive rearrangement of the chromosome compared to that of Clavibacter michiganensis subsp. michiganensis. There are 110 pseudogenes with overrepresentation in functions associated with carbohydrate metabolism, transcriptional regulation, and pathogenicity. Genome comparisons also indicated that there is substantial gene content diversity within the species, probably due to differential gene acquisition and loss. These genomic features and evolutionary dating suggest that there was recent adaptation for life in a restricted niche where nutrient diversity and perhaps competition are low, correlated with a reduced ability to exploit previously occupied complex niches outside the plant. Toleration of factors such as multiplication and integration of insertion sequence elements, genome rearrangements, and functional disruption of many genes and operons seems to indicate that there has been general relaxation of selective pressure on a large proportion of the genome. PMID:18192393

  9. Expressing genes do not forget their LINEs: transposable elements and gene expression

    PubMed Central

    Kines, Kristine J.; Belancio, Victoria P.

    2012-01-01

    1. ABSTRACT Historically the accumulated mass of mammalian transposable elements (TEs), particularly those located within gene boundaries, was viewed as a genetic burden potentially detrimental to the genomic landscape. This notion has been strengthened by the discovery that transposable sequences can alter the architecture of the transcriptome, not only through insertion, but also long after the integration process is completed. Insertions previously considered harmless are now known to impact the expression of host genes via modification of the transcript quality or quantity, transcriptional interference, or by the control of pathways that affect the mRNA life-cycle. Conversely, several examples of the evolutionary advantageous impact of TEs on the host gene structure that diversified the cellular transcriptome are reported. TE-induced changes in gene expression can be tissue-or disease-specific, raising the possibility that the impact of TE sequences may vary during development, among normal cell types, and between normal and disease-affected tissues. The understanding of the rules and abundance of TE-interference with gene expression is in its infancy, and its contribution to human disease and/or evolution remains largely unexplored. PMID:22201807

  10. Dictyostelium mobile elements: strategies to amplify in a compact genome.

    PubMed

    Winckler, T; Dingermann, T; Glöckner, G

    2002-12-01

    Dictyostelium discoideum is a eukaryotic microorganism that is attractive for the study of fundamental biological phenomena such as cell-cell communication, formation of multicellularity, cell differentiation and morphogenesis. Large-scale sequencing of the D. discoideum genome has provided new insights into evolutionary strategies evolved by transposable elements (TEs) to settle in compact microbial genomes and to maintain active populations over evolutionary time. The high gene density (about 1 gene/2.6 kb) of the D. discoideum genome leaves limited space for selfish molecular invaders to move and amplify without causing deleterious mutations that eradicate their host. Targeting of transfer RNA (tRNA) gene loci appears to be a generally successful strategy for TEs residing in compact genomes to insert away from coding regions. In D. discoideum, tRNA gene-targeted retrotransposition has evolved independently at least three times by both non-long terminal repeat (LTR) retrotransposons and retrovirus-like LTR retrotransposons. Unlike the nonspecifically inserting D. discoideum TEs, which have a strong tendency to insert into preexisting TE copies and form large and complex clusters near the ends of chromosomes, the tRNA gene-targeted retrotransposons have managed to occupy 75% of the tRNA gene loci spread on chromosome 2 and represent 80% of the TEs recognized on the assembled central 6.5-Mb part of chromosome 2. In this review we update the available information about D. discoideum TEs which emerges both from previous work and current large-scale genome sequencing, with special emphasis on the fact that tRNA genes are principal determinants of retrotransposon insertions into the D. discoideum genome.

  11. Transposon Invasion of the Paramecium Germline Genome Countered by a Domesticated PiggyBac Transposase and the NHEJ Pathway

    PubMed Central

    Dubois, Emeline; Bischerour, Julien; Marmignon, Antoine; Mathy, Nathalie; Régnier, Vinciane; Bétermier, Mireille

    2012-01-01

    Sequences related to transposons constitute a large fraction of extant genomes, but insertions within coding sequences have generally not been tolerated during evolution. Thanks to their unique nuclear dimorphism and to their original mechanism of programmed DNA elimination from their somatic nucleus (macronucleus), ciliates are emerging model organisms for the study of the impact of transposable elements on genomes. The germline genome of the ciliate Paramecium, located in its micronucleus, contains thousands of short intervening sequences, the IESs, which interrupt 47% of genes. Recent data provided support to the hypothesis that an evolutionary link exists between Paramecium IESs and Tc1/mariner transposons. During development of the macronucleus, IESs are excised precisely thanks to the coordinated action of PiggyMac, a domesticated piggyBac transposase, and of the NHEJ double-strand break repair pathway. A PiggyMac homolog is also required for developmentally programmed DNA elimination in another ciliate, Tetrahymena. Here, we present an overview of the life cycle of these unicellular eukaryotes and of the developmentally programmed genome rearrangements that take place at each sexual cycle. We discuss how ancient domestication of a piggyBac transposase might have allowed Tc1/mariner elements to spread throughout the germline genome of Paramecium, without strong counterselection against insertion within genes. PMID:22888464

  12. Characterization of a new high copy Stowaway family MITE, BRAMI-1 in Brassica genome

    PubMed Central

    2013-01-01

    Background Miniature inverted-repeat transposable elements (MITEs) are expected to play important roles in evolution of genes and genome in plants, especially in the highly duplicated plant genomes. Various MITE families and their roles in plants have been characterized. However, there have been fewer studies of MITE families and their potential roles in evolution of the recently triplicated Brassica genome. Results We identified a new MITE family, BRAMI-1, belonging to the Stowaway super-family in the Brassica genome. In silico mapping revealed that 697 members are dispersed throughout the euchromatic regions of the B. rapa pseudo-chromosomes. Among them, 548 members (78.6%) are located in gene-rich regions, less than 3 kb from genes. In addition, we identified 516 and 15 members in the 470 Mb and 15 Mb genomic shotgun sequences currently available for B. oleracea and B. napus, respectively. The resulting estimated copy numbers for the entire genomes were 1440, 1464 and 2490 in B. rapa, B. oleracea and B. napus, respectively. Concurrently, only 70 members of the related Arabidopsis ATTIRTA-1 MITE family were identified in the Arabidopsis genome. Phylogenetic analysis revealed that BRAMI-1 elements proliferated in the Brassica genus after divergence from the Arabidopsis lineage. MITE insertion polymorphism (MIP) was inspected for 50 BRAMI-1 members, revealing high levels of insertion polymorphism between and within species of Brassica that clarify BRAMI-1 activation periods up to the present. Comparative analysis of the 71 genes harbouring the BRAMI-1 elements with their non-insertion paralogs (NIPs) showed that the BRAMI-1 insertions mainly reside in non-coding sequences and that the expression levels of genes with the elements differ from those of their NIPs. Conclusion A Stowaway family MITE, named as BRAMI-1, was gradually amplified and remained present in over than 1400 copies in each of three Brassica species. Overall, 78% of the members were identified in gene-rich regions, and it is assumed that they may contribute to the evolution of duplicated genes in the highly duplicated Brassica genome. The resulting MIPs can serve as a good source of DNA markers for Brassica crops because the insertions are highly dispersed in the gene-rich euchromatin region and are polymorphic between or within species. PMID:23547712

  13. Optical mapping reveals a large genetic inversion between two methicillin-resistant Staphylococcus aureus strains.

    PubMed

    Shukla, Sanjay K; Kislow, Jennifer; Briska, Adam; Henkhaus, John; Dykes, Colin

    2009-09-01

    Staphylococcus aureus is a highly versatile and evolving bacterium of great clinical importance. S. aureus can evolve by acquiring single nucleotide polymorphisms and mobile genetic elements and by recombination events. Identification and location of novel genomic elements in a bacterial genome are not straightforward, unless the whole genome is sequenced. Optical mapping is a new tool that creates a high-resolution, in situ ordered restriction map of a bacterial genome. These maps can be used to determine genomic organization and perform comparative genomics to identify genomic rearrangements, such as insertions, deletions, duplications, and inversions, compared to an in silico (virtual) restriction map of a known genome sequence. Using this technology, we report here the identification, approximate location, and characterization of a genetic inversion of approximately 500 kb of a DNA element between the NRS387 (USA800) and FPR3757 (USA300) strains. The presence of the inversion and location of its junction sites were confirmed by site-specific PCR and sequencing. At both the left and right junction sites in NRS387, an IS1181 element and a 73-bp sequence were identified as inverted repeats, which could explain the possible mechanism of the inversion event.

  14. Dissemination of streptococcal pyrogenic exotoxin G (spegg) with an IS-like element in fish isolates of Streptococcus dysgalactiae.

    PubMed

    Abdelsalam, Mohamed; Chen, Shih-Chu; Yoshida, Terutoyo

    2010-08-01

    The Lancefield group C alpha-hemolytic Streptococcus dysgalactiae ssp. dysgalactiae (GCSD) causes systemic granulomatous inflammatory disease and high mortality rates in infected fish. Superantigen and streptolysin S genes are the most important virulence factors contributing to an invasive streptococcal infection. PCR amplification revealed that all strains isolated from moribund fish harbored the streptolysin S structural gene (sagA). GCSD fish isolates were PCR negative for emm, speA, speB, speC, speM, smeZ, and ssa. However, the size of the streptococcal pyrogenic exotoxin G (spegg) locus, a superantigen, in positive S. dysgalactiae fish and pig strains was variable. The ORF of the spegg locus of 26 GCSD fish strains and one GCSD pig strain was inserted with IS981SC. Interestingly, the ORF of the spegg locus of two fish strains of GCSD collected in Malaysia was inserted with an IS981SC-IS1161 hybrid IS element. The hybrid IS element was found in all of the GCSD fish isolates and one GCSD pig through PCR screening. Although no insertion sequence (IS) was detected in the spegg locus of S. dysgalactiae ssp. equisimilis (GCSE) strains, a five-nucleotide deletion mutation was detected in the ORF of the spegg locus of one GCSE strain at the supposed site of IS981SC insertion, resulting in a frameshift mutation.

  15. Exaptation of Transposable Elements into Novel Cis-Regulatory Elements: Is the Evidence Always Strong?

    PubMed Central

    de Souza, Flávio S.J.; Franchini, Lucía F.; Rubinstein, Marcelo

    2013-01-01

    Transposable elements (TEs) are mobile genetic sequences that can jump around the genome from one location to another, behaving as genomic parasites. TEs have been particularly effective in colonizing mammalian genomes, and such heavy TE load is expected to have conditioned genome evolution. Indeed, studies conducted both at the gene and genome levels have uncovered TE insertions that seem to have been co-opted—or exapted—by providing transcription factor binding sites (TFBSs) that serve as promoters and enhancers, leading to the hypothesis that TE exaptation is a major factor in the evolution of gene regulation. Here, we critically review the evidence for exaptation of TE-derived sequences as TFBSs, promoters, enhancers, and silencers/insulators both at the gene and genome levels. We classify the functional impact attributed to TE insertions into four categories of increasing complexity and argue that so far very few studies have conclusively demonstrated exaptation of TEs as transcriptional regulatory regions. We also contend that many genome-wide studies dealing with TE exaptation in recent lineages of mammals are still inconclusive and that the hypothesis of rapid transcriptional regulatory rewiring mediated by TE mobilization must be taken with caution. Finally, we suggest experimental approaches that may help attributing higher-order functions to candidate exapted TEs. PMID:23486611

  16. An interferon regulatory factor binding site in the U5 region of the bovine leukemia virus long terminal repeat stimulates Tax-independent gene expression.

    PubMed

    Kiermer, V; Van Lint, C; Briclet, D; Vanhulle, C; Kettmann, R; Verdin, E; Burny, A; Droogmans, L

    1998-07-01

    Bovine leukemia virus (BLV) replication is controlled by both cis- and trans-acting elements. The virus-encoded transactivator, Tax, is necessary for efficient transcription from the BLV promoter, although it is not present during the early stages of infection. Therefore, sequences that control Tax-independent transcription must play an important role in the initiation of viral gene expression. This study demonstrates that the R-U5 sequence of BLV stimulates Tax-independent reporter gene expression directed by the BLV promoter. R-U5 was also stimulatory when inserted immediately downstream from the transcription initiation site of a heterologous promoter. Progressive deletion analysis of this region revealed that a 46-bp element corresponding to the 5' half of U5 is principally responsible for the stimulation. This element exhibited enhancer activity when inserted upstream or downstream from the herpes simplex virus thymidine kinase promoter. This enhancer contains a binding site for the interferon regulatory factors IRF-1 and IRF-2. A 3-bp mutation that destroys the IRF recognition site caused a twofold decrease in Tax-independent BLV long terminal repeat-driven gene expression. These observations suggest that the IRF binding site in the U5 region of BLV plays a role in the initiation of virus replication.

  17. Exonization of an Intronic LINE-1 Element Causing Becker Muscular Dystrophy as a Novel Mutational Mechanism in Dystrophin Gene.

    PubMed

    Gonçalves, Ana; Oliveira, Jorge; Coelho, Teresa; Taipa, Ricardo; Melo-Pires, Manuel; Sousa, Mário; Santos, Rosário

    2017-10-03

    A broad mutational spectrum in the dystrophin ( DMD ) gene, from large deletions/duplications to point mutations, causes Duchenne/Becker muscular dystrophy (D/BMD). Comprehensive genotyping is particularly relevant considering the mutation-centered therapies for dystrophinopathies. We report the genetic characterization of a patient with disease onset at age 13 years, elevated creatine kinase levels and reduced dystrophin labeling, where multiplex-ligation probe amplification (MLPA) and genomic sequencing failed to detect pathogenic variants. Bioinformatic, transcriptomic (real time PCR, RT-PCR), and genomic approaches (Southern blot, long-range PCR, and single molecule real-time sequencing) were used to characterize the mutation. An aberrant transcript was identified, containing a 103-nucleotide insertion between exons 51 and 52, with no similarity with the DMD gene. This corresponded to the partial exonization of a long interspersed nuclear element (LINE-1), disrupting the open reading frame. Further characterization identified a complete LINE-1 (~6 kb with typical hallmarks) deeply inserted in intron 51. Haplotyping and segregation analysis demonstrated that the mutation had a de novo origin. Besides underscoring the importance of mRNA studies in genetically unsolved cases, this is the first report of a disease-causing fully intronic LINE-1 element in DMD , adding to the diversity of mutational events that give rise to D/BMD.

  18. Phylogenetic Conflict in Bears Identified by Automated Discovery of Transposable Element Insertions in Low-Coverage Genomes

    PubMed Central

    Gallus, Susanne; Janke, Axel

    2017-01-01

    Abstract Phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high-throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) identified 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Analysis of single nucleotide substitutions in the flanking regions of the TEs shows that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, despite strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun, and sloth bear form a monophyletic clade, in which phylogenetic incongruence originates from incomplete lineage sorting. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it possible to confidently extract thousands of TE insertions even from low-coverage genomes (∼10×) of nonmodel organisms. This opens new possibilities for biologists to study phylogenies and evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation. PMID:28985298

  19. Diversity and structure of PIF/Harbinger-like elements in the genome of Medicago truncatula

    PubMed Central

    Grzebelus, Dariusz; Lasota, Slawomir; Gambin, Tomasz; Kucherov, Gregory; Gambin, Anna

    2007-01-01

    Background Transposable elements constitute a significant fraction of plant genomes. The PIF/Harbinger superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, required for transposition, is characteristic for the autonomous PIF/Harbinger-like elements. Based on the above features, PIF/Harbinger-like elements were identified in several plant genomes and divided into several evolutionary lineages. Availability of a significant portion of Medicago truncatula genomic sequence allowed for mining PIF/Harbinger-like elements, starting from a single previously described element MtMaster. Results Twenty two putative autonomous, i.e. carrying an ORF coding for TPase and complete terminal inverted repeats, and 67 non-autonomous PIF/Harbinger-like elements were found in the genome of M. truncatula. They were divided into five families, MtPH-A5, MtPH-A6, MtPH-D,MtPH-E, and MtPH-M, corresponding to three previously identified and two new lineages. The largest families, MtPH-A6 and MtPH-M were further divided into four and three subfamilies, respectively. Non-autonomous elements were usually direct deletion derivatives of the putative autonomous element, however other types of rearrangements, including inversions and nested insertions were also observed. An interesting structural characteristic – the presence of 60 bp tandem repeats – was observed in a group of elements of subfamily MtPH-A6-4. Some families could be related to miniature inverted repeat elements (MITEs). The presence of empty loci (RESites), paralogous to those flanking the identified transposable elements, both autonomous and non-autonomous, as well as the presence of transposon insertion related size polymorphisms, confirmed that some of the mined elements were capable for transposition. Conclusion The population of PIF/Harbinger-like elements in the genome of M. truncatula is diverse. A detailed intra-family comparison of the elements' structure proved that they proliferated in the genome generally following the model of abortive gap repair. However, the presence of tandem repeats facilitated more pronounced rearrangements of the element internal regions. The insertion polymorphism of the MtPH elements and related MITE families in different populations of M. truncatula, if further confirmed experimentally, could be used as a source of molecular markers complementary to other marker systems. PMID:17996080

  20. Transposable elements and insecticide resistance.

    PubMed

    Rostant, Wayne G; Wedell, Nina; Hosken, David J

    2012-01-01

    Transposable elements (TEs) are mobile DNA sequences that are able to copy themselves within a host genome. They were initially characterized as selfish genes because of documented or presumed costs to host fitness, but it has become increasingly clear that not all TEs reduce host fitness. A good example of TEs benefiting hosts is seen with insecticide resistance, where in a number of cases, TE insertions near specific genes confer resistance to these man-made products. This is particularly true of Accord and associated TEs in Drosophila melanogaster and Doc insertions in Drosophila simulans. The first of these insertions also has sexually antagonistic fitness effects in the absence of insecticides, and although the magnitude of this effect depends on the genetic background in which Accord finds itself, this represents an excellent example of intralocus sexual conflict where the precise allele involved is well characterized. We discuss this finding and the role of TEs in insecticide resistance. We also highlight areas for further research, including the need for surveys of the prevalence and fitness consequences of the Doc insertion and how Drosophila can be used as models to investigate resistance in pest species. Copyright © 2012 Elsevier Inc. All rights reserved.

  1. Comparative analysis of complete orthologous centromeres from two subspecies of rice reveals rapid variation of centromere organization and structure.

    PubMed

    Wu, Jianzhong; Fujisawa, Masaki; Tian, Zhixi; Yamagata, Harumi; Kamiya, Kozue; Shibata, Michie; Hosokawa, Satomi; Ito, Yukiyo; Hamada, Masao; Katagiri, Satoshi; Kurita, Kanako; Yamamoto, Mayu; Kikuta, Ari; Machita, Kayo; Karasawa, Wataru; Kanamori, Hiroyuki; Namiki, Nobukazu; Mizuno, Hiroshi; Ma, Jianxin; Sasaki, Takuji; Matsumoto, Takashi

    2009-12-01

    Centromeres are sites for assembly of the chromosomal structures that mediate faithful segregation at mitosis and meiosis. This function is conserved across species, but the DNA components that are involved in kinetochore formation differ greatly, even between closely related species. To shed light on the nature, evolutionary timing and evolutionary dynamics of rice centromeres, we decoded a 2.25-Mb DNA sequence covering the centromeric region of chromosome 8 of an indica rice variety, 'Kasalath' (Kas-Cen8). Analysis of repetitive sequences in Kas-Cen8 led to the identification of 222 long terminal repeat (LTR)-retrotransposon elements and 584 CentO satellite monomers, which account for 59.2% of the region. A comparison of the Kas-Cen8 sequence with that of japonica rice 'Nipponbare' (Nip-Cen8) revealed that about 66.8% of the Kas-Cen8 sequence was collinear with that of Nip-Cen8. Although the 27 putative genes are conserved between the two subspecies, only 55.4% of the total LTR-retrotransposon elements in 'Kasalath' had orthologs in 'Nipponbare', thus reflecting recent proliferation of a considerable number of LTR-retrotransposons since the divergence of two rice subspecies of indica and japonica within Oryza sativa. Comparative analysis of the subfamilies, time of insertion, and organization patterns of inserted LTR-retrotransposons between the two Cen8 regions revealed variations between 'Kasalath' and 'Nipponbare' in the preferential accumulation of CRR elements, and the expansion of CentO satellite repeats within the core domain of Cen8. Together, the results provide insights into the recent proliferation of LTR-retrotransposons, and the rapid expansion of CentO satellite repeats, underlying the dynamic variation and plasticity of plant centromeres.

  2. Insertion Sequence-Caused Large Scale-Rearrangements in the Genome of Escherichia coli

    DTIC Science & Technology

    2016-07-18

    rearrangements in the genome of Escherichia coli Heewook Lee1,2, Thomas G. Doak3,4, Ellen Popodi3, Patricia L. Foster3 and Haixu Tang1,* 1School of...and excisions of IS elements and recombi- nation between homologous IS elements identified in a large collection of Escherichia coli mutation accu...scale rear- rangements arose in the Escherichia coli genome during a long-term evolution experiment in a recent study (8). Com- bining WGSS with

  3. Final Report for LDRD Project 02-ERD-069: Discovering the Unknown Mechanism(s) of Virulence in a BW, Class A Select Agent

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chain, P; Garcia, E

    2003-02-06

    The goal of this proposed effort was to assess the difficulty in identifying and characterizing virulence candidate genes in an organism for which very limited data exists. This was accomplished by first addressing the finishing phase of draft-sequenced F. tularensis genomes and conducting comparative analyses to determine the coding potential of each genome; to discover the differences in genome structure and content, and to identify potential genes whose products may be involved in the F. tularensis virulence process. The project was divided into three parts: (1) Genome finishing: This part involves determining the order and orientation of the consensus sequencesmore » of contigs obtained from Phrap assemblies of random draft genomic sequences. This tedious process consists of linking contig ends using information embedded in each sequence file that relates the sequence to the original cloned insert. Since inserts are sequenced from both ends, we can establish a link between these paired-ends in different contigs and thus order and orient contigs. Since these genomes carry numerous copies of insertion sequences, these repeated elements ''confuse'' the Phrap assembly program. It is thus necessary to break these contigs apart at the repeated sequences and individually join the proper flanking regions using paired-end information, or using results of comparisons against a similar genome. Larger repeated elements such as the small subunit ribosomal RNA operon require verification with PCR. Tandem repeats require manual intervention and typically rely on single nucleotide polymorphisms to be resolved. Remaining gaps require PCR reactions and sequencing. Once the genomes have been ''closed'', low quality regions are addressed by resequencing reactions. (2) Genome analysis: The final consensus sequences are processed by combining the results of three gene modelers: Glimmer, Critica and Generation. The final gene models are submitted to a battery of homology searches and domain prediction programs in order to annotate them (e.g. BLAST, Pfam, TIGRfam, COG, KEGG, InterPro, TMhmm, SignalP). The genome structure is also assessed in terms of G+C content, GC bias (GC skew), and locations of repeated regions (e.g. IS elements) and phage-like genes. (3) Comparative genomics: The results of the various genome analyses are compared between the finished (or almost finished) genomes. Here, we have compared the F. tularensis genomes from the extremely lethal strain Schu4 (subsp. tularensis), the vaccine strain LVS (subsp. holartica), and strain UT01-4992 of the less virulent, opportunistic subsp. novicida. Regions present in the highly virulent strain that are absent from the other less virulent strains may provide insight into what factors are required for the high level of virulence.« less

  4. The LINEs and SINEs of Entamoeba histolytica: comparative analysis and genomic distribution.

    PubMed

    Bakre, Abhijeet A; Rawal, Kamal; Ramaswamy, Ram; Bhattacharya, Alok; Bhattacharya, Sudha

    2005-07-01

    Autonomous non-long terminal repeat retrotransposons are commonly referred to as long interspersed elements (LINEs). Short non-autonomous elements that borrow the LINE machinery are called SINES. The Entamoeba histolytica genome contains three classes of LINEs and SINEs. Together the EhLINEs/SINEs account for about 6% of the genome. The recognizable functional domains in all three EhLINEs included reverse transcriptase and endonuclease. A novel feature was the presence of two types of members-some with a single long ORF (less frequent) and some with two ORFs (more frequent) in both EhLINE1 and 2. The two ORFs were generated by conserved changes leading to stop codon. Computational analysis of the immediate flanking sequences for each element showed that they inserted in AT-rich sequences, with a preponderance of Ts in the upstream site. The elements were very frequently located close to protein-coding genes and other EhLINEs/SINEs. The possible influence of these elements on expression of neighboring genes needs to be determined.

  5. A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome

    PubMed Central

    Konkel, Miriam K.; Batzer, Mark A.

    2010-01-01

    It is now commonly agreed that the human genome is not the stable entity originally presumed. Deletions, duplications, inversions, and insertions are common, and contribute significantly to genomic structural variations (SVs). Their collective impact generates much of the inter-individual genomic diversity observed among humans. Not only do these variations change the structure of the genome; they may also have functional implications, e.g. altered gene expression. Some SVs have been identified as the cause of genetic disorders, including cancer predisposition. Cancer cells are notorious for their genomic instability, and often show genomic rearrangements at the microscopic and submicroscopic level to which transposable elements (TEs) contribute. Here, we review the role of TEs in genome instability, with particular focus on non-LTR retrotransposons. Currently, three non-LTR retrotransposon families – long interspersed element 1 (L1), SVA (short interspersed element (SINE-R), variable number of tandem repeats (VNTR), and Alu), and Alu (a SINE) elements – mobilize in the human genome, and cause genomic instability through both insertion- and post-insertion-based mutagenesis. Due to the abundance and high sequence identity of TEs, they frequently mislead the homologous recombination repair pathway into non-allelic homologous recombination, causing deletions, duplications, and inversions. While less comprehensively studied, non-LTR retrotransposon insertions and TE-mediated rearrangements are probably more common in cancer cells than in healthy tissue. This may be at least partially attributed to the commonly seen global hypomethylation as well as general epigenetic dysfunction of cancer cells. Where possible, we provide examples that impact cancer predisposition and/or development. PMID:20307669

  6. Reading the tea leaves: Dead transposon copies reveal novel host and transposon biology.

    PubMed

    McLaughlin, Richard N

    2018-03-01

    Transposable elements comprise a huge portion of most animal genomes. Unlike many pathogens, these elements leave a mark of their impact via their insertion into host genomes. With proper teasing, these sequences can relay information about the evolutionary history of transposons and their hosts. In a new publication, Larson and colleagues describe a previously unappreciated density of long interspersed element-1 (LINE-1) sequences that have been spliced (LINE-1 and other reverse transcribing elements are necessarily intronless). They provide data to suggest that the retention of these potentially deleterious splice sites in LINE-1 results from the sites' overlap with an important transcription factor binding site. These spliced LINE-1s (i.e., spliced integrated retrotransposed elements [SpiREs]) lose their ability to replicate, suggesting they are evolutionary dead ends. However, the lethality of this splicing could be an efficient means of blocking continued replication of LINE-1. In this way, the record of inactive LINE-1 sequences in the human genome revealed a new, though infrequent, event in the LINE-1 replication cycle and motivates future studies to test whether splicing might be another weapon in the anti-LINE-1 arsenal of host genomes.

  7. A variant Tc4 transposable element in the nematode C. elegans could encode a novel protein.

    PubMed Central

    Li, W; Shaw, J E

    1993-01-01

    A variant C. elegans Tc4 transposable element, Tc4-rh1030, has been sequenced and is 3483 bp long. The Tc4 element that had been analyzed previously is 1605 bp long, consists of two 774-bp nearly perfect inverted terminal repeats connected by a 57-bp loop, and lacks significant open reading frames. In Tc4-rh1030, by comparison, a 2343-bp novel sequence is present in place of a 477-bp segment in one of the inverted repeats. The novel sequence of Tc4-rh1030 is present about five times per haploid genome and is invariably associated with Tc4 elements; we have used the designation Tc4v to denote this variant subfamily of Tc4 elements. Sequence analysis of three cDNA clones suggests that a Tc4v element contains at least five exons that could encode a novel basic protein of 537 amino acid residues. On northern blots, a 1.6-kb Tc4v-specific transcript was detected in the mutator strain TR679 but not in the wild-type strain N2; Tc4 elements are known to transpose in TR679 but appear to be quiescent in N2. We have analyzed transcripts produced by an unc-33 gene that has the Tc4-rh1030 insertional mutation in its transcribed region; all or almost all of the Tc4v sequence is frequently spliced out of the mutant unc-33 transcripts, sometimes by means of non-consensus splice acceptor sites. Images PMID:8382791

  8. Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)

    PubMed Central

    Andreassen, Rune; Lunner, Sigbjørn; Høyheim, Bjørn

    2009-01-01

    Background Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP), the number of sequences where the full-length of the cDNA insert has been determined has been small. Results High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91%) of the transcripts were annotated using Gene Ontology (GO) terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS). The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS). This suggests that the remaining cDNA libraries generated by SGP represent a valuable cCDS FLIc source. The conservation of 7-mers in 3'UTRs indicates that these motifs are functionally important. Identity between some of these 7-mers and miRNA target sequences suggests that they are miRNA targets in Salmo salar transcripts as well. PMID:19878547

  9. Sequence of rat alpha- and gamma-casein mRNAs: evolutionary comparison of the calcium-dependent rat casein multigene family.

    PubMed Central

    Hobbs, A A; Rosen, J M

    1982-01-01

    The complete sequences of rat alpha- and gamma-casein mRNAs have been determined. The 1402-nucleotide alpha- and 864-nucleotide gamma-casein mRNAs both encode 15 amino acid signal peptides and mature proteins of 269 and 164 residues, respectively. Considerable homology between the 5' non-coding regions, and the regions encoding the signal peptides and the phosphorylation sites, in these mRNAs as compared to several other rodent casein mRNAs, was observed. Significant homology was also detected between rat alpha- and bovine alpha s1-casein. Comparison of the rodent and bovine sequences suggests that the caseins evolved at about the time of the appearance of the primitive mammals. This may have occurred by intragenic duplication of a nucleotide sequence encoding a primitive phosphorylation site, -(Ser)n-Glu-Glu-, and intergenic duplication resulting in the small casein multigene family. A unique feature of the rat alpha-casein sequence is an insertion in the coding region containing 10 repeated elements of 18 nucleotides each. This insertion appears to have occurred 7-12 million years ago, just prior to the divergence of rat and mouse. Images PMID:6298707

  10. LINE dancing in the human genome: transposable elements and disease.

    PubMed

    Belancio, Victoria P; Deininger, Prescott L; Roy-Engel, Astrid M

    2009-10-27

    Transposable elements (TEs) have been consistently underestimated in their contribution to genetic instability and human disease. TEs can cause human disease by creating insertional mutations in genes, and also contributing to genetic instability through non-allelic homologous recombination and introduction of sequences that evolve into various cis-acting signals that alter gene expression. Other outcomes of TE activity, such as their potential to cause DNA double-strand breaks or to modulate the epigenetic state of chromosomes, are less fully characterized. The currently active human transposable elements are members of the non-LTR retroelement families, LINE-1, Alu (SINE), and SVA. The impact of germline insertional mutagenesis by TEs is well established, whereas the rate of post-insertional TE-mediated germline mutations and all forms of somatic mutations remain less well quantified. The number of human diseases discovered to be associated with non-allelic homologous recombination between TEs, and particularly between Alu elements, is growing at an unprecedented rate. Improvement in the technology for detection of such events, as well as the mounting interest in the research and medical communities in resolving the underlying causes of the human diseases with unknown etiology, explain this increase. Here, we focus on the most recent advances in understanding of the impact of the active human TEs on the stability of the human genome and its relevance to human disease.

  11. Sequence information signal processor for local and global string comparisons

    DOEpatents

    Peterson, John C.; Chow, Edward T.; Waterman, Michael S.; Hunkapillar, Timothy J.

    1997-01-01

    A sequence information signal processing integrated circuit chip designed to perform high speed calculation of a dynamic programming algorithm based upon the algorithm defined by Waterman and Smith. The signal processing chip of the present invention is designed to be a building block of a linear systolic array, the performance of which can be increased by connecting additional sequence information signal processing chips to the array. The chip provides a high speed, low cost linear array processor that can locate highly similar global sequences or segments thereof such as contiguous subsequences from two different DNA or protein sequences. The chip is implemented in a preferred embodiment using CMOS VLSI technology to provide the equivalent of about 400,000 transistors or 100,000 gates. Each chip provides 16 processing elements, and is designed to provide 16 bit, two's compliment operation for maximum score precision of between -32,768 and +32,767. It is designed to provide a comparison between sequences as long as 4,194,304 elements without external software and between sequences of unlimited numbers of elements with the aid of external software. Each sequence can be assigned different deletion and insertion weight functions. Each processor is provided with a similarity measure device which is independently variable. Thus, each processor can contribute to maximum value score calculation using a different similarity measure.

  12. CACTA-superfamily transposable element is inserted in MYB transcription factor gene of soybean line producing variegated seeds.

    PubMed

    Yan, Fan; Di, Shaokang; Takahashi, Ryoji

    2015-08-01

    The R gene of soybean, presumably encoding a MYB transcription factor, controls seed coat color. The gene consists of multiple alleles, R (black), r-m (black spots and (or) concentric streaks on brown seed), and r (brown seed). This study was conducted to determine the structure of the MYB transcription factor gene in a near-isogenic line (NIL) having r-m allele. PCR amplification of a fragment of the candidate gene Glyma.09G235100 generated a fragment of about 1 kb in the soybean cultivar Clark, whereas a fragment of about 14 kb in addition to fragments of 1 and 1.4 kb were produced in L72-2040, a Clark 63 NIL with the r-m allele. Clark 63 is a NIL of Clark with the rxp and Rps1 alleles. A DNA fragment of 13 060 bp was inserted in the intron of Glyma.09G235100 in L72-2040. The fragment had the CACTA motif at both ends, imperfect terminal inverted repeats (TIR), inverse repetition of short sequence motifs close to the 5' and 3' ends, and a duplication of three nucleotides at the site of integration, indicating that it belongs to a CACTA-superfamily transposable element. We designated the element as Tgm11. Overall nucleotide sequence, motifs of TIR, and subterminal repeats were similar to those of Tgm1 and Tgs1, suggesting that these elements comprise a family.

  13. Integration of narrow-host-range vectors from Escherichia coli into the genomes of amino acid-producing corynebacteria after intergeneric conjugation.

    PubMed

    Mateos, L M; Schäfer, A; Kalinowski, J; Martin, J F; Pühler, A

    1996-10-01

    Conjugative transfer of mobilizable derivatives of the Escherichia coli narrow-host-range plasmids pBR322, pBR325, pACYC177, and pACYC184 from E. coli to species of the gram-positive genera Corynebacterium and Brevibacterium resulted in the integration of the plasmids into the genomes of the recipient bacteria. Transconjugants appeared at low frequencies and reproducibly with a delay of 2 to 3 days compared with matings with replicative vectors. Southern analysis of corynebacterial transconjugants and nucleotide sequences from insertion sites revealed that integration occurs at different locations and that different parts of the vector are involved in the process. Integration is not dependent on indigenous insertion sequence elements but results from recombination between very short homologous DNA segments (8 to 12 bp) present in the vector and in the host DNA. In the majority of the cases (90%), integration led to cointegrate formation, and in some cases, deletions or rearrangements occurred during the recombination event. Insertions were found to be quite stable even in the absence of selective pressure.

  14. Integration of narrow-host-range vectors from Escherichia coli into the genomes of amino acid-producing corynebacteria after intergeneric conjugation.

    PubMed Central

    Mateos, L M; Schäfer, A; Kalinowski, J; Martin, J F; Pühler, A

    1996-01-01

    Conjugative transfer of mobilizable derivatives of the Escherichia coli narrow-host-range plasmids pBR322, pBR325, pACYC177, and pACYC184 from E. coli to species of the gram-positive genera Corynebacterium and Brevibacterium resulted in the integration of the plasmids into the genomes of the recipient bacteria. Transconjugants appeared at low frequencies and reproducibly with a delay of 2 to 3 days compared with matings with replicative vectors. Southern analysis of corynebacterial transconjugants and nucleotide sequences from insertion sites revealed that integration occurs at different locations and that different parts of the vector are involved in the process. Integration is not dependent on indigenous insertion sequence elements but results from recombination between very short homologous DNA segments (8 to 12 bp) present in the vector and in the host DNA. In the majority of the cases (90%), integration led to cointegrate formation, and in some cases, deletions or rearrangements occurred during the recombination event. Insertions were found to be quite stable even in the absence of selective pressure. PMID:8824624

  15. Phylogenetic Conflict in Bears Identified by Automated Discovery of Transposable Element Insertions in Low-Coverage Genomes.

    PubMed

    Lammers, Fritjof; Gallus, Susanne; Janke, Axel; Nilsson, Maria A

    2017-10-01

    Phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high-throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) identified 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Analysis of single nucleotide substitutions in the flanking regions of the TEs shows that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, despite strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun, and sloth bear form a monophyletic clade, in which phylogenetic incongruence originates from incomplete lineage sorting. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it possible to confidently extract thousands of TE insertions even from low-coverage genomes (∼10×) of nonmodel organisms. This opens new possibilities for biologists to study phylogenies and evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  16. Talua SINE biology in the genome of the Reticulitermes subterranean termites (Isoptera, Rhinotermitidae).

    PubMed

    Luchetti, Andrea; Mantovani, Barbara

    2009-12-01

    Studies on transposable elements in termites are of interest because their genome is in a permanent condition of inbreeding. In this situation, an increase in transposon copy number should be mainly due to a Muller's ratchet effect, with selection against deleterious insertions playing a major role. Short INterspersed Elements (SINEs) are non-autonomous retrotransposons, known to be stable components of eukaryotic genomes. The SINE Talua, first isolated from Reticulitermes lucifugus (Rhinotermitidae), is the only mobile element described so far in termites. In the present survey, Talua has been found widespread in the Isoptera order. In comparison with other non-termite SINEs, Talua diversity and distribution in the Reticulitermes genome demonstrate that Talua is an ancient component of termite genome and that it is significantly associated with other repeats. In particular, the element is found to be involved with microsatellite motifs either as their generator or because inserted in their nearby. Further, two new SINEs and a putative retrotranscriptase-like sequence were found linked to Talua. Talua's genomic distribution is discussed in the light of the available models on transposable element dynamics within inbred genomes, also taking into account SINE role as drivers of genetic diversity in counteracting inbreeding depression.

  17. The Tc1/mariner transposable element family shapes genetic variation and gene expression in the protist Trichomonas vaginalis

    PubMed Central

    2014-01-01

    Background Trichomonas vaginalis is the most prevalent non-viral sexually transmitted parasite. Although the protist is presumed to reproduce asexually, 60% of its haploid genome contains transposable elements (TEs), known contributors to genome variability. The availability of a draft genome sequence and our collection of >200 global isolates of T. vaginalis facilitate the study and analysis of TE population dynamics and their contribution to genomic variability in this protist. Results We present here a pilot study of a subset of class II Tc1/mariner TEs that belong to the T. vaginalis Tvmar1 family. We report the genetic structure of 19 Tvmar1 loci, their ability to encode a full-length transposase protein, and their insertion frequencies in 94 global isolates from seven regions of the world. While most of the Tvmar1 elements studied exhibited low insertion frequencies, two of the 19 loci (locus 1 and locus 9) show high insertion frequencies of 1.00 and 0.96, respectively. The genetic structuring of the global populations identified by principal component analysis (PCA) of the Tvmar1 loci is in general agreement with published data based on genotyping, showing that Tvmar1 polymorphisms are a robust indicator of T. vaginalis genetic history. Analysis of expression of 22 genes flanking 13 Tvmar1 loci indicated significantly altered expression of six of the genes next to five Tvmar1 insertions, suggesting that the insertions have functional implications for T. vaginalis gene expression. Conclusions Our study is the first in T. vaginalis to describe Tvmar1 population dynamics and its contribution to genetic variability of the parasite. We show that a majority of our studied Tvmar1 insertion loci exist at very low frequencies in the global population, and insertions are variable between geographical isolates. In addition, we observe that low frequency insertion is related to reduced or abolished expression of flanking genes. While low insertion frequencies might be expected, we identified two Tvmar1 insertion loci that are fixed across global populations. This observation indicates that Tvmar1 insertion may have differing impacts and fitness costs in the host genome and may play varying roles in the adaptive evolution of T. vaginalis. PMID:24834134

  18. GATA: A graphic alignment tool for comparative sequenceanalysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nix, David A.; Eisen, Michael B.

    2005-01-01

    Several problems exist with current methods used to align DNA sequences for comparative sequence analysis. Most dynamic programming algorithms assume that conserved sequence elements are collinear. This assumption appears valid when comparing orthologous protein coding sequences. Functional constraints on proteins provide strong selective pressure against sequence inversions, and minimize sequence duplications and feature shuffling. For non-coding sequences this collinearity assumption is often invalid. For example, enhancers contain clusters of transcription factor binding sites that change in number, orientation, and spacing during evolution yet the enhancer retains its activity. Dotplot analysis is often used to estimate non-coding sequence relatedness. Yet dotmore » plots do not actually align sequences and thus cannot account well for base insertions or deletions. Moreover, they lack an adequate statistical framework for comparing sequence relatedness and are limited to pairwise comparisons. Lastly, dot plots and dynamic programming text outputs fail to provide an intuitive means for visualizing DNA alignments.« less

  19. Divergence of Drosophila melanogaster repeatomes in response to a sharp microclimate contrast in Evolution Canyon, Israel

    PubMed Central

    Kim, Young Bun; Oh, Jung Hun; McIver, Lauren J.; Rashkovetsky, Eugenia; Michalak, Katarzyna; Garner, Harold R.; Kang, Lin; Nevo, Eviatar; Korol, Abraham B.; Michalak, Pawel

    2014-01-01

    Repeat sequences, especially mobile elements, make up large portions of most eukaryotic genomes and provide enormous, albeit commonly underappreciated, evolutionary potential. We analyzed repeatomes of Drosophila melanogaster that have been diverging in response to a microclimate contrast in Evolution Canyon (Mount Carmel, Israel), a natural evolutionary laboratory with two abutting slopes at an average distance of only 200 m, which pose a constant ecological challenge to their local biotas. Flies inhabiting the colder and more humid north-facing slope carried about 6% more transposable elements than those from the hot and dry south-facing slope, in parallel to a suite of other genetic and phenotypic differences between the two populations. Nearly 50% of all mobile element insertions were slope unique, with many of them disrupting coding sequences of genes critical for cognition, olfaction, and thermotolerance, consistent with the observed patterns of thermotolerance differences and assortative mating. PMID:25006263

  20. Divergence of Drosophila melanogaster repeatomes in response to a sharp microclimate contrast in Evolution Canyon, Israel.

    PubMed

    Kim, Young Bun; Oh, Jung Hun; McIver, Lauren J; Rashkovetsky, Eugenia; Michalak, Katarzyna; Garner, Harold R; Kang, Lin; Nevo, Eviatar; Korol, Abraham B; Michalak, Pawel

    2014-07-22

    Repeat sequences, especially mobile elements, make up large portions of most eukaryotic genomes and provide enormous, albeit commonly underappreciated, evolutionary potential. We analyzed repeatomes of Drosophila melanogaster that have been diverging in response to a microclimate contrast in Evolution Canyon (Mount Carmel, Israel), a natural evolutionary laboratory with two abutting slopes at an average distance of only 200 m, which pose a constant ecological challenge to their local biotas. Flies inhabiting the colder and more humid north-facing slope carried about 6% more transposable elements than those from the hot and dry south-facing slope, in parallel to a suite of other genetic and phenotypic differences between the two populations. Nearly 50% of all mobile element insertions were slope unique, with many of them disrupting coding sequences of genes critical for cognition, olfaction, and thermotolerance, consistent with the observed patterns of thermotolerance differences and assortative mating.

  1. Using SINEs to probe ancient explosive speciation: "hidden" radiation of African cichlids?

    PubMed

    Terai, Yohey; Takahashi, Kazuhiko; Nishida, Mutsumi; Sato, Tetsu; Okada, Norihiro

    2003-06-01

    Cichlid fishes of the east African Great Lakes represent a paradigm of adaptive radiation. We conducted a phylogenetic analysis of cichlids including pan-African and west African species by using insertion patterns of short interspersed elements (SINEs) at orthologous loci. The monophyly of the east African cichlids was consistently supported by seven independent insertions of SINE sequences that are uniquely shared by these species. In addition, data from four other loci indicated that the genera Tilapia (pan-African) and Steatocranus (west African) are the closest relatives to east African cichlids. However, relationships among Tilapia, Steatocranus, and the east African clade were ambiguous because of incongruencies among topologies suggested by insertion patterns of SINEs at six other loci. One plausible explanation for this phenomenon is incomplete lineage sorting of alleles containing or missing a SINE insertion at these loci during ancestral speciation. Such incomplete sorting may have taken place earlier than 14 MYA, followed by random and stochastic fixation of the alleles in subsequent lineages. These observations prompted us to consider the possibility that cichlid speciation occurred at an accelerated rate during this period when the African Great Lakes did not exist. The SINE method could be useful for detecting ancient exclusive speciation events that tend to remain hidden during conventional sequence analyses because of accumulated point mutations.

  2. Exonization of an Intronic LINE-1 Element Causing Becker Muscular Dystrophy as a Novel Mutational Mechanism in Dystrophin Gene

    PubMed Central

    Gonçalves, Ana; Coelho, Teresa; Melo-Pires, Manuel; Sousa, Mário

    2017-01-01

    A broad mutational spectrum in the dystrophin (DMD) gene, from large deletions/duplications to point mutations, causes Duchenne/Becker muscular dystrophy (D/BMD). Comprehensive genotyping is particularly relevant considering the mutation-centered therapies for dystrophinopathies. We report the genetic characterization of a patient with disease onset at age 13 years, elevated creatine kinase levels and reduced dystrophin labeling, where multiplex-ligation probe amplification (MLPA) and genomic sequencing failed to detect pathogenic variants. Bioinformatic, transcriptomic (real time PCR, RT-PCR), and genomic approaches (Southern blot, long-range PCR, and single molecule real-time sequencing) were used to characterize the mutation. An aberrant transcript was identified, containing a 103-nucleotide insertion between exons 51 and 52, with no similarity with the DMD gene. This corresponded to the partial exonization of a long interspersed nuclear element (LINE-1), disrupting the open reading frame. Further characterization identified a complete LINE-1 (~6 kb with typical hallmarks) deeply inserted in intron 51. Haplotyping and segregation analysis demonstrated that the mutation had a de novo origin. Besides underscoring the importance of mRNA studies in genetically unsolved cases, this is the first report of a disease-causing fully intronic LINE-1 element in DMD, adding to the diversity of mutational events that give rise to D/BMD. PMID:28972564

  3. Recombination rate and the distribution of transposable elements in the Drosophila melanogaster genome.

    PubMed

    Rizzon, Carène; Marais, Gabriel; Gouy, Manolo; Biémont, Christian

    2002-03-01

    We analyzed the distribution of 54 families of transposable elements (TEs; transposons, LTR retrotransposons, and non-LTR retrotransposons) in the chromosomes of Drosophila melanogaster, using data from the sequenced genome. The density of LTR and non-LTR retrotransposons (RNA-based elements) was high in regions with low recombination rates, but there was no clear tendency to parallel the recombination rate. However, the density of transposons (DNA-based elements) was significantly negatively correlated with recombination rate. The accumulation of TEs in regions of reduced recombination rate is compatible with selection acting against TEs, as selection is expected to be weaker in regions with lower recombination. The differences in the relationship between recombination rate and TE density that exist between chromosome arms suggest that TE distribution depends on specific characteristics of the chromosomes (chromatin structure, distribution of other sequences), the TEs themselves (transposition mechanism), and the species (reproductive system, effective population size, etc.), that have differing influences on the effect of natural selection acting against the TE insertions.

  4. Structure and expression of the attacin genes in Hyalophora cecropia.

    PubMed

    Sun, S C; Lindström, I; Lee, J Y; Faye, I

    1991-02-26

    To study the regulation of the immune genes in insects, we have cloned and sequenced the attacin gene locus of the giant silk moth Hyalophora cecropia. The locus contains one acidic and one basic attacin gene as well as two pseudogenes, which are remnants of basic attacin genes. A small insertion element was found within the locus. The two functional attacin genes are transcribed in opposite directions and have two introns inserted at homologous positions. A common sequence, GGGGATTCCT, is found at nucleotide position -48 in the acidic gene and at nucleotide position -58 in the basic gene. Interestingly, this decanucleotide is similar to the consensus of the NF-k B-binding site. Expression studies revealed that both attacins are strongly induced by phorbol 12-myristate 13-acetate, lipopolysaccharide and bacteria. However, only the acidic attacin gene showed a clear response to injury.

  5. A Versatile Transposon-Based Activation Tag Vector System for Functional Genomics in Cereals and Other Monocot Plants1[OA

    PubMed Central

    Qu, Shaohong; Desai, Aparna; Wing, Rod; Sundaresan, Venkatesan

    2008-01-01

    Transposon insertional mutagenesis is an effective alternative to T-DNA mutagenesis when transformation through tissue culture is inefficient as is the case for many crop species. When used as activation tags, transposons can be exploited to generate novel gain-of-function phenotypes without transformation and are of particular value in the study of polyploid plants where gene knockouts will not have phenotypes. We have developed an in cis-activation-tagging Ac-Ds transposon system in which a T-DNA vector carries a Dissociation (Ds) element containing 4× cauliflower mosaic virus enhancers along with the Activator (Ac) transposase gene. Stable Ds insertions were selected using green fluorescent protein and red fluorescent protein genes driven by promoters that are functional in maize (Zea mays) and rice (Oryza sativa). The system has been tested in rice, where 638 stable Ds insertions were selected from an initial set of 26 primary transformants. By analysis of 311 flanking sequences mapped to the rice genome, we could demonstrate the wide distribution of the elements over the rice chromosomes. Enhanced expression of rice genes adjacent to Ds insertions was detected in the insertion lines using semiquantitative reverse transcription-PCR method. The in cis-two-element vector system requires minimal number of primary transformants and eliminates the need for crossing, while the use of fluorescent markers instead of antibiotic or herbicide resistance increases the applicability to other plants and eliminates problems with escapes. Because Ac-Ds has been shown to transpose widely in the plant kingdom, the activation vector system developed in this study should be of utility more generally to other monocots. PMID:17993541

  6. Searching for nuclear export elements in hepatitis D virus RNA.

    PubMed

    Freitas, Natália; Cunha, Celso

    2013-08-12

    To search for the presence of cis elements in hepatitis D virus (HDV) genomic and antigenomic RNA capable of promoting nuclear export. We made use of a well characterized chloramphenicol acetyl-transferase reporter system based on plasmid pDM138. Twenty cDNA fragments corresponding to different HDV genomic and antigenomic RNA sequences were inserted in plasmid pDM138, and used in transfection experiments in Huh7 cells. The relative amounts of HDV RNA in nuclear and cytoplasmic fractions were then determined by real-time polymerase chain reaction and Northern blotting. The secondary structure of the RNA sequences that displayed nuclear export ability was further predicted using a web interface. Finally, the sensitivity to leptomycin B was assessed in order to investigate possible cellular pathways involved in HDV RNA nuclear export. Analysis of genomic RNA sequences did not allow identifying an unequivocal nuclear export element. However, two regions were found to promote the export of reporter mRNAs with efficiency higher than the negative controls albeit lower than the positive control. These regions correspond to nucleotides 266-489 and 584-920, respectively. In addition, when analyzing antigenomic RNA sequences a nuclear export element was found in positions 214-417. Export mediated by the nuclear export element of HDV antigenomic RNA is sensitive to leptomycin B suggesting a possible role of CRM1 in this transport pathway. A cis-acting nuclear export element is present in nucleotides 214-417 of HDV antigenomic RNA.

  7. A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome.

    PubMed

    Konkel, Miriam K; Batzer, Mark A

    2010-08-01

    It is now commonly agreed that the human genome is not the stable entity originally presumed. Deletions, duplications, inversions, and insertions are common, and contribute significantly to genomic structural variations (SVs). Their collective impact generates much of the inter-individual genomic diversity observed among humans. Not only do these variations change the structure of the genome; they may also have functional implications, e.g. altered gene expression. Some SVs have been identified as the cause of genetic disorders, including cancer predisposition. Cancer cells are notorious for their genomic instability, and often show genomic rearrangements at the microscopic and submicroscopic level to which transposable elements (TEs) contribute. Here, we review the role of TEs in genome instability, with particular focus on non-LTR retrotransposons. Currently, three non-LTR retrotransposon families - long interspersed element 1 (L1), SVA (short interspersed element (SINE-R), variable number of tandem repeats (VNTR), and Alu), and Alu (a SINE) elements - mobilize in the human genome, and cause genomic instability through both insertion- and post-insertion-based mutagenesis. Due to the abundance and high sequence identity of TEs, they frequently mislead the homologous recombination repair pathway into non-allelic homologous recombination, causing deletions, duplications, and inversions. While less comprehensively studied, non-LTR retrotransposon insertions and TE-mediated rearrangements are probably more common in cancer cells than in healthy tissue. This may be at least partially attributed to the commonly seen global hypomethylation as well as general epigenetic dysfunction of cancer cells. Where possible, we provide examples that impact cancer predisposition and/or development. Copyright © 2010 Elsevier Ltd. All rights reserved.

  8. The maternal-effect, selfish genetic element Medea is associated with a composite Tc1 transposon.

    PubMed

    Lorenzen, Marcé D; Gnirke, Andreas; Margolis, Jonathan; Garnes, Jeffrey; Campbell, Margie; Stuart, Jeffrey J; Aggarwal, Rajat; Richards, Stephen; Park, Yoonseong; Beeman, Richard W

    2008-07-22

    Maternal-Effect Dominant Embryonic Arrest ("Medea") factors are selfish nuclear elements that combine maternal-lethal and zygotic-rescue activities to gain a postzygotic survival advantage. We show that Medea(1) activity in Tribolium castaneum is associated with a composite Tc1 transposon inserted just downstream of the neurotransmitter reuptake symporter bloated tubules (blot), whose Drosophila ortholog has both maternal and zygotic functions. The 21.5-kb insertion contains defective copies of elongation initiation factor-3, ATP synthase subunit C, and an RNaseD-related gene, as well as a potentially intact copy of a prokaryotic DUF1703 gene. Sequence comparisons suggest that the current distribution of Medea(1) reflects global emanation after a single transpositional event in recent evolutionary time. The Medea system in Tribolium represents an unusual type of intragenomic conflict and could provide a useful vehicle for driving desirable genes into populations.

  9. The maternal-effect, selfish genetic element Medea is associated with a composite Tc1 transposon

    PubMed Central

    Lorenzen, Marcé D.; Gnirke, Andreas; Margolis, Jonathan; Garnes, Jeffrey; Campbell, Margie; Stuart, Jeffrey J.; Aggarwal, Rajat; Richards, Stephen; Park, Yoonseong; Beeman, Richard W.

    2008-01-01

    Maternal-Effect Dominant Embryonic Arrest (“Medea”) factors are selfish nuclear elements that combine maternal-lethal and zygotic-rescue activities to gain a postzygotic survival advantage. We show that Medea1 activity in Tribolium castaneum is associated with a composite Tc1 transposon inserted just downstream of the neurotransmitter reuptake symporter bloated tubules (blot), whose Drosophila ortholog has both maternal and zygotic functions. The 21.5-kb insertion contains defective copies of elongation initiation factor-3, ATP synthase subunit C, and an RNaseD-related gene, as well as a potentially intact copy of a prokaryotic DUF1703 gene. Sequence comparisons suggest that the current distribution of Medea1 reflects global emanation after a single transpositional event in recent evolutionary time. The Medea system in Tribolium represents an unusual type of intragenomic conflict and could provide a useful vehicle for driving desirable genes into populations. PMID:18621706

  10. Continuous Influx of Genetic Material from Host to Virus Populations

    PubMed Central

    Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane

    2016-01-01

    Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors. PMID:26829124

  11. Continuous Influx of Genetic Material from Host to Virus Populations.

    PubMed

    Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane; Cordaux, Richard; Herniou, Elisabeth A

    2016-02-01

    Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors.

  12. Impact of the excision of an ancient repeat insertion on Rickettsia conorii guanylate kinase activity.

    PubMed

    Abergel, Chantal; Blanc, Guillaume; Monchois, Vincent; Renesto, Patricia; Sigoillot, Cécile; Ogata, Hiroyuki; Raoult, Didier; Claverie, Jean-Michel

    2006-11-01

    The genomic sequencing of Rickettsia conorii revealed a new family of Rickettsia-specific palindromic elements (RPEs) capable of in-frame insertion in preexisting open reading frames (ORFs). Many of these altered ORFs correspond to proteins with well-characterized or essential functions in other microorganisms. Previous experiments indicated that RPE-containing genes are normally transcribed and that no excision of the repeat occurs at the mRNA level. Using mass spectrometry, we now confirmed the retention of the RPE-derived amino acid residues in 4 proteins successfully expressed in Escherichia coli, raising the general question of the consequences of this common insertion event on the fitness of Rickettsia enzymes. The predicted guanylate kinase activity of the R. conorii gmk gene product was measured both on the RPE-containing and RPE-excised recombinant proteins. We show that the 2 proteins are active but exhibit substantial differences in their affinity for adenosine triphosphate, guanosine monophosphate, and catalytic constants. The distribution of the RPEgmk insert among Rickettsia species indicates that the insertion event is ancient and occurred after the divergence of Rickettsia felis and R. conorii but before that of Rickettsia helvetica and R. conorii. We found no evidence that the gmk gene fixed adaptive changes to compensate the RPE peptide insertion. Furthermore, the analysis of the rates of divergence in 23 RPE-containing genes indicates that coding RPE repeats tend to evolve under weak selective constraint, at a rate similar to intergenic noncoding RPE sequences. Altogether, these results suggest that the insertion of RPE-encoded "selfish peptides," although respecting the original fold and activity of the host proteins, might be slightly detrimental to the enzyme efficiency within limits tolerable for slow-growing intracellular parasites such as Rickettsia.

  13. Chemical sensor

    NASA Technical Reports Server (NTRS)

    Rauh, R. David (Inventor)

    1990-01-01

    A sensor for detecting a chemical substance includes an insertion element having a structure which enables insertion of the chemical substance with a resulting change in the bulk electrical characteristics of the insertion element under conditions sufficient to permit effective insertion; the change in the bulk electrical characteristics of the insertion element is detected as an indication of the presence of the chemical substance.

  14. A single gene mutation that increases maize seed weight

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Giroux, M.J.; Shaw, J.; Hannah, L.C.

    1996-06-11

    The maize endosperm-specific gene shrunken2 (Sh2) encodes the large subunit of the heterotetrameric starch synthetic enzyme adenosine diphosphoglucose pyrophosphorylase (AGP; EC 2.7.7.27). Here we exploit an in vivo, site-specific mutagenesis system to create short insertion mutations in a region of the gene known to be involved in the allosteric regulation of AGP. The site-specific mutagen is the transposable element dissociation (Ds). Approximately one-third (8 of 23) of the germinal revertants sequenced restored the wild-type sequence, whereas the remaining revertants contained insertions of 3 or 6 bp. All revertants retained the original reading frame 3 feet to the insertion site andmore » involved the addition of tyrosine and/or serine. Each insertion revertant reduced total AGP activity and the amount of the SH2 protein. The revertant containing additional tyrosine and serine residues increased seed weight 11-18% without increasing or decreasing the percentage of starch. Other insertion revertants lacking an additional serine reduced seed weight. Reduced sensitivity to phosphate, a long-known inhibitor of AGP, was found in the high seed-weight revertant. This alteration is likely universally important since insertion of tyrosine and serine in the potato large subunit of AGP at the comparable position and expression in Escherichia coli also led to a phosphate-insensitive enzyme. These results show that single gene mutations giving rise to increased seed weight, and therefore perhaps yield, are clearly possible in a plant with a long history of intensive and successful breeding efforts. 20 refs., 5 figs., 5 tabs.« less

  15. A bacterial genome in transition - an exceptional enrichment of IS elements but lack of evidence for recent transposition in the symbiont Amoebophilus asiaticus

    PubMed Central

    2011-01-01

    Background Insertion sequence (IS) elements are important mediators of genome plasticity and are widespread among bacterial and archaeal genomes. The 1.88 Mbp genome of the obligate intracellular amoeba symbiont Amoebophilus asiaticus contains an unusually large number of transposase genes (n = 354; 23% of all genes). Results The transposase genes in the A. asiaticus genome can be assigned to 16 different IS elements termed ISCaa1 to ISCaa16, which are represented by 2 to 24 full-length copies, respectively. Despite this high IS element load, the A. asiaticus genome displays a GC skew pattern typical for most bacterial genomes, indicating that no major rearrangements have occurred recently. Additionally, the high sequence divergence of some IS elements, the high number of truncated IS element copies (n = 143), as well as the absence of direct repeats in most IS elements suggest that the IS elements of A. asiaticus are transpositionally inactive. Although we could show transcription of 13 IS elements, we did not find experimental evidence for transpositional activity, corroborating our results from sequence analyses. However, we detected contiguous transcripts between IS elements and their downstream genes at nine loci in the A. asiaticus genome, indicating that some IS elements influence the transcription of downstream genes, some of which might be important for host cell interaction. Conclusions Taken together, the IS elements in the A. asiaticus genome are currently in the process of degradation and largely represent reflections of the evolutionary past of A. asiaticus in which its genome was shaped by their activity. PMID:21943072

  16. Localization of Action of the Is50-Encoded Transposase Protein

    PubMed Central

    Phadnis, Suhas H.; Sasakawa, Chihiro; Berg, Douglas E.

    1986-01-01

    The movement of the bacterial insertion sequence IS50 and of composite elements containing direct terminal repeats of IS50 involves the two ends of IS50, designated O (outside) and I (inside), which are weakly matched in DNA sequence, and an IS50 encoded protein, transposase, which recognizes the O and I ends and acts preferentially in cis. Previous data had suggested that, initially, transposase interacts preferentially with the O end sequence and then, in a second step, with either an O or an I end. To better understand the cis action of transposase and how IS50 ends are selected, we generated a series of composite transposons which contain direct repeats of IS50 elements. In each transposon, one IS50 element encoded transposase (tnp +), and the other contained a null (tnp-) allele. In each of the five sets of composite transposons studied, the transposon for which the tnp+ IS50 element contained its O end was more active than a complementary transposon for which the tnp - IS50 element contained its O end. This pattern of O end use suggests models in which the cis action of transposase and its choice of ends is determined by protein tracking along DNA molecules. PMID:3007274

  17. RNA motif search with data-driven element ordering.

    PubMed

    Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa

    2016-05-18

    In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .

  18. Large diversity of the piggyBac-like elements in the genome of Tribolium castaneum

    PubMed Central

    Wang, Jianjun; Du, Yuzhou; Wang, Suzhi; Brown, Sue; Park, Yoonseong

    2011-01-01

    The piggyBac transposable element, originally discovered in the cabbage looper, Trichoplusia ni, has been widely used in insect transgenesis including the red flour beetle Tribolium castaneum. We surveyed piggyBac-like (PLE) sequences in the genome of Tribolium castaneum by homology searches using as queries the diverse PLE sequences that have been described previously. The search yielded a total of 32 piggyBac-like elements (TcPLEs) which were classified into 14 distinct groups. Most of the TcPLEs contain defective functional motifs in that they are lacking inverted terminal repeats or have disrupted open reading frames. Only one single copy of TcPLE1 appears to be intact with imperfect 16 bp inverted terminal repeats flanking an open reading frame encoding a transposase of 571 amino acid residues. Many copies of TcPLEs were found to be inserted into or close to other transposon-like sequences. This large diversity of TcPLEs with generally low copy numbers suggests multiple invasions of the TcPLEs over a long evolutionary time without extensive multiplications or occurrence of rapid loss of TcPLEs copies. PMID:18342253

  19. Interchromosomal recombination in Zea mays.

    PubMed Central

    Hu, W; Timmermans, M C; Messing, J

    1998-01-01

    A new allele of the 27-kD zein locus in maize has been generated by interchromosomal recombination between chromosomes of two different inbred lines. A continuous patch of at least 11,817 bp of inbred W64A, containing the previously characterized Ra allele of the 27-kD zein gene, has been inserted into the genome of A188 by a single crossover. While both junction sequences are conserved, sequences of the two homologs between these junctions differ considerably. W64A contains the 7313-bp-long retrotransposon, Zeon-1. A188 contains a second copy of the 27-kD zein gene and a 2-kb repetitive element. Therefore, recombination results in a 7.3-kb insertion and a 14-kb deletion compared to the original S+A188 allele. If nonpairing sequences are looped out, 206 single base changes, frequently clustered, are present. The structure of this allele may explain how a recently discovered example of somatic recombination occurred in an A188/W64A hybrid. This would indicate that despite these sequence differences, pairing between these alleles could occur early during plant development. Therefore, such a somatically derived chimeric chromosome can also be heritable and give rise to new alleles. PMID:9799274

  20. De novo insertion of an intron into the mammalian sex determining gene, SRY

    PubMed Central

    O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall

    1998-01-01

    Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071

  1. Effects of Single P-Element Insertions on Bristle Number and Viability in Drosophila Melanogaster

    PubMed Central

    Lyman, R. F.; Lawrence, F.; Nuzhdin, S. V.; Mackay, TFC.

    1996-01-01

    Single P-element mutagenesis was used to construct 1094 lines with P[lArB] inserts on all three major chromosomes in an isogenic background previously free of P elements. The effects of insertions on bristle number and on viability were assessed by comparison to 392 control lines. The variance and effects of P-element inserts on bristle number and viability were larger than those inferred from spontaneous mutations. The distributions of effects on bristle number were symmetrical and highly leptokurtic, such that a few inserts with large effects caused most of the increase in variance. The distribution of effects on viability were negatively skewed and platykurtic. On average, the effects of P-element insertions on bristle number were partly recessive and on viability were completely recessive. P-element inserts with large effects on bristle number tended to have reduced viability, but the correlation between the absolute value of the effects on bristle number and on viability was not strong. Fifty P-element inserts tagging quantitative trait loci (QTLs) with large effects on bristle number were mapped cytogenetically. Two P-element-induced scabrous alleles and five extramacrochaetae alleles were generated. Single P-element mutagenesis is a powerful method for identifying QTLs at the level of genetic locus. PMID:8722781

  2. Effects of single P-element insertions on bristle number and viability in Drosophila melanogaster.

    PubMed

    Lyman, R F; Lawrence, F; Nuzhdin, S V; Mackay, T F

    1996-05-01

    Single P-element mutagenesis was used to construct 1094 lines with P[lArB] inserts on all three major chromosomes in an isogenic background previously free of P elements. The effects of insertions on bristle number and on viability were assessed by comparison to 392 control lines. The variance and effects of P-element inserts on bristle number and viability were larger than those inferred from spontaneous mutations. The distributions of effects on bristle number were symmetrical and highly leptokurtic, such that a few inserts with large effects caused most of the increase in variance. The distribution of effects on viability were negatively skewed and platykurtic. On average, the effects of P-element insertions on bristle number were partly recessive and on viability were completely recessive. P-element inserts with large effects on bristle number tended to have reduced viability, but the correlation between the absolute value of the effects on bristle number and on viability was not strong. Fifty P-element inserts tagging quantitative trait loci (QTLs) with large effects on bristle number were mapped cytogenetically. Two P-element-induced scabrous alleles and five extramacrochaetae alleles were generated. Single P-element mutagenesis is a powerful method for identifying QTLs at the level of genetic locus.

  3. Identification of Genomic Insertion and Flanking Sequence of G2-EPSPS and GAT Transgenes in Soybean Using Whole Genome Sequencing Method.

    PubMed

    Guo, Bingfu; Guo, Yong; Hong, Huilong; Qiu, Li-Juan

    2016-01-01

    Molecular characterization of sequence flanking exogenous fragment insertion is essential for safety assessment and labeling of genetically modified organism (GMO). In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS) method. More than 22.4 Gb sequence data (∼21 × coverage) for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundaries of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of genomic insertion sites of G2-EPSPS and GAT transgenes will facilitate the utilization of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS was a cost-effective and rapid method for identifying sites of T-DNA insertions and flanking sequences in soybean.

  4. The end of the LINE?: lack of recent L1 activity in a group of South American rodents.

    PubMed Central

    Casavant, N C; Scott, L; Cantrell, M A; Wiggins, L E; Baker, R J; Wichman, H A

    2000-01-01

    L1s (LINE-1: Long Interspersed Nuclear Element 1) are present in all mammals examined to date. They occur in both placental mammals and marsupials and thus are thought to have been present in the genome prior to the mammalian radiation. This unusual conservation of a transposable element family for over 100 million years has led to speculation that these elements provide an advantage to the genomes they inhabit. We have recently identified a group of South American rodents, including rice rats (Oryzomys), in which L1s appear to be quiescent or extinct. Several observations support this conclusion. First, genomic Southern blot analysis fails to reveal genus-specific bands in Oryzomys. Second, we were unable to find recently inserted elements. Procedures to enrich for young elements did not yield any with an intact open reading frame for reverse transcriptase; all elements isolated had numerous insertions, deletions, and stop codons. Phylogenetic analysis failed to yield species-specific clusters among the L1 elements isolated, and all Oryzomys sequences had numerous private mutations. Finally, in situ hybridization of L1 to Oryzomys chromosomes failed to reveal the characteristic L1 distribution in Oryzomys with either a homologous or heterologous probe. Thus, Oryzomys is a viable candidate for L1 extinction from a mammalian host. PMID:10747071

  5. Transposable elements are enriched within or in close proximity to xenobiotic-metabolizing cytochrome P450 genes

    PubMed Central

    Chen, Song; Li, Xianchun

    2007-01-01

    Background Transposons, i.e. transposable elements (TEs), are the major internal spontaneous mutation agents for the variability of eukaryotic genomes. To address the general issue of whether transposons mediate genomic changes in environment-adaptation genes, we scanned two alleles per each of the six xenobiotic-metabolizing Helicoverpa zea cytochrome P450 loci, including CYP6B8, CYP6B27, CYP321A1, CYP321A2, CYP9A12v3 and CYP9A14, for the presence of transposon insertions by genome walking and sequence analysis. We also scanned thirteen Drosophila melanogaster P450s genes for TE insertions by in silico mapping and literature search. Results Twelve novel transposons, including LINEs (long interspersed nuclear elements), SINEs (short interspersed nuclear elements), MITEs (miniature inverted-repeat transposable elements), one full-length transib-like transposon, and one full-length Tcl-like DNA transpson, are identified from the alleles of the six H. zea P450 genes. The twelve transposons are inserted into the 5'flanking region, 3'flanking region, exon, or intron of the six environment-adaptation P450 genes. In D. melanogaster, seven out of the eight Drosophila P450s (CYP4E2, CYP6A2, CYP6A8, CYP6A9, CYP6G1, CYP6W1, CYP12A4, CYP12D1) implicated in insecticide resistance are associated with a variety of transposons. By contrast, all the five Drosophila P450s (CYP302A1, CYP306A1, CYP307A1, CYP314A1 and CYP315A1) involved in ecdysone biosynthesis and developmental regulation are free of TE insertions. Conclusion These results indicate that TEs are selectively retained within or in close proximity to xenobiotic-metabolizing P450 genes. PMID:17381843

  6. Albinism due to transposable element insertion in fish.

    PubMed

    Koga, A; Hori, H

    1997-12-01

    The i locus of the medaka fish, Oryzias latipes, is responsible for tyrosinase expression, and several mutant alleles have been identified. The genotype i1/i1 exhibits a complete albino phenotype, having pale orange-red skin and red eyes. This mutant lacks in vivo tyrosinase activity. The genotype i4/i4, on the other hand, shows a quasi-albino phenotype with skin as bright as that of i1/i1 but with red-wine-colored eyes. At the light microscope level, reduced pigmentation is observed both in the skin and eyes of this mutant. The tyrosinase genes for the i1 and the i4 alleles were cloned and sequenced, and compared with that of the wild-type tyrosinase gene. The i1 allele was found to contain a 1.9-kb transposable element in the 1st exon, and the i4 allele was found to contain a 4.7-kb transposable element in the 5th exon. Both i1 and i4 are alleles that were found in a commercial breeding population. The insertion of a transposable element thus appears to constitute a natural cause of mutations that cause albinism in this organism.

  7. The non-LTR retrotransposon R2 in termites (Insecta, Isoptera): characterization and dynamics.

    PubMed

    Ghesini, Silvia; Luchetti, Andrea; Marini, Mario; Mantovani, Barbara

    2011-03-01

    The full-length element of the non-LTR retrotransposon R2 is here characterized in three European isopteran species: the more primitive Kalotermes flavicollis (Kalotermitidae), including two highly divergent mitochondrial lineages, and the more derived Reticulitermes lucifugus and R. urbis (Rhinotermitidae). Partial 3' sequences for R. grassei and R. balkanensis were also analyzed. The essential structural features of R2 elements are conserved in termites. Phylogenetic analysis revealed that termite elements belong to the same clade and that their phylogeny is fully compatible with the phylogeny of their host species. The study of the number and the frequency of R2 insertion variants in four R. urbis colonies suggests a greatly reduced, or completely absent, recent element activity.

  8. Mobile Insertion Cassette Elements Found in Small Non-Transmissible Plasmids in Proteeae May Explain qnrD Mobilization

    PubMed Central

    Guillard, Thomas; Grillon, Antoine; de Champs, Christophe; Cartier, Céline; Madoux, Janick; Berçot, Béatrice; Lebreil, Anne-Laure; Lozniewski, Alain; Riahi, Jacques; Vernet-Garnier, Véronique; Cambau, Emmanuelle

    2014-01-01

    qnrD is a plasmid mediated quinolone resistance gene from unknown origin, recently described in Enterobacteriaceae. It encodes a pentapeptide repeat protein 36–60% different from the other Qnr (A, B, C, S and VC). Since most qnrD-positive strains were described as strains belonging to Proteus or Providencia genera, we hypothesized that qnrD originated in Proteeae before disseminating to other enterobacterial species. We screened 317 strains of Proteeae for qnrD and its genetic support by PCR. For all the seven qnrD-positive strains (4 Proteus mirabilis, 1 Proteus vulgaris and 2 Providencia rettgeri) the gene was carried onto a small non-transmissible plasmid, contrarily to other qnr genes that are usually carried onto large multi-resistant plasmids. Nucleotide sequences of the qnrD-bearing plasmids were 96% identical. Plasmids contained 3 ORFs apart from qnrD and belonged to an undescribed incompatibility group. Only one plasmid, in P. vulgaris, was slightly different with a 1,568-bp insertion between qnrD and its promoter, leading to absence of quinolone resistance. We sought for similar plasmids in 15 reference strains of Proteeae, but which were tested negative for qnrD, and found a 48% identical plasmid (pVERM) in Providencia vermicola. In order to explain how qnrD could have been inserted into such native plasmid, we sought for gene mobilization structures. qnrD was found to be located within a mobile insertion cassette (mic) element which sequences are similar to one mic also found in pVERM. Our conclusions are that (i) the small non-transmissible qnrD-plasmids described here may result from the recombination between an as-yet-unknown progenitor of qnrD and pVERM, (ii) these plasmids are maintained in Proteeae being a qnrD reservoir (iii) the mic element may explain qnrD mobilization from non-transmissible plasmids to mobilizable or conjugative plasmids from other Enterobacteriaceae, (iv) they can recombined with larger multiresistant plasmids conjugated in Proteeae. PMID:24504382

  9. A proposal to rename the hyperthermophile Pyrococcus woesei as Pyrococcus furiosus subsp. woesei.

    PubMed

    Kanoksilapatham, Wirojne; González, Juan M; Maeder, Dennis L; DiRuggiero, Jocelyne; Robb, Frank T

    2004-10-01

    Pyrococcus species are hyperthermophilic members of the order Thermococcales, with optimal growth temperatures approaching 100 degrees C. All species grow heterotrophically and produce H2 or, in the presence of elemental sulfur (S(o)), H2S. Pyrococcus woesei and P. furiosus were isolated from marine sediments at the same Vulcano Island beach site and share many morphological and physiological characteristics. We report here that the rDNA operons of these strains have identical sequences, including their intergenic spacer regions and part of the 23S rRNA. Both species grow rapidly and produce H2 in the presence of 0.1% maltose and 10-100 microM sodium tungstate in S(o)-free medium. However, P. woesei shows more extensive autolysis than P. furiosus in the stationary phase. Pyrococcus furiosus and P. woesei share three closely related families of insertion sequences (ISs). A Southern blot performed with IS probes showed extensive colinearity between the genomes of P. woesei and P. furiosus. Cloning and sequencing of ISs that were in different contexts in P. woesei and P. furiosus revealed that the napA gene in P. woesei is disrupted by a type III IS element, whereas in P. furiosus, this gene is intact. A type I IS element, closely linked to the napA gene, was observed in the same context in both P. furiosus and P. woesei genomes. Our results suggest that the IS elements are implicated in genomic rearrangements and reshuffling in these closely related strains. We propose to rename P. woesei a subspecies of P. furiosus based on their identical rDNA operon sequences, many common IS elements that are shared genomic markers, and the observation that all P. woesei nucleotide sequences deposited in GenBank to date are > 99% identical to P. furiosus sequences.

  10. The Regulatory Properties of Autonomous Subtelomeric P Elements Are Sensitive to a Suppressor of Variegation in Drosophila Melanogaster

    PubMed Central

    Ronsseray, S.; Lehmann, M.; Nouaud, D.; Anxolabehere, D.

    1996-01-01

    Genetic recombination was used in Drosophila melanogaster to isolate P elements, inserted at the telomeres of X chromosomes (cytological site 1A) from natural populations, in a genetic background devoid of other P elements. We show that complete maternally inherited P repression in the germline (P cytotype) can be elicited by only two autonomous P elements at 1A and that a single element at this site has partial regulatory properties. The analysis of the surrounding chromosomal regions of the P elements at 1A shows that in all cases these elements are flanked by Telomeric Associated Sequences, tandemly repetitive noncoding sequences that have properties of heterochromatin. In addition, we show that the regulatory properties of P elements at 1A can be inhibited by some of the mutant alleles of the Su(var)205 gene and by a deficiency of this gene. However, the regulatory properties of reference P strains (Harwich and Texas 007) are not impaired by Su(var)205 mutations. Su(var)205 encodes Heterochromatin Protein 1 (HP1). These results suggest that the HP1 dosage effect on the P element properties is site-dependent and could involve the structure of the chromatin. PMID:8844154

  11. Bigfoot. a new family of MITE elements characterized from the Medicago genus.

    PubMed

    Charrier, B; Foucher, F; Kondorosi, E; d'Aubenton-Carafa, Y; Thermes, C; Kondorosi, A; Ratet, P

    1999-05-01

    We have characterized from the legume plant Medicago a new family of miniature inverted-repeat transposable elements (MITE), called the Bigfoot transposable elements. Two of these insertion elements are present only in a single allele of two different M. sativa genes. Using a PCR strategy we have isolated 19 other Bigfoot elements from the M. sativa and M. truncatula genomes. They differ from the previously characterized MITEs by their sequence, a target site of 9 bp and a partially clustered genomic distribution. In addition, we show that they exhibit a significantly stable secondary structure. These elements may represent up to 0.1% of the genome of the outcrossing Medicago sativa but are present at a reduced copy number in the genome of the autogamous M. truncatula plant, revealing major differences in the genome organization of these two plants.

  12. Structural and sequence diversity of the transposon Galileo in the Drosophila willistoni genome.

    PubMed

    Gonçalves, Juliana W; Valiati, Victor Hugo; Delprat, Alejandra; Valente, Vera L S; Ruiz, Alfredo

    2014-09-13

    Galileo is one of three members of the P superfamily of DNA transposons. It was originally discovered in Drosophila buzzatii, in which three segregating chromosomal inversions were shown to have been generated by ectopic recombination between Galileo copies. Subsequently, Galileo was identified in six of 12 sequenced Drosophila genomes, indicating its widespread distribution within this genus. Galileo is strikingly abundant in Drosophila willistoni, a neotropical species that is highly polymorphic for chromosomal inversions, suggesting a role for this transposon in the evolution of its genome. We carried out a detailed characterization of all Galileo copies present in the D. willistoni genome. A total of 191 copies, including 133 with two terminal inverted repeats (TIRs), were classified according to structure in six groups. The TIRs exhibited remarkable variation in their length and structure compared to the most complete copy. Three copies showed extended TIRs due to internal tandem repeats, the insertion of other transposable elements (TEs), or the incorporation of non-TIR sequences into the TIRs. Phylogenetic analyses of the transposase (TPase)-encoding and TIR segments yielded two divergent clades, which we termed Galileo subfamilies V and W. Target-site duplications (TSDs) in D. willistoni Galileo copies were 7- or 8-bp in length, with the consensus sequence GTATTAC. Analysis of the region around the TSDs revealed a target site motif (TSM) with a 15-bp palindrome that may give rise to a stem-loop secondary structure. There is a remarkable abundance and diversity of Galileo copies in the D. willistoni genome, although no functional copies were found. The TIRs in particular have a dynamic structure and extend in different ways, but their ends (required for transposition) are more conserved than the rest of the element. The D. willistoni genome harbors two Galileo subfamilies (V and W) that diverged ~9 million years ago and may have descended from an ancestral element in the genome. Galileo shows a significant insertion preference for a 15-bp palindromic TSM.

  13. Reconstitutional Mutagenesis of the Maize P Gene by Short-Range Ac Transpositions

    PubMed Central

    Moreno, M. A.; Chen, J.; Greenblatt, I.; Dellaporta, S. L.

    1992-01-01

    The tendency for Ac to transpose over short intervals has been utilized to develop insertional mutagenesis and fine structure genetic mapping strategies in maize. We recovered excisions of Ac from the P gene and insertions into nearby chromosomal sites. These closely linked Ac elements reinserted into the P gene, reconstituting over 250 unstable variegated alleles. Reconstituted alleles condition a variety of variegation patterns that reflect the position and orientation of Ac within the P gene. Molecular mapping and DNA sequence analyses have shown that reinsertion sites are dispersed throughout a 12.3-kb chromosomal region in the promoter, exons and introns of the P gene, but in some regions insertions sites were clustered in a nonrandom fashion. Transposition profiles and target site sequence data obtained from these studies have revealed several features of Ac transposition including its preference for certain target sites. These results clearly demonstrate the tendency of Ac to transpose to nearby sites in both proximal and distal directions from the donor site. With minor modifications, reconstitutional mutagenesis should be applicable to many Ac-induced mutations in maize and in other plant species and can possibly be extended to other eukaryotic transposon systems as well. PMID:1325389

  14. Effects of P Element Insertions on Quantitative Traits in Drosophila Melanogaster

    PubMed Central

    Mackay, TFC.; Lyman, R. F.; Jackson, M. S.

    1992-01-01

    P element mutagenesis was used to construct 94 third chromosome lines of Drosophila melanogaster which contained on average 3.1 stable P element inserts, in an inbred host strain background previously free of P elements. The homozygous and heterozygous effects of the inserts on viability and abdominal and sternopleural bristle number were ascertained by comparing the chromosome lines with inserts to insert-free control lines of the inbred host strain. P elements reduced average homozygous viability by 12.2% per insert and average heterozygous viability by 5.5% per insert, and induced recessive lethal mutations at a rate of 3.8% per insert. Mutational variation for the bristle traits averaged over both sexes was 0.03V(e) per homozygous P insert and 0.003V(e) per heterozygous P insert, where V(e) is the environmental variance. Mutational variation was greater for the sexes considered separately because inserts had large pleiotropic effects on sex dimorphism of bristle characters. The distributions of homozygous effects of inserts on the bristle traits were asymmetrical, with the largest effects in the direction of reducing bristle number; and highly leptokurtic, with most of the increase in variance contributed by a few lines with large effects. The inserts had partially recessive effects on the bristle traits. Insert lines with extreme bristle effects had on average greatly reduced viability. PMID:1311697

  15. MERE1, a low-copy-number copia-type retroelement in Medicago truncatula active during tissue culture.

    PubMed

    Rakocevic, Alexandra; Mondy, Samuel; Tirichine, Leïla; Cosson, Viviane; Brocard, Lysiane; Iantcheva, Anelia; Cayrel, Anne; Devier, Benjamin; Abu El-Heba, Ghada Ahmed; Ratet, Pascal

    2009-11-01

    We have identified an active Medicago truncatula copia-like retroelement called Medicago RetroElement1-1 (MERE1-1) as an insertion in the symbiotic NSP2 gene. MERE1-1 belongs to a low-copy-number family in the sequenced Medicago genome. These copies are highly related, but only three of them have a complete coding region and polymorphism exists between the long terminal repeats of these different copies. This retroelement family is present in all M. truncatula ecotypes tested but also in other legume species like Lotus japonicus. It is active only during tissue culture in both R108 and Jemalong Medicago accessions and inserts preferentially in genes.

  16. Transposition of the maize transposable element Ac in barley (Hordeum vulgare L.).

    PubMed

    Scholz, S; Lörz, H; Lütticke, S

    2001-01-01

    Transposition of the maize autonomous element Ac (Activator) was investigated in barley (Hordeum vulgare L.) with the aim of developing a transposon tagging system for the latter. The Ac element was introduced into meristematic tissue of barley by microprojectile bombardment. Transposon activity was then examined in the resulting transgenic plants. Multiple excision events were detected in leaf tissue of all plant lines. The mobile elements generated empty donor sites with small DNA sequence alterations, similar to those found in maize. Reintegration of Ac at independent genomic loci in somatic tissue was demonstrated by isolation of new element-flanking regions by AIMS-PCR (amplification of insertion-mutagenized sites). In addition, transmission of transposed Ac elements to progeny plants was confirmed. The results indicate that the introduced Ac element is able to transpose in barley. This is a first step towards the establishment of a transposon tagging system in this economically important crop.

  17. Conservative site-specific and single-copy transgenesis in human LINE-1 elements

    PubMed Central

    Vijaya Chandra, Shree Harsha; Makhija, Harshyaa; Peter, Sabrina; Myint Wai, Cho Mar; Li, Jinming; Zhu, Jindong; Ren, Zhonglu; D'Alcontres, Martina Stagno; Siau, Jia Wei; Chee, Sharon; Ghadessy, Farid John; Dröge, Peter

    2016-01-01

    Genome engineering of human cells plays an important role in biotechnology and molecular medicine. In particular, insertions of functional multi-transgene cassettes into suitable endogenous sequences will lead to novel applications. Although several tools have been exploited in this context, safety issues such as cytotoxicity, insertional mutagenesis and off-target cleavage together with limitations in cargo size/expression often compromise utility. Phage λ integrase (Int) is a transgenesis tool that mediates conservative site-specific integration of 48 kb DNA into a safe harbor site of the bacterial genome. Here, we show that an Int variant precisely recombines large episomes into a sequence, termed attH4X, found in 1000 human Long INterspersed Elements-1 (LINE-1). We demonstrate single-copy transgenesis through attH4X-targeting in various cell lines including hESCs, with the flexibility of selecting clones according to transgene performance and downstream applications. This is exemplified with pluripotency reporter cassettes and constitutively expressed payloads that remain functional in LINE1-targeted hESCs and differentiated progenies. Furthermore, LINE-1 targeting does not induce DNA damage-response or chromosomal aberrations, and neither global nor localized endogenous gene expression is substantially affected. Hence, this simple transgene addition tool should become particularly useful for applications that require engineering of the human genome with multi-transgenes. PMID:26673710

  18. Translocation and gross deletion breakpoints in human inherited disease and cancer II: Potential involvement of repetitive sequence elements in secondary structure formation between DNA ends.

    PubMed

    Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N

    2003-09-01

    Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.

  19. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    PubMed

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.

  20. The CRISPR Spacer Space Is Dominated by Sequences from Species-Specific Mobilomes.

    PubMed

    Shmakov, Sergey A; Sitnik, Vassilii; Makarova, Kira S; Wolf, Yuri I; Severinov, Konstantin V; Koonin, Eugene V

    2017-09-19

    Clustered regularly interspaced short palindromic repeats and CRISPR-associated protein (CRISPR-Cas) systems store the memory of past encounters with foreign DNA in unique spacers that are inserted between direct repeats in CRISPR arrays. For only a small fraction of the spacers, homologous sequences, called protospacers, are detectable in viral, plasmid, and microbial genomes. The rest of the spacers remain the CRISPR "dark matter." We performed a comprehensive analysis of the spacers from all CRISPR- cas loci identified in bacterial and archaeal genomes, and we found that, depending on the CRISPR-Cas subtype and the prokaryotic phylum, protospacers were detectable for 1% to about 19% of the spacers (~7% global average). Among the detected protospacers, the majority, typically 80 to 90%, originated from viral genomes, including proviruses, and among the rest, the most common source was genes that are integrated into microbial chromosomes but are involved in plasmid conjugation or replication. Thus, almost all spacers with identifiable protospacers target mobile genetic elements (MGE). The GC content, as well as dinucleotide and tetranucleotide compositions, of microbial genomes, their spacer complements, and the cognate viral genomes showed a nearly perfect correlation and were almost identical. Given the near absence of self-targeting spacers, these findings are most compatible with the possibility that the spacers, including the dark matter, are derived almost completely from the species-specific microbial mobilomes. IMPORTANCE The principal function of CRISPR-Cas systems is thought to be protection of bacteria and archaea against viruses and other parasitic genetic elements. The CRISPR defense function is mediated by sequences from parasitic elements, known as spacers, that are inserted into CRISPR arrays and then transcribed and employed as guides to identify and inactivate the cognate parasitic genomes. However, only a small fraction of the CRISPR spacers match any sequences in the current databases, and of these, only a minority correspond to known parasitic elements. We show that nearly all spacers with matches originate from viral or plasmid genomes that are either free or have been integrated into the host genome. We further demonstrate that spacers with no matches have the same properties as those of identifiable origins, strongly suggesting that all spacers originate from mobile elements.

  1. Identification of structural variation in mouse genomes.

    PubMed

    Keane, Thomas M; Wong, Kim; Adams, David J; Flint, Jonathan; Reymond, Alexandre; Yalcin, Binnaz

    2014-01-01

    Structural variation is variation in structure of DNA regions affecting DNA sequence length and/or orientation. It generally includes deletions, insertions, copy-number gains, inversions, and transposable elements. Traditionally, the identification of structural variation in genomes has been challenging. However, with the recent advances in high-throughput DNA sequencing and paired-end mapping (PEM) methods, the ability to identify structural variation and their respective association to human diseases has improved considerably. In this review, we describe our current knowledge of structural variation in the mouse, one of the prime model systems for studying human diseases and mammalian biology. We further present the evolutionary implications of structural variation on transposable elements. We conclude with future directions on the study of structural variation in mouse genomes that will increase our understanding of molecular architecture and functional consequences of structural variation.

  2. Complete genome sequences and comparative genome analysis of Lactobacillus plantarum strain 5-2 isolated from fermented soybean.

    PubMed

    Liu, Chen-Jian; Wang, Rui; Gong, Fu-Ming; Liu, Xiao-Feng; Zheng, Hua-Jun; Luo, Yi-Yong; Li, Xiao-Ran

    2015-12-01

    Lactobacillus plantarum is an important probiotic and is mostly isolated from fermented foods. We sequenced the genome of L. plantarum strain 5-2, which was derived from fermented soybean isolated from Yunnan province, China. The strain was determined to contain 3114 genes. Fourteen complete insertion sequence (IS) elements were found in 5-2 chromosome. There were 24 DNA replication proteins and 76 DNA repair proteins in the 5-2 genome. Consistent with the classification of L. plantarum as a facultative heterofermentative lactobacillus, the 5-2 genome encodes key enzymes required for the EMP (Embden-Meyerhof-Parnas) and phosphoketolase (PK) pathways. Several components of the secretion machinery are found in the 5-2 genome, which was compared with L. plantarum ST-III, JDM1 and WCFS1. Most of the specific proteins in the four genomes appeared to be related to their prophage elements. Copyright © 2015 Elsevier Inc. All rights reserved.

  3. Identifying structural variation in haploid microbial genomes from short-read resequencing data using breseq.

    PubMed

    Barrick, Jeffrey E; Colburn, Geoffrey; Deatherage, Daniel E; Traverse, Charles C; Strand, Matthew D; Borges, Jordan J; Knoester, David B; Reba, Aaron; Meyer, Austin G

    2014-11-29

    Mutations that alter chromosomal structure play critical roles in evolution and disease, including in the origin of new lifestyles and pathogenic traits in microbes. Large-scale rearrangements in genomes are often mediated by recombination events involving new or existing copies of mobile genetic elements, recently duplicated genes, or other repetitive sequences. Most current software programs for predicting structural variation from short-read DNA resequencing data are intended primarily for use on human genomes. They typically disregard information in reads mapping to repeat sequences, and significant post-processing and manual examination of their output is often required to rule out false-positive predictions and precisely describe mutational events. We have implemented an algorithm for identifying structural variation from DNA resequencing data as part of the breseq computational pipeline for predicting mutations in haploid microbial genomes. Our method evaluates the support for new sequence junctions present in a clonal sample from split-read alignments to a reference genome, including matches to repeat sequences. Then, it uses a statistical model of read coverage evenness to accept or reject these predictions. Finally, breseq combines predictions of new junctions and deleted chromosomal regions to output biologically relevant descriptions of mutations and their effects on genes. We demonstrate the performance of breseq on simulated Escherichia coli genomes with deletions generating unique breakpoint sequences, new insertions of mobile genetic elements, and deletions mediated by mobile elements. Then, we reanalyze data from an E. coli K-12 mutation accumulation evolution experiment in which structural variation was not previously identified. Transposon insertions and large-scale chromosomal changes detected by breseq account for ~25% of spontaneous mutations in this strain. In all cases, we find that breseq is able to reliably predict structural variation with modest read-depth coverage of the reference genome (>40-fold). Using breseq to predict structural variation should be useful for studies of microbial epidemiology, experimental evolution, synthetic biology, and genetics when a reference genome for a closely related strain is available. In these cases, breseq can discover mutations that may be responsible for important or unintended changes in genomes that might otherwise go undetected.

  4. Derepression of the Plant Chromovirus LORE1 Induces Germline Transposition in Regenerated Plants

    PubMed Central

    Fukai, Eigo; Umehara, Yosuke; Sato, Shusei; Endo, Makoto; Kouchi, Hiroshi; Hayashi, Makoto; Stougaard, Jens; Hirochika, Hirohiko

    2010-01-01

    Transposable elements represent a large proportion of the eukaryotic genomes. Long Terminal Repeat (LTR) retrotransposons are very abundant and constitute the predominant family of transposable elements in plants. Recent studies have identified chromoviruses to be a widely distributed lineage of Gypsy elements. These elements contain chromodomains in their integrases, which suggests a preference for insertion into heterochromatin. In turn, this preference might have contributed to the patterning of heterochromatin observed in host genomes. Despite their potential importance for our understanding of plant genome dynamics and evolution, the regulatory mechanisms governing the behavior of chromoviruses and their activities remain largely uncharacterized. Here, we report a detailed analysis of the spatio-temporal activity of a plant chromovirus in the endogenous host. We examined LORE1a, a member of the endogenous chromovirus LORE1 family from the model legume Lotus japonicus. We found that this chromovirus is stochastically de-repressed in plant populations regenerated from de-differentiated cells and that LORE1a transposes in the male germline. Bisulfite sequencing of the 5′ LTR and its surrounding region suggests that tissue culture induces a loss of epigenetic silencing of LORE1a. Since LTR promoter activity is pollen specific, as shown by the analysis of transgenic plants containing an LTR::GUS fusion, we conclude that male germline-specific LORE1a transposition in pollen grains is controlled transcriptionally by its own cis-elements. New insertion sites of LORE1a copies were frequently found in genic regions and show no strong insertional preferences. These distinctive novel features of LORE1 indicate that this chromovirus has considerable potential for generating genetic and epigenetic diversity in the host plant population. Our results also define conditions for the use of LORE1a as a genetic tool. PMID:20221264

  5. Massive programmed translational jumping in mitochondria

    PubMed Central

    Lang, B. Franz; Jakubkova, Michaela; Hegedusova, Eva; Daoud, Rachid; Forget, Lise; Brejova, Brona; Vinar, Tomas; Kosa, Peter; Fricova, Dominika; Nebohacova, Martina; Griac, Peter; Tomaska, Lubomir; Burger, Gertraud; Nosek, Jozef

    2014-01-01

    Programmed translational bypassing is a process whereby ribosomes “ignore” a substantial interval of mRNA sequence. Although discovered 25 y ago, the only experimentally confirmed example of this puzzling phenomenon is expression of the bacteriophage T4 gene 60. Bypassing requires translational blockage at a “takeoff codon” immediately upstream of a stop codon followed by a hairpin, which causes peptidyl-tRNA dissociation and reassociation with a matching “landing triplet” 50 nt downstream, where translation resumes. Here, we report 81 translational bypassing elements (byps) in mitochondria of the yeast Magnusiomyces capitatus and demonstrate in three cases, by transcript analysis and proteomics, that byps are retained in mitochondrial mRNAs but not translated. Although mitochondrial byps resemble the bypass sequence in the T4 gene 60, they utilize unused codons instead of stops for translational blockage and have relaxed matching rules for takeoff/landing sites. We detected byp-like sequences also in mtDNAs of several Saccharomycetales, indicating that byps are mobile genetic elements. These byp-like sequences lack bypassing activity and are tolerated when inserted in-frame in variable protein regions. We hypothesize that byp-like elements have the potential to contribute to evolutionary diversification of proteins by adding new domains that allow exploration of new structures and functions. PMID:24711422

  6. Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection.

    PubMed

    Yu, Qichao; Zhang, Wei; Zhang, Xiaolong; Zeng, Yongli; Wang, Yeming; Wang, Yanhui; Xu, Liqin; Huang, Xiaoyun; Li, Nannan; Zhou, Xinlan; Lu, Jie; Guo, Xiaosen; Li, Guibo; Hou, Yong; Liu, Shiping; Li, Bo

    2017-09-01

    Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the population level have not been sufficiently undertaken, despite the obvious need for a thorough characterization of RIPs in the general population. Herein, we present a novel and efficient computational tool called Specific Insertions Detector (SID) for the detection of non-reference RIPs. We demonstrate that SID is suitable for high-depth whole-genome sequencing data using paired-end reads obtained from simulated and real datasets. We construct a comprehensive RIP database using a large population of 90 Han Chinese individuals with a mean ×68 depth per individual. In total, we identify 9342 recent RIPs, and 8433 of these RIPs are novel compared with dbRIP, including 5826 Alu, 2169 long interspersed nuclear element 1 (L1), 383 SVA, and 55 long terminal repeats. Among the 9342 RIPs, 4828 were located in gene regions and 5 were located in protein-coding regions. We demonstrate that RIPs can, in principle, be an informative resource to perform population evolution and phylogenetic analyses. Taking the demographic effects into account, we identify a weak negative selection on SVA and L1 but an approximately neutral selection for Alu elements based on the frequency spectrum of RIPs. SID is a powerful open-source program for the detection of non-reference RIPs. We built a non-reference RIP dataset that greatly enhanced the diversity of RIPs detected in the general population, and it should be invaluable to researchers interested in many aspects of human evolution, genetics, and disease. As a proof of concept, we demonstrate that the RIPs can be used as biomarkers in a similar way as single nucleotide polymorphisms. © The Authors 2017. Published by Oxford University Press.

  7. High-resolution definition of the Vibrio cholerae essential gene set with hidden Markov model–based analyses of transposon-insertion sequencing data

    PubMed Central

    Chao, Michael C.; Pritchard, Justin R.; Zhang, Yanjia J.; Rubin, Eric J.; Livny, Jonathan; Davis, Brigid M.; Waldor, Matthew K.

    2013-01-01

    The coupling of high-density transposon mutagenesis to high-throughput DNA sequencing (transposon-insertion sequencing) enables simultaneous and genome-wide assessment of the contributions of individual loci to bacterial growth and survival. We have refined analysis of transposon-insertion sequencing data by normalizing for the effect of DNA replication on sequencing output and using a hidden Markov model (HMM)-based filter to exploit heretofore unappreciated information inherent in all transposon-insertion sequencing data sets. The HMM can smooth variations in read abundance and thereby reduce the effects of read noise, as well as permit fine scale mapping that is independent of genomic annotation and enable classification of loci into several functional categories (e.g. essential, domain essential or ‘sick’). We generated a high-resolution map of genomic loci (encompassing both intra- and intergenic sequences) that are required or beneficial for in vitro growth of the cholera pathogen, Vibrio cholerae. This work uncovered new metabolic and physiologic requirements for V. cholerae survival, and by combining transposon-insertion sequencing and transcriptomic data sets, we also identified several novel noncoding RNA species that contribute to V. cholerae growth. Our findings suggest that HMM-based approaches will enhance extraction of biological meaning from transposon-insertion sequencing genomic data. PMID:23901011

  8. Negative effect of the 5'-untranslated leader sequence on Ac transposon promoter expression.

    PubMed

    Scortecci, K C; Raina, R; Fedoroff, N V; Van Sluys, M A

    1999-08-01

    Transposable elements are used in heterologous plant hosts to clone genes by insertional mutagenesis. The Activator (Ac) transposable element has been cloned from maize, and introduced into a variety of plants. However, differences in regulation and transposition frequency have been observed between different host plants. The cause of this variability is still unknown. To better understand the activity of the Ac element, we analyzed the Ac promoter region and its 5'-untranslated leader sequence (5' UTL). Transient assays in tobacco NT1 suspension cells showed that the Ac promoter is a weak promoter and its activity was localized by deletion analyses. The data presented here indicate that the core of the Ac promoter is contained within 153 bp fragment upstream to transcription start sites. An important inhibitory effect (80%) due to the presence of the 5' UTL was found on the expression of LUC reporter gene. Here we demonstrate that the presence of the 5' UTL in the constructs reduces the expression driven by either strong or weak promoters.

  9. Chromodomains direct integration of retrotransposons to heterochromatin

    PubMed Central

    Gao, Xiang; Hou, Yi; Ebina, Hirotaka; Levin, Henry L.; Voytas, Daniel F.

    2008-01-01

    The enrichment of mobile genetic elements in heterochromatin may be due, in part, to targeted integration. The chromoviruses are Ty3/gypsy retrotransposons with chromodomains at their integrase C termini. Chromodomains are logical determinants for targeting to heterochromatin, because the chromodomain of heterochromatin protein 1 (HP1) typically recognizes histone H3 K9 methylation, an epigenetic mark characteristic of heterochromatin. We describe three groups of chromoviruses based on amino acid sequence relationships of their integrase C termini. Genome sequence analysis indicates that representative chromoviruses from each group are enriched in gene-poor regions of the genome relative to other retrotransposons, and when fused to fluorescent marker proteins, the chromodomains target proteins to specific subnuclear foci coincident with heterochromatin. The chromodomain of the fungal element, MAGGY, interacts with histone H3 dimethyl- and trimethyl-K9, and when the MAGGY chromodomain is fused to integrase of the Schizosaccharomyces pombe Tf1 retrotransposon, new Tf1 insertions are directed to sites of H3 K9 methylation. Repetitive sequences such as transposable elements trigger the RNAi pathway resulting in their epigenetic modification. Our results suggest a dynamic interplay between retrotransposons and heterochromatin, wherein mobile elements recognize heterochromatin at the time of integration and then perpetuate the heterochromatic mark by triggering epigenetic modification. PMID:18256242

  10. A highly polymorphic insertion in the Y-chromosome amelogenin gene can be used for evolutionary biology, population genetics and sexing in Cetacea and Artiodactyla

    PubMed Central

    Macé, Matthias; Crouau-Roy, Brigitte

    2008-01-01

    Background The early radiation of the Cetartiodactyla is complex, and unambiguous molecular characters are needed to clarify the positions of hippotamuses, camels and pigs relative to the remaining taxa (Cetacea and Ruminantia). There is also a need for informative genealogic markers for Y-chromosome population genetics as well as a sexing method applicable to all species from this group. We therefore studied the sequence variation of a partial sequence of the evolutionary conserved amelogenin gene to assess its potential use in each of these fields. Results and discussion We report a large interstitial insertion in the Y amelogenin locus in most of the Cetartiodactyla lineages (cetaceans and ruminants). This sex-linked size polymorphism is the result of a 460–465 bp inserted element in intron 4 of the amelogenin gene of Ruminants and Cetaceans. Therefore, this polymorphism can easily be used in a sexing assay for these species. When taking into account this shared character in addition to nucleotide sequence, gene genealogy follows sex-chromosome divergence in Cetartiodactyla whereas it is more congruent with zoological history when ignoring these characters. This could be related to a loss of homology between chromosomal copies given the old age of the insertion. The 1 kbp Amel-Y amplified fragment is also characterized by high nucleotide diversity (64 polymorphic sites spanning over 1 kbp in seven haplotypes) which is greater than for other Y-chromosome sequence markers studied so far but less than the mitochondrial control region. Conclusion The gender-dependent polymorphism we have identified is relevant not only for phylogenic inference within the Cetartiodactyla but also for Y-chromosome based population genetics and gender determination in cetaceans and ruminants. One single protocol can therefore be used for studies in population and evolutionary genetics, reproductive biotechnologies, and forensic science. PMID:18925953

  11. Templated sequence insertion polymorphisms in the human genome

    NASA Astrophysics Data System (ADS)

    Onozawa, Masahiro; Aplan, Peter

    2016-11-01

    Templated Sequence Insertion Polymorphism (TSIP) is a recently described form of polymorphism recognized in the human genome, in which a sequence that is templated from a distant genomic region is inserted into the genome, seemingly at random. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; Class 1 TSIPs show features of insertions that are mediated via the LINE-1 ORF2 protein, including 1) target-site duplication (TSD), 2) polyadenylation 10-30 nucleotides downstream of a “cryptic” polyadenylation signal, and 3) preference for insertion at a 5’-TTTT/A-3’ sequence. In contrast, class 2 TSIPs show features consistent with repair of a DNA double-strand break via insertion of a DNA “patch” that is derived from a distant genomic region. Survey of a large number of normal human volunteers demonstrates that most individuals have 25-30 TSIPs, and that these TSIPs track with specific geographic regions. Similar to other forms of human polymorphism, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases.

  12. Different Genetic Elements Carrying the tet(W) Gene in Two Human Clinical Isolates of Streptococcus suis▿ †

    PubMed Central

    Palmieri, Claudio; Princivalli, Maria Stella; Brenciani, Andrea; Varaldo, Pietro E.; Facinelli, Bruna

    2011-01-01

    The genetic support for tet(W), an emerging tetracycline resistance determinant, was studied in two strains of Streptococcus suis, SsCA and SsUD, both isolated in Italy from patients with meningitis. Two completely different tet(W)-carrying genetic elements, sharing only a tet(W)-containing segment barely larger than the gene, were found in the two strains. The one from strain SsCA was nontransferable, and aside from an erm(B)-containing insertion, it closely resembled a genomic island recently described in an S. suis Chinese human isolate in sequence, organization, and chromosomal location. The tet(W)-carrying genetic element from strain SsUD was transferable (at a low frequency) and, though apparently noninducible following mitomycin C treatment, displayed a typical phage organization and was named ΦSsUD.1. Its full sequence was determined (60,711 bp), the highest BLASTN score being Streptococcus pyogenes Φm46.1. ΦSsUD.1 exhibited a unique combination of antibiotic and heavy metal resistance genes. Besides tet(W), it contained a MAS (macrolide-aminoglycoside-streptothricin) fragment with an erm(B) gene having a deleted leader peptide and a cadC/cadA cadmium efflux cassette. The MAS fragment closely resembled the one recently described in pneumococcal transposons Tn6003 and Tn1545. These resistance genes found in the ΦSsUD.1 phage scaffold differed from, but were in the same position as, cargo genes carried by other streptococcal phages. The chromosome integration site of ΦSsUD.1 was at the 3′ end of a conserved tRNA uracil methyltransferase (rum) gene. This site, known to be an insertional hot spot for mobile elements in S. pyogenes, might play a similar role in S. suis. PMID:21115784

  13. Different genetic elements carrying the tet(W) gene in two human clinical isolates of Streptococcus suis.

    PubMed

    Palmieri, Claudio; Princivalli, Maria Stella; Brenciani, Andrea; Varaldo, Pietro E; Facinelli, Bruna

    2011-02-01

    The genetic support for tet(W), an emerging tetracycline resistance determinant, was studied in two strains of Streptococcus suis, SsCA and SsUD, both isolated in Italy from patients with meningitis. Two completely different tet(W)-carrying genetic elements, sharing only a tet(W)-containing segment barely larger than the gene, were found in the two strains. The one from strain SsCA was nontransferable, and aside from an erm(B)-containing insertion, it closely resembled a genomic island recently described in an S. suis Chinese human isolate in sequence, organization, and chromosomal location. The tet(W)-carrying genetic element from strain SsUD was transferable (at a low frequency) and, though apparently noninducible following mitomycin C treatment, displayed a typical phage organization and was named ΦSsUD.1. Its full sequence was determined (60,711 bp), the highest BLASTN score being Streptococcus pyogenes Φm46.1. ΦSsUD.1 exhibited a unique combination of antibiotic and heavy metal resistance genes. Besides tet(W), it contained a MAS (macrolide-aminoglycoside-streptothricin) fragment with an erm(B) gene having a deleted leader peptide and a cadC/cadA cadmium efflux cassette. The MAS fragment closely resembled the one recently described in pneumococcal transposons Tn6003 and Tn1545. These resistance genes found in the ΦSsUD.1 phage scaffold differed from, but were in the same position as, cargo genes carried by other streptococcal phages. The chromosome integration site of ΦSsUD.1 was at the 3' end of a conserved tRNA uracil methyltransferase (rum) gene. This site, known to be an insertional hot spot for mobile elements in S. pyogenes, might play a similar role in S. suis.

  14. Landscape of Insertion Polymorphisms in the Human Genome

    PubMed Central

    Onozawa, Masahiro; Goldberg, Liat; Aplan, Peter D.

    2015-01-01

    Nucleotide substitutions, small (<50 bp) insertions or deletions (indels), and large (>50 bp) deletions are well-known causes of genetic variation within the human genome. We recently reported a previously unrecognized form of polymorphic insertions, termed templated sequence insertion polymorphism (TSIP), in which the inserted sequence was templated from a distant genomic region, and was inserted in the genome through reverse transcription of an RNA intermediate. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; class 1 TSIPs show target site duplication, polyadenylation, and preference for insertion at a 5′-TTTT/A-3′ sequence, suggesting a LINE-1 based insertion mechanism, whereas class 2 TSIPs show features consistent with repair of a DNA double strand break by nonhomologous end joining. To gain a more complete picture of TSIPs throughout the human population, we evaluated whole-genome sequence from 52 individuals, and identified 171 TSIPs. Most individuals had 25–30 TSIPs, and common (present in >20% of individuals) TSIPs were found in individuals throughout the world, whereas rare TSIPs tended to cluster in specific geographic regions. The number of rare TSIPs was greater than the number of common TSIPs, suggesting that TSIP generation is an ongoing process. Intriguingly, mitochondrial sequences were a frequent template for class 2 insertions, used more commonly than any nuclear chromosome. Similar to single nucleotide polymorphisms and indels, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases, and can be useful in tracking historical migration of populations. PMID:25745018

  15. Global mapping of transposon location.

    PubMed

    Gabriel, Abram; Dapprich, Johannes; Kunkel, Mark; Gresham, David; Pratt, Stephen C; Dunham, Maitreya J

    2006-12-15

    Transposable genetic elements are ubiquitous, yet their presence or absence at any given position within a genome can vary between individual cells, tissues, or strains. Transposable elements have profound impacts on host genomes by altering gene expression, assisting in genomic rearrangements, causing insertional mutations, and serving as sources of phenotypic variation. Characterizing a genome's full complement of transposons requires whole genome sequencing, precluding simple studies of the impact of transposition on interindividual variation. Here, we describe a global mapping approach for identifying transposon locations in any genome, using a combination of transposon-specific DNA extraction and microarray-based comparative hybridization analysis. We use this approach to map the repertoire of endogenous transposons in different laboratory strains of Saccharomyces cerevisiae and demonstrate that transposons are a source of extensive genomic variation. We also apply this method to mapping bacterial transposon insertion sites in a yeast genomic library. This unique whole genome view of transposon location will facilitate our exploration of transposon dynamics, as well as defining bases for individual differences and adaptive potential.

  16. Genome-wide analysis of short interspersed nuclear elements SINES revealed high sequence conservation, gene association and retrotranspositional activity in wheat

    PubMed Central

    Ben-David, Smadar; Yaakov, Beery; Kashkush, Khalil

    2013-01-01

    Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retroelements that are present in most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, they are poorly studied in plants, especially in wheat (Triticum aestivum). We used quantitative PCR of various wheat species to determine the copy number of a wheat SINE family, termed Au SINE, combined with computer-assisted analyses of the publicly available 454 pyrosequencing database of T. aestivum. In addition, we utilized site-specific PCR on 57 Au SINE insertions, transposon methylation display and transposon display on newly formed wheat polyploids to assess retrotranspositional activity, epigenetic status and genetic rearrangements in Au SINE, respectively. We retrieved 3706 different insertions of Au SINE from the 454 pyrosequencing database of T. aestivum, and found that most of the elements are inserted in A/T-rich regions, while approximately 38% of the insertions are associated with transcribed regions, including known wheat genes. We observed typical retrotransposition of Au SINE in the second generation of a newly formed wheat allohexaploid, and massive hypermethylation in CCGG sites surrounding Au SINE in the third generation. Finally, we observed huge differences in the copy numbers in diploid Triticum and Aegilops species, and a significant increase in the copy numbers in natural wheat polyploids, but no significant increase in the copy number of Au SINE in the first four generations for two of three newly formed allopolyploid species used in this study. Our data indicate that SINEs may play a prominent role in the genomic evolution of wheat through stress-induced activation. PMID:23855320

  17. Genome-wide analysis of short interspersed nuclear elements SINES revealed high sequence conservation, gene association and retrotranspositional activity in wheat.

    PubMed

    Ben-David, Smadar; Yaakov, Beery; Kashkush, Khalil

    2013-10-01

    Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retroelements that are present in most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, they are poorly studied in plants, especially in wheat (Triticum aestivum). We used quantitative PCR of various wheat species to determine the copy number of a wheat SINE family, termed Au SINE, combined with computer-assisted analyses of the publicly available 454 pyrosequencing database of T. aestivum. In addition, we utilized site-specific PCR on 57 Au SINE insertions, transposon methylation display and transposon display on newly formed wheat polyploids to assess retrotranspositional activity, epigenetic status and genetic rearrangements in Au SINE, respectively. We retrieved 3706 different insertions of Au SINE from the 454 pyrosequencing database of T. aestivum, and found that most of the elements are inserted in A/T-rich regions, while approximately 38% of the insertions are associated with transcribed regions, including known wheat genes. We observed typical retrotransposition of Au SINE in the second generation of a newly formed wheat allohexaploid, and massive hypermethylation in CCGG sites surrounding Au SINE in the third generation. Finally, we observed huge differences in the copy numbers in diploid Triticum and Aegilops species, and a significant increase in the copy numbers in natural wheat polyploids, but no significant increase in the copy number of Au SINE in the first four generations for two of three newly formed allopolyploid species used in this study. Our data indicate that SINEs may play a prominent role in the genomic evolution of wheat through stress-induced activation. © 2013 Ben-Gurion University The Plant Journal © 2013 John Wiley & Sons Ltd.

  18. Contribution of transposable elements in the plant's genome.

    PubMed

    Sahebi, Mahbod; Hanafi, Mohamed M; van Wijnen, Andre J; Rice, David; Rafii, M Y; Azizi, Parisa; Osman, Mohamad; Taheri, Sima; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat; Noor, Yusuf Muhammad

    2018-07-30

    Plants maintain extensive growth flexibility under different environmental conditions, allowing them to continuously and rapidly adapt to alterations in their environment. A large portion of many plant genomes consists of transposable elements (TEs) that create new genetic variations within plant species. Different types of mutations may be created by TEs in plants. Many TEs can avoid the host's defense mechanisms and survive alterations in transposition activity, internal sequence and target site. Thus, plant genomes are expected to utilize a variety of mechanisms to tolerate TEs that are near or within genes. TEs affect the expression of not only nearby genes but also unlinked inserted genes. TEs can create new promoters, leading to novel expression patterns or alternative coding regions to generate alternate transcripts in plant species. TEs can also provide novel cis-acting regulatory elements that act as enhancers or inserts within original enhancers that are required for transcription. Thus, the regulation of plant gene expression is strongly managed by the insertion of TEs into nearby genes. TEs can also lead to chromatin modifications and thereby affect gene expression in plants. TEs are able to generate new genes and modify existing gene structures by duplicating, mobilizing and recombining gene fragments. They can also facilitate cellular functions by sharing their transposase-coding regions. Hence, TE insertions can not only act as simple mutagens but can also alter the elementary functions of the plant genome. Here, we review recent discoveries concerning the contribution of TEs to gene expression in plant genomes and discuss the different mechanisms by which TEs can affect plant gene expression and reduce host defense mechanisms. Copyright © 2018 Elsevier B.V. All rights reserved.

  19. Characterization of GM events by insert knowledge adapted re-sequencing approaches

    PubMed Central

    Yang, Litao; Wang, Congmao; Holst-Jensen, Arne; Morisset, Dany; Lin, Yongjun; Zhang, Dabing

    2013-01-01

    Detection methods and data from molecular characterization of genetically modified (GM) events are needed by stakeholders of public risk assessors and regulators. Generally, the molecular characteristics of GM events are incomprehensively revealed by current approaches and biased towards detecting transformation vector derived sequences. GM events are classified based on available knowledge of the sequences of vectors and inserts (insert knowledge). Herein we present three insert knowledge-adapted approaches for characterization GM events (TT51-1 and T1c-19 rice as examples) based on paired-end re-sequencing with the advantages of comprehensiveness, accuracy, and automation. The comprehensive molecular characteristics of two rice events were revealed with additional unintended insertions comparing with the results from PCR and Southern blotting. Comprehensive transgene characterization of TT51-1 and T1c-19 is shown to be independent of a priori knowledge of the insert and vector sequences employing the developed approaches. This provides an opportunity to identify and characterize also unknown GM events. PMID:24088728

  20. Characterization of GM events by insert knowledge adapted re-sequencing approaches.

    PubMed

    Yang, Litao; Wang, Congmao; Holst-Jensen, Arne; Morisset, Dany; Lin, Yongjun; Zhang, Dabing

    2013-10-03

    Detection methods and data from molecular characterization of genetically modified (GM) events are needed by stakeholders of public risk assessors and regulators. Generally, the molecular characteristics of GM events are incomprehensively revealed by current approaches and biased towards detecting transformation vector derived sequences. GM events are classified based on available knowledge of the sequences of vectors and inserts (insert knowledge). Herein we present three insert knowledge-adapted approaches for characterization GM events (TT51-1 and T1c-19 rice as examples) based on paired-end re-sequencing with the advantages of comprehensiveness, accuracy, and automation. The comprehensive molecular characteristics of two rice events were revealed with additional unintended insertions comparing with the results from PCR and Southern blotting. Comprehensive transgene characterization of TT51-1 and T1c-19 is shown to be independent of a priori knowledge of the insert and vector sequences employing the developed approaches. This provides an opportunity to identify and characterize also unknown GM events.

  1. Repair of DNA double-strand breaks by templated nucleotide sequence insertions derived from distant regions of the genome.

    PubMed

    Onozawa, Masahiro; Zhang, Zhenhua; Kim, Yoo Jung; Goldberg, Liat; Varga, Tamas; Bergsagel, P Leif; Kuehl, W Michael; Aplan, Peter D

    2014-05-27

    We used the I-SceI endonuclease to produce DNA double-strand breaks (DSBs) and observed that a fraction of these DSBs were repaired by insertion of sequences, which we termed "templated sequence insertions" (TSIs), derived from distant regions of the genome. These TSIs were derived from genic, retrotransposon, or telomere sequences and were not deleted from the donor site in the genome, leading to the hypothesis that they were derived from reverse-transcribed RNA. Cotransfection of RNA and an I-SceI expression vector demonstrated insertion of RNA-derived sequences at the DNA-DSB site, and TSIs were suppressed by reverse-transcriptase inhibitors. Both observations support the hypothesis that TSIs were derived from RNA templates. In addition, similar insertions were detected at sites of DNA DSBs induced by transcription activator-like effector nuclease proteins. Whole-genome sequencing of myeloma cell lines revealed additional TSIs, demonstrating that repair of DNA DSBs via insertion was not restricted to experimentally produced DNA DSBs. Analysis of publicly available databases revealed that many of these TSIs are polymorphic in the human genome. Taken together, these results indicate that insertional events should be considered as alternatives to gross chromosomal rearrangements in the interpretation of whole-genome sequence data and that this mutagenic form of DNA repair may play a role in genetic disease, exon shuffling, and mammalian evolution.

  2. Polycistronic lentiviral vector for "hit and run" reprogramming of adult skin fibroblasts to induced pluripotent stem cells.

    PubMed

    Chang, Chia-Wei; Lai, Yi-Shin; Pawlik, Kevin M; Liu, Kaimao; Sun, Chiao-Wang; Li, Chao; Schoeb, Trenton R; Townes, Tim M

    2009-05-01

    We report the derivation of induced pluripotent stem (iPS) cells from adult skin fibroblasts using a single, polycistronic lentiviral vector encoding the reprogramming factors Oct4, Sox2, and Klf4. Porcine teschovirus-1 2A sequences that trigger ribosome skipping were inserted between human cDNAs for these factors, and the polycistron was subcloned downstream of the elongation factor 1 alpha promoter in a self-inactivating (SIN) lentiviral vector containing a loxP site in the truncated 3' long terminal repeat (LTR). Adult skin fibroblasts from a humanized mouse model of sickle cell disease were transduced with this single lentiviral vector, and iPS cell colonies were picked within 30 days. These cells expressed endogenous Oct4, Sox2, Nanog, alkaline phosphatase, stage-specific embryonic antigen-1, and other markers of pluripotency. The iPS cells produced teratomas containing tissue derived from all three germ layers after injection into immunocompromised mice and formed high-level chimeras after injection into murine blastocysts. iPS cell lines with as few as three lentiviral insertions were obtained. Expression of Cre recombinase in these iPS cells resulted in deletion of the lentiviral vector, and sequencing of insertion sites demonstrated that remnant 291-bp SIN LTRs containing a single loxP site did not interrupt coding sequences, promoters, or known regulatory elements. These results suggest that a single, polycistronic "hit and run" vector can safely and effectively reprogram adult dermal fibroblasts into iPS cells.

  3. Germline transformation of the butterfly Bicyclus anynana.

    PubMed

    Marcus, Jeffrey M; Ramos, Diane M; Monteiro, Antónia

    2004-08-07

    Ecological and evolutionary theory has frequently been inspired by the diversity of colour patterns on the wings of butterflies. More recently, these varied patterns have also become model systems for studying the evolution of developmental mechanisms. A technique that will facilitate our understanding of butterfly colour-pattern development is germline transformation. Germline transformation permits functional tests of candidate gene products and of cis-regulatory regions, and provides a means of generating new colour-pattern mutants by insertional mutagenesis. We report the successful transformation of the African satyrid butterfly Bicyclus anynana with two different transposable element vectors, Hermes and piggyBac, each carrying EGFP coding sequences driven by the 3XP3 synthetic enhancer that drives gene expression in the eyes. Candidate lines identified by screening for EGFP in adult eyes were later confirmed by PCR amplification of a fragment of the EGFP coding sequence from genomic DNA. Flanking DNA surrounding the insertions was amplified by inverse PCR and sequenced. Transformation rates were 5% for piggyBac and 10.2% for Hermes. Ultimately, the new data generated by these techniques may permit an integrated understanding of the developmental genetics of colour-pattern formation and of the ecological and evolutionary processes in which these patterns play a role.

  4. Human structural variation: mechanisms of chromosome rearrangements

    PubMed Central

    Weckselblatt, Brooke; Rudd, M. Katharine

    2015-01-01

    Chromosome structural variation (SV) is a normal part of variation in the human genome, but some classes of SV can cause neurodevelopmental disorders. Analysis of the DNA sequence at SV breakpoints can reveal mutational mechanisms and risk factors for chromosome rearrangement. Large-scale SV breakpoint studies have become possible recently owing to advances in next-generation sequencing (NGS) including whole-genome sequencing (WGS). These findings have shed light on complex forms of SV such as triplications, inverted duplications, insertional translocations, and chromothripsis. Sequence-level breakpoint data resolve SV structure and determine how genes are disrupted, fused, and/or misregulated by breakpoints. Recent improvements in breakpoint sequencing have also revealed non-allelic homologous recombination (NAHR) between paralogous long interspersed nuclear element (LINE) or human endogenous retrovirus (HERV) repeats as a cause of deletions, duplications, and translocations. This review covers the genomic organization of simple and complex constitutional SVs, as well as the molecular mechanisms of their formation. PMID:26209074

  5. Sequence of pNL194, a 79.3-Kilobase IncN Plasmid Carrying the blaVIM-1 Metallo-β-Lactamase Gene in Klebsiella pneumoniae▿

    PubMed Central

    Miriagou, V.; Papagiannitsis, C. C.; Kotsakis, S. D.; Loli, A.; Tzelepi, E.; Legakis, N. J.; Tzouvelekis, L. S.

    2010-01-01

    The nucleotide sequence of pNL194, a VIM-1-encoding plasmid, is described in this study. pNL194 (79,307 bp) comprised an IncN-characteristic segment (38,940 bp) and a mosaic structure (40,367 bp) including blaVIM-1, aacA7, aadA1, aadA2, dfrA1, dfrA12, aphA1, strA, strB, and sul1. Tn1000 or Tn5501 insertion within fipA probably facilitated recruitment of additional mobile elements carrying resistance genes. PMID:20660690

  6. Fission yeast retrotransposon Tf1 integration is targeted to 5' ends of open reading frames.

    PubMed

    Behrens, R; Hayles, J; Nurse, P

    2000-12-01

    Target site selection of transposable elements is usually not random but involves some specificity for a DNA sequence or a DNA binding host factor. We have investigated the target site selection of the long terminal repeat-containing retrotransposon Tf1 from the fission yeast Schizosaccharomyces pombe. By monitoring induced transposition events we found that Tf1 integration sites were distributed throughout the genome. Mapping these insertions revealed that Tf1 did not integrate into open reading frames, but occurred preferentially in longer intergenic regions with integration biased towards a region 100-420 bp upstream of the translation start site. Northern blot analysis showed that transcription of genes adjacent to Tf1 insertions was not significantly changed.

  7. Fission yeast retrotransposon Tf1 integration is targeted to 5′ ends of open reading frames

    PubMed Central

    Behrens, Ralf; Hayles, Jacky; Nurse, Paul

    2000-01-01

    Target site selection of transposable elements is usually not random but involves some specificity for a DNA sequence or a DNA binding host factor. We have investigated the target site selection of the long terminal repeat-containing retrotransposon Tf1 from the fission yeast Schizosaccharomyces pombe. By monitoring induced transposition events we found that Tf1 integration sites were distributed throughout the genome. Mapping these insertions revealed that Tf1 did not integrate into open reading frames, but occurred preferentially in longer intergenic regions with integration biased towards a region 100–420 bp upstream of the translation start site. Northern blot analysis showed that transcription of genes adjacent to Tf1 insertions was not significantly changed. PMID:11095681

  8. Molecular characterization, genomic distribution and evolutionary dynamics of Short INterspersed Elements in the termite genome.

    PubMed

    Luchetti, Andrea; Mantovani, Barbara

    2011-02-01

    Short INterspersed Elements (SINEs) in invertebrates, and especially in animal inbred genomes such that of termites, are poorly known; in this paper we characterize three new SINE families (Talub, Taluc and Talud) through the analyses of 341 sequences, either isolated from the Reticulitermes lucifugus genome or drawn from EST Genbank collection. We further add new data to the only isopteran element known so far, Talua. These SINEs are tRNA-derived elements, with an average length ranging from 258 to 372 bp. The tails are made up by poly(A) or microsatellite motifs. Their copy number varies from 7.9 × 10(3) to 10(5) copies, well within the range observed for other metazoan genomes. Species distribution, age and target site duplication analysis indicate Talud as the oldest, possibly inactive SINE originated before the onset of Isoptera (~150 Myr ago). Taluc underwent to substantial sequence changes throughout the evolution of termites and data suggest it was silenced and then re-activated in the R. lucifugus lineage. Moreover, Taluc shares a conserved sequence block with other unrelated SINEs, as observed for some vertebrate and cephalopod elements. The study of genomic environment showed that insertions are mainly surrounded by microsatellites and other SINEs, indicating a biased accumulation within non-coding regions. The evolutionary dynamics of Talu~ elements is explained through selective mechanisms acting in an inbred genome; in this respect, the study of termites' SINEs activity may provide an interesting framework to address the (co)evolution of mobile elements and the host genome.

  9. Prediction of the translocon-mediated membrane insertion free energies of protein sequences.

    PubMed

    Park, Yungki; Helms, Volkhard

    2008-05-15

    Helical membrane proteins (HMPs) play crucial roles in a variety of cellular processes. Unlike water-soluble proteins, HMPs need not only to fold but also get inserted into the membrane to be fully functional. This process of membrane insertion is mediated by the translocon complex. Thus, it is of great interest to develop computational methods for predicting the translocon-mediated membrane insertion free energies of protein sequences. We have developed Membrane Insertion (MINS), a novel sequence-based computational method for predicting the membrane insertion free energies of protein sequences. A benchmark test gives a correlation coefficient of 0.74 between predicted and observed free energies for 357 known cases, which corresponds to a mean unsigned error of 0.41 kcal/mol. These results are significantly better than those obtained by traditional hydropathy analysis. Moreover, the ability of MINS to reasonably predict membrane insertion free energies of protein sequences allows for effective identification of transmembrane (TM) segments. Subsequently, MINS was applied to predict the membrane insertion free energies of 316 TM segments found in known structures. An in-depth analysis of the predicted free energies reveals a number of interesting findings about the biogenesis and structural stability of HMPs. A web server for MINS is available at http://service.bioinformatik.uni-saarland.de/mins

  10. Short- and Long-term Evolutionary Dynamics of Bacterial Insertion Sequences: Insights from Wolbachia Endosymbionts

    PubMed Central

    Cerveau, Nicolas; Leclercq, Sébastien; Leroy, Elodie; Bouchon, Didier; Cordaux, Richard

    2011-01-01

    Transposable elements (TE) are one of the major driving forces of genome evolution, raising the question of the long-term dynamics underlying their evolutionary success. Long-term TE evolution can readily be reconstructed in eukaryotes, thanks to many degraded copies constituting genomic fossil records of past TE proliferations. By contrast, bacterial genomes usually experience high sequence turnover and short TE retention times, thereby obscuring ancient TE evolutionary patterns. We found that Wolbachia bacterial genomes contain 52–171 insertion sequence (IS) TEs. IS account for 11% of Wolbachia wRi, which is one of the highest IS genomic coverage reported in prokaryotes to date. We show that many IS groups are currently expanding in various Wolbachia genomes and that IS horizontal transfers are frequent among strains, which can explain the apparent synchronicity of these IS proliferations. Remarkably, >70% of Wolbachia IS are nonfunctional. They constitute an unusual bacterial IS genomic fossil record providing direct empirical evidence for a long-term IS evolutionary dynamics following successive periods of intense transpositional activity. Our results show that comprehensive IS annotations have the potential to provide new insights into prokaryote TE evolution and, more generally, prokaryote genome evolution. Indeed, the identification of an important IS genomic fossil record in Wolbachia demonstrates that IS elements are not always of recent origin, contrary to the conventional view of TE evolution in prokaryote genomes. Our results also raise the question whether the abundance of IS fossils is specific to Wolbachia or it may be a general, albeit overlooked, feature of prokaryote genomes. PMID:21940637

  11. Short- and long-term evolutionary dynamics of bacterial insertion sequences: insights from Wolbachia endosymbionts.

    PubMed

    Cerveau, Nicolas; Leclercq, Sébastien; Leroy, Elodie; Bouchon, Didier; Cordaux, Richard

    2011-01-01

    Transposable elements (TE) are one of the major driving forces of genome evolution, raising the question of the long-term dynamics underlying their evolutionary success. Long-term TE evolution can readily be reconstructed in eukaryotes, thanks to many degraded copies constituting genomic fossil records of past TE proliferations. By contrast, bacterial genomes usually experience high sequence turnover and short TE retention times, thereby obscuring ancient TE evolutionary patterns. We found that Wolbachia bacterial genomes contain 52-171 insertion sequence (IS) TEs. IS account for 11% of Wolbachia wRi, which is one of the highest IS genomic coverage reported in prokaryotes to date. We show that many IS groups are currently expanding in various Wolbachia genomes and that IS horizontal transfers are frequent among strains, which can explain the apparent synchronicity of these IS proliferations. Remarkably, >70% of Wolbachia IS are nonfunctional. They constitute an unusual bacterial IS genomic fossil record providing direct empirical evidence for a long-term IS evolutionary dynamics following successive periods of intense transpositional activity. Our results show that comprehensive IS annotations have the potential to provide new insights into prokaryote TE evolution and, more generally, prokaryote genome evolution. Indeed, the identification of an important IS genomic fossil record in Wolbachia demonstrates that IS elements are not always of recent origin, contrary to the conventional view of TE evolution in prokaryote genomes. Our results also raise the question whether the abundance of IS fossils is specific to Wolbachia or it may be a general, albeit overlooked, feature of prokaryote genomes.

  12. Reprogramming somatic cells into iPS cells activates LINE-1 retroelement mobility

    PubMed Central

    Wissing, Silke; Muñoz-Lopez, Martin; Macia, Angela; Yang, Zhiyuan; Montano, Mauricio; Collins, William; Garcia-Perez, Jose Luis; Moran, John V.; Greene, Warner C.

    2012-01-01

    Long interspersed element-1 (LINE-1 or L1) retrotransposons account for nearly 17% of human genomic DNA and represent a major evolutionary force that has reshaped the structure and function of the human genome. However, questions remain concerning both the frequency and the developmental timing of L1 retrotransposition in vivo and whether the mobility of these retroelements commonly results in insertional and post-insertional mechanisms of genomic injury. Cells exhibiting high rates of L1 retrotransposition might be especially at risk for such injury. We assessed L1 mRNA expression and L1 retrotransposition in two biologically relevant cell types, human embryonic stem cells (hESCs) and induced pluripotent stem cells (iPSCs), as well as in control parental human dermal fibroblasts (HDFs). Full-length L1 mRNA and the L1 open reading frame 1-encoded protein (ORF1p) were readily detected in hESCs and iPSCs, but not in HDFs. Sequencing analysis proved the expression of human-specific L1 element mRNAs in iPSCs. Bisulfite sequencing revealed that the increased L1 expression observed in iPSCs correlates with an overall decrease in CpG methylation in the L1 promoter region. Finally, retrotransposition of an engineered human L1 element was ∼10-fold more efficient in iPSCs than in parental HDFs. These findings indicate that somatic cell reprogramming is associated with marked increases in L1 expression and perhaps increases in endogenous L1 retrotransposition, which could potentially impact the genomic integrity of the resultant iPSCs. PMID:21989055

  13. Identification by Subtractive Hybridization of a Novel Insertion Sequence Specific for Virulent Strains of Porphyromonas gingivalis

    PubMed Central

    Sawada, Koichi; Kokeguchi, Susumu; Hongyo, Hiroshi; Sawada, Satoko; Miyamoto, Manabu; Maeda, Hiroshi; Nishimura, Fusanori; Takashiba, Shogo; Murayama, Yoji

    1999-01-01

    Subtractive hybridization was employed to isolate specific genes from virulent Porphyromonas gingivalis strains that are possibly related to abscess formation. The genomic DNA from the virulent strain P. gingivalis W83 was subtracted with DNA from the avirulent strain ATCC 33277. Three clones unique to strain W83 were isolated and sequenced. The cloned DNA fragments were 885, 369, and 132 bp and had slight homology with only Bacillus stearothermophilus IS5377, which is a putative transposase. The regions flanking the cloned DNA fragments were isolated and sequenced, and the gene structure around the clones was revealed. These three clones were located side-by-side in a gene reported as an outer membrane protein. The three clones interrupt the open reading frame of the outer membrane protein gene. This inserted DNA, consisting of three isolated clones, was designated IS1598, which was 1,396 bp (i.e., a 1,158-bp open reading frame) in length and was flanked by 16-bp terminal inverted repeats and a 9-bp duplicated target sequence. IS1598 was detected in P. gingivalis W83, W50, and FDC 381 by Southern hybridization. All three P. gingivalis strains have been shown to possess abscess-forming ability in animal models. However, IS1598 was not detected in avirulent strains of P. gingivalis, including ATCC 33277. The IS1598 may interrupt the synthesis of the outer membrane protein, resulting in changes in the structure of the bacterial outer membrane. The IS1598 isolated in this study is a novel insertion element which might be a specific marker for virulent P. gingivalis strains. PMID:10531208

  14. Endogenous Retroviruses: With Us and Against Us

    NASA Astrophysics Data System (ADS)

    Meyer, Thomas J.; Rosenkrantz, Jimi L.; Carbone, Lucia; Chavez, Shawn L.

    2017-04-01

    Mammalian genomes are scattered with thousands of copies of endogenous retroviruses (ERVs), mobile genetic elements that are relics of ancient retroviral infections. After inserting copies into the germ line of a host, most ERVs accumulate mutations that prevent the normal assembly of infectious viral particles, becoming trapped in host genomes and unable to leave to infect other cells. While most copies of ERVs are inactive, some are transcribed and encode the proteins needed to generate new insertions at novel loci. In some cases, old copies are removed via recombination and other mechanisms. This creates a shifting landscape of ERV copies within host genomes. New insertions can disrupt normal expression of nearby genes via directly inserting into key regulatory elements or by containing regulatory motifs within their sequences. Further, the transcriptional silencing of ERVs via epigenetic modification may result in changes to the epigenetic regulation of adjacent genes. In these ways, ERVs can be potent sources of regulatory disruption as well as genetic innovation. Here, we provide a brief review of the association between ERVs and gene expression, especially as observed in pre-implantation development and placentation. Moreover, we will describe the roles ERVs may play in somatic tissues, mostly in the context of human disease, including cancer, neurodegenerative disorders, and schizophrenia. Lastly, we discuss the recent discovery that some ERVs may have been pressed into the service of their host genomes to aid in the innate immune response to exogenous viral infections.

  15. An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region.

    PubMed Central

    Ashburner, M; Misra, S; Roote, J; Lewis, S E; Blazej, R; Davis, T; Doyle, C; Galle, R; George, R; Harris, N; Hartzell, G; Harvey, D; Hong, L; Houston, K; Hoskins, R; Johnson, G; Martin, C; Moshrefi, A; Palazzolo, M; Reese, M G; Spradling, A; Tsang, G; Wan, K; Whitelaw, K; Celniker, S

    1999-01-01

    A contiguous sequence of nearly 3 Mb from the genome of Drosophila melanogaster has been sequenced from a series of overlapping P1 and BAC clones. This region covers 69 chromosome polytene bands on chromosome arm 2L, including the genetically well-characterized "Adh region." A computational analysis of the sequence predicts 218 protein-coding genes, 11 tRNAs, and 17 transposable element sequences. At least 38 of the protein-coding genes are arranged in clusters of from 2 to 6 closely related genes, suggesting extensive tandem duplication. The gene density is one protein-coding gene every 13 kb; the transposable element density is one element every 171 kb. Of 73 genes in this region identified by genetic analysis, 49 have been located on the sequence; P-element insertions have been mapped to 43 genes. Ninety-five (44%) of the known and predicted genes match a Drosophila EST, and 144 (66%) have clear similarities to proteins in other organisms. Genes known to have mutant phenotypes are more likely to be represented in cDNA libraries, and far more likely to have products similar to proteins of other organisms, than are genes with no known mutant phenotype. Over 650 chromosome aberration breakpoints map to this chromosome region, and their nonrandom distribution on the genetic map reflects variation in gene spacing on the DNA. This is the first large-scale analysis of the genome of D. melanogaster at the sequence level. In addition to the direct results obtained, this analysis has allowed us to develop and test methods that will be needed to interpret the complete sequence of the genome of this species.Before beginning a Hunt, it is wise to ask someone what you are looking for before you begin looking for it. Milne 1926 PMID:10471707

  16. Cell type-specific termination of transcription by transposable element sequences.

    PubMed

    Conley, Andrew B; Jordan, I King

    2012-09-30

    Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription termination by TEs seen here, along with the preference for sense-oriented TE insertions to provide TTS, is consistent with the observed antisense orientation bias of human TEs.

  17. Genomic sequences of murine gamma B- and gamma C-crystallin-encoding genes: promoter analysis and complete evolutionary pattern of mouse, rat and human gamma-crystallins.

    PubMed

    Graw, J; Liebstein, A; Pietrowski, D; Schmitt-John, T; Werner, T

    1993-12-22

    The murine genes, gamma B-cry and gamma C-cry, encoding the gamma B- and gamma C-crystallins, were isolated from a genomic DNA library. The complete nucleotide (nt) sequences of both genes were determined from 661 and 711 bp, respectively, upstream from the first exon to the corresponding polyadenylation sites, comprising more than 2650 and 2890 bp, respectively. The new sequences were compared to the partial cDNA sequences available for the murine gamma B-cry and gamma C-cry, as well as to the corresponding genomic sequences from rat and man, at both the nt and predicted amino acid (aa) sequence levels. In the gamma B-cry promoter region, a canonical CCAAT-box, a TATA-box, putative NF-I and C/EBP sites were detected. An R-repeat is inserted 366 bp upstream from the transcription start point. In contrast, the gamma C-cry promoter does not contain a CCAAT-box, but some other putative binding sites for transcription factors (AP-2, UBP-1, LBP-1) were located by computer analysis. The promoter regions of all six gamma-cry from mouse, rat and human, except human psi gamma F-cry, were analyzed for common sequence elements. A complex sequence element of about 70-80 bp was found in the proximal promoter, which contains a gamma-cry-specific and almost invariant sequence (crygpel) of 14 nt, and ends with the also invariant TATA-box. Within the complex sequence element, a minimum of three further features specific for the gamma A-, gamma B- and gamma D/E/F-cry genes can be defined, at least two of which were recently shown to be functional. In addition to these four sequence elements, a subtype-specific structure of inverted repeats with different-sized spacers can be deduced from the multiple sequence alignment. A phylogenetic analysis based on the promoter region, as well as the complete exon 3 of all gamma-cry from mouse, rat and man, suggests separation of only five gamma-cry subtypes (gamma A-, gamma B-, gamma C-, gamma D- and gamma E/F-cry) prior to species separation.

  18. The nonamer UUAUUUAUU is the key AU-rich sequence motif that mediates mRNA degradation.

    PubMed Central

    Zubiaga, A M; Belasco, J G; Greenberg, M E

    1995-01-01

    Labile mRNAs that encode cytokine and immediate-early gene products often contain AU-rich sequences within their 3' untranslated region (UTR). These AU-rich sequences appear to be key determinants of the short half-lives of these mRNAs, although the sequence features of these elements and the mechanism by which they target mRNAs for rapid decay have not been fully defined. We have examined the features of AU-rich elements (AREs) that are crucial for their function as determinants of mRNA instability in mammalian cells by testing the ability of various mutant c-fos AREs and synthetic AREs to direct rapid mRNA deadenylation and decay when inserted within the 3' UTR of the normally stable beta-globin mRNA. Evidence is presented that the pentamer AUUUA, which previously was suggested to be the minimal determinant of instability present in mammalian AREs, cannot direct rapid mRNA deadenylation and decay. Instead, the nonomer UUAUUUAUU is the elemental AU-rich sequence motif that destabilizes mRNA. Removal of one uridine residue from either end of the nonamer (UUAUUUAU or UAUUUAUU) results in a decrease of potency of the element, while removal of a uridine residue from both ends of the nonamer (UAUUUAU) eliminates detectable destabilizing activity. The inclusion of an additional uridine residue at both ends of the nonamer (UUUAUUUAUUU) does not further increase the efficacy of the element. Taken together, these findings suggest that the nonamer UUAUUUAUU is the minimal AU-rich motif that effectively destabilizes mRNA. Additional ARE potency is achieved by combining multiple copies of this nonamer in a single mRNA 3' UTR. Furthermore, analysis of poly(A) shortening rates for ARE-containing mRNAs reveals that the UUAUUUAUU sequence also accelerates mRNA deadenylation and suggests that the UUAUUUAUU motif targets mRNA for rapid deadenylation as an early step in the mRNA decay process. PMID:7891716

  19. Human population-specific gene expression and transcriptional network modification with polymorphic transposable elements

    PubMed Central

    Wang, Lu; Mariño-Ramírez, Leonardo

    2017-01-01

    Abstract Transposable element (TE) derived sequences are known to contribute to the regulation of the human genome. The majority of known TE-derived regulatory sequences correspond to relatively ancient insertions, which are fixed across human populations. The extent to which human genetic variation caused by recent TE activity leads to regulatory polymorphisms among populations has yet to be thoroughly explored. In this study, we searched for associations between polymorphic TE (polyTE) loci and human gene expression levels using an expression quantitative trait loci (eQTL) approach. We compared locus-specific polyTE insertion genotypes to B cell gene expression levels among 445 individuals from 5 human populations. Numerous human polyTE loci correspond to both cis and trans eQTL, and their regulatory effects are directly related to cell type-specific function in the immune system. PolyTE loci are associated with differences in expression between European and African population groups, and a single polyTE loci is indirectly associated with the expression of numerous genes via the regulation of the B cell-specific transcription factor PAX5. The polyTE-gene expression associations we found indicate that human TE genetic variation can have important phenotypic consequences. Our results reveal that TE-eQTL are involved in population-specific gene regulation as well as transcriptional network modification. PMID:27998931

  20. A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

    PubMed Central

    Walker, M D; Park, C W; Rosen, A; Aronheim, A

    1990-01-01

    Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401

  1. Recombinations in Staphylococcal Cassette Chromosome mec Elements Compromise the Molecular Detection of Methicillin Resistance in Staphylococcus aureus

    PubMed Central

    Hill-Cawthorne, Grant A.; Hudson, Lyndsey O.; El Ghany, Moataz Fouad Abd; Piepenburg, Olaf; Nair, Mridul; Dodgson, Andrew; Forrest, Matthew S.

    2014-01-01

    Clinical laboratories are increasingly using molecular tests for methicillin-resistant Staphylococcus aureus (MRSA) screening. However, primers have to be targeted to a variable chromosomal region, the staphylococcal cassette chromosome mec (SCCmec). We initially screened 726 MRSA isolates from a single UK hospital trust by recombinase polymerase amplification (RPA), a novel, isothermal alternative to PCR. Undetected isolates were further characterised using multilocus sequence, spa typing and whole genome sequencing. 96% of our tested phenotypically MRSA isolates contained one of the six orfX-SCCmec junctions our RPA test and commercially available molecular tests target. However 30 isolates could not be detected. Sequencing of 24 of these isolates demonstrated recombinations within the SCCmec element with novel insertions that interfered with the RPA, preventing identification as MRSA. This result suggests that clinical laboratories cannot rely solely upon molecular assays to reliably detect all methicillin-resistance. The presence of significant recombinations in the SCCmec element, where the majority of assays target their primers, suggests that there will continue to be isolates that escape identification. We caution that dependence on amplification-based molecular assays will continue to result in failure to diagnose a small proportion (∼4%) of MRSA isolates, unless the true level of SCCmec natural diversity is determined by whole genome sequencing of a large collection of MRSA isolates. PMID:24972080

  2. Biased insert for installing data transmission components in downhole drilling pipe

    DOEpatents

    Hall, David R [Provo, UT; Briscoe, Michael A [Lehi, UT; Garner, Kory K [Payson, UT; Wilde, Tyson J [Spanish Fork, UT

    2007-04-10

    An apparatus for installing data transmission hardware in downhole tools includes an insert insertable into the box end or pin end of drill tool, such as a section of drill pipe. The insert typically includes a mount portion and a slide portion. A data transmission element is mounted in the slide portion of the insert. A biasing element is installed between the mount portion and the slide portion and is configured to create a bias between the slide portion and the mount portion. This biasing element is configured to compensate for varying tolerances encountered in different types of downhole tools. In selected embodiments, the biasing element is an elastomeric material, a spring, compressed gas, or a combination thereof.

  3. An active ac/ds transposon system for activation tagging in tomato cultivar m82 using clonal propagation.

    PubMed

    Carter, Jared D; Pereira, Andy; Dickerman, Allan W; Veilleux, Richard E

    2013-05-01

    Tomato (Solanum lycopersicum) is a model organism for Solanaceae in both molecular and agronomic research. This project utilized Agrobacterium tumefaciens transformation and the transposon-tagging construct Activator (Ac)/Dissociator (Ds)-ATag-Bar_gosGFP to produce activation-tagged and knockout mutants in the processing tomato cultivar M82. The construct carried hygromycin resistance (hyg), green fluorescent protein (GFP), and the transposase (TPase) of maize (Zea mays) Activator major transcript X054214.1 on the stable Ac element, along with a 35S enhancer tetramer and glufosinate herbicide resistance (BAR) on the mobile Ds-ATag element. An in vitro propagation strategy was used to produce a population of 25 T0 plants from a single transformed plant regenerated in tissue culture. A T1 population of 11,000 selfed and cv M82 backcrossed progeny was produced from the functional T0 line. This population was screened using glufosinate herbicide, hygromycin leaf painting, and multiplex polymerase chain reaction (PCR). Insertion sites of transposed Ds-ATag elements were identified through thermal asymmetric interlaced PCR, and resulting product sequences were aligned to the recently published tomato genome. A population of 509 independent, Ds-only transposant lines spanning all 12 tomato chromosomes has been developed. Insertion site analysis demonstrated that more than 80% of these lines harbored Ds insertions conducive to activation tagging. The capacity of the Ds-ATag element to alter transcription was verified by quantitative real-time reverse transcription-PCR in two mutant lines. The transposon-tagged lines have been immortalized in seed stocks and can be accessed through an online database, providing a unique resource for tomato breeding and analysis of gene function in the background of a commercial tomato cultivar.

  4. Dissemination of Novel Antimicrobial Resistance Mechanisms through the Insertion Sequence Mediated Spread of Metabolic Genes

    PubMed Central

    Furi, Leonardo; Haigh, Richard; Al Jabri, Zaaima J. H.; Morrissey, Ian; Ou, Hong-Yu; León-Sampedro, Ricardo; Martinez, Jose L.; Coque, Teresa M.; Oggioni, Marco R.

    2016-01-01

    The widely used biocide triclosan selectively targets FabI, the NADH-dependent trans-2-enoyl-acyl carrier protein (ACP) reductase, which is also an important target for the development of narrow spectrum antibiotics. The analysis of triclosan resistant Staphylococcus aureus isolates had previously shown that in about half of the strains, the mechanism of triclosan resistance consists on the heterologous duplication of the triclosan target gene due to the acquisition of an additional fabI allele derived from Staphylococcus haemolyticus (sh-fabI). In the current work, the genomic sequencing of 10 of these strains allowed the characterization of two novel composite transposons TnSha1 and TnSha2 involved in the spread of sh-fabI. TnSha1 harbors one copy of IS1272, whereas TnSha2 is a 11.7 kb plasmid carrying TnSha1 present either as plasmid or in an integrated form generally flanked by two IS1272 elements. The target and mechanism of integration for IS1272 and TnSha1 are novel and include targeting of DNA secondary structures, generation of blunt-end deletions of the stem-loop and absence of target duplication. Database analyses showed widespread occurrence of these two elements in chromosomes and plasmids, with TnSha1 mainly in S. aureus and with TnSha2 mainly in S. haemolyticus and S. epidermidis. The acquisition of resistance by means of an insertion sequence-based mobilization and consequent duplication of drug-target metabolic genes, as observed here for sh-fabI, is highly reminiscent of the situation with the ileS2 gene conferring mupirocin resistance, and the dfrA and dfrG genes conferring trimethoprim resistance both of which are mobilized by IS257. These three examples, which show similar mechanisms and levels of spread of metabolic genes linked to IS elements, highlight the importance of this genetic strategy for recruitment and rapid distribution of novel resistance mechanisms in staphylococci. PMID:27446047

  5. Genome-based insights into the resistome and mobilome of multidrug-resistant Aeromonas sp. ARM81 isolated from wastewater.

    PubMed

    Adamczuk, Marcin; Dziewit, Lukasz

    2017-01-01

    The draft genome of multidrug-resistant Aeromonas sp. ARM81 isolated from a wastewater treatment plant in Warsaw (Poland) was obtained. Sequence analysis revealed multiple genes conferring resistance to aminoglycosides, β-lactams or tetracycline. Three different β-lactamase genes were identified, including an extended-spectrum β-lactamase gene bla PER-1 . The antibiotic susceptibility was experimentally tested. Genome sequencing also allowed us to investigate the plasmidome and transposable mobilome of ARM81. Four plasmids, of which two carry phenotypic modules (i.e., genes encoding a zinc transporter ZitB and a putative glucosyltransferase), and 28 putative transposase genes were identified. The mobility of three insertion sequences (isoforms of previously identified elements ISAs12, ISKpn9 and ISAs26) was confirmed using trap plasmids.

  6. Retroelements (LINEs and SINEs) in vole genomes: differential distribution in the constitutive heterochromatin.

    PubMed

    Acosta, M J; Marchal, J A; Fernández-Espartero, C H; Bullejos, M; Sánchez, A

    2008-01-01

    The chromosomal distribution of mobile genetic elements is scarcely known in Arvicolinae species, but could be of relevance to understand the origin and complex evolution of the sex chromosome heterochromatin. In this work we cloned two retrotransposon sequences, L1 and SINE-B1, from the genome of Chionomys nivalis and investigated their chromosomal distribution on several arvicoline species. Our results demonstrate first that both retroelements are the most abundant repeated DNA sequences in the genome of these species. L1 elements, in most species, are highly accumulated in the sex chromosomes compared to the autosomes. This favoured L1 insertion could have played an important role in the origin of the enlarged heterochromatic blocks existing in the sex chromosomes of some Microtus species. Also, we propose that L1 accumulation on the X heterochromatin could have been the consequence of different, independent and rapid amplification processes acting in each species. SINE elements, however, were completely lacking from the constitutive heterochromatin, either in autosomes or in the heterochromatic blocks of sex chromosomes. These data could indicate that some SINE elements are incompatible with the formation of heterochromatic complexes and hence are necessarily missing from the constitutive heterochromatin.

  7. High-efficiency transformation of Pichia stipitis based on its URA3 gene and a homologous autonomous replication sequence, ARS2.

    PubMed Central

    Yang, V W; Marks, J A; Davis, B P; Jeffries, T W

    1994-01-01

    This paper describes the first high-efficiency transformation system for the xylose-fermenting yeast Pichia stipitis. The system includes integrating and autonomously replicating plasmids based on the gene for orotidine-5'-phosphate decarboxylase (URA3) and an autonomous replicating sequence (ARS) element (ARS2) isolated from P. stipitis CBS 6054. Ura- auxotrophs were obtained by selecting for resistance to 5-fluoroorotic acid and were identified as ura3 mutants by transformation with P. stipitis URA3. P. stipitis URA3 was cloned by its homology to Saccharomyces cerevisiae URA3, with which it is 69% identical in the coding region. P. stipitis ARS elements were cloned functionally through plasmid rescue. These sequences confer autonomous replication when cloned into vectors bearing the P. stipitis URA3 gene. P. stipitis ARS2 has features similar to those of the consensus ARS of S. cerevisiae and other ARS elements. Circular plasmids bearing the P. stipitis URA3 gene with various amounts of flanking sequences produced 600 to 8,600 Ura+ transformants per micrograms of DNA by electroporation. Most transformants obtained with circular vectors arose without integration of vector sequences. One vector yielded 5,200 to 12,500 Ura+ transformants per micrograms of DNA after it was linearized at various restriction enzyme sites within the P. stipitis URA3 insert. Transformants arising from linearized vectors produced stable integrants, and integration events were site specific for the genomic ura3 in 20% of the transformants examined. Plasmids bearing the P. stipitis URA3 gene and ARS2 element produced more than 30,000 transformants per micrograms of plasmid DNA. Autonomously replicating plasmids were stable for at least 50 generations in selection medium and were present at an average of 10 copies per nucleus. Images PMID:7811063

  8. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079

  9. Dynamic evolution at pericentromeres.

    PubMed

    Hall, Anne E; Kettler, Gregory C; Preuss, Daphne

    2006-03-01

    Pericentromeres are exceptional genomic regions: in animals they contain extensive segmental duplications implicated in gene creation, and in plants they sustain rearrangements and insertions uncommon in euchromatin. To examine the mechanisms and patterns of plant pericentromere evolution, we compared pericentromere sequence from four Brassicaceae species separated by <15 million years (Myr). This flowering plant family is ideal for studying relationships between genome reorganization and pericentromere evolution-its members have undergone recent polyploidization and hybridization, with close relatives changing in genome size and chromosome number. Through sequence and hybridization analyses, we examined regions from Arabidopsis arenosa, Capsella rubella, and Olimarabidopsis pumila that are homologous to Arabidopsis thaliana pericentromeres (peri-CENs) III and V, and used FISH to demonstrate they have been maintained near centromere satellite arrays in each species. Sequence analysis revealed a set of highly conserved genes, yet we discovered substantial differences in intergenic length and species-specific changes in sequence content and gene density. We discovered that A. thaliana has undergone recent, significant expansions within its pericentromeres, in some cases measuring hundreds of kilobases; these findings are in marked contrast to euchromatic segments in these species that exhibit only minor length changes. While plant pericentromeres do contain some duplications, we did not find evidence of extensive segmental duplications, as has been documented in primates. Our data support a model in which plant pericentromeres may experience selective pressures distinct from euchromatin, tolerating rapid, dynamic changes in structure and sequence content, including large insertions of mobile elements, 5S rDNA arrays and pseudogenes.

  10. MOSAIK: a hash-based algorithm for accurate next-generation sequencing short-read mapping.

    PubMed

    Lee, Wan-Ping; Stromberg, Michael P; Ward, Alistair; Stewart, Chip; Garrison, Erik P; Marth, Gabor T

    2014-01-01

    MOSAIK is a stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome. Uniquely among current mapping tools, MOSAIK can align reads generated by all the major sequencing technologies, including Illumina, Applied Biosystems SOLiD, Roche 454, Ion Torrent and Pacific BioSciences SMRT. Indeed, MOSAIK was the only aligner to provide consistent mappings for all the generated data (sequencing technologies, low-coverage and exome) in the 1000 Genomes Project. To provide highly accurate alignments, MOSAIK employs a hash clustering strategy coupled with the Smith-Waterman algorithm. This method is well-suited to capture mismatches as well as short insertions and deletions. To support the growing interest in larger structural variant (SV) discovery, MOSAIK provides explicit support for handling known-sequence SVs, e.g. mobile element insertions (MEIs) as well as generating outputs tailored to aid in SV discovery. All variant discovery benefits from an accurate description of the read placement confidence. To this end, MOSAIK uses a neural-network based training scheme to provide well-calibrated mapping quality scores, demonstrated by a correlation coefficient between MOSAIK assigned and actual mapping qualities greater than 0.98. In order to ensure that studies of any genome are supported, a training pipeline is provided to ensure optimal mapping quality scores for the genome under investigation. MOSAIK is multi-threaded, open source, and incorporated into our command and pipeline launcher system GKNO (http://gkno.me).

  11. MOSAIK: A Hash-Based Algorithm for Accurate Next-Generation Sequencing Short-Read Mapping

    PubMed Central

    Lee, Wan-Ping; Stromberg, Michael P.; Ward, Alistair; Stewart, Chip; Garrison, Erik P.; Marth, Gabor T.

    2014-01-01

    MOSAIK is a stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome. Uniquely among current mapping tools, MOSAIK can align reads generated by all the major sequencing technologies, including Illumina, Applied Biosystems SOLiD, Roche 454, Ion Torrent and Pacific BioSciences SMRT. Indeed, MOSAIK was the only aligner to provide consistent mappings for all the generated data (sequencing technologies, low-coverage and exome) in the 1000 Genomes Project. To provide highly accurate alignments, MOSAIK employs a hash clustering strategy coupled with the Smith-Waterman algorithm. This method is well-suited to capture mismatches as well as short insertions and deletions. To support the growing interest in larger structural variant (SV) discovery, MOSAIK provides explicit support for handling known-sequence SVs, e.g. mobile element insertions (MEIs) as well as generating outputs tailored to aid in SV discovery. All variant discovery benefits from an accurate description of the read placement confidence. To this end, MOSAIK uses a neural-network based training scheme to provide well-calibrated mapping quality scores, demonstrated by a correlation coefficient between MOSAIK assigned and actual mapping qualities greater than 0.98. In order to ensure that studies of any genome are supported, a training pipeline is provided to ensure optimal mapping quality scores for the genome under investigation. MOSAIK is multi-threaded, open source, and incorporated into our command and pipeline launcher system GKNO (http://gkno.me). PMID:24599324

  12. Molecular characterization of the short interspersed repetitive element SIRE in the six discrete typing units (DTUs) of Trypanosoma cruzi.

    PubMed

    Pavia, Paula X; Thomas, M Carmen; López, Manuel C; Puerta, Concepción J

    2012-10-01

    Repetitive sequences constitute an important proportion of the Trypanosoma cruzi genome; hence, they have been used as molecular markers and as amplification targets to identify the parasite presence via PCR. In this study, a molecular characterization of the SIRE repetitive element was performed in the six discrete typing units (DTUs) of T. cruzi. The results evidenced that this element, located in multiple chromosomes, was interspersed in the genome of all DTUs of the parasite. The presence of several motifs implicated in element insertion, duplication, and functionality suggests that SIRE could be an active element in the parasite genome. Of interest, there were SIRE specific Alu I fragments that allowed to discriminate DTU I from the others DTUs. Moreover, an UPGMA phenetic tree constructed from fragment sharing Southern blot data showed that T. cruzi I isolates conform a cluster separated from the T. cruzi II-VI isolates. When the relative number of SIRE copies was determined, a variation from 105 to 2,000 copies per haploid genome was observed among the different isolates without kept a DTU-relationship. In all, these findings suggest that SIRE sequence is a good target for parasite DNA amplification. Copyright © 2012 Elsevier Inc. All rights reserved.

  13. ALF: a strategy for identification of unauthorized GMOs in complex mixtures by a GW-NGS method and dedicated bioinformatics analysis.

    PubMed

    Košir, Alexandra Bogožalec; Arulandhu, Alfred J; Voorhuijzen, Marleen M; Xiao, Hongmei; Hagelaar, Rico; Staats, Martijn; Costessi, Adalberto; Žel, Jana; Kok, Esther J; Dijk, Jeroen P van

    2017-10-26

    The majority of feed products in industrialised countries contains materials derived from genetically modified organisms (GMOs). In parallel, the number of reports of unauthorised GMOs (UGMOs) is gradually increasing. There is a lack of specific detection methods for UGMOs, due to the absence of detailed sequence information and reference materials. In this research, an adapted genome walking approach was developed, called ALF: Amplification of Linearly-enriched Fragments. Coupling of ALF to NGS aims for simultaneous detection and identification of all GMOs, including UGMOs, in one sample, in a single analysis. The ALF approach was assessed on a mixture made of DNA extracts from four reference materials, in an uneven distribution, mimicking a real life situation. The complete insert and genomic flanking regions were known for three of the included GMO events, while for MON15985 only partial sequence information was available. Combined with a known organisation of elements, this GMO served as a model for a UGMO. We successfully identified sequences matching with this organisation of elements serving as proof of principle for ALF as new UGMO detection strategy. Additionally, this study provides a first outline of an automated, web-based analysis pipeline for identification of UGMOs containing known GM elements.

  14. Re-sequencing transgenic plants revealed rearrangements at T-DNA inserts, and integration of a short T-DNA fragment, but no increase of small mutations elsewhere.

    PubMed

    Schouten, Henk J; Vande Geest, Henri; Papadimitriou, Sofia; Bemer, Marian; Schaart, Jan G; Smulders, Marinus J M; Perez, Gabino Sanchez; Schijlen, Elio

    2017-03-01

    Transformation resulted in deletions and translocations at T-DNA inserts, but not in genome-wide small mutations. A tiny T-DNA splinter was detected that probably would remain undetected by conventional techniques. We investigated to which extent Agrobacterium tumefaciens-mediated transformation is mutagenic, on top of inserting T-DNA. To prevent mutations due to in vitro propagation, we applied floral dip transformation of Arabidopsis thaliana. We re-sequenced the genomes of five primary transformants, and compared these to genomic sequences derived from a pool of four wild-type plants. By genome-wide comparisons, we identified ten small mutations in the genomes of the five transgenic plants, not correlated to the positions or number of T-DNA inserts. This mutation frequency is within the range of spontaneous mutations occurring during seed propagation in A. thaliana, as determined earlier. In addition, we detected small as well as large deletions specifically at the T-DNA insert sites. Furthermore, we detected partial T-DNA inserts, one of these a tiny 50-bp fragment originating from a central part of the T-DNA construct used, inserted into the plant genome without flanking other T-DNA. Because of its small size, we named this fragment a T-DNA splinter. As far as we know this is the first report of such a small T-DNA fragment insert in absence of any T-DNA border sequence. Finally, we found evidence for translocations from other chromosomes, flanking T-DNA inserts. In this study, we showed that next-generation sequencing (NGS) is a highly sensitive approach to detect T-DNA inserts in transgenic plants.

  15. Molecular biology. Mothers setting boundaries.

    PubMed

    Thorvaldsen, J L; Bartolomei, M S

    2000-06-23

    Certain genes are only expressed at one allele, a phenomenon called imprinting. Although it is well established that one allele of certain imprinted genes is silenced through methylation, this does not appear to be the case for all imprinted genes. In a thoughtful Perspective, Thorvaldsen and Bartolomei discuss new findings showing that insertion of insulator elements (boundary regions) between the promoter of a gene and its enhancer (a sequence that boosts gene expression) may be another way in which genes are silenced during imprinting.

  16. Using PATIMDB to Create Bacterial Transposon Insertion Mutant Libraries

    PubMed Central

    Urbach, Jonathan M.; Wei, Tao; Liberati, Nicole; Grenfell-Lee, Daniel; Villanueva, Jacinto; Wu, Gang; Ausubel, Frederick M.

    2015-01-01

    PATIMDB is a software package for facilitating the generation of transposon mutant insertion libraries. The software has two main functions: process tracking and automated sequence analysis. The process tracking function specifically includes recording the status and fates of multiwell plates and samples in various stages of library construction. Automated sequence analysis refers specifically to the pipeline of sequence analysis starting with ABI files from a sequencing facility and ending with insertion location identifications. The protocols in this unit describe installation and use of PATIMDB software. PMID:19343706

  17. L1-associated genomic regions are deleted in somatic cells of the healthy human brain.

    PubMed

    Erwin, Jennifer A; Paquola, Apuã C M; Singer, Tatjana; Gallina, Iryna; Novotny, Mark; Quayle, Carolina; Bedrosian, Tracy A; Alves, Francisco I A; Butcher, Cheyenne R; Herdy, Joseph R; Sarkar, Anindita; Lasken, Roger S; Muotri, Alysson R; Gage, Fred H

    2016-12-01

    The healthy human brain is a mosaic of varied genomes. Long interspersed element-1 (LINE-1 or L1) retrotransposition is known to create mosaicism by inserting L1 sequences into new locations of somatic cell genomes. Using a machine learning-based, single-cell sequencing approach, we discovered that somatic L1-associated variants (SLAVs) are composed of two classes: L1 retrotransposition insertions and retrotransposition-independent L1-associated variants. We demonstrate that a subset of SLAVs comprises somatic deletions generated by L1 endonuclease cutting activity. Retrotransposition-independent rearrangements in inherited L1s resulted in the deletion of proximal genomic regions. These rearrangements were resolved by microhomology-mediated repair, which suggests that L1-associated genomic regions are hotspots for somatic copy number variants in the brain and therefore a heritable genetic contributor to somatic mosaicism. We demonstrate that SLAVs are present in crucial neural genes, such as DLG2 (also called PSD93), and affect 44-63% of cells of the cells in the healthy brain.

  18. Retroposon analysis of major cetacean lineages: The monophyly of toothed whales and the paraphyly of river dolphins

    PubMed Central

    Nikaido, Masato; Matsuno, Fumio; Hamilton, Healy; Brownell, Robert L.; Cao, Ying; Ding, Wang; Zuoyan, Zhu; Shedlock, Andrew M.; Fordyce, R. Ewan; Hasegawa, Masami; Okada, Norihiro

    2001-01-01

    SINE (short interspersed element) insertion analysis elucidates contentious aspects in the phylogeny of toothed whales and dolphins (Odontoceti), especially river dolphins. Here, we characterize 25 informative SINEs inserted into unique genomic loci during evolution of odontocetes to construct a cladogram, and determine a total of 2.8 kb per taxon of the flanking sequences of these SINE loci to estimate divergence times among lineages. We demonstrate that: (i) Odontocetes are monophyletic; (ii) Ganges River dolphins, beaked whales, and ocean dolphins diverged (in this order) after sperm whales; (iii) three other river dolphin taxa, namely the Amazon, La Plata, and Yangtze river dolphins, form a monophyletic group with Yangtze River dolphins being the most basal; and (iv) the rapid radiation of extant cetacean lineages occurred some 28–33 million years B.P., in strong accord with the fossil record. The combination of SINE and flanking sequence analysis suggests a topology and set of divergence times for odontocete relationships, offering alternative explanations for several long-standing problems in cetacean evolution. PMID:11416211

  19. A Truncated AdeS Kinase Protein Generated by ISAba1 Insertion Correlates with Tigecycline Resistance in Acinetobacter baumannii

    PubMed Central

    Sun, Jun-Ren; Perng, Cherng-Lih; Chan, Ming-Chin; Morita, Yuji; Lin, Jung-Chung; Su, Chih-Mao; Wang, Wei-Yao; Chang, Tein-Yao; Chiueh, Tzong-Shi

    2012-01-01

    Over-expression of AdeABC efflux pump stimulated continuously by the mutated AdeRS two component system has been found to result in antimicrobial resistance, even tigecycline (TGC) resistance, in multidrug-resistant Acinetobacter baumannii (MRAB). Although the insertion sequence, ISAba1, contributes to one of the AdeRS mutations, the detail mechanism remains unclear. In the present study we collected 130 TGC-resistant isolates from 317 carbapenem resistant MRAB (MRAB-C) isolates, and 38 of them were characterized with ISAba1 insertion in the adeS gene. The relationship between the expression of AdeABC efflux pump and TGC resistant was verified indirectly by successfully reducing TGC resistance with NMP, an efflux pump inhibitor. Further analysis showed that the remaining gene following the ISAba1 insertion was still transcribed to generate a truncated AdeS protein by the Pout promoter on ISAba1 instead of frame shift or pre-termination. Through introducing a series of recombinant adeRS constructs into a adeRS knockout strain, we demonstrated the truncated AdeS protein was constitutively produced and stimulating the expression of AdeABC efflux pump via interaction with AdeR. Our findings suggest a mechanism of antimicrobial resistance induced by an aberrant cytoplasmic sensor derived from an insertion element. PMID:23166700

  20. Comparative Genomics of the Listeria monocytogenes ST204 Subgroup

    PubMed Central

    Fox, Edward M.; Allnutt, Theodore; Bradbury, Mark I.; Fanning, Séamus; Chandry, P. Scott

    2016-01-01

    The ST204 subgroup of Listeria monocytogenes is among the most frequently isolated in Australia from a range of environmental niches. In this study we provide a comparative genomics analysis of food and food environment isolates from geographically diverse sources. Analysis of the ST204 genomes showed a highly conserved core genome with the majority of variation seen in mobile genetic elements such as plasmids, transposons and phage insertions. Most strains (13/15) harbored plasmids, which although varying in size contained highly conserved sequences. Interestingly 4 isolates contained a conserved plasmid of 91,396 bp. The strains examined were isolated over a period of 12 years and from different geographic locations suggesting plasmids are an important component of the genetic repertoire of this subgroup and may provide a range of stress tolerance mechanisms. In addition to this 4 phage insertion sites and 2 transposons were identified among isolates, including a novel transposon. These genetic elements were highly conserved across isolates that harbored them, and also contained a range of genetic markers linked to stress tolerance and virulence. The maintenance of conserved mobile genetic elements in the ST204 population suggests these elements may contribute to the diverse range of niches colonized by ST204 isolates. Environmental stress selection may contribute to maintaining these genetic features, which in turn may be co-selecting for virulence markers relevant to clinical infection with ST204 isolates. PMID:28066377

  1. Comparative Genomics of the Listeria monocytogenes ST204 Subgroup.

    PubMed

    Fox, Edward M; Allnutt, Theodore; Bradbury, Mark I; Fanning, Séamus; Chandry, P Scott

    2016-01-01

    The ST204 subgroup of Listeria monocytogenes is among the most frequently isolated in Australia from a range of environmental niches. In this study we provide a comparative genomics analysis of food and food environment isolates from geographically diverse sources. Analysis of the ST204 genomes showed a highly conserved core genome with the majority of variation seen in mobile genetic elements such as plasmids, transposons and phage insertions. Most strains (13/15) harbored plasmids, which although varying in size contained highly conserved sequences. Interestingly 4 isolates contained a conserved plasmid of 91,396 bp. The strains examined were isolated over a period of 12 years and from different geographic locations suggesting plasmids are an important component of the genetic repertoire of this subgroup and may provide a range of stress tolerance mechanisms. In addition to this 4 phage insertion sites and 2 transposons were identified among isolates, including a novel transposon. These genetic elements were highly conserved across isolates that harbored them, and also contained a range of genetic markers linked to stress tolerance and virulence. The maintenance of conserved mobile genetic elements in the ST204 population suggests these elements may contribute to the diverse range of niches colonized by ST204 isolates. Environmental stress selection may contribute to maintaining these genetic features, which in turn may be co-selecting for virulence markers relevant to clinical infection with ST204 isolates.

  2. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences.

    PubMed

    Chen, Zhuo; Xu, Shixia; Zhou, Kaiya; Yang, Guang

    2011-10-27

    A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future.

  3. Characterization of the UGA-recoding and SECIS-binding activities of SECIS-binding protein 2.

    PubMed

    Bubenik, Jodi L; Miniard, Angela C; Driscoll, Donna M

    2014-01-01

    Selenium, a micronutrient, is primarily incorporated into human physiology as selenocysteine (Sec). The 25 Sec-containing proteins in humans are known as selenoproteins. Their synthesis depends on the translational recoding of the UGA stop codon to allow Sec insertion. This requires a stem-loop structure in the 3' untranslated region of eukaryotic mRNAs known as the Selenocysteine Insertion Sequence (SECIS). The SECIS is recognized by SECIS-binding protein 2 (SBP2) and this RNA:protein interaction is essential for UGA recoding to occur. Genetic mutations cause SBP2 deficiency in humans, resulting in a broad set of symptoms due to differential effects on individual selenoproteins. Progress on understanding the different phenotypes requires developing robust tools to investigate SBP2 structure and function. In this study we demonstrate that SBP2 protein produced by in vitro translation discriminates among SECIS elements in a competitive UGA recoding assay and has a much higher specific activity than bacterially expressed protein. We also show that a purified recombinant protein encompassing amino acids 517-777 of SBP2 binds to SECIS elements with high affinity and selectivity. The affinity of the SBP2:SECIS interaction correlated with the ability of a SECIS to compete for UGA recoding activity in vitro. The identification of a 250 amino acid sequence that mediates specific, selective SECIS-binding will facilitate future structural studies of the SBP2:SECIS complex. Finally, we identify an evolutionarily conserved core cysteine signature in SBP2 sequences from the vertebrate lineage. Mutation of multiple, but not single, cysteines impaired SECIS-binding but did not affect protein localization in cells.

  4. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences

    PubMed Central

    2011-01-01

    Background A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. Results An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Conclusions Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future. PMID:22029548

  5. Enhancing the GABI-Kat Arabidopsis thaliana T-DNA Insertion Mutant Database by Incorporating Araport11 Annotation.

    PubMed

    Kleinboelting, Nils; Huep, Gunnar; Weisshaar, Bernd

    2017-01-01

    SimpleSearch provides access to a database containing information about T-DNA insertion lines of the GABI-Kat collection of Arabidopsis thaliana mutants. These mutants are an important tool for reverse genetics, and GABI-Kat is the second largest collection of such T-DNA insertion mutants. Insertion sites were deduced from flanking sequence tags (FSTs), and the database contains information about mutant plant lines as well as insertion alleles. Here, we describe improvements within the interface (available at http://www.gabi-kat.de/db/genehits.php) and with regard to the database content that have been realized in the last five years. These improvements include the integration of the Araport11 genome sequence annotation data containing the recently updated A. thaliana structural gene descriptions, an updated visualization component that displays groups of insertions with very similar insertion positions, mapped confirmation sequences, and primers. The visualization component provides a quick way to identify insertions of interest, and access to improved data about the exact structure of confirmed insertion alleles. In addition, the database content has been extended by incorporating additional insertion alleles that were detected during the confirmation process, as well as by adding new FSTs that have been produced during continued efforts to complement gaps in FST availability. Finally, the current database content regarding predicted and confirmed insertion alleles as well as primer sequences has been made available as downloadable flat files. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.

  6. Method of modifying a volume mesh using sheet insertion

    DOEpatents

    Borden, Michael J [Albuquerque, NM; Shepherd, Jason F [Albuquerque, NM

    2006-08-29

    A method and machine-readable medium provide a technique to modify a hexahedral finite element volume mesh using dual generation and sheet insertion. After generating a dual of a volume stack (mesh), a predetermined algorithm may be followed to modify (refine) the volume mesh of hexahedral elements. The predetermined algorithm may include the steps of locating a sheet of hexahedral mesh elements, determining a plurality of hexahedral elements within the sheet to refine, shrinking the plurality of elements, and inserting a new sheet of hexahedral elements adjacently to modify the volume mesh. Additionally, another predetermined algorithm using mesh cutting may be followed to modify a volume mesh.

  7. “Agrolistic” transformation of plant cells: Integration of T-strands generated in planta

    PubMed Central

    Hansen, Geneviève; Chilton, Mary-Dell

    1996-01-01

    We describe a novel plant transformation technique, termed “agrolistic,” that combines the advantages of the Agrobacterium transformation system with the high efficiency of biolistic DNA delivery. Agrolistic transformation allows integration of the gene of interest without undesired vector sequence. The virulence genes virD1 and virD2 from Agrobacterium tumefaciens that are required in bacteria for excision of T-strands from the tumor-inducing plasmid were placed under the control of the CaMV35S promoter and codelivered with a target plasmid containing border sequences flanking the gene of interest. Transient expression assays in tobacco and in maize cells indicated that vir gene products caused strand-specific nicking in planta at the right border sequence, similar to VirD1/VirD2-catalyzed T-strand excision observed in Agrobacterium. Agrolistically transformed tobacco calli were obtained after codelivery of virD1 and virD2 genes together with a selectable marker flanked by border sequences. Some inserts exhibited right junctions with plant DNA that corresponded precisely to the sequence expected for T-DNA (portion of the tumor-inducing plasmid that is transferred to plant cells) insertion events. We designate these as “agrolistic” inserts, as distinguished from “biolistic” inserts. Both types of inserts were found in some transformed lines. The frequency of agrolistic inserts was 20% that of biolistic inserts. PMID:8962167

  8. Evolution of hypervirulence by a MRSA clone through acquisition of a transposable element

    PubMed Central

    Benson, Meredith A.; Ohneck, Elizabeth A.; Ryan, Chanelle; Alonzo, Francis; Smith, Hannah; Narechania, Apurva; Kolokotronis, Sergios-Orestis; Satola, Sarah W.; Uhlemann, Anne-Catrin; Sebra, Robert; Deikus, Gintaras; Shopsin, Bo; Planet, Paul J.; Torres, Victor J.

    2014-01-01

    SUMMARY Staphylococcus aureus has evolved as a pathogen that causes a range of diseases in humans. There are two dominant modes of evolution thought to explain most of the virulence differences between strains. First, virulence genes may be acquired from other organisms. Second, mutations may cause changes in the regulation and expression of genes. Here we describe an evolutionary event in which transposition of an IS element has a direct impact on virulence gene regulation resulting in hypervirulence. Whole genome analysis of a methicillin-resistant S. aureus (MRSA) strain USA500 revealed acquisition of a transposable element (IS256) that is absent from close relatives of this strain. Of the multiple copies of IS256 found in the USA500 genome, one was inserted in the promoter sequence of repressor of toxins (Rot), a master transcriptional regulator responsible for the expression of virulence factors in S. aureus. We show that insertion into the rot promoter by IS256 results in the derepression of cytotoxin expression and increased virulence. Taken together, this work provides new insight into evolutionary strategies by which S. aureus is able to modify its virulence properties and demonstrates a novel mechanism by which horizontal gene transfer directly impacts virulence through altering toxin regulation. PMID:24962815

  9. The structure of the coding and 5'-flanking region of the type 1 iodothyronine deiodinase (dio1) gene is normal in a patient with suspected congenital dio1 deficiency.

    PubMed

    Toyoda, N; Kleinhaus, N; Larsen, P R

    1996-06-01

    We analyzed the exon-intron structure of the human type 1 deiodinase gene (dio1) and compared it with that of a patient with suspected congenital type 1 deiodinase (D1) deficiency. The hdio1 gene is identical in exon-intron arrangement to the mouse gene, with coding sequences and a selenocysteine insertion sequence (SECIS) element contained in four exons. There were no mutations in the sequences of exons 1-4 of the patient's genomic DNA. Functional studies by transient expression techniques showed no difference in basal promoter activity or T3 responsiveness between the patient's and the normal dio1 gene. A structural abnormality in the dio1 gene is not a likely explanation for this patient's D1-deficient phenotype.

  10. Giardia telomeric sequence d(TAGGG)4 forms two intramolecular G-quadruplexes in K+ solution: effect of loop length and sequence on the folding topology.

    PubMed

    Hu, Lanying; Lim, Kah Wai; Bouaziz, Serge; Phan, Anh Tuân

    2009-11-25

    Recently, it has been shown that in K(+) solution the human telomeric sequence d[TAGGG(TTAGGG)(3)] forms a (3 + 1) intramolecular G-quadruplex, while the Bombyx mori telomeric sequence d[TAGG(TTAGG)(3)], which differs from the human counterpart only by one G deletion in each repeat, forms a chair-type intramolecular G-quadruplex, indicating an effect of G-tract length on the folding topology of G-quadruplexes. To explore the effect of loop length and sequence on the folding topology of G-quadruplexes, here we examine the structure of the four-repeat Giardia telomeric sequence d[TAGGG(TAGGG)(3)], which differs from the human counterpart only by one T deletion within the non-G linker in each repeat. We show by NMR that this sequence forms two different intramolecular G-quadruplexes in K(+) solution. The first one is a novel basket-type antiparallel-stranded G-quadruplex containing two G-tetrads, a G x (A-G) triad, and two A x T base pairs; the three loops are consecutively edgewise-diagonal-edgewise. The second one is a propeller-type parallel-stranded G-quadruplex involving three G-tetrads; the three loops are all double-chain-reversal. Recurrence of several structural elements in the observed structures suggests a "cut and paste" principle for the design and prediction of G-quadruplex topologies, for which different elements could be extracted from one G-quadruplex and inserted into another.

  11. A novel adaptive needle insertion sequencing for robotic, single needle MR-guided high-dose-rate prostate brachytherapy

    NASA Astrophysics Data System (ADS)

    Borot de Battisti, M.; de Senneville, B. Denis; Hautvast, G.; Binnekamp, D.; Lagendijk, J. J. W.; Maenhout, M.; Moerland, M. A.

    2017-05-01

    MR-guided high-dose-rate (HDR) brachytherapy has gained increasing interest as a treatment for patients with localized prostate cancer because of the superior value of MRI for tumor and surrounding tissues localization. To enable needle insertion into the prostate with the patient in the MR bore, a single needle MR-compatible robotic system involving needle-by-needle dose delivery has been developed at our institution. Throughout the intervention, dose delivery may be impaired by: (1) sub-optimal needle positioning caused by e.g. needle bending, (2) intra-operative internal organ motion such as prostate rotations or swelling, or intra-procedural rectum or bladder filling. This may result in failure to reach clinical constraints. To assess the first aforementioned challenge, a recent study from our research group demonstrated that the deposited dose may be greatly improved by real-time adaptive planning with feedback on the actual needle positioning. However, the needle insertion sequence is left to the doctor and therefore, this may result in sub-optimal dose delivery. In this manuscript, a new method is proposed to determine and update automatically the needle insertion sequence. This strategy is based on the determination of the most sensitive needle track. The sensitivity of a needle track is defined as its impact on the dose distribution in case of sub-optimal positioning. A stochastic criterion is thus presented to determine each needle track sensitivity based on needle insertion simulations. To assess the proposed sequencing strategy, HDR prostate brachytherapy was simulated on 11 patients with varying number of needle insertions. Sub-optimal needle positioning was simulated at each insertion (modeled by typical random angulation errors). In 91% of the scenarios, the dose distribution improved when the needle was inserted into the most compared to the least sensitive needle track. The computation time for sequencing was less than 6 s per needle track. The proposed needle insertion sequencing can therefore assist in delivering an optimal dose in HDR prostate brachytherapy.

  12. A Comprehensive Analysis of In Vitro and In Vivo Genetic Fitness of Pseudomonas aeruginosa Using High-Throughput Sequencing of Transposon Libraries

    PubMed Central

    Aschard, Hugues; Cattoir, Vincent; Yoder-Himes, Deborah; Lory, Stephen; Pier, Gerald B.

    2013-01-01

    High-throughput sequencing of transposon (Tn) libraries created within entire genomes identifies and quantifies the contribution of individual genes and operons to the fitness of organisms in different environments. We used insertion-sequencing (INSeq) to analyze the contribution to fitness of all non-essential genes in the chromosome of Pseudomonas aeruginosa strain PA14 based on a library of ∼300,000 individual Tn insertions. In vitro growth in LB provided a baseline for comparison with the survival of the Tn insertion strains following 6 days of colonization of the murine gastrointestinal tract as well as a comparison with Tn-inserts subsequently able to systemically disseminate to the spleen following induction of neutropenia. Sequencing was performed following DNA extraction from the recovered bacteria, digestion with the MmeI restriction enzyme that hydrolyzes DNA 16 bp away from the end of the Tn insert, and fractionation into oligonucleotides of 1,200–1,500 bp that were prepared for high-throughput sequencing. Changes in frequency of Tn inserts into the P. aeruginosa genome were used to quantify in vivo fitness resulting from loss of a gene. 636 genes had <10 sequencing reads in LB, thus defined as unable to grow in this medium. During in vivo infection there were major losses of strains with Tn inserts in almost all known virulence factors, as well as respiration, energy utilization, ion pumps, nutritional genes and prophages. Many new candidates for virulence factors were also identified. There were consistent changes in the recovery of Tn inserts in genes within most operons and Tn insertions into some genes enhanced in vivo fitness. Strikingly, 90% of the non-essential genes were required for in vivo survival following systemic dissemination during neutropenia. These experiments resulted in the identification of the P. aeruginosa strain PA14 genes necessary for optimal survival in the mucosal and systemic environments of a mammalian host. PMID:24039572

  13. Detection of Sleeping Beauty transposition in the genome of host cells by non-radioactive Southern blot analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aravalli, Rajagopal N., E-mail: aravalli@umn.edu; Park, Chang W.; Steer, Clifford J., E-mail: steer001@umn.edu

    The Sleeping Beauty transposon (SB-Tn) system is being used widely as a DNA vector for the delivery of therapeutic transgenes, as well as a tool for the insertional mutagenesis in animal models. In order to accurately assess the insertional potential and properties related to the integration of SB it is essential to determine the copy number of SB-Tn in the host genome. Recently developed SB100X transposase has demonstrated an integration rate that was much higher than the original SB10 and that of other versions of hyperactive SB transposases, such as HSB3 or HSB17. In this study, we have constructed amore » series of SB vectors carrying either a DsRed or a human β-globin transgene that was encompassed by cHS4 insulator elements, and containing the SB100X transposase gene outside the SB-Tn unit within the same vector in cis configuration. These SB-Tn constructs were introduced into the K-562 erythroid cell line, and their presence in the genomes of host cells was analyzed by Southern blot analysis using non-radioactive probes. Many copies of SB-Tn insertions were detected in host cells regardless of transgene sequences or the presence of cHS4 insulator elements. Interestingly, the size difference of 2.4 kb between insulated SB and non-insulated controls did not reflect the proportional difference in copy numbers of inserted SB-Tns. We then attempted methylation-sensitive Southern blots to assess the potential influence of cHS4 insulator elements on the epigenetic modification of SB-Tn. Our results indicated that SB100X was able to integrate at multiple sites with the number of SB-Tn copies larger than 6 kb in size. In addition, the non-radioactive Southern blot protocols developed here will be useful to detect integrated SB-Tn copies in any mammalian cell type.« less

  14. T-lex2: genotyping, frequency estimation and re-annotation of transposable elements using single or pooled next-generation sequencing data.

    PubMed

    Fiston-Lavier, Anna-Sophie; Barrón, Maite G; Petrov, Dmitri A; González, Josefa

    2015-02-27

    Transposable elements (TEs) constitute the most active, diverse and ancient component in a broad range of genomes. Complete understanding of genome function and evolution cannot be achieved without a thorough understanding of TE impact and biology. However, in-depth analysis of TEs still represents a challenge due to the repetitive nature of these genomic entities. In this work, we present a broadly applicable and flexible tool: T-lex2. T-lex2 is the only available software that allows routine, automatic and accurate genotyping of individual TE insertions and estimation of their population frequencies both using individual strain and pooled next-generation sequencing data. Furthermore, T-lex2 also assesses the quality of the calls allowing the identification of miss-annotated TEs and providing the necessary information to re-annotate them. The flexible and customizable design of T-lex2 allows running it in any genome and for any type of TE insertion. Here, we tested the fidelity of T-lex2 using the fly and human genomes. Overall, T-lex2 represents a significant improvement in our ability to analyze the contribution of TEs to genome function and evolution as well as learning about the biology of TEs. T-lex2 is freely available online at http://sourceforge.net/projects/tlex. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Use of the multipurpose transposon Tn KPK2 for the mutational analysis of chromosomal regions upstream and downstream of the sipF gene in Bradyrhizobium japonicum.

    PubMed

    Müller, P

    2004-04-01

    The DNA regions upstream and downstream of the Bradyrhizobium japonicum gene sipF were cloned by in vivo techniques and subsequently sequenced. In order to study the function of the predicted genes, a new transposon for in vitro mutagenesis, Tn KPK2, was constructed. This mutagenesis system has a number of advantages over other transposons. Tn KPK2 itself has no transposase gene, making transposition events stable. Extremely short inverted repeats minimize the length of the transposable element and facilitate the determination of the nucleotide sequence of the flanking regions. Since the transposable element carries a promoterless ' phoA reporter gene, the appearance of functional PhoA fusion proteins indicates that Tn KPK2 has inserted in a gene encoding a periplasmic or secreted protein. Although such events are extremely rare, because the transposon has to insert in-frame, in the correct orientation, and at an appropriate location in the target molecule, a direct screening procedure on agar indicator plates permits the identification of candidate clones from large numbers of colonies. In this study, Tn KPK2 was used for the construction of various symbiotic mutants of B. japonicum. One of the mutant strains, A2-10, which is defective in a gene encoding a protein that comigrates with bacterioferritin ( bcpB), was found to induce the formation of small and ineffective nodules.

  16. The insertional history of an active family of L1 retrotransposons in humans.

    PubMed

    Boissinot, Stéphane; Entezam, Ali; Young, Lynn; Munson, Peter J; Furano, Anthony V

    2004-07-01

    As humans contain a currently active L1 (LINE-1) non-LTR retrotransposon family (Ta-1), the human genome database likely provides only a partial picture of Ta-1-generated diversity. We used a non-biased method to clone Ta-1 retrotransposon-containing loci from representatives of four ethnic populations. We obtained 277 distinct Ta-1 loci and identified an additional 67 loci in the human genome database. This collection represents approximately 90% of the Ta-1 population in the individuals examined and is thus more representative of the insertional history of Ta-1 than the human genome database, which lacked approximately 40% of our cloned Ta-1 elements. As both polymorphic and fixed Ta-1 elements are as abundant in the GC-poor genomic regions as in ancestral L1 elements, the enrichment of L1 elements in GC-poor areas is likely due to insertional bias rather than selection. Although the chromosomal distribution of Ta-1 inserts is generally a function of chromosomal length and gene density, chromosome 4 significantly deviates from this pattern and has been much more hospitable to Ta-1 insertions than any other chromosome. Also, the intra-chromosomal distribution of Ta-1 elements is not uniform. Ta-1 elements tend to cluster, and the maximal gaps between Ta-1 inserts are larger than would be expected from a model of uniform random insertion. Copyright 2004 Cold Spring Harbor Laboratory Press ISSN

  17. Comparative evolution history of SINEs in Arabidopsis thaliana and Brassica oleracea: evidence for a high rate of SINE loss.

    PubMed

    Lenoir, A; Pélissier, T; Bousquet-Antonelli, C; Deragon, J M

    2005-01-01

    Brassica oleracea and Arabidopsis thaliana belong to the Brassicaceae(Cruciferae) family and diverged 16 to 19 million years ago. Although the genome size of B. oleracea (approximately 600 million base pairs) is more than four times that of A. thaliana (approximately 130 million base pairs), their gene content is believed to be very similar with more than 85% sequence identity in the coding region. Therefore, this important difference in genome size is likely to reflect a different rate of non-coding DNA accumulation. Transposable elements (TEs) constitute a major fraction of non-coding DNA in plant species. A different rate in TE accumulation between two closely related species can result in significant genome size variations in a short evolutionary period. Short interspersed elements (SINEs) are non-autonomous retroposons that have invaded the genome of most eukaryote species. Several SINE families are present in B. oleracea and A. thaliana and we found that two of them (called RathE1 and RathE2) are present in both species. In this study, the tempo of evolution of RathE1 and RathE2 SINE families in both species was compared. We observed that most B. oleracea RathE2 SINEs are "young" (close to the consensus sequence) and abundant while elements from this family are more degenerated and much less abundant in A. thaliana. However, the situation is different for the RathE1 SINE family for which the youngest elements are found in A. thaliana. Surprisingly, no SINE was found to occupy the same (orthologous) genomic locus in both species suggesting that either these SINE families were not amplified at a significant rate in the common ancestor of the two species or that older elements were lost and only the recent (lineage-specific) insertions remain. To test this latter hypothesis, loci containing a recently inserted SINE in the A. thaliana col-0 ecotype were selected and characterized in several other A. thaliana ecotypes. In addition to the expected SINE containing allele and the pre-integrative allele (i.e. the "empty" allele), we observed in the different ecotypes, alleles with truncated portions of the SINE (up to the complete loss of the element) and of the immediate genomic flanking sequences. The absence of SINEs in orthologous positions between B. oleracea and A. thaliana and the presence in recently diverged A. thaliana ecotypes of alleles containing severely truncated SINEs suggest a very high rate of SINE loss in these species.

  18. Active Site Sharing and Subterminal Hairpin Recognition in a New Class of DNA Transposases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ronning, Donald R.; Guynet, Catherine; Ton-Hoang, Bao

    2010-07-20

    Many bacteria harbor simple transposable elements termed insertion sequences (IS). In Helicobacter pylori, the chimeric IS605 family elements are particularly interesting due to their proximity to genes encoding gastric epithelial invasion factors. Protein sequences of IS605 transposases do not bear the hallmarks of other well-characterized transposases. We have solved the crystal structure of full-length transposase (TnpA) of a representative member, ISHp608. Structurally, TnpA does not resemble any characterized transposase; rather, it is related to rolling circle replication (RCR) proteins. Consistent with RCR, Mg{sup 2+} and a conserved tyrosine, Tyr127, are essential for DNA nicking and the formation of a covalentmore » intermediate between TnpA and DNA. TnpA is dimeric, contains two shared active sites, and binds two DNA stem loops representing the conserved inverted repeats near each end of ISHp608. The cocrystal structure with stem-loop DNA illustrates how this family of transposases specifically recognizes and pairs ends, necessary steps during transposition.« less

  19. Exponential Megapriming PCR (EMP) Cloning—Seamless DNA Insertion into Any Target Plasmid without Sequence Constraints

    PubMed Central

    Ulrich, Alexander; Andersen, Kasper R.; Schwartz, Thomas U.

    2012-01-01

    We present a fast, reliable and inexpensive restriction-free cloning method for seamless DNA insertion into any plasmid without sequence limitation. Exponential megapriming PCR (EMP) cloning requires two consecutive PCR steps and can be carried out in one day. We show that EMP cloning has a higher efficiency than restriction-free (RF) cloning, especially for long inserts above 2.5 kb. EMP further enables simultaneous cloning of multiple inserts. PMID:23300917

  20. Exponential megapriming PCR (EMP) cloning--seamless DNA insertion into any target plasmid without sequence constraints.

    PubMed

    Ulrich, Alexander; Andersen, Kasper R; Schwartz, Thomas U

    2012-01-01

    We present a fast, reliable and inexpensive restriction-free cloning method for seamless DNA insertion into any plasmid without sequence limitation. Exponential megapriming PCR (EMP) cloning requires two consecutive PCR steps and can be carried out in one day. We show that EMP cloning has a higher efficiency than restriction-free (RF) cloning, especially for long inserts above 2.5 kb. EMP further enables simultaneous cloning of multiple inserts.

  1. Identifying transposon insertions and their effects from RNA-sequencing data.

    PubMed

    de Ruiter, Julian R; Kas, Sjors M; Schut, Eva; Adams, David J; Koudijs, Marco J; Wessels, Lodewyk F A; Jonkers, Jos

    2017-07-07

    Insertional mutagenesis using engineered transposons is a potent forward genetic screening technique used to identify cancer genes in mouse model systems. In the analysis of these screens, transposon insertion sites are typically identified by targeted DNA-sequencing and subsequently assigned to predicted target genes using heuristics. As such, these approaches provide no direct evidence that insertions actually affect their predicted targets or how transcripts of these genes are affected. To address this, we developed IM-Fusion, an approach that identifies insertion sites from gene-transposon fusions in standard single- and paired-end RNA-sequencing data. We demonstrate IM-Fusion on two separate transposon screens of 123 mammary tumors and 20 B-cell acute lymphoblastic leukemias, respectively. We show that IM-Fusion accurately identifies transposon insertions and their true target genes. Furthermore, by combining the identified insertion sites with expression quantification, we show that we can determine the effect of a transposon insertion on its target gene(s) and prioritize insertions that have a significant effect on expression. We expect that IM-Fusion will significantly enhance the accuracy of cancer gene discovery in forward genetic screens and provide initial insight into the biological effects of insertions on candidate cancer genes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. A rapid and cost-effective method for sequencing pooled cDNA clones by using a combination of transposon insertion and Gateway technology.

    PubMed

    Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide

    2011-09-01

    Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.

  3. Exploitation of the diverse insertion sequence element content of dairy Lactobacillus helveticus starters as a rapid method to identify different strains.

    PubMed

    Kaleta, Pawel; Callanan, Michael J; O'Callaghan, John; Fitzgerald, Gerald F; Beresford, Thomas P; Ross, R Paul

    2009-10-01

    The species Lactobacillus helveticus is a commonly used thermophilic starter and/or adjunct culture for Swiss and Cheddar cheese manufacture. Its use is normally associated with flavour improvement which is known to be associated with culture traits such as rapid autolysis and high proteolytic activity. The genome of the commercial strain, DPC4571, was recently sequenced and found to have an abundance of IS sequences in terms of both abundance (213 intact) and diversity (21 types). Given this unique diversity for a lactic acid bacterium, we investigated whether PCR-based IS fingerprinting could be used as a discriminatory tool to distinguish between different strains of Lb. helveticus. A set of ten primers targeting five of the most numerous groups (ISL1201, ISLhe65, ISLhe2, ISLhe15 and ISL2) of IS elements was designed. Multiplex-PCR with all primers resulted in 1-12 discreet amplicons for each strain tested. The resultant fingerprints (in the 0.5 kb-3 kb range) were found to be strain specific and reproducible. This approach thus provides a valuable method to distinguish between Lb. helveticus strains while giving some indication of the relative abundance of IS sequences in each strain.

  4. High-Throughput Analysis of T-DNA Location and Structure Using Sequence Capture.

    PubMed

    Inagaki, Soichi; Henry, Isabelle M; Lieberman, Meric C; Comai, Luca

    2015-01-01

    Agrobacterium-mediated transformation of plants with T-DNA is used both to introduce transgenes and for mutagenesis. Conventional approaches used to identify the genomic location and the structure of the inserted T-DNA are laborious and high-throughput methods using next-generation sequencing are being developed to address these problems. Here, we present a cost-effective approach that uses sequence capture targeted to the T-DNA borders to select genomic DNA fragments containing T-DNA-genome junctions, followed by Illumina sequencing to determine the location and junction structure of T-DNA insertions. Multiple probes can be mixed so that transgenic lines transformed with different T-DNA types can be processed simultaneously, using a simple, index-based pooling approach. We also developed a simple bioinformatic tool to find sequence read pairs that span the junction between the genome and T-DNA or any foreign DNA. We analyzed 29 transgenic lines of Arabidopsis thaliana, each containing inserts from 4 different T-DNA vectors. We determined the location of T-DNA insertions in 22 lines, 4 of which carried multiple insertion sites. Additionally, our analysis uncovered a high frequency of unconventional and complex T-DNA insertions, highlighting the needs for high-throughput methods for T-DNA localization and structural characterization. Transgene insertion events have to be fully characterized prior to use as commercial products. Our method greatly facilitates the first step of this characterization of transgenic plants by providing an efficient screen for the selection of promising lines.

  5. New insights into Acinetobacter baumannii pathogenesis revealed by high-density pyrosequencing and transposon mutagenesis.

    PubMed

    Smith, Michael G; Gianoulis, Tara A; Pukatzki, Stefan; Mekalanos, John J; Ornston, L Nicholas; Gerstein, Mark; Snyder, Michael

    2007-03-01

    Acinetobacter baumannii has emerged as an important and problematic human pathogen as it is the causative agent of several types of infections including pneumonia, meningitis, septicemia, and urinary tract infections. We explored the pathogenic content of this harmful pathogen using a combination of DNA sequencing and insertional mutagenesis. The genome of this organism was sequenced using a strategy involving high-density pyrosequencing, a novel, rapid method of high-throughput sequencing. Excluding the rDNA repeats, the assembled genome is 3,976,746 base pairs (bp) and has 3830 ORFs. A significant fraction of ORFs (17.2%) are located in 28 putative alien islands, indicating that the genome has acquired a large amount of foreign DNA. Consistent with its role in pathogenesis, a remarkable number of the islands (16) contain genes implicated in virulence, indicating the organism devotes a considerable portion of its genes to pathogenesis. The largest island contains elements homologous to the Legionella/Coxiella Type IV secretion apparatus. Type IV secretion systems have been demonstrated to be important for virulence in other organisms and thus are likely to help mediate pathogenesis of A. baumannii. Insertional mutagenesis generated avirulent isolates of A. baumannii and verified that six of the islands contain virulence genes, including two novel islands containing genes that lacked homology with others in the databases. The DNA sequencing approach described in this study allows the rapid elucidation of the DNA sequence of any microbe and, when combined with genetic screens, can identify many novel genes important for microbial pathogenesis.

  6. Reduced Mutation Rate and Increased Transformability of Transposon-Free Acinetobacter baylyi ADP1-ISx

    PubMed Central

    Suárez, Gabriel A.; Renda, Brian A.; Dasgupta, Aurko

    2017-01-01

    ABSTRACT The genomes of most bacteria contain mobile DNA elements that can contribute to undesirable genetic instability in engineered cells. In particular, transposable insertion sequence (IS) elements can rapidly inactivate genes that are important for a designed function. We deleted all six copies of IS1236 from the genome of the naturally transformable bacterium Acinetobacter baylyi ADP1. The natural competence of ADP1 made it possible to rapidly repair deleterious point mutations that arose during strain construction. In the resulting ADP1-ISx strain, the rates of mutations inactivating a reporter gene were reduced by 7- to 21-fold. This reduction was higher than expected from the incidence of new IS1236 insertions found during a 300-day mutation accumulation experiment with wild-type ADP1 that was used to estimate spontaneous mutation rates in the strain. The extra improvement appears to be due in part to eliminating large deletions caused by IS1236 activity, as the point mutation rate was unchanged in ADP1-ISx. Deletion of an error-prone polymerase (dinP) and a DNA damage response regulator (umuDAb [the umuD gene of A. baylyi]) from the ADP1-ISx genome did not further reduce mutation rates. Surprisingly, ADP1-ISx exhibited increased transformability. This improvement may be due to less autolysis and aggregation of the engineered cells than of the wild type. Thus, deleting IS elements from the ADP1 genome led to a greater than expected increase in evolutionary reliability and unexpectedly enhanced other key strain properties, as has been observed for other clean-genome bacterial strains. ADP1-ISx is an improved chassis for metabolic engineering and other applications. IMPORTANCE Acinetobacter baylyi ADP1 has been proposed as a next-generation bacterial host for synthetic biology and genome engineering due to its ability to efficiently take up DNA from its environment during normal growth. We deleted transposable elements that are capable of copying themselves, inserting into other genes, and thereby inactivating them from the ADP1 genome. The resulting “clean-genome” ADP1-ISx strain exhibited larger reductions in the rates of inactivating mutations than expected from spontaneous mutation rates measured via whole-genome sequencing of lineages evolved under relaxed selection. Surprisingly, we also found that IS element activity reduces transformability and is a major cause of cell aggregation and death in wild-type ADP1 grown under normal laboratory conditions. More generally, our results demonstrate that domesticating a bacterial genome by removing mobile DNA elements that have accumulated during evolution in the wild can have unanticipated benefits. PMID:28667117

  7. Reduced Mutation Rate and Increased Transformability of Transposon-Free Acinetobacter baylyi ADP1-ISx.

    PubMed

    Suárez, Gabriel A; Renda, Brian A; Dasgupta, Aurko; Barrick, Jeffrey E

    2017-09-01

    The genomes of most bacteria contain mobile DNA elements that can contribute to undesirable genetic instability in engineered cells. In particular, transposable insertion sequence (IS) elements can rapidly inactivate genes that are important for a designed function. We deleted all six copies of IS 1236 from the genome of the naturally transformable bacterium Acinetobacter baylyi ADP1. The natural competence of ADP1 made it possible to rapidly repair deleterious point mutations that arose during strain construction. In the resulting ADP1-ISx strain, the rates of mutations inactivating a reporter gene were reduced by 7- to 21-fold. This reduction was higher than expected from the incidence of new IS 1236 insertions found during a 300-day mutation accumulation experiment with wild-type ADP1 that was used to estimate spontaneous mutation rates in the strain. The extra improvement appears to be due in part to eliminating large deletions caused by IS 1236 activity, as the point mutation rate was unchanged in ADP1-ISx. Deletion of an error-prone polymerase ( dinP ) and a DNA damage response regulator ( umuD Ab [the umuD gene of A. baylyi ]) from the ADP1-ISx genome did not further reduce mutation rates. Surprisingly, ADP1-ISx exhibited increased transformability. This improvement may be due to less autolysis and aggregation of the engineered cells than of the wild type. Thus, deleting IS elements from the ADP1 genome led to a greater than expected increase in evolutionary reliability and unexpectedly enhanced other key strain properties, as has been observed for other clean-genome bacterial strains. ADP1-ISx is an improved chassis for metabolic engineering and other applications. IMPORTANCE Acinetobacter baylyi ADP1 has been proposed as a next-generation bacterial host for synthetic biology and genome engineering due to its ability to efficiently take up DNA from its environment during normal growth. We deleted transposable elements that are capable of copying themselves, inserting into other genes, and thereby inactivating them from the ADP1 genome. The resulting "clean-genome" ADP1-ISx strain exhibited larger reductions in the rates of inactivating mutations than expected from spontaneous mutation rates measured via whole-genome sequencing of lineages evolved under relaxed selection. Surprisingly, we also found that IS element activity reduces transformability and is a major cause of cell aggregation and death in wild-type ADP1 grown under normal laboratory conditions. More generally, our results demonstrate that domesticating a bacterial genome by removing mobile DNA elements that have accumulated during evolution in the wild can have unanticipated benefits. Copyright © 2017 American Society for Microbiology.

  8. A new molecular evolution model for limited insertion independent of substitution.

    PubMed

    Lèbre, Sophie; Michel, Christian J

    2013-10-01

    We recently introduced a new molecular evolution model called the IDIS model for Insertion Deletion Independent of Substitution [13,14]. In the IDIS model, the three independent processes of substitution, insertion and deletion of residues have constant rates. In order to control the genome expansion during evolution, we generalize here the IDIS model by introducing an insertion rate which decreases when the sequence grows and tends to 0 for a maximum sequence length nmax. This new model, called LIIS for Limited Insertion Independent of Substitution, defines a matrix differential equation satisfied by a vector P(t) describing the sequence content in each residue at evolution time t. An analytical solution is obtained for any diagonalizable substitution matrix M. Thus, the LIIS model gives an expression of the sequence content vector P(t) in each residue under evolution time t as a function of the eigenvalues and the eigenvectors of matrix M, the residue insertion rate vector R, the total insertion rate r, the initial and maximum sequence lengths n0 and nmax, respectively, and the sequence content vector P(t0) at initial time t0. The derivation of the analytical solution is much more technical, compared to the IDIS model, as it involves Gauss hypergeometric functions. Several propositions of the LIIS model are derived: proof that the IDIS model is a particular case of the LIIS model when the maximum sequence length nmax tends to infinity, fixed point, time scale, time step and time inversion. Using a relation between the sequence length l and the evolution time t, an expression of the LIIS model as a function of the sequence length l=n(t) is obtained. Formulas for 'insertion only', i.e. when the substitution rates are all equal to 0, are derived at evolution time t and sequence length l. Analytical solutions of the LIIS model are explicitly derived, as a function of either evolution time t or sequence length l, for two classical substitution matrices: the 3-parameter symmetric substitution matrix [12] (LIIS-SYM3) and the HKY asymmetric substitution matrix[9] (LIIS-HKY). An evaluation of the LIIS model (precisely, LIIS-HKY) based on four statistical analyses of the GC content in complete genomes of four prokaryotic taxonomic groups, namely Chlamydiae, Crenarchaeota, Spirochaetes and Thermotogae, shows the expected improvement from the theory of the LIIS model compared to the IDIS model. Copyright © 2013 Elsevier Inc. All rights reserved.

  9. Impact of Insertion Sequences and Recombination on the Population Structure of Staphylococcus haemolyticus.

    PubMed

    Bouchami, Ons; de Lencastre, Herminia; Miragaia, Maria

    2016-01-01

    Staphylococcus haemolyticus is one of the most common pathogens associated with medical-device related infections, but its molecular epidemiology is poorly explored. In the current study, we aimed to better understand the genetic mechanisms contributing to S. haemolyticus diversity in the hospital environment and their impact on the population structure and clinical relevant phenotypic traits. The analysis of a representative S. haemolyticus collection by multilocus sequence typing (MLST) has identified a single highly prevalent and diverse genetic lineage of nosocomial S. haemolyticus clonal complex (CC) 29 accounting for 91% of the collection of isolates disseminated worldwide. The examination of the sequence changes at MLST loci during clonal diversification showed that recombination had a higher impact than mutation in shaping the S. haemolyticus population. Also, we ascertained that another mechanism contributing significantly to clonal diversification and adaptation was mediated by insertion sequence (IS) elements. We found that all nosocomial S. haemolyticus, belonging to different STs, were rich in IS1272 copies, as determined by Southern hybridization of macrorestriction patterns. In particular, we observed that the chromosome of a S. haemolyticus strain within CC29 was highly unstable during serial growth in vitro which paralleled with IS1272 transposition events and changes in clinically relevant phenotypic traits namely, mannitol fermentation, susceptibility to beta-lactams, biofilm formation and hemolysis. Our results suggest that recombination and IS transposition might be a strategy of adaptation, evolution and pathogenicity of the major S. haemolyticus prevalent lineage in the hospital environment.

  10. Impact of Insertion Sequences and Recombination on the Population Structure of Staphylococcus haemolyticus

    PubMed Central

    Bouchami, Ons; de Lencastre, Herminia; Miragaia, Maria

    2016-01-01

    Staphylococcus haemolyticus is one of the most common pathogens associated with medical-device related infections, but its molecular epidemiology is poorly explored. In the current study, we aimed to better understand the genetic mechanisms contributing to S. haemolyticus diversity in the hospital environment and their impact on the population structure and clinical relevant phenotypic traits. The analysis of a representative S. haemolyticus collection by multilocus sequence typing (MLST) has identified a single highly prevalent and diverse genetic lineage of nosocomial S. haemolyticus clonal complex (CC) 29 accounting for 91% of the collection of isolates disseminated worldwide. The examination of the sequence changes at MLST loci during clonal diversification showed that recombination had a higher impact than mutation in shaping the S. haemolyticus population. Also, we ascertained that another mechanism contributing significantly to clonal diversification and adaptation was mediated by insertion sequence (IS) elements. We found that all nosocomial S. haemolyticus, belonging to different STs, were rich in IS1272 copies, as determined by Southern hybridization of macrorestriction patterns. In particular, we observed that the chromosome of a S. haemolyticus strain within CC29 was highly unstable during serial growth in vitro which paralleled with IS1272 transposition events and changes in clinically relevant phenotypic traits namely, mannitol fermentation, susceptibility to beta-lactams, biofilm formation and hemolysis. Our results suggest that recombination and IS transposition might be a strategy of adaptation, evolution and pathogenicity of the major S. haemolyticus prevalent lineage in the hospital environment. PMID:27249649

  11. Natural mutagenesis of human genomes by endogenous retrotransposons.

    PubMed

    Iskow, Rebecca C; McCabe, Michael T; Mills, Ryan E; Torene, Spencer; Pittard, W Stephen; Neuwald, Andrew F; Van Meir, Erwin G; Vertino, Paula M; Devine, Scott E

    2010-06-25

    Two abundant classes of mobile elements, namely Alu and L1 elements, continue to generate new retrotransposon insertions in human genomes. Estimates suggest that these elements have generated millions of new germline insertions in individual human genomes worldwide. Unfortunately, current technologies are not capable of detecting most of these young insertions, and the true extent of germline mutagenesis by endogenous human retrotransposons has been difficult to examine. Here, we describe technologies for detecting these young retrotransposon insertions and demonstrate that such insertions indeed are abundant in human populations. We also found that new somatic L1 insertions occur at high frequencies in human lung cancer genomes. Genome-wide analysis suggests that altered DNA methylation may be responsible for the high levels of L1 mobilization observed in these tumors. Our data indicate that transposon-mediated mutagenesis is extensive in human genomes and is likely to have a major impact on human biology and diseases.

  12. A Deep-Coverage Tomato BAC Library and Prospects Toward Development of an STC Framework for Genome Sequencing

    PubMed Central

    Budiman, Muhammad A.; Mao, Long; Wood, Todd C.; Wing, Rod A.

    2000-01-01

    Recently a new strategy using BAC end sequences as sequence-tagged connectors (STCs) was proposed for whole-genome sequencing projects. In this study, we present the construction and detailed characterization of a 15.0 haploid genome equivalent BAC library for the cultivated tomato, Lycopersicon esculentum cv. Heinz 1706. The library contains 129,024 clones with an average insert size of 117.5 kb and a chloroplast content of 1.11%. BAC end sequences from 1490 ends were generated and analyzed as a preliminary evaluation for using this library to develop an STC framework to sequence the tomato genome. A total of 1205 BAC end sequences (80.9%) were obtained, with an average length of 360 high-quality bases, and were searched against the GenBank database. Using a cutoff expectation value of <10−6, and combining the results from BLASTN, BLASTX, and TBLASTX searches, 24.3% of the BAC end sequences were similar to known sequences, of which almost half (48.7%) share sequence similarities to retrotransposons and 7% to known genes. Some of the transposable element sequences were the first reported in tomato, such as sequences similar to maize transposon Activator (Ac) ORF and tobacco pararetrovirus-like sequences. Interestingly, there were no BAC end sequences similar to the highly repeated TGRI and TGRII elements. However, the majority (70.3%) of STCs did not share significant sequence similarities to any sequences in GenBank at either the DNA or predicted protein levels, indicating that a large portion of the tomato genome is still unknown. Our data demonstrate that this BAC library is suitable for developing an STC database to sequence the tomato genome. The advantages of developing an STC framework for whole-genome sequencing of tomato are discussed. [The BAC end sequences described in this paper have been deposited in the GenBank data library under accession nos. AQ367111–AQ368361.] PMID:10645957

  13. Tungsten wire/FeCrAlY matrix turbine blade fabrication study

    NASA Technical Reports Server (NTRS)

    Melnyk, P.; Fleck, J. N.

    1979-01-01

    The objective was to establish a viable FRS monotape technology base to fabricate a complex, advanced turbine blade. All elements of monotape fabrication were addressed. A new process for incorporation of the matrix, including bi-alloy matrices, was developed. Bonding, cleaning, cutting, sizing, and forming parameters were established. These monotapes were then used to fabricate a 48 ply solid JT9D-7F 1st stage turbine blade. Core technology was then developed and first a 12 ply and then a 7 ply shell hollow airfoil was fabricated. As the fabrication technology advanced, additional airfoils incorporated further elements of sophistication, by introducing in sequence bonded root blocks, cross-plying, bi-metallic matrix, tip cap, trailing edge slots, and impingement inserts.

  14. The genomic substrate for adaptive radiation in African cichlid fish.

    PubMed

    Brawand, David; Wagner, Catherine E; Li, Yang I; Malinsky, Milan; Keller, Irene; Fan, Shaohua; Simakov, Oleg; Ng, Alvin Y; Lim, Zhi Wei; Bezault, Etienne; Turner-Maier, Jason; Johnson, Jeremy; Alcazar, Rosa; Noh, Hyun Ji; Russell, Pamela; Aken, Bronwen; Alföldi, Jessica; Amemiya, Chris; Azzouzi, Naoual; Baroiller, Jean-François; Barloy-Hubler, Frederique; Berlin, Aaron; Bloomquist, Ryan; Carleton, Karen L; Conte, Matthew A; D'Cotta, Helena; Eshel, Orly; Gaffney, Leslie; Galibert, Francis; Gante, Hugo F; Gnerre, Sante; Greuter, Lucie; Guyon, Richard; Haddad, Natalie S; Haerty, Wilfried; Harris, Rayna M; Hofmann, Hans A; Hourlier, Thibaut; Hulata, Gideon; Jaffe, David B; Lara, Marcia; Lee, Alison P; MacCallum, Iain; Mwaiko, Salome; Nikaido, Masato; Nishihara, Hidenori; Ozouf-Costaz, Catherine; Penman, David J; Przybylski, Dariusz; Rakotomanga, Michaelle; Renn, Suzy C P; Ribeiro, Filipe J; Ron, Micha; Salzburger, Walter; Sanchez-Pulido, Luis; Santos, M Emilia; Searle, Steve; Sharpe, Ted; Swofford, Ross; Tan, Frederick J; Williams, Louise; Young, Sarah; Yin, Shuangye; Okada, Norihiro; Kocher, Thomas D; Miska, Eric A; Lander, Eric S; Venkatesh, Byrappa; Fernald, Russell D; Meyer, Axel; Ponting, Chris P; Streelman, J Todd; Lindblad-Toh, Kerstin; Seehausen, Ole; Di Palma, Federica

    2014-09-18

    Cichlid fishes are famous for large, diverse and replicated adaptive radiations in the Great Lakes of East Africa. To understand the molecular mechanisms underlying cichlid phenotypic diversity, we sequenced the genomes and transcriptomes of five lineages of African cichlids: the Nile tilapia (Oreochromis niloticus), an ancestral lineage with low diversity; and four members of the East African lineage: Neolamprologus brichardi/pulcher (older radiation, Lake Tanganyika), Metriaclima zebra (recent radiation, Lake Malawi), Pundamilia nyererei (very recent radiation, Lake Victoria), and Astatotilapia burtoni (riverine species around Lake Tanganyika). We found an excess of gene duplications in the East African lineage compared to tilapia and other teleosts, an abundance of non-coding element divergence, accelerated coding sequence evolution, expression divergence associated with transposable element insertions, and regulation by novel microRNAs. In addition, we analysed sequence data from sixty individuals representing six closely related species from Lake Victoria, and show genome-wide diversifying selection on coding and regulatory variants, some of which were recruited from ancient polymorphisms. We conclude that a number of molecular mechanisms shaped East African cichlid genomes, and that amassing of standing variation during periods of relaxed purifying selection may have been important in facilitating subsequent evolutionary diversification.

  15. The genomic substrate for adaptive radiation in African cichlid fish

    PubMed Central

    Malinsky, Milan; Keller, Irene; Fan, Shaohua; Simakov, Oleg; Ng, Alvin Y.; Lim, Zhi Wei; Bezault, Etienne; Turner-Maier, Jason; Johnson, Jeremy; Alcazar, Rosa; Noh, Hyun Ji; Russell, Pamela; Aken, Bronwen; Alföldi, Jessica; Amemiya, Chris; Azzouzi, Naoual; Baroiller, Jean-François; Barloy-Hubler, Frederique; Berlin, Aaron; Bloomquist, Ryan; Carleton, Karen L.; Conte, Matthew A.; D'Cotta, Helena; Eshel, Orly; Gaffney, Leslie; Galibert, Francis; Gante, Hugo F.; Gnerre, Sante; Greuter, Lucie; Guyon, Richard; Haddad, Natalie S.; Haerty, Wilfried; Harris, Rayna M.; Hofmann, Hans A.; Hourlier, Thibaut; Hulata, Gideon; Jaffe, David B.; Lara, Marcia; Lee, Alison P.; MacCallum, Iain; Mwaiko, Salome; Nikaido, Masato; Nishihara, Hidenori; Ozouf-Costaz, Catherine; Penman, David J.; Przybylski, Dariusz; Rakotomanga, Michaelle; Renn, Suzy C. P.; Ribeiro, Filipe J.; Ron, Micha; Salzburger, Walter; Sanchez-Pulido, Luis; Santos, M. Emilia; Searle, Steve; Sharpe, Ted; Swofford, Ross; Tan, Frederick J.; Williams, Louise; Young, Sarah; Yin, Shuangye; Okada, Norihiro; Kocher, Thomas D.; Miska, Eric A.; Lander, Eric S.; Venkatesh, Byrappa; Fernald, Russell D.; Meyer, Axel; Ponting, Chris P.; Streelman, J. Todd; Lindblad-Toh, Kerstin; Seehausen, Ole; Di Palma, Federica

    2015-01-01

    Cichlid fishes are famous for large, diverse and replicated adaptive radiations in the Great Lakes of East Africa. To understand the molecular mechanisms underlying cichlid phenotypic diversity, we sequenced the genomes and transcriptomes of five lineages of African cichlids: the Nile tilapia (Oreochromis niloticus), an ancestral lineage with low diversity; and four members of the East African lineage: Neolamprologus brichardi/pulcher (older radiation, Lake Tanganyika), Metriaclima zebra (recent radiation, Lake Malawi), Pundamilia nyererei (very recent radiation, Lake Victoria), and Astatotilapia burtoni (riverine species around Lake Tanganyika). We found an excess of gene duplications in the East African lineage compared to tilapia and other teleosts, an abundance of non-coding element divergence, accelerated coding sequence evolution, expression divergence associated with transposable element insertions, and regulation by novel microRNAs. In addition, we analysed sequence data from sixty individuals representing six closely related species from Lake Victoria, and show genome-wide diversifying selection on coding and regulatory variants, some of which were recruited from ancient polymorphisms. We conclude that a number of molecular mechanisms shaped East African cichlid genomes, and that amassing of standing variation during periods of relaxed purifying selection may have been important in facilitating subsequent evolutionary diversification. PMID:25186727

  16. Alu Sb2 subfamily is present in all higher primates but was most succesfully amplified in humans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Richer, C.; Zietkiewicz, E.; Labuda, D.

    Alu repeats can be classified into subfamilies which amplified in primate genomes at different evolutionary time periods. A young Alu subfamily, Sb2, with a characteristic 7-nucleotide duplication at position 256, has been described in seven human loci. An Sb2 insertion found near the HD gene was unique to two HD families, indicating that Sb2 was still retropositionally active. Here, we have shown that the Sb2 insertion in the CHOL locus was similarly rare, being absent in 120 individuals of Caucasian, Oriental and Black origin. In contrast, Sb2 inserts in five other loci were found fixed (non-polymorphic), based on measurements inmore » the same population sample, but absent from orthologous positions in higher apes. This suggest that Sb2 repeats spread relatively early in the human lineage following divergence from other primates and that these elements may be human-specific. By quantitative PCR, we investigated the presence of Sb2 sequences in different primate DNA, using one PCR primer anchored at the 5{prime} Alu-end and the other complementary to the duplicated Sb2-specific segment. With an Sb2-containing plasmid as a standard, we estimated the number of Sb2 repeats at 1500-1800 copies per human haploid equivalent; corresponding numbers in chimpanzee and gorilla were almost two orders of magnitude lower, while the signal observed in orangutan and gibbon DNAs was consistent with the presence of a single copy. The analysis of 22 human, 11 chimpanzee and 10 gorilla sequences indicates that the Alu Sb2 dispersed independently in these three primate lineages; gorilla consensus differs from the human Sb2 sequence by one position, while all chimpanzee repeats have their linker expanded by up to eight A-residues. Should they be thus considered as separate subfamilies? It is possible that sequence modifications with respect to the human consensus are responsible for poor retroposition of Sb2 in apes.« less

  17. Visual attention distracter insertion for improved EEG rapid serial visual presentation (RSVP) target stimuli detection

    NASA Astrophysics Data System (ADS)

    Khosla, Deepak; Huber, David J.; Martin, Kevin

    2017-05-01

    This paper† describes a technique in which we improve upon the prior performance of the Rapid Serial Visual Presentation (RSVP) EEG paradigm for image classification though the insertion of visual attention distracters and overall sequence reordering based upon the expected ratio of rare to common "events" in the environment and operational context. Inserting distracter images maintains the ratio of common events to rare events at an ideal level, maximizing the rare event detection via P300 EEG response to the RSVP stimuli. The method has two steps: first, we compute the optimal number of distracters needed for an RSVP stimuli based on the desired sequence length and expected number of targets and insert the distracters into the RSVP sequence, and then we reorder the RSVP sequence to maximize P300 detection. We show that by reducing the ratio of target events to nontarget events using this method, we can allow RSVP sequences with more targets without sacrificing area under the ROC curve (azimuth).

  18. Designing deep sequencing experiments: detecting structural variation and estimating transcript abundance.

    PubMed

    Bashir, Ali; Bansal, Vikas; Bafna, Vineet

    2010-06-18

    Massively parallel DNA sequencing technologies have enabled the sequencing of several individual human genomes. These technologies are also being used in novel ways for mRNA expression profiling, genome-wide discovery of transcription-factor binding sites, small RNA discovery, etc. The multitude of sequencing platforms, each with their unique characteristics, pose a number of design challenges, regarding the technology to be used and the depth of sequencing required for a particular sequencing application. Here we describe a number of analytical and empirical results to address design questions for two applications: detection of structural variations from paired-end sequencing and estimating mRNA transcript abundance. For structural variation, our results provide explicit trade-offs between the detection and resolution of rearrangement breakpoints, and the optimal mix of paired-read insert lengths. Specifically, we prove that optimal detection and resolution of breakpoints is achieved using a mix of exactly two insert library lengths. Furthermore, we derive explicit formulae to determine these insert length combinations, enabling a 15% improvement in breakpoint detection at the same experimental cost. On empirical short read data, these predictions show good concordance with Illumina 200 bp and 2 Kbp insert length libraries. For transcriptome sequencing, we determine the sequencing depth needed to detect rare transcripts from a small pilot study. With only 1 Million reads, we derive corrections that enable almost perfect prediction of the underlying expression probability distribution, and use this to predict the sequencing depth required to detect low expressed genes with greater than 95% probability. Together, our results form a generic framework for many design considerations related to high-throughput sequencing. We provide software tools http://bix.ucsd.edu/projects/NGS-DesignTools to derive platform independent guidelines for designing sequencing experiments (amount of sequencing, choice of insert length, mix of libraries) for novel applications of next generation sequencing.

  19. Identification of a recently active Prunus-specific non-autonomous Mutator element with considerable genome shaping force.

    PubMed

    Halász, Júlia; Kodad, Ossama; Hegedűs, Attila

    2014-07-01

    Miniature inverted-repeat transposable elements (MITEs) are known to contribute to the evolution of plants, but only limited information is available for MITEs in the Prunus genome. We identified a MITE that has been named Falling Stones, FaSt. All structural features (349-bp size, 82-bp terminal inverted repeats and 9-bp target site duplications) are consistent with this MITE being a putative member of the Mutator transposase superfamily. FaSt showed a preferential accumulation in the short AT-rich segments of the euchromatin region of the peach genome. DNA sequencing and pollination experiments have been performed to confirm that the nested insertion of FaSt into the S-haplotype-specific F-box gene of apricot resulted in the breakdown of self-incompatibility (SI). A bioinformatics-based survey of the known Rosaceae and other genomes and a newly designed polymerase chain reaction (PCR) assay verified the Prunoideae-specific occurrence of FaSt elements. Phylogenetic analysis suggested a recent activity of FaSt in the Prunus genome. The occurrence of a nested insertion in the apricot genome further supports the recent activity of FaSt in response to abiotic stress conditions. This study reports on a presumably active non-autonomous Mutator element in Prunus that exhibits a major indirect genome shaping force through inducing loss-of-function mutation in the SI locus. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  20. In vivo blunt-end cloning through CRISPR/Cas9-facilitated non-homologous end-joining

    PubMed Central

    Geisinger, Jonathan M.; Turan, Sören; Hernandez, Sophia; Spector, Laura P.; Calos, Michele P.

    2016-01-01

    The CRISPR/Cas9 system facilitates precise DNA modifications by generating RNA-guided blunt-ended double-strand breaks. We demonstrate that guide RNA pairs generate deletions that are repaired with a high level of precision by non-homologous end-joining in mammalian cells. We present a method called knock-in blunt ligation for exploiting these breaks to insert exogenous PCR-generated sequences in a homology-independent manner without loss of additional nucleotides. This method is useful for making precise additions to the genome such as insertions of marker gene cassettes or functional elements, without the need for homology arms. We successfully utilized this method in human and mouse cells to insert fluorescent protein cassettes into various loci, with efficiencies up to 36% in HEK293 cells without selection. We also created versions of Cas9 fused to the FKBP12-L106P destabilization domain in an effort to improve Cas9 performance. Our in vivo blunt-end cloning method and destabilization-domain-fused Cas9 variant increase the repertoire of precision genome engineering approaches. PMID:26762978

  1. Insertional engineering of chromosomes with Sleeping Beauty transposition: an overview.

    PubMed

    Grabundzija, Ivana; Izsvák, Zsuzsanna; Ivics, Zoltán

    2011-01-01

    Novel genetic tools and mutagenesis strategies based on the Sleeping Beauty (SB) transposable element are currently under development with a vision to link primary DNA sequence information to gene functions in vertebrate models. By virtue of its inherent capacity to insert into DNA, the SB transposon can be developed into powerful tools for chromosomal manipulations. Mutagenesis screens based on SB have numerous advantages including high throughput and easy identification of mutated alleles. Forward genetic approaches based on insertional mutagenesis by engineered SB transposons have the advantage of providing insight into genetic networks and pathways based on phenotype. Indeed, the SB transposon has become a highly instrumental tool to induce tumors in experimental animals in a tissue-specific -manner with the aim of uncovering the genetic basis of diverse cancers. Here, we describe a battery of mutagenic cassettes that can be applied in conjunction with SB transposon vectors to mutagenize genes, and highlight versatile experimental strategies for the generation of engineered chromosomes for loss-of-function as well as gain-of-function mutagenesis for functional gene annotation in vertebrate models.

  2. Prevalence of Ambler class A β-lactamases and ampC expression in cephalosporin-resistant isolates of Acinetobacter baumannii.

    PubMed

    Rezaee, Mohammad Ahangarzadeh; Pajand, Omid; Nahaei, Mohammad Reza; Mahdian, Reza; Aghazadeh, Mohammad; Ghojazadeh, Morteza; Hojabri, Zoya

    2013-07-01

    We examined the prevalence of various cephalosporins' resistance mechanisms in Acinetobacter baumannii clinical isolates. Phenotypic and molecular detection of Ambler classes A, B and D β-lactamases was performed on 75 isolates. Clonal relatedness was defined using Repetitive Extragenic Palindromic PCR. PCR mapping was used to examine the linkage of insertion sequences and the ampC gene, and ampC expression was analyzed by TaqMan reverse transcriptase-PCR. Twenty-six (37%) isolates carried at least one of the blaPER-1 or blaTEM-1. Sixty-nine (98.5%) out of 70 cephalosporin-resistant isolates had insertions upstream of the ampC gene, of which 48 (69%) and 6 (8%) were identified as ISAba1and ISAba125, respectively. Higher level of expression was obtained in resistant isolates lacking ISAba1/ampC combination in comparison with that in positive ones. The ability to up-regulate the expression of ampC gene in association with different insertion elements has become an important factor in A. baumannii resistance to cephalosporins. Copyright © 2013 Elsevier Inc. All rights reserved.

  3. Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene

    PubMed Central

    Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis

    2012-01-01

    Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272

  4. Separation of endogenous viral elements from infectious Penaeus stylirostris densovirus using recombinase polymerase amplification.

    PubMed

    Jaroenram, Wansadaj; Owens, Leigh

    2014-01-01

    Non-infectious Penaeus stylirostris densovirus (PstDV)-related sequences in the shrimp genome cause false positive results with current PCR protocols. Here, we examined and mapped PstDV insertion profile in the genome of Australian Penaeus monodon. A DNA sequence which is likely to represent infectious PstDV was also identified and used as a target sequence for recombinase polymerase amplification (RPA)-based approach, developed for specifically detecting PstDV. The RPA protocol at 37 °C for 30 min showed no cross-reaction with other shrimp viruses, and was 10 times more sensitive than the 309F/R PCR protocol currently recommended by the World Organization for Animal Health (OIE) for PstDV diagnosis. These features, together with the simplicity of the protocol, requiring only a heating block for the reaction, offer opportunities for rapid and efficient detection of PstDV. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. Many P-Element Insertions Affect Wing Shape in Drosophila melanogaster

    PubMed Central

    Weber, Kenneth; Johnson, Nancy; Champlin, David; Patty, April

    2005-01-01

    A screen of random, autosomal, homozygous-viable P-element insertions in D. melanogaster found small effects on wing shape in 11 of 50 lines. The effects were due to single insertions and remained stable and significant for over 5 years, in repeated, high-resolution measurements. All 11 insertions were within or near protein-coding transcription units, none of which were previously known to affect wing shape. Many sites in the genome can affect wing shape. PMID:15545659

  6. Many P-element insertions affect wing shape in Drosophila melanogaster.

    PubMed

    Weber, Kenneth; Johnson, Nancy; Champlin, David; Patty, April

    2005-03-01

    A screen of random, autosomal, homozygous-viable P-element insertions in D. melanogaster found small effects on wing shape in 11 of 50 lines. The effects were due to single insertions and remained stable and significant for over 5 years, in repeated, high-resolution measurements. All 11 insertions were within or near protein-coding transcription units, none of which were previously known to affect wing shape. Many sites in the genome can affect wing shape.

  7. Altered Viral Replication and Cell Responses by Inserting MicroRNA Recognition Element into PB1 in Pandemic Influenza A Virus (H1N1) 2009

    PubMed Central

    Shen, Xiaoyue; Sun, Wenkui; Shi, Yi; Xing, Zheng; Su, Xin

    2015-01-01

    Objective. MicroRNAs (miRNAs) are endogenous noncoding RNAs that spatiotemporally modulate mRNAs in a posttranscriptional manner. Engineering mutant viruses by inserting cell-specific miRNA recognition element (MRE) into viral genome may alter viral infectivity and host responses in vital tissues and organs infected with pandemic influenza A virus (H1N1) 2009 (H1N1pdm). Methods. In this study, we employed reverse genetics approach to generate a recombinant H1N1pdm with a cell-specific miRNA target sequence inserted into its PB1 genomic segment to investigate whether miRNAs are able to suppress H1N1pdm replication. We inserted an MRE of microRNA-let-7b (miR-let-7b) into the open reading frame of PB1 to test the feasibility of creating a cell-restricted H1N1pdm virus since let-7b is abundant in human bronchial epithelial cells. Results. miR-let-7b is rich in human bronchial epithelial cells (HBE). Incorporation of the miR-let-7b-MRE confers upon the recombinant H1N1pdm virus susceptibility to miR-let-7b targeting, suggesting that the H1N1pdm and influenza A viruses can be engineered to exert the desired replication restrictive effect and decrease infectivity in vital tissues and organs. Conclusions. This approach provides an additional layer of biosafety and thus has great potential for the application in the rational development of safer and more effective influenza viral vaccines. PMID:25788763

  8. Identification and characterization of a selenoprotein family containing a diselenide bond in a redox motif

    PubMed Central

    Shchedrina, Valentina A.; Novoselov, Sergey V.; Malinouski, Mikalai Yu.; Gladyshev, Vadim N.

    2007-01-01

    Selenocysteine (Sec, U) insertion into proteins is directed by translational recoding of specific UGA codons located upstream of a stem-loop structure known as Sec insertion sequence (SECIS) element. Selenoproteins with known functions are oxidoreductases containing a single redox-active Sec in their active sites. In this work, we identified a family of selenoproteins, designated SelL, containing two Sec separated by two other residues to form a UxxU motif. SelL proteins show an unusual occurrence, being present in diverse aquatic organisms, including fish, invertebrates, and marine bacteria. Both eukaryotic and bacterial SelL genes use single SECIS elements for insertion of two Sec. In eukaryotes, the SECIS is located in the 3′ UTR, whereas the bacterial SelL SECIS is within a coding region and positioned at a distance that supports the insertion of either of the two Sec or both of these residues. SelL proteins possess a thioredoxin-like fold wherein the UxxU motif corresponds to the catalytic CxxC motif in thioredoxins, suggesting a redox function of SelL proteins. Distantly related SelL-like proteins were also identified in a variety of organisms that had either one or both Sec replaced with Cys. Danio rerio SelL, transiently expressed in mammalian cells, incorporated two Sec and localized to the cytosol. In these cells, it occurred in an oxidized form and was not reducible by DTT. In a bacterial expression system, we directly demonstrated the formation of a diselenide bond between the two Sec, establishing it as the first diselenide bond found in a natural protein. PMID:17715293

  9. Interplay of a non-conjugative integrative element and a conjugative plasmid in the spread of antibiotic resistance via suicidal plasmid transfer from an aquaculture Vibrio isolate.

    PubMed

    Nonaka, Lisa; Yamamoto, Tatsuya; Maruyama, Fumito; Hirose, Yuu; Onishi, Yuki; Kobayashi, Takeshi; Suzuki, Satoru; Nomura, Nobuhiko; Masuda, Michiaki; Yano, Hirokazu

    2018-01-01

    The capture of antimicrobial resistance genes (ARGs) by mobile genetic elements (MGEs) plays a critical role in resistance acquisition for human-associated bacteria. Although aquaculture environments are recognized as important reservoirs of ARGs, intra- and intercellular mobility of MGEs discovered in marine organisms is poorly characterized. Here, we show a new pattern of interspecies ARGs transfer involving a 'non-conjugative' integrative element. To identify active MGEs in a Vibrio ponticus isolate, we conducted whole-genome sequencing of a transconjugant obtained by mating between Escherichia coli and Vibrio ponticus. This revealed integration of a plasmid (designated pSEA1) into the chromosome, consisting of a self-transmissible plasmid backbone of the MOBH group, ARGs, and a 13.8-kb integrative element Tn6283. Molecular genetics analysis suggested a two-step gene transfer model. First, Tn6283 integrates into the recipient chromosome during suicidal plasmid transfer, followed by homologous recombination between the Tn6283 copy in the chromosome and that in the newly transferred pSEA1. Tn6283 is unusual among integrative elements in that it apparently does not encode transfer function and its excision barely generates unoccupied donor sites. Thus, its movement is analogous to the transposition of insertion sequences rather than to that of canonical integrative and conjugative elements. Overall, this study reveals the presence of a previously unrecognized type of MGE in a marine organism, highlighting diversity in the mode of interspecies gene transfer.

  10. Recent Amplification of the Kangaroo Endogenous Retrovirus, KERV, Limited to the Centromere▿

    PubMed Central

    Ferreri, Gianni C.; Brown, Judith D.; Obergfell, Craig; Jue, Nathaniel; Finn, Caitlin E.; O'Neill, Michael J.; O'Neill, Rachel J.

    2011-01-01

    Mammalian retrotransposons, transposable elements that are processed through an RNA intermediate, are categorized as short interspersed elements (SINEs), long interspersed elements (LINEs), and long terminal repeat (LTR) retroelements, which include endogenous retroviruses. The ability of transposable elements to autonomously amplify led to their initial characterization as selfish or junk DNA; however, it is now known that they may acquire specific cellular functions in a genome and are implicated in host defense mechanisms as well as in genome evolution. Interactions between classes of transposable elements may exert a markedly different and potentially more significant effect on a genome than interactions between members of a single class of transposable elements. We examined the genomic structure and evolution of the kangaroo endogenous retrovirus (KERV) in the marsupial genus Macropus. The complete proviral structure of the kangaroo endogenous retrovirus, phylogenetic relationship among relative retroviruses, and expression of this virus in both Macropus rufogriseus and M. eugenii are presented for the first time. In addition, we show the relative copy number and distribution of the kangaroo endogenous retrovirus in the Macropus genus. Our data indicate that amplification of the kangaroo endogenous retrovirus occurred in a lineage-specific fashion, is restricted to the centromeres, and is not correlated with LINE depletion. Finally, analysis of KERV long terminal repeat sequences using massively parallel sequencing indicates that the recent amplification in M. rufogriseus is likely due to duplications and concerted evolution rather than a high number of independent insertion events. PMID:21389136

  11. High-throughput analysis of T-DNA location and structure using sequence capture

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Inagaki, Soichi; Henry, Isabelle M.; Lieberman, Meric C.

    Agrobacterium-mediated transformation of plants with T-DNA is used both to introduce transgenes and for mutagenesis. Conventional approaches used to identify the genomic location and the structure of the inserted T-DNA are laborious and high-throughput methods using next-generation sequencing are being developed to address these problems. Here, we present a cost-effective approach that uses sequence capture targeted to the T-DNA borders to select genomic DNA fragments containing T-DNA—genome junctions, followed by Illumina sequencing to determine the location and junction structure of T-DNA insertions. Multiple probes can be mixed so that transgenic lines transformed with different T-DNA types can be processed simultaneously,more » using a simple, index-based pooling approach. We also developed a simple bioinformatic tool to find sequence read pairs that span the junction between the genome and T-DNA or any foreign DNA. We analyzed 29 transgenic lines of Arabidopsis thaliana, each containing inserts from 4 different T-DNA vectors. We determined the location of T-DNA insertions in 22 lines, 4 of which carried multiple insertion sites. Additionally, our analysis uncovered a high frequency of unconventional and complex T-DNA insertions, highlighting the needs for high-throughput methods for T-DNA localization and structural characterization. Transgene insertion events have to be fully characterized prior to use as commercial products. As a result, our method greatly facilitates the first step of this characterization of transgenic plants by providing an efficient screen for the selection of promising lines.« less

  12. High-throughput analysis of T-DNA location and structure using sequence capture

    DOE PAGES

    Inagaki, Soichi; Henry, Isabelle M.; Lieberman, Meric C.; ...

    2015-10-07

    Agrobacterium-mediated transformation of plants with T-DNA is used both to introduce transgenes and for mutagenesis. Conventional approaches used to identify the genomic location and the structure of the inserted T-DNA are laborious and high-throughput methods using next-generation sequencing are being developed to address these problems. Here, we present a cost-effective approach that uses sequence capture targeted to the T-DNA borders to select genomic DNA fragments containing T-DNA—genome junctions, followed by Illumina sequencing to determine the location and junction structure of T-DNA insertions. Multiple probes can be mixed so that transgenic lines transformed with different T-DNA types can be processed simultaneously,more » using a simple, index-based pooling approach. We also developed a simple bioinformatic tool to find sequence read pairs that span the junction between the genome and T-DNA or any foreign DNA. We analyzed 29 transgenic lines of Arabidopsis thaliana, each containing inserts from 4 different T-DNA vectors. We determined the location of T-DNA insertions in 22 lines, 4 of which carried multiple insertion sites. Additionally, our analysis uncovered a high frequency of unconventional and complex T-DNA insertions, highlighting the needs for high-throughput methods for T-DNA localization and structural characterization. Transgene insertion events have to be fully characterized prior to use as commercial products. As a result, our method greatly facilitates the first step of this characterization of transgenic plants by providing an efficient screen for the selection of promising lines.« less

  13. Early Strains of Multidrug-Resistant Salmonella enterica Serovar Kentucky Sequence Type 198 from Southeast Asia Harbor Salmonella Genomic Island 1-J Variants with a Novel Insertion Sequence

    PubMed Central

    Le Hello, Simon; Weill, François-Xavier; Guibert, Véronique; Praud, Karine; Cloeckaert, Axel

    2012-01-01

    Salmonella genomic island 1 (SGI1) is a 43-kb integrative mobilizable element that harbors a great diversity of multidrug resistance gene clusters described in numerous Salmonella enterica serovars and also in Proteus mirabilis. The majority of SGI1 variants contain an In104-derivative complex class 1 integron inserted between resolvase gene res and open reading frame (ORF) S044 in SGI1. Recently, the international spread of ciprofloxacin-resistant S. enterica serovar Kentucky sequence type 198 (ST198) containing SGI1-K variants has been reported. A retrospective study was undertaken to characterize ST198 S. Kentucky strains isolated before the spread of the epidemic ST198-SGI1-K population in Africa and the Middle East. Here, we characterized 12 ST198 S. Kentucky strains isolated between 1969 and 1999, mainly from humans returning from Southeast Asia (n = 10 strains) or Israel (n = 1 strain) or from meat in Egypt (n = 1 strain). All these ST198 S. Kentucky strains did not belong to the XbaI pulsotype X1 associated with the African epidemic clone but to pulsotype X2. SGI1-J subgroup variants containing different complex integrons with a partial transposition module and inserted within ORF S023 of SGI1 were detected in six strains. The SGI1-J4 variant containing a partially deleted class 1 integron and thus showing a narrow resistance phenotype to sulfonamides was identified in two epidemiologically unrelated strains from Indonesia. The four remaining strains harbored a novel SGI1-J variant, named SGI1-J6, which contained aadA2, floR2, tetR(G)-tetA(G), and sul1 resistance genes within its complex integron. Moreover, in all these S. Kentucky isolates, a novel insertion sequence related to the IS630 family and named ISSen5 was found inserted upstream of the SGI1 complex integron in ORF S023. Thus, two subpopulations of S. Kentucky ST198 independently and exclusively acquired the SGI1 during the 1980s and 1990s. Unlike the ST198-X1 African epidemic subpopulation, the ST198-X2 subpopulation mainly from Asia harbors variants of the SGI1-J subgroup that are encountered mainly in the Far East, as previously described for S. enterica serovars Emek and Virchow. PMID:22802251

  14. Early strains of multidrug-resistant Salmonella enterica serovar Kentucky sequence type 198 from Southeast Asia harbor Salmonella genomic island 1-J variants with a novel insertion sequence.

    PubMed

    Le Hello, Simon; Weill, François-Xavier; Guibert, Véronique; Praud, Karine; Cloeckaert, Axel; Doublet, Benoît

    2012-10-01

    Salmonella genomic island 1 (SGI1) is a 43-kb integrative mobilizable element that harbors a great diversity of multidrug resistance gene clusters described in numerous Salmonella enterica serovars and also in Proteus mirabilis. The majority of SGI1 variants contain an In104-derivative complex class 1 integron inserted between resolvase gene res and open reading frame (ORF) S044 in SGI1. Recently, the international spread of ciprofloxacin-resistant S. enterica serovar Kentucky sequence type 198 (ST198) containing SGI1-K variants has been reported. A retrospective study was undertaken to characterize ST198 S. Kentucky strains isolated before the spread of the epidemic ST198-SGI1-K population in Africa and the Middle East. Here, we characterized 12 ST198 S. Kentucky strains isolated between 1969 and 1999, mainly from humans returning from Southeast Asia (n = 10 strains) or Israel (n = 1 strain) or from meat in Egypt (n = 1 strain). All these ST198 S. Kentucky strains did not belong to the XbaI pulsotype X1 associated with the African epidemic clone but to pulsotype X2. SGI1-J subgroup variants containing different complex integrons with a partial transposition module and inserted within ORF S023 of SGI1 were detected in six strains. The SGI1-J4 variant containing a partially deleted class 1 integron and thus showing a narrow resistance phenotype to sulfonamides was identified in two epidemiologically unrelated strains from Indonesia. The four remaining strains harbored a novel SGI1-J variant, named SGI1-J6, which contained aadA2, floR2, tetR(G)-tetA(G), and sul1 resistance genes within its complex integron. Moreover, in all these S. Kentucky isolates, a novel insertion sequence related to the IS630 family and named ISSen5 was found inserted upstream of the SGI1 complex integron in ORF S023. Thus, two subpopulations of S. Kentucky ST198 independently and exclusively acquired the SGI1 during the 1980s and 1990s. Unlike the ST198-X1 African epidemic subpopulation, the ST198-X2 subpopulation mainly from Asia harbors variants of the SGI1-J subgroup that are encountered mainly in the Far East, as previously described for S. enterica serovars Emek and Virchow.

  15. Begin at the beginning: A BAC-end view of the passion fruit (Passiflora) genome.

    PubMed

    Santos, Anselmo Azevedo; Penha, Helen Alves; Bellec, Arnaud; Munhoz, Carla de Freitas; Pedrosa-Harand, Andrea; Bergès, Hélène; Vieira, Maria Lucia Carneiro

    2014-09-26

    The passion fruit (Passiflora edulis) is a tropical crop of economic importance both for juice production and consumption as fresh fruit. The juice is also used in concentrate blends that are consumed worldwide. However, very little is known about the genome of the species. Therefore, improving our understanding of passion fruit genomics is essential and to some degree a pre-requisite if its genetic resources are to be used more efficiently. In this study, we have constructed a large-insert BAC library and provided the first view on the structure and content of the passion fruit genome, using BAC-end sequence (BES) data as a major resource. The library consisted of 82,944 clones and its levels of organellar DNA were very low. The library represents six haploid genome equivalents, and the average insert size was 108 kb. To check its utility for gene isolation, successful macroarray screening experiments were carried out with probes complementary to eight Passiflora gene sequences available in public databases. BACs harbouring those genes were used in fluorescent in situ hybridizations and unique signals were detected for four BACs in three chromosomes (n=9). Then, we explored 10,000 BES and we identified reads likely to contain repetitive mobile elements (19.6% of all BES), simple sequence repeats and putative proteins, and to estimate the GC content (~42%) of the reads. Around 9.6% of all BES were found to have high levels of similarity to plant genes and ontological terms were assigned to more than half of the sequences analysed (940). The vast majority of the top-hits made by our sequences were to Populus trichocarpa (24.8% of the total occurrences), Theobroma cacao (21.6%), Ricinus communis (14.3%), Vitis vinifera (6.5%) and Prunus persica (3.8%). We generated the first large-insert library for a member of Passifloraceae. This BAC library provides a new resource for genetic and genomic studies, as well as it represents a valuable tool for future whole genome study. Remarkably, a number of BAC-end pair sequences could be mapped to intervals of the sequenced Arabidopsis thaliana, V. vinifera and P. trichocarpa chromosomes, and putative collinear microsyntenic regions were identified.

  16. A bioinformatics approach for identifying transgene insertion sites using whole genome sequencing data.

    PubMed

    Park, Doori; Park, Su-Hyun; Ban, Yong Wook; Kim, Youn Shic; Park, Kyoung-Cheul; Kim, Nam-Soo; Kim, Ju-Kon; Choi, Ik-Young

    2017-08-15

    Genetically modified crops (GM crops) have been developed to improve the agricultural traits of modern crop cultivars. Safety assessments of GM crops are of paramount importance in research at developmental stages and before releasing transgenic plants into the marketplace. Sequencing technology is developing rapidly, with higher output and labor efficiencies, and will eventually replace existing methods for the molecular characterization of genetically modified organisms. To detect the transgenic insertion locations in the three GM rice gnomes, Illumina sequencing reads are mapped and classified to the rice genome and plasmid sequence. The both mapped reads are classified to characterize the junction site between plant and transgene sequence by sequence alignment. Herein, we present a next generation sequencing (NGS)-based molecular characterization method, using transgenic rice plants SNU-Bt9-5, SNU-Bt9-30, and SNU-Bt9-109. Specifically, using bioinformatics tools, we detected the precise insertion locations and copy numbers of transfer DNA, genetic rearrangements, and the absence of backbone sequences, which were equivalent to results obtained from Southern blot analyses. NGS methods have been suggested as an effective means of characterizing and detecting transgenic insertion locations in genomes. Our results demonstrate the use of a combination of NGS technology and bioinformatics approaches that offers cost- and time-effective methods for assessing the safety of transgenic plants.

  17. Genome Sequence of the Fleming Strain of Micrococcus luteus, a Simple Free-Living Actinobacterium▿ †‡

    PubMed Central

    Young, Michael; Artsatbanov, Vladislav; Beller, Harry R.; Chandra, Govind; Chater, Keith F.; Dover, Lynn G.; Goh, Ee-Been; Kahan, Tamar; Kaprelyants, Arseny S.; Kyrpides, Nikos; Lapidus, Alla; Lowry, Stephen R.; Lykidis, Athanasios; Mahillon, Jacques; Markowitz, Victor; Mavromatis, Konstantinos; Mukamolova, Galina V.; Oren, Aharon; Rokem, J. Stefan; Smith, Margaret C. M.; Young, Danielle I.; Greenblatt, Charles L.

    2010-01-01

    Micrococcus luteus (NCTC2665, “Fleming strain”) has one of the smallest genomes of free-living actinobacteria sequenced to date, comprising a single circular chromosome of 2,501,097 bp (G+C content, 73%) predicted to encode 2,403 proteins. The genome shows extensive synteny with that of the closely related organism, Kocuria rhizophila, from which it was taxonomically separated relatively recently. Despite its small size, the genome harbors 73 insertion sequence (IS) elements, almost all of which are closely related to elements found in other actinobacteria. An IS element is inserted into the rrs gene of one of only two rrn operons found in M. luteus. The genome encodes only four sigma factors and 14 response regulators, a finding indicative of adaptation to a rather strict ecological niche (mammalian skin). The high sensitivity of M. luteus to β-lactam antibiotics may result from the presence of a reduced set of penicillin-binding proteins and the absence of a wblC gene, which plays an important role in the antibiotic resistance in other actinobacteria. Consistent with the restricted range of compounds it can use as a sole source of carbon for energy and growth, M. luteus has a minimal complement of genes concerned with carbohydrate transport and metabolism and its inability to utilize glucose as a sole carbon source may be due to the apparent absence of a gene encoding glucokinase. Uniquely among characterized bacteria, M. luteus appears to be able to metabolize glycogen only via trehalose and to make trehalose only via glycogen. It has very few genes associated with secondary metabolism. In contrast to most other actinobacteria, M. luteus encodes only one resuscitation-promoting factor (Rpf) required for emergence from dormancy, and its complement of other dormancy-related proteins is also much reduced. M. luteus is capable of long-chain alkene biosynthesis, which is of interest for advanced biofuel production; a three-gene cluster essential for this metabolism has been identified in the genome. PMID:19948807

  18. Target sites for the transposition of rat long interspersed repeated DNA elements (LINEs) are not random.

    PubMed Central

    Furano, A V; Somerville, C C; Tsichlis, P N; D'Ambrosio, E

    1986-01-01

    The long interspersed repeated DNA family of rats (LINE or L1Rn family) contains about 40,000 6.7-kilobase (kb) long members (1). LINE members may be currently mobile since their presence or absence causes allelic variation at three single copy loci (2, 3): insulin 1, Moloney leukemia virus integration 2 (Mlvi-2) (4), and immunoglobulin heavy chain (Igh). To characterize target sites for LINE insertion, we compared the DNA sequences of the unoccupied Mlvi-2 target site, its LINE-containing allele, and several other LINE-containing sites. Although not homologous overall, the target sites share three characteristics: First, depending on the site, they are from 68% to 86% (A+T) compared to 58% (A+T) for total rat DNA (5). Depending on the site, a 7- to 15-bp target site sequence becomes duplicated and flanks the inserted LINE member. The second is a version (0 or 1 mismatch) of the hexanucleotide, TACTCA, which is also present in the LINE member, in a highly conserved region located just before the A-rich right end of the LINE member. The third is a stretch of alternating purine/pyrimidine (PQ). The A-rich right ends of different LINE members vary in length and composition, and the sequence of a particularly long one suggests that it contains the A-rich target site from a previous transposition. PMID:3012480

  19. Characteristics of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) rRNA genes of Apis mellifera (Insecta: Hymenoptera): structure, organization, and retrotransposable elements

    PubMed Central

    Gillespie, J J; Johnston, J S; Cannone, J J; Gutell, R R

    2006-01-01

    As an accompanying manuscript to the release of the honey bee genome, we report the entire sequence of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) ribosomal RNA (rRNA)-encoding gene sequences (rDNA) and related internally and externally transcribed spacer regions of Apis mellifera (Insecta: Hymenoptera: Apocrita). Additionally, we predict secondary structures for the mature rRNA molecules based on comparative sequence analyses with other arthropod taxa and reference to recently published crystal structures of the ribosome. In general, the structures of honey bee rRNAs are in agreement with previously predicted rRNA models from other arthropods in core regions of the rRNA, with little additional expansion in non-conserved regions. Our multiple sequence alignments are made available on several public databases and provide a preliminary establishment of a global structural model of all rRNAs from the insects. Additionally, we provide conserved stretches of sequences flanking the rDNA cistrons that comprise the externally transcribed spacer regions (ETS) and part of the intergenic spacer region (IGS), including several repetitive motifs. Finally, we report the occurrence of retrotransposition in the nuclear large subunit rDNA, as R2 elements are present in the usual insertion points found in other arthropods. Interestingly, functional R1 elements usually present in the genomes of insects were not detected in the honey bee rRNA genes. The reverse transcriptase products of the R2 elements are deduced from their putative open reading frames and structurally aligned with those from another hymenopteran insect, the jewel wasp Nasonia (Pteromalidae). Stretches of conserved amino acids shared between Apis and Nasonia are illustrated and serve as potential sites for primer design, as target amplicons within these R2 elements may serve as novel phylogenetic markers for Hymenoptera. Given the impending completion of the sequencing of the Nasonia genome, we expect our report eventually to shed light on the evolution of the hymenopteran genome within higher insects, particularly regarding the relative maintenance of conserved rDNA genes, related variable spacer regions and retrotransposable elements. PMID:17069639

  20. Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

    PubMed

    Amirhaeri, S; Wohlrab, F; Wells, R D

    1995-02-17

    The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.

  1. Evidence That Intergenic Spacer Repeats of Drosophila Melanogaster Rrna Genes Function as X-Y Pairing Sites in Male Meiosis, and a General Model for Achiasmatic Pairing

    PubMed Central

    McKee, B. D.; Habera, L.; Vrana, J. A.

    1992-01-01

    In Drosophila melanogaster males, X-Y meiotic chromosome pairing is mediated by the nucleolus organizers (NOs) which are located in the X heterochromatin (Xh) and near the Y centromere. Deficiencies for Xh disrupt X-Y meiotic pairing and cause high frequencies of X-Y nondisjunction. Insertion of cloned rRNA genes on an Xh(-) chromosome partially restores normal X-Y pairing and disjunction. To map the sequences within an inserted, X-linked rRNA gene responsible for stimulating X-Y pairing, partial deletions were generated by P element-mediated destabilization of the insert. Complete deletions of the rRNA transcription unit did not interfere with the ability to stimulate X-Y pairing as long as most of the intergenic spacer (IGS) remained. Within groups of deletions that lacked the entire transcription unit and differed only in length of residual IGS material, pairing ability was proportional to the dose of 240-bp intergenic spacer repeats. Deletions of the complete rRNA transcription unit or of the 28S sequences alone blocked nucleolus formation, as determined by binding of an antinucleolar antibody, yet did not interfere with pairing ability, suggesting that X-Y pairing may not be mechanistically related to nucleolus formation. A model for achiasmatic pairing in Drosophila males based upon the combined action of topoisomerase I and a strand transferase is proposed. PMID:1330825

  2. Influence of flanking sequences on presentation efficiency of a CD8+ cytotoxic T-cell epitope delivered by parvovirus-like particles.

    PubMed

    Rueda, P; Morón, G; Sarraseca, J; Leclerc, C; Casal, J I

    2004-03-01

    We have previously developed an antigen-delivery system based on hybrid recombinant porcine parvovirus-like particles (PPV-VLPs) formed by the self-assembly of the VP2 protein of PPV carrying a foreign epitope at its N terminus. In this study, different constructs were made containing a CD8(+) T-cell epitope of chicken ovalbumin (OVA) to analyse the influence of the sequence inserted into VP2 on the correct processing of VLPs by antigen-presenting cells. We analysed the presentation of the OVA epitope inserted without flanking sequences or with either different natural flanking sequences or with the natural flanking sequences of a CD8(+) T-cell epitope from the lymphocytic choriomeningitis virus nucleoprotein, and as a dimer with or without linker sequences. All constructs were studied in terms of level of expression, assembly of VLPs and ability to deliver the inserted epitope into the MHC I pathway. The presentation of the OVA epitope was considerably improved by insertion of short natural flanking sequences, which indicated the relevance of the flanking sequences on the processing of PPV-VLPs. Only PPV-VLPs carrying two copies of the OVA epitope linked by two glycines were able to be properly processed, suggesting that the introduction of flexible residues between the two consecutive OVA epitopes may be necessary for the correct presentation of these dimers by PPV-VLPs. These results provide information to improve the insertion of epitopes into PPV-VLPs to facilitate their processing and presentation by MHC class I molecules.

  3. Gene structure of CYP3A4, an adult-specific form of cytochrome P450 in human livers, and its transcriptional control.

    PubMed

    Hashimoto, H; Toide, K; Kitamura, R; Fujita, M; Tagawa, S; Itoh, S; Kamataki, T

    1993-12-01

    CYP3 A4 is the adult-specific form of cytochrome P450 in human livers [Komori, M., Nishio, K., Kitada, M., Shiramatsu, K., Muroya, K., Soma, M., Nagashima, K. & Kamataki, T. (1990) Biochemistry 29, 4430-4433]. The sequences of three genomic clones for CYP3A4 were analyzed for all exons, exon-intron junctions and the 5'-flanking region from the major transcription site to nucleotide position -1105, and compared with those of the CYP3A7 gene, a fetal-specific form of cytochrome P450 in humans. The results showed that the identity of 5'-flanking sequences between CYP3A4 and CYP3A7 genes was 91%, and that each 5'-flanking region had characteristic sequences termed as NFSE (P450NF-specific element) and HFLaSE (P450HFLa specific element), respectively. A basic transcription element (BTE) also lay in the 5'-flanking region of the CYP3A4 gene as seen in many CYP genes [Yanagida, A., Sogawa, K., Yasumoto, K. & Fujii-Kuriyama, Y. (1990) Mol. Cell. Biol. 10, 1470-1475]. The BTE binding factor (BTEB) was present in both adult and fetal human livers. To examine the transcriptional activity of the CYP3A4 gene, DNA fragments in the 5'-flanking region of the gene were inserted in front of the simian virus 40 promoter and the chloramphenicol acetyltransferase structural gene, and the constructs were transfected in HepG2 cells. The analysis of the chloramphenicol acetyltransferase activity indicated that (a) specific element(s) which could bind with a factor(s) in livers was present in the 5'-flanking region of the CYP3A4 gene to show the transcriptional activity.

  4. A genomic landscape of mitochondrial DNA insertions in the pig nuclear genome provides evolutionary signatures of interspecies admixture.

    PubMed

    Schiavo, Giuseppina; Hoffmann, Orsolya Ivett; Ribani, Anisa; Utzeri, Valerio Joe; Ghionda, Marco Ciro; Bertolini, Francesca; Geraci, Claudia; Bovo, Samuele; Fontanesi, Luca

    2017-10-01

    Nuclear DNA sequences of mitochondrial origin (numts) are derived by insertion of mitochondrial DNA (mtDNA), into the nuclear genome. In this study, we provide, for the first time, a genome picture of numts inserted in the pig nuclear genome. The Sus scrofa reference nuclear genome (Sscrofa10.2) was aligned with circularized and consensus mtDNA sequences using LAST software. A total of 430 numt sequences that may represent 246 different numt integration events (57 numt regions determined by at least two numt sequences and 189 singletons) were identified, covering about 0.0078% of the nuclear genome. Numt integration events were correlated (0.99) to the chromosome length. The longest numt sequence (about 11 kbp) was located on SSC2. Six numts were sequenced and PCR amplified in pigs of European commercial and local pig breeds, of the Chinese Meishan breed and in European wild boars. Three of them were polymorphic for the presence or absence of the insertion. Surprisingly, the estimated age of insertion of two of the three polymorphic numts was more ancient than that of the speciation time of the Sus scrofa, supporting that these polymorphic sites were originated from interspecies admixture that contributed to shape the pig genome. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  5. Germline transformation of the western corn rootworm, Diabrotica virgifera virgifera.

    PubMed

    Chu, F; Klobasa, W; Wu, P; Pinzi, S; Grubbs, N; Gorski, S; Cardoza, Y; Lorenzen, M D

    2017-08-01

    The western corn rootworm (WCR), a major pest of maize, is notorious for rapidly adapting biochemically, behaviourally and developmentally to a variety of control methods. Despite much effort, the genetic basis of WCR adaptation remains a mystery. Since transformation-based applications such as transposon tagging and enhancer trapping have facilitated genetic dissection of model species such as Drosophila melanogaster, we developed a germline-transformation system for WCR in an effort to gain a greater understanding of the basic biology of this economically important insect. Here we report the use of a fluorescent-marked Minos element to create transgenic WCR. We demonstrate that the transgenic strains express both an eye-specific fluorescent marker and piggyBac transposase. We identified insertion-site junction sequences via inverse PCR and assessed insertion copy number using digital droplet PCR (ddPCR). Interestingly, most WCR identified as transgenic via visual screening for DsRed fluorescence proved to carry multiple Minos insertions when tested via ddPCR. A total of eight unique insertion strains were created by outcrossing the initial transgenic strains to nontransgenic WCR mates. Establishing transgenic technologies for this beetle is the first step towards bringing a wide range of transformation-based tools to bear on understanding WCR biology. © 2017 The Royal Entomological Society.

  6. The B73 maize genome: complexity, diversity, and dynamics.

    PubMed

    Schnable, Patrick S; Ware, Doreen; Fulton, Robert S; Stein, Joshua C; Wei, Fusheng; Pasternak, Shiran; Liang, Chengzhi; Zhang, Jianwei; Fulton, Lucinda; Graves, Tina A; Minx, Patrick; Reily, Amy Denise; Courtney, Laura; Kruchowski, Scott S; Tomlinson, Chad; Strong, Cindy; Delehaunty, Kim; Fronick, Catrina; Courtney, Bill; Rock, Susan M; Belter, Eddie; Du, Feiyu; Kim, Kyung; Abbott, Rachel M; Cotton, Marc; Levy, Andy; Marchetto, Pamela; Ochoa, Kerri; Jackson, Stephanie M; Gillam, Barbara; Chen, Weizu; Yan, Le; Higginbotham, Jamey; Cardenas, Marco; Waligorski, Jason; Applebaum, Elizabeth; Phelps, Lindsey; Falcone, Jason; Kanchi, Krishna; Thane, Thynn; Scimone, Adam; Thane, Nay; Henke, Jessica; Wang, Tom; Ruppert, Jessica; Shah, Neha; Rotter, Kelsi; Hodges, Jennifer; Ingenthron, Elizabeth; Cordes, Matt; Kohlberg, Sara; Sgro, Jennifer; Delgado, Brandon; Mead, Kelly; Chinwalla, Asif; Leonard, Shawn; Crouse, Kevin; Collura, Kristi; Kudrna, Dave; Currie, Jennifer; He, Ruifeng; Angelova, Angelina; Rajasekar, Shanmugam; Mueller, Teri; Lomeli, Rene; Scara, Gabriel; Ko, Ara; Delaney, Krista; Wissotski, Marina; Lopez, Georgina; Campos, David; Braidotti, Michele; Ashley, Elizabeth; Golser, Wolfgang; Kim, HyeRan; Lee, Seunghee; Lin, Jinke; Dujmic, Zeljko; Kim, Woojin; Talag, Jayson; Zuccolo, Andrea; Fan, Chuanzhu; Sebastian, Aswathy; Kramer, Melissa; Spiegel, Lori; Nascimento, Lidia; Zutavern, Theresa; Miller, Beth; Ambroise, Claude; Muller, Stephanie; Spooner, Will; Narechania, Apurva; Ren, Liya; Wei, Sharon; Kumari, Sunita; Faga, Ben; Levy, Michael J; McMahan, Linda; Van Buren, Peter; Vaughn, Matthew W; Ying, Kai; Yeh, Cheng-Ting; Emrich, Scott J; Jia, Yi; Kalyanaraman, Ananth; Hsia, An-Ping; Barbazuk, W Brad; Baucom, Regina S; Brutnell, Thomas P; Carpita, Nicholas C; Chaparro, Cristian; Chia, Jer-Ming; Deragon, Jean-Marc; Estill, James C; Fu, Yan; Jeddeloh, Jeffrey A; Han, Yujun; Lee, Hyeran; Li, Pinghua; Lisch, Damon R; Liu, Sanzhen; Liu, Zhijie; Nagel, Dawn Holligan; McCann, Maureen C; SanMiguel, Phillip; Myers, Alan M; Nettleton, Dan; Nguyen, John; Penning, Bryan W; Ponnala, Lalit; Schneider, Kevin L; Schwartz, David C; Sharma, Anupma; Soderlund, Carol; Springer, Nathan M; Sun, Qi; Wang, Hao; Waterman, Michael; Westerman, Richard; Wolfgruber, Thomas K; Yang, Lixing; Yu, Yeisoo; Zhang, Lifang; Zhou, Shiguo; Zhu, Qihui; Bennetzen, Jeffrey L; Dawe, R Kelly; Jiang, Jiming; Jiang, Ning; Presting, Gernot G; Wessler, Susan R; Aluru, Srinivas; Martienssen, Robert A; Clifton, Sandra W; McCombie, W Richard; Wing, Rod A; Wilson, Richard K

    2009-11-20

    We report an improved draft nucleotide sequence of the 2.3-gigabase genome of maize, an important crop plant and model for biological research. Over 32,000 genes were predicted, of which 99.8% were placed on reference chromosomes. Nearly 85% of the genome is composed of hundreds of families of transposable elements, dispersed nonuniformly across the genome. These were responsible for the capture and amplification of numerous gene fragments and affect the composition, sizes, and positions of centromeres. We also report on the correlation of methylation-poor regions with Mu transposon insertions and recombination, and copy number variants with insertions and/or deletions, as well as how uneven gene losses between duplicated regions were involved in returning an ancient allotetraploid to a genetically diploid state. These analyses inform and set the stage for further investigations to improve our understanding of the domestication and agricultural improvements of maize.

  7. Regulation and Adaptive Evolution of Lactose Operon Expression in Lactobacillus delbrueckii

    PubMed Central

    Lapierre, Luciane; Mollet, Beat; Germond, Jacques-Edouard

    2002-01-01

    Lactobacillus delbrueckii subsp. bulgaricus and L. delbrueckii subsp. lactis are both used in the dairy industry as homofermentative lactic acid bacteria in the production of fermented milk products. After selective pressure for the fast fermentation of milk in the manufacture of yogurts, L. delbrueckii subsp. bulgaricus loses its ability to regulate lac operon expression. A series of mutations led to the constitutive expression of the lac genes. A complex of insertion sequence (IS) elements (ISL4 inside ISL5), inserted at the border of the lac promoter, induced the loss of the palindromic structure of one of the operators likely involved in the binding of regulatory factors. A lac repressor gene was discovered downstream of the β-galactosidase gene of L. delbrueckii subsp. lactis and was shown to be inactivated by several mutations in L. delbrueckii subsp. bulgaricus. Regulatory mechanisms of the lac gene expression of L. delbrueckii subsp. bulgaricus and L. delbrueckii subsp. lactis were compared by heterologous expression in Lactococcus lactis of the two lac promoters in front of a reporter gene (β-glucuronidase) in the presence or absence of the lac repressor gene. Insertion of the complex of IS elements in the lac promoter of L. delbrueckii subsp. bulgaricus increased the promoter's activity but did not prevent repressor binding; rather, it increased the affinity of the repressor for the promoter. Inactivation of the lac repressor by mutations was then necessary to induce the constitutive expression of the lac genes in L. delbrueckii subsp. bulgaricus. PMID:11807052

  8. Silencing Effect of Hominoid Highly Conserved Noncoding Sequences on Embryonic Brain Development

    PubMed Central

    Mahmoudi Saber, Morteza

    2017-01-01

    Abstract Superfamily Hominoidea, which consists of Hominidae (humans and great apes) and Hylobatidae (gibbons), is well-known for sharing human-like characteristics, however, the genomic origins of these shared unique phenotypes have mainly remained elusive. To decipher the underlying genomic basis of Hominoidea-restricted phenotypes, we identified and characterized Hominoidea-restricted highly conserved noncoding sequences (HCNSs) that are a class of potential regulatory elements which may be involved in evolution of lineage-specific phenotypes. We discovered 679 such HCNSs from human, chimpanzee, gorilla, orangutan and gibbon genomes. These HCNSs were demonstrated to be under purifying selection but with lineage-restricted characteristics different from old CNSs. A significant proportion of their ancestral sequences had accelerated rates of nucleotide substitutions, insertions and deletions during the evolution of common ancestor of Hominoidea, suggesting the intervention of positive Darwinian selection for creating those HCNSs. In contrary to enhancer elements and similar to silencer sequences, these Hominoidea-restricted HCNSs are located in close proximity of transcription start sites. Their target genes are enriched in the nervous system, development and transcription, and they tend to be remotely located from the nearest coding gene. Chip-seq signals and gene expression patterns suggest that Hominoidea-restricted HCNSs are likely to be functional regulatory elements by imposing silencing effects on their target genes in a tissue-restricted manner during fetal brain development. These HCNSs, emerged through adaptive evolution and conserved through purifying selection, represent a set of promising targets for future functional studies of the evolution of Hominoidea-restricted phenotypes. PMID:28633494

  9. Male Germline Control of Transposable Elements1

    PubMed Central

    Bao, Jianqiang; Yan, Wei

    2012-01-01

    ABSTRACT Repetitive sequences, especially transposon-derived interspersed repetitive elements, account for a large fraction of the genome in most eukaryotes. Despite the repetitive nature, these transposable elements display quantitative and qualitative differences even among species of the same lineage. Although transposable elements contribute greatly as a driving force to the biological diversity during evolution, they can induce embryonic lethality and genetic disorders as a result of insertional mutagenesis and genomic rearrangement. Temporary relaxation of the epigenetic control of retrotransposons during early germline development opens a risky window that can allow retrotransposons to escape from host constraints and to propagate abundantly in the host genome. Because germline mutations caused by retrotransposon activation are heritable and thus can be deleterious to the offspring, an adaptive strategy has evolved in host cells, especially in the germline. In this review, we will attempt to summarize general defense mechanisms deployed by the eukaryotic genome, with an emphasis on pathways utilized by the male germline to confer retrotransposon silencing. PMID:22357546

  10. Identification of the protease cleavage sites in a reconstituted Gag polyprotein of an HERV-K(HML-2) element

    PubMed Central

    2011-01-01

    Background The human genome harbors several largely preserved HERV-K(HML-2) elements. Although this retroviral family comes closest of all known HERVs to producing replication competent virions, mutations acquired during their chromosomal residence have rendered them incapable of expressing infectious particles. This also holds true for the HERV-K113 element that has conserved open reading frames (ORFs) for all its proteins in addition to a functional LTR promoter. Uncertainty concerning the localization and impact of post-insertional mutations has greatly hampered the functional characterization of these ancient retroviruses and their proteins. However, analogous to other betaretroviruses, it is known that HERV-K(HML-2) virions undergo a maturation process during or shortly after release from the host cell. During this process, the subdomains of the Gag polyproteins are released by proteolytic cleavage, although the nature of the mature HERV-K(HML-2) Gag proteins and the exact position of the cleavage sites have until now remained unknown. Results By aligning the amino acid sequences encoded by the gag-pro-pol ORFs of HERV-K113 with the corresponding segments from 10 other well-preserved human specific elements we identified non-synonymous post-insertional mutations that have occurred in this region of the provirus. Reversion of these mutations and a partial codon optimization facilitated the large-scale production of maturation-competent HERV-K113 virus-like particles (VLPs). The Gag subdomains of purified mature VLPs were separated by reversed-phase high-pressure liquid chromatography and initially characterized using specific antibodies. Cleavage sites were identified by mass spectrometry and N-terminal sequencing and confirmed by mutagenesis. Our results indicate that the gag gene product Pr74Gag of HERV-K(HML-2) is processed to yield p15-MA (matrix), SP1 (spacer peptide of 14 amino acids), p15, p27-CA (capsid), p10-NC (nucleocapsid) and two C-terminally encoded glutamine- and proline-rich peptides, QP1 and QP2, spanning 23 and 19 amino acids, respectively. Conclusions Expression of reconstituted sequences of original HERV elements is an important tool for studying fundamental aspects of the biology of these ancient viruses. The analysis of HERV-K(HML-2) Gag processing and the nature of the mature Gag proteins presented here will facilitate further studies of the discrete functions of these proteins and of their potential impact on the human host. PMID:21554716

  11. Identification of the protease cleavage sites in a reconstituted Gag polyprotein of an HERV-K(HML-2) element.

    PubMed

    George, Maja; Schwecke, Torsten; Beimforde, Nadine; Hohn, Oliver; Chudak, Claudia; Zimmermann, Anja; Kurth, Reinhard; Naumann, Dieter; Bannert, Norbert

    2011-05-09

    The human genome harbors several largely preserved HERV-K(HML-2) elements. Although this retroviral family comes closest of all known HERVs to producing replication competent virions, mutations acquired during their chromosomal residence have rendered them incapable of expressing infectious particles. This also holds true for the HERV-K113 element that has conserved open reading frames (ORFs) for all its proteins in addition to a functional LTR promoter. Uncertainty concerning the localization and impact of post-insertional mutations has greatly hampered the functional characterization of these ancient retroviruses and their proteins. However, analogous to other betaretroviruses, it is known that HERV-K(HML-2) virions undergo a maturation process during or shortly after release from the host cell. During this process, the subdomains of the Gag polyproteins are released by proteolytic cleavage, although the nature of the mature HERV-K(HML-2) Gag proteins and the exact position of the cleavage sites have until now remained unknown. By aligning the amino acid sequences encoded by the gag-pro-pol ORFs of HERV-K113 with the corresponding segments from 10 other well-preserved human specific elements we identified non-synonymous post-insertional mutations that have occurred in this region of the provirus. Reversion of these mutations and a partial codon optimization facilitated the large-scale production of maturation-competent HERV-K113 virus-like particles (VLPs). The Gag subdomains of purified mature VLPs were separated by reversed-phase high-pressure liquid chromatography and initially characterized using specific antibodies. Cleavage sites were identified by mass spectrometry and N-terminal sequencing and confirmed by mutagenesis. Our results indicate that the gag gene product Pr74Gag of HERV-K(HML-2) is processed to yield p15-MA (matrix), SP1 (spacer peptide of 14 amino acids), p15, p27-CA (capsid), p10-NC (nucleocapsid) and two C-terminally encoded glutamine- and proline-rich peptides, QP1 and QP2, spanning 23 and 19 amino acids, respectively. Expression of reconstituted sequences of original HERV elements is an important tool for studying fundamental aspects of the biology of these ancient viruses. The analysis of HERV-K(HML-2) Gag processing and the nature of the mature Gag proteins presented here will facilitate further studies of the discrete functions of these proteins and of their potential impact on the human host.

  12. High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and Massively Parallel Sequencing

    DTIC Science & Technology

    2010-10-14

    High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and Massively Parallel Sequencing...Venezuelan equine encephalitis virus (VEEV) genome. We initially used a capillary electrophoresis method to gain insight into the role of the VEEV...Smith JM, Schmaljohn CS (2010) High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and

  13. GABI-Kat SimpleSearch: new features of the Arabidopsis thaliana T-DNA mutant database.

    PubMed

    Kleinboelting, Nils; Huep, Gunnar; Kloetgen, Andreas; Viehoever, Prisca; Weisshaar, Bernd

    2012-01-01

    T-DNA insertion mutants are very valuable for reverse genetics in Arabidopsis thaliana. Several projects have generated large sequence-indexed collections of T-DNA insertion lines, of which GABI-Kat is the second largest resource worldwide. User access to the collection and its Flanking Sequence Tags (FSTs) is provided by the front end SimpleSearch (http://www.GABI-Kat.de). Several significant improvements have been implemented recently. The database now relies on the TAIRv10 genome sequence and annotation dataset. All FSTs have been newly mapped using an optimized procedure that leads to improved accuracy of insertion site predictions. A fraction of the collection with weak FST yield was re-analysed by generating new FSTs. Along with newly found predictions for older sequences about 20,000 new FSTs were included in the database. Information about groups of FSTs pointing to the same insertion site that is found in several lines but is real only in a single line are included, and many problematic FST-to-line links have been corrected using new wet-lab data. SimpleSearch currently contains data from ~71,000 lines with predicted insertions covering 62.5% of the 27,206 nuclear protein coding genes, and offers insertion allele-specific data from 9545 confirmed lines that are available from the Nottingham Arabidopsis Stock Centre.

  14. Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome.

    PubMed

    González, Leonardo Galindo; Deyholos, Michael K

    2012-11-21

    Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated.

  15. Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome

    PubMed Central

    2012-01-01

    Background Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Results Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. Conclusions The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated. PMID:23171245

  16. Initial sequence and comparative analysis of the cat genome

    PubMed Central

    Pontius, Joan U.; Mullikin, James C.; Smith, Douglas R.; Lindblad-Toh, Kerstin; Gnerre, Sante; Clamp, Michele; Chang, Jean; Stephens, Robert; Neelam, Beena; Volfovsky, Natalia; Schäffer, Alejandro A.; Agarwala, Richa; Narfström, Kristina; Murphy, William J.; Giger, Urs; Roca, Alfred L.; Antunes, Agostinho; Menotti-Raymond, Marilyn; Yuhki, Naoya; Pecon-Slattery, Jill; Johnson, Warren E.; Bourque, Guillaume; Tesler, Glenn; O’Brien, Stephen J.

    2007-01-01

    The genome sequence (1.9-fold coverage) of an inbred Abyssinian domestic cat was assembled, mapped, and annotated with a comparative approach that involved cross-reference to annotated genome assemblies of six mammals (human, chimpanzee, mouse, rat, dog, and cow). The results resolved chromosomal positions for 663,480 contigs, 20,285 putative feline gene orthologs, and 133,499 conserved sequence blocks (CSBs). Additional annotated features include repetitive elements, endogenous retroviral sequences, nuclear mitochondrial (numt) sequences, micro-RNAs, and evolutionary breakpoints that suggest historic balancing of translocation and inversion incidences in distinct mammalian lineages. Large numbers of single nucleotide polymorphisms (SNPs), deletion insertion polymorphisms (DIPs), and short tandem repeats (STRs), suitable for linkage or association studies were characterized in the context of long stretches of chromosome homozygosity. In spite of the light coverage capturing ∼65% of euchromatin sequence from the cat genome, these comparative insights shed new light on the tempo and mode of gene/genome evolution in mammals, promise several research applications for the cat, and also illustrate that a comparative approach using more deeply covered mammals provides an informative, preliminary annotation of a light (1.9-fold) coverage mammal genome sequence. PMID:17975172

  17. Comparative genotyping of Clostridium thermocellum strains isolated from biogas plants: genetic markers and characterization of cellulolytic potential.

    PubMed

    Koeck, Daniela E; Zverlov, Vladimir V; Liebl, Wolfgang; Schwarz, Wolfgang H

    2014-07-01

    Clostridium thermocellum is among the most prevalent of known anaerobic cellulolytic bacteria. In this study, genetic and phenotypic variations among C. thermocellum strains isolated from different biogas plants were determined and different genotyping methods were evaluated on these isolates. At least two C. thermocellum strains were isolated independently from each of nine different biogas plants via enrichment on cellulose. Various DNA-based genotyping methods such as ribotyping, RAPD (Random Amplified Polymorphic DNA) and VNTR (Variable Number of Tandem Repeats) were applied to these isolates. One novel approach - the amplification of unknown target sequences between copies of a previously discovered Random Inserted Mobile Element (RIME) - was also tested. The genotyping method with the highest discriminatory power was found to be the amplification of the sequences between the insertion elements, where isolates from each biogas plant yielded a different band pattern. Cellulolytic potentials, optimal growth conditions and substrate spectra of all isolates were characterized to help identify phenotypic variations. Irrespective of the genotyping method used, the isolates from each individual biogas plant always exhibited identical patterns. This is suggestive of a single C. thermocellum strain exhibiting dominance in each biogas plant. The genotypic groups reflect the results of the physiological characterization of the isolates like substrate diversity and cellulase activity. Conversely, strains isolated across a range of biogas plants differed in their genotyping results and physiological properties. Both strains isolated from one biogas plant had the best specific cellulose-degrading properties and might therefore achieve superior substrate utilization yields in biogas fermenters. Copyright © 2014 Elsevier GmbH. All rights reserved.

  18. The Mouse Genomes Project: a repository of inbred laboratory mouse strain genomes.

    PubMed

    Adams, David J; Doran, Anthony G; Lilue, Jingtao; Keane, Thomas M

    2015-10-01

    The Mouse Genomes Project was initiated in 2009 with the goal of using next-generation sequencing technologies to catalogue molecular variation in the common laboratory mouse strains, and a selected set of wild-derived inbred strains. The initial sequencing and survey of sequence variation in 17 inbred strains was completed in 2011 and included comprehensive catalogue of single nucleotide polymorphisms, short insertion/deletions, larger structural variants including their fine scale architecture and landscape of transposable element variation, and genomic sites subject to post-transcriptional alteration of RNA. From this beginning, the resource has expanded significantly to include 36 fully sequenced inbred laboratory mouse strains, a refined and updated data processing pipeline, and new variation querying and data visualisation tools which are available on the project's website ( http://www.sanger.ac.uk/resources/mouse/genomes/ ). The focus of the project is now the completion of de novo assembled chromosome sequences and strain-specific gene structures for the core strains. We discuss how the assembled chromosomes will power comparative analysis, data access tools and future directions of mouse genetics.

  19. Novel insertion mutation of ABCB1 gene in an ivermectin-sensitive Border Collie.

    PubMed

    Han, Jae-Ik; Son, Hyoung-Won; Park, Seung-Cheol; Na, Ki-Jeong

    2010-12-01

    P-glycoprotein (P-gp) is encoded by the ABCB1 gene and acts as an efflux pump for xenobiotics. In the Border Collie, a nonsense mutation caused by a 4-base pair deletion in the ABCB1 gene is associated with a premature stop to P-gp synthesis. In this study, we examined the full-length coding sequence of the ABCB1 gene in an ivermectin-sensitive Border Collie that lacked the aforementioned deletion mutation. The sequence was compared to the corresponding sequences of a wild-type Beagle and seven ivermectin-tolerant family members of the Border Collie. When compared to the wild-type Beagle sequence, that of the ivermectin-sensitive Border Collie was found to have one insertion mutation and eight single nucleotide polymorphisms (SNPs) in the coding sequence of the ABCB1 gene. While the eight SNPs were also found in the family members' sequences, the insertion mutation was found only in the ivermectin-sensitive dog. These results suggest the possibility that the SNPs are species-specific features of the ABCB1 gene in Border Collies, and that the insertion mutation may be related to ivermectin intolerance.

  20. Novel insertion mutation of ABCB1 gene in an ivermectin-sensitive Border Collie

    PubMed Central

    Han, Jae-Ik; Son, Hyoung-Won; Park, Seung-Cheol

    2010-01-01

    P-glycoprotein (P-gp) is encoded by the ABCB1 gene and acts as an efflux pump for xenobiotics. In the Border Collie, a nonsense mutation caused by a 4-base pair deletion in the ABCB1 gene is associated with a premature stop to P-gp synthesis. In this study, we examined the full-length coding sequence of the ABCB1 gene in an ivermectin-sensitive Border Collie that lacked the aforementioned deletion mutation. The sequence was compared to the corresponding sequences of a wild-type Beagle and seven ivermectin-tolerant family members of the Border Collie. When compared to the wild-type Beagle sequence, that of the ivermectin-sensitive Border Collie was found to have one insertion mutation and eight single nucleotide polymorphisms (SNPs) in the coding sequence of the ABCB1 gene. While the eight SNPs were also found in the family members' sequences, the insertion mutation was found only in the ivermectin-sensitive dog. These results suggest the possibility that the SNPs are species-specific features of the ABCB1 gene in Border Collies, and that the insertion mutation may be related to ivermectin intolerance. PMID:21113104

  1. High-coverage sequencing and annotated assembly of the genome of the Australian dragon lizard Pogona vitticeps.

    PubMed

    Georges, Arthur; Li, Qiye; Lian, Jinmin; O'Meally, Denis; Deakin, Janine; Wang, Zongji; Zhang, Pei; Fujita, Matthew; Patel, Hardip R; Holleley, Clare E; Zhou, Yang; Zhang, Xiuwen; Matsubara, Kazumi; Waters, Paul; Graves, Jennifer A Marshall; Sarre, Stephen D; Zhang, Guojie

    2015-01-01

    The lizards of the family Agamidae are one of the most prominent elements of the Australian reptile fauna. Here, we present a genomic resource built on the basis of a wild-caught male ZZ central bearded dragon Pogona vitticeps. The genomic sequence for P. vitticeps, generated on the Illumina HiSeq 2000 platform, comprised 317 Gbp (179X raw read depth) from 13 insert libraries ranging from 250 bp to 40 kbp. After filtering for low-quality and duplicated reads, 146 Gbp of data (83X) was available for assembly. Exceptionally high levels of heterozygosity (0.85 % of single nucleotide polymorphisms plus sequence insertions or deletions) complicated assembly; nevertheless, 96.4 % of reads mapped back to the assembled scaffolds, indicating that the assembly included most of the sequenced genome. Length of the assembly was 1.8 Gbp in 545,310 scaffolds (69,852 longer than 300 bp), the longest being 14.68 Mbp. N50 was 2.29 Mbp. Genes were annotated on the basis of de novo prediction, similarity to the green anole Anolis carolinensis, Gallus gallus and Homo sapiens proteins, and P. vitticeps transcriptome sequence assemblies, to yield 19,406 protein-coding genes in the assembly, 63 % of which had intact open reading frames. Our assembly captured 99 % (246 of 248) of core CEGMA genes, with 93 % (231) being complete. The quality of the P. vitticeps assembly is comparable or superior to that of other published squamate genomes, and the annotated P. vitticeps genome can be accessed through a genome browser available at https://genomics.canberra.edu.au.

  2. Whole Genome Sequencing Identifies a 78 kb Insertion from Chromosome 8 as the Cause of Charcot-Marie-Tooth Neuropathy CMTX3

    PubMed Central

    Brewer, Megan H.; Chaudhry, Rabia; Qi, Jessica; Kidambi, Aditi; Drew, Alexander P.; Ryan, Monique M.; Subramanian, Gopinath M.; Young, Helen K.; Zuchner, Stephan; Reddel, Stephen W.; Nicholson, Garth A.; Kennerson, Marina L.

    2016-01-01

    With the advent of whole exome sequencing, cases where no pathogenic coding mutations can be found are increasingly being observed in many diseases. In two large, distantly-related families that mapped to the Charcot-Marie-Tooth neuropathy CMTX3 locus at chromosome Xq26.3-q27.3, all coding mutations were excluded. Using whole genome sequencing we found a large DNA interchromosomal insertion within the CMTX3 locus. The 78 kb insertion originates from chromosome 8q24.3, segregates fully with the disease in the two families, and is absent from the general population as well as 627 neurologically normal chromosomes from in-house controls. Large insertions into chromosome Xq27.1 are known to cause a range of diseases and this is the first neuropathy phenotype caused by an interchromosomal insertion at this locus. The CMTX3 insertion represents an understudied pathogenic structural variation mechanism for inherited peripheral neuropathies. Our finding highlights the importance of considering all structural variation types when studying unsolved inherited peripheral neuropathy cases with no pathogenic coding mutations. PMID:27438001

  3. Mobility and generation of mosaic non-autonomous transposons by Tn3-derived inverted-repeat miniature elements (TIMEs).

    PubMed

    Szuplewska, Magdalena; Ludwiczak, Marta; Lyzwa, Katarzyna; Czarnecki, Jakub; Bartosik, Dariusz

    2014-01-01

    Functional transposable elements (TEs) of several Pseudomonas spp. strains isolated from black shale ore of Lubin mine and from post-flotation tailings of Zelazny Most in Poland, were identified using a positive selection trap plasmid strategy. This approach led to the capture and characterization of (i) 13 insertion sequences from 5 IS families (IS3, IS5, ISL3, IS30 and IS1380), (ii) isoforms of two Tn3-family transposons--Tn5563a and Tn4662a (the latter contains a toxin-antitoxin system), as well as (iii) non-autonomous TEs of diverse structure, ranging in size from 262 to 3892 bp. The non-autonomous elements transposed into AT-rich DNA regions and generated 5- or 6-bp sequence duplications at the target site of transposition. Although these TEs lack a transposase gene, they contain homologous 38-bp-long terminal inverted repeat sequences (IRs), highly conserved in Tn5563a and many other Tn3-family transposons. The simplest elements of this type, designated TIMEs (Tn3 family-derived Inverted-repeat Miniature Elements) (262 bp), were identified within two natural plasmids (pZM1P1 and pLM8P2) of Pseudomonas spp. It was demonstrated that TIMEs are able to mobilize segments of plasmid DNA for transposition, which results in the generation of more complex non-autonomous elements, resembling IS-driven composite transposons in structure. Such transposon-like elements may contain different functional genetic modules in their core regions, including plasmid replication systems. Another non-autonomous element "captured" with a trap plasmid was a TIME derivative containing a predicted resolvase gene and a res site typical for many Tn3-family transposons. The identification of a portable site-specific recombination system is another intriguing example confirming the important role of non-autonomous TEs of the TIME family in shuffling genetic information in bacterial genomes. Transposition of such mosaic elements may have a significant impact on diversity and evolution, not only of transposons and plasmids, but also of other types of mobile genetic elements.

  4. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization.

    PubMed

    Seibt, Kathrin M; Wenke, Torsten; Muders, Katja; Truberg, Bernd; Schmidt, Thomas

    2016-05-01

    Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  5. Sequence-Level Mechanisms of Human Epigenome Evolution

    PubMed Central

    Prendergast, James G.D.; Chambers, Emily V.; Semple, Colin A.M.

    2014-01-01

    DNA methylation and chromatin states play key roles in development and disease. However, the extent of recent evolutionary divergence in the human epigenome and the influential factors that have shaped it are poorly understood. To determine the links between genome sequence and human epigenome evolution, we examined the divergence of DNA methylation and chromatin states following segmental duplication events in the human lineage. Chromatin and DNA methylation states were found to have been generally well conserved following a duplication event, with the evolution of the epigenome largely uncoupled from the total number of genetic changes in the surrounding DNA sequence. However, the epigenome at tissue-specific, distal regulatory regions was observed to be unusually prone to diverge following duplication, with particular sequence differences, altering known sequence motifs, found to be associated with divergence in patterns of DNA methylation and chromatin. Alu elements were found to have played a particularly prominent role in shaping human epigenome evolution, and we show that human-specific AluY insertion events are strongly linked to the evolution of the DNA methylation landscape and gene expression levels, including at key neurological genes in the human brain. Studying paralogous regions within the same sample enables the study of the links between genome and epigenome evolution while controlling for biological and technical variation. We show DNA methylation and chromatin divergence between duplicated regions are linked to the divergence of particular genetic motifs, with Alu elements having played a disproportionate role in the evolution of the epigenome in the human lineage. PMID:24966180

  6. A small and efficient dimerization/packaging signal of rat VL30 RNA and its use in murine leukemia virus-VL30-derived vectors for gene transfer.

    PubMed

    Torrent, C; Gabus, C; Darlix, J L

    1994-02-01

    Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer.

  7. Retroposition of the AFC family of SINEs (short interspersed repetitive elements) before and during the adaptive radiation of cichlid fishes in Lake Malawi and related inferences about phylogeny.

    PubMed

    Takahashi, K; Nishida, M; Yuma, M; Okada, N

    2001-01-01

    Lake Malawi is home to more than 450 species of endemic cichlids, which provide a spectacular example of adaptive radiation. To clarify the phylogenetic relationships among these fish, we examined the presence and absence of SINEs (short interspersed repetitive elements) at orthologous loci. We identified six loci at which a SINE sequence had apparently been specifically inserted by retroposition in the common ancestor of all the investigated species of endemic cichlids in Lake Malawi. At another locus, unique sharing of a SINE sequence was evident among all the investigated species of endemic non-Mbuna cichlids with the exception of Rhamphochromis sp. The relationships were in good agreement with those deduced in previous studies with various different markers, demonstrating that the SINE method is useful for the elucidation of phylogenetic relationships among cichlids in Lake Malawi. We also characterized a locus that exhibited transspecies polymorphism with respect to the presence or absence of the SINE sequence among non-Mbuna species. This result suggests that incomplete lineage sorting and/or interspecific hybridization might have occurred or be occurring among the species in this group, which might potentially cause misinterpretation of phylogenetic data, in particular when a single-locus marker, such as a sequence in the mitochondrial DNA, is used for analysis.

  8. Comparative Analysis of the First Complete Enterococcus faecium Genome

    PubMed Central

    Lam, Margaret M. C.; Seemann, Torsten; Bulach, Dieter M.; Gladman, Simon L.; Chen, Honglei; Haring, Volker; Moore, Robert J.; Ballard, Susan; Grayson, M. Lindsay; Johnson, Paul D. R.; Howden, Benjamin P.

    2012-01-01

    Vancomycin-resistant enterococci (VRE) are one of the leading causes of nosocomial infections in health care facilities around the globe. In particular, infections caused by vancomycin-resistant Enterococcus faecium are becoming increasingly common. Comparative and functional genomic studies of E. faecium isolates have so far been limited owing to the lack of a fully assembled E. faecium genome sequence. Here we address this issue and report the complete 3.0-Mb genome sequence of the multilocus sequence type 17 vancomycin-resistant Enterococcus faecium strain Aus0004, isolated from the bloodstream of a patient in Melbourne, Australia, in 1998. The genome comprises a 2.9-Mb circular chromosome and three circular plasmids. The chromosome harbors putative E. faecium virulence factors such as enterococcal surface protein, hemolysin, and collagen-binding adhesin. Aus0004 has a very large accessory genome (38%) that includes three prophage and two genomic islands absent among 22 other E. faecium genomes. One of the prophage was present as inverted 50-kb repeats that appear to have facilitated a 683-kb chromosomal inversion across the replication terminus, resulting in a striking replichore imbalance. Other distinctive features include 76 insertion sequence elements and a single chromosomal copy of Tn1549 containing the vanB vancomycin resistance element. A complete E. faecium genome will be a useful resource to assist our understanding of this emerging nosocomial pathogen. PMID:22366422

  9. Adjacent DNA sequences modulate Sox9 transcriptional activation at paired Sox sites in three chondrocyte-specific enhancer elements

    PubMed Central

    Bridgewater, Laura C.; Walker, Marlan D.; Miller, Gwen C.; Ellison, Trevor A.; Holsinger, L. Daniel; Potter, Jennifer L.; Jackson, Todd L.; Chen, Reuben K.; Winkel, Vicki L.; Zhang, Zhaoping; McKinney, Sandra; de Crombrugghe, Benoit

    2003-01-01

    Expression of the type XI collagen gene Col11a2 is directed to cartilage by at least three chondrocyte-specific enhancer elements, two in the 5′ region and one in the first intron of the gene. The three enhancers each contain two heptameric sites with homology to the Sox protein-binding consensus sequence. The two sites are separated by 3 or 4 bp and arranged in opposite orientation to each other. Targeted mutational analyses of these three enhancers showed that in the intronic enhancer, as in the other two enhancers, both Sox sites in a pair are essential for enhancer activity. The transcription factor Sox9 binds as a dimer at the paired sites, and the introduction of insertion mutations between the sites demonstrated that physical interactions between the adjacently bound proteins are essential for enhancer activity. Additional mutational analyses demonstrated that although Sox9 binding at the paired Sox sites is necessary for enhancer activity, it alone is not sufficient. Adjacent DNA sequences in each enhancer are also required, and mutation of those sequences can eliminate enhancer activity without preventing Sox9 binding. The data suggest a new model in which adjacently bound proteins affect the DNA bend angle produced by Sox9, which in turn determines whether an active transcriptional enhancer complex is assembled. PMID:12595563

  10. Sequence Composition and Gene Content of the Short Arm of Rye (Secale cereale) Chromosome 1

    PubMed Central

    Fluch, Silvia; Kopecky, Dieter; Burg, Kornel; Šimková, Hana; Taudien, Stefan; Petzold, Andreas; Kubaláková, Marie; Platzer, Matthias; Berenyi, Maria; Krainer, Siegfried; Doležel, Jaroslav; Lelley, Tamas

    2012-01-01

    Background The purpose of the study is to elucidate the sequence composition of the short arm of rye chromosome 1 (Secale cereale) with special focus on its gene content, because this portion of the rye genome is an integrated part of several hundreds of bread wheat varieties worldwide. Methodology/Principal Findings Multiple Displacement Amplification of 1RS DNA, obtained from flow sorted 1RS chromosomes, using 1RS ditelosomic wheat-rye addition line, and subsequent Roche 454FLX sequencing of this DNA yielded 195,313,589 bp sequence information. This quantity of sequence information resulted in 0.43× sequence coverage of the 1RS chromosome arm, permitting the identification of genes with estimated probability of 95%. A detailed analysis revealed that more than 5% of the 1RS sequence consisted of gene space, identifying at least 3,121 gene loci representing 1,882 different gene functions. Repetitive elements comprised about 72% of the 1RS sequence, Gypsy/Sabrina (13.3%) being the most abundant. More than four thousand simple sequence repeat (SSR) sites mostly located in gene related sequence reads were identified for possible marker development. The existence of chloroplast insertions in 1RS has been verified by identifying chimeric chloroplast-genomic sequence reads. Synteny analysis of 1RS to the full genomes of Oryza sativa and Brachypodium distachyon revealed that about half of the genes of 1RS correspond to the distal end of the short arm of rice chromosome 5 and the proximal region of the long arm of Brachypodium distachyon chromosome 2. Comparison of the gene content of 1RS to 1HS barley chromosome arm revealed high conservation of genes related to chromosome 5 of rice. Conclusions The present study revealed the gene content and potential gene functions on this chromosome arm and demonstrated numerous sequence elements like SSRs and gene-related sequences, which can be utilised for future research as well as in breeding of wheat and rye. PMID:22328922

  11. Chromosome arm-specific BAC end sequences permit comparative analysis of homoeologous chromosomes and genomes of polyploid wheat

    PubMed Central

    2012-01-01

    Background Bread wheat, one of the world’s staple food crops, has the largest, highly repetitive and polyploid genome among the cereal crops. The wheat genome holds the key to crop genetic improvement against challenges such as climate change, environmental degradation, and water scarcity. To unravel the complex wheat genome, the International Wheat Genome Sequencing Consortium (IWGSC) is pursuing a chromosome- and chromosome arm-based approach to physical mapping and sequencing. Here we report on the use of a BAC library made from flow-sorted telosomic chromosome 3A short arm (t3AS) for marker development and analysis of sequence composition and comparative evolution of homoeologous genomes of hexaploid wheat. Results The end-sequencing of 9,984 random BACs from a chromosome arm 3AS-specific library (TaaCsp3AShA) generated 11,014,359 bp of high quality sequence from 17,591 BAC-ends with an average length of 626 bp. The sequence represents 3.2% of t3AS with an average DNA sequence read every 19 kb. Overall, 79% of the sequence consisted of repetitive elements, 1.38% as coding regions (estimated 2,850 genes) and another 19% of unknown origin. Comparative sequence analysis suggested that 70-77% of the genes present in both 3A and 3B were syntenic with model species. Among the transposable elements, gypsy/sabrina (12.4%) was the most abundant repeat and was significantly more frequent in 3A compared to homoeologous chromosome 3B. Twenty novel repetitive sequences were also identified using de novo repeat identification. BESs were screened to identify simple sequence repeats (SSR) and transposable element junctions. A total of 1,057 SSRs were identified with a density of one per 10.4 kb, and 7,928 junctions between transposable elements (TE) and other sequences were identified with a density of one per 1.39 kb. With the objective of enhancing the marker density of chromosome 3AS, oligonucleotide primers were successfully designed from 758 SSRs and 695 Insertion Site Based Polymorphisms (ISBPs). Of the 96 ISBP primer pairs tested, 28 (29%) were 3A-specific and compared to 17 (18%) for 96 SSRs. Conclusion This work reports on the use of wheat chromosome arm 3AS-specific BAC library for the targeted generation of sequence data from a particular region of the huge genome of wheat. A large quantity of sequences were generated from the A genome of hexaploid wheat for comparative genome analysis with homoeologous B and D genomes and other model grass genomes. Hundreds of molecular markers were developed from the 3AS arm-specific sequences; these and other sequences will be useful in gene discovery and physical mapping. PMID:22559868

  12. CRISPR regulation of intraspecies diversification by limiting IS transposition and intercellular recombination.

    PubMed

    Watanabe, Takayasu; Nozawa, Takashi; Aikawa, Chihiro; Amano, Atsuo; Maruyama, Fumito; Nakagawa, Ichiro

    2013-01-01

    Mobile genetic elements (MGEs) and genetic rearrangement are considered as major driving forces of bacterial diversification. Previous comparative genome analysis of Porphyromonas gingivalis, a pathogen related to periodontitis, implied such an important relationship. As a counterpart system to MGEs, clustered regularly interspaced short palindromic repeats (CRISPRs) in bacteria may be useful for genetic typing. We found that CRISPR typing could be a reasonable alternative to conventional methods for characterizing phylogenetic relationships among 60 highly diverse P. gingivalis isolates. Examination of genetic recombination along with multilocus sequence typing suggests the importance of such events between different isolates. MGEs appear to be strategically located at the breakpoint gaps of complicated genome rearrangements. Of these MGEs, insertion sequences (ISs) were found most frequently. CRISPR analysis identified 2,150 spacers that were clustered into 1,187 unique ones. Most of these spacers exhibited no significant nucleotide similarity to known sequences (97.6%: 1,158/1,187). Surprisingly, CRISPR spacers exhibiting high nucleotide similarity to regions of P. gingivalis genomes including ISs were predominant. The proportion of such spacers to all the unique spacers (1.6%: 19/1,187) was the highest among previous studies, suggesting novel functions for these CRISPRs. These results indicate that P. gingivalis is a bacterium with high intraspecies diversity caused by frequent insertion sequence (IS) transposition, whereas both the introduction of foreign DNA, primarily from other P. gingivalis cells, and IS transposition are limited by CRISPR interference. It is suggested that P. gingivalis CRISPRs could be an important source for understanding the role of CRISPRs in the development of bacterial diversity.

  13. Quantitative analysis of bristle number in Drosophila mutants identifies genes involved in neural development

    NASA Technical Reports Server (NTRS)

    Norga, Koenraad K.; Gurganus, Marjorie C.; Dilda, Christy L.; Yamamoto, Akihiko; Lyman, Richard F.; Patel, Prajal H.; Rubin, Gerald M.; Hoskins, Roger A.; Mackay, Trudy F.; Bellen, Hugo J.

    2003-01-01

    BACKGROUND: The identification of the function of all genes that contribute to specific biological processes and complex traits is one of the major challenges in the postgenomic era. One approach is to employ forward genetic screens in genetically tractable model organisms. In Drosophila melanogaster, P element-mediated insertional mutagenesis is a versatile tool for the dissection of molecular pathways, and there is an ongoing effort to tag every gene with a P element insertion. However, the vast majority of P element insertion lines are viable and fertile as homozygotes and do not exhibit obvious phenotypic defects, perhaps because of the tendency for P elements to insert 5' of transcription units. Quantitative genetic analysis of subtle effects of P element mutations that have been induced in an isogenic background may be a highly efficient method for functional genome annotation. RESULTS: Here, we have tested the efficacy of this strategy by assessing the extent to which screening for quantitative effects of P elements on sensory bristle number can identify genes affecting neural development. We find that such quantitative screens uncover an unusually large number of genes that are known to function in neural development, as well as genes with yet uncharacterized effects on neural development, and novel loci. CONCLUSIONS: Our findings establish the use of quantitative trait analysis for functional genome annotation through forward genetics. Similar analyses of quantitative effects of P element insertions will facilitate our understanding of the genes affecting many other complex traits in Drosophila.

  14. Preparation of high temperature gas-cooled reactor fuel element

    DOEpatents

    Bradley, Ronnie A.; Sease, John D.

    1976-01-01

    This invention relates to a method for the preparation of high temperature gas-cooled reactor (HTGR) fuel elements wherein uncarbonized fuel rods are inserted in appropriate channels of an HTGR fuel element block and the entire block is inserted in an autoclave for in situ carbonization under high pressure. The method is particularly applicable to remote handling techniques.

  15. Parainfluenza virus chimeric mini-replicons indicate a novel regulatory element in the leader promoter.

    PubMed

    Matsumoto, Yusuke; Ohta, Keisuke; Goto, Hideo; Nishio, Machiko

    2016-07-01

    Gene expression of paramyxoviruses is regulated by genome-encoded cis-acting elements; however, whether all the required elements for viral growth have been identified is not clear. Using a mini-replicon system, it has been shown that human parainfluenza virus type 2 (hPIV2) polymerase can recognize the promoter elements of parainfluenza virus type 5 (PIV5), but reporter activity is lower in this case. We constructed a series of luciferase-encoding chimeric PIV2/5 mini-genomes that are basically hPIV2, but whose leader (le), mRNA start signal and trailer sequence are partially replaced with those of PIV5. Studies of the chimeric PIV2/5 mini-replicons demonstrated that replacement of hPIV2 le with PIV5 le results in remarkably weak luciferase expression. Further mutagenesis identified the responsible region as positions 25-30 of the PIV5 le. Using recombinant hPIV2, the impact of this region on viral life cycles was assessed. Insertion of the mutation at this region facilitated viral growth, genomic replication and mRNA transcription at the early stage of infection, which elicited severe cell damage. In contrast, at the late infection stage it caused a reduction in viral transcription. Here, we identify a novel cis-acting element in the internal region of an le sequence that is involved in the regulation of polymerase, and which contributes to maintaining a balance between viral growth and cytotoxicity.

  16. TU-H-CAMPUS-JeP3-05: Adaptive Determination of Needle Sequence HDR Prostate Brachytherapy with Divergent Needle-By-Needle Delivery

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Borot de Battisti, M; Maenhout, M; Lagendijk, J J W

    Purpose: To develop a new method which adaptively determines the optimal needle insertion sequence for HDR prostate brachytherapy involving divergent needle-by-needle dose delivery by e.g. a robotic device. A needle insertion sequence is calculated at the beginning of the intervention and updated after each needle insertion with feedback on needle positioning errors. Methods: Needle positioning errors and anatomy changes may occur during HDR brachytherapy which can lead to errors in the delivered dose. A novel strategy was developed to calculate and update the needle sequence and the dose plan after each needle insertion with feedback on needle positioning errors. Themore » dose plan optimization was performed by numerical simulations. The proposed needle sequence determination optimizes the final dose distribution based on the dose coverage impact of each needle. This impact is predicted stochastically by needle insertion simulations. HDR procedures were simulated with varying number of needle insertions (4 to 12) using 11 patient MR data-sets with PTV, prostate, urethra, bladder and rectum delineated. Needle positioning errors were modeled by random normally distributed angulation errors (standard deviation of 3 mm at the needle’s tip). The final dose parameters were compared in the situations where the needle with the largest vs. the smallest dose coverage impact was selected at each insertion. Results: Over all scenarios, the percentage of clinically acceptable final dose distribution improved when the needle selected had the largest dose coverage impact (91%) compared to the smallest (88%). The differences were larger for few (4 to 6) needle insertions (maximum difference scenario: 79% vs. 60%). The computation time of the needle sequence optimization was below 60s. Conclusion: A new adaptive needle sequence determination for HDR prostate brachytherapy was developed. Coupled to adaptive planning, the selection of the needle with the largest dose coverage impact increases chances of reaching the clinical constraints. M. Borot de Battisti is funded by Philips Medical Systems Nederland B.V.; M. Moerland is principal investigator on a contract funded by Philips Medical Systems Nederland B.V.; G. Hautvast and D. Binnekamp are fulltime employees of Philips Medical Systems Nederland B.V.« less

  17. Innate Immune Complexity in the Purple Sea Urchin: Diversity of the Sp185/333 System

    PubMed Central

    Smith, L. Courtney

    2012-01-01

    The California purple sea urchin, Strongylocentrotus purpuratus, is a long-lived echinoderm with a complex and sophisticated innate immune system. There are several large gene families that function in immunity in this species including the Sp185/333 gene family that has ∼50 (±10) members. The family shows intriguing sequence diversity and encodes a broad array of diverse yet similar proteins. The genes have two exons of which the second encodes the mature protein and has repeats and blocks of sequence called elements. Mosaics of element patterns plus single nucleotide polymorphisms-based variants of the elements result in significant sequence diversity among the genes yet maintains similar structure among the members of the family. Sequence of a bacterial artificial chromosome insert shows a cluster of six, tightly linked Sp185/333 genes that are flanked by GA microsatellites. The sequences between the GA microsatellites in which the Sp185/333 genes and flanking regions are located, are much more similar to each other than are the sequences outside the microsatellites suggesting processes such as gene conversion, recombination, or duplication. However, close linkage does not correspond with greater sequence similarity compared to randomly cloned and sequenced genes that are unlikely to be linked. There are three segmental duplications that are bounded by GAT microsatellites and include three almost identical genes plus flanking regions. RNA editing is detectible throughout the mRNAs based on comparisons to the genes, which, in combination with putative post-translational modifications to the proteins, results in broad arrays of Sp185/333 proteins that differ among individuals. The mature proteins have an N-terminal glycine-rich region, a central RGD motif, and a C-terminal histidine-rich region. The Sp185/333 proteins are localized to the cell surface and are found within vesicles in subsets of polygonal and small phagocytes. The coelomocyte proteome shows full-length and truncated proteins, including some with missense sequence. Current results suggest that both native Sp185/333 proteins and a recombinant protein bind bacteria and are likely important in sea urchin innate immunity. PMID:22566951

  18. In vivo insertion pool sequencing identifies virulence factors in a complex fungal–host interaction

    PubMed Central

    Uhse, Simon; Pflug, Florian G.; Stirnberg, Alexandra; Ehrlinger, Klaus; von Haeseler, Arndt

    2018-01-01

    Large-scale insertional mutagenesis screens can be powerful genome-wide tools if they are streamlined with efficient downstream analysis, which is a serious bottleneck in complex biological systems. A major impediment to the success of next-generation sequencing (NGS)-based screens for virulence factors is that the genetic material of pathogens is often underrepresented within the eukaryotic host, making detection extremely challenging. We therefore established insertion Pool-Sequencing (iPool-Seq) on maize infected with the biotrophic fungus U. maydis. iPool-Seq features tagmentation, unique molecular barcodes, and affinity purification of pathogen insertion mutant DNA from in vivo-infected tissues. In a proof of concept using iPool-Seq, we identified 28 virulence factors, including 23 that were previously uncharacterized, from an initial pool of 195 candidate effector mutants. Because of its sensitivity and quantitative nature, iPool-Seq can be applied to any insertional mutagenesis library and is especially suitable for genetically complex setups like pooled infections of eukaryotic hosts. PMID:29684023

  19. SimulaTE: simulating complex landscapes of transposable elements of populations.

    PubMed

    Kofler, Robert

    2018-04-15

    Estimating the abundance of transposable elements (TEs) in populations (or tissues) promises to answer many open research questions. However, progress is hampered by the lack of concordance between different approaches for TE identification and thus potentially unreliable results. To address this problem, we developed SimulaTE a tool that generates TE landscapes for populations using a newly developed domain specific language (DSL). The simple syntax of our DSL allows for easily building even complex TE landscapes that have, for example, nested, truncated and highly diverged TE insertions. Reads may be simulated for the populations using different sequencing technologies (PacBio, Illumina paired-ends) and strategies (sequencing individuals and pooled populations). The comparison between the expected (i.e. simulated) and the observed results will guide researchers in finding the most suitable approach for a particular research question. SimulaTE is implemented in Python and available at https://sourceforge.net/projects/simulates/. Manual https://sourceforge.net/p/simulates/wiki/Home/#manual; Test data and tutorials https://sourceforge.net/p/simulates/wiki/Home/#walkthrough; Validation https://sourceforge.net/p/simulates/wiki/Home/#validation. robert.kofler@vetmeduni.ac.at.

  20. Enhanced expression of EGFP gene in CHSE-214 cells by an ARS element from mud loach (Misgurnus mizolepis).

    PubMed

    Kim, Moo-Sang; Lim, Hak-Seob; Ahn, Sang Jung; Jeong, Yong-Kee; Kim, Chul Geun; Lee, Hyung Ho

    2007-11-01

    The origins of replication are associated with nuclear matrices or are found in close proximity to matrix attachment regions (MARs). In this report, fish MARs were cloned into an autonomously replicating sequence (ARS) cloning vector and were screened for ARS elements in Saccharomyces cerevisiae. Sixteen clones were isolated that were able to grow on the selective plates. In particular, an ARS905 that shows high efficiency among them was selected for this study. Southern hybridization indicated the autonomous replication of the transformation vector containing the ARS905 element. DNA sequences analysis showed that the ARS905 contained two ARS consensus sequences as well as MAR motifs, such as AT tracts, ORI patterns, and ATC tracts. In vitro matrix binding analysis, major matrix binding activity and ARS function coincided in a subfragment of the ARS905. To analyze the effects of ARS905 on expression of a reporter gene, an ARS905(E1158) with ARS activity was inserted into pBaEGFP(+) containing mud loach beta-actin promoter, EGFP as a reporter gene, and SV40 poly(A) signal. The pBaEGFP(+)-ARS905(E1158) was transfected into a fish cell line, CHSE-214. The intensity of EGFP transfected cells was a 7-fold of the control at 11days post-transfection. These results indicate that ARS905 enhances the expression of the EGFP gene and that it should be as a component of expression vectors in further fish biotechnological studies.

  1. Time- and Cost-Efficient Identification of T-DNA Insertion Sites through Targeted Genomic Sequencing

    PubMed Central

    Lepage, Étienne; Zampini, Éric; Boyle, Brian; Brisson, Normand

    2013-01-01

    Forward genetic screens enable the unbiased identification of genes involved in biological processes. In Arabidopsis, several mutant collections are publicly available, which greatly facilitates such practice. Most of these collections were generated by agrotransformation of a T-DNA at random sites in the plant genome. However, precise mapping of T-DNA insertion sites in mutants isolated from such screens is a laborious and time-consuming task. Here we report a simple, low-cost and time efficient approach to precisely map T-DNA insertions simultaneously in many different mutants. By combining sequence capture, next-generation sequencing and 2D-PCR pooling, we developed a new method that allowed the rapid localization of T-DNA insertion sites in 55 out of 64 mutant plants isolated in a screen for gyrase inhibition hypersensitivity. PMID:23951038

  2. Sorting genomes by reciprocal translocations, insertions, and deletions.

    PubMed

    Qi, Xingqin; Li, Guojun; Li, Shuguang; Xu, Ying

    2010-01-01

    The problem of sorting by reciprocal translocations (abbreviated as SBT) arises from the field of comparative genomics, which is to find a shortest sequence of reciprocal translocations that transforms one genome Pi into another genome Gamma, with the restriction that Pi and Gamma contain the same genes. SBT has been proved to be polynomial-time solvable, and several polynomial algorithms have been developed. In this paper, we show how to extend Bergeron's SBT algorithm to include insertions and deletions, allowing to compare genomes containing different genes. In particular, if the gene set of Pi is a subset (or superset, respectively) of the gene set of Gamma, we present an approximation algorithm for transforming Pi into Gamma by reciprocal translocations and deletions (insertions, respectively), providing a sorting sequence with length at most OPT + 2, where OPT is the minimum number of translocations and deletions (insertions, respectively) needed to transform Pi into Gamma; if Pi and Gamma have different genes but not containing each other, we give a heuristic to transform Pi into Gamma by a shortest sequence of reciprocal translocations, insertions, and deletions, with bounds for the length of the sorting sequence it outputs. At a conceptual level, there is some similarity between our algorithm and the algorithm developed by El Mabrouk which is used to sort two chromosomes with different gene contents by reversals, insertions, and deletions.

  3. Mutations That Stimulate flhDC Expression in Escherichia coli K-12.

    PubMed

    Fahrner, Karen A; Berg, Howard C

    2015-10-01

    Motility is a beneficial attribute that enables cells to access and explore new environments and to escape detrimental ones. The organelle of motility in Escherichia coli is the flagellum, and its production is initiated by the activating transcription factors FlhD and FlhC. The expression of these factors by the flhDC operon is highly regulated and influenced by environmental conditions. The flhDC promoter is recognized by σ(70) and is dependent on the transcriptional activator cyclic AMP (cAMP)-cAMP receptor protein complex (cAMP-CRP). A number of K-12 strains exhibit limited motility due to low expression levels of flhDC. We report here a large number of mutations that stimulate flhDC expression in such strains. They include single nucleotide changes in the -10 element of the promoter, in the promoter spacer, and in the cAMP-CRP binding region. In addition, we show that insertion sequence (IS) elements or a kanamycin gene located hundreds of base pairs upstream of the promoter can effectively enhance transcription, suggesting that the topology of a large upstream region plays a significant role in the regulation of flhDC expression. None of the mutations eliminated the requirement for cAMP-CRP for activation. However, several mutations allowed expression in the absence of the nucleoid organizing protein, H-NS, which is normally required for flhDC expression. The flhDC operon of Escherichia coli encodes transcription factors that initiate flagellar synthesis, an energetically costly process that is highly regulated. Few deregulating mutations have been reported thus far. This paper describes new single nucleotide mutations that stimulate flhDC expression, including a number that map to the promoter spacer region. In addition, this work shows that insertion sequence elements or a kanamycin gene located far upstream from the promoter or repressor binding sites also stimulate transcription, indicating a role of regional topology in the regulation of flhDC expression. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  4. Isolation of a promoter region in mouse cytochrome P450 3A (Cyp3A16) gene and its transcriptional control.

    PubMed

    Itoh, S; Abe, Y; Kubo, A; Okuda, M; Shimoji, M; Nakayama, K; Kamataki, T

    1997-02-07

    An 11.5 kb fragment of the mouse Cyp3a16 gene containing the 5' flanking region was isolated from the lambda DASHII mouse genomic library. A part of the 5' flanking region and the first exon of Cyp3a16 gene were sequenced. S1 mapping analysis showed the presence of two transcriptional initiation sites. The first exon was completely identical to Cyp3a16 cDNA. The identity of 5' flanking sequences between Cyp3a16 and Cyp3a11 genes was about 69%. A typical TATA box and a basic transcription element (BTE) were found as seen with other CYP3A genes from various animal species Moreover, some putative transcriptional regulatory elements were also found in addition to the sequence motif seen for the formation of Z-type DNA. To examine the transcriptional activity of Cyp3a11 gene, DNA fragments in the 5'-flanking region of the gene were inserted front of the luciferase structural gene, and the constructs were transfected in primary hepatocytes. The analysis of the luciferase activity indicated that the region between -146 and -56 was necessary for the transcription of CYP3a16 gene.

  5. Elevated Rate of Fixation of Endogenous Retroviral Elements in Haplorhini TRIM5 and TRIM22 Genomic Sequences: Impact on Transcriptional Regulation

    PubMed Central

    Diehl, William E.; Johnson, Welkin E.; Hunter, Eric

    2013-01-01

    All genes in the TRIM6/TRIM34/TRIM5/TRIM22 locus are type I interferon inducible, with TRIM5 and TRIM22 possessing antiviral properties. Evolutionary studies involving the TRIM6/34/5/22 locus have predominantly focused on the coding sequence of the genes, finding that TRIM5 and TRIM22 have undergone high rates of both non-synonymous nucleotide replacements and in-frame insertions and deletions. We sought to understand if divergent evolutionary pressures on TRIM6/34/5/22 coding regions have selected for modifications in the non-coding regions of these genes and explore whether such non-coding changes may influence the biological function of these genes. The transcribed genomic regions, including the introns, of TRIM6, TRIM34, TRIM5, and TRIM22 from ten Haplorhini primates and one prosimian species were analyzed for transposable element content. In Haplorhini species, TRIM5 displayed an exaggerated interspecies variability, predominantly resulting from changes in the composition of transposable elements in the large first and fourth introns. Multiple lineage-specific endogenous retroviral long terminal repeats (LTRs) were identified in the first intron of TRIM5 and TRIM22. In the prosimian genome, we identified a duplication of TRIM5 with a concomitant loss of TRIM22. The transposable element content of the prosimian TRIM5 genes appears to largely represent the shared Haplorhini/prosimian ancestral state for this gene. Furthermore, we demonstrated that one such differentially fixed LTR provides for species-specific transcriptional regulation of TRIM22 in response to p53 activation. Our results identify a previously unrecognized source of species-specific variation in the antiviral TRIM genes, which can lead to alterations in their transcriptional regulation. These observations suggest that there has existed long-term pressure for exaptation of retroviral LTRs in the non-coding regions of these genes. This likely resulted from serial viral challenges and provided a mechanism for rapid alteration of transcriptional regulation. To our knowledge, this represents the first report of persistent evolutionary pressure for the capture of retroviral LTR insertions. PMID:23516500

  6. Absence of mutation at the 5'-upstream promoter region of the TPM4 gene from cardiac mutant axolotl (Ambystoma mexicanum).

    PubMed

    Denz, Christopher R; Zhang, Chi; Jia, Pingping; Du, Jianfeng; Huang, Xupei; Dube, Syamalima; Thomas, Anish; Poiesz, Bernard J; Dube, Dipak K

    2011-09-01

    Tropomyosins are a family of actin-binding proteins that show cell-specific diversity by a combination of multiple genes and alternative RNA splicing. Of the 4 different tropomyosin genes, TPM4 plays a pivotal role in myofibrillogenesis as well as cardiac contractility in amphibians. In this study, we amplified and sequenced the upstream regulatory region of the TPM4 gene from both normal and mutant axolotl hearts. To identify the cis-elements that are essential for the expression of the TPM4, we created various deletion mutants of the TPM4 promoter DNA, inserted the deleted segments into PGL3 vector, and performed promoter-reporter assay using luciferase as the reporter gene. Comparison of sequences of the promoter region of the TPM4 gene from normal and mutant axolotl revealed no mutations in the promoter sequence of the mutant TPM4 gene. CArG box elements that are generally involved in controlling the expression of several other muscle-specific gene promoters were not found in the upstream regulatory region of the TPM4 gene. In deletion experiments, loss of activity of the reporter gene was noted upon deletion which was then restored upon further deletion suggesting the presence of both positive and negative cis-elements in the upstream regulatory region of the TPM4 gene. We believe that this is the first axolotl promoter that has ever been cloned and studied with clear evidence that it functions in mammalian cell lines. Although striated muscle-specific cis-acting elements are absent from the promoter region of TPM4 gene, our results suggest the presence of positive and negative cis-elements in the promoter region, which in conjunction with positive and negative trans-elements may be involved in regulating the expression of TPM4 gene in a tissue-specific manner.

  7. Structure and transcriptional impact of divergent repetitive elements inserted within Phanerochaete chrysosporium strain RP-78 genes

    Treesearch

    Luis F. Larrondo; Paulo Canessa; Rafael Vicuna; Philip Stewart; Amber Vanden Wymelenberg; Dan Cullen

    2007-01-01

    We describe the structure, organization, and transcriptional impact of repetitive elements within the lignin-degrading basidiomycete, Phanerochaete chrysosporium. Searches of the P. chrysosporium genome revealed five copies of pce1, a 1,750-nt non-autonomous, class II element. Alleles encoding a putative glucosyltransferase and a cytochrome P450 harbor pce insertions...

  8. Crystal Structure of the Extracellular Cholinesterase-Like Domain from Neuroligin-2

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Koehnke,J.; Jin, X.; Budreck, E.

    Neuroligins (NLs) are catalytically inactive members of a family of cholinesterase-like transmembrane proteins that mediate cell adhesion at neuronal synapses. Postsynaptic neuroligins engage in Ca2+-dependent transsynaptic interactions via their extracellular cholinesterase domain with presynaptic neurexins (NRXs). These interactions may be regulated by two short splice insertions (termed A and B) in the NL cholinesterase domain. Here, we present the 3.3- Angstroms crystal structure of the ectodomain from NL2 containing splice insertion A (NL2A). The overall structure of NL2A resembles that of cholinesterases, but several structural features are unique to the NL proteins. First, structural elements surrounding the esterase active-site regionmore » differ significantly between active esterases and NL2A. On the opposite surface of the NL2A molecule, the positions of the A and B splice insertions identify a candidate NRX interaction site of the NL protein. Finally, sequence comparisons of NL isoforms allow for mapping the location of residues of previously identified mutations in NL3 and NL4 found in patients with autism spectrum disorders. Overall, the NL2 structure promises to provide a valuable model for dissecting NL isoform- and synapse-specific functions.« less

  9. Crystal structure of the extracellular cholinesterase-like domain from neuroligin-2

    PubMed Central

    Koehnke, Jesko; Jin, Xiangshu; Budreck, Elaine C.; Posy, Shoshana; Scheiffele, Peter; Honig, Barry; Shapiro, Lawrence

    2008-01-01

    Neuroligins (NLs) are catalytically inactive members of a family of cholinesterase-like transmembrane proteins that mediate cell adhesion at neuronal synapses. Postsynaptic neuroligins engage in Ca2+-dependent transsynaptic interactions via their extracellular cholinesterase domain with presynaptic neurexins (NRXs). These interactions may be regulated by two short splice insertions (termed A and B) in the NL cholinesterase domain. Here, we present the 3.3-Å crystal structure of the ectodomain from NL2 containing splice insertion A (NL2A). The overall structure of NL2A resembles that of cholinesterases, but several structural features are unique to the NL proteins. First, structural elements surrounding the esterase active-site region differ significantly between active esterases and NL2A. On the opposite surface of the NL2A molecule, the positions of the A and B splice insertions identify a candidate NRX interaction site of the NL protein. Finally, sequence comparisons of NL isoforms allow for mapping the location of residues of previously identified mutations in NL3 and NL4 found in patients with autism spectrum disorders. Overall, the NL2 structure promises to provide a valuable model for dissecting NL isoform- and synapse-specific functions. PMID:18250328

  10. A physical map of a BAC clone contig covering the entire autosome insertion between ovine MHC Class IIa and IIb

    PubMed Central

    2012-01-01

    Background The ovine Major Histocompatibility Complex (MHC) harbors genes involved in overall resistance/susceptibility of the host to infectious diseases. Compared to human and mouse, the ovine MHC is interrupted by a large piece of autosome insertion via a hypothetical chromosome inversion that constitutes ~25% of ovine chromosome 20. The evolutionary consequence of such an inversion and an insertion (inversion/insertion) in relation to MHC function remains unknown. We previously constructed a BAC clone physical map for the ovine MHC exclusive of the insertion region. Here we report the construction of a high-density physical map covering the autosome insertion in order to address the question of what the inversion/insertion had to do with ruminants during the MHC evolution. Results A total of 119 pairs of comparative bovine oligo primers were utilized to screen an ovine BAC library for positive clones and the orders and overlapping relationships of the identified clones were determined by DNA fingerprinting, BAC-end sequencing, and sequence-specific PCR. A total of 368 positive BAC clones were identified and 108 of the effective clones were ordered into an overlapping BAC contig to cover the consensus region between ovine MHC class IIa and IIb. Therefore, a continuous physical map covering the entire ovine autosome inversion/insertion region was successfully constructed. The map confirmed the bovine sequence assembly for the same homologous region. The DNA sequences of 185 BAC-ends have been deposited into NCBI database with the access numbers HR309252 through HR309068, corresponding to dbGSS ID 30164010 through 30163826. Conclusions We have constructed a high-density BAC clone physical map for the ovine autosome inversion/insertion between the MHC class IIa and IIb. The entire ovine MHC region is now fully covered by a continuous BAC clone contig. The physical map we generated will facilitate MHC functional studies in the ovine, as well as the comparative MHC evolution in ruminants. PMID:22897909

  11. The expanding universe of transposon technologies for gene and cell engineering.

    PubMed

    Ivics, Zoltán; Izsvák, Zsuzsanna

    2010-12-07

    Transposable elements can be viewed as natural DNA transfer vehicles that, similar to integrating viruses, are capable of efficient genomic insertion. The mobility of class II transposable elements (DNA transposons) can be controlled by conditionally providing the transposase component of the transposition reaction. Thus, a DNA of interest (be it a fluorescent marker, a small hairpin (sh)RNA expression cassette, a mutagenic gene trap or a therapeutic gene construct) cloned between the inverted repeat sequences of a transposon-based vector can be used for stable genomic insertion in a regulated and highly efficient manner. This methodological paradigm opened up a number of avenues for genome manipulations in vertebrates, including transgenesis for the generation of transgenic cells in tissue culture, the production of germline transgenic animals for basic and applied research, forward genetic screens for functional gene annotation in model species, and therapy of genetic disorders in humans. Sleeping Beauty (SB) was the first transposon shown to be capable of gene transfer in vertebrate cells, and recent results confirm that SB supports a full spectrum of genetic engineering including transgenesis, insertional mutagenesis, and therapeutic somatic gene transfer both ex vivo and in vivo. The first clinical application of the SB system will help to validate both the safety and efficacy of this approach. In this review, we describe the major transposon systems currently available (with special emphasis on SB), discuss the various parameters and considerations pertinent to their experimental use, and highlight the state of the art in transposon technology in diverse genetic applications.

  12. Method for introducing unidirectional nested deletions

    DOEpatents

    Dunn, John J.; Quesada, Mark A.; Randesi, Matthew

    2001-01-01

    Disclosed is a method for the introduction of unidirectional deletions in a cloned DNA segment in the context of a cloning vector which contains an f1 endonuclease recognition sequence adjacent to the insertion site of the DNA segment. Also disclosed is a method for producing single-stranded DNA probes utilizing the same cloning vector. An optimal vector, PZIP is described. Methods for introducing unidirectional deletions into a terminal location of a cloned DNA sequence which is inserted into the vector of the present invention are also disclosed. These methods are useful for introducing deletions into either or both ends of a cloned DNA insert, for high throughput sequencing of any DNA of interest.

  13. Triple helix purification and sequencing

    DOEpatents

    Wang, Renfeng; Smith, Lloyd M.; Tong, Xinchun E.

    1995-01-01

    Disclosed herein are methods, kits, and equipment for purifying single stranded circular DNA and then using the DNA for DNA sequencing purposes. Templates are provided with an insert having a hybridization region. An elongated oligonucleotide has two regions that are complementary to the insert and the oligo is bound to a magnetic anchor. The oligo hybridizes to the insert on two sides to form a stable triple helix complex. The anchor can then be used to drag the template out of solution using a magnet. The system can purify sequencing templates, and if desired the triple helix complex can be opened up to a double helix so that the oligonucleotide will act as a primer for further DNA synthesis.

  14. Triple helix purification and sequencing

    DOEpatents

    Wang, R.; Smith, L.M.; Tong, X.E.

    1995-03-28

    Disclosed herein are methods, kits, and equipment for purifying single stranded circular DNA and then using the DNA for DNA sequencing purposes. Templates are provided with an insert having a hybridization region. An elongated oligonucleotide has two regions that are complementary to the insert and the oligo is bound to a magnetic anchor. The oligo hybridizes to the insert on two sides to form a stable triple helix complex. The anchor can then be used to drag the template out of solution using a magnet. The system can purify sequencing templates, and if desired the triple helix complex can be opened up to a double helix so that the oligonucleotide will act as a primer for further DNA synthesis. 4 figures.

  15. Insertion of an SVA-E retrotransposon into the CASP8 gene is associated with protection against prostate cancer

    PubMed Central

    Stacey, Simon N.; Kehr, Birte; Gudmundsson, Julius; Zink, Florian; Jonasdottir, Aslaug; Gudjonsson, Sigurjon A.; Sigurdsson, Asgeir; Halldorsson, Bjarni V.; Agnarsson, Bjarni A.; Benediktsdottir, Kristrun R.; Aben, Katja K.H.; Vermeulen, Sita H.; Cremers, Ruben G.; Panadero, Angeles; Helfand, Brian T.; Cooper, Phillip R.; Donovan, Jenny L.; Hamdy, Freddie C.; Jinga, Viorel; Okamoto, Ichiro; Jonasson, Jon G.; Tryggvadottir, Laufey; Johannsdottir, Hrefna; Kristinsdottir, Anna M.; Masson, Gisli; Magnusson, Olafur T.; Iordache, Paul D.; Helgason, Agnar; Helgason, Hannes; Sulem, Patrick; Gudbjartsson, Daniel F.; Kong, Augustine; Jonsson, Eirikur; Barkardottir, Rosa B.; Einarsson, Gudmundur V.; Rafnar, Thorunn; Thorsteinsdottir, Unnur; Mates, Ioan N.; Neal, David E.; Catalona, William J.; Mayordomo, José I.; Kiemeney, Lambertus A.; Thorleifsson, Gudmar; Stefansson, Kari

    2016-01-01

    Transcriptional and splicing anomalies have been observed in intron 8 of the CASP8 gene (encoding procaspase-8) in association with cutaneous basal-cell carcinoma (BCC) and linked to a germline SNP rs700635. Here, we show that the rs700635[C] allele, which is associated with increased risk of BCC and breast cancer, is protective against prostate cancer [odds ratio (OR) = 0.91, P = 1.0 × 10−6]. rs700635[C] is also associated with failures to correctly splice out CASP8 intron 8 in breast and prostate tumours and in corresponding normal tissues. Investigation of rs700635[C] carriers revealed that they have a human-specific short interspersed element-variable number of tandem repeat-Alu (SINE-VNTR-Alu), subfamily-E retrotransposon (SVA-E) inserted into CASP8 intron 8. The SVA-E shows evidence of prior activity, because it has transduced some CASP8 sequences during subsequent retrotransposition events. Whole-genome sequence (WGS) data were used to tag the SVA-E with a surrogate SNP rs1035142[T] (r2 = 0.999), which showed associations with both the splicing anomalies (P = 6.5 × 10−32) and with protection against prostate cancer (OR = 0.91, P = 3.8 × 10−7). PMID:26740556

  16. Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana.

    PubMed

    Lin, X; Kaul, S; Rounsley, S; Shea, T P; Benito, M I; Town, C D; Fujii, C Y; Mason, T; Bowman, C L; Barnstead, M; Feldblyum, T V; Buell, C R; Ketchum, K A; Lee, J; Ronning, C M; Koo, H L; Moffat, K S; Cronin, L A; Shen, M; Pai, G; Van Aken, S; Umayam, L; Tallon, L J; Gill, J E; Adams, M D; Carrera, A J; Creasy, T H; Goodman, H M; Somerville, C R; Copenhaver, G P; Preuss, D; Nierman, W C; White, O; Eisen, J A; Salzberg, S L; Fraser, C M; Venter, J C

    1999-12-16

    Arabidopsis thaliana (Arabidopsis) is unique among plant model organisms in having a small genome (130-140 Mb), excellent physical and genetic maps, and little repetitive DNA. Here we report the sequence of chromosome 2 from the Columbia ecotype in two gap-free assemblies (contigs) of 3.6 and 16 megabases (Mb). The latter represents the longest published stretch of uninterrupted DNA sequence assembled from any organism to date. Chromosome 2 represents 15% of the genome and encodes 4,037 genes, 49% of which have no predicted function. Roughly 250 tandem gene duplications were found in addition to large-scale duplications of about 0.5 and 4.5 Mb between chromosomes 2 and 1 and between chromosomes 2 and 4, respectively. Sequencing of nearly 2 Mb within the genetically defined centromere revealed a low density of recognizable genes, and a high density and diverse range of vestigial and presumably inactive mobile elements. More unexpected is what appears to be a recent insertion of a continuous stretch of 75% of the mitochondrial genome into chromosome 2.

  17. Typing and comparative genome analysis of Brucella melitensis isolated from Lebanon.

    PubMed

    Abou Zaki, Natalia; Salloum, Tamara; Osman, Marwan; Rafei, Rayane; Hamze, Monzer; Tokajian, Sima

    2017-10-16

    Brucella melitensis is the main causative agent of the zoonotic disease brucellosis. This study aimed at typing and characterizing genetic variation in 33 Brucella isolates recovered from patients in Lebanon. Bruce-ladder multiplex PCR and PCR-RFLP of omp31, omp2a and omp2b were performed. Sixteen representative isolates were chosen for draft-genome sequencing and analyzed to determine variations in virulence, resistance, genomic islands, prophages and insertion sequences. Comparative whole-genome single nucleotide polymorphism analysis was also performed. The isolates were confirmed to be B. melitensis. Genome analysis revealed multiple virulence determinants and efflux pumps. Genome comparisons and single nucleotide polymorphisms divided the isolates based on geographical distribution but revealed high levels of similarity between the strains. Sequence divergence in B. melitensis was mainly due to lateral gene transfer of mobile elements. This is the first report of an in-depth genomic characterization of B. melitensis in Lebanon. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. CRISPR/Cas9 for genome editing: progress, implications and challenges.

    PubMed

    Zhang, Feng; Wen, Yan; Guo, Xiong

    2014-09-15

    Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) protein 9 system provides a robust and multiplexable genome editing tool, enabling researchers to precisely manipulate specific genomic elements, and facilitating the elucidation of target gene function in biology and diseases. CRISPR/Cas9 comprises of a nonspecific Cas9 nuclease and a set of programmable sequence-specific CRISPR RNA (crRNA), which can guide Cas9 to cleave DNA and generate double-strand breaks at target sites. Subsequent cellular DNA repair process leads to desired insertions, deletions or substitutions at target sites. The specificity of CRISPR/Cas9-mediated DNA cleavage requires target sequences matching crRNA and a protospacer adjacent motif locating at downstream of target sequences. Here, we review the molecular mechanism, applications and challenges of CRISPR/Cas9-mediated genome editing and clinical therapeutic potential of CRISPR/Cas9 in future. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  19. A Data Type for Efficient Representation of Other Data Types

    NASA Technical Reports Server (NTRS)

    James, Mark

    2008-01-01

    A self-organizing, monomorphic data type denoted a sequence has been conceived to address certain concerns that arise in programming parallel computers. A sequence in the present sense can be regarded abstractly as a vector, set, bag, queue, or other construct. Heretofore, in programming a parallel computer, it has been necessary for the programmer to state explicitly, at the outset, what parts of the program and the underlying data structures must be represented in parallel form. Not only is this requirement not optimal from the perspective of implementation; it entails an additional requirement that the programmer have intimate understanding of the underlying parallel structure. The present sequence data type overcomes both the implementation and parallel structure obstacles. In so doing, the sequence data type provides unified means by which the programmer can represent a data structure for natural and automatic decomposition to a parallel computing architecture. Sequences exhibit the behavioral and structural characteristics of vectors, but the underlying representations are automatically synthesized from combinations of programmers advice and execution use metrics. Sequences can vary bidirectionally between sparseness and density, making them excellent choices for many kinds of algorithms. The novelty and benefit of this behavior lies in the fact that it can relieve programmers of the details of implementations. The creation of a sequence enables decoupling of a conceptual representation from an implementation. The underlying representation of a sequence is a hybrid of representations composed of vectors, linked lists, connected blocks, and hash tables. The internal structure of a sequence can automatically change from time to time on the basis of how it is being used. Those portions of a sequence where elements have not been added or removed can be as efficient as vectors. As elements are inserted and removed in a given portion, then different methods are utilized to provide both an access and memory strategy that is optimized for that portion and the use to which it is put.

  20. Precise Maps of RNA Polymerase Reveal How Promoters Direct Initiation and Pausing

    PubMed Central

    Kwak, Hojoong; Fuda, Nicholas J.; Core, Leighton J.; Lis, John T.

    2014-01-01

    Transcription regulation occurs frequently through promoter-associated pausing of RNA polymerase II (Pol II). We developed a Precision nuclear Run-On and sequencing assay (PRO-seq) to map the genome-wide distribution of transcriptionally-engaged Pol II at base-pair resolution. Pol II accumulates immediately downstream of promoters, at intron-exon junctions that are efficiently used for splicing, and over 3' poly-adenylation sites. Focused analyses of promoters reveal that pausing is not fixed relative to initiation sites nor is it specified directly by the position of a particular core promoter element or the first nucleosome. Core promoter elements function beyond initiation, and when optimally positioned they act collectively to dictate the position and strength of pausing. We test this ‘Complex Interaction’ model with insertional mutagenesis of the Drosophila Hsp70 core promoter. PMID:23430654

  1. Construction of a nurse shark (Ginglymostoma cirratum) bacterial artificial chromosome (BAC) library and a preliminary genome survey.

    PubMed

    Luo, Meizhong; Kim, Hyeran; Kudrna, Dave; Sisneros, Nicholas B; Lee, So-Jeong; Mueller, Christopher; Collura, Kristi; Zuccolo, Andrea; Buckingham, E Bryan; Grim, Suzanne M; Yanagiya, Kazuyo; Inoko, Hidetoshi; Shiina, Takashi; Flajnik, Martin F; Wing, Rod A; Ohta, Yuko

    2006-05-03

    Sharks are members of the taxonomic class Chondrichthyes, the oldest living jawed vertebrates. Genomic studies of this group, in comparison to representative species in other vertebrate taxa, will allow us to theorize about the fundamental genetic, developmental, and functional characteristics in the common ancestor of all jawed vertebrates. In order to obtain mapping and sequencing data for comparative genomics, we constructed a bacterial artificial chromosome (BAC) library for the nurse shark, Ginglymostoma cirratum. The BAC library consists of 313,344 clones with an average insert size of 144 kb, covering ~4.5 x 1010 bp and thus providing an 11-fold coverage of the haploid genome. BAC end sequence analyses revealed, in addition to LINEs and SINEs commonly found in other animal and plant genomes, two new groups of nurse shark-specific repetitive elements, NSRE1 and NSRE2 that seem to be major components of the nurse shark genome. Screening the library with single-copy or multi-copy gene probes showed 6-28 primary positive clones per probe of which 50-90% were true positives, demonstrating that the BAC library is representative of the different regions of the nurse shark genome. Furthermore, some BAC clones contained multiple genes, making physical mapping feasible. We have constructed a deep-coverage, high-quality, large insert, and publicly available BAC library for a cartilaginous fish. It will be very useful to the scientific community interested in shark genomic structure, comparative genomics, and functional studies. We found two new groups of repetitive elements specific to the nurse shark genome, which may contribute to the architecture and evolution of the nurse shark genome.

  2. [Isolation and function of genes regulating aphB expression in Vibrio cholerae].

    PubMed

    Chen, Haili; Zhu, Zhaoqin; Zhong, Zengtao; Zhu, Jun; Kan, Biao

    2012-02-04

    We identified genes that regulate the expression of aphB, the gene encoding a key virulence regulator in Vibrio cholerae O1 E1 Tor C6706(-). We constructed a transposon library in V. cholerae C6706 strain containing a P(aphB)-luxCDABE and P(aphB)-lacZ transcriptional reporter plasmids. Using a chemiluminescence imager system, we rapidly detected aphB promoter expression level at a large scale. We then sequenced the transposon insertion sites by arbitrary PCR and sequencing analysis. We obtained two candidate mutants T1 and T2 which displayed reduced aphB expression from approximately 40,000 transposon insertion mutants. Sequencing analysis shows that Tn inserted in vc1585 reading frame in the T1 mutant and Tn inserted in the end of coding sequence of vc1602 in the T2 mutant. By using a genetic screen, we identified two potential genes that may involve in regulation of the expression of the key virulence regulator AphB. This study sheds light on our further investigation to fully understand V. cholerae virulence gene regulatory cascades.

  3. Removal of a putative inhibitory element reduces the calcium-dependent calmodulin activation of neuronal nitric-oxide synthase.

    PubMed

    Montgomery, H J; Romanov, V; Guillemette, J G

    2000-02-18

    Neuronal nitric-oxide synthase (NOS) and endothelial NOS are constitutive NOS isoforms that are activated by binding calmodulin in response to elevated intracellular calcium. In contrast, the inducible NOS isoform binds calmodulin at low basal levels of calcium in resting cells. Primary sequence comparisons show that each constitutive NOS isozyme contains a polypeptide segment within its reductase domain, which is absent in the inducible NOS enzyme. To study a possible link between the presence of these additional polypeptide segments in constitutive NOS enzymes and their calcium-dependent calmodulin activation, three deletion mutants were created. The putative inhibitory insert was removed from the FMN binding regions of the neuronal NOS holoenzyme and from two truncated neuronal NOS reductase enzymes in which the calmodulin binding region was either included or deleted. All three mutant enzymes showed reduced incorporation of FMN and required reconstitution with exogenous FMN for activity. The combined removal of both the calmodulin binding domain and the putative inhibitory insert did not result in a calmodulin-independent neuronal NOS reductase. Thus, although the putative inhibitory element has an effect on the calcium-dependent calmodulin activation of neuronal NOS, it does not have the properties of the typical autoinhibitory domain found in calmodulin-activated enzymes.

  4. CRISPR/Cas9 cleavages in budding yeast reveal templated insertions and strand-specific insertion/deletion profiles.

    PubMed

    Lemos, Brenda R; Kaplan, Adam C; Bae, Ji Eun; Ferrazzoli, Alexander E; Kuo, James; Anand, Ranjith P; Waterman, David P; Haber, James E

    2018-02-27

    Harnessing CRISPR-Cas9 technology provides an unprecedented ability to modify genomic loci via DNA double-strand break (DSB) induction and repair. We analyzed nonhomologous end-joining (NHEJ) repair induced by Cas9 in budding yeast and found that the orientation of binding of Cas9 and its guide RNA (gRNA) profoundly influences the pattern of insertion/deletions (indels) at the site of cleavage. A common indel created by Cas9 is a 1-bp (+1) insertion that appears to result from Cas9 creating a 1-nt 5' overhang that is filled in by a DNA polymerase and ligated. The origin of +1 insertions was investigated by using two gRNAs with PAM sequences located on opposite DNA strands but designed to cleave the same sequence. These templated +1 insertions are dependent on the X-family DNA polymerase, Pol4. Deleting Pol4 also eliminated +2 and +3 insertions, which are biased toward homonucleotide insertions. Using inverted PAM sequences, we also found significant differences in overall NHEJ efficiency and repair profiles, suggesting that the binding of the Cas9:gRNA complex influences subsequent NHEJ processing. As with events induced by the site-specific HO endonuclease, CRISPR-Cas9-mediated NHEJ repair depends on the Ku heterodimer and DNA ligase 4. Cas9 events are highly dependent on the Mre11-Rad50-Xrs2 complex, independent of Mre11's nuclease activity. Inspection of the outcomes of a large number of Cas9 cleavage events in mammalian cells reveals a similar templated origin of +1 insertions in human cells, but also a significant frequency of similarly templated +2 insertions.

  5. Ebbie: automated analysis and storage of small RNA cloning data using a dynamic web server

    PubMed Central

    Ebhardt, H Alexander; Wiese, Kay C; Unrau, Peter J

    2006-01-01

    Background DNA sequencing is used ubiquitously: from deciphering genomes[1] to determining the primary sequence of small RNAs (smRNAs) [2-5]. The cloning of smRNAs is currently the most conventional method to determine the actual sequence of these important regulators of gene expression. Typical smRNA cloning projects involve the sequencing of hundreds to thousands of smRNA clones that are delimited at their 5' and 3' ends by fixed sequence regions. These primers result from the biochemical protocol used to isolate and convert the smRNA into clonable PCR products. Recently we completed a smRNA cloning project involving tobacco plants, where analysis was required for ~700 smRNA sequences[6]. Finding no easily accessible research tool to enter and analyze smRNA sequences we developed Ebbie to assist us with our study. Results Ebbie is a semi-automated smRNA cloning data processing algorithm, which initially searches for any substring within a DNA sequencing text file, which is flanked by two constant strings. The substring, also termed smRNA or insert, is stored in a MySQL and BlastN database. These inserts are then compared using BlastN to locally installed databases allowing the rapid comparison of the insert to both the growing smRNA database and to other static sequence databases. Our laboratory used Ebbie to analyze scores of DNA sequencing data originating from an smRNA cloning project[6]. Through its built-in instant analysis of all inserts using BlastN, we were able to quickly identify 33 groups of smRNAs from ~700 database entries. This clustering allowed the easy identification of novel and highly expressed clusters of smRNAs. Ebbie is available under GNU GPL and currently implemented on Conclusion Ebbie was designed for medium sized smRNA cloning projects with about 1,000 database entries [6-8].Ebbie can be used for any type of sequence analysis where two constant primer regions flank a sequence of interest. The reliable storage of inserts, and their annotation in a MySQL database, BlastN[9] comparison of new inserts to dynamic and static databases make it a powerful new tool in any laboratory using DNA sequencing. Ebbie also prevents manual mistakes during the excision process and speeds up annotation and data-entry. Once the server is installed locally, its access can be restricted to protect sensitive new DNA sequencing data. Ebbie was primarily designed for smRNA cloning projects, but can be applied to a variety of RNA and DNA cloning projects[2,3,10,11]. PMID:16584563

  6. Efforts to deregulate Rainbow papaya in Japan: Molecular Characterization of Transgene and Vector Inserts

    USDA-ARS?s Scientific Manuscript database

    Transformation plasmid-derived insert number and insert site sequence in 55-1 line papaya derivatives Rainbow and SunUp was determined as part of a larger petition to allow its import into Japan (Suzuki, et al., 2007, 2008). Three insertions were detected by Southern analysis and their correspondin...

  7. BrassicaTED - a public database for utilization of miniature transposable elements in Brassica species.

    PubMed

    Murukarthick, Jayakodi; Sampath, Perumal; Lee, Sang Choon; Choi, Beom-Soon; Senthil, Natesan; Liu, Shengyi; Yang, Tae-Jin

    2014-06-20

    MITE, TRIM and SINEs are miniature form transposable elements (mTEs) that are ubiquitous and dispersed throughout entire plant genomes. Tens of thousands of members cause insertion polymorphism at both the inter- and intra- species level. Therefore, mTEs are valuable targets and resources for development of markers that can be utilized for breeding, genetic diversity and genome evolution studies. Taking advantage of the completely sequenced genomes of Brassica rapa and B. oleracea, characterization of mTEs and building a curated database are prerequisite to extending their utilization for genomics and applied fields in Brassica crops. We have developed BrassicaTED as a unique web portal containing detailed characterization information for mTEs of Brassica species. At present, BrassicaTED has datasets for 41 mTE families, including 5894 and 6026 members from 20 MITE families, 1393 and 1639 members from 5 TRIM families, 1270 and 2364 members from 16 SINE families in B. rapa and B. oleracea, respectively. BrassicaTED offers different sections to browse structural and positional characteristics for every mTE family. In addition, we have added data on 289 MITE insertion polymorphisms from a survey of seven Brassica relatives. Genes with internal mTE insertions are shown with detailed gene annotation and microarray-based comparative gene expression data in comparison with their paralogs in the triplicated B. rapa genome. This database also includes a novel tool, K BLAST (Karyotype BLAST), for clear visualization of the locations for each member in the B. rapa and B. oleracea pseudo-genome sequences. BrassicaTED is a newly developed database of information regarding the characteristics and potential utility of mTEs including MITE, TRIM and SINEs in B. rapa and B. oleracea. The database will promote the development of desirable mTE-based markers, which can be utilized for genomics and breeding in Brassica species. BrassicaTED will be a valuable repository for scientists and breeders, promoting efficient research on Brassica species. BrassicaTED can be accessed at http://im-crop.snu.ac.kr/BrassicaTED/index.php.

  8. BAC end sequencing of Pacific white shrimp Litopenaeus vannamei: a glimpse into the genome of Penaeid shrimp

    NASA Astrophysics Data System (ADS)

    Zhao, Cui; Zhang, Xiaojun; Liu, Chengzhang; Huan, Pin; Li, Fuhua; Xiang, Jianhai; Huang, Chao

    2012-05-01

    Little is known about the genome of Pacific white shrimp ( Litopenaeus vannamei). To address this, we conducted BAC (bacterial artificial chromosome) end sequencing of L. vannamei. We selected and sequenced 7 812 BAC clones from the BAC library LvHE from the two ends of the inserts by Sanger sequencing. After trimming and quality filtering, 11 279 BAC end sequences (BESs) including 4 609 pairedends BESs were obtained. The total length of the BESs was 4 340 753 bp, representing 0.18% of the L. vannamei haploid genome. The lengths of the BESs ranged from 100 bp to 660 bp with an average length of 385 bp. Analysis of the BESs indicated that the L. vannamei genome is AT-rich and that the primary repeats patterns were simple sequence repeats (SSRs) and low complexity sequences. Dinucleotide and hexanucleotide repeats were the most common SSR types in the BESs. The most abundant transposable element was gypsy, which may contribute to the generation of the large genome size of L. vannamei. We successfully annotated 4 519 BESs by BLAST searching, including genes involved in immunity and sex determination. Our results provide an important resource for functional gene studies, map construction and integration, and complete genome assembly for this species.

  9. Distribution of Classical and Nonclassical Virulence Genes in Enterotoxigenic Escherichia coli Isolates from Chilean Children and tRNA Gene Screening for Putative Insertion Sites for Genomic Islands▿†

    PubMed Central

    Del Canto, Felipe; Valenzuela, Patricio; Cantero, Lidia; Bronstein, Jonathan; Blanco, Jesús E.; Blanco, Jorge; Prado, Valeria; Levine, Myron; Nataro, James; Sommerfelt, Halvor; Vidal, Roberto

    2011-01-01

    Enterotoxigenic Escherichia coli (ETEC) is an important cause of diarrhea. Three adhesins (Tia, TibA, EtpA), an iron acquisition system (Irp1, Irp2, and FyuA), a GTPase (LeoA), and an autotransporter (EatA) are ETEC virulence-related proteins that, in contrast to the classical virulence factors (enterotoxins and fimbrial colonization factors) have not heretofore been targets in characterizing isolates from epidemiological studies. Here, we determined the occurrence of these nonclassical virulence genes in 103 ETEC isolates from Chilean children with diarrhea and described their association with O serogroups and classical virulence determinants. Because tia, leoA, irp2, and fyuA are harbored by pathogenicity islands inserted into the selC and asnT tRNA genes (tDNAs), we analyzed the regions flanking these loci. Ten additional tDNAs were also screened to identify hot spots for genetic insertions. Associations between the most frequent serogroups and classical colonization factor (CF)-toxin profiles included O6/LT-STh/CS1-CS3-CS21 (i.e., O6 serogroup, heat-labile [LT] and human heat-stable [STh] enterotoxins, and CFs CS1, -3 and -21), O6/LT-STh/CS2-CS3-CS21, and O104-O127/STh/CFAI-CS21. The eatA and etpA genes were detected in more than 70% of the collection, including diverse serogroups and virulence profiles. Sixteen percent of the ETEC strains were negative for classical and nonclassical adhesins, suggesting the presence of unknown determinants of adhesion. The leuX, thrW, and asnT tDNAs were disrupted in more than 65% of strains, suggesting they are hot spots for the insertion of mobile elements. Sequences similar to integrase genes were identified next to the thrW, asnT, pheV, and selC tDNAs. We propose that the eatA and etpA genes should be included in characterizations of ETEC isolates in future epidemiological studies to determine their prevalence in other geographical regions. Sequencing of tDNA-associated genetic insertions might identify new ETEC virulence determinants. PMID:21775541

  10. A small and efficient dimerization/packaging signal of rat VL30 RNA and its use in murine leukemia virus-VL30-derived vectors for gene transfer.

    PubMed Central

    Torrent, C; Gabus, C; Darlix, J L

    1994-01-01

    Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer. Images PMID:8289369

  11. The impact of transposable elements in environmental adaptation.

    PubMed

    Casacuberta, Elena; González, Josefa

    2013-03-01

    Transposable elements (TEs) play an important role in the responsive capacity of their hosts in the face of environmental challenges. The variety of mechanisms by which TEs influence the capacity of adaptation of the host is as large as the variety of TEs and host genomes. For example, TEs might directly affect the function of individual genes, provide a mechanism for rapidly acquiring new genetic material and disseminate regulatory elements that can lead to the creation of stress-inducible regulatory networks. In this review, we summarize recent examples that are part of an increasing body of evidence suggesting a significant role of TEs in the host response to an ever-changing environment, both in prokaryote and in eukaryote organisms. We argue that in the near future, the increasing availability of genome sequences and the development of new tools to discover and analyse TE insertions will further show the relevant role of TEs in environmental adaptation. © 2013 Blackwell Publishing Ltd.

  12. Incidence of genome structure, DNA asymmetry, and cell physiology on T-DNA integration in chromosomes of the phytopathogenic fungus Leptosphaeria maculans.

    PubMed

    Bourras, Salim; Meyer, Michel; Grandaubert, Jonathan; Lapalu, Nicolas; Fudal, Isabelle; Linglin, Juliette; Ollivier, Benedicte; Blaise, Françoise; Balesdent, Marie-Hélène; Rouxel, Thierry

    2012-08-01

    The ever-increasing generation of sequence data is accompanied by unsatisfactory functional annotation, and complex genomes, such as those of plants and filamentous fungi, show a large number of genes with no predicted or known function. For functional annotation of unknown or hypothetical genes, the production of collections of mutants using Agrobacterium tumefaciens-mediated transformation (ATMT) associated with genotyping and phenotyping has gained wide acceptance. ATMT is also widely used to identify pathogenicity determinants in pathogenic fungi. A systematic analysis of T-DNA borders was performed in an ATMT-mutagenized collection of the phytopathogenic fungus Leptosphaeria maculans to evaluate the features of T-DNA integration in its particular transposable element-rich compartmentalized genome. A total of 318 T-DNA tags were recovered and analyzed for biases in chromosome and genic compartments, existence of CG/AT skews at the insertion site, and occurrence of microhomologies between the T-DNA left border (LB) and the target sequence. Functional annotation of targeted genes was done using the Gene Ontology annotation. The T-DNA integration mainly targeted gene-rich, transcriptionally active regions, and it favored biological processes consistent with the physiological status of a germinating spore. T-DNA integration was strongly biased toward regulatory regions, and mainly promoters. Consistent with the T-DNA intranuclear-targeting model, the density of T-DNA insertion correlated with CG skew near the transcription initiation site. The existence of microhomologies between promoter sequences and the T-DNA LB flanking sequence was also consistent with T-DNA integration to host DNA mediated by homologous recombination based on the microhomology-mediated end-joining pathway.

  13. ISC, a Novel Group of Bacterial and Archaeal DNA Transposons That Encode Cas9 Homologs

    PubMed Central

    Kapitonov, Vladimir V.; Makarova, Kira S.

    2015-01-01

    ABSTRACT Bacterial genomes encode numerous homologs of Cas9, the effector protein of the type II CRISPR-Cas systems. The homology region includes the arginine-rich helix and the HNH nuclease domain that is inserted into the RuvC-like nuclease domain. These genes, however, are not linked to cas genes or CRISPR. Here, we show that Cas9 homologs represent a distinct group of nonautonomous transposons, which we denote ISC (insertion sequences Cas9-like). We identify many diverse families of full-length ISC transposons and demonstrate that their terminal sequences (particularly 3′ termini) are similar to those of IS605 superfamily transposons that are mobilized by the Y1 tyrosine transposase encoded by the TnpA gene and often also encode the TnpB protein containing the RuvC-like endonuclease domain. The terminal regions of the ISC and IS605 transposons contain palindromic structures that are likely recognized by the Y1 transposase. The transposons from these two groups are inserted either exactly in the middle or upstream of specific 4-bp target sites, without target site duplication. We also identify autonomous ISC transposons that encode TnpA-like Y1 transposases. Thus, the nonautonomous ISC transposons could be mobilized in trans either by Y1 transposases of other, autonomous ISC transposons or by Y1 transposases of the more abundant IS605 transposons. These findings imply an evolutionary scenario in which the ISC transposons evolved from IS605 family transposons, possibly via insertion of a mobile group II intron encoding the HNH domain, and Cas9 subsequently evolved via immobilization of an ISC transposon. IMPORTANCE Cas9 endonucleases, the effectors of type II CRISPR-Cas systems, represent the new generation of genome-engineering tools. Here, we describe in detail a novel family of transposable elements that encode the likely ancestors of Cas9 and outline the evolutionary scenario connecting different varieties of these transposons and Cas9. PMID:26712934

  14. Indel PDB: a database of structural insertions and deletions derived from sequence alignments of closely related proteins.

    PubMed

    Hsing, Michael; Cherkasov, Artem

    2008-06-25

    Insertions and deletions (indels) represent a common type of sequence variations, which are less studied and pose many important biological questions. Recent research has shown that the presence of sizable indels in protein sequences may be indicative of protein essentiality and their role in protein interaction networks. Examples of utilization of indels for structure-based drug design have also been recently demonstrated. Nonetheless many structural and functional characteristics of indels remain less researched or unknown. We have created a web-based resource, Indel PDB, representing a structural database of insertions/deletions identified from the sequence alignments of highly similar proteins found in the Protein Data Bank (PDB). Indel PDB utilized large amounts of available structural information to characterize 1-, 2- and 3-dimensional features of indel sites. Indel PDB contains 117,266 non-redundant indel sites extracted from 11,294 indel-containing proteins. Unlike loop databases, Indel PDB features more indel sequences with secondary structures including alpha-helices and beta-sheets in addition to loops. The insertion fragments have been characterized by their sequences, lengths, locations, secondary structure composition, solvent accessibility, protein domain association and three dimensional structures. By utilizing the data available in Indel PDB, we have studied and presented here several sequence and structural features of indels. We anticipate that Indel PDB will not only enable future functional studies of indels, but will also assist protein modeling efforts and identification of indel-directed drug binding sites.

  15. Rapid Mitochondrial Genome Evolution through Invasion of Mobile Elements in Two Closely Related Species of Arbuscular Mycorrhizal Fungi

    PubMed Central

    Beaudet, Denis; Nadimi, Maryam; Iffis, Bachir; Hijri, Mohamed

    2013-01-01

    Arbuscular mycorrhizal fungi (AMF) are common and important plant symbionts. They have coenocytic hyphae and form multinucleated spores. The nuclear genome of AMF is polymorphic and its organization is not well understood, which makes the development of reliable molecular markers challenging. In stark contrast, their mitochondrial genome (mtDNA) is homogeneous. To assess the intra- and inter-specific mitochondrial variability in closely related Glomus species, we performed 454 sequencing on total genomic DNA of Glomus sp. isolate DAOM-229456 and we compared its mtDNA with two G. irregulare isolates. We found that the mtDNA of Glomus sp. is homogeneous, identical in gene order and, with respect to the sequences of coding regions, almost identical to G. irregulare. However, certain genomic regions vary substantially, due to insertions/deletions of elements such as introns, mitochondrial plasmid-like DNA polymerase genes and mobile open reading frames. We found no evidence of mitochondrial or cytoplasmic plasmids in Glomus species, and mobile ORFs in Glomus are responsible for the formation of four gene hybrids in atp6, atp9, cox2, and nad3, which are most probably the result of horizontal gene transfer and are expressed at the mRNA level. We found evidence for substantial sequence variation in defined regions of mtDNA, even among closely related isolates with otherwise identical coding gene sequences. This variation makes it possible to design reliable intra- and inter-specific markers. PMID:23637766

  16. Rapid mitochondrial genome evolution through invasion of mobile elements in two closely related species of arbuscular mycorrhizal fungi.

    PubMed

    Beaudet, Denis; Nadimi, Maryam; Iffis, Bachir; Hijri, Mohamed

    2013-01-01

    Arbuscular mycorrhizal fungi (AMF) are common and important plant symbionts. They have coenocytic hyphae and form multinucleated spores. The nuclear genome of AMF is polymorphic and its organization is not well understood, which makes the development of reliable molecular markers challenging. In stark contrast, their mitochondrial genome (mtDNA) is homogeneous. To assess the intra- and inter-specific mitochondrial variability in closely related Glomus species, we performed 454 sequencing on total genomic DNA of Glomus sp. isolate DAOM-229456 and we compared its mtDNA with two G. irregulare isolates. We found that the mtDNA of Glomus sp. is homogeneous, identical in gene order and, with respect to the sequences of coding regions, almost identical to G. irregulare. However, certain genomic regions vary substantially, due to insertions/deletions of elements such as introns, mitochondrial plasmid-like DNA polymerase genes and mobile open reading frames. We found no evidence of mitochondrial or cytoplasmic plasmids in Glomus species, and mobile ORFs in Glomus are responsible for the formation of four gene hybrids in atp6, atp9, cox2, and nad3, which are most probably the result of horizontal gene transfer and are expressed at the mRNA level. We found evidence for substantial sequence variation in defined regions of mtDNA, even among closely related isolates with otherwise identical coding gene sequences. This variation makes it possible to design reliable intra- and inter-specific markers.

  17. Complete Genomic Structure of the Bloom-forming Toxic Cyanobacterium Microcystis aeruginosa NIES-843

    PubMed Central

    Kaneko, Takakazu; Nakajima, Nobuyoshi; Okamoto, Shinobu; Suzuki, Iwane; Tanabe, Yuuhiko; Tamaoki, Masanori; Nakamura, Yasukazu; Kasai, Fumie; Watanabe, Akiko; Kawashima, Kumiko; Kishida, Yoshie; Ono, Akiko; Shimizu, Yoshimi; Takahashi, Chika; Minami, Chiharu; Fujishiro, Tsunakazu; Kohara, Mitsuyo; Katoh, Midori; Nakazaki, Naomi; Nakayama, Shinobu; Yamada, Manabu; Tabata, Satoshi; Watanabe, Makoto M.

    2007-01-01

    Abstract The nucleotide sequence of the complete genome of a cyanobacterium, Microcystis aeruginosa NIES-843, was determined. The genome of M. aeruginosa is a single, circular chromosome of 5 842 795 base pairs (bp) in length, with an average GC content of 42.3%. The chromosome comprises 6312 putative protein-encoding genes, two sets of rRNA genes, 42 tRNA genes representing 41 tRNA species, and genes for tmRNA, the B subunit of RNase P, SRP RNA, and 6Sa RNA. Forty-five percent of the putative protein-encoding sequences showed sequence similarity to genes of known function, 32% were similar to hypothetical genes, and the remaining 23% had no apparent similarity to reported genes. A total of 688 kb of the genome, equivalent to 11.8% of the entire genome, were composed of both insertion sequences and miniature inverted-repeat transposable elements. This is indicative of a plasticity of the M. aeruginosa genome, through a mechanism that involves homologous recombination mediated by repetitive DNA elements. In addition to known gene clusters related to the synthesis of microcystin and cyanopeptolin, novel gene clusters that may be involved in the synthesis and modification of toxic small polypeptides were identified. Compared with other cyanobacteria, a relatively small number of genes for two component systems and a large number of genes for restriction-modification systems were notable characteristics of the M. aeruginosa genome. PMID:18192279

  18. SINE transcription by RNA polymerase III is suppressed by histone methylation but not by DNA methylation

    PubMed Central

    Varshney, Dhaval; Vavrova-Anderson, Jana; Oler, Andrew J.; Cowling, Victoria H.; Cairns, Bradley R.; White, Robert J.

    2015-01-01

    Short interspersed nuclear elements (SINEs), such as Alu, spread by retrotransposition, which requires their transcripts to be copied into DNA and then inserted into new chromosomal sites. This can lead to genetic damage through insertional mutagenesis and chromosomal rearrangements between non-allelic SINEs at distinct loci. SINE DNA is heavily methylated and this was thought to suppress its accessibility and transcription, thereby protecting against retrotransposition. Here we provide several lines of evidence that methylated SINE DNA is occupied by RNA polymerase III, including the use of high-throughput bisulphite sequencing of ChIP DNA. We find that loss of DNA methylation has little effect on accessibility of SINEs to transcription machinery or their expression in vivo. In contrast, a histone methyltransferase inhibitor selectively promotes SINE expression and occupancy by RNA polymerase III. The data suggest that methylation of histones rather than DNA plays a dominant role in suppressing SINE transcription. PMID:25798578

  19. Retrotransposon Tf1 is targeted to pol II promoters by transcription activators

    PubMed Central

    Leem, Young-Eun; Ripmaster, Tracy; Kelly, Felice; Ebina, Hirotaka; Heincelman, Marc; Zhang, Ke; Grewal, Shiv I. S.; Hoffman, Charles S.; Levin, Henry L.

    2008-01-01

    SUMMARY The LTR-retrotransposon Tf1 preserves the coding capacity of its host Schizosaccharomyces pombe by integrating upstream of open reading frames (ORFs). To determine which features of the target sites were recognized by the transposon, we introduced plasmids containing candidate insertion sites into S. pombe and mapped the positions of integration. We found that Tf1 was targeted specifically to the promoters of pol II transcribed genes. A detailed analysis of integration in plasmids that contained either ade6 or fbp1 revealed insertions occurred in the promoters at positions where transcription factors bound. Further experiments revealed that the activator Atf1p and its binding site were required for directing integration to the promoter of fbp1. An interaction between Tf1 integrase and Atf1p was observed indicating that integration at fbp1 was mediated by the activator bound to its promoter. Surprisingly we found Tf1 contained sequences that activated transcription and these substituted for elements of the ade6 promoter disrupted by integration. PMID:18406330

  20. Retrotransposon Tf1 is targeted to Pol II promoters by transcription activators.

    PubMed

    Leem, Young-Eun; Ripmaster, Tracy L; Kelly, Felice D; Ebina, Hirotaka; Heincelman, Marc E; Zhang, Ke; Grewal, Shiv I S; Hoffman, Charles S; Levin, Henry L

    2008-04-11

    The LTR-retrotransposon Tf1 preserves the coding capacity of its host Schizosaccharomyces pombe by integrating upstream of open reading frames (ORFs). To determine which features of the target sites were recognized by the transposon, we introduced plasmids containing candidate insertion sites into S. pombe and mapped the positions of integration. We found that Tf1 was targeted specifically to the promoters of Pol II-transcribed genes. A detailed analysis of integration in plasmids that contained either ade6 or fbp1 revealed insertions occurred in the promoters at positions where transcription factors bound. Further experiments revealed that the activator Atf1p and its binding site were required for directing integration to the promoter of fbp1. An interaction between Tf1 integrase and Atf1p was observed, indicating that integration at fbp1 was mediated by the activator bound to its promoter. Surprisingly, we found Tf1 contained sequences that activated transcription, and these substituted for elements of the ade6 promoter disrupted by integration.

  1. Homing at an extragenic locus mediated by VDE (PI-SceI) in Saccharomyces cerevisiae.

    PubMed

    Nogami, Satoru; Fukuda, Tomoyuki; Nagai, Yuri; Yabe, Shizu; Sugiura, Masako; Mizutani, Ryuta; Satow, Yoshinori; Anraku, Yasuhiro; Ohya, Yoshikazu

    2002-06-30

    PI-SceI (VDE), a homing endonuclease with protein splicing activity, is a genomic parasite in the VMA1 gene of Saccharomyces cerevisiae. In a heterozygous diploid of the VDE-less VMA1 allele and a VDE-containing VMA1 allele, VDE specifically cleaves its recognition sequence (VRS) in the VDE-less VMA1 allele at meiosis, followed by 'homing', i.e. a conversion to a VDE-containing allele. We found that upon VDE expression, homing of a marker gene at an extragenic locus occurs only when a 45 bp element containing the VRS is inserted at its allelic site, while mutants of VDE with no endonuclease activity lack authentic extragenic homing activity. Thus, both the VRS and VDE are required for homing. Insertion of the VRS in a homozygous diploid significantly lowered the spore germination ability, indicating that a template for gene repair at its allelic locus is essential for efficient homing and survival of yeast cells. Copyright 2002 John Wiley & Sons, Ltd.

  2. Detection of active transposable elements in Arabidopsis thaliana using Oxford Nanopore Sequencing technology.

    PubMed

    Debladis, Emilie; Llauro, Christel; Carpentier, Marie-Christine; Mirouze, Marie; Panaud, Olivier

    2017-07-17

    Transposables elements (TEs) contribute to both structural and functional dynamics of most eukaryotic genomes. Because of their propensity to densely populate plant and animal genomes, the precise estimation of the impact of transposition on genomic diversity has been considered as one of the main challenges of today's genomics. The recent development of NGS (next generation sequencing) technologies has open new perspectives in population genomics by providing new methods for high throughput detection of Transposable Elements-associated Structural Variants (TEASV). However, these have relied on Illumina platform that generates short reads (up to 350 nucleotides). This limitation in size of sequence reads can cause high false discovery rate (FDR) and therefore limit the power of detection of TEASVs, especially in the case of large, complex genomes. The newest sequencing technologies, such as Oxford Nanopore Technologies (ONT) can generate kilobases-long reads thus representing a promising tool for TEASV detection in plant and animals. We present the results of a pilot experiment for TEASV detection on the model plant species Arabidopsis thaliana using ONT sequencing and show that it can be used efficiently to detect TE movements. We generated a ~0.8X genome coverage of a met1-derived epigenetic recombinant inbred line (epiRIL) using a MinIon device with R7 chemistry. We were able to detect nine new copies of the LTR-retrotransposon Evadé (EVD). We also evidenced the activity of the DNA transposon CACTA, CAC1. Even at a low sequence coverage (0.8X), ONT sequencing allowed us to reliably detect several TE insertions in Arabidopsis thaliana genome. The long read length allowed a precise and un-ambiguous mapping of the structural variations caused by the activity of TEs. This suggests that the trade-off between read length and genome coverage for TEASV detection may be in favor of the former. Should the technology be further improved both in terms of lower error rate and operation costs, it could be efficiently used in diversity studies at population level.

  3. Insertion sequence 1515 in the ply gene of a type 1 clinical isolate of Streptococcus pneumoniae abolishes pneumolysin expression.

    PubMed

    Garnier, Fabien; Janapatla, Rajendra Prasad; Charpentier, Emmanuelle; Masson, Geoffrey; Grélaud, Carole; Stach, Jean François; Denis, François; Ploy, Marie-Cécile

    2007-07-01

    A serotype 1 Streptococcus pneumoniae strain isolated by blood culture from a woman with pneumonia was found to harbor insertion sequence (IS) 1515 in the pneumolysin gene, abolishing pneumolysin expression. To our knowledge, this is the first report of an IS in the pneumolysin gene of S. pneumoniae.

  4. Methods and compositions for controlling gene expression by RNA processing

    DOEpatents

    Doudna, Jennifer A.; Qi, Lei S.; Haurwitz, Rachel E.; Arkin, Adam P.

    2017-08-29

    The present disclosure provides nucleic acids encoding an RNA recognition sequence positioned proximal to an insertion site for the insertion of a sequence of interest; and host cells genetically modified with the nucleic acids. The present disclosure also provides methods of modifying the activity of a target RNA, and kits and compositions for carrying out the methods.

  5. Linear and exponential TAIL-PCR: a method for efficient and quick amplification of flanking sequences adjacent to Tn5 transposon insertion sites.

    PubMed

    Jia, Xianbo; Lin, Xinjian; Chen, Jichen

    2017-11-02

    Current genome walking methods are very time consuming, and many produce non-specific amplification products. To amplify the flanking sequences that are adjacent to Tn5 transposon insertion sites in Serratia marcescens FZSF02, we developed a genome walking method based on TAIL-PCR. This PCR method added a 20-cycle linear amplification step before the exponential amplification step to increase the concentration of the target sequences. Products of the linear amplification and the exponential amplification were diluted 100-fold to decrease the concentration of the templates that cause non-specific amplification. Fast DNA polymerase with a high extension speed was used in this method, and an amplification program was used to rapidly amplify long specific sequences. With this linear and exponential TAIL-PCR (LETAIL-PCR), we successfully obtained products larger than 2 kb from Tn5 transposon insertion mutant strains within 3 h. This method can be widely used in genome walking studies to amplify unknown sequences that are adjacent to known sequences.

  6. The wheat cytochrome oxidase subunit II gene has an intron insert and three radical amino acid changes relative to maize

    PubMed Central

    Bonen, Linda; Boer, Poppo H.; Gray, Michael W.

    1984-01-01

    We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565

  7. Alu expression in human cell lines and their retrotranspositional potential.

    PubMed

    Oler, Andrew J; Traina-Dorge, Stephen; Derbes, Rebecca S; Canella, Donatella; Cairns, Brad R; Roy-Engel, Astrid M

    2012-06-20

    The vast majority of the 1.1 million Alu elements are retrotranspositionally inactive, where only a few loci referred to as 'source elements' can generate new Alu insertions. The first step in identifying the active Alu sources is to determine the loci transcribed by RNA polymerase III (pol III). Previous genome-wide analyses from normal and transformed cell lines identified multiple Alu loci occupied by pol III factors, making them candidate source elements. Analysis of the data from these genome-wide studies determined that the majority of pol III-bound Alus belonged to the older subfamilies Alu S and Alu J, which varied between cell lines from 62.5% to 98.7% of the identified loci. The pol III-bound Alus were further scored for estimated retrotransposition potential (ERP) based on the absence or presence of selected sequence features associated with Alu retrotransposition capability. Our analyses indicate that most of the pol III-bound Alu loci candidates identified lack the sequence characteristics important for retrotransposition. These data suggest that Alu expression likely varies by cell type, growth conditions and transformation state. This variation could extend to where the same cell lines in different laboratories present different Alu expression patterns. The vast majority of Alu loci potentially transcribed by RNA pol III lack important sequence features for retrotransposition and the majority of potentially active Alu loci in the genome (scored high ERP) belong to young Alu subfamilies. Our observations suggest that in an in vivo scenario, the contribution of Alu activity on somatic genetic damage may significantly vary between individuals and tissues.

  8. Evolutionary modes of emergence of short interspersed nuclear element (SINE) families in grasses.

    PubMed

    Kögler, Anja; Schmidt, Thomas; Wenke, Torsten

    2017-11-01

    Short interspersed nuclear elements (SINEs) are non-autonomous transposable elements which are propagated by retrotransposition and constitute an inherent part of the genome of most eukaryotic species. Knowledge of heterogeneous and highly abundant SINEs is crucial for de novo (or improvement of) annotation of whole genome sequences. We scanned Poaceae genome sequences of six important cereals (Oryza sativa, Triticum aestivum, Hordeum vulgare, Panicum virgatum, Sorghum bicolor, Zea mays) and Brachypodium distachyon to examine the diversity and evolution of SINE populations. We comparatively analyzed the structural features, distribution, evolutionary relation and abundance of 32 SINE families and subfamilies within grasses, comprising 11 052 individual copies. The investigation of activity profiles within the Poaceae provides insights into their species-specific diversification and amplification. We found that Poaceae SINEs (PoaS) fall into two length categories: simple SINEs of up to 180 bp and dimeric SINEs larger than 240 bp. Detailed analysis at the nucleotide level revealed that multimerization of related and unrelated SINE copies is an important evolutionary mechanism of SINE formation. We conclude that PoaS families diversify by massive reshuffling between SINE families, likely caused by insertion of truncated copies, and provide a model for this evolutionary scenario. Twenty-eight of 32 PoaS families and subfamilies show significant conservation, in particular either in the 5' or 3' regions, across Poaceae species and share large sequence stretches with one or more other PoaS families. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  9. MR-compatibility assessment of MADPET4: a study of interferences between an SiPM-based PET insert and a 7 T MRI system.

    PubMed

    Omidvari, Negar; Topping, Geoffrey; Cabello, Jorge; Paul, Stephan; Schwaiger, Markus; Ziegler, Sibylle I

    2018-05-01

    Compromises in the design of a positron emission tomography (PET) insert for a magnetic resonance imaging (MRI) system should minimize the deterioration of image quality in both modalities, particularly when simultaneous demanding acquisitions are performed. In this work, the advantages of using individually read-out crystals with high-gain silicon photomultipliers (SiPMs) were studied with a small animal PET insert for a 7 T MRI system, in which the SiPM charge was transferred to outside the MRI scanner using coaxial cables. The interferences between the two systems were studied with three radio-frequency (RF) coil configurations. The effects of PET on the static magnetic field, flip angle distribution, RF noise, and image quality of various MRI sequences (gradient echo, spin echo, and echo planar imaging (EPI) at 1 H frequency, and chemical shift imaging at 13 C frequency) were investigated. The effects of fast-switching gradient fields and RF pulses on PET count rate were studied, while the PET insert and the readout electronics were not shielded. Operating the insert inside a 1 H volume coil, used for RF transmission and reception, limited the MRI to T1-weighted imaging, due to coil detuning and RF attenuation, and resulted in significant PET count loss. Using a surface receive coil allowed all tested MR sequences to be used with the insert, with 45-59% signal-to-noise ratio (SNR) degradation, compared to without PET. With a 1 H/ 13 C volume coil inside the insert and shielded by a copper tube, the SNR degradation was limited to 23-30% with all tested sequences. The insert did not introduce any discernible distortions into images of two tested EPI sequences. Use of truncated sinc shaped RF excitation pulses and gradient field switching had negligible effects on PET count rate. However, PET count rate was substantially affected by high-power RF block pulses and temperature variations due to high gradient duty cycles.

  10. A specific insertion of a solo-LTR characterizes the Y-chromosome of Bryonia dioica (Cucurbitaceae).

    PubMed

    Oyama, Ryan K; Silber, Martina V; Renner, Susanne S

    2010-06-14

    Relatively few species of flowering plants are dioecious and even fewer are known to have sex chromosomes. Current theory posits that homomorphic sex chromosomes, such as found in Bryonia dioica (Cucurbitaceae), offer insight into the early stages in the evolution of sex chromosomes from autosomes. Little is known about these early steps, but an accumulation of transposable element sequences has been observed on the Y-chromosomes of some species with heteromorphic sex chromosomes. Recombination, by which transposable elements are removed, is suppressed on at least part of the emerging Y-chromosome, and this may explain the correlation between the emergence of sex chromosomes and transposable element enrichment. We sequenced 2321 bp of the Y-chromosome in Bryonia dioica that flank a male-linked marker, BdY1, reported previously. Within this region, which should be suppressed for recombination, we observed a solo-LTR nested in a Copia-like transposable element. We also found other, presumably paralogous, solo-LTRs in a consensus sequence of the underlying Copia-like transposable element. Given that solo-LTRs arise via recombination events, it is noteworthy that we find one in a genomic region where recombination should be suppressed. Although the solo-LTR could have arisen before recombination was suppressed, creating the male-linked marker BdY1, our previous study on B. dioica suggested that BdY1 may not lie in the recombination-suppressed region of the Y-chromosome in all populations. Presence of a solo-LTR near BdY1 therefore fits with the observed correlation between retrotransposon accumulation and the suppression of recombination early in the evolution of sex chromosomes. These findings further suggest that the homomorphic sex chromosomes of B. dioica, the first organism for which genetic XY sex-determination was inferred, are evolutionarily young and offer reference information for comparative studies of other plant sex chromosomes.

  11. Endogenous avian leukosis viral loci in the Red Jungle Fowl genome assembly.

    PubMed

    Benkel, Bernhard; Rutherford, Katherine

    2014-12-01

    The current build (galGal4) of the genome of the ancestor of the modern chicken, the Red Jungle Fowl, contains a single endogenous avian leukosis viral element (ALVE) on chromosome 1 (designated RSV-LTR; family ERVK). The assembly shows the ALVE provirus juxtaposed with a member of a second family of avian endogenous retroviruses (designated GGERV20; family ERVL); however, the status of the 3' end of the ALVE element as well as its flanking region remain unclear due to a gap in the reference genome sequence. In this study, we filled the gap in the assembly using a combination of long-range PCR (LR-PCR) and a short contig present in the unassembled portion of the reference genome database. Our results demonstrate that the ALVE element (ALVE-JFevB) is inserted into the putative envelope region of a GGERV20 element, roughly 1 kbp from its 3' end, and that ALVE-JFevB is complete, and depending on its expression status, potentially capable of directing the production of virus. Moreover, the unassembled portion of the genome database contains junction fragments for a second, previously characterized endogenous proviral element, ALVE-6. ©2014 Poultry Science Association Inc.

  12. Development of PET/MRI with insertable PET for simultaneous PET and MR imaging of human brain.

    PubMed

    Jung, Jin Ho; Choi, Yong; Jung, Jiwoong; Kim, Sangsu; Lim, Hyun Keong; Im, Ki Chun; Oh, Chang Hyun; Park, Hyun-wook; Kim, Kyung Min; Kim, Jong Guk

    2015-05-01

    The purpose of this study was to develop a dual-modality positron emission tomography (PET)/magnetic resonance imaging (MRI) with insertable PET for simultaneous PET and MR imaging of the human brain. The PET detector block was composed of a 4 × 4 matrix of detector modules, each consisting of a 4 × 4 array LYSO coupled to a 4 × 4 Geiger-mode avalanche photodiode (GAPD) array. The PET insert consisted of 18 detector blocks, circularly mounted on a custom-made plastic base to form a ring with an inner diameter of 390 mm and axial length of 60 mm. The PET gantry was shielded with gold-plated conductive fabric tapes with a thickness of 0.1 mm. The charge signals of PET detector transferred via 4 m long flat cables were fed into the position decoder circuit. The flat cables were shielded with a mesh-type aluminum sheet with a thickness of 0.24 mm. The position decoder circuit and field programmable gate array-embedded DAQ modules were enclosed in an aluminum box with a thickness of 10 mm and located at the rear of the MR bore inside the MRI room. A 3-T human MRI system with a Larmor frequency of 123.7 MHz and inner bore diameter of 60 cm was used as the PET/MRI hybrid system. A custom-made radio frequency (RF) coil with an inner diameter of 25 cm was fabricated. The PET was positioned between gradient and the RF coils. PET performance was measured outside and inside the MRI scanner using echo planar imaging, spin echo, turbo spin echo, and gradient echo sequences. MRI performance was also evaluated with and without the PET insert. The stability of the newly developed PET insert was evaluated and simultaneous PET and MR images of a brain phantom were acquired. No significant degradation of the PET performance caused by MR was observed when the PET was operated using various MR imaging sequences. The signal-to-noise ratio of MR images was slightly degraded due to the PET insert installed inside the MR bore while the homogeneity was maintained. The change of gain of the 256 GAPD/scintillator elements of a detector block was <3% for 60 min, and simultaneous PET and MR images of a brain phantom were successfully acquired. Experimental results indicate that a compact and lightweight PET insert for hybrid PET/MRI can be developed using GAPD arrays and charge signal transmission method proposed in this study without significant interference.

  13. Understanding the differences between genome sequences of Escherichia coli B strains REL606 and BL21(DE3), and comparison of the closely related E. coli B and K-12 genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Studier, F.W.; Daegelen, P.; Lenski, R. E.

    2009-12-01

    Each difference between the genome sequences of Escherichia coli B strains REL606 and BL21(DE3) can be interpreted in light of known laboratory manipulations plus a gene conversion between ribosomal RNA operons. Two treatments with 1-methyl-3-nitro-1-nitrosoguanidine in the REL606 lineage produced at least 93 single-base-pair mutations ({approx} 90% GC-to-AT transitions) and 3 single-base-pair GC deletions. Two UV treatments in the BL21(DE3) lineage produced only 4 single-base-pair mutations but 16 large deletions. P1 transductions from K-12 into the two B lineages produced 317 single-base-pair differences and 9 insertions or deletions, reflecting differences between B DNA in BL21(DE3) and integrated restriction fragments ofmore » K-12 DNA inherited by REL606. Two sites showed selective enrichment of spontaneous mutations. No unselected spontaneous single-base-pair mutations were evident. The genome sequences revealed that a progenitor of REL606 had been misidentified, explaining initially perplexing differences. Limited sequencing of other B strains defined characteristic properties of B and allowed assembly of the inferred genome of the ancestral B of Delbrueck and Luria. Comparison of the B and K-12 genomes shows that more than half of the 3793 proteins of their basic genomes are predicted to be identical, although {approx} 310 appear to be functional in either B or K-12 but not in both. The ancestral basic genome appears to have had {approx} 4039 coding sequences occupying {approx} 4.0 Mbp. Repeated horizontal transfer from diverged Escherichia coli genomes and homologous recombination may explain the observed variable distribution of single-base-pair differences. Fifteen sites are occupied by phage-related elements, but only six by comparable elements at the same site. More than 50 sites are occupied by IS elements in both B and K, 16 in common, and likely founding IS elements are identified. A signature of widespread cryptic phage P4-type mobile elements was identified. Complex deletions (dense clusters of small deletions and substitutions) apparently removed nonessential genes from {approx} 30 sites in the basic genomes.« less

  14. Placing three-dimensional isoparametric elements into NASTRAN. [alterations in matrix assembly to simplify generation of higher order elements

    NASA Technical Reports Server (NTRS)

    Newman, M. B.; Filstrup, A. W.

    1973-01-01

    Linear (8 node), parabolic (20 node), cubic (32 node) and mixed (some edges linear, some parabolic and some cubic) have been inserted into NASTRAN, level 15.1. First the dummy element feature was used to check out the stiffness matrix generation routines for the linear element in NASTRAN. Then, the necessary modules of NASTRAN were modified to include the new family of elements. The matrix assembly was changed so that the stiffness matrix of each isoparametric element is only generated once as the time to generate these higher order elements tends to be much longer than the other elements in NASTRAN. This paper presents some of the experiences and difficulties of inserting a new element or family of elements into NASTRAN.

  15. The Tgm9-induced indexed insertional mutant collection to conduct community-based reverse genetics studies in soybean

    USDA-ARS?s Scientific Manuscript database

    Until now, functional analyses of soybean genes have been very arduous because of the lack of a rapid transformation procedure. Recently identified the active endogenous type II transposable element, Tgm9, excises from insertion sites and restores wild-type phenotypes. Thus, this element provides a ...

  16. A High-Throughput Arabidopsis Reverse Genetics System

    PubMed Central

    Sessions, Allen; Burke, Ellen; Presting, Gernot; Aux, George; McElver, John; Patton, David; Dietrich, Bob; Ho, Patrick; Bacwaden, Johana; Ko, Cynthia; Clarke, Joseph D.; Cotton, David; Bullis, David; Snell, Jennifer; Miguel, Trini; Hutchison, Don; Kimmerly, Bill; Mitzel, Theresa; Katagiri, Fumiaki; Glazebrook, Jane; Law, Marc; Goff, Stephen A.

    2002-01-01

    A collection of Arabidopsis lines with T-DNA insertions in known sites was generated to increase the efficiency of functional genomics. A high-throughput modified thermal asymetric interlaced (TAIL)-PCR protocol was developed and used to amplify DNA fragments flanking the T-DNA left borders from ∼100,000 transformed lines. A total of 85,108 TAIL-PCR products from 52,964 T-DNA lines were sequenced and compared with the Arabidopsis genome to determine the positions of T-DNAs in each line. Predicted T-DNA insertion sites, when mapped, showed a bias against predicted coding sequences. Predicted insertion mutations in genes of interest can be identified using Arabidopsis Gene Index name searches or by BLAST (Basic Local Alignment Search Tool) search. Insertions can be confirmed by simple PCR assays on individual lines. Predicted insertions were confirmed in 257 of 340 lines tested (76%). This resource has been named SAIL (Syngenta Arabidopsis Insertion Library) and is available to the scientific community at www.tmri.org. PMID:12468722

  17. A Bioinformatics Approach for Detecting Repetitive Nested Motifs using Pattern Matching.

    PubMed

    Romero, José R; Carballido, Jessica A; Garbus, Ingrid; Echenique, Viviana C; Ponzoni, Ignacio

    2016-01-01

    The identification of nested motifs in genomic sequences is a complex computational problem. The detection of these patterns is important to allow the discovery of transposable element (TE) insertions, incomplete reverse transcripts, deletions, and/or mutations. In this study, a de novo strategy for detecting patterns that represent nested motifs was designed based on exhaustive searches for pairs of motifs and combinatorial pattern analysis. These patterns can be grouped into three categories, motifs within other motifs, motifs flanked by other motifs, and motifs of large size. The methodology used in this study, applied to genomic sequences from the plant species Aegilops tauschii and Oryza sativa , revealed that it is possible to identify putative nested TEs by detecting these three types of patterns. The results were validated through BLAST alignments, which revealed the efficacy and usefulness of the new method, which is called Mamushka.

  18. Genomic analyses of primitive, wild and cultivated citrus provide insights into asexual reproduction.

    PubMed

    Wang, Xia; Xu, Yuantao; Zhang, Siqi; Cao, Li; Huang, Yue; Cheng, Junfeng; Wu, Guizhi; Tian, Shilin; Chen, Chunli; Liu, Yan; Yu, Huiwen; Yang, Xiaoming; Lan, Hong; Wang, Nan; Wang, Lun; Xu, Jidi; Jiang, Xiaolin; Xie, Zongzhou; Tan, Meilian; Larkin, Robert M; Chen, Ling-Ling; Ma, Bin-Guang; Ruan, Yijun; Deng, Xiuxin; Xu, Qiang

    2017-05-01

    The emergence of apomixis-the transition from sexual to asexual reproduction-is a prominent feature of modern citrus. Here we de novo sequenced and comprehensively studied the genomes of four representative citrus species. Additionally, we sequenced 100 accessions of primitive, wild and cultivated citrus. Comparative population analysis suggested that genomic regions harboring energy- and reproduction-associated genes are probably under selection in cultivated citrus. We also narrowed the genetic locus responsible for citrus polyembryony, a form of apomixis, to an 80-kb region containing 11 candidate genes. One of these, CitRWP, is expressed at higher levels in ovules of polyembryonic cultivars. We found a miniature inverted-repeat transposable element insertion in the promoter region of CitRWP that cosegregated with polyembryony. This study provides new insights into citrus apomixis and constitutes a promising resource for the mining of agriculturally important genes.

  19. A high-quality human reference panel reveals the complexity and distribution of genomic structural variants.

    PubMed

    Hehir-Kwa, Jayne Y; Marschall, Tobias; Kloosterman, Wigard P; Francioli, Laurent C; Baaijens, Jasmijn A; Dijkstra, Louis J; Abdellaoui, Abdel; Koval, Vyacheslav; Thung, Djie Tjwan; Wardenaar, René; Renkens, Ivo; Coe, Bradley P; Deelen, Patrick; de Ligt, Joep; Lameijer, Eric-Wubbo; van Dijk, Freerk; Hormozdiari, Fereydoun; Uitterlinden, André G; van Duijn, Cornelia M; Eichler, Evan E; de Bakker, Paul I W; Swertz, Morris A; Wijmenga, Cisca; van Ommen, Gert-Jan B; Slagboom, P Eline; Boomsma, Dorret I; Schönhuth, Alexander; Ye, Kai; Guryev, Victor

    2016-10-06

    Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals.

  20. Vector modifications to eliminate transposase expression following piggyBac-mediated transgenesis

    PubMed Central

    Chakraborty, Syandan; Ji, HaYeun; Chen, Jack; Gersbach, Charles A.; Leong, Kam W.

    2014-01-01

    Transgene insertion plays an important role in gene therapy and in biological studies. Transposon-based systems that integrate transgenes by transposase-catalyzed “cut-and-paste” mechanism have emerged as an attractive system for transgenesis. Hyperactive piggyBac transposon is particularly promising due to its ability to integrate large transgenes with high efficiency. However, prolonged expression of transposase can become a potential source of genotoxic effects due to uncontrolled transposition of the integrated transgene from one chromosomal locus to another. In this study we propose a vector design to decrease post-transposition expression of transposase and to eliminate the cells that have residual transposase expression. We design a single plasmid construct that combines the transposase and the transpositioning transgene element to share a single polyA sequence for termination. Consequently, the separation of the transposase element from the polyA sequence after transposition leads to its deactivation. We also co-express Herpes Simplex Virus thymidine kinase (HSV-tk) with the transposase. Therefore, cells having residual transposase expression can be eliminated by the administration of ganciclovir. We demonstrate the utility of this combination transposon system by integrating and expressing a model therapeutic gene, human coagulation Factor IX, in HEK293T cells. PMID:25492703

  1. Improving prokaryotic transposable elements identification using a combination of de novo and profile HMM methods.

    PubMed

    Kamoun, Choumouss; Payen, Thibaut; Hua-Van, Aurélie; Filée, Jonathan

    2013-10-11

    Insertion Sequences (ISs) and their non-autonomous derivatives (MITEs) are important components of prokaryotic genomes inducing duplication, deletion, rearrangement or lateral gene transfers. Although ISs and MITEs are relatively simple and basic genetic elements, their detection remains a difficult task due to their remarkable sequence diversity. With the advent of high-throughput genome and metagenome sequencing technologies, the development of fast, reliable and sensitive methods of ISs and MITEs detection become an important challenge. So far, almost all studies dealing with prokaryotic transposons have used classical BLAST-based detection methods against reference libraries. Here we introduce alternative methods of detection either taking advantages of the structural properties of the elements (de novo methods) or using an additional library-based method using profile HMM searches. In this study, we have developed three different work flows dedicated to ISs and MITEs detection: the first two use de novo methods detecting either repeated sequences or presence of Inverted Repeats; the third one use 28 in-house transposase alignment profiles with HMM search methods. We have compared the respective performances of each method using a reference dataset of 30 archaeal and 30 bacterial genomes in addition to simulated and real metagenomes. Compared to a BLAST-based method using ISFinder as library, de novo methods significantly improve ISs and MITEs detection. For example, in the 30 archaeal genomes, we discovered 30 new elements (+20%) in addition to the 141 multi-copies elements already detected by the BLAST approach. Many of the new elements correspond to ISs belonging to unknown or highly divergent families. The total number of MITEs has even doubled with the discovery of elements displaying very limited sequence similarities with their respective autonomous partners (mainly in the Inverted Repeats of the elements). Concerning metagenomes, with the exception of short reads data (<300 bp) for which both techniques seem equally limited, profile HMM searches considerably ameliorate the detection of transposase encoding genes (up to +50%) generating low level of false positives compare to BLAST-based methods. Compared to classical BLAST-based methods, the sensitivity of de novo and profile HMM methods developed in this study allow a better and more reliable detection of transposons in prokaryotic genomes and metagenomes. We believed that future studies implying ISs and MITEs identification in genomic data should combine at least one de novo and one library-based method, with optimal results obtained by running the two de novo methods in addition to a library-based search. For metagenomic data, profile HMM search should be favored, a BLAST-based step is only useful to the final annotation into groups and families.

  2. Identification of a peroxisome proliferator-responsive element upstream of the gene encoding rat peroxisomal enoyl-CoA hydratase/3-hydroxyacyl-CoA dehydrogenase.

    PubMed Central

    Zhang, B; Marcus, S L; Sajjadi, F G; Alvares, K; Reddy, J K; Subramani, S; Rachubinski, R A; Capone, J P

    1992-01-01

    Ciprofibrate, a hypolipidemic drug that acts as a peroxisome proliferator, induces the transcription of genes encoding peroxisomal beta-oxidation enzymes. To identify cis-acting promoter elements involved in this induction, 5.8 kilobase pairs of promoter sequence from the gene encoding rat peroxisomal enoyl-CoA hydratase/3-hydroxyacyl-CoA dehydrogenase (EC 4.2.1.17/EC 1.1.1.35) was inserted upstream of a luciferase reporter gene. Transfection of this expression vector into rat hepatoma H4IIEC3 cells in the presence of ciprofibrate resulted in a 5- to 10-fold, cell type-specific increase in luciferase activity as compared to cells transfected in the absence of drug. A peroxisome proliferator-responsive element (PPRE) was localized to a 196-nucleotide region centered at position -2943 from the transcription start site. This PPRE conferred ciprofibrate responsiveness on a heterologous promoter and functioned independently of orientation or position. Gel retardation analysis with nuclear extracts demonstrated that ciprofibrate-treated or untreated H4IIEC3 cells, but not HeLa cells or monkey kidney cells, contained sequence-specific DNA binding factors that interact with the PPRE. These results have implications for understanding the mechanisms of coordinated transcriptional induction of genes encoding peroxisomal proteins by hypolipidemic agents and other peroxisome proliferators. Images PMID:1502166

  3. Anaerobically Grown Escherichia coli Has an Enhanced Mutation Rate and Distinct Mutational Spectra

    PubMed Central

    Shewaramani, Sonal; Finn, Thomas J.; Kassen, Rees; Rainey, Paul B.

    2017-01-01

    Oxidative stress is a major cause of mutation but little is known about how growth in the absence of oxygen impacts the rate and spectrum of mutations. We employed long-term mutation accumulation experiments to directly measure the rates and spectra of spontaneous mutation events in Escherichia coli populations propagated under aerobic and anaerobic conditions. To detect mutations, whole genome sequencing was coupled with methods of analysis sufficient to identify a broad range of mutational classes, including structural variants (SVs) generated by movement of repetitive elements. The anaerobically grown populations displayed a mutation rate nearly twice that of the aerobic populations, showed distinct asymmetric mutational strand biases, and greater insertion element activity. Consistent with mutation rate and spectra observations, genes for transposition and recombination repair associated with SVs were up-regulated during anaerobic growth. Together, these results define differences in mutational spectra affecting the evolution of facultative anaerobes. PMID:28103245

  4. Brucella abortus Strain 2308 Wisconsin Genome: Importance of the Definition of Reference Strains

    PubMed Central

    Suárez-Esquivel, Marcela; Ruiz-Villalobos, Nazareth; Castillo-Zeledón, Amanda; Jiménez-Rojas, César; Roop II, R. Martin; Comerci, Diego J.; Barquero-Calvo, Elías; Chacón-Díaz, Carlos; Caswell, Clayton C.; Baker, Kate S.; Chaves-Olarte, Esteban; Thomson, Nicholas R.; Moreno, Edgardo; Letesson, Jean J.; De Bolle, Xavier; Guzmán-Verri, Caterina

    2016-01-01

    Brucellosis is a bacterial infectious disease affecting a wide range of mammals and a neglected zoonosis caused by species of the genetically homogenous genus Brucella. As in most studies on bacterial diseases, research in brucellosis is carried out by using reference strains as canonical models to understand the mechanisms underlying host pathogen interactions. We performed whole genome sequencing analysis of the reference strain B. abortus 2308 routinely used in our laboratory, including manual curated annotation accessible as an editable version through a link at https://en.wikipedia.org/wiki/Brucella#Genomics. Comparison of this genome with two publically available 2308 genomes showed significant differences, particularly indels related to insertional elements, suggesting variability related to the transposition of these elements within the same strain. Considering the outcome of high resolution genomic techniques in the bacteriology field, the conventional concept of strain definition needs to be revised. PMID:27746773

  5. Contribution of type W human endogenous retroviruses to the human genome: characterization of HERV-W proviral insertions and processed pseudogenes.

    PubMed

    Grandi, Nicole; Cadeddu, Marta; Blomberg, Jonas; Tramontano, Enzo

    2016-09-09

    Human endogenous retroviruses (HERVs) are ancient sequences integrated in the germ line cells and vertically transmitted through the offspring constituting about 8 % of our genome. In time, HERVs accumulated mutations that compromised their coding capacity. A prominent exception is HERV-W locus 7q21.2, producing a functional Env protein (Syncytin-1) coopted for placental syncytiotrophoblast formation. While expression of HERV-W sequences has been investigated for their correlation to disease, an exhaustive description of the group composition and characteristics is still not available and current HERV-W group information derive from studies published a few years ago that, of course, used the rough assemblies of the human genome available at that time. This hampers the comparison and correlation with current human genome assemblies. In the present work we identified and described in detail the distribution and genetic composition of 213 HERV-W elements. The bioinformatics analysis led to the characterization of several previously unreported features and provided a phylogenetic classification of two main subgroups with different age and structural characteristics. New facts on HERV-W genomic context of insertion and co-localization with sequences putatively involved in disease development are also reported. The present work is a detailed overview of the HERV-W contribution to the human genome and provides a robust genetic background useful to clarify HERV-W role in pathologies with poorly understood etiology, representing, to our knowledge, the most complete and exhaustive HERV-W dataset up to date.

  6. Structural analyses of the CRISPR protein Csc2 reveal the RNA-binding interface of the type I-D Cas7 family.

    PubMed

    Hrle, Ajla; Maier, Lisa-Katharina; Sharma, Kundan; Ebert, Judith; Basquin, Claire; Urlaub, Henning; Marchfelder, Anita; Conti, Elena

    2014-01-01

    Upon pathogen invasion, bacteria and archaea activate an RNA-interference-like mechanism termed CRISPR (clustered regularly interspaced short palindromic repeats). A large family of Cas (CRISPR-associated) proteins mediates the different stages of this sophisticated immune response. Bioinformatic studies have classified the Cas proteins into families, according to their sequences and respective functions. These range from the insertion of the foreign genetic elements into the host genome to the activation of the interference machinery as well as target degradation upon attack. Cas7 family proteins are central to the type I and type III interference machineries as they constitute the backbone of the large interference complexes. Here we report the crystal structure of Thermofilum pendens Csc2, a Cas7 family protein of type I-D. We found that Csc2 forms a core RRM-like domain, flanked by three peripheral insertion domains: a lid domain, a Zinc-binding domain and a helical domain. Comparison with other Cas7 family proteins reveals a set of similar structural features both in the core and in the peripheral domains, despite the absence of significant sequence similarity. T. pendens Csc2 binds single-stranded RNA in vitro in a sequence-independent manner. Using a crosslinking - mass-spectrometry approach, we mapped the RNA-binding surface to a positively charged surface patch on T. pendens Csc2. Thus our analysis of the key structural and functional features of T. pendens Csc2 highlights recurring themes and evolutionary relationships in type I and type III Cas proteins.

  7. SINE Retrotransposition: Evaluation of Alu Activity and Recovery of De Novo Inserts.

    PubMed

    Ade, Catherine; Roy-Engel, Astrid M

    2016-01-01

    Mobile element activity is of great interest due to its impact on genomes. However, the types of mobile elements that inhabit any given genome are remarkably varied. Among the different varieties of mobile elements, the Short Interspersed Elements (SINEs) populate many genomes, including many mammalian species. Although SINEs are parasites of Long Interspersed Elements (LINEs), SINEs have been highly successful in both the primate and rodent genomes. When comparing copy numbers in mammals, SINEs have been vastly more successful than other nonautonomous elements, such as the retropseudogenes and SVA. Interestingly, in the human genome the copy number of Alu (a primate SINE) outnumbers LINE-1 (L1) copies 2 to 1. Estimates suggest that the retrotransposition rate for Alu is tenfold higher than LINE-1 with about 1 insert in every twenty births. Furthermore, Alu-induced mutagenesis is responsible for the majority of the documented instances of human retroelement insertion-induced disease. However, little is known on what contributes to these observed differences between SINEs and LINEs. The development of an assay to monitor SINE retrotransposition in culture has become an important tool for the elucidation of some of these differences. In this chapter, we present details of the SINE retrotransposition assay and the recovery of de novo inserts. We also focus on the nuances that are unique to the SINE assay.

  8. ``Dressing'' lines and vertices in calculations of matrix elements with the coupled-cluster method and determination of Cs atomic properties

    NASA Astrophysics Data System (ADS)

    Derevianko, Andrei; Porsev, Sergey G.

    2005-03-01

    We consider evaluation of matrix elements with the coupled-cluster method. Such calculations formally involve infinite number of terms and we devise a method of partial summation (dressing) of the resulting series. Our formalism is built upon an expansion of the product C†C of cluster amplitudes C into a sum of n -body insertions. We consider two types of insertions: particle (hole) line insertion and two-particle (two-hole) random-phase-approximation-like insertion. We demonstrate how to “dress” these insertions and formulate iterative equations. We illustrate the dressing equations in the case when the cluster operator is truncated at single and double excitations. Using univalent systems as an example, we upgrade coupled-cluster diagrams for matrix elements with the dressed insertions and highlight a relation to pertinent fourth-order diagrams. We illustrate our formalism with relativistic calculations of the hyperfine constant A(6s) and the 6s1/2-6p1/2 electric-dipole transition amplitude for the Cs atom. Finally, we augment the truncated coupled-cluster calculations with otherwise omitted fourth order diagrams. The resulting analysis for Cs is complete through the fourth order of many-body perturbation theory and reveals an important role of triple and disconnected quadruple excitations.

  9. Expression of Wheat High Molecular Weight Glutenin Subunit 1Bx Is Affected by Large Insertions and Deletions Located in the Upstream Flanking Sequences

    PubMed Central

    Hao, Chenyang; Tang, Saijun; Zhang, Xueyong; Li, Tian

    2014-01-01

    To better understand the transcriptional regulation of high molecular weight glutenin subunit (HMW-GS) expression, we isolated four Glu-1Bx promoters from six wheat cultivars exhibiting diverse protein expression levels. The activities of the diverse Glu-1Bx promoters were tested and compared with β-glucuronidase (GUS) reporter fusions. Although all the full-length Glu-1Bx promoters showed endosperm-specific activities, the strongest GUS activity was observed with the 1Bx7OE promoter in both transient expression assays and stable transgenic rice lines. A 43 bp insertion in the 1Bx7OE promoter, which is absent in the 1Bx7 promoter, led to enhanced expression. Analysis of promoter deletion constructs confirmed that a 185 bp MITE (miniature inverted-repeat transposable element) in the 1Bx14 promoter had a weak positive effect on Glu-1Bx expression, and a 54 bp deletion in the 1Bx13 promoter reduced endosperm-specific activity. To investigate the effect of the 43 bp insertion in the 1Bx7OE promoter, a functional marker was developed to screen 505 Chinese varieties and 160 European varieties, and only 1Bx7-type varieties harboring the 43 bp insertion in their promoters showed similar overexpression patterns. Hence, the 1Bx7OE promoter should be important tool in crop genetic engineering as well as in molecular assisted breeding. PMID:25133580

  10. Selenium. Role of the Essential Metalloid in Health

    PubMed Central

    Kurokawa, Suguru; Berry, Marla J.

    2015-01-01

    Selenium is an essential micronutrient in mammals, but is also recognized as toxic in excess. It is a non-metal with properties that are intermediate between the chalcogen elements sulfur and tellurium. Selenium exerts its biological functions through selenoproteins. Selenoproteins contain selenium in the form of the 21st amino acid, selenocysteine (Sec), which is an analog of cysteine with the sulfur-containing side chain replaced by a Se-containing side chain. Sec is encoded by the codon UGA, which is one of three termination codons for mRNA translation in non-selenoprotein genes. Recognition of the UGA codon as a Sec insertion site instead of stop requires a Sec insertion sequence (SECIS) element in selenoprotein mRNAs and a unique selenocysteyl-tRNA, both of which are recognized by specialized protein factors. Unlike the 20 standard amino acids, Sec is biosynthesized from serine on its tRNA. Twenty-five selenoproteins are encoded in the human genome. Most of the selenoprotein genes were discovered by bioinformatics approaches, searching for SECIS elements downstream of in-frame UGA codons. Sec has been described as having stronger nucleophilic and electrophilic properties than cysteine, and Sec is present in the catalytic site of all selenoenzymes. Most selenoproteins, whose functions are known, are involved in redox systems and signaling pathways. However, several selenoproteins are not well characterized in terms of their function. The selenium field has grown dramatically in the last few decades, and research on selenium biology is providing extensive new information regarding its importance for human health. PMID:24470102

  11. Transposable element dynamics and PIWI regulation impacts lncRNA and gene expression diversity in Drosophila ovarian cell cultures.

    PubMed

    Sytnikova, Yuliya A; Rahman, Reazur; Chirn, Gung-Wei; Clark, Josef P; Lau, Nelson C

    2014-12-01

    Piwi proteins and Piwi-interacting RNAs (piRNAs) repress transposable elements (TEs) from mobilizing in gonadal cells. To determine the spectrum of piRNA-regulated targets that may extend beyond TEs, we conducted a genome-wide survey for transcripts associated with PIWI and for transcripts affected by PIWI knockdown in Drosophila ovarian somatic sheet (OSS) cells, a follicle cell line expressing the Piwi pathway. Despite the immense sequence diversity among OSS cell piRNAs, our analysis indicates that TE transcripts are the major transcripts associated with and directly regulated by PIWI. However, several coding genes were indirectly regulated by PIWI via an adjacent de novo TE insertion that generated a nascent TE transcript. Interestingly, we noticed that PIWI-regulated genes in OSS cells greatly differed from genes affected in a related follicle cell culture, ovarian somatic cells (OSCs). Therefore, we characterized the distinct genomic TE insertions across four OSS and OSC lines and discovered dynamic TE landscapes in gonadal cultures that were defined by a subset of active TEs. Particular de novo TEs appeared to stimulate the expression of novel candidate long noncoding RNAs (lncRNAs) in a cell lineage-specific manner, and some of these TE-associated lncRNAs were associated with PIWI and overlapped PIWI-regulated genes. Our analyses of OSCs and OSS cells demonstrate that despite having a Piwi pathway to suppress endogenous mobile elements, gonadal cell TE landscapes can still dramatically change and create transcriptome diversity. © 2014 Sytnikova et al.; Published by Cold Spring Harbor Laboratory Press.

  12. Utilization of next generation sequencing for analyzing transgenic insertions in plum

    USDA-ARS?s Scientific Manuscript database

    When utilizing transgenic plants, it is useful to know how many copies of the genes were inserted and the locations of these insertions in the genome. This information can provide important insights for the interpretation of transgene expression and the resulting phenotype. Traditionally, these qu...

  13. Low-pass sequencing for microbial comparative genomics

    PubMed Central

    Goo, Young Ah; Roach, Jared; Glusman, Gustavo; Baliga, Nitin S; Deutsch, Kerry; Pan, Min; Kennedy, Sean; DasSarma, Shiladitya; Victor Ng, Wailap; Hood, Leroy

    2004-01-01

    Background We studied four extremely halophilic archaea by low-pass shotgun sequencing: (1) the metabolically versatile Haloarcula marismortui; (2) the non-pigmented Natrialba asiatica; (3) the psychrophile Halorubrum lacusprofundi and (4) the Dead Sea isolate Halobaculum gomorrense. Approximately one thousand single pass genomic sequences per genome were obtained. The data were analyzed by comparative genomic analyses using the completed Halobacterium sp. NRC-1 genome as a reference. Low-pass shotgun sequencing is a simple, inexpensive, and rapid approach that can readily be performed on any cultured microbe. Results As expected, the four archaeal halophiles analyzed exhibit both bacterial and eukaryotic characteristics as well as uniquely archaeal traits. All five halophiles exhibit greater than sixty percent GC content and low isoelectric points (pI) for their predicted proteins. Multiple insertion sequence (IS) elements, often involved in genome rearrangements, were identified in H. lacusprofundi and H. marismortui. The core biological functions that govern cellular and genetic mechanisms of H. sp. NRC-1 appear to be conserved in these four other halophiles. Multiple TATA box binding protein (TBP) and transcription factor IIB (TFB) homologs were identified from most of the four shotgunned halophiles. The reconstructed molecular tree of all five halophiles shows a large divergence between these species, but with the closest relationship being between H. sp. NRC-1 and H. lacusprofundi. Conclusion Despite the diverse habitats of these species, all five halophiles share (1) high GC content and (2) low protein isoelectric points, which are characteristics associated with environmental exposure to UV radiation and hypersalinity, respectively. Identification of multiple IS elements in the genome of H. lacusprofundi and H. marismortui suggest that genome structure and dynamic genome reorganization might be similar to that previously observed in the IS-element rich genome of H. sp. NRC-1. Identification of multiple TBP and TFB homologs in these four halophiles are consistent with the hypothesis that different types of complex transcriptional regulation may occur through multiple TBP-TFB combinations in response to rapidly changing environmental conditions. Low-pass shotgun sequence analyses of genomes permit extensive and diverse analyses, and should be generally useful for comparative microbial genomics. PMID:14718067

  14. “One code to find them all”: a perl tool to conveniently parse RepeatMasker output files

    PubMed Central

    2014-01-01

    Background Of the different bioinformatic methods used to recover transposable elements (TEs) in genome sequences, one of the most commonly used procedures is the homology-based method proposed by the RepeatMasker program. RepeatMasker generates several output files, including the .out file, which provides annotations for all detected repeats in a query sequence. However, a remaining challenge consists of identifying the different copies of TEs that correspond to the identified hits. This step is essential for any evolutionary/comparative analysis of the different copies within a family. Different possibilities can lead to multiple hits corresponding to a unique copy of an element, such as the presence of large deletions/insertions or undetermined bases, and distinct consensus corresponding to a single full-length sequence (like for long terminal repeat (LTR)-retrotransposons). These possibilities must be taken into account to determine the exact number of TE copies. Results We have developed a perl tool that parses the RepeatMasker .out file to better determine the number and positions of TE copies in the query sequence, in addition to computing quantitative information for the different families. To determine the accuracy of the program, we tested it on several RepeatMasker .out files corresponding to two organisms (Drosophila melanogaster and Homo sapiens) for which the TE content has already been largely described and which present great differences in genome size, TE content, and TE families. Conclusions Our tool provides access to detailed information concerning the TE content in a genome at the family level from the .out file of RepeatMasker. This information includes the exact position and orientation of each copy, its proportion in the query sequence, and its quality compared to the reference element. In addition, our tool allows a user to directly retrieve the sequence of each copy and obtain the same detailed information at the family level when a local library with incomplete TE class/subclass information was used with RepeatMasker. We hope that this tool will be helpful for people working on the distribution and evolution of TEs within genomes.

  15. Characterization of (CA)n microsatellite repeats from large-insert clones.

    PubMed

    Litt, M; Browne, D

    2001-05-01

    The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit determination of sequences flanking the microsatellites. When cosmids or large-insert phage clones are used as primary sources of (CA)n repeat markers, they have traditionally been subcloned into plasmid vectors such as pUC18 or M13 mp 18/19 cloning vectors to obtain fragments of suitable size for DNA sequencing. This unit presents an alternative approach whereby a set of degenerate sequencing primers that anneal directly to (CA)n microsatellites can be used to determine sequences that are inaccessible with vector-derived primers. Because the primers anneal to the repeat and not to the vector, they can be used with subclones containing inserts of several kilobases and should, in theory, always give sequence in the regions directly flanking the repeat. Degeneracy at the 3 end of each of these primers prevents elongation of primers that have annealed out-of-register. The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit.

  16. Allexiviruses may have acquired inserted sequences between the CP and CRP genes to change the translation reinitiation strategy of CRP.

    PubMed

    Yoshida, Naoto; Shimura, Hanako; Masuta, Chikara

    2018-06-01

    Allexiviruses are economically important garlic viruses that are involved in garlic mosaic diseases. In this study, we characterized the allexivirus cysteine-rich protein (CRP) gene located just downstream of the coat protein (CP) gene in the viral genome. We determined the nucleotide sequences of the CP and CRP genes from numerous allexivirus isolates and performed a phylogenetic analysis. According to the resulting phylogenetic tree, we found that allexiviruses were clearly divided into two major groups (group I and group II) based on the sequences of the CP and CRP genes. In addition, the allexiviruses in group II had distinct sequences just before the CRP gene, while group I isolates did not. The inserted sequence between the CP and CRP genes was partially complementary to garlic 18S rRNA. Using a potato virus X vector, we showed that the CRPs affected viral accumulation and symptom induction in Nicotiana benthamiana, suggesting that the allexivirus CRP is a pathogenicity determinant. We assume that the inserted sequences before the CRP gene may have been generated during viral evolution to alter the termination-reinitiation mechanism for coupled translation of CP and CRP.

  17. Cryo-EM near-atomic structure of a dsRNA fungal virus shows ancient structural motifs preserved in the dsRNA viral lineage

    PubMed Central

    Luque, Daniel; Gómez-Blanco, Josué; Garriga, Damiá; Brilot, Axel F.; González, José M.; Havens, Wendy M.; Carrascosa, José L.; Trus, Benes L.; Verdaguer, Nuria; Ghabrial, Said A.; Castón, José R.

    2014-01-01

    Viruses evolve so rapidly that sequence-based comparison is not suitable for detecting relatedness among distant viruses. Structure-based comparisons suggest that evolution led to a small number of viral classes or lineages that can be grouped by capsid protein (CP) folds. Here, we report that the CP structure of the fungal dsRNA Penicillium chrysogenum virus (PcV) shows the progenitor fold of the dsRNA virus lineage and suggests a relationship between lineages. Cryo-EM structure at near-atomic resolution showed that the 982-aa PcV CP is formed by a repeated α-helical core, indicative of gene duplication despite lack of sequence similarity between the two halves. Superimposition of secondary structure elements identified a single “hotspot” at which variation is introduced by insertion of peptide segments. Structural comparison of PcV and other distantly related dsRNA viruses detected preferential insertion sites at which the complexity of the conserved α-helical core, made up of ancestral structural motifs that have acted as a skeleton, might have increased, leading to evolution of the highly varied current structures. Analyses of structural motifs only apparent after systematic structural comparisons indicated that the hallmark fold preserved in the dsRNA virus lineage shares a long (spinal) α-helix tangential to the capsid surface with the head-tailed phage and herpesvirus viral lineage. PMID:24821769

  18. Generation and analysis of a barcode-tagged insertion mutant library in the fission yeast Schizosaccharomyces pombe.

    PubMed

    Chen, Bo-Ruei; Hale, Devin C; Ciolek, Peter J; Runge, Kurt W

    2012-05-03

    Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches.

  19. Studies on transposable elements in yeast. I. ROAM mutations causing increased expression of yeast genes: their activation by signals directed toward conjugation functions and their formation by insertion of Tyl repetitive elements

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Errede, B.; Cardillo, T.S.; Wever, G.

    1981-01-01

    Mechanisms available to eukaryotic organisms for the coordinate regulation of gene expression are being examined by genetic and biochemical characterization of an unusual mutation, CYC7-H2, which causes over-production of iso-2-cytochrome c in the yeast Saccharomyces cerevisiae. The CYC7-H2 mutation causes overproduction in haploid strains but only a 1- to 40-fold overproduction in MATa/MAT..cap alpha.. diploid strains. This regulation of overproduction has been characterized as a response to signals controlling conjugation in yeast. Furthermore, the abnormal controlling region has been identified as an insertion of a transposable and reiterated Ty1 element adjacent to the structural gene. Therefore, we suggest that Ty1more » elements or portions of Ty1 elements occur adjacent to some of the genes required for conjugation and that they normally function to control expression of this process. The suggested role of reiterated sequences may represent a general mechanism of coordinate regulation in eukaryotes. The CYC7-H2 mutation is closely related to other regulatory mutations occurring at the cargA, cargB and DUR1,2 loci. Similar to the CYC7-H2 mutation, the mutations designated cargA/sup +/O/sup h/, cargB/sup +/O/sup h/, and durO/sup h/ cause constitutive production of their respective gene products at much lower levels of MATa/MAT..cap alpha.. diploid strains than in the corresponding haploid strains. A consistent relationship between conjugation competence and the level of overproduction in all four mutants has been established. Observations characterizing the regulation of overproduction in the CYC7-H2 mutant are presented with the additional and parallel observations for the O/sup h/ mutants. Together these results provide a demonstration of the specificity and equivalence of regulatory control exhibited by ROAM mutants.« less

  20. The Influence of Primary and Secondary DNA Structure in Deletion and Duplication between Direct Repeats in Escherichia Coli

    PubMed Central

    Trinh, T. Q.; Sinden, R. R.

    1993-01-01

    We describe a system to measure the frequency of both deletions and duplications between direct repeats. Short 17- and 18-bp palindromic and nonpalindromic DNA sequences were cloned into the EcoRI site within the chloramphenicol acetyltransferase gene of plasmids pBR325 and pJT7. This creates an insert between direct repeated EcoRI sites and results in a chloramphenicol-sensitive phenotype. Selection for chloramphenicol resistance was utilized to select chloramphenicol resistant revertants that included those with precise deletion of the insert from plasmid pBR325 and duplication of the insert in plasmid pJT7. The frequency of deletion or duplication varied more than 500-fold depending on the sequence of the short sequence inserted into the EcoRI site. For the nonpalindromic inserts, multiple internal direct repeats and the length of the direct repeats appear to influence the frequency of deletion. Certain palindromic DNA sequences with the potential to form DNA hairpin structures that might stabilize the misalignment of direct repeats had a high frequency of deletion. Other DNA sequences with the potential to form structures that might destabilize misalignment of direct repeats had a very low frequency of deletion. Duplication mutations occurred at the highest frequency when the DNA between the direct repeats contained no direct or inverted repeats. The presence of inverted repeats dramatically reduced the frequency of duplications. The results support the slippage-misalignment model, suggesting that misalignment occurring during DNA replication leads to deletion and duplication mutations. The results also support the idea that the formation of DNA secondary structures during DNA replication can facilitate and direct specific mutagenic events. PMID:8325478

  1. Recombining overlapping BACs into a single larger BAC.

    PubMed

    Kotzamanis, George; Huxley, Clare

    2004-01-06

    BAC clones containing entire mammalian genes including all the transcribed region and long range controlling elements are very useful for functional analysis. Sequenced BACs are available for most of the human and mouse genomes and in many cases these contain intact genes. However, large genes often span more than one BAC, and single BACs covering the entire region of interest are not available. Here we describe a system for linking two or more overlapping BACs into a single clone by homologous recombination. The method was used to link a 61-kb insert carrying the final 5 exons of the human CFTR gene onto a 160-kb BAC carrying the first 22 exons. Two rounds of homologous recombination were carried out in the EL350 strain of bacteria which can be induced for the Red genes. In the first round, the inserts of the two overlapping BACs were subcloned into modified BAC vectors using homologous recombination. In the second round, the BAC to be added was linearised with the very rare-cutting enzyme I-PpoI and electroporated into recombination efficient EL350 bacteria carrying the other BAC. Recombined BACs were identified by antibiotic selection and PCR screening and 10% of clones contained the correctly recombined 220-kb BAC. The system can be used to link the inserts from any overlapping BAC or PAC clones. The original orientation of the inserts is not important and desired regions of the inserts can be selected. The size limit for the fragments recombined may be larger than the 61 kb used here and multiple BACs in a contig could be combined by alternating use of the two pBACLink vectors. This system should be of use to many investigators wishing to carry out functional analysis on large mammalian genes which are not available in single BAC clones.

  2. Escherichia coli O-Antigen Gene Clusters of Serogroups O62, O68, O131, O140, O142, and O163: DNA Sequences and Similarity between O62 and O68, and PCR-Based Serogrouping

    PubMed Central

    Liu, Yanhong; Yan, Xianghe; DebRoy, Chitrita; Fratamico, Pina M.; Needleman, David S.; Li, Robert W.; Wang, Wei; Losada, Liliana; Brinkac, Lauren; Radune, Diana; Toro, Magaly; Hegde, Narasimha; Meng, Jianghong

    2015-01-01

    The DNA sequence of the O-antigen gene clusters of Escherichia coli serogroups O62, O68, O131, O140, O142, and O163 was determined, and primers based on the wzx (O-antigen flippase) and/or wzy (O-antigen polymerase) genes within the O-antigen gene clusters were designed and used in PCR assays to identify each serogroup. Specificity was tested with E. coli reference strains, field isolates belonging to the target serogroups, and non-E. coli bacteria. The PCR assays were highly specific for the respective serogroups; however, the PCR assay targeting the O62 wzx gene reacted positively with strains belonging to E. coli O68, which was determined by serotyping. Analysis of the O-antigen gene cluster sequences of serogroups O62 and O68 reference strains showed that they were 94% identical at the nucleotide level, although O62 contained an insertion sequence (IS) element located between the rmlA and rmlC genes within the O-antigen gene cluster. A PCR assay targeting the rmlA and rmlC genes flanking the IS element was used to differentiate O62 and O68 serogroups. The PCR assays developed in this study can be used for the detection and identification of E. coli O62/O68, O131, O140, O142, and O163 strains isolated from different sources. PMID:25664526

  3. Sequence variability of Campylobacter temperate bacteriophages

    PubMed Central

    Clark, Clifford G; Ng, Lai-King

    2008-01-01

    Background Prophages integrated within the chromosomes of Campylobacter jejuni isolates have been demonstrated very recently. Prior work with Campylobacter temperate bacteriophages, as well as evidence from prophages in other enteric bacteria, suggests these prophages might have a role in the biology and virulence of the organism. However, very little is known about the genetic variability of Campylobacter prophages which, if present, could lead to differential phenotypes in isolates carrying the phages versus those that do not. As a first step in the characterization of C. jejuni prophages, we investigated the distribution of prophage DNA within a C. jejuni population assessed the DNA and protein sequence variability within a subset of the putative prophages found. Results Southern blotting of C. jejuni DNA using probes from genes within the three putative prophages of the C. jejuni sequenced strain RM 1221 demonstrated the presence of at least one prophage gene in a large proportion (27/35) of isolates tested. Of these, 15 were positive for 5 or more of the 7 Campylobacter Mu-like phage 1 (CMLP 1, also designated Campylobacter jejuni integrated element 1, or CJIE 1) genes tested. Twelve of these putative prophages were chosen for further analysis. DNA sequencing of a 9,000 to 11,000 nucleotide region of each prophage demonstrated a close homology with CMLP 1 in both gene order and nucleotide sequence. Structural and sequence variability, including short insertions, deletions, and allele replacements, were found within the prophage genomes, some of which would alter the protein products of the ORFs involved. No insertions of novel genes were detected within the sequenced regions. The 12 prophages and RM 1221 had a % G+C very similar to C. jejuni sequenced strains, as well as promoter regions characteristic of C. jejuni. None of the putative prophages were successfully induced and propagated, so it is not known if they were functional or if they represented remnant prophage DNA in the bacterial chromosomes. Conclusion These putative prophages form a family of phages with conserved sequences, and appear to be adapted to Campylobacter. There was evidence for recombination among groups of prophages, suggesting that the prophages had a mosaic structure. In many of these properties, the Mu-like CMLP 1 homologs characterized in this study resemble temperate bacteriophages of enteric bacteria that are responsible for contributions to virulence and host adaptation. PMID:18366706

  4. Dynamic analysis of a needle insertion for soft materials: Arbitrary Lagrangian-Eulerian-based three-dimensional finite element analysis.

    PubMed

    Yamaguchi, Satoshi; Tsutsui, Kihei; Satake, Koji; Morikawa, Shigehiro; Shirai, Yoshiaki; Tanaka, Hiromi T

    2014-10-01

    Our goal was to develop a three-dimensional finite element model that enables dynamic analysis of needle insertion for soft materials. To demonstrate large deformation and fracture, we used the arbitrary Lagrangian-Eulerian (ALE) method for fluid analysis. We performed ALE-based finite element analysis for 3% agar gel and three types of copper needle with bevel tips. To evaluate simulation results, we compared the needle deflection and insertion force with corresponding experimental results acquired with a uniaxial manipulator. We studied the shear stress distribution of agar gel on various time scales. For 30°, 45°, and 60°, differences in deflections of each needle between both sets of results were 2.424, 2.981, and 3.737mm, respectively. For the insertion force, there was no significant difference for mismatching area error (p<0.05) between simulation and experimental results. Our results have the potential to be a stepping stone to develop pre-operative surgical planning to estimate an optimal needle insertion path for MR image-guided microwave coagulation therapy and for analyzing large deformation and fracture in biological tissues. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. A practical approach to screen for authorised and unauthorised genetically modified plants.

    PubMed

    Waiblinger, Hans-Ulrich; Grohmann, Lutz; Mankertz, Joachim; Engelbert, Dirk; Pietsch, Klaus

    2010-03-01

    In routine analysis, screening methods based on real-time PCR are most commonly used for the detection of genetically modified (GM) plant material in food and feed. In this paper, it is shown that the combination of five DNA target sequences can be used as a universal screening approach for at least 81 GM plant events authorised or unauthorised for placing on the market and described in publicly available databases. Except for maize event LY038, soybean events DP-305423 and BPS-CV127-9 and cotton event 281-24-236 x 3006-210-23, at least one of the five genetic elements has been inserted in these GM plants and is targeted by this screening approach. For the detection of these sequences, fully validated real-time PCR methods have been selected. A screening table is presented that describes the presence or absence of the target sequences for most of the listed GM plants. These data have been verified either theoretically according to available databases or experimentally using available reference materials. The screening table will be updated regularly by a network of German enforcement laboratories.

  6. The genomic landscape shaped by selection on transposable elements across 18 mouse strains.

    PubMed

    Nellåker, Christoffer; Keane, Thomas M; Yalcin, Binnaz; Wong, Kim; Agam, Avigail; Belgard, T Grant; Flint, Jonathan; Adams, David J; Frankel, Wayne N; Ponting, Chris P

    2012-06-15

    Transposable element (TE)-derived sequence dominates the landscape of mammalian genomes and can modulate gene function by dysregulating transcription and translation. Our current knowledge of TEs in laboratory mouse strains is limited primarily to those present in the C57BL/6J reference genome, with most mouse TEs being drawn from three distinct classes, namely short interspersed nuclear elements (SINEs), long interspersed nuclear elements (LINEs) and the endogenous retrovirus (ERV) superfamily. Despite their high prevalence, the different genomic and gene properties controlling whether TEs are preferentially purged from, or are retained by, genetic drift or positive selection in mammalian genomes remain poorly defined. Using whole genome sequencing data from 13 classical laboratory and 4 wild-derived mouse inbred strains, we developed a comprehensive catalogue of 103,798 polymorphic TE variants. We employ this extensive data set to characterize TE variants across the Mus lineage, and to infer neutral and selective processes that have acted over 2 million years. Our results indicate that the majority of TE variants are introduced though the male germline and that only a minority of TE variants exert detectable changes in gene expression. However, among genes with differential expression across the strains there are twice as many TE variants identified as being putative causal variants as expected. Most TE variants that cause gene expression changes appear to be purged rapidly by purifying selection. Our findings demonstrate that past TE insertions have often been highly deleterious, and help to prioritize TE variants according to their likely contribution to gene expression or phenotype variation.

  7. iPBS: a universal method for DNA fingerprinting and retrotransposon isolation.

    PubMed

    Kalendar, Ruslan; Antonius, Kristiina; Smýkal, Petr; Schulman, Alan H

    2010-11-01

    Molecular markers are essential in plant and animal breeding and biodiversity applications, in human forensics, and for map-based cloning of genes. The long terminal repeat (LTR) retrotransposons are well suited as molecular markers. As dispersed and ubiquitous transposable elements, their "copy and paste" life cycle of replicative transposition leads to new genome insertions without excision of the original element. Both the overall structure of retrotransposons and the domains responsible for the various phases of their replication are highly conserved in all eukaryotes. Nevertheless, up to a year has been required to develop a retrotransposon marker system in a new species, involving cloning and sequencing steps as well as the development of custom primers. Here, we describe a novel PCR-based method useful both as a marker system in its own right and for the rapid isolation of retrotransposon termini and full-length elements, making it ideal for "orphan crops" and other species with underdeveloped marker systems. The method, iPBS amplification, is based on the virtually universal presence of a tRNA complement as a reverse transcriptase primer binding site (PBS) in LTR retrotransposons. The method differs from earlier retrotransposon isolation methods because it is applicable not only to endogenous retroviruses and retroviruses, but also to both Gypsy and Copia LTR retrotransposons, as well as to non-autonomous LARD and TRIM elements, throughout the plant kingdom and to animals. Furthermore, the inter-PBS amplification technique as such has proved to be a powerful DNA fingerprinting technology without the need for prior sequence knowledge.

  8. MR-compatibility assessment of MADPET4: a study of interferences between an SiPM-based PET insert and a 7 T MRI system

    NASA Astrophysics Data System (ADS)

    Omidvari, Negar; Topping, Geoffrey; Cabello, Jorge; Paul, Stephan; Schwaiger, Markus; Ziegler, Sibylle I.

    2018-05-01

    Compromises in the design of a positron emission tomography (PET) insert for a magnetic resonance imaging (MRI) system should minimize the deterioration of image quality in both modalities, particularly when simultaneous demanding acquisitions are performed. In this work, the advantages of using individually read-out crystals with high-gain silicon photomultipliers (SiPMs) were studied with a small animal PET insert for a 7 T MRI system, in which the SiPM charge was transferred to outside the MRI scanner using coaxial cables. The interferences between the two systems were studied with three radio-frequency (RF) coil configurations. The effects of PET on the static magnetic field, flip angle distribution, RF noise, and image quality of various MRI sequences (gradient echo, spin echo, and echo planar imaging (EPI) at 1H frequency, and chemical shift imaging at 13C frequency) were investigated. The effects of fast-switching gradient fields and RF pulses on PET count rate were studied, while the PET insert and the readout electronics were not shielded. Operating the insert inside a 1H volume coil, used for RF transmission and reception, limited the MRI to T1-weighted imaging, due to coil detuning and RF attenuation, and resulted in significant PET count loss. Using a surface receive coil allowed all tested MR sequences to be used with the insert, with 45–59% signal-to-noise ratio (SNR) degradation, compared to without PET. With a 1H/13C volume coil inside the insert and shielded by a copper tube, the SNR degradation was limited to 23–30% with all tested sequences. The insert did not introduce any discernible distortions into images of two tested EPI sequences. Use of truncated sinc shaped RF excitation pulses and gradient field switching had negligible effects on PET count rate. However, PET count rate was substantially affected by high-power RF block pulses and temperature variations due to high gradient duty cycles.

  9. Birth and death of genes linked to chromosomal inversion

    PubMed Central

    Furuta, Yoshikazu; Kawai, Mikihiko; Yahara, Koji; Takahashi, Noriko; Handa, Naofumi; Tsuru, Takeshi; Oshima, Kenshiro; Yoshida, Masaru; Azuma, Takeshi; Hattori, Masahira; Uchiyama, Ikuo; Kobayashi, Ichizo

    2011-01-01

    The birth and death of genes is central to adaptive evolution, yet the underlying genome dynamics remain elusive. The availability of closely related complete genome sequences helps to follow changes in gene contents and clarify their relationship to overall genome organization. Helicobacter pylori, bacteria in our stomach, are known for their extreme genome plasticity through mutation and recombination and will make a good target for such an analysis. In comparing their complete genome sequences, we found that gain and loss of genes (loci) for outer membrane proteins, which mediate host interaction, occurred at breakpoints of chromosomal inversions. Sequence comparison there revealed a unique mechanism of DNA duplication: DNA duplication associated with inversion. In this process, a DNA segment at one chromosomal locus is copied and inserted, in an inverted orientation, into a distant locus on the same chromosome, while the entire region between these two loci is also inverted. Recognition of this and three more inversion modes, which occur through reciprocal recombination between long or short sequence similarity or adjacent to a mobile element, allowed reconstruction of synteny evolution through inversion events in this species. These results will guide the interpretation of extensive DNA sequencing results for understanding long- and short-term genome evolution in various organisms and in cancer cells. PMID:21212362

  10. Rapid molecular sexing of three-spined sticklebacks, Gasterosteus aculeatus L., based on large Y-chromosomal insertions.

    PubMed

    Bakker, Theo C M; Giger, Thomas; Frommen, Joachim G; Largiadèr, Carlo R

    2017-08-01

    There is a need for rapid and reliable molecular sexing of three-spined sticklebacks, Gasterosteus aculeatus, the supermodel species for evolutionary biology. A DNA region at the 5' end of the sex-linked microsatellite Gac4202 was sequenced for the X chromosome of six females and the Y chromosome of five males from three populations. The Y chromosome contained two large insertions, which did not recombine with the phenotype of sex in a cross of 322 individuals. Genetic variation (SNPs and indels) within the insertions was smaller than on flanking DNA sequences. Three molecular PCR-based sex tests were developed, in which the first, the second or both insertions were covered. In five European populations (from DE, CH, NL, GB) of three-spined sticklebacks, tests with both insertions combined showed two clearly separated bands on agarose minigels in males and one band in females. The tests with the separate insertions gave similar results. Thus, the new molecular sexing method gave rapid and reliable results for sexing three-spined sticklebacks and is an improvement and/or alternative to existing methods.

  11. Recombination, rearrangement, reshuffling, and divergence in a centromeric region of rice.

    PubMed

    Ma, Jianxin; Bennetzen, Jeffrey L

    2006-01-10

    Centromeres have many unusual biological properties, including kinetochore attachment and severe repression of local meiotic recombination. These properties are partly an outcome, partly a cause, of unusual DNA structure in the centromeric region. Although several plant and animal genomes have been sequenced, most centromere sequences have not been completed or analyzed in depth. To shed light on the unique organization, variability, and evolution of centromeric DNA, detailed analysis of a 1.97-Mb sequence that includes centromere 8 (CEN8) of japonica rice was undertaken. Thirty-three long-terminal repeat (LTR)-retrotransposon families (including 11 previously unknown) were identified in the CEN8 region, totaling 245 elements and fragments that account for 67% of the region. The ratio of solo LTRs to intact elements in the CEN8 region is approximately 0.9:1, compared with approximately 2.2:1 in noncentromeric regions of rice. However, the ratio of solo LTRs to intact elements in the core of the CEN8 region ( approximately 2.5:1) is higher than in any other region investigated in rice, suggesting a hotspot for unequal recombination. Comparison of the CEN8 region of japonica and its orthologous segments from indica rice indicated that approximately 15% of the intact retrotransposons and solo LTRs were inserted into CEN8 after the divergence of japonica and indica from a common ancestor, compared with approximately 50% for previously studied euchromatic regions. Frequent DNA rearrangements were observed in the CEN8 region, including a 212-kb subregion that was found to be composed of three rearranged tandem repeats. Phylogenetic analysis also revealed recent segmental duplication and extensive rearrangement and reshuffling of the CentO satellite repeats.

  12. Import of honeybee prepromelittin into the endoplasmic reticulum: structural basis for independence of SRP and docking protein.

    PubMed Central

    Müller, G; Zimmermann, R

    1987-01-01

    Honeybee prepromelittin is correctly processed and imported by dog pancreas microsomes. Insertion of prepromelittin into microsomal membranes, as assayed by signal sequence removal, does not depend on signal recognition particle (SRP) and docking protein. We addressed the question as to how prepromelittin bypasses the SRP/docking protein system. Hybrid proteins between prepromelittin, or carboxy-terminally truncated derivatives, and the cytoplasmic protein dihydrofolate reductase from mouse were constructed. These hybrid proteins were analysed for membrane insertion and sequestration into microsomes. The results suggest the following: (i) The signal sequence of prepromelittin is capable of interacting with the SRP/docking protein system, but this interaction is not mandatory for membrane insertion; this is related to the small size of prepromelittin. (ii) In prepromelittin a cluster of negatively charged amino acids must be balanced by a cluster of positively charged amino acids in order to allow membrane insertion. (iii) In general, a signal sequence can be sufficient to mediate membrane insertion independently of SRP and docking protein in the case of short precursor proteins; however, the presence and distribution of charged amino acids within the mature part of these precursors can play distinct roles. Images Fig. 3. Fig. 4. Fig. 5. Fig. 6. Fig. 7. Fig. 8. Fig. 9. PMID:2820722

  13. Diversity, distribution and dynamics of full-length Copia and Gypsy LTR retroelements in Solanum lycopersicum.

    PubMed

    Paz, Rosalía Cristina; Kozaczek, Melisa Eliana; Rosli, Hernán Guillermo; Andino, Natalia Pilar; Sanchez-Puerta, Maria Virginia

    2017-10-01

    Transposable elements are the most abundant components of plant genomes and can dramatically induce genetic changes and impact genome evolution. In the recently sequenced genome of tomato (Solanum lycopersicum), the estimated fraction of elements corresponding to retrotransposons is nearly 62%. Given that tomato is one of the most important vegetable crop cultivated and consumed worldwide, understanding retrotransposon dynamics can provide insight into its evolution and domestication processes. In this study, we performed a genome-wide in silico search of full-length LTR retroelements in the tomato nuclear genome and annotated 736 full-length Gypsy and Copia retroelements. The dispersion level across the 12 chromosomes, the diversity and tissue-specific expression of those elements were estimated. Phylogenetic analysis based on the retrotranscriptase region revealed the presence of 12 major lineages of LTR retroelements in the tomato genome. We identified 97 families, of which 77 and 20 belong to the superfamilies Copia and Gypsy, respectively. Each retroelement family was characterized according to their element size, relative frequencies and insertion time. These analyses represent a valuable resource for comparative genomics within the Solanaceae, transposon-tagging and for the design of cultivar-specific molecular markers in tomato.

  14. The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance.

    PubMed

    Molin, William T; Wright, Alice A; Lawton-Rauh, Amy; Saski, Christopher A

    2017-01-17

    The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplification and insertion of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene across the genome. Increased EPSPS gene copy numbers results in higher titers of the EPSPS enzyme, the target of glyphosate, and confers resistance to glyphosate treatment. To understand the genomic unit and mechanism of EPSPS gene copy number proliferation, we developed and used a bacterial artificial chromosome (BAC) library from a highly resistant biotype to sequence the local genomic landscape flanking the EPSPS gene. By sequencing overlapping BACs, a 297 kb sequence was generated, hereafter referred to as the "EPSPS cassette." This region included several putative genes, dense clusters of tandem and inverted repeats, putative helitron and autonomous replication sequences, and regulatory elements. Whole genome shotgun sequencing (WGS) of two biotypes exhibiting high and no resistance to glyphosate was performed to compare genomic representation across the EPSPS cassette. Mapping of sequences for both biotypes to the reference EPSPS cassette revealed significant differences in upstream and downstream sequences relative to EPSPS with regard to both repetitive units and coding content between these biotypes. The differences in sequence may have resulted from a compounded-building mechanism such as repetitive transpositional events. The association of putative helitron sequences with the cassette suggests a possible amplification and distribution mechanism. Flow cytometry revealed that the EPSPS cassette added measurable genomic content. The adoption of glyphosate resistant cropping systems in major crops such as corn, soybean, cotton and canola coupled with excessive use of glyphosate herbicide has led to evolved glyphosate resistance in several important weeds. In Amaranthus palmeri, the amplification of the EPSPS cassette, characterized by a complex array of repetitive elements and putative helitron sequences, suggests an adaptive structural genomic mechanism that drives amplification and distribution around the genome. The added genomic content not found in glyphosate sensitive plants may be driving evolution through genome expansion.

  15. Characterization of a cfr-Carrying Plasmid from Porcine Escherichia coli That Closely Resembles Plasmid pEA3 from the Plant Pathogen Erwinia amylovora.

    PubMed

    Zhang, Rongmin; Sun, Bin; Wang, Yang; Lei, Lei; Schwarz, Stefan; Wu, Congming

    2016-01-01

    The multiresistance gene cfr was found in two porcine Escherichia coli isolates, one harboring it on the conjugative 33,885-bp plasmid pFSEC-01, the other harboring it in the chromosomal DNA. Sequence analysis of pFSEC-01 revealed that a 6,769-bp fragment containing the cfr gene bracketed by two IS26 elements was inserted into a plasmid closely related to pEA3 from the plant pathogen Erwinia amylovora, suggesting that pFSEC-01 may be transferred between different bacterial genera of both animal and plant origin. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  16. Tn5253 family integrative and conjugative elements carrying mef(I) and catQ determinants in Streptococcus pneumoniae and Streptococcus pyogenes.

    PubMed

    Mingoia, Marina; Morici, Eleonora; Morroni, Gianluca; Giovanetti, Eleonora; Del Grosso, Maria; Pantosti, Annalisa; Varaldo, Pietro E

    2014-10-01

    The linkage between the macrolide efflux gene mef(I) and the chloramphenicol inactivation gene catQ was first described in Streptococcus pneumoniae (strain Spn529), where the two genes are located in a module designated IQ element. Subsequently, two different defective IQ elements were detected in Streptococcus pyogenes (strains Spy029 and Spy005). The genetic elements carrying the three IQ elements were characterized, and all were found to be Tn5253 family integrative and conjugative elements (ICEs). The ICE from S. pneumoniae (ICESpn529IQ) was sequenced, whereas the ICEs from S. pyogenes (ICESpy029IQ and ICESpy005IQ, the first Tn5253-like ICEs reported in this species) were characterized by PCR mapping, partial sequencing, and restriction analysis. ICESpn529IQ and ICESpy029IQ were found to share the intSp 23FST81 integrase gene and an identical Tn916 fragment, whereas ICESpy005IQ has int5252 and lacks Tn916. All three ICEs were found to lack the linearized pC194 plasmid that is usually associated with Tn5253-like ICEs, and all displayed a single copy of a toxin-antitoxin operon that is typically contained in the direct repeats flanking the excisable pC194 region when this region is present. Two different insertion sites of the IQ elements were detected, one in ICESpn529IQ and ICESpy029IQ, and another in ICESpy005IQ. The chromosomal integration of the three ICEs was site specific, depending on the integrase (intSp 23FST81 or int5252). Only ICESpy005IQ was excised in circular form and transferred by conjugation. By transformation, mef(I) and catQ were cotransferred at a high frequency from S. pyogenes Spy005 and at very low frequencies from S. pneumoniae Spn529 and S. pyogenes Spy029. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  17. Horizontal gene transfer and mobile genetic elements in marine systems.

    PubMed

    Sobecky, Patricia A; Hazen, Tracy H

    2009-01-01

    The pool of mobile genetic elements (MGE) in microbial communities consists of viruses, plasmids, and associated elements (insertion sequences, transposons, and integrons) that are either self-transmissible or use mobile plasmids and viruses as vehicles for their dissemination. This mobilome facilitates the horizontal transfer of genes that promote the evolution and adaptation of microbial communities. Efforts to characterize MGEs from microbial populations resident in a variety of ecological habitats have revealed a surprisingly novel and seemingly untapped biodiversity. To better understand the impact of horizontal gene transfer (HGT), as well as the agents that promote HGT in marine ecosystems and to determine whether or not environmental parameters can effect the composition and structure of the mobilome in marine microbial communities, information on the distribution, diversity, and ecological traits of the marine mobilome is presented. In this chapter we discuss recent insights gained from different methodological approaches used to characterize the biodiversity and ecology of MGE in marine environments and their contributions to HGT. In addition, we present case studies that highlight specific HGT examples in coastal, open-ocean, and deep-sea marine ecosystems.

  18. Metagenomic exploration reveals a marked change in the river resistome and mobilome after treated wastewater discharges.

    PubMed

    Lekunberri, Itziar; Balcázar, José Luis; Borrego, Carles M

    2018-03-01

    Mobile genetic elements (MGEs) are key agents in the spread of antibiotic resistance genes (ARGs) across environments. Here we used metagenomics to compare the river resistome (collection of all ARGs) and mobilome (e.g., integrases, transposases, integron integrases and insertion sequence common region "ISCR" elements) between samples collected upstream (n = 6) and downstream (n = 6) of an urban wastewater treatment plant (UWWTP). In comparison to upstream metagenomes, downstream metagenomes showed a drastic increase in the abundance of ARGs, as well as markers of MGEs, particularly integron integrases and ISCR elements. These changes were accompanied by a concomitant prevalence of 16S rRNA gene signatures of bacteria affiliated to families encompassing well-known human and animal pathogens. Our results confirm that chronic discharges of treated wastewater severely impact the river resistome affecting not only the abundance and diversity of ARGs but also their potential spread by enriching the river mobilome in a wide variety of MGEs. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. MiMIC: a highly versatile transposon insertion resource for engineering Drosophila melanogaster genes

    PubMed Central

    Venken, Koen J. T.; Schulze, Karen L.; Haelterman, Nele A.; Pan, Hongling; He, Yuchun; Evans-Holm, Martha; Carlson, Joseph W.; Levis, Robert W.; Spradling, Allan C.; Hoskins, Roger A.; Bellen, Hugo J.

    2011-01-01

    We demonstrate the versatility of a collection of insertions of the transposon Minos mediated integration cassette (MiMIC), in Drosophila melanogaster. MiMIC contains a gene-trap cassette and the yellow+ marker flanked by two inverted bacteriophage ΦC31 attP sites. MiMIC integrates almost at random in the genome to create sites for DNA manipulation. The attP sites allow the replacement of the intervening sequence of the transposon with any other sequence through recombinase mediated cassette exchange (RMCE). We can revert insertions that function as gene traps and cause mutant phenotypes to wild type by RMCE and modify insertions to control GAL4 or QF overexpression systems or perform lineage analysis using the Flp system. Insertions within coding introns can be exchanged with protein-tag cassettes to create fusion proteins to follow protein expression and perform biochemical experiments. The applications of MiMIC vastly extend the Drosophila melanogaster toolkit. PMID:21985007

  20. Active role of a human genomic insert in replication of a yeast artificial chromosome.

    PubMed

    van Brabant, A J; Fangman, W L; Brewer, B J

    1999-06-01

    Yeast artificial chromosomes (YACs) are a common tool for cloning eukaryotic DNA. The manner by which large pieces of foreign DNA are assimilated by yeast cells into a functional chromosome is poorly understood, as is the reason why some of them are stably maintained and some are not. We examined the replication of a stable YAC containing a 240-kb insert of DNA from the human T-cell receptor beta locus. The human insert contains multiple sites that serve as origins of replication. The activity of these origins appears to require the yeast ARS consensus sequence and, as with yeast origins, additional flanking sequences. In addition, the origins in the human insert exhibit a spacing, a range of activation efficiencies, and a variation in times of activation during S phase similar to those found for normal yeast chromosomes. We propose that an appropriate combination of replication origin density, activation times, and initiation efficiencies is necessary for the successful maintenance of YAC inserts.

Top