Sample records for intronic control sequences

  1. Interactions between the promoter and first intron are involved in transcriptional control of alpha 1(I) collagen gene expression.

    PubMed Central

    Bornstein, P; McKay, J; Liska, D J; Apone, S; Devarayalu, S

    1988-01-01

    The first intron of the human collagen alpha 1(I) gene contains several positively and negatively acting elements. We have studied the transcription of collagen-human growth hormone fusion genes, containing deletions and rearrangements of collagen intronic sequences, by transient transfection of chick tendon fibroblasts and NIH 3T3 cells. In chick tendon fibroblasts, but not in 3T3 cells, inversion of intronic sequences containing a previously studied 274-base-pair segment, A274, resulted in markedly reduced human growth hormone mRNA levels as determined by an RNase protection assay. This inhibitory effect was largely alleviated when deletions were introduced in the collagen promoter of plasmids containing negatively oriented intronic sequences. Evidence for interaction of the promoter with the intronic segment, A274, was obtained by gel mobility shift assays. We suggest that promoter-intron interactions, mediated by DNA-binding proteins, regulate collagen gene transcription. Inversion of intronic segments containing critical interactive elements might then lead to an altered geometry and reduced activity of a transcriptional complex in those cells with sufficiently high levels of appropriate transcription factors. We further suggest that the deleted promoter segment plays a key role in directing DNA interactions involved in transcriptional control. Images PMID:3211130

  2. Spliced RNA of woodchuck hepatitis virus.

    PubMed

    Ogston, C W; Razman, D G

    1992-07-01

    Polymerase chain reaction was used to investigate RNA splicing in liver of woodchucks infected with woodchuck hepatitis virus (WHV). Two spliced species were detected, and the splice junctions were sequenced. The larger spliced RNA has an intron of 1300 nucleotides, and the smaller spliced sequence shows an additional downstream intron of 1104 nucleotides. We did not detect singly spliced sequences from which the smaller intron alone was removed. Control experiments showed that spliced sequences are present in both RNA and DNA in infected liver, showing that the viral reverse transcriptase can use spliced RNA as template. Spliced sequences were detected also in virion DNA prepared from serum. The upstream intron produces a reading frame that fuses the core to the polymerase polypeptide, while the downstream intron causes an inframe deletion in the polymerase open reading frame. Whereas the splicing patterns in WHV are superficially similar to those reported recently in hepatitis B virus, we detected no obvious homology in the coding capacity of spliced RNAs from these two viruses.

  3. Nucleotide sequence of the ribosomal RNA gene of Physarum polycephalum: intron 2 and its flanking regions of the 26S rRNA gene.

    PubMed Central

    Nomiyama, H; Kuhara, S; Kukita, T; Otsuka, T; Sakaki, Y

    1981-01-01

    The 26S ribosomal RNA gene of Physarum polycephalum is interrupted by two introns, and we have previously determined the sequence of one of them (intron 1) (Nomiyama et al. Proc.Natl.Acad.Sci.USA 78, 1376-1380, 1981). In this study we sequenced the second intron (intron 2) of about 0.5 kb length and its flanking regions, and found that one nucleotide at each junction is identical in intron 1 and intron 2, though the junction regions share no other sequence homology. Comparison of the flanking exon sequences to E. coli 23S rRNA sequences shows that conserved sequences are interspersed with tracts having little homology. In particular, the region encompassing the intron 2 interruption site is highly conserved. The E. coli ribosomal protein L1 binding region is also conserved. Images PMID:6171776

  4. A pipeline of programs for collecting and analyzing group II intron retroelement sequences from GenBank

    PubMed Central

    2013-01-01

    Background Accurate and complete identification of mobile elements is a challenging task in the current era of sequencing, given their large numbers and frequent truncations. Group II intron retroelements, which consist of a ribozyme and an intron-encoded protein (IEP), are usually identified in bacterial genomes through their IEP; however, the RNA component that defines the intron boundaries is often difficult to identify because of a lack of strong sequence conservation corresponding to the RNA structure. Compounding the problem of boundary definition is the fact that a majority of group II intron copies in bacteria are truncated. Results Here we present a pipeline of 11 programs that collect and analyze group II intron sequences from GenBank. The pipeline begins with a BLAST search of GenBank using a set of representative group II IEPs as queries. Subsequent steps download the corresponding genomic sequences and flanks, filter out non-group II introns, assign introns to phylogenetic subclasses, filter out incomplete and/or non-functional introns, and assign IEP sequences and RNA boundaries to the full-length introns. In the final step, the redundancy in the data set is reduced by grouping introns into sets of ≥95% identity, with one example sequence chosen to be the representative. Conclusions These programs should be useful for comprehensive identification of group II introns in sequence databases as data continue to rapidly accumulate. PMID:24359548

  5. Group I introns are widespread in archaea.

    PubMed

    Nawrocki, Eric P; Jones, Thomas A; Eddy, Sean R

    2018-05-18

    Group I catalytic introns have been found in bacterial, viral, organellar, and some eukaryotic genomes, but not in archaea. All known archaeal introns are bulge-helix-bulge (BHB) introns, with the exception of a few group II introns. It has been proposed that BHB introns arose from extinct group I intron ancestors, much like eukaryotic spliceosomal introns are thought to have descended from group II introns. However, group I introns have little sequence conservation, making them difficult to detect with standard sequence similarity searches. Taking advantage of recent improvements in a computational homology search method that accounts for both conserved sequence and RNA secondary structure, we have identified 39 group I introns in a wide range of archaeal phyla, including examples of group I introns and BHB introns in the same host gene.

  6. HFE gene polymorphism defined by sequence-based typing of the Brazilian population and a standardized nomenclature for HFE allele sequences.

    PubMed

    Campos, W N; Massaro, J D; Martinelli, A L C; Halliwell, J A; Marsh, S G E; Mendes-Junior, C T; Donadi, E A

    2017-10-01

    The HFE molecule controls iron uptake from gut, and defects in the molecule have been associated with iron overload, particularly in hereditary hemochromatosis. The HFE gene including both coding and boundary intronic regions were sequenced in 304 Brazilian individuals, encompassing healthy individuals and patients exhibiting hereditary or acquired iron overload. Six sites of variation were detected: (1) H63D C>G in exon 2, (2) IVS2 (+4) T>C in intron 2, (3) a C>G transversion in intron 3, (4) C282Y G>A in exon 4, (5) IVS4 (-44) T>C in intron 4, and (6) a new guanine deletion (G>del) in intron 5, which were used for haplotype inference. Nine HFE alleles were detected and six of these were officially named on the basis of the HLA Nomenclature, defined by the World Health Organization (WHO) Nomenclature Committee for Factors of the HLA System, and published via the IPD-IMGT/HLA website. Four alleles, HFE*001, *002, *003, and *004 exhibited variation within their exon sequences. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. Bioinformatics analysis of plant orthologous introns: identification of an intronic tRNA-like sequence.

    PubMed

    Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei

    2014-09-10

    Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.

  8. Mitochondrial genes in the colourless alga Prototheca wickerhamii resemble plant genes in their exons but fungal genes in their introns.

    PubMed Central

    Wolff, G; Burger, G; Lang, B F; Kück, U

    1993-01-01

    The mitochondrial DNA from the colourless alga Prototheca wickerhamii contains two mosaic genes as was revealed from complete sequencing of the circular extranuclear genome. The genes for the large subunit of the ribosomal RNA (LSUrRNA) as well as for subunit I of the cytochrome oxidase (coxI) carry two and three intronic sequences respectively. On the basis of their canonical nucleotide sequences they can be classified as group I introns. Phylogenetic comparisons of the coxI protein sequences allow us to conclude that the P.wickerhamii mtDNA is much closer related to higher plant mtDNAs than to those of the chlorophyte alga C.reinhardtii. The comparison of the intron sequences revealed several unusual features: (1) The P.wickerhamii introns are structurally related to mitochondrial introns from various ascomycetous fungi. (2) Phylogenetic analyses indicate a close relationship between fungal and algal intronic sequences. (3) The P. wickerhamii introns are located at positions within the structural genes which can be considered as preferred intron insertion sites in homologous mitochondrial genes from fungi or liverwort. In all cases, the sequences adjacent to the insertion sites are very well conserved over large evolutionary distances. Our finding of highly similar introns in fungi and algae is consistent with the idea that introns have already been present in the bacterial ancestors of present day mitochondria and evolved concomitantly with the organelles. PMID:7680126

  9. Remarkable sequence conservation of the last intron in the PKD1 gene.

    PubMed

    Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P

    2003-10-01

    The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.

  10. Introns: The Functional Benefits of Introns in Genomes.

    PubMed

    Jo, Bong-Seok; Choi, Sun Shim

    2015-12-01

    The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.

  11. Molecular analysis of the split cox1 gene from the Basidiomycota Agrocybe aegerita: relationship of its introns with homologous Ascomycota introns and divergence levels from common ancestral copies.

    PubMed

    Gonzalez, P; Barroso, G; Labarère, J

    1998-10-05

    The Basidiomycota Agrocybe aegerita (Aa) mitochondrial cox1 gene (6790 nucleotides), encoding a protein of 527aa (58377Da), is split by four large subgroup IB introns possessing site-specific endonucleases assumed to be involved in intron mobility. When compared to other fungal COX1 proteins, the Aa protein is closely related to the COX1 one of the Basidiomycota Schizophyllum commune (Sc). This clade reveals a relationship with the studied Ascomycota ones, with the exception of Schizosaccharomyces pombe (Sp) which ranges in an out-group position compared with both higher fungi divisions. When comparison is extended to other kingdoms, fungal COX1 sequences are found to be more related to algae and plant ones (more than 57.5% aa similarity) than to animal sequences (53.6% aa similarity), contrasting with the previously established close relationship between fungi and animals, based on comparisons of nuclear genes. The four Aa cox1 introns are homologous to Ascomycota or algae cox1 introns sharing the same location within the exonic sequences. The percentages of identity of the intronic nucleotide sequences suggest a possible acquisition by lateral transfers of ancestral copies or of their derived sequences. These identities extend over the whole intronic sequences, arguing in favor of a transfer of the complete intron rather than a transfer limited to the encoded ORF. The intron i4 shares 74% of identity, at the nucleotidic level, with the Podospora anserina (Pa) intron i14, and up to 90.5% of aa similarity between the encoded proteins, i.e. the highest values reported to date between introns of two phylogenetically distant species. This low divergence argues for a recent lateral transfer between the two species. On the contrary, the low sequence identities (below 36%) observed between Aa i1 and the homologous Sp i1 or Prototheca wickeramii (Pw) i1 suggest a long evolution time after the separation of these sequences. The introns i2 and i3 possessed intermediate percentages of identity with their homologous Ascomycota introns. This is the first report of the complete nucleotide sequence and molecular organization of a mitochondrial cox1 gene of any member of the Basidiomycota division.

  12. Splicing-Related Features of Introns Serve to Propel Evolution

    PubMed Central

    Luo, Yuping; Li, Chun; Gong, Xi; Wang, Yanlu; Zhang, Kunshan; Cui, Yaru; Sun, Yi Eve; Li, Siguang

    2013-01-01

    The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron's ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution. PMID:23516505

  13. Colonization of heterochromatic genes by transposable elements in Drosophila.

    PubMed

    Dimitri, Patrizio; Junakovic, Nikolaj; Arcà, Bruno

    2003-04-01

    As a further step toward understanding transposable element-host genome interactions, we investigated the molecular anatomy of introns from five heterochromatic and 22 euchromatic protein-coding genes of Drosophila melanogaster. A total of 79 kb of intronic sequences from heterochromatic genes and 355 kb of intronic sequences from euchromatic genes have been used in Blast searches against Drosophila transposable elements (TEs). The results show that TE-homologous sequences belonging to 19 different families represent about 50% of intronic DNA from heterochromatic genes. In contrast, only 0.1% of the euchromatic intron DNA exhibits homology to known TEs. Intraspecific and interspecific size polymorphisms of introns were found, which are likely to be associated with changes in TE-related sequences. Together, the enrichment in TEs and the apparent dynamic state of heterochromatic introns suggest that TEs contribute significantly to the evolution of genes located in heterochromatin.

  14. Evolution of Mhc-DRB introns: implications for the origin of primates.

    PubMed

    Kupfermann, H; Satta, Y; Takahata, N; Tichy, H; Klein, J

    1999-06-01

    Introns are generally believed to evolve too rapidly and too erratically to be of much use in phylogenetic reconstructions. Few phylogenetically informative intron sequences are available, however, to ascertain the validity of this supposition. In the present study the supposition was tested on the example of the mammalian class II major histocompatibility complex (Mhc) genes of the DRB family. Since the Mhc genes evolve under balancing selection and are believed to recombine or rearrange frequently, the evolution of their introns could be expected to be particularly rapid and subject to scrambling. Sequences of intron 4 and 5 DRB genes were obtained from polymerase chain reaction-amplified fragments of genomic DNA from representatives of six eutherian orders-Primates, Scandentia, Chiroptera, Dermoptera, Lagomorpha, and Insectivora. Although short stretches of the introns have indeed proved to be unalignable, the bulk of the intron sequences from all six orders, spanning >85 million years (my) of evolution, could be aligned and used in a study of the tempo and mode of intron evolution. The analysis has revealed the Mhc introns to evolve at a rate similar to that of other genes and of synonymous sites of non-Mhc genes. No evidence of homogenization or large-scale scrambling of the intron sequences could be found. The Mhc introns apparently evolve largely by point mutations and insertions/deletions. The phylogenetic signals contained in the intron sequences could be used to identify Scandentia as the sister group of Primates, to support the existence of the Archonta superorder, and to confirm the monophyly of the Chiroptera.

  15. An intronic open reading frame was released from one of group II introns in the mitochondrial genome of the haptophyte Chrysochromulina sp. NIES-1333

    PubMed Central

    Nishimura, Yuki; Kamikawa, Ryoma; Hashimoto, Tetsuo; Inagaki, Yuji

    2014-01-01

    Mitochondrial (mt) genome sequences, which often bear introns, have been sampled from phylogenetically diverse eukaryotes. Thus, we can anticipate novel insights into intron evolution from previously unstudied mt genomes. We here investigated the origins and evolution of three introns in the mt genome of the haptophyte Chrysochromulina sp. NIES-1333, which was sequenced completely in this study. All the three introns were characterized as group II, on the basis of predicted secondary structure, and the conserved sequence motifs at the 5′ and 3′ termini. Our comparative studies on diverse mt genomes prompt us to propose that the Chrysochromulina mt genome laterally acquired the introns from mt genomes in distantly related eukaryotes. Many group II introns harbor intronic open reading frames for the proteins (intron-encoded proteins or IEPs), which likely facilitate the splicing of their host introns. However, we propose that a “free-standing,” IEP-like protein, which is not encoded within any introns in the Chrysochromulina mt genome, is involved in the splicing of the first cox1 intron that lacks any open reading frames. PMID:25054084

  16. Sequence Variation of the tRNALeu Intron as a Marker for Genetic Diversity and Specificity of Symbiotic Cyanobacteria in Some Lichens

    PubMed Central

    Paulsrud, Per; Lindblad, Peter

    1998-01-01

    We examined the genetic diversity of Nostoc symbionts in some lichens by using the tRNALeu (UAA) intron as a genetic marker. The nucleotide sequence was analyzed in the context of the secondary structure of the transcribed intron. Cyanobacterial tRNALeu (UAA) introns were specifically amplified from freshly collected lichen samples without previous DNA extraction. The lichen species used in the present study were Nephroma arcticum, Peltigera aphthosa, P. membranacea, and P. canina. Introns with different sizes around 300 bp were consistently obtained. Multiple clones from single PCRs were screened by using their single-stranded conformational polymorphism pattern, and the nucleotide sequence was determined. No evidence for sample heterogenity was found. This implies that the symbiont in situ is not a diverse community of cyanobionts but, rather, one Nostoc strain. Furthermore, each lichen thallus contained only one intron type, indicating that each thallus is colonized only once or that there is a high degree of specificity. The same cyanobacterial intron sequence was also found in samples of one lichen species from different localities. In a phylogenetic analysis, the cyanobacterial lichen sequences grouped together with the sequences from two free-living Nostoc strains. The size differences in the intron were due to insertions and deletions in highly variable regions. The sequence data were used in discussions concerning specificity and biology of the lichen symbiosis. It is concluded that the tRNALeu (UAA) intron can be of great value when examining cyanobacterial diversity. PMID:9435083

  17. Hypervariable and highly divergent intron-exon organizations in the chordate Oikopleura dioica.

    PubMed

    Edvardsen, Rolf B; Lerat, Emmanuelle; Maeland, Anne Dorthea; Flåt, Mette; Tewari, Rita; Jensen, Marit F; Lehrach, Hans; Reinhardt, Richard; Seo, Hee-Chan; Chourrout, Daniel

    2004-10-01

    Oikopleura dioica is a pelagic tunicate with a very small genome and a very short life cycle. In order to investigate the intron-exon organizations in Oikopleura, we have isolated and characterized ribosomal protein EF-1alpha, Hox, and alpha-tubulin genes. Their intron positions have been compared with those of the same genes from various invertebrates and vertebrates, including four species with entirely sequenced genomes. Oikopleura genes, like Caenorhabditis genes, have introns at a large number of nonconserved positions, which must originate from late insertions or intron sliding of ancient insertions. Both species exhibit hypervariable intron-exon organization within their alpha-tubulin gene family. This is due to localization of most nonconserved intron positions in single members of this gene family. The hypervariability and divergence of intron positions in Oikopleura and Caenorhabditis may be related to the predominance of short introns, the processing of which is not very dependent upon the exonic environment compared to large introns. Also, both species have an undermethylated genome, and the control of methylation-induced point mutations imposes a control on exon size, at least in vertebrate genes. That introns placed at such variable positions in Oikopleura or C. elegans may serve a specific purpose is not easy to infer from our current knowledge and hypotheses on intron functions. We propose that new introns are retained in species with very short life cycles, because illegitimate exchanges including gene conversion are repressed. We also speculate that introns placed at gene-specific positions may contribute to suppressing these exchanges and thereby favor their own persistence.

  18. Mechanism for DNA transposons to generate introns on genomic scales

    PubMed Central

    Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.

    2017-01-01

    Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113

  19. Malonyl CoA decarboxylase deficiency: C to T transition in intron 2 of the MCD gene.

    PubMed

    Surendran, S; Sacksteder, K A; Gould, S J; Coldwell, J G; Rady, P L; Tyring, S K; Matalon, R

    2001-09-15

    Malonyl CoA decarboxylase (MCD) is an enzyme involved in the metabolism of fatty acids synthesis. Based on reports of MCD deficiency, this enzyme is particular important in muscle and brain metabolism. Mutations in the MCD gene result in a deficiency of MCD activity, that lead to psychomotor retardation, cardiomyopathy and neonatal death. To date however, only a few patients have been reported with defects in MCD. We report here studies of a patient with MCD deficiency, who presented with hypotonia, cardiomyopathy and psychomotor retardation. DNA sequencing of MCD revealed a homozygous intronic mutation, specifically a -5 C to T transition near the acceptor site for exon 3. RT-PCR amplification of exons 2 and 3 revealed that although mRNA from a normal control sample yielded one major DNA band, the mutant mRNA sample resulted in two distinct DNA fragments. Sequencing of the patient's two RT-PCR products revealed that the larger molecular weight fragments contained exons 2 and 3 as well as the intervening intronic sequence. The smaller size band from the patient contained the properly spliced exons, similar to the normal control. Western blotting analysis of the expressed protein showed only a faint band in the patient sample in contrast to a robust band in the control. In addition, the enzyme activity of the mutant protein was lower than that of the control protein. The data indicate that homozygous mutation in intron 2 disrupt normal splicing of the gene, leading to lower expression of the MCD protein and MCD deficiency. Copyright 2001 Wiley-Liss, Inc.

  20. Tissue- and case-specific retention of intron 40 in mature dystrophin mRNA.

    PubMed

    Nishida, Atsushi; Minegishi, Maki; Takeuchi, Atsuko; Niba, Emma Tabe Eko; Awano, Hiroyuki; Lee, Tomoko; Iijima, Kazumoto; Takeshima, Yasuhiro; Matsuo, Masafumi

    2015-06-01

    The dystrophin gene, which is mutated in Duchenne muscular dystrophy (DMD), comprises 79 exons that show multiple alternative splicing events. Intron retention, a type of alternative splicing, may control gene expression. We examined intron retention in dystrophin introns by reverse-transcription PCR from skeletal muscle, focusing on the nine shortest (all <1000 bp), because these are more likely to be retained. Only one, intron 40, was retained in mRNA; sequencing revealed insertion of a complete intron 40 (851 nt) between exons 40 and 41. The intron 40 retention product accounted for 1.2% of the total product but had a premature stop codon at the fifth intronic codon. Intron 40 retention was most strongly observed in the kidney (36.6%) and was not obtained from the fetal liver, lung, spleen or placenta. This indicated that intron retention is a tissue-specific event whose level varies among tissues. In two DMD patients, intron 40 retention was observed in one patient but not in the other. Examination of splicing regulatory factors revealed that intron 40 had the highest guanine-cytosine content of all examined introns in a 30-nt segment at its 3' end. Further studies are needed to clarify the biological role of intron 40-retained dystrophin mRNA.

  1. Genomic structure of two ras family genes in the slime mold Physarum polycephalum.

    PubMed

    Trzcińska-Danielewicz, Joanna; Kozlowski, Piotr; Gierdal, Katarzyna; Wiejak, Jolanta; Jagielski, Adam; Toczko, Kazimierz; Fronk, Jan

    2002-08-01

    Genomic structure of two Physarum polycephalum ras family genes, Ppras2 and Pprap1, has been determined, including the upstream region of the latter. The genes are interrupted by three and four introns, respectively. The first intron of Ppras2 has the same location within the coding sequence as the first intron in another ras homolog from this organism, Ppras1 [Trzcińska-Danielewicz, J., Kozlowski, P., and Toczko, K. (1996). "Cloning and genomic sequence of the Physarum polycephalum Ppras1 gene, a homologue of the ras protooncogene", Gene 169, pp. 143-144]. All introns, ranging from 53 to ca. 460 base pairs, have the canonical 5' and 3' ends, are greatly enriched in pyrimidines in the coding strand and have frequent pyrimidines-only tracts. These latter features seem to be responsible for the difficulties in cloning and sequencing of parts of these genes. Short sequences shared with P. polycephalum transposon-like repeats are common in the introns, indicating a possible role of transposition in intron evolution. In all three ras family genes phase zero introns are located mostly between sequences coding for regular protein secondary structure elements.

  2. Hidden genetic history of the Japanese sand dollar Peronella (Echinoidea: Laganidae) revealed by nuclear intron sequences.

    PubMed

    Endo, Megumi; Hirose, Mamiko; Honda, Masanao; Koga, Hiroyuki; Morino, Yoshiaki; Kiyomoto, Masato; Wada, Hiroshi

    2018-06-15

    The marine environment around Japan experienced significant changes during the Cenozoic Era. In this study, we report findings suggesting that this dynamic history left behind traces in the genome of the Japanese sand dollar species Peronella japonica and P. rubra. Although mitochondrial Cytochrome C Oxidase I sequences did not indicate fragmentation of the current local populations of P. japonica around Japan, two different types of intron sequence were found in the Alx1 locus. We inferred that past fragmentation of the populations account for the presence of two types of nuclear sequences as alleles in the Alx1 intron of P. japonica. It is likely that the split populations have intermixed in recent times; hence, we did not detect polymorphisms in the sequences reflecting the current localization of the species. In addition, we found two allelic sequences of theAlx1 intron in the sister species P. rubra. The divergence times of the two types of Alx1 intron sequences were estimated at approximately 14.9 and 4.0 million years ago for P. japonica and P. rubra, respectively. Our study indicates that information from the intron sequences of nuclear genes can enhance our understanding of past genetic events in organisms. Copyright © 2018 Elsevier B.V. All rights reserved.

  3. Comparative Analysis of Vertebrate Dystrophin Loci Indicate Intron Gigantism as a Common Feature

    PubMed Central

    Pozzoli, Uberto; Elgar, Greg; Cagliani, Rachele; Riva, Laura; Comi, Giacomo P.; Bresolin, Nereo; Bardoni, Alessandra; Sironi, Manuela

    2003-01-01

    The human DMD gene is the largest known to date, spanning > 2000 kb on the X chromosome. The gene size is mainly accounted for by huge intronic regions. We sequenced 190 kb of Fugu rubripes (pufferfish) genomic DNA corresponding to the complete dystrophin gene (FrDMD) and provide the first report of gene structure and sequence comparison among dystrophin genomic sequences from different vertebrate organisms. Almost all intron positions and phases are conserved between FrDMD and its mammalian counterparts, and the predicted protein product of the Fugu gene displays 55% identity and 71% similarity to human dystrophin. In analogy to the human gene, FrDMD presents several-fold longer than average intronic regions. Analysis of intron sequences of the human and murine genes revealed that they are extremely conserved in size and that a similar fraction of total intron length is represented by repetitive elements; moreover, our data indicate that intron expansion through repeat accumulation in the two orthologs is the result of independent insertional events. The hypothesis that intron length might be functionally relevant to the DMD gene regulation is proposed and substantiated by the finding that dystrophin intron gigantism is common to the three vertebrate genes. [Supplemental material is available online at www.genome.org.] PMID:12727896

  4. Mollusk genes encoding lysine tRNA (UUU) contain introns.

    PubMed

    Matsuo, M; Abe, Y; Saruta, Y; Okada, N

    1995-11-20

    New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.

  5. Euglena gracilis chloroplast DNA: analysis of a 1.6 kb intron of the psb C gene containing an open reading frame of 458 codons.

    PubMed

    Montandon, P E; Vasserot, A; Stutz, E

    1986-01-01

    We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.

  6. Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria.

    PubMed

    Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée

    2006-09-14

    The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis.

  7. Determinism and randomness in the evolution of introns and sine inserts in mouse and human mitochondrial solute carrier and cytokine receptor genes.

    PubMed

    Cianciulli, Antonia; Calvello, Rosa; Panaro, Maria A

    2015-04-01

    In the homologous genes studied, the exons and introns alternated in the same order in mouse and human. We studied, in both species: corresponding short segments of introns, whole corresponding introns and complete homologous genes. We considered the total number of nucleotides and the number and orientation of the SINE inserts. Comparisons of mouse and human data series showed that at the level of individual relatively short segments of intronic sequences the stochastic variability prevails in the local structuring, but at higher levels of organization a deterministic component emerges, conserved in mouse and human during the divergent evolution, despite the ample re-editing of the intronic sequences and the fact that processes such as SINE spread had taken place in an independent way in the two species. Intron conservation is negatively correlated with the SINE occupancy, suggesting that virus inserts interfere with the conservation of the sequences inherited from the common ancestor. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. The utility of DNA sequences of an intron from the beta-fibrinogen gene in phylogenetic analysis of woodpeckers (Aves: Picidae).

    PubMed

    Prychitko, T M; Moore, W S

    1997-10-01

    Estimating phylogenies from DNA sequence data has become the major methodology of molecular phylogenetics. To date, molecular phylogenetics of the vertebrates has been very dependent on mtDNA, but studies involving mtDNA are limited because the several genes comprising the mt-genome are inherited as a single linkage group. The only apparent solution to this problem is to sequence additional genes, each representing a distinct linkage group, so that the resultant gene trees provide independent estimates of the species tree. There exists the need to find novel gene sequences which contain enough phylogenetic information to resolve relationships between closely related species. A possible source is the nuclear-encoded introns, because they evolve more rapidly than exons. We designed primers to amplify and sequence the 7 intron from the beta-fibrinogen gene for a recently evolved group, the woodpeckers. We sequenced the entire intron for 10 specimens representing five species. Nucleotide substitutions are randomly distributed along the length of the intron, suggesting selective neutrality. A preliminary analysis indicates that the phylogenetic signal in the intron is as strong as that in the mitochondrial encoded cytochrome b (cyt b) gene. The topology of the beta-fibrinogen tree is identical to that of the cyt b tree. This analysis demonstrates the ability of the 7 intron of beta-fibrinogen to provide well resolved, independent gene trees for recently evolved groups and establishes it as a source of sequences to be used in other phylogenetic studies. Copyright 1997 Academic Press

  9. The wheat cytochrome oxidase subunit II gene has an intron insert and three radical amino acid changes relative to maize

    PubMed Central

    Bonen, Linda; Boer, Poppo H.; Gray, Michael W.

    1984-01-01

    We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565

  10. Fungal origin by horizontal transfer of a plant mitochondrial group I intron in the chimeric CoxI gene of Peperomia.

    PubMed

    Vaughn, J C; Mason, M T; Sper-Whitis, G L; Kuhlman, P; Palmer, J D

    1995-11-01

    We present phylogenetic evidence that a group I intron in an angiosperm mitochondrial gene arose recently by horizontal transfer from a fungal donor species. A 1,716-bp fragment of the mitochondrial coxI gene from the angiosperm Peperomia polybotrya was amplified via the polymerase chain reaction and sequenced. Comparison to other coxI genes revealed a 966-bp group I intron, which, based on homology with the related yeast coxI intron aI4, potentially encodes a 279-amino-acid site-specific DNA endonuclease. This intron, which is believed to function as a ribozyme during its own splicing, is not present in any of 19 coxI genes examined from other diverse vascular plant species. Phylogenetic analysis of intron origin was carried out using three different tree-generating algorithms, and on a variety of nucleotide and amino acid data sets from the intron and its flanking exon sequences. These analyses show that the Peperomia coxI gene intron and exon sequences are of fundamentally different evolutionary origin. The Peperomia intron is more closely related to several fungal mitochondrial introns, two of which are located at identical positions in coxI, than to identically located coxI introns from the land plant Marchantia and the green alga Prototheca. Conversely, the exon sequence of this gene is, as expected, most closely related to other angiosperm coxI genes. These results, together with evidence suggestive of co-conversion of exonic markers immediately flanking the intron insertion site, lead us to conclude that the Peperomia coxI intron probably arose by horizontal transfer from a fungal donor, using the double-strand-break repair pathway. The donor species may have been one of the symbiotic mycorrhizal fungi that live in close obligate association with most plants.

  11. Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria

    PubMed Central

    Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée

    2006-01-01

    Background The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). Results A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Conclusion Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis. PMID:16972986

  12. Developmental expression of a regulatory gene is programmed at the level of splicing.

    PubMed Central

    Chou, T B; Zachar, Z; Bingham, P M

    1987-01-01

    We report sequence and transcript structures for a 6191-base chromosomal segment containing the presumptive regulatory gene from Drosophila, suppressor-of-white-apricot [su(wa)]. Our results indicate that su(wa) expression is controlled by regulating occurrence of specific splices. Seven introns are removed from the su(wa) primary transcript during precellular blastoderm development. The sequence of this mature RNA indicates that it is a conventional messenger RNA. In contrast, after cellular blastoderm the first two of these introns cease to be efficiently removed. The mature RNAs resulting from this failure to remove the first two introns have structures quite unexpected of mRNAs. We propose that postcellular blastoderm su(wa) expression is repressed by preventing splices necessary to produce a functional mRNA. Implications and mechanisms are discussed. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. PMID:2832151

  13. Evolution of EF-hand calcium-modulated proteins. IV. Exon shuffling did not determine the domain compositions of EF-hand proteins

    NASA Technical Reports Server (NTRS)

    Kretsinger, R. H.; Nakayama, S.

    1993-01-01

    In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.

  14. Influence of intron length on interaction characters between post-spliced intron and its CDS in ribosomal protein genes

    NASA Astrophysics Data System (ADS)

    Zhao, Xiaoqing; Li, Hong; Bao, Tonglaga; Ying, Zhiqiang

    2012-09-01

    Many experiment evidences showed that sequence structures of introns and intron loss/gain can influence gene expression, but current mechanisms did not refer to the functions of post-spliced introns directly. We propose that postspliced introns play their functions in gene expression by interacting with their mRNA sequences and the interaction is characterized by the matched segments between introns and their CDS. In this study, we investigated the interaction characters with length series by improved Smith-Waterman local alignment software for the ribosomal protein genes in C. elegans and D. melanogaster. Our results showed that RF values of five intron groups are significantly high in the central non-conserved region and very low in 5'-end and 3'-end splicing region. It is interesting that the number of the optimal matched regions gradually increases with intron length. Distributions of the optimal matched regions are different for five intron groups. Our study revealed that there are more interaction regions between longer introns and their CDS than shorter, and it provides a positive pattern for regulating the gene expression.

  15. Molecular gene organisation and secondary structure of the mitochondrial large subunit ribosomal RNA from the cultivated Basidiomycota Agrocybe aegerita: a 13 kb gene possessing six unusual nucleotide extensions and eight introns.

    PubMed

    Gonzalez, P; Barroso, G; Labarère, J

    1999-04-01

    The complete gene sequence and secondary structure of the mitochondrial LSU rRNA from the cultivated Basidiomycota Agrocybe aegerita was derived by chromosome walking. The A.aegerita LSU rRNA gene (13 526 nt) represents, to date, the longest described, due to the highest number of introns (eight) and the occurrence of six long nucleotidic extensions. Seven introns belong to group I, while the intronic sequence i5 constitutes the first typical group II intron reported in a fungal mitochondrial LSU rDNA. As with most fungal LSU rDNA introns reported to date, four introns (i5-i8) are distributed in domain V associated with the peptidyl-transferase activity. One intron (i1) is located in domain I, and three (i2-i4) in domain II. The introns i2-i8 possess homologies with other fungal, algal or protozoan introns located at the same position in LSU rDNAs. One of them (i6) is located at the same insertion site as most Ascomycota or algae LSU introns, suggesting a possible inheritance from a common ancestor. On the contrary, intron i1 is located at a so-far unreported insertion site. Among the six unusual nucleotide extensions, five are located in domain I and one in domain V. This is the first report of a mitochondrial LSU rRNA gene sequence and secondary structure for the whole Basidiomycota division.

  16. Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting

    PubMed Central

    Piazza, Carol Lyn; Smith, Dorie

    2018-01-01

    Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis, inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. PMID:29905149

  17. Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting.

    PubMed

    Qu, Guosheng; Piazza, Carol Lyn; Smith, Dorie; Belfort, Marlene

    2018-06-15

    Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis , inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. © 2018, Qu et al.

  18. A 5′ Noncoding Exon Containing Engineered Intron Enhances Transgene Expression from Recombinant AAV Vectors in vivo

    PubMed Central

    Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.

    2017-01-01

    We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072

  19. The low information content of Neurospora splicing signals: implications for RNA splicing and intron origin.

    PubMed

    Collins, Richard A; Stajich, Jason E; Field, Deborah J; Olive, Joan E; DeAbreu, Diane M

    2015-05-01

    When we expressed a small (0.9 kb) nonprotein-coding transcript derived from the mitochondrial VS plasmid in the nucleus of Neurospora we found that it was efficiently spliced at one or more of eight 5' splice sites and ten 3' splice sites, which are present apparently by chance in the sequence. Further experimental and bioinformatic analyses of other mitochondrial plasmids, random sequences, and natural nuclear genes in Neurospora and other fungi indicate that fungal spliceosomes recognize a wide range of 5' splice site and branchpoint sequences and predict introns to be present at high frequency in random sequence. In contrast, analysis of intronless fungal nuclear genes indicates that branchpoint, 5' splice site and 3' splice site consensus sequences are underrepresented compared with random sequences. This underrepresentation of splicing signals is sufficient to deplete the nuclear genome of splice sites at locations that do not comprise biologically relevant introns. Thus, the splicing machinery can recognize a wide range of splicing signal sequences, but splicing still occurs with great accuracy, not because the splicing machinery distinguishes correct from incorrect introns, but because incorrect introns are substantially depleted from the genome. © 2015 Collins et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  20. Polymorphism in Mitochondrial Group I Introns among Cryptococcus neoformans and Cryptococcus gattii Genotypes and Its Association with Drug Susceptibility.

    PubMed

    Gomes, Felipe E E S; Arantes, Thales D; Fernandes, José A L; Ferreira, Leonardo C; Romero, Héctor; Bosco, Sandra M G; Oliveira, Maria T B; Del Negro, Gilda M B; Theodoro, Raquel C

    2018-01-01

    Cryptococcosis, one of the most important systemic mycosis in the world, is caused by different genotypes of Cryptococcus neoformans and Cryptococcus gattii , which differ in their ecology, epidemiology, and antifungal susceptibility. Therefore, the search for new molecular markers for genotyping, pathogenicity and drug susceptibility is necessary. Group I introns fulfill the requisites for such task because (i) they are polymorphic sequences; (ii) their self-splicing is inhibited by some drugs; and (iii) their correct splicing under parasitic conditions is indispensable for pathogen survival. Here, we investigated the presence of group I introns in the mitochondrial LSU rRNA gene in 77 Cryptococcus isolates and its possible relation to drug susceptibility. Sequencing revealed two new introns in the LSU rRNA gene. All the introns showed high sequence similarity to other mitochondrial introns from distinct fungi, supporting the hypothesis of an ancient non-allelic invasion. Intron presence was statistically associated with those genotypes reported to be less pathogenic ( p < 0.001). Further virulence assays are needed to confirm this finding. In addition, in vitro antifungal tests indicated that the presence of LSU rRNA introns may influence the minimum inhibitory concentration (MIC) of amphotericin B and 5-fluorocytosine. These findings point to group I introns in the mitochondrial genome of Cryptococcus as potential molecular markers for antifungal resistance, as well as therapeutic targets.

  1. Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.

    PubMed

    Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M

    2017-03-27

    Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.

  2. a Simple Symmetric Algorithm Using a Likeness with Introns Behavior in RNA Sequences

    NASA Astrophysics Data System (ADS)

    Regoli, Massimo

    2009-02-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences has some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algoritnm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  3. A common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification.

    PubMed

    Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P

    2017-03-01

    Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  4. An intron within the 16S ribosomal RNA gene of the archaeon Pyrobaculum aerophilum

    NASA Technical Reports Server (NTRS)

    Burggraf, S.; Larsen, N.; Woese, C. R.; Stetter, K. O.

    1993-01-01

    The 16S rRNA genes of Pyrobaculum aerophilum and Pyrobaculum islandicum were amplified by the polymerase chain reaction, and the resulting products were sequenced directly. The two organisms are closely related by this measure (over 98% similar). However, they differ in that the (lone) 16S rRNA gene of Pyrobaculum aerophilum contains a 713-bp intron not seen in the corresponding gene of Pyrobaculum islandicum. To our knowledge, this is the only intron so far reported in the small subunit rRNA gene of a prokaryote. Upon excision the intron is circularized. A secondary structure model of the intron-containing rRNA suggests a splicing mechanism of the same type as that invoked for the tRNA introns of the Archaea and Eucarya and 23S rRNAs of the Archaea. The intron contains an open reading frame whose protein translation shows no certain homology with any known protein sequence.

  5. A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals.

    PubMed

    Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M

    2017-01-01

    Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.

  6. Nucleotide sequence of the L1 ribosomal protein gene of Xenopus laevis: remarkable sequence homology among introns.

    PubMed Central

    Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F

    1985-01-01

    Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512

  7. Patterns and rates of intron divergence between humans and chimpanzees

    PubMed Central

    Gazave, Elodie; Marqués-Bonet, Tomàs; Fernando, Olga; Charlesworth, Brian; Navarro, Arcadi

    2007-01-01

    Background Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results. Results We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns). Conclusion We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints. PMID:17309804

  8. Evaluation of the mechanisms of intron loss and gain in the social amoebae Dictyostelium.

    PubMed

    Ma, Ming-Yue; Che, Xun-Ru; Porceddu, Andrea; Niu, Deng-Ke

    2015-12-18

    Spliceosomal introns are a common feature of eukaryotic genomes. To approach a comprehensive understanding of intron evolution on Earth, studies should look beyond repeatedly studied groups such as animals, plants, and fungi. The slime mold Dictyostelium belongs to a supergroup of eukaryotes not covered in previous studies. We found 441 precise intron losses in Dictyostelium discoideum and 202 precise intron losses in Dictyostelium purpureum. Consistent with these observations, Dictyostelium discoideum was found to have significantly more copies of reverse transcriptase genes than Dictyostelium purpureum. We also found that the lost introns are significantly further from the 5' end of genes than the conserved introns. Adjacent introns were prone to be lost simultaneously in Dictyostelium discoideum. In both Dictyostelium species, the exonic sequences flanking lost introns were found to have a significantly higher GC content than those flanking conserved introns. Together, these observations support a reverse-transcription model of intron loss in which intron losses were caused by gene conversion between genomic DNA and cDNA reverse transcribed from mature mRNA. We also identified two imprecise intron losses in Dictyostelium discoideum that may have resulted from genomic deletions. Ninety-eight putative intron gains were also observed. Consistent with previous studies of other lineages, the source sequences were found in only a small number of cases, with only two instances of intron gain identified in Dictyostelium discoideum. Although they diverged very early from animals and fungi, Dictyostelium species have similar mechanisms of intron loss.

  9. A comparative genomics strategy for targeted discovery of single-nucleotide polymorphisms and conserved-noncoding sequences in orphan crops.

    PubMed

    Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H

    2006-04-01

    Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.

  10. A Comparative Genomics Strategy for Targeted Discovery of Single-Nucleotide Polymorphisms and Conserved-Noncoding Sequences in Orphan Crops1[W

    PubMed Central

    Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.

    2006-01-01

    Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031

  11. Development of single-copy nuclear intron markers for species-level phylogenetics: Case study with Paullinieae (Sapindaceae).

    PubMed

    Chery, Joyce G; Sass, Chodon; Specht, Chelsea D

    2017-09-01

    We developed a bioinformatic pipeline that leverages a publicly available genome and published transcriptomes to design primers in conserved coding sequences flanking targeted introns of single-copy nuclear loci. Paullinieae (Sapindaceae) is used to demonstrate the pipeline. Transcriptome reads phylogenetically closer to the lineage of interest are aligned to the closest genome. Single-nucleotide polymorphisms are called, generating a "pseudoreference" closer to the lineage of interest. Several filters are applied to meet the criteria of single-copy nuclear loci with introns of a desired size. Primers are designed in conserved coding sequences flanking introns. Using this pipeline, we developed nine single-copy nuclear intron markers for Paullinieae. This pipeline is highly flexible and can be used for any group with available genomic and transcriptomic resources. This pipeline led to the development of nine variable markers for phylogenetic study without generating sequence data de novo.

  12. Tobacco chloroplast tRNALys(UUU) gene contains a 2.5-kilobase-pair intron: An open reading frame and a conserved boundary sequence in the intron

    PubMed Central

    Sugita, Mamoru; Shinozaki, Kazuo; Sugiura, Masahiro

    1985-01-01

    The nucleotide sequence of a tRNALys(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNAGly(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long. Images PMID:16593561

  13. Tobacco chloroplast tRNA(UUU) gene contains a 2.5-kilobase-pair intron: An open reading frame and a conserved boundary sequence in the intron.

    PubMed

    Sugita, M; Shinozaki, K; Sugiura, M

    1985-06-01

    The nucleotide sequence of a tRNA(Lys)(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNA(Gly)(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long.

  14. The gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis contains a group I intron.

    PubMed Central

    De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y

    1992-01-01

    The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081

  15. The intron 1 of HPV 16 has a suboptimal branch point at a guanosine.

    PubMed

    De la Rosa-Rios, Marco Antonio; Martínez-Salazar, Martha; Martínez-Garcia, Martha; González-Bonilla, César; Villegas-Sepúlveda, Nicolás

    2006-06-01

    The branch point sequence (BPS) of intron 1 of the HPV-16 was determined via RT-PCR in a cell free system, using lariat intermediates obtained by in vitro splicing reactions. We used synthetic E6/E7 transcripts and HeLa nuclear protein extracts to obtain the splicing intermediates. Then, a divergent oligonucleotide primer set, pairing on the lariat RNA that encompassed the 2'-5' phosphodiester bond formed between the 5' end of the intron and the BPS, was used for cDNA synthesis and PCR amplification. Subsequent RT-PCR assays revealed four splicing intermediates, made up of a major intermediary corresponding to the BPS and four cryptic branched sequences. Only intermediates bound at the 5' end of the intron are probably the authentic branch point sequence, and all of them branch at guanosine 328 instead of the typical adenosine. Unusually, the BPS of intron 1 of HPV-16 is a suboptimal sequence (AGUGAGU) that differs from the eukaryotic consensus BPS, which correlates with the splicing profile observed for early transcripts of HPV-16 in tumors and tumor derived cell lines. The implications of this unusual branch point sequence for splicing of the HPV-16 pre-mRNA are discussed.

  16. [Identifying and sequence analysis of HLA-B*2736].

    PubMed

    Li, Zhen; Zou, Hong-Yan; Shao, Chao-Peng; Tang, Si; Wang, Da-Ming; Cheng, Liang-Hong

    2007-11-01

    An unknown HLA-B allele which was similar to HLA-B*270401 was detected by FLOW-SSOPCR-SSP and heterozygous sequence-based typing (SBT) in Chinese Han individual. Its anomalous patterns suggested the possible presence of new allele. Amplifying exon 2-5(include intron 2-4) of the HLA-B*27 allele separately by using allele-specific primers and sequencing in both directions. Identifying the difference between the novel B*27 allele and B*270401. The sequence of novel B*27 from exon 2 to partial exon 5 is 1 815 bp. There are 10 nt changes from B*270401 in exon 3-4, at nt634where A-->C(codon130 AGC-->CGC, 130 S-->R); nt670 where A-->T (codon142 ACC-->TCC, 142 T-->S); nt683 where G-->T (codon146 TGG-->TTG, 146 W-->L); nt698 where A-->T (codon151 GAG-->GTG, 151 E-->V); nt774 where G-->C (codon176 GAG-->GAC, 176 E-->D); nt776 where C-->A (codon177 ACG-->AAG, 177 T-->K); nt781 where C-->G (codon179 CAG-->GAG, 179Q-->E); nt789 where G-->T (codon181 GCG-->GCT) resulting no coding change; nt1438 where C-->T (codon206 GGC-->GGT) resulting no coding change; nt1449 where G-->C (codon210 GGG-->GCG, 210G-->A). In IMGT/HLA database, only three alleles (B*270502/2706/2732) have sequences of introns. The same sequence in intron 2 showed homology between the novel HLA-B*27 allele and B*2706, but their homology could not be supported in intron 3-4. Comparing the sequence of the novel B*27 allele in intron 3 and 4 with B*27 group, it showed there are three mutations at nt106 C-->G, nt179 G-->A, nt536 G-->A and one deletion at nt168 in intron 3 and one mutations at nt82 T-->C in intron 4, but the sequence of the novel B*27 allele in intron 3 and 4 was all the same to B*070201. The sequence was submitted to Gen-Bank and the accession number was DQ915176. The allele has been confirmed as an extension of B*2736 by the WHO Nomenclature committee in November 2006.

  17. [Analysis of chloroplast rpS16 intron sequences in Lemnaceae].

    PubMed

    Martirosian, E V; Ryzhova, N N; Kochieva, E Z; Skriabin, K G

    2009-01-01

    Chloroplast rpS16 gene intron sequences were determined and characterized for twenty-five Lemnaceae accessions representing nine duckweed species. For each Lemnaceae species nucleotide substitutions and for Lemna minor, Lemna aequinoctialis, Wolffia arrhiza different indels were detected. Most of indels were found for Wolffia arrhiza and Lemna aequinoctialis. The analyses of intraspecific polymorphism resulted in identification of several gaplotypes in L. gibba and L. trisulca. Lemnaceae phylogenetic relationship based on rpS16 intron variability data has revealed significant differences between L. aequinoctialis and other Lemna species. Genetic distance values corroborated competence of Landoltia punctata separations from Spirodela into an independent generic taxon. The acceptability of rpS16 intron sequences for phylogenetic studies in Lemnaceae was shown.

  18. cisprimertool: software to implement a comparative genomics strategy for the development of conserved intron scanning (CIS) markers.

    PubMed

    Jayashree, B; Jagadeesh, V T; Hoisington, D

    2008-05-01

    The availability of complete, annotated genomic sequence information in model organisms is a rich resource that can be extended to understudied orphan crops through comparative genomic approaches. We report here a software tool (cisprimertool) for the identification of conserved intron scanning regions using expressed sequence tag alignments to a completely sequenced model crop genome. The method used is based on earlier studies reporting the assessment of conserved intron scanning primers (called CISP) within relatively conserved exons located near exon-intron boundaries from onion, banana, sorghum and pearl millet alignments with rice. The tool is freely available to academic users at http://www.icrisat.org/gt-bt/CISPTool.htm. © 2007 ICRISAT.

  19. Molecular characterization of beta-tubulin from Phakopsora pachyrhizi, the causal agent of Asian soybean rust

    PubMed Central

    2010-01-01

    β-tubulins are structural components of microtubules and the targets of benzimidazole fungicides used to control many diseases of agricultural importance. Intron polymorphisms in the intron-rich genes of these proteins have been used in phylogeographic investigations of phytopathogenic fungi. In this work, we sequenced 2764 nucleotides of the β-tubulin gene (Pp tubB) in samples of Phakopsora pachyrhizi collected from seven soybean fields in Brazil. Pp tubB contained an open reading frame of 1341 nucleotides, including nine exons and eight introns. Exon length varied from 14 to 880 nucleotides, whereas intron length varied from 76 to 102 nucleotides. The presence of only four polymorphic sites limited the usefulness of Pp tubB for phylogeographic studies in P. pachyrhizi. The gene structures of Pp tubB and orthologous β-tubulin genes of Melampsora lini and Uromyces viciae-fabae were highly conserved. The amino acid substitutions in β-tubulin proteins associated with the onset of benzimidazole resistance in model organisms, especially at His 6 , Glu 198 and Phe 200 , were absent from the predicted sequence of the P. pachyrhizi β-tubulin protein. PMID:21637494

  20. Contribution of Mobile Group II Introns to Sinorhizobium meliloti Genome Evolution.

    PubMed

    Toro, Nicolás; Martínez-Abarca, Francisco; Molina-Sánchez, María D; García-Rodríguez, Fernando M; Nisa-Martínez, Rafael

    2018-01-01

    Mobile group II introns are ribozymes and retroelements that probably originate from bacteria. Sinorhizobium meliloti , the nitrogen-fixing endosymbiont of legumes of genus Medicago , harbors a large number of these retroelements. One of these elements, RmInt1, has been particularly successful at colonizing this multipartite genome. Many studies have improved our understanding of RmInt1 and phylogenetically related group II introns, their mobility mechanisms, spread and dynamics within S. meliloti and closely related species. Although RmInt1 conserves the ancient retroelement behavior, its evolutionary history suggests that this group II intron has played a role in the short- and long-term evolution of the S. meliloti genome. We will discuss its proposed role in genome evolution by controlling the spread and coexistence of potentially harmful mobile genetic elements, by ectopic transposition to different genetic loci as a source of early genomic variation and by generating sequence variation after a very slow degradation process, through intron remnants that may have continued to evolve, contributing to bacterial speciation.

  1. Contribution of Mobile Group II Introns to Sinorhizobium meliloti Genome Evolution

    PubMed Central

    Toro, Nicolás; Martínez-Abarca, Francisco; Molina-Sánchez, María D.; García-Rodríguez, Fernando M.; Nisa-Martínez, Rafael

    2018-01-01

    Mobile group II introns are ribozymes and retroelements that probably originate from bacteria. Sinorhizobium meliloti, the nitrogen-fixing endosymbiont of legumes of genus Medicago, harbors a large number of these retroelements. One of these elements, RmInt1, has been particularly successful at colonizing this multipartite genome. Many studies have improved our understanding of RmInt1 and phylogenetically related group II introns, their mobility mechanisms, spread and dynamics within S. meliloti and closely related species. Although RmInt1 conserves the ancient retroelement behavior, its evolutionary history suggests that this group II intron has played a role in the short- and long-term evolution of the S. meliloti genome. We will discuss its proposed role in genome evolution by controlling the spread and coexistence of potentially harmful mobile genetic elements, by ectopic transposition to different genetic loci as a source of early genomic variation and by generating sequence variation after a very slow degradation process, through intron remnants that may have continued to evolve, contributing to bacterial speciation. PMID:29670598

  2. Analysis of Claviceps africana and C. sorghi from India using AFLPs, EF-1alpha gene intron 4, and beta-tubulin gene intron 3.

    PubMed

    Tooley, Paul W; Bandyopadhyay, Ranajit; Carras, Marie M; Pazoutová, Sylvie

    2006-04-01

    Isolates of Claviceps causing ergot on sorghum in India were analysed by AFLP analysis, and by analysis of DNA sequences of the EF-1alpha gene intron 4 and beta-tubulin gene intron 3 region. Of 89 isolates assayed from six states in India, four were determined to be C. sorghi, and the rest C. africana. A relatively low level of genetic diversity was observed within the Indian C. africana population. No evidence of genetic exchange between C. africana and C. sorghi was observed in either AFLP or DNA sequence analysis. Phylogenetic analysis was conducted using DNA sequences from 14 different Claviceps species. A multigene phylogeny based on the EF-1alpha gene intron 4, the beta-tubulin gene intron 3 region, and rDNA showed that C. sorghi grouped most closely with C. gigantea and C. africana. Although the Claviceps species we analysed were closely related, they colonize hosts that are taxonomically very distinct suggesting that there is no direct coevolution of Claviceps with its hosts.

  3. Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing.

    PubMed

    Mohr, Sabine; Ghanem, Eman; Smith, Whitney; Sheeter, Dennis; Qin, Yidan; King, Olga; Polioudakis, Damon; Iyer, Vishwanath R; Hunicke-Smith, Scott; Swamy, Sajani; Kuersten, Scott; Lambowitz, Alan M

    2013-07-01

    Mobile group II introns encode reverse transcriptases (RTs) that function in intron mobility ("retrohoming") by a process that requires reverse transcription of a highly structured, 2-2.5-kb intron RNA with high processivity and fidelity. Although the latter properties are potentially useful for applications in cDNA synthesis and next-generation RNA sequencing (RNA-seq), group II intron RTs have been difficult to purify free of the intron RNA, and their utility as research tools has not been investigated systematically. Here, we developed general methods for the high-level expression and purification of group II intron-encoded RTs as fusion proteins with a rigidly linked, noncleavable solubility tag, and we applied them to group II intron RTs from bacterial thermophiles. We thus obtained thermostable group II intron RT fusion proteins that have higher processivity, fidelity, and thermostability than retroviral RTs, synthesize cDNAs at temperatures up to 81°C, and have significant advantages for qRT-PCR, capillary electrophoresis for RNA-structure mapping, and next-generation RNA sequencing. Further, we find that group II intron RTs differ from the retroviral enzymes in template switching with minimal base-pairing to the 3' ends of new RNA templates, making it possible to efficiently and seamlessly link adaptors containing PCR-primer binding sites to cDNA ends without an RNA ligase step. This novel template-switching activity enables facile and less biased cloning of nonpolyadenylated RNAs, such as miRNAs or protein-bound RNA fragments. Our findings demonstrate novel biochemical activities and inherent advantages of group II intron RTs for research, biotechnological, and diagnostic methods, with potentially wide applications.

  4. Intron loss from the NADH dehydrogenase subunit 4 gene of lettuce mitochondrial DNA: evidence for homologous recombination of a cDNA intermediate.

    PubMed

    Geiss, K T; Abbas, G M; Makaroff, C A

    1994-04-01

    The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.

  5. De novo insertion of an intron into the mammalian sex determining gene, SRY

    PubMed Central

    O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall

    1998-01-01

    Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071

  6. Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes

    PubMed Central

    Adir, Idan; Dahan, Orna; Broday, Limor; Pilpel, Yitzhak; Rechavi, Oded

    2016-01-01

    Codon usage bias affects protein translation because tRNAs that recognize synonymous codons differ in their abundance. Although the current dogma states that tRNA expression is exclusively regulated by intrinsic control elements (A- and B-box sequences), we revealed, using a reporter that monitors the levels of individual tRNA genes in Caenorhabditis elegans, that eight tryptophan tRNA genes, 100% identical in sequence, are expressed in different tissues and change their expression dynamically. Furthermore, the expression levels of the sup-7 tRNA gene at day 6 were found to predict the animal’s lifespan. We discovered that the expression of tRNAs that reside within introns of protein-coding genes is affected by the host gene’s promoter. Pairing between specific Pol II genes and the tRNAs that are contained in their introns is most likely adaptive, since a genome-wide analysis revealed that the presence of specific intronic tRNAs within specific orthologous genes is conserved across Caenorhabditis species. PMID:27560950

  7. Identification and analysis of multigene families by comparison of exon fingerprints.

    PubMed

    Brown, N P; Whittaker, A J; Newell, W R; Rawlings, C J; Beck, S

    1995-06-02

    Gene families are often recognised by sequence homology using similarity searching to find relationships, however, genomic sequence data provides gene architectural information not used by conventional search methods. In particular, intron positions and phases are expected to be relatively conserved features, because mis-splicing and reading frame shifts should be selected against. A fast search technique capable of detecting possible weak sequence homologies apparent at the intron/exon level of gene organization is presented for comparing spliceosomal genes and gene fragments. FINEX compares strings of exons delimited by intron/exon boundary positions and intron phases (exon fingerprint) using a global dynamic programming algorithm with a combined intron phase identity and exon size dissimilarity score. Exon fingerprints are typically two orders of magnitude smaller than their nucleic acid sequence counterparts giving rise to fast search times: a ranked search against a library of 6755 fingerprints for a typical three exon fingerprint completes in under 30 seconds on an ordinary workstation, while a worst case largest fingerprint of 52 exons completes in just over one minute. The short "sequence" length of exon fingerprints in comparisons is compensated for by the large exon alphabet compounded of intron phase types and a wide range of exon sizes, the latter contributing the most information to alignments. FINEX performs better in some searches than conventional methods, finding matches with similar exon organization, but low sequence homology. A search using a human serum albumin finds all members of the multigene family in the FINEX database at the top of the search ranking, despite very low amino acid percentage identities between family members. The method should complement conventional sequence searching and alignment techniques, offering a means of identifying otherwise hard to detect homologies where genomic data are available.

  8. Quaking and PTB control overlapping splicing regulatory networks during muscle cell differentiation

    PubMed Central

    Hall, Megan P.; Nagel, Roland J.; Fagg, W. Samuel; Shiue, Lily; Cline, Melissa S.; Perriman, Rhonda J.; Donohue, John Paul; Ares, Manuel

    2013-01-01

    Alternative splicing contributes to muscle development, but a complete set of muscle-splicing factors and their combinatorial interactions are unknown. Previous work identified ACUAA (“STAR” motif) as an enriched intron sequence near muscle-specific alternative exons such as Capzb exon 9. Mass spectrometry of myoblast proteins selected by the Capzb exon 9 intron via RNA affinity chromatography identifies Quaking (QK), a protein known to regulate mRNA function through ACUAA motifs in 3′ UTRs. We find that QK promotes inclusion of Capzb exon 9 in opposition to repression by polypyrimidine tract-binding protein (PTB). QK depletion alters inclusion of 406 cassette exons whose adjacent intron sequences are also enriched in ACUAA motifs. During differentiation of myoblasts to myotubes, QK levels increase two- to threefold, suggesting a mechanism for QK-responsive exon regulation. Combined analysis of the PTB- and QK-splicing regulatory networks during myogenesis suggests that 39% of regulated exons are under the control of one or both of these splicing factors. This work provides the first evidence that QK is a global regulator of splicing during muscle development in vertebrates and shows how overlapping splicing regulatory networks contribute to gene expression programs during differentiation. PMID:23525800

  9. Phylogenetics and Gene Structure Dynamics of Polygalacturonase Genes in Aspergillus and Neurospora crassa

    PubMed Central

    Hong, Jin-Sung; Ryu, Ki-Hyun; Kwon, Soon-Jae; Kim, Jin-Won; Kim, Kwang-Soo; Park, Kyong-Cheul

    2013-01-01

    Polygalacturonase (PG) gene is a typical gene family present in eukaryotes. Forty-nine PGs were mined from the genomes of Neurospora crassa and five Aspergillus species. The PGs were classified into 3 clades such as clade 1 for rhamno-PGs, clade 2 for exo-PGs and clade 3 for exo- and endo-PGs, which were further grouped into 13 sub-clades based on the polypeptide sequence similarity. In gene structure analysis, a total of 124 introns were present in 44 genes and five genes lacked introns to give an average of 2.5 introns per gene. Intron phase distribution was 64.5% for phase 0, 21.8% for phase 1, and 13.7% for phase 2, respectively. The introns varied in their sequences and their lengths ranged from 20 bp to 424 bp with an average of 65.9 bp, which is approximately half the size of introns in other fungal genes. There were 29 homologous intron blocks and 26 of those were sub-clade specific. Intron losses were counted in 18 introns in which no obvious phase preference for intron loss was observed. Eighteen introns were placed at novel positions, which is considerably higher than those of plant PGs. In an evolutionary sense both intron loss and gain must have taken place for shaping the current PGs in these fungi. Together with the small intron size, low conservation of homologous intron blocks and higher number of novel introns, PGs of fungal species seem to have recently undergone highly dynamic evolution. PMID:25288950

  10. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

    PubMed

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-06-04

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.

  11. Identification of Genetic Elements Associated with EPSPS Gene Amplification

    PubMed Central

    Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.

    2013-01-01

    Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434

  12. Deep intronic GPR143 mutation in a Japanese family with ocular albinism

    PubMed Central

    Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei

    2015-01-01

    Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease. PMID:26061757

  13. Deep intronic GPR143 mutation in a Japanese family with ocular albinism.

    PubMed

    Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei

    2015-06-10

    Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease.

  14. A Detailed History of Intron-rich Eukaryotic Ancestors Inferred from a Global Survey of 100 Complete Genomes

    PubMed Central

    Csuros, Miklos; Rogozin, Igor B.; Koonin, Eugene V.

    2011-01-01

    Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6–7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing. PMID:21935348

  15. Mitochondrial Intronic Open Reading Frames in Podospora: Mobility and Consecutive Exonic Sequence Variations

    PubMed Central

    Sellem, C. H.; d'Aubenton-Carafa, Y.; Rossignol, M.; Belcour, L.

    1996-01-01

    The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optional sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences. In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group I intronic ORFs are mobile elements and that their transfer, and comcomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes. PMID:8725226

  16. Mitochondrial intronic open reading frames in Podospora: mobility and consecutive exonic sequence variations.

    PubMed

    Sellem, C H; d'Aubenton-Carafa, Y; Rossignol, M; Belcour, L

    1996-06-01

    The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optional sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences. In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group 1 intronic ORFs are mobile elements and that their transfer, and concomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes.

  17. Accurate, simple, and inexpensive assays to diagnose F8 gene inversion mutations in hemophilia A patients and carriers.

    PubMed

    Dutta, Debargh; Gunasekera, Devi; Ragni, Margaret V; Pratt, Kathleen P

    2016-12-27

    The most frequent mutations resulting in hemophilia A are an intron 22 or intron 1 gene inversion, which together cause ∼50% of severe hemophilia A cases. We report a simple and accurate RNA-based assay to detect these mutations in patients and heterozygous carriers. The assays do not require specialized equipment or expensive reagents; therefore, they may provide useful and economic protocols that could be standardized for central laboratory testing. RNA is purified from a blood sample, and reverse transcription nested polymerase chain reaction (RT-NPCR) reactions amplify DNA fragments with the F8 sequence spanning the exon 22 to 23 splice site (intron 22 inversion test) or the exon 1 to 2 splice site (intron 1 inversion test). These sequences will be amplified only from F8 RNA without an intron 22 or intron 1 inversion mutation, respectively. Additional RT-NPCR reactions are then carried out to amplify the inverted sequences extending from F8 exon 19 to the first in-frame stop codon within intron 22 or a chimeric transcript containing F8 exon 1 and the VBP1 gene. These latter 2 products are produced only by individuals with an intron 22 or intron 1 inversion mutation, respectively. The intron 22 inversion mutations may be further classified (eg, as type 1 or type 2, reflecting the specific homologous recombination sites) by the standard DNA-based "inverse-shifting" PCR assay if desired. Efficient Bcl I and T4 DNA ligase enzymes that cleave and ligate DNA in minutes were used, which is a substantial improvement over previous protocols that required overnight incubations. These protocols can accurately detect F8 inversion mutations via same-day testing of patient samples.

  18. Sequencing of mitochondrial genomes of nine Aspergillus and Penicillium species identifies mobile introns and accessory genes as main sources of genome size variability.

    PubMed

    Joardar, Vinita; Abrams, Natalie F; Hostetler, Jessica; Paukstelis, Paul J; Pakala, Suchitra; Pakala, Suman B; Zafar, Nikhat; Abolude, Olukemi O; Payne, Gary; Andrianopoulos, Alex; Denning, David W; Nierman, William C

    2012-12-12

    The genera Aspergillus and Penicillium include some of the most beneficial as well as the most harmful fungal species such as the penicillin-producer Penicillium chrysogenum and the human pathogen Aspergillus fumigatus, respectively. Their mitochondrial genomic sequences may hold vital clues into the mechanisms of their evolution, population genetics, and biology, yet only a handful of these genomes have been fully sequenced and annotated. Here we report the complete sequence and annotation of the mitochondrial genomes of six Aspergillus and three Penicillium species: A. fumigatus, A. clavatus, A. oryzae, A. flavus, Neosartorya fischeri (A. fischerianus), A. terreus, P. chrysogenum, P. marneffei, and Talaromyces stipitatus (P. stipitatum). The accompanying comparative analysis of these and related publicly available mitochondrial genomes reveals wide variation in size (25-36 Kb) among these closely related fungi. The sources of genome expansion include group I introns and accessory genes encoding putative homing endonucleases, DNA and RNA polymerases (presumed to be of plasmid origin) and hypothetical proteins. The two smallest sequenced genomes (A. terreus and P. chrysogenum) do not contain introns in protein-coding genes, whereas the largest genome (T. stipitatus), contains a total of eleven introns. All of the sequenced genomes have a group I intron in the large ribosomal subunit RNA gene, suggesting that this intron is fixed in these species. Subsequent analysis of several A. fumigatus strains showed low intraspecies variation. This study also includes a phylogenetic analysis based on 14 concatenated core mitochondrial proteins. The phylogenetic tree has a different topology from published multilocus trees, highlighting the challenges still facing the Aspergillus systematics. The study expands the genomic resources available to fungal biologists by providing mitochondrial genomes with consistent annotations for future genetic, evolutionary and population studies. Despite the conservation of the core genes, the mitochondrial genomes of Aspergillus and Penicillium species examined here exhibit significant amount of interspecies variation. Most of this variation can be attributed to accessory genes and mobile introns, presumably acquired by horizontal gene transfer of mitochondrial plasmids and intron homing.

  19. Effective suppression of dengue virus using a novel group-I intron that induces apoptotic cell death upon infection through conditional expression of the Bax C-terminal domain.

    PubMed

    Carter, James R; Keith, James H; Fraser, Tresa S; Dawson, James L; Kucharski, Cheryl A; Horne, Kate M; Higgs, Stephen; Fraser, Malcolm J

    2014-06-13

    Approximately 100 million confirmed infections and 20,000 deaths are caused by Dengue virus (DENV) outbreaks annually. Global warming and rapid dispersal have resulted in DENV epidemics in formally non-endemic regions. Currently no consistently effective preventive measures for DENV exist, prompting development of transgenic and paratransgenic vector control approaches. Production of transgenic mosquitoes refractory for virus infection and/or transmission is contingent upon defining antiviral genes that have low probability for allowing escape mutations, and are equally effective against multiple serotypes. Previously we demonstrated the effectiveness of an anti-viral group I intron targeting U143 of the DENV genome in mediating trans-splicing and expression of a marker gene with the capsid coding domain. In this report we examine the effectiveness of coupling expression of ΔN Bax to trans-splicing U143 intron activity as a means of suppressing DENV infection of mosquito cells. Targeting the conserved DENV circularization sequence (CS) by U143 intron trans-splicing activity appends a 3' exon RNA encoding ΔN Bax to the capsid coding region of the genomic RNA, resulting in a chimeric protein that induces premature cell death upon infection. TCID50-IFA analyses demonstrate an enhancement of DENV suppression for all DENV serotypes tested over the identical group I intron coupled with the non-apoptotic inducing firefly luciferase as the 3' exon. These cumulative results confirm the increased effectiveness of this αDENV-U143-ΔN Bax group I intron as a sequence specific antiviral that should be useful for suppression of DENV in transgenic mosquitoes. Annexin V staining, caspase 3 assays, and DNA ladder observations confirm DCA-ΔN Bax fusion protein expression induces apoptotic cell death. This report confirms the relative effectiveness of an anti-DENV group I intron coupled to an apoptosis-inducing ΔN Bax 3' exon that trans-splices conserved sequences of the 5' CS region of all DENV serotypes and induces apoptotic cell death upon infection. Our results confirm coupling the targeted ribozyme capabilities of the group I intron with the generation of an apoptosis-inducing transcript increases the effectiveness of infection suppression, improving the prospects of this unique approach as a means of inducing transgenic refractoriness in mosquitoes for all serotypes of this important disease.

  20. Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae).

    PubMed

    Brouard, Jean-Simon; Turmel, Monique; Otis, Christian; Lemieux, Claude

    2016-01-01

    The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA) structure, size, gene order, and intron content have been observed. The large inverted repeat (IR), an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales) but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum . The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium , it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold longer and dispersed repeats are more abundant, but a smaller fraction of the Oedocladium genome is occupied by introns. Six additional group II introns are present, five of which lack ORFs and carry highly similar sequences to that of the ORF-less IIA intron shared with Oedogonium . Secondary structure analysis of the group IIA introns disclosed marked differences in the exon-binding sites; however, each intron showed perfect or nearly perfect base pairing interactions with its target site. Our results suggest that chloroplast genes rearrange more slowly in the Oedogoniales than in the Chaetophorales and raise questions as to what was the nature of the foreign coding sequences in the IR of the common ancestor of the Oedogoniales. They provide the first evidence for intragenomic proliferation of group IIA introns in the Viridiplantae, revealing that intron spread in the Oedocladium lineage likely occurred by retrohoming after sequence divergence of the exon-binding sites.

  1. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    PubMed

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-02-15

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).

  2. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    PubMed Central

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-01-01

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U). PMID:8604302

  3. HybPiper: Extracting coding sequence and introns for phylogenetics from high-throughput sequencing reads using target enrichment1

    PubMed Central

    Johnson, Matthew G.; Gardner, Elliot M.; Liu, Yang; Medina, Rafael; Goffinet, Bernard; Shaw, A. Jonathan; Zerega, Nyree J. C.; Wickett, Norman J.

    2016-01-01

    Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper. PMID:27437175

  4. The complete plastid genome sequence of Eustrephus latifolius (Asparagaceae: Lomandroideae).

    PubMed

    Kim, Hyoung Tae; Kim, Jung Sung; Kim, Joo-Hwan

    2016-01-01

    The complete chloroplast (cp) genome sequence of Eustrephus latifolius was firstly determined in subfamily Lomandriodeae of family Asparagaceae. It was 159,736 bp and contained a large single copy region (82,403 bp) and a small single copy region (13,607 bp) which were separated by two inverted repeat regions (31,863 bp). In total, 132 genes were identified and they were consisted of 83 coding genes, 8 rRNA genes, 38 tRNA genes, 3 pseudogenes. rpl23 and clpP were pseudogenes due to sequence deletions. Among 23 genes containing introns, rps12 and ycf3 contained two introns and the rest had just one intron. The intact ycf68 was identified within an intron of trnI-GAU. The amino acid sequence was almost identical with Phoenix dactylifera in Aracales. Ycf1 of E. latifolius was completely located in IR. It was similar to cp genome structure of Lemna minor, Spirodela polyrhiza, Wolffiella lingulata, Wolffia australiana in Alismatales.

  5. Phylogenetic inferences of Nepenthes species in Peninsular Malaysia revealed by chloroplast (trnL intron) and nuclear (ITS) DNA sequences.

    PubMed

    Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd

    2017-01-26

    The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.

  6. In vitro mapping of Myotonic Dystrophy (DM) gene promoter

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Storbeck, C.J.; Sabourin, L.; Baird, S.

    1994-09-01

    The Myotonic Dystrophy Kinase (DMK) gene has been cloned and shared homology to serine/threonine protein kinases. Overexpression of this gene in stably transfected mouse myoblasts has been shown to inhibit fusion into myotubes while myoblasts stably transfected with an antisense construct show increased fusion potential. These experiments, along with data showing that the DM gene is highly expressed in muscle have highlighted the possibility of DMK being involved in myogenesis. The promoter region of the DM gene lacks a consensus TATA box and CAAT box, but harbours numerous transcription binding sites. Clones containing extended 5{prime} upstream sequences (UPS) of DMKmore » only weakly drive the reporter gene chloramphenicol acetyl transferase (CAT) when transfected into C2C12 mouse myoblasts. However, four E-boxes are present in the first intron of the DM gene and transient assays show increased expression of the CAT gene when the first intron is present downstream of these 5{prime} UPS in an orientation dependent manner. Comparison between mouse and human sequence reveals that the regions in the first intron where the E-boxes are located are highly conserved. The mapping of the promoter and the importance of the first intron in the control of DMK expression will be presented.« less

  7. An RNAi-enhanced Logic Circuit for Cancer Specific Detection and Destruction

    DTIC Science & Technology

    2010-07-01

    Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its mutant hBax-S184A [4]. A plasmid containing the tested gene was transfected into HEK...the far-red fluorescent protein mKate to express the Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and...intron-exon junction. Among the donor and acceptor sequences found in literature our intron features were chosen according SplicePort [5], an

  8. Discovering weighted patterns in intron sequences using self-adaptive harmony search and back-propagation algorithms.

    PubMed

    Huang, Yin-Fu; Wang, Chia-Ming; Liou, Sing-Wu

    2013-01-01

    A hybrid self-adaptive harmony search and back-propagation mining system was proposed to discover weighted patterns in human intron sequences. By testing the weights under a lazy nearest neighbor classifier, the numerical results revealed the significance of these weighted patterns. Comparing these weighted patterns with the popular intron consensus model, it is clear that the discovered weighted patterns make originally the ambiguous 5SS and 3SS header patterns more specific and concrete.

  9. Discovering Weighted Patterns in Intron Sequences Using Self-Adaptive Harmony Search and Back-Propagation Algorithms

    PubMed Central

    Wang, Chia-Ming; Liou, Sing-Wu

    2013-01-01

    A hybrid self-adaptive harmony search and back-propagation mining system was proposed to discover weighted patterns in human intron sequences. By testing the weights under a lazy nearest neighbor classifier, the numerical results revealed the significance of these weighted patterns. Comparing these weighted patterns with the popular intron consensus model, it is clear that the discovered weighted patterns make originally the ambiguous 5SS and 3SS header patterns more specific and concrete. PMID:23737711

  10. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication

    PubMed Central

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-01-01

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615

  11. Insertion of a self-splicing intron into the mtDNA of atriploblastic animal

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Valles, Y.; Halanych, K.; Boore, J.L.

    2006-04-14

    Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking amore » long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In contrast to this mtDNA structure and organization diversity, current genome level studies point to a monophyletic origin of the mitochondria (REFS), raising questions such as: what are the pressures at work shaping the evolution of the mitochondrial genome at 'higher' levels? What drives the absence of introns and other non-coding spacers in metazoan mtDNA? What characteristics must have an intron to be maintained in an environment where 'extra chromosomes' are usually selected against?« less

  12. [Identification and phylogenetic application of unique nucleotide sequence of nad7 intron2 in Rhodiola (Crassulaceae) species].

    PubMed

    Deng, Ke-Jun; Yang, Zu-Jun; Liu, Cheng; Zhao, Wei; Liu, Chang; Feng, Juan; Ren, Zheng-Long

    2007-03-01

    Genetic characterization of 9 populations of Rhodiola crenulata, R. fastigiata and R. sachalinensis (Crassulaceae) species from Sichuan and Jilin Provinces of China, was investigated using the conserved primer of nad7 intron 2. All PCR products about 800 bp long were shorter than other Crassulaceae plants, which were used as molecular markers to identify the Rhodiola species. The sequence of the products indicated that total exon of 53 bp and intron of 738 bp exhibit only 9 nucleotide variations. Blasting the nad7 sequences to GenBank and the phylogenetic analysis showed that the sequence of Rhodiola species was clusted independently, and the length was smaller than all the registered sequences of higher plants. The result suggests that the Rhiodola species had a unique sequence in this gene region, which might be related to the special growth condition.

  13. Mitochondrial intronic open reading frames in Podospora: Mobility and consecutive exonic sequence variations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sellem, C.H.; Rossignol, M.; Belcour, L.

    1996-06-01

    The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optical sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences.more » In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group I intronic ORFs are mobile elements and that their transfer, and comcomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes. 46 refs., 5 figs., 2 tabs.« less

  14. Phylogenetic Distribution of Intron Positions in Alpha-Amylase Genes of Bilateria Suggests Numerous Gains and Losses

    PubMed Central

    Da Lage, Jean-Luc; Maczkowiak, Frédérique; Cariou, Marie-Louise

    2011-01-01

    Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that “resets” of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures. PMID:21611157

  15. Novel BRCA1 mutations and more frequent intron-20 alteration found among 236 women from Western Poland.

    PubMed

    Sobczak, K; Kozłowski, P; Napierała, M; Czarny, J; Woźniak, M; Kapuścińska, M; Lośko, M; Koziczak, M; Jasińska, A; Powierska, J; Braczkowski, R; Breborowicz, J; Godlewski, D; Mackiewicz, A; Krzyzosiak, W

    1997-10-09

    Three different novel BRCA1 mutations, five independent cases of the same 12 bp insertion-duplication in intron-20 and two novel rare BRCA1 sequence variants were identified among 122 Polish women with positive, in most cases moderate family history of breast and/or ovarian cancer, 80 controls and 34 unselected breast cancer tissue specimens. All mutations and variants were germline. The 4153 delA frameshift mutation, the Tyr105Cys missense mutation and two cases of the alteration in intron-20 were found in the group of healthy women with positive family history. Two other cases of the intronic insertion were found in unselected controls. Their carriers had no family history of breast or ovarian cancer but other cancers occurred in their families. The 1782 Trp/STOP nonsense mutation and one case of the insertion in intron-20 were first found in tissue specimens of breast cancer patient and breast/ovarian cancer patient, respectively. Their carriers also had no family history of breast or ovarian cancer. The distribution of the insertion in intron-20 in analysed groups and results of RT-PCR experiments suggest a less prominent role for this variant considered earlier a splicing mutation. This study shows also, that more population-oriented research is needed, involving women with less profound or even no family history of breast and ovarian cancer, to better understand the role and significance of different BRCA1 variants and mutations.

  16. Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene

    PubMed Central

    Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis

    2012-01-01

    Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272

  17. Pea chloroplast tRNA(Lys) (UUU) gene: transcription and analysis of an intron-containing gene.

    PubMed

    Boyer, S K; Mullet, J E

    1988-07-01

    The pea chloroplast trnK gene which encodes tRNA(Lys) (UUU) was sequenced. TrnK is located 210 bp upstream from the promoter of psbA and immediately downstream from the 3'-end of rbcL. The gene is transcribed from the same DNA strand as psbA and rbcL. A 2447 bp intron with class II features is located in the trnK anticodon loop. The intron contains a 506 amino acid open reading frame which could encode an RNA maturase. The primary transcript of trnK is 2.9 kb long; its 5'-end was identified as a site of transcription initiation by in vitro transcription experiments. The 5'-terminus is adjacent to DNA sequences previously identified as transcription promoter elements. The most abundant trnK transcript is 2.5 kb long with termini corresponding to the 5' and 3' ends of the trnK exons. Intron specific RNAs were not detected. This suggests that RNA processing which produces tRNA(Lys) leads to rapid degradation of intron sequences.

  18. Fractal landscape analysis of DNA walks

    NASA Technical Reports Server (NTRS)

    Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H. E.

    1992-01-01

    By mapping nucleotide sequences onto a "DNA walk", we uncovered remarkably long-range power law correlations [Nature 356 (1992) 168] that imply a new scale invariant property of DNA. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences, but not in cDNA sequences or intron-less genes. In this paper, we present more explicit evidences to support our findings.

  19. [Applylication of new type combined fragments: nrDNA ITS+ nad 1-intron 2 for identification of Dendrobium species of Fengdous].

    PubMed

    Geng, Li-xia; Zheng, Rui; Ren, Jie; Niu, Zhi-tao; Sun, Yu-long; Xue, Qing-yun; Liu, Wei; Ding, Xiao-yu

    2015-08-01

    In this study, 17 kinds of Dendrobium species of Fengdous including 39 individuals were collected from 4 provinces. Mitochondrial gene sequences co I, nad 5, nad 1-intron 2 and chloroplast gene sequences rbcL, matK amd psbA-trnH were amplified from these materials, as well as nrDNA ITS. Furthermore, suitable sequences for identification of Dendrobium species of Fengdous were screened by K-2-P and P-distance. The results showed that during the mentioned 7 sequences, nrDNA ITS, nad 1-intron 2 and psbA-trnH which had a high degree of variability could be used to identify Dendrobium species of Fengdous. However, single fragment could not be used to distinguish D. moniliforme and D. huoshanense. Moreover, compared to other combined fragments, new type combined fragments nrDNA ITS+nad 1-intron 2 was more effective in identifying the original plants of Dendrobium species and could be used to identify D. huoshanense and D. moniliforme. Besides, according to the UPGMA tree constructed with nrDNA ITS+nad 1-intron 2, 3 inspected Dendrobium plants were identified as D. huoshanense, D. moniliforme and D. officinale, respectively. This study identified Dendrobium species of Fengdous by combined fragments nrDNA ITS+nad 1-intron 2 for the first time, which provided a more effective basis for identification of Dendrobium species. And this study will be helpful for regulating the market of Fengdous.

  20. Novel methodologies for spectral classification of exon and intron sequences

    NASA Astrophysics Data System (ADS)

    Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.

    2012-12-01

    Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.

  1. Intronic splicing mutations in PTCH1 cause Gorlin syndrome.

    PubMed

    Bholah, Zaynab; Smith, Miriam J; Byers, Helen J; Miles, Emma K; Evans, D Gareth; Newman, William G

    2014-09-01

    Gorlin syndrome is an autosomal dominant disorder characterized by multiple early-onset basal cell carcinoma, odontogenic keratocysts and skeletal abnormalities. It is caused by heterozygous mutations in the tumour suppressor PTCH1. Routine clinical genetic testing, by Sanger sequencing and multiplex ligation-dependent probe amplification (MLPA) to confirm a clinical diagnosis of Gorlin syndrome, identifies a mutation in 60-90 % of cases. We undertook RNA analysis on lymphocytes from ten individuals diagnosed with Gorlin syndrome, but without known PTCH1 mutations by exonic sequencing or MLPA. Two altered PTCH1 transcripts were identified. Genomic DNA sequence analysis identified an intron 7 mutation c.1068-10T>A, which created a strong cryptic splice acceptor site, leading to an intronic insertion of eight bases; this is predicted to create a frameshift p.(His358Alafs*12). Secondly, a deep intronic mutation c.2561-2057A>G caused an inframe insertion of 78 intronic bases in the cDNA transcript, leading to a premature stop codon p.(Gly854fs*3). The mutations are predicted to cause loss of function of PTCH1, consistent with its tumour suppressor function. The findings indicate the importance of RNA analysis to detect intronic mutations in PTCH1 not identified by routine screening techniques.

  2. Evolutionary and biogeographical implications of degraded LAGLIDADG endonuclease functionality and group I intron occurrence in stony corals (Scleractinia) and mushroom corals (Corallimorpharia).

    PubMed

    Celis, Juan Sebastián; Edgell, David R; Stelbrink, Björn; Wibberg, Daniel; Hauffe, Torsten; Blom, Jochen; Kalinowski, Jörn; Wilke, Thomas

    2017-01-01

    Group I introns and homing endonuclease genes (HEGs) are mobile genetic elements, capable of invading target sequences in intron-less genomes. LAGLIDADG HEGs are the largest family of endonucleases, playing a key role in the mobility of group I introns in a process known as 'homing'. Group I introns and HEGs are rare in metazoans, and can be mainly found inserted in the COXI gene of some sponges and cnidarians, including stony corals (Scleractinia) and mushroom corals (Corallimorpharia). Vertical and horizontal intron transfer mechanisms have been proposed as explanations for intron occurrence in cnidarians. However, the central role of LAGLIDADG motifs in intron mobility mechanisms remains poorly understood. To resolve questions regarding the evolutionary origin and distribution of group I introns and HEGs in Scleractinia and Corallimorpharia, we examined intron/HEGs sequences within a comprehensive phylogenetic framework. Analyses of LAGLIDADG motif conservation showed a high degree of degradation in complex Scleractinia and Corallimorpharia. Moreover, the two motifs lack the respective acidic residues necessary for metal-ion binding and catalysis, potentially impairing horizontal intron mobility. In contrast, both motifs are highly conserved within robust Scleractinia, indicating a fully functional endonuclease capable of promoting horizontal intron transference. A higher rate of non-synonymous substitutions (Ka) detected in the HEGs of complex Scleractinia and Corallimorpharia suggests degradation of the HEG, whereas lower Ka rates in robust Scleractinia are consistent with a scenario of purifying selection. Molecular-clock analyses and ancestral inference of intron type indicated an earlier intron insertion in complex Scleractinia and Corallimorpharia in comparison to robust Scleractinia. These findings suggest that the lack of horizontal intron transfers in the former two groups is related to an age-dependent degradation of the endonuclease activity. Moreover, they also explain the peculiar geographical patterns of introns in stony and mushroom corals.

  3. DIP1 modulates stem cell homeostasis in Drosophila through regulation of sisR-1.

    PubMed

    Wong, Jing Ting; Akhbar, Farzanah; Ng, Amanda Yunn Ee; Tay, Mandy Li-Ian; Loi, Gladys Jing En; Pek, Jun Wei

    2017-10-02

    Stable intronic sequence RNAs (sisRNAs) are by-products of splicing and regulate gene expression. How sisRNAs are regulated is unclear. Here we report that a double-stranded RNA binding protein, Disco-interacting protein 1 (DIP1) regulates sisRNAs in Drosophila. DIP1 negatively regulates the abundance of sisR-1 and INE-1 sisRNAs. Fine-tuning of sisR-1 by DIP1 is important to maintain female germline stem cell homeostasis by modulating germline stem cell differentiation and niche adhesion. Drosophila DIP1 localizes to a nuclear body (satellite body) and associates with the fourth chromosome, which contains a very high density of INE-1 transposable element sequences that are processed into sisRNAs. DIP1 presumably acts outside the satellite bodies to regulate sisR-1, which is not on the fourth chromosome. Thus, our study identifies DIP1 as a sisRNA regulatory protein that controls germline stem cell self-renewal in Drosophila.Stable intronic sequence RNAs (sisRNAs) are by-products of splicing from introns with roles in embryonic development in Drosophila. Here, the authors show that the RNA binding protein DIP1 regulates sisRNAs in Drosophila, which is necessary for germline stem cell homeostasis.

  4. Origin and evolution of spliceosomal introns

    PubMed Central

    2012-01-01

    Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section. PMID:22507701

  5. Intermediate introns in nuclear genes of euglenids - are they a distinct type?

    PubMed

    Milanowski, Rafał; Gumińska, Natalia; Karnkowska, Anna; Ishikawa, Takao; Zakryś, Bożena

    2016-02-29

    Nuclear genes of euglenids contain two major types of introns: conventional spliceosomal and nonconventional introns. The latter are characterized by variable non-canonical borders, RNA secondary structure that brings intron ends together, and an unknown mechanism of removal. Some researchers also distinguish intermediate introns, which combine features of both types. They form a stable RNA secondary structure and are classified into two subtypes depending on whether they contain one (intermediate/nonconventional subtype) or both (conventional/intermediate subtype) canonical spliceosomal borders. However, it has been also postulated that most introns classified as intermediate could simply be special cases of conventional or nonconventional introns. Sequences of tubB, hsp90 and gapC genes from six strains of Euglena agilis were obtained. They contain four, six, and two or three introns, respectively (the third intron in the gapC gene is unique for just one strain). Conventional introns were present at three positions: two in the tubB gene (at one position conventional/intermediate introns were also found) and one in the gapC gene. Nonconventional introns are present at ten positions: two in the tubB gene (at one position intermediate/nonconventional introns were also found), six in hsp90 (at four positions intermediate/nonconventional introns were also found), and two in the gapC gene. Sequence and RNA secondary structure analyses of nonconventional introns confirmed that their most strongly conserved elements are base pairing nucleotides at positions +4, +5 and +6/ -8, -7 and -6 (in most introns CAG/CTG nucleotides were observed). It was also confirmed that the presence of the 5' GT/C end in intermediate/nonconventional introns is not the result of kinship with conventional introns, but is due to evolutionary pressure to preserve the purine at the 5' end. However, an example of a nonconventional intron with GC-AG ends was shown, suggesting the possibility of intron type conversion between nonconventional and conventional. Furthermore, an analysis of conventional introns revealed that the ability to form a stable RNA secondary structure by some introns is probably not a result of their relationship with nonconventional introns. It was also shown that acquisition of new nonconventional introns is an ongoing process and can be observed at the level of a single species. In the recently acquired intron in the gapC gene an extended direct repeats at the intron-exon junctions are present, suggesting that double-strand break repair process could be the source of new nonconventional introns.

  6. Functional comparison of three transformer gene introns regulating conditional female lethality

    USDA-ARS?s Scientific Manuscript database

    The trasformer gene plays a critical role in the sex determination pathways of many insects. We cloned two transformer gene introns from Anastrepha suspensa, the Caribbean fruit fly. These introns have sequences that putatively have a role in sex-specific splicing patterns that affect sex determinat...

  7. The Reverse Transcriptase/RNA Maturase Protein MatR Is Required for the Splicing of Various Group II Introns in Brassicaceae Mitochondria

    PubMed Central

    Sultan, Laure D.; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Dietrich, André

    2016-01-01

    Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. PMID:27760804

  8. Regulatory elements of the floral homeotic gene AGAMOUS identified by phylogenetic footprinting and shadowing.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.

    2003-06-01

    OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally importantmore » for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.« less

  9. Processing of Archaebacterial Intron-Containing tRNA Gene Transcripts.

    DTIC Science & Technology

    1987-07-31

    1{ 1. Project Goals: A. To determine the mechanism of tRNA intron processing in the halophilic archaebacteria. B. Characterize and compare the...enzyme(s) responsible for the removal of 5’-flanking sequences from halophilic and sulfur-dependent tRNA gene transcripts. C. Examine the structure and...distribution of tRNA introns in the halophilic archaebacteria. 2. Accomplishments: A. Intron processing mechanism We have succeeded in our primary

  10. Pre-Mrna Introns as a Model for Cryptographic Algorithm:. Theory and Experiments

    NASA Astrophysics Data System (ADS)

    Regoli, Massimo

    2010-01-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. In particular the RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  11. Intron Definition Is Required for Excision of the Minute Virus of Mice Small Intron and Definition of the Upstream Exon

    PubMed Central

    Haut, Donald D.; Pintel, D. J.

    1998-01-01

    Alternative splicing of pre-mRNAs plays a critical role in maximizing the coding capacity of the small parvovirus genome. The small-intron region of minute virus of mice (MVM) pre-mRNAs undergoes an unusual pattern of overlapping alternative splicing—using two donors (D1 and D2) and two acceptors (A1 and A2) within a region of 120 nucleotides—that determines the steady-state ratios of the various viral mRNAs. In this report, we show that the determinants that govern excision of the small intron are complex and are also required for efficient definition of the upstream exon. For the MVM small intron in its natural context, the two donors appear to compete for the splicing machinery: the position of D1 favors its usage, while the primary sequence of D2 must be more like the consensus sequence than is D1 to be used efficiently. We have genetically defined the branch points that are used for generation of the major and minor spliced forms and show that recognition of components of the small-intron acceptors is likely to be the dominant determinant in alternative small-intron excision. We have also identified a G-rich intronic enhancer sequence within the small intron that is essential for splicing of the minor form (D2 to A2) but not the major form (D1 to A1) of MVM mRNAs and is required for efficient definition of the upstream NS2-specific exon. In its natural context, the small intron appears to be excised by a mechanism consistent with intron definition. When the MVM small intron is expanded, various parameters of its excision are altered, indicating that critical cis-acting signals are context dependent. Relative use of the donors and acceptors is altered, and the upstream NS2-specific exon is no longer efficiently defined. The fact that definition of the upstream NS2-specific exon can be achieved by the MVM small intron in its natural context, but not when it is expanded, suggests that the multiple determinants that govern definition and excision of the small intron are required, in concert, for upstream exon definition. Our data are consistent with a model in which alternative splicing of the MVM P4-generated pre-mRNAs is governed by a hybrid of intron- and exon-defining mechanisms. PMID:9499034

  12. Virtual Genome Walking across the 32 Gb Ambystoma mexicanum genome; assembling gene models and intronic sequence.

    PubMed

    Evans, Teri; Johnson, Andrew D; Loose, Matthew

    2018-01-12

    Large repeat rich genomes present challenges for assembly using short read technologies. The 32 Gb axolotl genome is estimated to contain ~19 Gb of repetitive DNA making an assembly from short reads alone effectively impossible. Indeed, this model species has been sequenced to 20× coverage but the reads could not be conventionally assembled. Using an alternative strategy, we have assembled subsets of these reads into scaffolds describing over 19,000 gene models. We call this method Virtual Genome Walking as it locally assembles whole genome reads based on a reference transcriptome, identifying exons and iteratively extending them into surrounding genomic sequence. These assemblies are then linked and refined to generate gene models including upstream and downstream genomic, and intronic, sequence. Our assemblies are validated by comparison with previously published axolotl bacterial artificial chromosome (BAC) sequences. Our analyses of axolotl intron length, intron-exon structure, repeat content and synteny provide novel insights into the genic structure of this model species. This resource will enable new experimental approaches in axolotl, such as ChIP-Seq and CRISPR and aid in future whole genome sequencing efforts. The assembled sequences and annotations presented here are freely available for download from https://tinyurl.com/y8gydc6n . The software pipeline is available from https://github.com/LooseLab/iterassemble .

  13. COL1A1 transgene expression in stably transfected osteoblastic cells. Relative contributions of first intron, 3'-flanking sequences, and sequences derived from the body of the human COL1A1 minigene

    NASA Technical Reports Server (NTRS)

    Breault, D. T.; Lichtler, A. C.; Rowe, D. W.

    1997-01-01

    Collagen reporter gene constructs have be used to identify cell-specific sequences needed for transcriptional activation. The elements required for endogenous levels of COL1A1 expression, however, have not been elucidated. The human COL1A1 minigene is expressed at high levels and likely harbors sequence elements required for endogenous levels of activity. Using stably transfected osteoblastic Py1a cells, we studied a series of constructs (pOBColCAT) designed to characterize further the elements required for high level of expression. pOBColCAT, which contains the COL1A1 first intron, was expressed at 50-100-fold higher levels than ColCAT 3.6, which lacks the first intron. This difference is best explained by improved mRNA processing rather than a transcriptional effect. Furthermore, variation in activity observed with the intron deletion constructs is best explained by altered mRNA splicing. Two major regions of the human COL1A1 minigene, the 3'-flanking sequences and the minigene body, were introduced into pOBColCAT to assess both transcriptional enhancing activity and the effect on mRNA stability. Analysis of the minigene body, which includes the first five exons and introns fused with the terminal six introns and exons, revealed an orientation-independent 5-fold increase in CAT activity. In contrast the 3'-flanking sequences gave rise to a modest 61% increase in CAT activity. Neither region increased the mRNA half-life of the parent construct, suggesting that CAT-specific mRNA instability elements may serve as dominant negative regulators of stability. This study suggests that other sites within the body of the COL1A1 minigene are important for high expression, e.g. during periods of rapid extracellular matrix production.

  14. Phylogenomic Resolution of the Phylogeny of Laurasiatherian Mammals: Exploring Phylogenetic Signals within Coding and Noncoding Sequences.

    PubMed

    Chen, Meng-Yun; Liang, Dan; Zhang, Peng

    2017-08-01

    The interordinal relationships of Laurasiatherian mammals are currently one of the most controversial questions in mammalian phylogenetics. Previous studies mainly relied on coding sequences (CDS) and seldom used noncoding sequences. Here, by data mining public genome data, we compiled an intron data set of 3,638 genes (all introns from a protein-coding gene are considered as a gene) (19,055,073 bp) and a CDS data set of 10,259 genes (20,994,285 bp), covering all major lineages of Laurasiatheria (except Pholidota). We found that the intron data contained stronger and more congruent phylogenetic signals than the CDS data. In agreement with this observation, concatenation and species-tree analyses of the intron data set yielded well-resolved and identical phylogenies, whereas the CDS data set produced weakly supported and incongruent results. Further analyses showed that the phylogeny inferred from the intron data is highly robust to data subsampling and change in outgroup, but the CDS data produced unstable results under the same conditions. Interestingly, gene tree statistical results showed that the most frequently observed gene tree topologies for the CDS and intron data are identical, suggesting that the major phylogenetic signal within the CDS data is actually congruent with that within the intron data. Our final result of Laurasiatheria phylogeny is (Eulipotyphla,((Chiroptera, Perissodactyla),(Carnivora, Cetartiodactyla))), favoring a close relationship between Chiroptera and Perissodactyla. Our study 1) provides a well-supported phylogenetic framework for Laurasiatheria, representing a step towards ending the long-standing "hard" polytomy and 2) argues that intron within genome data is a promising data resource for resolving rapid radiation events across the tree of life. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  15. A KCNH2 branch point mutation causing aberrant splicing contributes to an explanation of genotype-negative long QT syndrome.

    PubMed

    Crotti, Lia; Lewandowska, Marzena A; Schwartz, Peter J; Insolia, Roberto; Pedrazzini, Matteo; Bussani, Erica; Dagradi, Federica; George, Alfred L; Pagani, Franco

    2009-02-01

    Genetic screening of long QT syndrome (LQTS) fails to identify disease-causing mutations in about 30% of patients. So far, molecular screening has focused mainly on coding sequence mutations or on substitutions at canonical splice sites. The purpose of this study was to explore the possibility that intronic variants not at canonical splice sites might affect splicing regulatory elements, lead to aberrant transcripts, and cause LQTS. Molecular screening was performed through DHPLC and sequence analysis. The role of the intronic mutation identified was assessed with a hybrid minigene splicing assay. A three-generation LQTS family was investigated. Molecular screening failed to identify an obvious disease-causing mutation in the coding sequences of the major LQTS genes but revealed an intronic A-to-G substitution in KCNH2 (IVS9-28A/G) cosegregating with the clinical phenotype in family members. In vitro analysis proved that the mutation disrupts the acceptor splice site definition by affecting the branch point (BP) sequence and promoting intron retention. We further demonstrated a tight functional relationship between the BP and the polypyrimidine tract, whose weakness is responsible for the pathological effect of the IVS9-28A/G mutation. We identified a novel BP mutation in KCNH2 that disrupts the intron 9 acceptor splice site definition and causes LQT2. The present finding demonstrates that intronic mutations affecting pre-mRNA processing may contribute to the failure of traditional molecular screening in identifying disease-causing mutations in LQTS subjects and offers a rationale strategy for the reduction of genotype-negative cases.

  16. Base pairing between the 3' exon and an internal guide sequence increases 3' splice site specificity in the Tetrahymena self-splicing rRNA intron.

    PubMed Central

    Suh, E R; Waring, R B

    1990-01-01

    It has been proposed that recognition of the 3' splice site in many group I introns involves base pairing between the start of the 3' exon and a region of the intron known as the internal guide sequence (R. W. Davies, R. B. Waring, J. Ray, T. A. Brown, and C. Scazzocchio, Nature [London] 300:719-724, 1982). We have examined this hypothesis, using the self-splicing rRNA intron from Tetrahymena thermophila. Mutations in the 3' exon that weaken this proposed pairing increased use of a downstream cryptic 3' splice site. Compensatory mutations in the guide sequence that restore this pairing resulted in even stronger selection of the normal 3' splice site. These changes in 3' splice site usage were more pronounced in the background of a mutation (414A) which resulted in an adenine instead of a guanine being the last base of the intron. These results show that the proposed pairing (P10) plays an important role in ensuring that cryptic 3' splice sites are selected against. Surprisingly, the 414A mutation alone did not result in activation of the cryptic 3' splice site. Images PMID:2342465

  17. Contrasting population structure from nuclear intron sequences and mtDNA of humpback whales.

    PubMed

    Palumbi, S R; Baker, C S

    1994-05-01

    Powerful analyses of population structure require information from multiple genetic loci. To help develop a molecular toolbox for obtaining this information, we have designed universal oligonucleotide primers that span conserved intron-exon junctions in a wide variety of animal phyla. We test the utility of exon-primed, intron-crossing amplifications by analyzing the variability of actin intron sequences from humpback, blue, and bowhead whales and comparing the results with mitochondrial DNA (mtDNA) haplotype data. Humpback actin introns fall into two major clades that exist in different frequencies in different oceanic populations. It is surprising that Hawaii and California populations, which are very distinct in mtDNAs, are similar in actin intron alleles. This discrepancy between mtDNA and nuclear DNA results may be due either to differences in genetic drift in mitochondrial and nuclear genes or to preferential movement of males, which do not transmit mtDNA to offspring, between separate breeding grounds. Opposing mtDNA and nuclear DNA results can help clarify otherwise hidden patterns of structure in natural populations.

  18. Nucleotide sequence of the COX1 gene in Kluyveromyces lactis mitochondrial DNA: evidence for recent horizontal transfer of a group II intron.

    PubMed

    Hardy, C M; Clark-Walker, G D

    1991-07-01

    The cytochrome oxidase subunit 1 gene (COX1) in K. lactis K8 mtDNA spans 8,826 bp and contains five exons (termed E1-E5) totalling 1,602 bp that show 88% nucleotide base matching and 91% amino acid homology to the equivalent gene in S. cerevisiae. The four introns (termed K1 cox1.1-1.4) contain open reading frames encoding proteins of 786, 333, 319 and 395 amino acids respectively that potentially encode maturase enzymes. The first intron belongs to group II whereas the remaining three are group I type B. Introns K1 cox1.1, 1.3, and 1.4 are found at identical locations to introns Sc cox1.2, 1.5 a, and 1.5 b respectively from S. cerevisiae. Horizontal transfer of an intron between recent progenitors of K. lactis and S. cerevisiae is suggested by the observation that K1 cox1.1 and Sc cox1.2 show 96% base matching. Sequence comparisons between K1 cox1.3/Sc cox1.5 a and K1 cox1.4/Sc cox1.5 b suggest that these introns are likely to have been present in the ancestral COX1 gene of these yeasts. Intron K1 cox1.2 is not found in S. cerevisiae and appears at an unique location in K. lactis. A feature of the DNA sequences of the group I introns K1 cox1.2, 1.3, and 1.4 is the presence of 11 GC-rich clusters inserted into both coding and noncoding regions. Immediately downstream of the COX1 gene is the ATPase subunit 8 gene (A8) that shows 82.6% base matching to its counterpart in S. cerevisiae mtDNA.

  19. Molecular and bioinformatical characterization of a novel superfamily of cysteine-rich peptides from arthropods.

    PubMed

    Zeng, Xian-Chun; Nie, Yao; Luo, Xuesong; Wu, Shifen; Shi, Wanxia; Zhang, Lei; Liu, Yichen; Cao, Hanjun; Yang, Ye; Zhou, Jianping

    2013-03-01

    The full-length cDNA sequences of two novel cysteine-rich peptides (referred to as HsVx1 and MmKTx1) were obtained from scorpions. The two peptides represent a novel class of cysteine-rich peptides with a unique cysteine pattern. The genomic sequence of HsVx1 is composed of three exons interrupted by two introns that are localized in the mature peptide encoding region and inserted in phase 1 and phase 2, respectively. Such a genomic organization markedly differs from those of other peptides from scorpions described previously. Genome-wide search for the orthologs of HsVx1 identified 59 novel cysteine-rich peptides from arthropods. These peptides share a consistent cysteine pattern with HsVx1. Genomic comparison revealed extensive intron length differences and intronic number and position polymorphisms among the genes of these peptides. Further analysis identified 30 cases of intron sliding, 1 case of intron gain and 22 cases of intron loss occurred with the genes of the HsVx1 and HsVx1-like peptides. It is interesting to see that three HsVx1-like peptides XP_001658928, XP_001658929 and XP_001658930 were derived from a single gene (XP gene): the former two were generated from alternative splicing; the third one was encoded by a DNA region in the reverse complementary strand of the third intron of the XP gene. These findings strongly suggest that the genes of these cysteine-rich peptides were evolved by intron sliding, intron gain/loss, gene recombination and alternative splicing events in response to selective forces without changing their cysteine pattern. The evolution of these genes is dominated by intron sliding and intron loss. Copyright © 2012 Elsevier Inc. All rights reserved.

  20. The Reverse Transcriptase/RNA Maturase Protein MatR Is Required for the Splicing of Various Group II Introns in Brassicaceae Mitochondria.

    PubMed

    Sultan, Laure D; Mileshina, Daria; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Głodowicz, Paweł; Niazi, Adnan Khan; Keren, Ido; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Mower, Jeffrey P; Dietrich, André; Ostersetzer-Biran, Oren

    2016-11-01

    Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. © 2016 American Society of Plant Biologists. All rights reserved.

  1. The alternative oxidase family of Vitis vinifera reveals an attractive model to study the importance of genomic design.

    PubMed

    Costa, José Hélio; de Melo, Dirce Fernandes; Gouveia, Zélia; Cardoso, Hélia Guerra; Peixe, Augusto; Arnholdt-Schmitt, Birgit

    2009-12-01

    'Genomic design' refers to the structural organization of gene sequences. Recently, the role of intron sequences for gene regulation is being better understood. Further, introns possess high rates of polymorphism that are considered as the major source for speciation. In molecular breeding, the length of gene-specific introns is recognized as a tool to discriminate genotypes with diverse traits of agronomic interest. 'Economy selection' and 'time-economy selection' have been proposed as models for explaining why highly expressed genes typically contain small introns. However, in contrast to these theories, plant-specific selection reveals that highly expressed genes contain introns that are large. In the presented research, 'wet'Aox gene identification from grapevine is advanced by a bioinformatics approach to study the species-specific organization of Aox gene structures in relation to available expressed sequence tag (EST) data. Two Aox1 and one Aox2 gene sequences have been identified in Vitis vinifera using grapevine cultivars from Portugal and Germany. Searching the complete genome sequence data of two grapevine cultivars confirmed that V. vinifera alternative oxidase (Aox) is encoded by a small multigene family composed of Aox1a, Aox1b and Aox2. An analysis of EST distribution revealed high expression of the VvAox2 gene. A relationship between the atypical long primary transcript of VvAox2 (in comparison to other plant Aox genes) and its expression level is suggested. V. vinifera Aox genes contain four exons interrupted by three introns except for Aox1a which contains an additional intron in the 3'-UTR. The lengths of primary Aox transcripts were estimated for each gene in two V. vinifera varieties: PN40024 and Pinot Noir. In both varieties, Aox1a and Aox1b contained small introns that corresponded to primary transcript lengths ranging from 1501 to 1810 bp. The Aox2 of PN40024 (12 329 bp) was longer than that from Pinot Noir (7279 bp) because of selection against a transposable-element insertion that is 5028 bp in size. An EST database basic local alignment search tool (BLAST) search of GenBank revealed the following ESTs percentages for each gene: Aox1a (26.2%), Aox1b (11.9%) and Aox2 (61.9%). Aox1a was expressed in fruits and roots, Aox1b expression was confined to flowers and Aox2 was ubiquitously expressed. These data for V. vinifera show that atypically long Aox intron lengths are related to high levels of gene expression. Furthermore, it is shown for the first time that two grapevine cultivars can be distinguished by Aox intron length polymorphism.

  2. Selfish DNA: homing endonucleases find a home.

    PubMed

    Edgell, David R

    2009-02-10

    Self-splicing group I introns come in two flavours - those with a homing endonuclease to promote mobility of the intron, and those without an endonuclease. How homing endonucleases and self-splicing introns associate to form a composite selfish genetic element is a question of long-standing interest. Recent work has revealed that a shared characteristic of both introns and endonucleases, the targeting of conserved sequences, may provide the impetus for the evolution of composite mobile genetic elements.

  3. Cis-regulatory underpinnings of human GLI3 expression in embryonic craniofacial structures and internal organs.

    PubMed

    Abbasi, Amir A; Minhas, Rashid; Schmidt, Ansgar; Koch, Sabine; Grzeschik, Karl-Heinz

    2013-10-01

    The zinc finger transcription factor Gli3 is an important mediator of Sonic hedgehog (Shh) signaling. During early embryonic development Gli3 participates in patterning and growth of the central nervous system, face, skeleton, limb, tooth and gut. Precise regulation of the temporal and spatial expression of Gli3 is crucial for the proper specification of these structures in mammals and other vertebrates. Previously we reported a set of human intronic cis-regulators controlling almost the entire known repertoire of endogenous Gli3 expression in mouse neural tube and limbs. However, the genetic underpinning of GLI3 expression in other embryonic domains such as craniofacial structures and internal organs remain elusive. Here we demonstrate in a transgenic mice assay the potential of a subset of human/fish conserved non-coding sequences (CNEs) residing within GLI3 intronic intervals to induce reporter gene expression at known regions of endogenous Gli3 transcription in embryonic domains other than central nervous system (CNS) and limbs. Highly specific reporter expression was observed in craniofacial structures, eye, gut, and genitourinary system. Moreover, the comparison of expression patterns directed by these intronic cis-acting regulatory elements in mouse and zebrafish embryos suggests that in accordance with sequence conservation, the target site specificity of a subset of these elements remains preserved among these two lineages. Taken together with our recent investigations, it is proposed here that during vertebrate evolution the Gli3 expression control acquired multiple, independently acting, intronic enhancers for spatiotemporal patterning of CNS, limbs, craniofacial structures and internal organs. © 2013 The Authors Development, Growth & Differentiation © 2013 Japanese Society of Developmental Biologists.

  4. Introduction of a novel 18S rDNA gene arrangement along with distinct ITS region in the saline water microalga Dunaliella

    PubMed Central

    2010-01-01

    Comparison of 18S rDNA gene sequences is a very promising method for identification and classification of living organisms. Molecular identification and discrimination of different Dunaliella species were carried out based on the size of 18S rDNA gene and, number and position of introns in the gene. Three types of 18S rDNA structure have already been reported: the gene with a size of ~1770 bp lacking any intron, with a size of ~2170 bp consisting one intron near 5' terminus, and with a size of ~2570 bp harbouring two introns near 5' and 3' termini. Hereby, we report a new 18S rDNA gene arrangement in terms of intron localization and nucleotide sequence in a Dunaliella isolated from Iranian salt lakes (ABRIINW-M1/2). PCR amplification with genus-specific primers resulted in production of a ~2170 bp DNA band, which is similar to that of D. salina 18S rDNA gene containing only one intron near 5' terminus. Whilst, sequence composition of the gene revealed the lack of any intron near 5' terminus in our isolate. Furthermore, another alteration was observed due to the presence of a 440 bp DNA fragment near 3' terminus. Accordingly, 18S rDNA gene of the isolate is clearly different from those of D. salina and any other Dunaliella species reported so far. Moreover, analysis of ITS region sequence showed the diversity of this region compared to the previously reported species. 18S rDNA and ITS sequences of our isolate were submitted with accesion numbers of EU678868 and EU927373 in NCBI database, respectively. The optimum growth rate of this isolate occured at the salinity level of 1 M NaCl. The maximum carotenoid content under stress condition of intense light (400 μmol photon m-2 s-1), high salinity (4 M NaCl) and deficiency of nitrate and phosphate nutritions reached to 240 ng/cell after 15 days. PMID:20377865

  5. Complete plastid genome sequence of the chickpea (Cicer arietinum) and the phylogenetic distribution of rps12 and clpP intron losses among legumes (Leguminosae)

    PubMed Central

    Jansen, Robert K.; Wojciechowski, Martin F.; Sanniyasi, Elumalai; Lee, Seung-Bum; Daniell, Henry

    2008-01-01

    Chickpea (Cicer arietinum, Leguminosae), an important grain legume, is widely used for food and fodder throughout the world. We sequenced the complete plastid genome of chickpea, which is 125,319 bp in size, and contains only one copy of the inverted repeat (IR). The genome encodes 108 genes, including 4 rRNAs, 29 tRNAs, and 75 proteins. The genes rps16, infA, and ycf4 are absent in the chickpea plastid genome, and ndhB has an internal stop codon in the 5′exon, similar to other legumes. Two genes have lost their introns, one in the 3′exon of the transpliced gene rps12, and the one between exons 1 and 2 of clpP; this represents the first documented case of the loss of introns from both of these genes in the same plastid genome. An extensive phylogenetic survey of these intron losses was performed on 302 taxa across legumes and the related family Polygalaceae. The clpP intron has been lost exclusively in taxa from the temperate “IR-lacking clade” (IRLC), whereas the rps12 intron has been lost in most members of the IRLC (with the exception of Wisteria, Callerya, Afgekia, and certain species of Millettia, which represent the earliest diverging lineages of this clade), and in the tribe Desmodieae, which is closely related to the tribes Phaseoleae and Psoraleeae. Data provided here suggest that the loss of the rps12 intron occurred after the loss of the IR. The two new genomic changes identified in the present study provide additional support of the monophyly of the IR-loss clade, and resolution of the pattern of the earliest-branching lineages in this clade. The availability of the complete chickpea plastid genome sequence also provides valuable information on intergenic spacer regions among legumes and endogenous regulatory sequences for plastid genetic engineering. PMID:18638561

  6. Identification of human short introns

    PubMed Central

    Abebrese, Emmanuel L.; Arnold, Zachary R.; Armstrong, Katharine; Burns, Lindsay; Day, R. Thomas; Hsu, Daniel G.; Jarrell, Katherine; Luo, Yi; Mugayo, Daphine

    2017-01-01

    Canonical pre-mRNA splicing requires snRNPs and associated splicing factors to excise conserved intronic sequences, with a minimum intron length required for efficient splicing. Non-canonical splicing–intron excision without the spliceosome–has been documented; most notably, some tRNAs and the XBP1 mRNA contain short introns that are not removed by the spliceosome. There have been some efforts to identify additional short introns, but little is known about how many short introns are processed from mRNAs. Here, we report an approach to identify RNA short introns from RNA-Seq data, discriminating against small genomic deletions. We identify hundreds of short introns conserved among multiple human cell lines. These short introns are often alternatively spliced and are found in a variety of RNAs–both mRNAs and lncRNAs. Short intron splicing efficiency is increased by secondary structure, and we detect both canonical and non-canonical short introns. In many cases, splicing of these short introns from mRNAs is predicted to alter the reading frame and change protein output. Our findings imply that standard gene prediction models which often assume a lower limit for intron size fail to predict short introns effectively. We conclude that short introns are abundant in the human transcriptome, and short intron splicing represents an added layer to mRNA regulation. PMID:28520720

  7. Biotechnological applications of mobile group II introns and their reverse transcriptases: gene targeting, RNA-seq, and non-coding RNA analysis.

    PubMed

    Enyeart, Peter J; Mohr, Georg; Ellington, Andrew D; Lambowitz, Alan M

    2014-01-13

    Mobile group II introns are bacterial retrotransposons that combine the activities of an autocatalytic intron RNA (a ribozyme) and an intron-encoded reverse transcriptase to insert site-specifically into DNA. They recognize DNA target sites largely by base pairing of sequences within the intron RNA and achieve high DNA target specificity by using the ribozyme active site to couple correct base pairing to RNA-catalyzed intron integration. Algorithms have been developed to program the DNA target site specificity of several mobile group II introns, allowing them to be made into 'targetrons.' Targetrons function for gene targeting in a wide variety of bacteria and typically integrate at efficiencies high enough to be screened easily by colony PCR, without the need for selectable markers. Targetrons have found wide application in microbiological research, enabling gene targeting and genetic engineering of bacteria that had been intractable to other methods. Recently, a thermostable targetron has been developed for use in bacterial thermophiles, and new methods have been developed for using targetrons to position recombinase recognition sites, enabling large-scale genome-editing operations, such as deletions, inversions, insertions, and 'cut-and-pastes' (that is, translocation of large DNA segments), in a wide range of bacteria at high efficiency. Using targetrons in eukaryotes presents challenges due to the difficulties of nuclear localization and sub-optimal magnesium concentrations, although supplementation with magnesium can increase integration efficiency, and directed evolution is being employed to overcome these barriers. Finally, spurred by new methods for expressing group II intron reverse transcriptases that yield large amounts of highly active protein, thermostable group II intron reverse transcriptases from bacterial thermophiles are being used as research tools for a variety of applications, including qRT-PCR and next-generation RNA sequencing (RNA-seq). The high processivity and fidelity of group II intron reverse transcriptases along with their novel template-switching activity, which can directly link RNA-seq adaptor sequences to cDNAs during reverse transcription, open new approaches for RNA-seq and the identification and profiling of non-coding RNAs, with potentially wide applications in research and biotechnology.

  8. Towards barcode markers in Fungi: an intron map of Ascomycota mitochondria.

    PubMed

    Santamaria, Monica; Vicario, Saverio; Pappadà, Graziano; Scioscia, Gaetano; Scazzocchio, Claudio; Saccone, Cecilia

    2009-06-16

    A standardized and cost-effective molecular identification system is now an urgent need for Fungi owing to their wide involvement in human life quality. In particular the potential use of mitochondrial DNA species markers has been taken in account. Unfortunately, a serious difficulty in the PCR and bioinformatic surveys is due to the presence of mobile introns in almost all the fungal mitochondrial genes. The aim of this work is to verify the incidence of this phenomenon in Ascomycota, testing, at the same time, a new bioinformatic tool for extracting and managing sequence databases annotations, in order to identify the mitochondrial gene regions where introns are missing so as to propose them as species markers. The general trend towards a large occurrence of introns in the mitochondrial genome of Fungi has been confirmed in Ascomycota by an extensive bioinformatic analysis, performed on all the entries concerning 11 mitochondrial protein coding genes and 2 mitochondrial rRNA (ribosomal RNA) specifying genes, belonging to this phylum, available in public nucleotide sequence databases. A new query approach has been developed to retrieve effectively introns information included in these entries. After comparing the new query-based approach with a blast-based procedure, with the aim of designing a faithful Ascomycota mitochondrial intron map, the first method appeared clearly the most accurate. Within this map, despite the large pervasiveness of introns, it is possible to distinguish specific regions comprised in several genes, including the full NADH dehydrogenase subunit 6 (ND6) gene, which could be considered as barcode candidates for Ascomycota due to their paucity of introns and to their length, above 400 bp, comparable to the lower end size of the length range of barcodes successfully used in animals. The development of the new query system described here would answer the pressing requirement to improve drastically the bioinformatics support to the DNA Barcode Initiative. The large scale investigation of Ascomycota mitochondrial introns performed through this tool, allowing to exclude the introns-rich sequences from the barcode candidates exploration, could be the first step towards a mitochondrial barcoding strategy for these organisms, similar to the standard approach employed in metazoans.

  9. Bio—Cryptography: A Possible Coding Role for RNA Redundancy

    NASA Astrophysics Data System (ADS)

    Regoli, M.

    2009-03-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions," are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behavior in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  10. Multiple recent horizontal transfers of the cox1 intron in Solanaceae and extended co-conversion of flanking exons

    PubMed Central

    2011-01-01

    Background The most frequent case of horizontal transfer in plants involves a group I intron in the mitochondrial gene cox1, which has been acquired via some 80 separate plant-to-plant transfer events among 833 diverse angiosperms examined. This homing intron encodes an endonuclease thought to promote the intron's promiscuous behavior. A promising experimental approach to study endonuclease activity and intron transmission involves somatic cell hybridization, which in plants leads to mitochondrial fusion and genome recombination. However, the cox1 intron has not yet been found in the ideal group for plant somatic genetics - the Solanaceae. We therefore undertook an extensive survey of this family to find members with the intron and to learn more about the evolutionary history of this exceptionally mobile genetic element. Results Although 409 of the 426 species of Solanaceae examined lack the cox1 intron, it is uniformly present in three phylogenetically disjunct clades. Despite strong overall incongruence of cox1 intron phylogeny with angiosperm phylogeny, two of these clades possess nearly identical intron sequences and are monophyletic in intron phylogeny. These two clades, and possibly the third also, contain a co-conversion tract (CCT) downstream of the intron that is extended relative to all previously recognized CCTs in angiosperm cox1. Re-examination of all published cox1 genes uncovered additional cases of extended co-conversion and identified a rare case of putative intron loss, accompanied by full retention of the CCT. Conclusions We infer that the cox1 intron was separately and recently acquired by at least three different lineages of Solanaceae. The striking identity of the intron and CCT from two of these lineages suggests that one of these three intron captures may have occurred by a within-family transfer event. This is consistent with previous evidence that horizontal transfer in plants is biased towards phylogenetically local events. The discovery of extended co-conversion suggests that other cox1 conversions may be longer than realized but obscured by the exceptional conservation of plant mitochondrial sequences. Our findings provide further support for the rampant-transfer model of cox1 intron evolution and recommend the Solanaceae as a model system for the experimental analysis of cox1 intron transfer in plants. PMID:21943226

  11. Horizontal transfer and gene conversion as an important driving force in shaping the landscape of mitochondrial introns.

    PubMed

    Wu, Baojun; Hao, Weilong

    2014-04-16

    Group I introns are highly dynamic and mobile, featuring extensive presence-absence variation and widespread horizontal transfer. Group I introns can invade intron-lacking alleles via intron homing powered by their own encoded homing endonuclease gene (HEG) after horizontal transfer or via reverse splicing through an RNA intermediate. After successful invasion, the intron and HEG are subject to degeneration and sequential loss. It remains unclear whether these mechanisms can fully address the high dynamics and mobility of group I introns. Here, we found that HEGs undergo a fast gain-and-loss turnover comparable with introns in the yeast mitochondrial 21S-rRNA gene, which is unexpected, as the intron and HEG are generally believed to move together as a unit. We further observed extensively mosaic sequences in both the introns and HEGs, and evidence of gene conversion between HEG-containing and HEG-lacking introns. Our findings suggest horizontal transfer and gene conversion can accelerate HEG/intron degeneration and loss, or rescue and propagate HEG/introns, and ultimately result in high HEG/intron turnover rate. Given that up to 25% of the yeast mitochondrial genome is composed of introns and most mitochondrial introns are group I introns, horizontal transfer and gene conversion could have served as an important mechanism in introducing mitochondrial intron diversity, promoting intron mobility and consequently shaping mitochondrial genome architecture.

  12. Identification of a deep intronic mutation in the COL6A2 gene by a novel custom oligonucleotide CGH array designed to explore allelic and genetic heterogeneity in collagen VI-related myopathies

    PubMed Central

    2010-01-01

    Background Molecular characterization of collagen-VI related myopathies currently relies on standard sequencing, which yields a detection rate approximating 75-79% in Ullrich congenital muscular dystrophy (UCMD) and 60-65% in Bethlem myopathy (BM) patients as PCR-based techniques tend to miss gross genomic rearrangements as well as copy number variations (CNVs) in both the coding sequence and intronic regions. Methods We have designed a custom oligonucleotide CGH array in order to investigate the presence of CNVs in the coding and non-coding regions of COL6A1, A2, A3, A5 and A6 genes and a group of genes functionally related to collagen VI. A cohort of 12 patients with UCMD/BM negative at sequencing analysis and 2 subjects carrying a single COL6 mutation whose clinical phenotype was not explicable by inheritance were selected and the occurrence of allelic and genetic heterogeneity explored. Results A deletion within intron 1A of the COL6A2 gene, occurring in compound heterozygosity with a small deletion in exon 28, previously detected by routine sequencing, was identified in a BM patient. RNA studies showed monoallelic transcription of the COL6A2 gene, thus elucidating the functional effect of the intronic deletion. No pathogenic mutations were identified in the remaining analyzed patients, either within COL6A genes, or in genes functionally related to collagen VI. Conclusions Our custom CGH array may represent a useful complementary diagnostic tool, especially in recessive forms of the disease, when only one mutant allele is detected by standard sequencing. The intronic deletion we identified represents the first example of a pure intronic mutation in COL6A genes. PMID:20302629

  13. A mixed group II/group III twintron in the Euglena gracilis chloroplast ribosomal protein S3 gene: evidence for intron insertion during gene evolution.

    PubMed Central

    Copertino, D W; Christopher, D A; Hallick, R B

    1991-01-01

    The splicing of a 409 nucleotide intron from the Euglena gracilis chloroplast ribosomal protein S3 gene (rps3) was examined by cDNA cloning and sequencing, and northern hybridization. Based on the characterization of a partially spliced pre-mRNA, the intron was characterized as a 'mixed' twintron, composed of a 311 nucleotide group II intron internal to a 98 nucleotide group III intron. Twintron excision is via a 2-step sequential splicing pathway, with removal of the internal group II intron preceding excision of the external group III intron. Based on secondary structural analysis of the twintron, we propose that group III introns may represent highly degenerate versions of group II introns. The existence of twintrons is interpreted as evidence that group II introns were inserted during the evolution of Euglena chloroplast genes from a common ancestor with eubacteria, archaebacteria, cyanobacteria, and other chloroplasts. Images PMID:1721702

  14. Another heritage from the RNA world: self-excision of intron sequence from nuclear pre-tRNAs.

    PubMed

    Weber, U; Beier, H; Gross, H J

    1996-06-15

    The intervening sequences of nuclear tRNA precursors are known to be excised by tRNA splicing endonuclease. We show here that a T7 transcript corresponding to a pre-tRNA(Tyr) from Arabidopsis thaliana has a highly specific activity for autolytic intron excision. Self-cleavage occurs precisely at the authentic 3'-splice site and at the phosphodiester bond one nucleotide downstream of the authentic 5'-splice site. The reaction results in fragments with 2',3'-cyclic phosphate and 5'-OH termini. It is resistant to proteinase K and/or SDS treatment and is not inhibited by added tRNA. The self-cleavage depends on Mg2+ and is stimulated by spermine and Triton X-100. A set of sequence variants at the cleavage sites has been analysed for autolytic intron excision and, in parallel, for enzymatic in vitro splicing in wheat germ S23 extract. Single-stranded loops are a prerequisite for both reactions. Self-cleavage not only occurs at pyrimidine-A but also at U-U bonds. Since intron self-excision is only about five times slower than the enzymatic intron excision in a wheat germ S23 extract, we propose that the splicing endonuclease may function by improving the preciseness and efficiency of an inherent pre-tRNA self-cleavage activity.

  15. The paradox of MHC-DRB exon/intron evolution: alpha-helix and beta-sheet encoding regions diverge while hypervariable intronic simple repeats coevolve with beta-sheet codons.

    PubMed

    Schwaiger, F W; Weyers, E; Epplen, C; Brün, J; Ruff, G; Crawford, A; Epplen, J T

    1993-09-01

    Twenty-one different caprine and 13 ovine MHC-DRB exon 2 sequences were determined including part of the adjacent introns containing simple repetitive (gt)n(ga)m elements. The positions for highly polymorphic DRB amino acids vary slightly among ungulates and other mammals. From man and mouse to ungulates the basic (gt)n(ga)m structure is fixed in evolution for 7 x 10(7) years whereas ample variations exist in the tandem (gt)n and (ga)m dinucleotides and especially their "degenerated" derivatives. Phylogenetic trees for the alpha-helices and beta-pleated sheets of the ungulate DRB sequences suggest different evolutionary histories. In hoofed animals as well as in humans DRB beta-sheet encoding sequences and adjacent intronic repeats can be assembled into virtually identical groups suggesting coevolution of noncoding as well as coding DNA. In contrast alpha-helices and C-terminal parts of the first DRB domain evolve distinctly. In the absence of a defined mechanism causing specific, site-directed mutations, double-recombination or gene-conversion-like events would readily explain this fact. The role of the intronic simple (gt)n(ga)m repeat is discussed with respect to these genetic exchange mechanisms during evolution.

  16. Myostatin-2 gene structure and polymorphism of the promoter and first intron in the marine fish Sparus aurata: evidence for DNA duplications and/or translocations.

    PubMed

    Nadjar-Boger, Elisabeth; Funkenstein, Bruria

    2011-02-01

    Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily that functions as a negative regulator of skeletal muscle development and growth in mammals. Fish express at least two genes for MSTN: MSTN-1 and MSTN-2. To date, MSTN-2 promoters have been cloned only from salmonids and zebrafish. Here we described the cloning and sequence analysis of MSTN-2 gene and its 5' flanking region in the marine fish Sparus aurata (saMSTN-2). We demonstrate the existence of three alleles of the promoter and three alleles of the first intron. Sequence comparison of the promoter region in the three alleles revealed that although the sequences of the first 1050 bp upstream of the translation start site are almost identical in the three alleles, a substantial sequence divergence is seen further upstream. Careful sequence analysis of the region upstream of the first 1050 bp in the three alleles identified several elements that appear to be repeated in some or all sequences, at different positions. This suggests that the promoter region of saMSTN-2 has been subjected to various chromosomal rearrangements during the course of evolution, reflecting either insertion or deletion events. Screening of several genomic DNA collections indicated differences in allele frequency, with allele 'b' being the most abundant, followed by allele 'c', whereas allele 'a' is relatively rare. Sequence analysis of saMSTN-2 gene also revealed polymorphism in the first intron, identifying three alleles. The length difference in alleles '1R' and '2R' of the first intron is due to the presence of one or two copies of a repeated block of approximately 150 bp, located at the 5' end of the first intron. The third allele, '4R', has an additional insertion of 323 bp located 116 bp upstream of the 3' end of the first intron. Analysis of several DNA collections showed that the '2R' allele is the most common, followed by the '4R' allele, whereas the '1R' allele is relatively rare. Progeny analysis of a full-sib family showed a Mendelian mode of inheritance of the two genetic loci. No clear association was found between the two genetic markers and growth rate. These results show for the first time a substantial degree of polymorphism in both the promoter and first intron of MSTN-2 gene in a perciform fish species which points to chromosomal rearrangements that took place during evolution.

  17. Screening of Variations in CD22 Gene in Children with B-Precursor Acute Lymphoblastic Leukemia.

    PubMed

    Aslar Oner, Deniz; Akin, Dilara Fatma; Sipahi, Kadir; Mumcuoglu, Mine; Ezer, Ustun; Kürekci, A Emin; Akar, Nejat

    2016-09-01

    CD22 is expressed on the surface of B-cell lineage cells from the early progenitor stage of pro-B cell until terminal differentiation to mature B cells. It plays a role in signal transduction and as a regulator of B-cell receptor signaling in B-cell development. We aimed to screen exons 9-14 of the CD22 gene, which is a mutational hot spot region in B-precursor acute lymphoblastic leukemia (pre-B ALL) patients, to find possible genetic variants that could play role in the pathogenesis of pre-B ALL in Turkish children. This study included 109 Turkish children with pre-B ALL who were diagnosed at Losante Hospital for Children with Leukemia. Genomic DNA was extracted from both peripheral blood and bone marrow leukocytes. Gene amplification was performed with PCR, and all samples were screened for the variants by single strand conformation polymorphism. Samples showing band shifts were sequenced on an automated sequencer. In our patient group a total of 9 variants were identified in the CD22 gene by sequencing: a novel variant in intron 10 (T2199G); a missense variant in exon 12; 5 intronic variants between exon 12 and intron 13; a novel intronic variant (C2424T); and a synonymous in exon 13. Thirteen of 109 children (11.9%) carried the T2199G novel intronic variant located in intron 10, and 17 of 109 children (15.6%) carried the C2424T novel intronic variant. Novel variants in the CD22 gene in children with pre-B ALL in Turkey that are not present, in the Human Gene Mutation Database or NCBI SNP database, were found.

  18. The Mitochondrial Genome of Chara vulgaris: Insights into the Mitochondrial DNA Architecture of the Last Common Ancestor of Green Algae and Land PlantsW⃞

    PubMed Central

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    2003-01-01

    Mitochondrial DNA (mtDNA) has undergone radical changes during the evolution of green plants, yet little is known about the dynamics of mtDNA evolution in this phylum. Land plant mtDNAs differ from the few green algal mtDNAs that have been analyzed to date by their expanded size, long spacers, and diversity of introns. We have determined the mtDNA sequence of Chara vulgaris (Charophyceae), a green alga belonging to the charophycean order (Charales) that is thought to be the most closely related alga to land plants. This 67,737-bp mtDNA sequence, displaying 68 conserved genes and 27 introns, was compared with those of three angiosperms, the bryophyte Marchantia polymorpha, the charophycean alga Chaetosphaeridium globosum (Coleochaetales), and the green alga Mesostigma viride. Despite important differences in size and intron composition, Chara mtDNA strikingly resembles Marchantia mtDNA; for instance, all except 9 of 68 conserved genes lie within blocks of colinear sequences. Overall, our genome comparisons and phylogenetic analyses provide unequivocal support for a sister-group relationship between the Charales and the land plants. Only four introns in land plant mtDNAs appear to have been inherited vertically from a charalean algar ancestor. We infer that the common ancestor of green algae and land plants harbored a tightly packed, gene-rich, and relatively intron-poor mitochondrial genome. The group II introns in this ancestral genome appear to have spread to new mtDNA sites during the evolution of bryophytes and charalean green algae, accounting for part of the intron diversity found in Chara and land plant mitochondria. PMID:12897260

  19. The chloroplast and mitochondrial genome sequences of the charophyte Chaetosphaeridium globosum: Insights into the timing of the events that restructured organelle DNAs within the green algal lineage that led to land plants

    PubMed Central

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    2002-01-01

    The land plants and their immediate green algal ancestors, the charophytes, form the Streptophyta. There is evidence that both the chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) underwent substantial changes in their architecture (intron insertions, gene losses, scrambling in gene order, and genome expansion in the case of mtDNA) during the evolution of streptophytes; however, because no charophyte organelle DNAs have been sequenced completely thus far, the suite of events that shaped streptophyte organelle genomes remains largely unknown. Here, we have determined the complete cpDNA (131,183 bp) and mtDNA (56,574 bp) sequences of the charophyte Chaetosphaeridium globosum (Coleochaetales). At the levels of gene content (124 genes), intron composition (18 introns), and gene order, Chaetosphaeridium cpDNA is remarkably similar to land-plant cpDNAs, implying that most of the features characteristic of land-plant lineages were gained during the evolution of charophytes. Although the gene content of Chaetosphaeridium mtDNA (67 genes) closely resembles that of the bryophyte Marchantia polymorpha (69 genes), this charophyte mtDNA differs substantially from its land-plant relatives at the levels of size, intron composition (11 introns), and gene order. Our finding that it shares only one intron with its land-plant counterparts supports the idea that the vast majority of mitochondrial introns in land plants appeared after the emergence of these organisms. Our results also suggest that the events accounting for the spacious intergenic spacers found in land-plant mtDNAs took place late during the evolution of charophytes or coincided with the transition from charophytes to land plants. PMID:12161560

  20. Exon–intron organization of genes in the slime mold Physarum polycephalum

    PubMed Central

    Trzcinska-Danielewicz, Joanna; Fronk, Jan

    2000-01-01

    The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon–intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon–intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon–intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3′-ends. PMID:10982858

  1. SURVEY AND SUMMARY: exon-intron organization of genes in the slime mold Physarum polycephalum.

    PubMed

    Trzcinska-Danielewicz, J; Fronk, J

    2000-09-15

    The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon-intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon-intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon-intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3'-ends.

  2. Quantitation of normal CFTR mRNA in CF patients with splice-site mutations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, Z.; Olsen, J.C.; Silverman, L.M.

    Previously we identified two mutations in introns of the CFTR gene associated with partially active splice sites and unusual clinical phenotypes. One mutation in intron 19 (3849+10 kb C to T) is common in CF patients with normal sweat chloride values; an 84 bp sequence from intron 19, which contains a stop codon, is inserted between exon 19 and exon 20 in most nasal CFTR transcripts. The other mutation in intron 14B (2789+5 G to A) is associated with elevated sweat chloride levels, but mild pulmonary disease; exon 14B (38 bp) is spliced out of most nasal CFTR transcipts. Themore » remaining CFTR cDNA sequences, other than the 84 bp insertion of exon 14B deletion, are identical to the published sequence. To correlate genotype and phenotype, we used quantitative RT-PCR to determine the levels of normally-spliced CFTR mRNA in nasal epithelia from these patients. CFTR cDNA was amplified (25 cycles) by using primers specific for normally-spliced species, {gamma}-actin cDNA was amplified as a standard.« less

  3. Analysis of the Mitochondrial Genome in Hypomyces aurantius Reveals a Novel Twintron Complex in Fungi.

    PubMed

    Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang

    2016-06-30

    Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification.

  4. Analysis of the Mitochondrial Genome in Hypomyces aurantius Reveals a Novel Twintron Complex in Fungi

    PubMed Central

    Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang

    2016-01-01

    Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification. PMID:27376282

  5. The human myelin oligodendrocyte glycoprotein (MOG) gene: Complete nucleotide sequence and structural characterization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Paule Roth, M.; Malfroy, L.; Offer, C.

    1995-07-20

    Human myelin oligodendrocyte glycoprotein (MOG), a myelin component of the central nervous system, is a candidate target antigen for autoimmune-mediated demyelination. We have isolated and sequenced part of a cosmid clone that contains the entire human MOG gene. The primary nuclear transcript, extending from the putative start of transcription to the site of poly(A) addition, is 15,561 nucleotides in length. The human MOG gene contains 8 exons, separated by 7 introns; canonical intron/exon boundary sites are observed at each junction. The introns vary in size from 242 to 6484 bp and contain numerous repetitive DNA elements, including 14 Alu sequencesmore » within 3 introns. Another Alu element is located in the 3{prime}-untranslated region of the gene. Alu sequences were classified with respect to subfamily assignment. Seven hundred sixty-three nucleotides 5{prime} of the transcription start and 1214 nucleotides 3{prime} of the poly(A) addition sites were also sequenced. The 5{prime}-flanking region revealed the presence of several consensus sequences that could be relevant in the transcription of the MOG gene, in particular binding sites in common with other myelin gene promoters. Two polymorphic intragenic dinucleotide (CA){sub n} and tetranucleotide (TAAA){sub n} repeats were identified and may provide genetic marker tools for association and linkage studies. 50 refs., 3 figs., 3 tabs.« less

  6. Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays

    PubMed Central

    Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel

    2006-01-01

    Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921

  7. The in vivo use of alternate 3'-splice sites in group I introns.

    PubMed

    Sellem, C H; Belcour, L

    1994-04-11

    Alternative splicing of group I introns has been postulated as a possible mechanism that would ensure the translation of proteins encoded into intronic open reading frames, discontinuous with the upstream exon and lacking an initiation signal. Alternate splice sites were previously depicted according to secondary structures of several group I introns. We present here strong evidence that, in the case of Podospora anserina nad 1-i4 and cox1-i7 mitochondrial introns, alternative splicing events do occur in vivo. Indeed, by PCR experiments we have detected molecules whose sequence is precisely that expected if the predicted alternate 3'-splice sites were used.

  8. Group I intron-mediated trans-splicing in mitochondria of Gigaspora rosea and a robust phylogenetic affiliation of arbuscular mycorrhizal fungi with Mortierellales.

    PubMed

    Nadimi, Maryam; Beaudet, Denis; Forget, Lise; Hijri, Mohamed; Lang, B Franz

    2012-09-01

    Gigaspora rosea is a member of the arbuscular mycorrhizal fungi (AMF; Glomeromycota) and a distant relative of Glomus species that are beneficial to plant growth. To allow for a better understanding of Glomeromycota, we have sequenced the mitochondrial DNA of G. rosea. A comparison with Glomus mitochondrial genomes reveals that Glomeromycota undergo insertion and loss of mitochondrial plasmid-related sequences and exhibit considerable variation in introns. The gene order between the two species is almost completely reshuffled. Furthermore, Gigaspora has fragmented cox1 and rns genes, and an unorthodox initiator tRNA that is tailored to decoding frequent UUG initiation codons. For the fragmented cox1 gene, we provide evidence that its RNA is joined via group I-mediated trans-splicing, whereas rns RNA remains in pieces. According to our model, the two cox1 precursor RNA pieces are brought together by flanking cox1 exon sequences that form a group I intron structure, potentially in conjunction with the nad5 intron 3 sequence. Finally, we present analyses that address the controversial phylogenetic association of Glomeromycota within fungi. According to our results, Glomeromycota are not a separate group of paraphyletic zygomycetes but branch together with Mortierellales, potentially also Harpellales.

  9. Isolation and identification of gene-specific microRNAs.

    PubMed

    Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

    2006-01-01

    Prediction of microRNA (miRNA) candidates using computer programming has identified hundreds and hundreds of genomic hairpin sequences, of which, the functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene-silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem, and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. By insertion of a hairpin-like pre-miRNA structure into the intron region of a gene, this intronic miRNA biogenesis system has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA-expressing system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafish, chicken embryos, and adult mice. Based on the strand complementarity between the designed miRNA and its target gene sequence, we have also developed a miRNA isolation protocol to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proof- of-principle method, we now have the knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing system.

  10. Isolation and identification of gene-specific microRNAs.

    PubMed

    Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

    2013-01-01

    Computer programming has identified hundreds of genomic hairpin sequences, many with functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA generation system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafishes, chicken embryos, and adult mice. We have also developed an miRNA isolation protocol, based on the complementarity between the designed miRNA and its target gene sequence, to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proven-of-principle method, we now have full knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing systems.

  11. Human Ro60 (SSA2) genomic organization and sequence alterations, examined in cutaneous lupus erythematosus.

    PubMed

    Millard, T P; Ashton, G H S; Kondeatis, E; Vaughan, R W; Hughes, G R V; Khamashta, M A; Hawk, J L M; McGregor, J M; McGrath, J A

    2002-02-01

    The Ro 60 kDa protein (Ro60 or SSA2) is the major component of the Ro ribonucleoprotein (Ro RNP) complex, to which an immune response is a specific feature of several autoimmune diseases. The genomic organization and any sequence variation within the DNA encoding Ro60 are unknown. To characterize the Ro60 gene structure and to assess whether any sequence alterations might be associated with serum anti-Ro antibody in subacute cutaneous lupus erythematosus (SCLE), thus potentially providing new insight into disease pathogenesis. The cDNA sequence for Ro60 was obtained from the NCBI database and used for a BLAST search for a clone containing the entire genomic sequence. The intron-exon borders were confirmed by designing intronic primer pairs to flank each exon, which were then used to amplify genomic DNA for automated sequencing from 36 caucasian patients with SCLE (anti-Ro positive) and 49 with discoid LE (DLE, anti-Ro negative), in addition to 36 healthy caucasian controls. Heteroduplex analysis of polymerase chain reaction (PCR) products from patients and controls spanning all Ro60 exons (1-8) revealed a common bandshift in the PCR products spanning exon 7. Sequencing of the corresponding PCR products demonstrated an A > G substitution at nucleotide position 1318-7, within the consensus acceptor splice site of exon 7 (GenBank XM001901). The allele frequencies were major allele A (0.71) and minor allele G (0.29) in 72 control chromosomes, with no significant differences found between SCLE patients, DLE patients and controls. The genomic organization of the DNA encoding the Ro60 protein is described, including a common polymorphism within the consensus acceptor splice site of exon 7. Our delineation of a strategy for the genomic amplification of Ro60 forms a basis for further examination of the pathological functions of the Ro RNP in autoimmune disease.

  12. The complete chloroplast DNA sequence of Eleutherococcus senticosus (Araliaceae); comparative evolutionary analyses with other three asterids.

    PubMed

    Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong

    2012-05-01

    This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.

  13. Organellar maturases: A window into the evolution of the spliceosome.

    PubMed

    Schmitz-Linneweber, Christian; Lampe, Marie-Kristin; Sultan, Laure D; Ostersetzer-Biran, Oren

    2015-09-01

    During the evolution of eukaryotic genomes, many genes have been interrupted by intervening sequences (introns) that must be removed post-transcriptionally from RNA precursors to form mRNAs ready for translation. The origin of nuclear introns is still under debate, but one hypothesis is that the spliceosome and the intron-exon structure of genes have evolved from bacterial-type group II introns that invaded the eukaryotic genomes. The group II introns were most likely introduced into the eukaryotic genome from an α-proteobacterial predecessor of mitochondria early during the endosymbiosis event. These self-splicing and mobile introns spread through the eukaryotic genome and later degenerated. Pieces of introns became part of the general splicing machinery we know today as the spliceosome. In addition, group II introns likely brought intron maturases with them to the nucleus. Maturases are found in most bacterial introns, where they act as highly specific splicing factors for group II introns. In the spliceosome, the core protein Prp8 shows homology to group II intron-encoded maturases. While maturases are entirely intron specific, their descendant of the spliceosomal machinery, the Prp8 protein, is an extremely versatile splicing factor with multiple interacting proteins and RNAs. How could such a general player in spliceosomal splicing evolve from the monospecific bacterial maturases? Analysis of the organellar splicing machinery in plants may give clues on the evolution of nuclear splicing. Plants encode various proteins which are closely related to bacterial maturases. The organellar genomes contain one maturase each, named MatK in chloroplasts and MatR in mitochondria. In addition, several maturase genes have been found in the nucleus as well, which are acting on mitochondrial pre-RNAs. All plant maturases show sequence deviation from their progenitor bacterial maturases, and interestingly are all acting on multiple organellar group II intron targets. Moreover, they seem to function in the splicing of group II introns together with a number of additional nuclear-encoded splicing factors, possibly acting as an organellar proto-spliceosome. Together, this makes them interesting models for the early evolution of nuclear spliceosomal splicing. In this review, we summarize recent advances in our understanding of the role of plant maturases and their accessory factors in plants. This article is part of a Special Issue entitled: Chloroplast Biogenesis. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. High-throughput sequencing of the entire genomic regions of CCM1/KRIT1, CCM2 and CCM3/PDCD10 to search for pathogenic deep-intronic splice mutations in cerebral cavernous malformations.

    PubMed

    Rath, Matthias; Jenssen, Sönke E; Schwefel, Konrad; Spiegler, Stefanie; Kleimeier, Dana; Sperling, Christian; Kaderali, Lars; Felbor, Ute

    2017-09-01

    Cerebral cavernous malformations (CCM) are vascular lesions of the central nervous system that can cause headaches, seizures and hemorrhagic stroke. Disease-associated mutations have been identified in three genes: CCM1/KRIT1, CCM2 and CCM3/PDCD10. The precise proportion of deep-intronic variants in these genes and their clinical relevance is yet unknown. Here, a long-range PCR (LR-PCR) approach for target enrichment of the entire genomic regions of the three genes was combined with next generation sequencing (NGS) to screen for coding and non-coding variants. NGS detected all six CCM1/KRIT1, two CCM2 and four CCM3/PDCD10 mutations that had previously been identified by Sanger sequencing. Two of the pathogenic variants presented here are novel. Additionally, 20 stringently selected CCM index cases that had remained mutation-negative after conventional sequencing and exclusion of copy number variations were screened for deep-intronic mutations. The combination of bioinformatics filtering and transcript analyses did not reveal any deep-intronic splice mutations in these cases. Our results demonstrate that target enrichment by LR-PCR combined with NGS can be used for a comprehensive analysis of the entire genomic regions of the CCM genes in a research context. However, its clinical utility is limited as deep-intronic splice mutations in CCM1/KRIT1, CCM2 and CCM3/PDCD10 seem to be rather rare. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  15. Germline EMSY sequence alterations in hereditary breast cancer and ovarian cancer families.

    PubMed

    Määttä, Kirsi M; Nurminen, Riikka; Kankuri-Tammilehto, Minna; Kallioniemi, Anne; Laasanen, Satu-Leena; Schleutker, Johanna

    2017-07-24

    BRCA1 and BRCA2 mutations explain approximately one-fifth of the inherited susceptibility in high-risk Finnish hereditary breast and ovarian cancer (HBOC) families. EMSY is located in the breast cancer-associated chromosomal region 11q13. The EMSY gene encodes a BRCA2-interacting protein that has been implicated in DNA damage repair and genomic instability. We analysed the role of germline EMSY variation in breast/ovarian cancer predisposition. The present study describes the first EMSY screening in patients with high familial risk for this disease. Index individuals from 71 high-risk, BRCA1/2-negative HBOC families were screened for germline EMSY sequence alterations in protein coding regions and exon-intron boundaries using Sanger sequencing and TaqMan assays. The identified variants were further screened in 36 Finnish HBOC patients and 904 controls. Moreover, one novel intronic deletion was screened in a cohort of 404 breast cancer patients unselected for family history. Haplotype block structure and the association of haplotypes with breast/ovarian cancer were analysed using Haploview. The functionality of the identified variants was predicted using Haploreg, RegulomeDB, Human Splicing Finder, and Pathogenic-or-Not-Pipeline 2. Altogether, 12 germline EMSY variants were observed. Two alterations were located in the coding region, five alterations were intronic, and five alterations were located in the 3'untranslated region (UTR). Variant frequencies did not significantly differ between cases and controls. The novel variant, c.2709 + 122delT, was detected in 1 out of 107 (0.9%) breast cancer patients, and the carrier showed a bilateral form of the disease. The deletion was absent in 897 controls (OR = 25.28; P = 0.1) and in 404 breast cancer patients unselected for family history. No haplotype was identified to increase the risk of breast/ovarian cancer. Functional analyses suggested that variants, particularly in the 3'UTR, were located within regulatory elements. The novel deletion was predicted to affect splicing regulatory elements. These results suggest that the identified EMSY variants are likely neutral at the population level. However, these variants may contribute to breast/ovarian cancer risk in single families. Additional analyses are warranted for rare novel intronic deletions and the 3'UTR variants predicted to have functional roles.

  16. Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution.

    PubMed

    Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V

    2003-09-02

    Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.

  17. ExDom: an integrated database for comparative analysis of the exon–intron structures of protein domains in eukaryotes

    PubMed Central

    Bhasi, Ashwini; Philip, Philge; Manikandan, Vinu; Senapathy, Periannan

    2009-01-01

    We have developed ExDom, a unique database for the comparative analysis of the exon–intron structures of 96 680 protein domains from seven eukaryotic organisms (Homo sapiens, Mus musculus, Bos taurus, Rattus norvegicus, Danio rerio, Gallus gallus and Arabidopsis thaliana). ExDom provides integrated access to exon-domain data through a sophisticated web interface which has the following analytical capabilities: (i) intergenomic and intragenomic comparative analysis of exon–intron structure of domains; (ii) color-coded graphical display of the domain architecture of proteins correlated with their corresponding exon-intron structures; (iii) graphical analysis of multiple sequence alignments of amino acid and coding nucleotide sequences of homologous protein domains from seven organisms; (iv) comparative graphical display of exon distributions within the tertiary structures of protein domains; and (v) visualization of exon–intron structures of alternative transcripts of a gene correlated to variations in the domain architecture of corresponding protein isoforms. These novel analytical features are highly suited for detailed investigations on the exon–intron structure of domains and make ExDom a powerful tool for exploring several key questions concerning the function, origin and evolution of genes and proteins. ExDom database is freely accessible at: http://66.170.16.154/ExDom/. PMID:18984624

  18. Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

    PubMed

    Bergman, C M; Kreitman, M

    2001-08-01

    Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.

  19. An RNAi-Enhanced Logic Circuit for Cancer Specific Detection and Destruction

    DTIC Science & Technology

    2013-02-01

    monomeric protein secreted by Corynebacterium diphtheriae, and pro-apoptotic members of Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its...Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and acceptor site – were selected based on previously...sequences found in literature our intron features were chosen according SplicePort [4], an online analyzer that detects the likelihood of splicing to

  20. Cooperative action of multiple cis-acting elements is required for N-myc expression in branchial arches: specific contribution of GATA3.

    PubMed

    Potvin, Eric; Beuret, Laurent; Cadrin-Girard, Jean-François; Carter, Marcelle; Roy, Sophie; Tremblay, Michel; Charron, Jean

    2010-11-01

    The precise expression of the N-myc proto-oncogene is essential for normal mammalian development, whereas altered N-myc gene regulation is known to be a determinant factor in tumor formation. Using transgenic mouse embryos, we show that N-myc sequences from kb -8.7 to kb +7.2 are sufficient to reproduce the N-myc embryonic expression profile in developing branchial arches and limb buds. These sequences encompass several regulatory elements dispersed throughout the N-myc locus, including an upstream limb bud enhancer, a downstream somite enhancer, a branchial arch enhancer in the second intron, and a negative regulatory element in the first intron. N-myc expression in the limb buds is under the dominant control of the limb bud enhancer. The expression in the branchial arches necessitates the interplay of three regulatory domains. The branchial arch enhancer cooperates with the somite enhancer region to prevent an inhibitory activity contained in the first intron. The characterization of the branchial arch enhancer has revealed a specific role of the transcription factor GATA3 in the regulation of N-myc expression. Together, these data demonstrate that correct N-myc developmental expression is achieved via cooperation of multiple positive and negative regulatory elements.

  1. [Detection of factor VIII intron 1 inversion in severe haemophilia A].

    PubMed

    Liang, Yan; Yan, Zhen-yu; Yan, Mei; Hua, Bao-lai; Xiao, Bai; Zhao, Yong-qiang; Liu, Jing-zhong

    2009-06-01

    Screening the intron 1 inversion of factor VIII (FVIII) in the population of severe haemophilia A(HA) in China and performing carrier detection and prenatal diagnosis. Using LD-PCR to detect intron 22 inversions and multiple-PCR within two tubes to intron 1 inversions in severe HA patients. Carrier detection and prenatal diagnosis were performed in affected families. Linkage analysis and DNA sequencing were used to verify these tests. One hundred and eighteen patients were seven diagnosed as intron 22 inversions and 7 were intron 1 inversions out of 247 severe HA patients. The prevalence of the intron 1 inversion in Chinese severe haemophilia A patients was 2.8% (7/247). Six women from family A and 2 from family B were diagnosed as carriers. One fetus from family A was affected fetus. Intron 1 inversion could be detected directly by multiple-PCR within two tubes. This method made the strategy more perfective in carrier and prenatal diagnosis of haemophilia A.

  2. Splicing of a group II intron involved in the conjugative transfer of pRS01 in lactococci.

    PubMed

    Mills, D A; McKay, L L; Dunny, G M

    1996-06-01

    Analysis of a region involved in the conjugative transfer of the lactococcal conjugative element pRS01 has revealed a bacteria] group II intron. Splicing of this lactococcal intron (designated Ll.ltrB) in vivo resulted in the ligation of two exon messages (ltrBE1 and ltrBE2) which encoded a putative conjugative relaxase essential for the transfer of pRS01. Like many group II introns, the Ll.ltrB intron possessed an open reading frame (ltrA) with homology to reverse transcriptases. Remarkably, sequence analysis of ltrA suggested a greater similarity to open reading frames encoded by eukaryotic mitochondrial group II introns than to those identified to date from other bacteria. Several insertional mutations within ltrA resulted in plasmids exhibiting a conjugative transfer-deficient phenotype. These results provide the first direct evidence for splicing of a prokaryotic group II intron in vivo and suggest that conjugative transfer is a mechanism for group II intron dissemination in bacteria.

  3. Transposition of an intron in yeast mitochondria requires a protein encoded by that intron.

    PubMed

    Macreadie, I G; Scott, R M; Zinn, A R; Butow, R A

    1985-06-01

    The optional 1143 bp intron in the yeast mitochondrial 21S rRNA gene (omega +) is nearly quantitatively inserted in genetic crosses into 21S rRNA alleles that lack it (omega -). The intron contains an open reading frame that can encode a protein of 235 amino acids, but no function has been ascribed to this sequence. We previously found an in vivo double-strand break in omega - DNA at or close to the intron insertion site only in zygotes of omega + X omega - crosses that appears with the same kinetics as intron insertion. We now show that mutations in the intron open reading frame that would alter the translation product simultaneously inhibit nonreciprocal omega recombination and the in vivo double-strand break in omega - DNA. These results provide evidence that the open reading frame encodes a protein required for intron transposition and support the role of the double-strand break in the process.

  4. Intron open reading frames as mobile elements and evolution of a group I intron.

    PubMed

    Sellem, C H; Belcour, L

    1997-05-01

    Group I introns are proposed to have become mobile following the acquisition of open reading frames (ORFs) that encode highly specific DNA endonucleases. This proposal implies that intron ORFs could behave as autonomously mobile entities. This was supported by abundant circumstantial evidence but no experiment of ORF transfer from an ORF-containing intron to its ORF-less counterpart has been described. In this paper we present such experiments, which demonstrate the efficient mobility of the mitochondrial nad1-i4-orf1 between two Podospora strains. The homing of this mobile ORF was accompanied by a bidirectional co-conversion that did not systematically involve the whole intron sequence. Orf1 acquisition would be the most recent step in the evolution of the nad1-i4 intron, which has resulted in many strains of Podospora having an intron with two ORFs (biorfic) and four splicing pathways. We show that two of the splicing events that operate in this biorfic intron, as evidenced by PCR experiments, are generated by a 5'-alternative splice site, which is most probably a remnant of the monoorfic ancestral form of the intron. We propose a sequential evolution model that is consistent with the four organizations of the corresponding nad1 locus that we found among various species of the Pyrenomycete family; these organizations consist of no intron, an intron alone, a monoorfic intron, and a biorfic intron.

  5. Recurrent Loss of Specific Introns during Angiosperm Evolution

    PubMed Central

    Wang, Hao; Devos, Katrien M.; Bennetzen, Jeffrey L.

    2014-01-01

    Numerous instances of presence/absence variations for introns have been documented in eukaryotes, and some cases of recurrent loss of the same intron have been suggested. However, there has been no comprehensive or phylogenetically deep analysis of recurrent intron loss. Of 883 cases of intron presence/absence variation that we detected in five sequenced grass genomes, 93 were confirmed as recurrent losses and the rest could be explained by single losses (652) or single gains (118). No case of recurrent intron gain was observed. Deep phylogenetic analysis often indicated that apparent intron gains were actually numerous independent losses of the same intron. Recurrent loss exhibited extreme non-randomness, in that some introns were removed independently in many lineages. The two larger genomes, maize and sorghum, were found to have a higher rate of both recurrent loss and overall loss and/or gain than foxtail millet, rice or Brachypodium. Adjacent introns and small introns were found to be preferentially lost. Intron loss genes exhibited a high frequency of germ line or early embryogenesis expression. In addition, flanking exon A+T-richness and intron TG/CG ratios were higher in retained introns. This last result suggests that epigenetic status, as evidenced by a loss of methylated CG dinucleotides, may play a role in the process of intron loss. This study provides the first comprehensive analysis of recurrent intron loss, makes a series of novel findings on the patterns of recurrent intron loss during the evolution of the grass family, and provides insight into the molecular mechanism(s) underlying intron loss. PMID:25474210

  6. Short intronic repeat sequences facilitate circular RNA production

    PubMed Central

    Liang, Dongming

    2014-01-01

    Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217

  7. Isolation and Identification of Gene-Specific MicroRNAs.

    PubMed

    Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

    2018-01-01

    Computer programming has identified hundreds of genomic hairpin sequences, many with functions yet to be determined. Because transfection of hairpin-like microRNA precursors (pre-miRNAs) into mammalian cells is not always sufficient to trigger RNA-induced gene silencing complex (RISC) assembly, a key step for inducing RNA interference (RNAi)-related gene silencing, we have developed an intronic miRNA expression system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene, and hence successfully increase the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis mechanism has been found to depend on a coupled interaction of nascent messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA so obtained is transcribed by type-II RNA polymerases, coexpressed within a primary gene transcript, and then excised out of the gene transcript by intracellular RNA splicing and processing machineries. After that, ribonuclease III (RNaseIII) endonucleases further process the spliced introns into mature miRNAs. Using this intronic miRNA expression system, we have shown for the first time that the intron-derived miRNAs are able to elicit strong RNAi effects in not only human and mouse cells in vitro but also in zebrafishes, chicken embryos, and adult mice in vivo. We have also developed a miRNA isolation protocol, based on the complementarity between the designed miRNA and its targeted gene sequence, to purify and identify the mature miRNAs generated. As a result, several intronic miRNA identities and structures have been confirmed. According to this proof-of-principle methodology, we now have full knowledge to design various intronic pre-miRNA inserts that are more efficient and effective for inducing specific gene silencing effects in vitro and in vivo.

  8. Distinguishing between weedy Amaranthus species based on intron one sequences from the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS)gene

    USDA-ARS?s Scientific Manuscript database

    Hybridization between Amaranthus species and the potential for herbicide resistance to be transferred by hybridization are of growing concern in the weed science community. It is important to confirm suspect hybrid populations early to develop an effective control strategy. With this in mind, a PC...

  9. Geographical origin of Leucobryum boninense Sull. & Lesq. (Leucobryaceae, Musci) endemic to the Bonin Islands, Japan

    PubMed Central

    Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki

    2013-01-01

    Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5′ trnK intron, matK, partial 3′ trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species. PMID:23610621

  10. Geographical origin of Leucobryum boninense Sull. & Lesq. (Leucobryaceae, Musci) endemic to the Bonin Islands, Japan.

    PubMed

    Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki

    2013-04-01

    Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5' trnK intron, matK, partial 3' trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species.

  11. [Frequency of intron 1 inversion of factor VIII gene in Chinese hemophilia A patients with case report of a female patient with heterozygous intron 1 inversion].

    PubMed

    Yan, Zhen-yu; Liang, Yan; Yan, Mei; Fan, Lian-kai; Xiao, Bai; Hua, Bao-lai; Liu, Jing-zhong; Zhao, Yong-qiang

    2008-10-21

    To investigate the frequency of intron 1 inversion (inv1) in FVIII gene in Chinese hemophilia A (HA) patients and to investigate the mechanism of pathogenesis. Peripheral blood samples were collected from 158 unrelated HA patients, aged 20 (1 - 73), including one female HA patient, aged 5, and several family members of a patient positive in inv1. One-stage method was used to assay the FVIII activity (FVIII:C). Long distance PCR and multiple PCR in duplex reactions were used to screen for the intron 22 inversion (inv22) and inv1 of the FVIII coding gene (F8). The F8 coding sequence was amplified with PCR and sequenced with an automatic sequencer. Two unrelated patients (pedigrees) were detected as inv1 positive with a positive rate of 1.26%. A rare female HA patient with inv1 was also discovered in a positive family (3 HA cases were found in this family and regarded as one case in calculating the total detection rate). The full length of FVIII was sequenced, and no other mutation was detected. There frequency of FVIII inv1 is low in Chinese HA patients compared with other populations. Female HA patients are heterozygous for FVIII inv1 and that may be resulted from nonrandom inactivation of X chromosome.

  12. Molecular Phylogenetic Analysis of Archaeal Intron-Containing Genes Coding for rRNA Obtained from a Deep-Subsurface Geothermal Water Pool

    PubMed Central

    Takai, Ken; Horikoshi, Koki

    1999-01-01

    Molecular phylogenetic analysis of a naturally occurring microbial community in a deep-subsurface geothermal environment indicated that the phylogenetic diversity of the microbial population in the environment was extremely limited and that only hyperthermophilic archaeal members closely related to Pyrobaculum were present. All archaeal ribosomal DNA sequences contained intron-like sequences, some of which had open reading frames with repeated homing-endonuclease motifs. The sequence similarity analysis and the phylogenetic analysis of these homing endonucleases suggested the possible phylogenetic relationship among archaeal rRNA-encoded homing endonucleases. PMID:10584021

  13. New encoded single-indicator sequences based on physico-chemical parameters for efficient exon identification.

    PubMed

    Meher, J K; Meher, P K; Dash, G N; Raval, M K

    2012-01-01

    The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.

  14. Developing a set of strong intronic promoters for robust metabolic engineering in oleaginous Rhodotorula (Rhodosporidium) yeast species.

    PubMed

    Liu, Yanbin; Yap, Sihui Amy; Koh, Chong Mei John; Ji, Lianghui

    2016-11-25

    Red yeast species in the Rhodotorula/Rhodosporidium genus are outstanding producers of triacylglyceride and cell biomass. Metabolic engineering is expected to further enhance the productivity and versatility of these hosts for the production of biobased chemicals and fuels. Promoters with strong activity during oil-accumulation stage are critical tools for metabolic engineering of these oleaginous yeasts. The upstream DNA sequences of 6 genes involved in lipid biosynthesis or accumulation in Rhodotorula toruloides were studied by luciferase reporter assay. The promoter of perilipin/lipid droplet protein 1 gene (LDP1) displayed much stronger activity (4-11 folds) than that of glyceraldehyde-3-phosphate dehydrogenase gene (GPD1), one of the strongest promoters known in yeasts. Depending on the stage of cultivation, promoter of acetyl-CoA carboxylase gene (ACC1) and fatty acid synthase β subunit gene (FAS1) exhibited intermediate strength, displaying 50-160 and 20-90% levels of GPD1 promoter, respectively. Interestingly, introns significantly modulated promoter strength at high frequency. The incorporation of intron 1 and 2 of LDP1 (LDP1in promoter) enhanced its promoter activity by 1.6-3.0 folds. Similarly, the strength of ACC1 promoter was enhanced by 1.5-3.2 folds if containing intron 1. The intron 1 sequences of ACL1 and FAS1 also played significant regulatory roles. When driven by the intronic promoters of ACC1 and LDP1 (ACC1in and LDP1in promoter, respectively), the reporter gene expression were up-regulated by nitrogen starvation, independent of de novo oil biosynthesis and accumulation. As a proof of principle, overexpression of the endogenous acyl-CoA-dependent diacylglycerol acyltransferase 1 gene (DGA1) by LDP1in promoter was significantly more efficient than GPD1 promoter in enhancing lipid accumulation. Intronic sequences play an important role in regulating gene expression in R. toruloides. Three intronic promoters, LDP1in, ACC1in and FAS1in, are excellent promoters for metabolic engineering in the oleaginous and carotenogenic yeast, R. toruloides.

  15. Ferritin gene organization: differences between plants and animals suggest possible kingdom-specific selective constraints.

    PubMed

    Proudhon, D; Wei, J; Briat, J; Theil, E C

    1996-03-01

    Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.

  16. Comparative analysis of the 5{prime} genomic and promoter regions between the mouse (Hdh) and human Huntington disease (HD) gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kalchman, M.; Lin, B.; Nasir, J.

    1994-09-01

    The mouse homologue of the Huntington disease gene (Hdh) has recently been cloned and mapped to a region of synteny with the human, on mouse chromosome 5. The two genes share a high degree of both coding (90% amino acid) and nucleotide (86.2%) identity. We have subsequently performed a detailed comparison of the genomic organization of the 5{prime} region of the two genes encompassing the promoter region and first five exons of both the human and mouse genes. The comparative sequence analysis of the promoter region between HD and Hdh reveals two highly conserved regions. One region (-56 to -118)more » (+1 is the ATG start codon), shared 84% nucleotide identity and another region (-130 to -206) had 81% nucleotide identity. Nine putative Sp1 sites appear in the human promoter region contrasted with only 3 in a similar region in the mouse. Furthermore, 17 and 20 base pair direct repeats present in the HD 5{prime} region are absent in the similar Hdh region. Although both the mouse and human intron/exon boundaries conform to the GT/AG rule, the intron sizes between HD and Hdh are markedly different. The first four introns in Hdh are 15, 7, 5 and 0.5 kb compared to sizes of 10, 15, 7 and 0.5 kb, respectively. Comparison between the mouse and human intronic sequences immediately adjacent to the first five exons (excluding exon 1) reveals only about 46 to 50% identity within the first 60 bp of intronic sequence. Furthermore, we have identified novel polymorphic di-, tri- and tetra-nucleotide repeats in Hdh introns of various mouse strains that are not present in the human. For example, polymorphic CT repeats are present in introns 2 and 4 of Hdh and a novel mouse 56 AAG trinucleotide repeat (interrupted by an AAGG) is also located within intron 2. This information concerning the promoter and genomic organization of both HD and Hdh is critical for designing appropriate gene targetting vectors for studying the normal function of the HD and Hdh genes in model systems.« less

  17. Characterization of a marsupial sperm protamine gene and its transcripts from the North American opossum (Didelphis marsupialis).

    PubMed

    Winkfein, R J; Nishikawa, S; Connor, W; Dixon, G H

    1993-07-01

    A synthetic oligonucleotide primer, designed from marsupial protamine protein-sequence data [Balhorn, R., Corzett, M., Matrimas, J. A., Cummins, J. & Faden, B. (1989) Analysis of protamines isolated from two marsupials, the ring-tailed wallaby and gray short-tailed opossum, J. Cell. Biol. 107] was used to amplify, via the polymerase chain reaction, protamine sequences from a North American opossum (Didelphis marsupialis) cDNA. Using the amplified sequences as probes, several protamine cDNA clones were isolated. The protein sequence, predicted from the cDNA sequences, consisted of 57 amino acids, contained a large number of arginine residues and exhibited the sequence ARYR at its amino terminus, which is conserved in avian and most eutherian mammal protamines. Like the true protamines of trout and chicken, the opossum protamine lacked cysteine residues, distinguishing it from placental mammalian protamine 1 (P1 or stable) protamines. Examination of the protamine gene, isolated by polymerase-chain-reaction amplification of genomic DNA, revealed the presence of an intron dividing the protamine-coding region, a common characteristic of all mammalian P1 genes. In addition, extensive sequence identity in the 5' and 3' flanking regions between mouse and opossum sequences classify the marsupial protamine as being closely related to placental mammal P1. Protamine transcripts, in both birds and mammals, are present in two size classes, differing by the length of their poly(A) tails (either short or long). Examination of opossum protamine transcripts by Northern hybridization revealed four distinct mRNA species in the total RNA fraction, two of which were enriched in the poly(A)-rich fraction. Northern-blot analysis, using an intron-specific probe, revealed the presence of intron sequences in two of the four protamine transcripts. If expressed, the corresponding protein from intron-containing transcripts would differ from spliced transcripts by length (49 versus 57 amino acids) and would contain a cysteine residue.

  18. Ovarian Tumors related to Intronic Mutations in DICER1: A Report from the International Ovarian and Testicular Stromal Tumor Registry

    PubMed Central

    Schultz, Kris Ann; Harris, Anne; Messinger, Yoav; Sencer, Susan; Baldinger, Shari; Dehner, Louis P.; Hill, D. Ashley

    2015-01-01

    Germline DICER1 mutations have been described in individuals with pleuropulmonary blastoma (PPB), ovarian Sertoli-Leydig cell tumor (SLCT), sarcomas, multinodular goiter, thyroid carcinoma, cystic nephroma and other neoplastic conditions. Early results from the International Ovarian and Testicular Stromal Tumor Registry show germline DICER1 mutations in 48% of girls and women with SLCT. In this report, a young woman presented with ovarian undifferentiated sarcoma. Four years later, she presented with SLCT. She was successfully treated for both malignancies. Sequence results showed a germline intronic mutation in DICER1. This mutation results in an exact duplication of the six bases at the splice site at the intron 23 and exon 24 junction. Predicted improper splicing leads to inclusion of 10 bases of intronic sequence, frameshift and premature truncation of the protein disrupting the RNase IIIb domain. A second individual with SLCT was found to have an identical germline mutation. In each of the ovarian tumors, an additional somatic mutation in the RNase IIIb domain of DICER1 was found. In rare patients, germline intronic mutations in DICER1 that are predicted to cause incorrect splicing can also contribute to the pathogenesis of SLCT. PMID:26289771

  19. Genetic characterization of Common Eiders breeding in the Yukon-Kuskokwim Delta, Alaska

    USGS Publications Warehouse

    Sonsthagen, Sarah A.; Talbot, Sandra L.; McCracken, Kevin G.

    2007-01-01

    We assessed population genetic subdivision among four colonies of Common Eiders (Somateria mollissima v-nigrum) breeding in the Yukon-Kuskokwim Delta (YKD), Alaska, using microsatellite genotypes and DNA sequences with differing modes of inheritance. Significant, albeit low, levels of genetic differentiation were observed between mainland populations and Kigigak Island for nuclear intron lamin A and mitochondrial DNA (mtDNA) control region. Intercolony variation in haplotypic frequencies also was observed at mtDNA. Positive growth signatures assayed from microsatellites, nuclear introns, and mtDNA indicate recent colonization of the YKD, and may explain the low levels of structuring observed. Gene flow estimates based on microsatellites, nuclear introns, and mtDNA suggest asymmetrical gene flow between mainland colonies and Kigigak Island, with more individuals on average dispersing from mainland populations to Kigigak Island than vice versa. The directionality of gene flow observed may be explained by the colonization of the YKD from northern glacial refugia or by YKD metapopulation dynamics.

  20. The chloroplast tRNALys(UUU) gene from mustard (Sinapis alba) contains a class II intron potentially coding for a maturase-related polypeptide.

    PubMed

    Neuhaus, H; Link, G

    1987-01-01

    The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.

  1. RNA editing in the anticodon of tRNA Leu (CAA) occurs before group I intron splicing in plastids of a moss Takakia lepidozioides S. Hatt. & Inoue.

    PubMed

    Miyata, Y; Sugita, C; Maruyama, K; Sugita, M

    2008-03-01

    RNA editing of cytidine (C) to uridine (U) transitions occurs in plastids and mitochondria of most land plants. In this study, we amplified and sequenced the group I intron-containing tRNA Leu gene, trnL-CAA, from Takakia lepidozioides, a moss. DNA sequence analysis revealed that the T. lepidozioides tRNA Leu gene consisted of a 35-bp 5' exon, a 469-bp group I intron and a 50-bp 3' exon. The intron was inserted between the first and second position of the tRNA Leu anticodon. In general, plastid tRNA Leu genes with a group I intron code for a TAA anticodon in most land plants. This strongly suggests that the first nucleotide of the CAA anticodon could be edited in T. lepidozioides plastids. To investigate this possibility, we analysed cDNAs derived from the trnL-CAA transcripts. We demonstrated that the first nucleotide C of the anticodon was edited to create a canonical UAA anticodon in T. lepidozioides plastids. cDNA sequencing analyses of the spliced or unspliced tRNA Leu transcripts revealed that, while the spliced tRNA was completely edited, editing in the unspliced tRNAs were only partial. This is the first experimental evidence that the anticodon editing of tRNA occurs before RNA splicing in plastids. This suggests that this editing is a prerequisite to splicing of pre-tRNA Leu.

  2. Evaluation of non-coding variation in GLUT1 deficiency.

    PubMed

    Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S

    2016-12-01

    Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.

  3. Mobile Bacterial Group II Introns at the Crux of Eukaryotic Evolution

    PubMed Central

    Lambowitz, Alan M.; Belfort, Marlene

    2015-01-01

    SUMMARY This review focuses on recent developments in our understanding of group II intron function, the relationships of these introns to retrotransposons and spliceosomes, and how their common features have informed thinking about bacterial group II introns as key elements in eukaryotic evolution. Reverse transcriptase-mediated and host factor-aided intron retrohoming pathways are considered along with retrotransposition mechanisms to novel sites in bacteria, where group II introns are thought to have originated. DNA target recognition and movement by target-primed reverse transcription infer an evolutionary relationship among group II introns, non-LTR retrotransposons, such as LINE elements, and telomerase. Additionally, group II introns are almost certainly the progenitors of spliceosomal introns. Their profound similarities include splicing chemistry extending to RNA catalysis, reaction stereochemistry, and the position of two divalent metals that perform catalysis at the RNA active site. There are also sequence and structural similarities between group II introns and the spliceosome’s small nuclear RNAs (snRNAs) and between a highly conserved core spliceosomal protein Prp8 and a group II intron-like reverse transcriptase. It has been proposed that group II introns entered eukaryotes during bacterial endosymbiosis or bacterial-archaeal fusion, proliferated within the nuclear genome, necessitating evolution of the nuclear envelope, and fragmented giving rise to spliceosomal introns. Thus, these bacterial self-splicing mobile elements have fundamentally impacted the composition of extant eukaryotic genomes, including the human genome, most of which is derived from close relatives of mobile group II introns. PMID:25878921

  4. Rapid Construction of Stable Infectious Full-Length cDNA Clone of Papaya Leaf Distortion Mosaic Virus Using In-Fusion Cloning

    PubMed Central

    Tuo, Decai; Shen, Wentao; Yan, Pu; Li, Xiaoying; Zhou, Peng

    2015-01-01

    Papaya leaf distortion mosaic virus (PLDMV) is becoming a threat to papaya and transgenic papaya resistant to the related pathogen, papaya ringspot virus (PRSV). The generation of infectious viral clones is an essential step for reverse-genetics studies of viral gene function and cross-protection. In this study, a sequence- and ligation-independent cloning system, the In-Fusion® Cloning Kit (Clontech, Mountain View, CA, USA), was used to construct intron-less or intron-containing full-length cDNA clones of the isolate PLDMV-DF, with the simultaneous scarless assembly of multiple viral and intron fragments into a plasmid vector in a single reaction. The intron-containing full-length cDNA clone of PLDMV-DF was stably propagated in Escherichia coli. In vitro intron-containing transcripts were processed and spliced into biologically active intron-less transcripts following mechanical inoculation and then initiated systemic infections in Carica papaya L. seedlings, which developed similar symptoms to those caused by the wild-type virus. However, no infectivity was detected when the plants were inoculated with RNA transcripts from the intron-less construct because the instability of the viral cDNA clone in bacterial cells caused a non-sense or deletion mutation of the genomic sequence of PLDMV-DF. To our knowledge, this is the first report of the construction of an infectious full-length cDNA clone of PLDMV and the splicing of intron-containing transcripts following mechanical inoculation. In-Fusion cloning shortens the construction time from months to days. Therefore, it is a faster, more flexible, and more efficient method than the traditional multistep restriction enzyme-mediated subcloning procedure. PMID:26633465

  5. Rapid Construction of Stable Infectious Full-Length cDNA Clone of Papaya Leaf Distortion Mosaic Virus Using In-Fusion Cloning.

    PubMed

    Tuo, Decai; Shen, Wentao; Yan, Pu; Li, Xiaoying; Zhou, Peng

    2015-12-01

    Papaya leaf distortion mosaic virus (PLDMV) is becoming a threat to papaya and transgenic papaya resistant to the related pathogen, papaya ringspot virus (PRSV). The generation of infectious viral clones is an essential step for reverse-genetics studies of viral gene function and cross-protection. In this study, a sequence- and ligation-independent cloning system, the In-Fusion(®) Cloning Kit (Clontech, Mountain View, CA, USA), was used to construct intron-less or intron-containing full-length cDNA clones of the isolate PLDMV-DF, with the simultaneous scarless assembly of multiple viral and intron fragments into a plasmid vector in a single reaction. The intron-containing full-length cDNA clone of PLDMV-DF was stably propagated in Escherichia coli. In vitro intron-containing transcripts were processed and spliced into biologically active intron-less transcripts following mechanical inoculation and then initiated systemic infections in Carica papaya L. seedlings, which developed similar symptoms to those caused by the wild-type virus. However, no infectivity was detected when the plants were inoculated with RNA transcripts from the intron-less construct because the instability of the viral cDNA clone in bacterial cells caused a non-sense or deletion mutation of the genomic sequence of PLDMV-DF. To our knowledge, this is the first report of the construction of an infectious full-length cDNA clone of PLDMV and the splicing of intron-containing transcripts following mechanical inoculation. In-Fusion cloning shortens the construction time from months to days. Therefore, it is a faster, more flexible, and more efficient method than the traditional multistep restriction enzyme-mediated subcloning procedure.

  6. Frequencies of VNTR and RFLP polymorphisms associated with factor VIII gene in Singapore

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fong, I.; Lai, P.S.; Ouah, T.C.

    1994-09-01

    The allelic frequency of any polymorphism within a population determines its usefulness for genetic counselling. This is important in populations of non-Caucasian origin as RFLPs may significantly differ among ethnic groups. We report a study of five intragenic polymorphisms in factor VIII gene carried out in Singapore. The three PCR-based RFLP markers studied were Intron 18/Bcl I, Intron 19/Hind III and Intron 22/Xba I. In an analysis of 148 unrelated normal X chromosomes, the allele frequencies were found to be A1 = 0.18, A2 = 0.82 (Bcl I RFLP), A1 = 0.80, A2 = 0.20 (Hind III RFLP) and A1more » = 0.58, and A2 = 0.42 (Xba I RFLP). The heterozygosity rates of 74 females analyzed separately were 31%, 32% and 84.2%, respectively. Linkage disequilibrium was also observed to some degree between Bcl I and Hind III polymorphism in our population. We have also analyzed a sequence polymorphism in Intron 7 using hybridization with radioactive-labelled {sup 32}P allele-specific oligonucleotide probes. This polymorphism was not very polymorphic in our population with only 2% of 117 individuals analyzed being informative. However, the use of a hypervariable dinucleotide repeat sequence (VNTR) in Intron 13 showed that 25 of our of 27 (93%) females were heterozygous. Allele frequencies ranged from 1 to 55 %. We conclude that a viable strategy for molecular analysis of Hemophilia A families in our population should include the use of Intron 18/Bcl I and Intron 22/Xba I RFLP markers and the Intron 13 VNTR marker.« less

  7. MAPT as a predisposing gene for sporadic amyotrophic lateral sclerosis in the Chinese Han population

    PubMed Central

    Fang, Pu; Xu, Wenyuan; Wu, Chengsi; Zhu, Min; Li, Xiaobing; Hong, Daojun

    2013-01-01

    A previous study of European Caucasian patients with sporadic amyotrophic lateral sclerosis demonstrated that a polymorphism in the microtubule-associated protein Tau (MAPT) gene was significantly associated with sporadic amyotrophic lateral sclerosis pathogenesis. Here, we tested this association in 107 sporadic amyotrophic lateral sclerosis patients and 100 healthy controls from the Chinese Han population. We screened the mutation-susceptible regions of MAPT – the 3' and 5' untranslated regions as well as introns 9, 10, 11, and 12 – by direct sequencing, and identified 33 genetic variations. Two of these, 105788 A > G in intron 9 and 123972 T > A in intron 11, were not present in the control group. The age of onset in patients with the 105788 A > G and/or the 123972 T > A variant was younger than that in patients without either genetic variation. Moreover, the pa-tients with a genetic variation were more prone to bulbar palsy and breathing difficulties than those with the wild-type genotype. This led to a shorter survival period in patients with a MAPT genetic variant. Our study suggests that the MAPT gene is a potential risk gene for sporadic amyotrophic lateral sclerosis in the Chinese Han population. PMID:25206632

  8. Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome

    PubMed Central

    Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

    2014-01-01

    Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes. PMID:25482895

  9. Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome.

    PubMed

    Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

    2014-01-01

    Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.

  10. The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.

    PubMed

    Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B

    2017-12-27

    Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.

  11. BIALLELIC POLYMORPHISM IN THE INTRON REGION OF B-TUBULIN GENE OF CRYPTOSPORIDIUM PARASITES

    EPA Science Inventory

    Nucleotide sequencing of polymerase chain reaction-amplified intron region of the Cryptosporidium parvum B-tubulin gene in 26 human and 15 animal isolates revealed distinct genetic polymorphism between the human and bovine genotypes. The separation of 2 genotypes of C. parvum is...

  12. Group I introns are inherited through common ancestry in the nuclear-encoded rRNA of Zygnematales (Charophyceae).

    PubMed Central

    Bhattacharya, D; Surek, B; Rüsing, M; Damberger, S; Melkonian, M

    1994-01-01

    Group I introns are found in organellar genomes, in the genomes of eubacteria and phages, and in nuclear-encoded rRNAs. The origin and distribution of nuclear-encoded rRNA group I introns are not understood. To elucidate their evolutionary relationships, we analyzed diverse nuclear-encoded small-subunit rRNA group I introns including nine sequences from the green-algal order Zygnematales (Charophyceae). Phylogenetic analyses of group I introns and rRNA coding regions suggest that lateral transfers have occurred in the evolutionary history of group I introns and that, after transfer, some of these elements may form stable components of the host-cell nuclear genomes. The Zygnematales introns, which share a common insertion site (position 1506 relative to the Escherichia coli small-subunit rRNA), form one subfamily of group I introns that has, after its origin, been inherited through common ancestry. Since the first Zygnematales appear in the middle Devonian within the fossil record, the "1506" group I intron presumably has been a stable component of the Zygnematales small-subunit rRNA coding region for 350-400 million years. PMID:7937917

  13. Efficiency of introns from various origins in fish cells.

    PubMed

    Bétancourt, O H; Attal, J; Théron, M C; Puissant, C; Houdebine, L M

    1993-06-01

    Several vectors containing (1) regulatory regions from Rous sarcoma virus (RSV), human cytomegalovirus (CMV), and herpes simplex thymidine kinase (TK); (2) introns from early or late SV40 genes and from trout growth hormone gene (tGH); (3) chloramphenicol acetyltransferase gene (CAT); and (4) transcription terminators from SV40 were transfected into carp EPC cells, salmon CHSE cells, tilapia TO2 cells, quail QT6 cells, and hamster CHO cells. CAT activity was measured in extracts from several cell lines 3 days after transfection and in the fish EPC stable clones. The CMV and RSV promoters were the most potent in all cell types. The intron from late SV40 genes (VP1 intron) worked properly in QT6 and CHO cells but not in EPC and very weakly in TO2 cells. The tGH intron was efficient in all cell types but preferentially in fish cells. The small t intron from SV40 was processed in all cell types. The small t and, to a lesser extent, the tGH introns amplified expression of cat gene in stable clones, in comparison to the transiently transfected cells. These results indicate that elements from mammalian genes may not be properly recognized by the fish cellular machinery and in an unpredictable manner. This finding suggests that vectors prepared to express foreign genes in transfected cultured fish cells and transgenic fish should preferably contain DNA sequences from fish genes or, alternatively, those sequences from mammalian genes that have been previously proved to be compatible with the fish cellular machinery.

  14. Chloroplast genome expansion by intron multiplication in the basal psychrophilic euglenoid Eutreptiella pomquetensis

    PubMed Central

    Bennett, Matthew S.; Triemer, Richard E.; Preisfeld, Angelika

    2017-01-01

    Background Over the last few years multiple studies have been published showing a great diversity in size of chloroplast genomes (cpGenomes), and in the arrangement of gene clusters, in the Euglenales. However, while these genomes provided important insights into the evolution of cpGenomes across the Euglenales and within their genera, only two genomes were analyzed in regard to genomic variability between and within Euglenales and Eutreptiales. To better understand the dynamics of chloroplast genome evolution in early evolving Eutreptiales, this study focused on the cpGenome of Eutreptiella pomquetensis, and the spread and peculiarities of introns. Methods The Etl. pomquetensis cpGenome was sequenced, annotated and afterwards examined in structure, size, gene order and intron content. These features were compared with other euglenoid cpGenomes as well as those of prasinophyte green algae, including Pyramimonas parkeae. Results and Discussion With about 130,561 bp the chloroplast genome of Etl. pomquetensis, a basal taxon in the phototrophic euglenoids, was considerably larger than the two other Eutreptiales cpGenomes sequenced so far. Although the detected quadripartite structure resembled most green algae and plant chloroplast genomes, the gene content of the single copy regions in Etl. pomquetensis was completely different from those observed in green algae and plants. The gene composition of Etl. pomquetensis was extensively changed and turned out to be almost identical to other Eutreptiales and Euglenales, and not to P. parkeae. Furthermore, the cpGenome of Etl. pomquetensis was unexpectedly permeated by a high number of introns, which led to a substantially larger genome. The 51 identified introns of Etl. pomquetensis showed two major unique features: (i) more than half of the introns displayed a high level of pairwise identities; (ii) no group III introns could be identified in the protein coding genes. These findings support the hypothesis that group III introns are degenerated group II introns and evolved later. PMID:28852596

  15. Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

    NASA Astrophysics Data System (ADS)

    Hamid, Nur Athirah Abd; Ismail, Ismanizan

    2013-11-01

    Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.

  16. Choosing and Using Introns in Molecular Phylogenetics

    PubMed Central

    Creer, Simon

    2007-01-01

    Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984

  17. Comparative Analysis of Four Calypogeia Species Revealed Unexpected Change in Evolutionarily-Stable Liverwort Mitogenomes

    PubMed Central

    Ślipiko, Monika; Buczkowska-Chmielewska, Katarzyna; Bączkiewicz, Alina; Szczecińska, Monika; Sawicki, Jakub

    2017-01-01

    Liverwort mitogenomes are considered to be evolutionarily stable. A comparative analysis of four Calypogeia species revealed differences compared to previously sequenced liverwort mitogenomes. Such differences involve unexpected structural changes in the two genes, cox1 and atp1, which have lost three and two introns, respectively. The group I introns in the cox1 gene are proposed to have been lost by two-step localized retroprocessing, whereas one-step retroprocessing could be responsible for the disappearance of the group II introns in the atp1 gene. These cases represent the first identified losses of introns in mitogenomes of leafy liverworts (Jungermanniopsida) contrasting the stability of mitochondrial gene order with certain changes in the gene content and intron set in liverworts. PMID:29257096

  18. Short intronic repeat sequences facilitate circular RNA production.

    PubMed

    Liang, Dongming; Wilusz, Jeremy E

    2014-10-15

    Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.

  19. Elongation Factor-1α Accurately Reconstructs Relationships Amongst Psyllid Families (Hemiptera: Psylloidea), with Possible Diagnostic Implications.

    PubMed

    Martoni, Francesco; Bulman, Simon R; Pitman, Andrew; Armstrong, Karen F

    2017-12-05

    The superfamily Psylloidea (Hemiptera: Sternorrhyncha) lacks a robust multigene phylogeny. This impedes our understanding of the evolution of this group of insects and, consequently, an accurate identification of individuals, of their plant host associations, and their roles as vectors of economically important plant pathogens. The conserved nuclear gene elongation factor-1 alpha (EF-1α) has been valuable as a higher-level phylogenetic marker in insects and it has also been widely used to investigate the evolution of intron/exon structure. To explore evolutionary relationships among Psylloidea, polymerase chain reaction amplification and nucleotide sequencing of a 250-bp EF-1α gene fragment was applied to psyllids belonging to five different families. Introns were detected in three individuals belonging to two families. The nine genera belonging to the family Aphalaridae all lacked introns, highlighting the possibility of using intron presence/absence as a diagnostic tool at a family level. When paired with cytochrome oxidase I gene sequences, the 250 bp EF-1α sequence appeared to be a very promising higher-level phylogenetic marker for psyllids. © The Author(s) 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  20. Genome Analysis Reveals Interplay between 5′UTR Introns and Nuclear mRNA Export for Secretory and Mitochondrial Genes

    PubMed Central

    Cenik, Can; Chua, Hon Nian; Zhang, Hui; Tarnawsky, Stefan P.; Akef, Abdalla; Derti, Adnan; Tasan, Murat; Moore, Melissa J.; Palazzo, Alexander F.; Roth, Frederick P.

    2011-01-01

    In higher eukaryotes, messenger RNAs (mRNAs) are exported from the nucleus to the cytoplasm via factors deposited near the 5′ end of the transcript during splicing. The signal sequence coding region (SSCR) can support an alternative mRNA export (ALREX) pathway that does not require splicing. However, most SSCR–containing genes also have introns, so the interplay between these export mechanisms remains unclear. Here we support a model in which the furthest upstream element in a given transcript, be it an intron or an ALREX–promoting SSCR, dictates the mRNA export pathway used. We also experimentally demonstrate that nuclear-encoded mitochondrial genes can use the ALREX pathway. Thus, ALREX can also be supported by nucleotide signals within mitochondrial-targeting sequence coding regions (MSCRs). Finally, we identified and experimentally verified novel motifs associated with the ALREX pathway that are shared by both SSCRs and MSCRs. Our results show strong correlation between 5′ untranslated region (5′UTR) intron presence/absence and sequence features at the beginning of the coding region. They also suggest that genes encoding secretory and mitochondrial proteins share a common regulatory mechanism at the level of mRNA export. PMID:21533221

  1. RPG: the Ribosomal Protein Gene database.

    PubMed

    Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya

    2004-01-01

    RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes.

  2. RPG: the Ribosomal Protein Gene database

    PubMed Central

    Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya

    2004-01-01

    RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes. PMID:14681386

  3. Characterizing the strand-specific distribution of non-CpG methylation in human pluripotent cells.

    PubMed

    Guo, Weilong; Chung, Wen-Yu; Qian, Minping; Pellegrini, Matteo; Zhang, Michael Q

    2014-03-01

    DNA methylation is an important defense and regulatory mechanism. In mammals, most DNA methylation occurs at CpG sites, and asymmetric non-CpG methylation has only been detected at appreciable levels in a few cell types. We are the first to systematically study the strand-specific distribution of non-CpG methylation. With the divide-and-compare strategy, we show that CHG and CHH methylation are not intrinsically different in human embryonic stem cells (ESCs) and induced pluripotent stem cells (iPSCs). We also find that non-CpG methylation is skewed between the two strands in introns, especially at intron boundaries and in highly expressed genes. Controlling for the proximal sequences of non-CpG sites, we show that the skew of non-CpG methylation in introns is mainly guided by sequence skew. By studying subgroups of transposable elements, we also found that non-CpG methylation is distributed in a strand-specific manner in both short interspersed nuclear elements (SINE) and long interspersed nuclear elements (LINE), but not in long terminal repeats (LTR). Finally, we show that on the antisense strand of Alus, a non-CpG site just downstream of the A-box is highly methylated. Together, the divide-and-compare strategy leads us to identify regions with strand-specific distributions of non-CpG methylation in humans.

  4. Occurrence of Can-SINEs and intron sequence evolution supports robust phylogeny of pinniped carnivores and their terrestrial relatives.

    PubMed

    Schröder, Christiane; Bleidorn, Christoph; Hartmann, Stefanie; Tiedemann, Ralph

    2009-12-15

    Investigating the dog genome we found 178965 introns with a moderate length of 200-1000 bp. A screening of these sequences against 23 different repeat libraries to find insertions of short interspersed elements (SINEs) detected 45276 SINEs. Virtually all of these SINEs (98%) belong to the tRNA-derived Can-SINE family. Can-SINEs arose about 55 million years ago before Carnivora split into two basal groups, the Caniformia (dog-like carnivores) and the Feliformia (cat-like carnivores). Genome comparisons of dog and cat recovered 506 putatively informative SINE loci for caniformian phylogeny. In this study we show how to use such genome information of model organisms to research the phylogeny of related non-model species of interest. Investigating a dataset including representatives of all major caniformian lineages, we analysed 24 randomly chosen loci for 22 taxa. All loci were amplifiable and revealed 17 parsimony-informative SINE insertions. The screening for informative SINE insertions yields a large amount of sequence information, in particular of introns, which contain reliable phylogenetic information as well. A phylogenetic analysis of intron- and SINE sequence data provided a statistically robust phylogeny which is congruent with the absence/presence pattern of our SINE markers. This phylogeny strongly supports a sistergroup relationship of Musteloidea and Pinnipedia. Within Pinnipedia, we see strong support from bootstrapping and the presence of a SINE insertion for a sistergroup relationship of the walrus with the Otariidae.

  5. G to A substitution in 5{prime} donor splice site of introns 18 and 48 of COL1A1 gene of type I collagen results in different splicing alternatives in osteogenesis imperfecta type I cell strains

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Willing, M.; Deschenes, S.

    We have identified a G to A substitution in the 5{prime} donor splice site of intron 18 of one COL1A1 allele in two unrelated families with osteogenesis imperfecta (OI) type I. A third OI type I family has a G to A substitution at the identical position in intron 48 of one COL1A1 allele. Both mutations abolish normal splicing and lead to reduced steady-state levels of mRNA from the mutant COL1A1 allele. The intron 18 mutation leads to both exon 18 skipping in the mRNA and to utilization of a single alternative splice site near the 3{prime} end of exonmore » 18. The latter results in deletion of the last 8 nucleotides of exon 18 from the mRNA, a shift in the translational reading-frame, and the creation of a premature termination codon in exon 19. Of the potential alternative 5{prime} splice sites in exon 18 and intron 18, the one utilized has a surrounding nucleotide sequence which most closely resembles that of the natural splice site. Although a G to A mutation was detected at the identical position in intron 48 of one COL1A1 allele in another OI type I family, nine complex alternative splicing patterns were identified by sequence analysis of cDNA clones derived from fibroblast mRNA from this cell strain. All result in partial or complete skipping of exon 48, with in-frame deletions of portions of exons 47 and/or 49. The different patterns of RNA splicing were not explained by their sequence homology with naturally occuring 5{prime} splice sites, but rather by recombination between highly homologous exon sequences, suggesting that we may not have identified the major splicing alternative(s) in this cell strain. Both G to A mutations result in decreased production of type I collagen, the common biochemical correlate of OI type I.« less

  6. Physical structure and chromosomal localization of a gene encoding human p58[sup clk-1], a cell division control related protein kinase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eipers, P.G.

    1992-01-01

    The gene for the human p58[sup clk[minus]1] protein kinase, a cell division control-related gene, has been mapped by somatic cell hybrid analyses, in situ localization with the chromosomal gene, and nested polymerase chain reaction amplification of microdissected chromosomes. These studies indicate that the expressed p58[sup clk[minus]1] chromosomal gene maps to 1p36, while a highly related p58[sup clk[minus]1] sequence of unknown nature maps to chromosome 15. Assignment of a p34[sup cdc2]-related gene to 1p36 region, including neuroblastoma, ductal carcinoma of the breast, malignant melanoma, Merkel cell carcinoma and endocrine neoplasia among others. Aberrant expression of this protein kinase negatively regulates normalmore » cellular growth. The p58[sup clk[minus]1] protein contains a central domain of 299 amino acids that is 46% identical to human p34[sup cdc2], the master mitotic protein kinase. This dissertation details the complete structure of the p58[sup clk[minus]1] chromosomal gene, including its putative promoter region, transcriptional start sites, exonic sequences, and intron/exon boundary sequences. The gene is 10 kb in size and contains 12 exons and 11 introns. Interestingly, the rather large 2.0 kb 3[prime] untranslated region is interrupted by an intron that separates a region containing numerous AUUUA destabilization motifs from the coding region. Furthermore, the expression of this gene in normal human tissues, as well as several human tumor cell samples and lines, is examined. The origin of multiple human transcripts from the same chromosomal gene, and the possible differential stability of these various transcripts, is discussed with regard to the transcriptional and post-transcriptional regulation of this gene. This is the first report of the chromosomal gene structure of a member of the p34[sup cdc2] supergene family.« less

  7. DNA double-strand break in vivo at the 3' extremity of exons located upstream of group II introns. Senescence and circular DNA introns in Podospora mitochondria.

    PubMed

    Sainsard-Chanet, A; Begel, O; Belcour, L

    1994-10-07

    In the filamentous fungus Podospora anserina, the unavoidable phenomenon of senescence is associated with the amplification of the first intron of the mitochondrial cox1 that accumulates as circular DNA molecules consisting of tandem repeats. This group II intron (cox1-i1 or alpha) is able to transpose and contains an open reading frame with significant amino acid similarity with reverse transcriptases. The generation of these intronic circular DNA molecules, their amplification and their involvement in the senescence process are unresolved questions. We demonstrate here that: (1) another group II intron, the fourth intron of gene cox1, cox1-i4, is also able to give precise DNA end to end junctions; (2) this intronic sequence can be found amplified during senescence, although to a lesser extent than cox1-i1; (3) the amplification of the DNA multimeric cox1-i1 molecules likely does not proceed by autonomous replication; (4) the generation of the DNA intronic circles does not require efficient intron splicing; (5) a DNA double-strand break occurs in vivo at the 3' extremity of the cox1-e1 and cox1-e4 exons preceding the group II introns that form circular DNAs. On the whole, these results show that the ability to form DNA circular molecules is a property of some group II introns and they demonstrate the occurrence of a specific DNA cleavage at or near the integration site of these group II introns. The results strongly suggest that this cleavage is involved in the formation of the group II intronic DNA circles and could also be involved in the phenomenon of group II intron homing.

  8. Branchpoint selection in the splicing of U12-dependent introns in vitro.

    PubMed

    McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A

    2002-05-01

    In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome.

  9. Branchpoint selection in the splicing of U12-dependent introns in vitro.

    PubMed Central

    McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A

    2002-01-01

    In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome. PMID:12022225

  10. A novel non-coding RNA within an intron of CDH2 and association of its SNP with non-syndromic cleft lip and palate.

    PubMed

    Kumari, Priyanka; Singh, Subodh Kumar; Raman, Rajiva

    2018-06-05

    Genome-wide linkage analysis and whole genome sequencing in a Van der Woude syndrome (VWS) family revealed that the SNP, rs539075, within intron 2 of the cadherin 2 gene (CDH2) co-segregated with the disease phenotype. A study with nonsyndromic cleft lip with or without cleft palate (NSCL ± P) cases (N = 292) and controls (N = 287) established association of this SNP with NSCL ± P as a risk factor. RT-PCR based expression analysis of the SNP-harbouring region of intron 2 of CDH2 in the clefted lip and/or palate tissues of 16 patients revealed that the mutant allele expressed in all those individuals having it (hetero-/homozygous), whereas the wild type allele expressed in <50% of the samples in which it was present. The intronic transcript was also present in the prospective lip and palate region of 13.5 dpc mouse embryo, detected by RNA in situ hybridization and RT-PCR. These results including the in silico, characterization of the ~200 nt-intronic transcript showed that conformationally it fits best with noncoding small RNA, possibly a precursor of miRNA. Its function in the orofacial organogenesis remains to be elucidated which will enable us to define the role of this mutant ncRNA in the clefting of lip and palate. Copyright © 2018 Elsevier B.V. All rights reserved.

  11. Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence.

    PubMed

    Shin, Dong-Ho; Webb, Barbara M; Nakao, Miki; Smith, Sylvia L

    2009-07-01

    Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and -d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (

  12. Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence

    PubMed Central

    Shin, Dong-Ho; Webb, Barbara M.; Nakao, Miki; Smith, Sylvia L.

    2009-01-01

    Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and –d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (≤) amino acid identities with each other, 35.4 ~ 39.6% and 62.8 ~ 65.9% with factor I of mammals and banded houndshark (Triakis scyllium), respectively. The modular structure of the GcIf is similar to that of mammals with one notable exception, the presence of a novel shark-specific sequence between the leader peptide (LP) and the factor I membrane attack complex (FIMAC) domain. The cDNA sequences differ only in the size and composition of the shark-specific region (SSR). Sequence analysis of each SSR has identified within the region two novel short sequences (SS1 and SS2) and three repeat sequences (RS1, 2 and 3). Genomic analysis has revealed the existence of three introns between the leader peptide and the FIMAC domain, tentatively designated intron 1, intron 2, and intron 3 which span 4067, 2293 and 2082 bp, respectively. Southern blot analysis suggests the presence of a single gene copy for each cDNA type. Phylogenetic analysis suggests that complement factor I of cartilaginous fish diverged prior to the emergence of mammals. All four GcIf cDNA species are expressed in four different tissues and the liver is the main tissue in which expression level of all four is high. This suggests that the expression of GcIf isotypes is tissue-dependent. PMID:19423168

  13. A sequence-based genetic map of Medicago truncatula and comparison of marker colinearity with M. sativa.

    PubMed Central

    Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R

    2004-01-01

    A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563

  14. Sensing Self and Foreign Circular RNAs by Intron Identity.

    PubMed

    Chen, Y Grace; Kim, Myoungjoo V; Chen, Xingqi; Batista, Pedro J; Aoyama, Saeko; Wilusz, Jeremy E; Iwasaki, Akiko; Chang, Howard Y

    2017-07-20

    Circular RNAs (circRNAs) are single-stranded RNAs that are joined head to tail with largely unknown functions. Here we show that transfection of purified in vitro generated circRNA into mammalian cells led to potent induction of innate immunity genes and confers protection against viral infection. The nucleic acid sensor RIG-I is necessary to sense foreign circRNA, and RIG-I and foreign circRNA co-aggregate in cytoplasmic foci. CircRNA activation of innate immunity is independent of a 5' triphosphate, double-stranded RNA structure, or the primary sequence of the foreign circRNA. Instead, self-nonself discrimination depends on the intron that programs the circRNA. Use of a human intron to express a foreign circRNA sequence abrogates immune activation, and mature human circRNA is associated with diverse RNA binding proteins reflecting its endogenous splicing and biogenesis. These results reveal innate immune sensing of circRNA and highlight introns-the predominant output of mammalian transcription-as arbiters of self-nonself identity. Copyright © 2017 Elsevier Inc. All rights reserved.

  15. Insights into evolution in Andean Polystichum (Dryopteridaceae) from expanded understanding of the cytosolic phosphoglucose isomerase gene.

    PubMed

    Lyons, Brendan M; McHenry, Monique A; Barrington, David S

    2017-07-01

    Cytosolic phosphoglucose isomerase (pgiC) is an enzyme essential to glycolysis found universally in eukaryotes, but broad understanding of variation in the gene coding for pgiC is lacking for ferns. We used a substantially expanded representation of the gene for Andean species of the fern genus Polystichum to characterize pgiC in ferns relative to angiosperms, insects, and an amoebozoan; assess the impact of selection versus neutral evolutionary processes on pgiC; and explore evolutionary relationships of selected Andean species. The dataset of complete sequences comprised nine accessions representing seven species and one hybrid from the Andes and Serra do Mar. The aligned sequences of the full data set comprised 3376 base pairs (70% of the entire gene) including 17 exons and 15 introns from two central areas of the gene. The exons are highly conserved relative to angiosperms and retain substantial homology to insect pgiC, but intron length and structure are unique to the ferns. Average intron size is similar to angiosperms; intron number and location in insects are unlike those of the plants we considered. The introns included an array of indels and, in intron 7, an extensive microsatellite array with potential utility in analyzing population-level histories. Bayesian and maximum-parsimony analysis of 129 variable nucleotides in the Andean polystichums revealed that 59 (1.7% of the 3376 total) were phylogenetically informative; most of these united sister accessions. The phylogenetic trees for the Andean polystichums were incongruent with previously published cpDNA trees for the same taxa, likely the result of rapid evolutionary change in the introns and contrasting stability in the exons. The exons code a total of seven amino-acid substitutions. Comparison of non-synonymous to synonymous substitutions did not suggest that the pgiC gene is under selection in the Andes. Variation in pgiC including two additional accessions represented by incomplete sequences provided new insights into reticulate relationships among Andean taxa. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. Phylogenetic Analysis of Nuclear-Encoded RNA Maturases

    PubMed Central

    Malik, Sunita; Upadhyaya, KC; Khurana, SM Paul

    2017-01-01

    Posttranscriptional processes, such as splicing, play a crucial role in gene expression and are prevalent not only in nuclear genes but also in plant mitochondria where splicing of group II introns is catalyzed by a class of proteins termed maturases. In plant mitochondria, there are 22 mitochondrial group II introns. matR, nMAT1, nMAT2, nMAT3, and nMAT4 proteins have been shown to be required for efficient splicing of several group II introns in Arabidopsis thaliana. Nuclear maturases (nMATs) are necessary for splicing of mitochondrial genes, leading to normal oxidative phosphorylation. Sequence analysis through phylogenetic tree (including bootstrapping) revealed high homology with maturase sequences of A thaliana and other plants. This study shows the phylogenetic relationship of nMAT proteins between A thaliana and other nonredundant plant species taken from BLASTP analysis. PMID:28607538

  17. Alternative splicing by participation of the group II intron ORF in extremely halotolerant and alkaliphilic Oceanobacillus iheyensis.

    PubMed

    Chee, Gab-Joo; Takami, Hideto

    2011-01-01

    Group II introns inserted into genes often undergo splicing at unexpected sites, and participate in the transcription of host genes. We identified five copies of a group II intron, designated Oi.Int, in the genome of an extremely halotolerant and alkaliphilic bacillus, Oceanobacillus iheyensis. The Oi.Int4 differs from the Oi.Int3 at four bases. The ligated exons of the Oi.Int4 could not be detected by RT-PCR assays in vivo or in vitro although group II introns can generally self-splice in vitro without the involvement of an intron-encoded open reading frame (ORF). In the Oi.Int4 mutants with base substitutions within the ORF, ligated exons were detected by in vitro self-splicing. It was clear that the ligation of exons during splicing is affected by the sequence of the intron-encoded ORF since the splice sites corresponded to the joining sites of the intron. In addition, the mutant introns showed unexpected multiple products with alternative 5' splice sites. These findings imply that alternative 5' splicing which causes a functional change of ligated exons presumably has influenced past adaptations of O. iheyensis to various environmental changes.

  18. Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer

    PubMed Central

    Nisa-Martínez, Rafael; Jiménez-Zurdo, José I.; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás

    2007-01-01

    RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase–maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (ΔORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic ΔORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature. PMID:17158161

  19. Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer.

    PubMed

    Nisa-Martínez, Rafael; Jiménez-Zurdo, José I; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás

    2007-01-01

    RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase-maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (DeltaORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic DeltaORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature.

  20. Long-read sequencing of nascent RNA reveals coupling among RNA processing events.

    PubMed

    Herzel, Lydia; Straube, Korinna; Neugebauer, Karla M

    2018-06-14

    Pre-mRNA splicing is accomplished by the spliceosome, a megadalton complex that assembles de novo on each intron. Because spliceosome assembly and catalysis occur cotranscriptionally, we hypothesized that introns are removed in the order of their transcription in genomes dominated by constitutive splicing. Remarkably little is known about splicing order and the regulatory potential of nascent transcript remodeling by splicing, due to the limitations of existing methods that focus on analysis of mature splicing products (mRNAs) rather than substrates and intermediates. Here, we overcome this obstacle through long-read RNA sequencing of nascent, multi-intron transcripts in the fission yeast Schizosaccharomyces pombe Most multi-intron transcripts were fully spliced, consistent with rapid cotranscriptional splicing. However, an unexpectedly high proportion of transcripts were either fully spliced or fully unspliced, suggesting that splicing of any given intron is dependent on the splicing status of other introns in the transcript. Supporting this, mild inhibition of splicing by a temperature-sensitive mutation in prp2 , the homolog of vertebrate U2AF65, increased the frequency of fully unspliced transcripts. Importantly, fully unspliced transcripts displayed transcriptional read-through at the polyA site and were degraded cotranscriptionally by the nuclear exosome. Finally, we show that cellular mRNA levels were reduced in genes with a high number of unspliced nascent transcripts during caffeine treatment, showing regulatory significance of cotranscriptional splicing. Therefore, overall splicing of individual nascent transcripts, 3' end formation, and mRNA half-life depend on the splicing status of neighboring introns, suggesting crosstalk among spliceosomes and the polyA cleavage machinery during transcription elongation. © 2018 Herzel et al.; Published by Cold Spring Harbor Laboratory Press.

  1. The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

    PubMed Central

    Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen

    2017-01-01

    Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736

  2. The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

    DOE PAGES

    Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...

    2017-12-27

    Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less

  3. The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pai, Athma A.; Henriques, Telmo; McCue, Kayla

    Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less

  4. Late-onset spastic paraplegia: Aberrant SPG11 transcripts generated by a novel splice site donor mutation.

    PubMed

    Kawarai, Toshitaka; Miyamoto, Ryosuke; Mori, Atsuko; Oki, Ryosuke; Tsukamoto-Miyashiro, Ai; Matsui, Naoko; Miyazaki, Yoshimichi; Orlacchio, Antonio; Izumi, Yuishin; Nishida, Yoshihiko; Kaji, Ryuji

    2015-12-15

    We identified a novel homozygous mutation in the splice site donor (SSD) of intron 30 (c.5866+1G>A) in consanguineous Japanese SPG11 siblings showing late-onset spastic paraplegia using the whole-exome sequencing. Phenotypic variability was observed, including age-at-onset, dysarthria and pes cavus. Coding DNA sequencing revealed that the mutation affected the recognition of the constitutive SSD of intron 30, splicing upstream onto a nearby cryptic SSD in exon 30. The use of constitutive splice sites of intron 29 was confirmed by sequencing. The mutant transcripts are mostly subject to degradation by the nonsense-mediated mRNA decay system. SPG11 transcripts, escaping from the nonsense-mediated mRNA decay pathway, would generate a truncated protein (p.Tyr1900Phefs5X) containing the first 1899 amino acids and followed by 4 aberrant amino acids. This study showed a successful clinical application of whole-exome sequencing in spastic paraplegia and demonstrated a further evidence of allelic heterogeneity in SPG11. The confirmation of aberrant transcript by splice site mutation is a prerequisite for a more precise molecular diagnosis. Copyright © 2015 Elsevier B.V. All rights reserved.

  5. Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.

    PubMed

    Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T

    1993-02-01

    An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.

  6. [Sequence analysis of LEAFY homologous gene from Dendrobium moniliforme and application for identification of medicinal Dendrobium].

    PubMed

    Xing, Wen-Rui; Hou, Bei-Wei; Guan, Jing-Jiao; Luo, Jing; Ding, Xiao-Yu

    2013-04-01

    The LEAFY (LFY) homologous gene of Dendrobium moniliforme (L.) Sw. was cloned by new primers which were designed based on the conservative region of known sequences of orchid LEAFY gene. Partial LFY homologous gene was cloned by common PCR, then we got the complete LFY homologous gene Den LFY by Tail-PCR. The complete sequence of DenLFY gene was 3 575 bp which contained three exons and two introns. Using BLAST method, comparison analysis among the exon of LFY homologous gene indicted that the DenLFY gene had high identity with orchids LFY homologous, including the related fragment of PhalLFY (84%) in Phalaenopsis hybrid cultivar, LFY homologous gene in Oncidium (90%) and in other orchid (over 80%). Using MP analysis, Dendrobium is found to be the sister to Oncidium and Phalaenopsis. Homologous analysis demonstrated that the C-terminal amino acids were highly conserved. When the exons and introns were separately considered, exons and the sequence of amino acid were good markers for the function research of DenLFY gene. The second intron can be used in authentication research of Dendrobium based on the length polymorphism between Dendrobium moniliforme and Dendrobium officinale.

  7. Development and utilization of novel intron length polymorphic markers in foxtail millet (Setaria italica (L.) P. Beauv.).

    PubMed

    Gupta, Sarika; Kumari, Kajal; Das, Jyotirmoy; Lata, Charu; Puranik, Swati; Prasad, Manoj

    2011-07-01

    Introns are noncoding sequences in a gene that are transcribed to precursor mRNA but spliced out during mRNA maturation and are abundant in eukaryotic genomes. The availability of codominant molecular markers and saturated genetic linkage maps have been limited in foxtail millet (Setaria italica (L.) P. Beauv.). Here, we describe the development of 98 novel intron length polymorphic (ILP) markers in foxtail millet using sequence information of the model plant rice. A total of 575 nonredundant expressed sequence tag (EST) sequences were obtained, of which 327 and 248 unique sequences were from dehydration- and salinity-stressed suppression subtractive hybridization libraries, respectively. The BLAST analysis of 98 EST sequences suggests a nearly defined function for about 64% of them, and they were grouped into 11 different functional categories. All 98 ILP primer pairs showed a high level of cross-species amplification in two millets and two nonmillets species ranging from 90% to 100%, with a mean of ∼97%. The mean observed heterozygosity and Nei's average gene diversity 0.016 and 0.171, respectively, established the efficiency of the ILP markers for distinguishing the foxtail millet accessions. Based on 26 ILP markers, a reasonable dendrogram of 45 foxtail millet accessions was constructed, demonstrating the utility of ILP markers in germplasm characterizations and genomic relationships in millets and nonmillets species.

  8. Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes

    PubMed Central

    López-Garriga, Juan; Cadilla, Carmen L.

    2016-01-01

    The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233

  9. Gene structure and evolution of transthyretin in the order Chiroptera.

    PubMed

    Khwanmunee, Jiraporn; Leelawatwattana, Ladda; Prapunpoj, Porntip

    2016-02-01

    Bats are mammals in the order Chiroptera. Although many extensive morphologic and molecular genetics analyses have been attempted, phylogenetic relationships of bats has not been completely resolved. The paraphyly of microbats is of particular controversy that needs to be confirmed. In this study, we attempted to use the nucleotide sequence of transthyretin (TTR) intron 1 to resolve the relationship among bats. To explore its utility, the complete sequences of TTR gene and intron 1 region of bats in Vespertilionidae: genus Eptesicus (Eptesicus fuscus) and genus Myotis (Myotis brandtii, Myotis davidii, and Myotis lucifugus), and Pteropodidae (Pteropus alecto and Pteropus vampyrus) were extracted from the retrieved sequences, whereas those of Rhinoluphus affinis and Scotophilus kuhlii were amplified and sequenced. The derived overall amino sequences of bat TTRs were found to be very similar to those in other eutherians but differed from those in other classes of vertebrates. However, missing of amino acids from N-terminal or C-terminal region was observed. The phylogenetic analysis of amino acid sequences suggested bat and other eutherian TTRs lineal descent from a single most recent common ancestor which differed from those of non-placental mammals and the other classes of vertebrates. The splicing of bat TTR precursor mRNAs was similar to those of other eutherian but different from those of marsupial, bird, reptile and amphibian. Based on TTR intron 1 sequence, the inferred evolutionary relationship within Chiroptera revealed more closely relatedness of R. affinis to megabats than to microbats. Accordingly, the paraphyly of microbats was suggested.

  10. Intron retention regulates the expression of pectin methyl esterase inhibitor (Pmei) genes during wheat growth and development.

    PubMed

    Rocchi, V; Janni, M; Bellincampi, D; Giardina, T; D'Ovidio, R

    2012-03-01

    Pectin is an important component of the plant cell wall and its remodelling occurs during normal plant growth or following stress responses. Pectin is secreted into the cell wall in a highly methyl-esterified form and subsequently de-methyl-esterified by pectin methyl esterase (PME), whose activity is controlled by the pectin methyl esterase inhibitor protein (PMEI). Cereal cell wall contains a low amount of pectin; nonetheless the level and pattern of pectin methyl esterification play a primary role during development or pathogen infection. Since few data are available on the role of PMEI in plant development and defence of cereal species, we isolated and characterised three Pmei genes (Tdpmei2.1, Tdpmei2.2 and Tdpmei3) and their encoded products in wheat. Sequence comparisons showed a low level of intra- and inter-specific sequence conservation of PMEIs. Tdpmei2.1 and Tdpmei2.2 share 94% identity at protein level, but only 20% identity with the product of Tdpmei3. All three Tdpmei genes code for functional inhibitors of plant PMEs and do not inhibit microbial PMEs or a plant invertase. RT-PCR analyses demonstrated, for the first time to our knowledge, that Pmei genes are regulated by intron retention. Processed and unprocessed transcripts of Tdpmei2.1 and Tdpmei2.2 accumulated in several organs, but anthers contained only mature transcripts. Tdpmei3 lacks introns and its transcript accumulated mainly in stem internodes. These findings suggest that products encoded by these Tdpmei genes control organ- or tissue-specific activity of specific PME isoforms in wheat. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.

  11. Unique CD44 intronic SNP is associated with tumor grade in breast cancer: a case control study and in silico analysis.

    PubMed

    Esmaeili, Rezvan; Abdoli, Nasrin; Yadegari, Fatemeh; Neishaboury, Mohamadreza; Farahmand, Leila; Kaviani, Ahmad; Majidzadeh-A, Keivan

    2018-01-01

    CD44 encoded by a single gene is a cell surface transmembrane glycoprotein. Exon 2 is one of the important exons to bind CD44 protein to hyaluronan. Experimental evidences show that hyaluronan-CD44 interaction intensifies the proliferation, migration, and invasion of breast cancer cells. Therefore, the current study aimed at investigating the association between specific polymorphisms in exon 2 and its flanking region of CD44 with predisposition to breast cancer. In the current study, 175 Iranian female patients with breast cancer and 175 age-matched healthy controls were recruited in biobank, Breast Cancer Research Center, Tehran, Iran. Single nucleotide polymorphisms of CD44 exon 2 and its flanking were analyzed via polymerase chain reaction and gene sequencing techniques. Association between the observed variation with breast cancer risk and clinico-pathological characteristics were studied. Subsequently, bioinformatics analysis was conducted to predict potential exonic splicing enhancer (ESE) motifs changed as the result of a mutation. A unique polymorphism of the gene encoding CD44 was identified at position 14 nucleotide upstream of exon 2 (A37692→G) by the sequencing method. The A > G polymorphism exhibited a significant association with higher-grades of breast cancer, although no significant relation was found between this polymorphism and breast cancer risk. Finally, computational analysis revealed that the intronic mutation generated a new consensus-binding motif for the splicing factor, SC35, within intron 1. The current study results indicated that A > G polymorphism was associated with breast cancer development; in addition, in silico analysis with ESE finder prediction software showed that the change created a new SC35 binding site.

  12. Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

    PubMed

    Yin, Changchuan

    2015-04-01

    To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.

  13. The Mitochondrial Genome of the Prasinophyte Prasinoderma coloniale Reveals Two Trans-Spliced Group I Introns in the Large Subunit rRNA Gene

    PubMed Central

    Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude

    2013-01-01

    Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications of this interesting observation for trans-splicing of group I introns. PMID:24386369

  14. Differential splicing of human androgen receptor pre-mRNA in X-linked reifenstein syndrome, because of a deletion involving a putative branch site

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ris-Stalpers, C.; Verleun-Mooijman, M.C.T.; Blaeij, T.J.P. de

    1994-04-01

    The analysis of the androgen receptor (AR) gene, mRNA, and protein in a subject with X-linked Reifenstein syndrome (partial androgen insensitivity) is reported. The presence of two mature AR transcripts in genital skin fibroblasts of the patient is established, and, by reverse transcriptase-PCR and RNase transcription analysis, the wild-type transcript and a transcript in which exon 3 sequences are absent without disruption of the translational reading frame are identified. Sequencing and hybridization analysis show a deletion of >6 kb in intron 2 of the human AR gene, starting 18 bp upstream of exon 3. The deletion includes the putative branch-pointmore » sequence (BPS) but not the acceptor splice site on the intron 2/exon 3 boundary. The deletion of the putative intron 2 BPS results in 90% inhibition of wild-type splicing. The mutant transcript encodes an AR protein lacking the second zinc finger of the DNA-binding domain. Western/immunoblotting analysis is used to show that the mutant AR protein is expressed in genital skin fibroblasts of the patient. The residual 10% wild-type transcript can be the result of the use of a cryptic BPS located 63 bp upstream of the intron 2/exon 3 boundary of the mutant AR gene. The mutated AR protein has no transcription-activating potential and does not influence the transactivating properties of the wild-type AR, as tested in cotransfection studies. It is concluded that the partial androgen-insensitivity syndrome of this patient is the consequence of the limited amount of wild-type AR protein expressed in androgen target cells, resulting from the deletion of the intron 2 putative BPS. 42 refs., 6 figs., 1 tab.« less

  15. The splicing of tiny introns of Paramecium is controlled by MAGO.

    PubMed

    Contreras, Julia; Begley, Victoria; Marsella, Laura; Villalobo, Eduardo

    2018-07-15

    The exon junction complex (EJC) is a key element of the splicing machinery. The EJC core is composed of eIF4A3, MAGO, Y14 and MLN51. Few accessory proteins, such as CWC22 or UPF3, bind transiently to the EJC. The EJC has been implicated in the control of the splicing of long introns. To ascertain whether the EJC controls the splicing of short introns, we used Paramecium tetraurelia as a model organism, since it has thousands of very tiny introns. To elucidate whether EJC affects intron splicing in P. tetraurelia, we searched for EJC protein-coding genes, and silenced those genes coding for eIF4A3, MAGO and CWC22. We found that P. tetraurelia likely assembles an active EJC with only three of the core proteins, since MLN51 is lacking. Silencing of eIF4A3 or CWC22 genes, but not that of MAGO, caused lethality. Silencing of the MAGO gene caused either an increase, decrease, or no change in intron retention levels of some intron-containing mRNAs used as reporters. We suggest that a fine-tuning expression of EJC genes is required for steady intron removal in P. tetraurelia. Taking into consideration our results and those published by others, we conclude that the EJC controls splicing independently of the intron size. Copyright © 2018 Elsevier B.V. All rights reserved.

  16. Inferring Invasion History of Red Swamp Crayfish (Procambarus clarkii) in China from Mitochondrial Control Region and Nuclear Intron Sequences

    PubMed Central

    Li, Yanhe; Guo, Xianwu; Chen, Liping; Bai, Xiaohui; Wei, Xinlan; Zhou, Xiaoyun; Huang, Songqian; Wang, Weimin

    2015-01-01

    Identifying the dispersal pathways of an invasive species is useful for adopting the appropriate strategies to prevent and control its spread. However, these processes are exceedingly complex. So, it is necessary to apply new technology and collect representative samples for analysis. This study used Approximate Bayesian Computation (ABC) in combination with traditional genetic tools to examine extensive sample data and historical records to infer the invasion history of the red swamp crayfish, Procambarus clarkii, in China. The sequences of the mitochondrial control region and the proPOx intron in the nuclear genome of samples from 37 sites (35 in China and one each in Japan and the USA) were analyzed. The results of combined scenarios testing and historical records revealed a much more complex invasion history in China than previously believed. P. clarkii was most likely originally introduced into China from Japan from an unsampled source, and the species then expanded its range primarily into the middle and lower reaches and, to a lesser extent, into the upper reaches of the Changjiang River in China. No transfer was observed from the upper reaches to the middle and lower reaches of the Changjiang River. Human-mediated jump dispersal was an important dispersal pathway for P. clarkii. The results provide a better understanding of the evolutionary scenarios involved in the rapid invasion of P. clarkii in China. PMID:26132567

  17. Localization, structure and polymorphism of two paralogous Xenopus laevis mitochondrial malate dehydrogenase genes.

    PubMed

    Tlapakova, Tereza; Krylov, Vladimir; Macha, Jaroslav

    2005-01-01

    Two paralogous mitochondrial malate dehydrogenase 2 (Mdh2) genes of Xenopus laevis have been cloned and sequenced, revealing 95% identity. Fluorescence in-situ hybridization (FISH) combined with tyramide amplification discriminates both genes; Mdh2a was localized into chromosome q3 and Mdh2b into chromosome q8. One kb cDNA probes detect both genes with 85% accuracy. The remaining signals were on the paralogous counterpart. Introns interrupt coding sequences at the same nucleotide as defined for mouse. Restriction polymorphism has been detected in the first intron of Mdh2a, while the individual variability in intron 6 of Mdh2b gene is represented by an insertion of incomplete retrotransposon L1Xl. Rates of nucleotide substitutions indicate that both genes are under similar evolutionary constraints. X. laevis Mdh2 genes can be used as markers for physical mapping and linkage analysis.

  18. Pseudoexon activation increases phenotype severity in a Becker muscular dystrophy patient.

    PubMed

    Greer, Kane; Mizzi, Kayla; Rice, Emily; Kuster, Lukas; Barrero, Roberto A; Bellgard, Matthew I; Lynch, Bryan J; Foley, Aileen Reghan; O Rathallaigh, Eoin; Wilton, Steve D; Fletcher, Sue

    2015-07-01

    We report a dystrophinopathy patient with an in-frame deletion of DMD exons 45-47, and therefore a genetic diagnosis of Becker muscular dystrophy, who presented with a more severe than expected phenotype. Analysis of the patient DMD mRNA revealed an 82 bp pseudoexon, derived from intron 44, that disrupts the reading frame and is expected to yield a nonfunctional dystrophin. Since the sequence of the pseudoexon and canonical splice sites does not differ from the reference sequence, we concluded that the genomic rearrangement promoted recognition of the pseudoexon, causing a severe dystrophic phenotype. We characterized the deletion breakpoints and identified motifs that might influence selection of the pseudoexon. We concluded that the donor splice site was strengthened by juxtaposition of intron 47, and loss of intron 44 silencer elements, normally located downstream of the pseudoexon donor splice site, further enhanced pseudoexon selection and inclusion in the DMD transcript in this patient.

  19. A deep intronic mutation in the SLC12A3 gene leads to Gitelman syndrome.

    PubMed

    Nozu, Kandai; Iijima, Kazumoto; Nozu, Yoshimi; Ikegami, Ei; Imai, Takehide; Fu, Xue Jun; Kaito, Hiroshi; Nakanishi, Koichi; Yoshikawa, Norishige; Matsuo, Masafumi

    2009-11-01

    Many mutations have been detected in the SLC12A3 gene of Gitelman syndrome (GS, OMIM 263800) patients. In previous studies, only one mutant allele was detected in approximately 20 to 41% of patients with GS; however, the exact reason for the nonidentification has not been established. In this study, we used RT-PCR using mRNA to investigate for the first time transcript abnormalities caused by deep intronic mutation. Direct sequencing analysis of leukocyte DNA identified one base insertion in exon 6 (c.818_819insG), but no mutation was detected in another allele. We analyzed RNA extracted from leukocytes and urine sediments and detected unknown sequence containing 238bp between exons 13 and 14. The genomic DNA analysis of intron 13 revealed a single-base substitution (c.1670-191C>T) that creates a new donor splice site within the intron resulting in the inclusion of a novel cryptic exon in mRNA. This is the first report of creation of a splice site by a deep intronic single-nucleotide change in GS and the first report to detect the onset mechanism in a patient with GS and missing mutation in one allele. This molecular onset mechanism may partly explain the poor success rate of mutation detection in both alleles of patients with GS.

  20. Conservation and diversification of Msx protein in metazoan evolution.

    PubMed

    Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun

    2008-01-01

    Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family proteins contributed to the diversification of animal body organization.

  1. An RNA electrophoretic mobility shift and mutational analysis of rnp-4f 5′-UTR intron splicing regulatory proteins in Drosophila reveals a novel new role for a dADAR protein isoform

    PubMed Central

    Lakshmi, G. Girija; Ghosh, Sushmita; Jones, Gabriel P.; Parikh, Roshni; Rawlins, Bridgette A.; Vaughn, Jack C.

    2014-01-01

    Alternative splicing greatly enhances the diversity of proteins encoded by eukaryotic genomes, and is also important in gene expression control. In contrast to the great depth of knowledge as to molecular mechanisms in the splicing pathway itself, relatively little is known about the regulatory events behind this process. The 5′-UTR and 3′-UTR in pre-mRNAs play a variety of roles in controlling eukaryotic gene expression, including translational modulation, and nearly 4,000 of the roughly 14,000 protein coding genes in Drosophila contain introns of unknown functional significance in their 5′-UTR. Here we report the results of an RNA electrophoretic mobility shift analysis of Drosophila rnp-4f 5′-UTR intron 0 splicing regulatory proteins. The pre-mRNA potential regulatory element consists of an evolutionarily-conserved 177-nt stem-loop arising from pairing of intron 0 with part of adjacent exon 2. Incubation of in vitro transcribed probe with embryo protein extract is shown to result in two shifted RNA-protein bands, and protein extract from a dADAR null mutant fly line results in only one shifted band. A mutated stem-loop in which the conserved exon 2 primary sequence is changed but secondary structure maintained by introducing compensatory base changes results in diminished band shifts. To test the hypothesis that dADAR plays a role in intron splicing regulation in vivo, levels of unspliced rnp-4f mRNA in dADAR mutant were compared to wild-type via real-time qRT-PCR. The results show that during embryogenesis unspliced rnp-4f mRNA levels fall by up to 85% in the mutant, in support of the hypothesis. Taken together, these results demonstrate a novel role for dADAR protein in rnp-4f 5′-UTR alternative intron splicing regulation which is consistent with a previously proposed model. PMID:23026215

  2. A novel germline PALB2 deletion in Polish breast and ovarian cancer patients.

    PubMed

    Dansonka-Mieszkowska, Agnieszka; Kluska, Anna; Moes, Joanna; Dabrowska, Michalina; Nowakowska, Dorota; Niwinska, Anna; Derlatka, Pawel; Cendrowski, Krzysztof; Kupryjanczyk, Jolanta

    2010-02-02

    PALB2 protein was recently identified as a partner of BRCA1 and BRCA2 which determines their proper function in DNA repair. Initially, the entire coding sequence of the PALB2 gene with exon/intron boundaries was evaluated by the PCR-SSCP and direct sequencing methods on 70 ovarian carcinomas. Sequence variants of interest were further studied on enlarged groups of ovarian carcinomas (total 339 non-consecutive ovarian carcinomas), blood samples from 334 consecutive sporadic and 648 consecutive familial breast cancer patients, and 1310 healthy controls from central Poland. Ten types of sequence variants were detected, and among them four novel polymorphisms: c.2996+58T>C in intron 9; c.505C>A (p.L169I), c.618T>G (p.L206L), both in exon 4; and c.2135C>T (A712V) in exon 5 of the PALB2 gene. Another two polymorphisms, c.212-58A>C and c.2014G>C (E672Q) were always detected together, both in cancer (7.5% of patients) and control samples (4.9% of controls, p = 0.2). A novel germline truncating mutation, c.509_510delGA (p.R170fs) was found in exon 4: in 2 of 339 (0.6%) unrelated ovarian cancer patients, in 4 of 648 (0.6%) unrelated familial breast cancer patients, and in 1 of 1310 controls (0.08%, p = 0.1, p = 0.044, respectively). One ovarian cancer patient with the PALB2 mutation had also a germline nonsense mutation of the BRCA2 gene. The c.509_510delGA is a novel PALB2 mutation that increases the risk of familial breast cancer. Occurrence of the same PALB2 alteration in seven unrelated women suggests that c.509_510delGA (p.R170fs) is a recurrent mutation for Polish population.

  3. Regulation of expression of transgenes in developing fish.

    PubMed

    Moav, B; Liu, Z; Caldovic, L D; Gross, M L; Faras, A J; Hackett, P B

    1993-05-01

    The transcriptional regulatory elements of the beta-actin gene of carp (Cyprinus carpio) have been examined in zebrafish and goldfish harbouring transgenes. The high sequence conservation of the putative regulatory elements in the beta-actin genes of animals suggested that their function would be conserved, so that transgenic constructs with the same transcriptional control elements would promote similar levels of transgene expression in different species of transgenic animals. To test this assumption, we analysed the temporal expression of a reporter gene under the control of transcriptional control sequences from the carp beta-actin gene in zebrafish (Brachydanio rerio) and goldfish (Carrasius auratus). Our results indicated that, contrary to expectations, combinations of different transcriptional control elements affected the level, duration, and onset of gene expression differently in developing zebrafish and goldfish. The major differences in expression of beta-actin/CAT (chloramphenicol acetyltransferase) constructs in zebrafish and goldfish were: (1) overall expression was almost 100-fold higher in goldfish than in zebrafish embryos, (2) the first intron had an enhancing effect on gene expression in zebrafish but not in goldfish, and (3) the serum-responsive/CArG-containing regulatory element in the proximal promoter was not always required for maximal CAT activity in goldfish, but was required in zebrafish. These results suggest that in the zebrafish, but not in the goldfish, there may be interactions between motifs in the proximal promoter and the first intron which appear to be required for maximal enhancement of transcription.

  4. Impaired Spermatogenesis, Muscle, and Erythrocyte Function in U12 Intron Splicing-Defective Zrsr1 Mutant Mice.

    PubMed

    Horiuchi, Keiko; Perez-Cerezales, Serafín; Papasaikas, Panagiotis; Ramos-Ibeas, Priscila; López-Cardona, Angela Patricia; Laguna-Barraza, Ricardo; Fonseca Balvís, Noelia; Pericuesta, Eva; Fernández-González, Raul; Planells, Benjamín; Viera, Alberto; Suja, Jose Angel; Ross, Pablo Juan; Alén, Francisco; Orio, Laura; Rodriguez de Fonseca, Fernando; Pintado, Belén; Valcárcel, Juan; Gutiérrez-Adán, Alfonso

    2018-04-03

    The U2AF35-like ZRSR1 has been implicated in the recognition of 3' splice site during spliceosome assembly, but ZRSR1 knockout mice do not show abnormal phenotypes. To analyze ZRSR1 function and its precise role in RNA splicing, we generated ZRSR1 mutant mice containing truncating mutations within its RNA-recognition motif. Homozygous mutant mice exhibited severe defects in erythrocytes, muscle stretch, and spermatogenesis, along with germ cell sloughing and apoptosis, ultimately leading to azoospermia and male sterility. Testis RNA sequencing (RNA-seq) analyses revealed increased intron retention of both U2- and U12-type introns, including U12-type intron events in genes with key functions in spermatogenesis and spermatid development. Affected U2 introns were commonly found flanking U12 introns, suggesting functional cross-talk between the two spliceosomes. The splicing and tissue defects observed in mutant mice attributed to ZRSR1 loss of function suggest a physiological role for this factor in U12 intron splicing. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.

  5. Cytochrome oxidase subunit II gene in mitochondria of Oenothera has no intron

    PubMed Central

    Hiesel, Rudolf; Brennicke, Axel

    1983-01-01

    The cytochrome oxidase subunit II gene has been localized in the mitochondrial genome of Oenothera berteriana and the nucleotide sequence has been determined. The coding sequence contains 777 bp and, unlike the corresponding gene in Zea mays, is not interrupted by an intron. No TGA codon is found within the open reading frame. The codon CGG, as in the maize gene, is used in place of tryptophan codons of corresponding genes in other organisms. At position 742 in the Oenothera sequence the TGG of maize is changed into a CGG codon, where Trp is conserved as the amino acid in other organisms. Homologous sequences occur more than once in the mitochondrial genome as several mitochondrial DNA species hybridize with DNA probes of the cytochrome oxidase subunit II gene. ImagesFig. 5. PMID:16453484

  6. SinEx DB: a database for single exon coding sequences in mammalian genomes.

    PubMed

    Jorquera, Roddy; Ortiz, Rodrigo; Ossandon, F; Cárdenas, Juan Pablo; Sepúlveda, Rene; González, Carolina; Holmes, David S

    2016-01-01

    Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as 'single exon genes' (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs.Database URL: www.sinex.cl. © The Author(s) 2016. Published by Oxford University Press.

  7. The legumin gene family: structure of a B type gene of Vicia faba and a possible legumin gene specific regulatory element.

    PubMed Central

    Bäumlein, H; Wobus, U; Pustell, J; Kafatos, F C

    1986-01-01

    The field bean, Vicia faba L. var. minor, possesses two sub-families of 11 S legumin genes named A and B. We isolated from a genomic library a B-type gene (LeB4) and determined its primary DNA sequence. Gene LeB4 codes for a 484 amino acid residue prepropolypeptide, encompassing a signal peptide of 22 amino acid residues, an acidic, very hydrophilic alpha-chain of 281 residues and a basic, somewhat hydrophobic beta-chain of 181 residues. The latter two coding regions are immediately contiguous, but each is interrupted by a short intron. Type A legumin genes from soybean and pea are known to have introns in the same two positions, in addition to an extra intron (within the alpha-coding sequence). Sequence comparisons of legumin genes from these three plants revealed a highly conserved sequence element of at least 28 bp, centered at approximately 100 bp upstream of each cap site. The element is absent from the equivalent position of all non-legumin and other plant and fungal genes examined. We tentatively name this element "legumin box" and suggest that it may have a function in the regulation of legumin gene expression. PMID:3960730

  8. An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion

    PubMed Central

    Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.

    2017-01-01

    Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442

  9. An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.

    PubMed

    Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres

    2017-06-20

    RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. High-throughput sequencing of human plasma RNA by using thermostable group II intron reverse transcriptases

    PubMed Central

    Qin, Yidan; Yao, Jun; Wu, Douglas C.; Nottingham, Ryan M.; Mohr, Sabine; Hunicke-Smith, Scott; Lambowitz, Alan M.

    2016-01-01

    Next-generation RNA-sequencing (RNA-seq) has revolutionized transcriptome profiling, gene expression analysis, and RNA-based diagnostics. Here, we developed a new RNA-seq method that exploits thermostable group II intron reverse transcriptases (TGIRTs) and used it to profile human plasma RNAs. TGIRTs have higher thermostability, processivity, and fidelity than conventional reverse transcriptases, plus a novel template-switching activity that can efficiently attach RNA-seq adapters to target RNA sequences without RNA ligation. The new TGIRT-seq method enabled construction of RNA-seq libraries from <1 ng of plasma RNA in <5 h. TGIRT-seq of RNA in 1-mL plasma samples from a healthy individual revealed RNA fragments mapping to a diverse population of protein-coding gene and long ncRNAs, which are enriched in intron and antisense sequences, as well as nearly all known classes of small ncRNAs, some of which have never before been seen in plasma. Surprisingly, many of the small ncRNA species were present as full-length transcripts, suggesting that they are protected from plasma RNases in ribonucleoprotein (RNP) complexes and/or exosomes. This TGIRT-seq method is readily adaptable for profiling of whole-cell, exosomal, and miRNAs, and for related procedures, such as HITS-CLIP and ribosome profiling. PMID:26554030

  11. The complete chloroplast DNA sequences of the charophycean green algae Staurastrum and Zygnema reveal that the chloroplast genome underwent extensive changes during the evolution of the Zygnematales

    PubMed Central

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    2005-01-01

    Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178

  12. Genome-wide mapping of alternative splicing in Arabidopsis thaliana

    PubMed Central

    Filichkin, Sergei A.; Priest, Henry D.; Givan, Scott A.; Shen, Rongkun; Bryant, Douglas W.; Fox, Samuel E.; Wong, Weng-Keen; Mockler, Todd C.

    2010-01-01

    Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least ∼42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC+ isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC+ isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC+ and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression. PMID:19858364

  13. mRNA-based detection of rare CFTR mutations improves genetic diagnosis of cystic fibrosis in populations with high genetic heterogeneity.

    PubMed

    Felício, V; Ramalho, A S; Igreja, S; Amaral, M D

    2017-03-01

    Even with advent of next generation sequencing complete sequencing of large disease-associated genes and intronic regions is economically not feasible. This is the case of cystic fibrosis transmembrane conductance regulator (CFTR), the gene responsible for cystic fibrosis (CF). Yet, to confirm a CF diagnosis, proof of CFTR dysfunction needs to be obtained, namely by the identification of two disease-causing mutations. Moreover, with the advent of mutation-based therapies, genotyping is an essential tool for CF disease management. There is, however, still an unmet need to genotype CF patients by fast, comprehensive and cost-effective approaches, especially in populations with high genetic heterogeneity (and low p.F508del incidence), where CF is now emerging with new diagnosis dilemmas (Brazil, Asia, etc). Herein, we report an innovative mRNA-based approach to identify CFTR mutations in the complete coding and intronic regions. We applied this protocol to genotype individuals with a suspicion of CF and only one or no CFTR mutations identified by routine methods. It successfully detected multiple intronic mutations unlikely to be detected by CFTR exon sequencing. We conclude that this is a rapid, robust and inexpensive method to detect any CFTR coding/intronic mutation (including rare ones) that can be easily used either as primary approach or after routine DNA analysis. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. Ribosomal DNA sequence divergence and group I introns within the Leucostoma species L. cinctum, L. persoonii, and L. parapersoonii sp. nov., ascomycetes that cause Cytospora canker of fruit trees.

    PubMed

    Adams, Gerard C; Surve-Iyer, Rupa S; Iezzoni, Amy F

    2002-01-01

    Leucostoma species that are the causal agents of Cytospora canker of stone and pome fruit trees were studied in detail. DNA sequence of the internal transcribed spacer regions and the 5.8S of the nuclear ribosomal DNA operon (ITS rDNA) supplied sufficient characters to assess the phylogenetic relationships among species of Leucostoma, Valsa, Valsella, and related anamorphs in Cytospora. Parsimony analysis of the aligned sequence divided Cytospora isolates from fruit trees into clades that generally agreed with the morphological species concepts, and with some of the phenetic groupings (PG 1-6) identified previously by isozyme analysis and cultural characteristics. Phylogenetic analysis inferred that isolates of L. persoonii formed two well-resolved clades distinct from isolates of L. cinctum. Phylogenetic analysis of the ITS rDNA, isozyme analysis, and cultural characteristics supported the inference that L. persoonii groups PG 2 and PG 3 were populations of a new species apparently more genetically different from L. persoonii PG 1 than from isolates representative of L. massariana, L. niveum, L. translucens, and Valsella melastoma. The new species, L. parapersoonii, was described. A diverse collection of isolates of L. cinctum, L. persoonii, and L. parapersoonii were examined for genetic variation using restriction fragment length polymorphism (RFLP) analysis of the ITS rDNA and the five prime end of the large subunit of the rDNA (LSU rDNA). HinfI and HpaII endonucleases were each useful in dividing the Leucostoma isolates into RFLP profiles corresponding to the isozyme phenetic groups, PG 1-6. RFLP analysis was more effective than isozyme analysis in uncovering variation among isolates of L. persoonii PG 1, but less effective within L. cinctum populations. Isolates representative of seven of the L. persoonii formae speciales proposed by G. Défago in 1935 were found to be genetically diverse isolates of PG 1. Two large insertions, 415 and 309 nucleotides long, in the small subunit (SSU) of the nuclear rDNA of L. cinctum were identified as Group 1 introns; intron 1 at position 943 and intron 2 at position 1199. The two introns were found to be consistently present in isolates of L. cinctum PG 4 and PG 5 and absent from L. cinctum PG 6 isolates, despite the similarity of the ITS sequence and teleomorph morphology. Intron 1 was of subgroup 1C1 whereas intron 2 was of an unknown subgroup. RFLP patterns and presence/absence of introns were useful characters for expediting the identification of cultures of Leucostoma isolated from stone and pome fruit cankers. RFLP patterns from 13 endonucleases provided an effective method for selecting an array of diverse PG 1 isolates useful in screening plant germplasm for disease-resistance.

  15. The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate?

    PubMed Central

    Koonin, Eugene V

    2006-01-01

    Background Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes. Results I argue that several lines of evidence now suggest a coherent solution to the introns-early versus introns-late debate, and the emerging picture of intron evolution integrates aspects of both views although, formally, there seems to be no support for the original version of introns-early. Firstly, there is growing evidence that spliceosomal introns evolved from group II self-splicing introns which are present, usually, in small numbers, in many bacteria, and probably, moved into the evolving eukaryotic genome from the α-proteobacterial progenitor of the mitochondria. Secondly, the concept of a primordial pool of 'virus-like' genetic elements implies that self-splicing introns are among the most ancient genetic entities. Thirdly, reconstructions of the ancestral state of eukaryotic genes suggest that the last common ancestor of extant eukaryotes had an intron-rich genome. Thus, it appears that ancestors of spliceosomal introns, indeed, have existed since the earliest stages of life's evolution, in a formal agreement with the introns-early scenario. However, there is no evidence that these ancient introns ever became widespread before the emergence of eukaryotes, hence, the central tenet of introns-early, the role of introns in early evolution of proteins, has no support. However, the demonstration that numerous introns invaded eukaryotic genes at the outset of eukaryotic evolution and that subsequent intron gain has been limited in many eukaryotic lineages implicates introns as an ancestral feature of eukaryotic genomes and refutes radical versions of introns-late. Perhaps, most importantly, I argue that the intron invasion triggered other pivotal events of eukaryogenesis, including the emergence of the spliceosome, the nucleus, the linear chromosomes, the telomerase, and the ubiquitin signaling system. This concept of eukaryogenesis, in a sense, revives some tenets of the exon hypothesis, by assigning to introns crucial roles in eukaryotic evolutionary innovation. Conclusion The scenario of the origin and evolution of introns that is best compatible with the results of comparative genomics and theoretical considerations goes as follows: self-splicing introns since the earliest stages of life's evolution – numerous spliceosomal introns invading genes of the emerging eukaryote during eukaryogenesis – subsequent lineage-specific loss and gain of introns. The intron invasion, probably, spawned by the mitochondrial endosymbiont, might have critically contributed to the emergence of the principal features of the eukaryotic cell. This scenario combines aspects of the introns-early and introns-late views. Reviewers this article was reviewed by W. Ford Doolittle, James Darnell (nominated by W. Ford Doolittle), William Martin, and Anthony Poole. PMID:16907971

  16. Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons

    PubMed Central

    Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil

    2012-01-01

    Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994

  17. Mitochondrial Group II Introns, Cytochrome c Oxidase, and Senescence in Podospora anserina†

    PubMed Central

    Begel, Odile; Boulay, Jocelyne; Albert, Beatrice; Dufour, Eric; Sainsard-Chanet, Annie

    1999-01-01

    Podospora anserina is a filamentous fungus with a limited life span. It expresses a degenerative syndrome called senescence, which is always associated with the accumulation of circular molecules (senDNAs) containing specific regions of the mitochondrial chromosome. A mobile group II intron (α) has been thought to play a prominent role in this syndrome. Intron α is the first intron of the cytochrome c oxidase subunit I gene (COX1). Mitochondrial mutants that escape the senescence process are missing this intron, as well as the first exon of the COX1 gene. We describe here the first mutant of P. anserina that has the α sequence precisely deleted and whose cytochrome c oxidase activity is identical to that of wild-type cells. The integration site of the intron is slightly modified, and this change prevents efficient homing of intron α. We show here that this mutant displays a senescence syndrome similar to that of the wild type and that its life span is increased about twofold. The introduction of a related group II intron into the mitochondrial genome of the mutant does not restore the wild-type life span. These data clearly demonstrate that intron α is not the specific senescence factor but rather an accelerator or amplifier of the senescence process. They emphasize the role that intron α plays in the instability of the mitochondrial chromosome and the link between this instability and longevity. Our results strongly support the idea that in Podospora, “immortality” can be acquired not by the absence of intron α but rather by the lack of active cytochrome c oxidase. PMID:10330149

  18. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae.

    PubMed

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-10-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species.

  19. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae

    PubMed Central

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-01-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3′ terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785

  20. Parallel Loss of Plastid Introns and Their Maturase in the Genus Cuscuta

    PubMed Central

    McNeal, Joel R.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Leebens-Mack, Jim; dePamphilis, Claude W.

    2009-01-01

    Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta. PMID:19543388

  1. Parallel loss of plastid introns and their maturase in the genus Cuscuta.

    PubMed

    McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; Leebens-Mack, Jim; dePamphilis, Claude W

    2009-06-19

    Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta.

  2. Expression of a polyubiquitin promoter isolated from Gladiolus.

    PubMed

    Joung, Young Hee; Kamo, Kathryn

    2006-10-01

    A polyubiquitin promoter (GUBQ1) including its 5'UTR and intron was isolated from the floral monocot Gladiolus because high levels of expression could not be obtained using publicly available promoters isolated from either cereals or dicots. Sequencing of the promoter revealed highly conserved 5' and 3' intron splicing sites for the 1.234 kb intron. The coding sequence of the first two ubiquitin genes showed the highest homology (87 and 86%, respectively) to the ubiquitin genes of Nicotiana tabacum and Oryza sativa RUBQ2. Transient expression following gene gun bombardment showed that relative levels of GUS activity with the GUBQ1 promoter were comparable to the CaMV 35S promoter in gladiolus, tobacco, rose, rice, and the floral monocot freesia. The highest levels of GUS expression with GUBQ1 were attained with Gladiolus. The full-length GUBQ1 promoter including 5'UTR and intron were necessary for maximum GUS expression in Gladiolus. The relative GUS activity for the promoter only was 9%, and the activity for the promoter with 5'UTR and 399 bp of the full-length 1.234 kb intron was 41%. Arabidopsis plants transformed with uidA under GUBQ1 showed moderate GUS expression throughout young leaves and in the vasculature of older leaves. The highest levels of transient GUS expression in Gladiolus have been achieved using the GUBQ1 promoter. This promoter should be useful for genetic engineering of disease resistance in Gladiolus, rose, and freesia, where high levels of gene expression are important.

  3. Limited MHC class I intron 2 repertoire variation in bonobos.

    PubMed

    de Groot, Natasja G; Heijmans, Corrine M C; Helsen, Philippe; Otting, Nel; Pereboom, Zjef; Stevens, Jeroen M G; Bontrop, Ronald E

    2017-10-01

    Common chimpanzees (Pan troglodytes) experienced a selective sweep, probably caused by a SIV-like virus, which targeted their MHC class I repertoire. Based on MHC class I intron 2 data analyses, this selective sweep took place about 2-3 million years ago. As a consequence, common chimpanzees have a skewed MHC class I repertoire that is enriched for allotypes that are able to recognise conserved regions of the SIV proteome. The bonobo (Pan paniscus) shared an ancestor with common chimpanzees approximately 1.5 to 2 million years ago. To investigate whether the signature of this selective sweep is also detectable in bonobos, the MHC class I gene repertoire of two bonobo panels comprising in total 29 animals was investigated by Sanger sequencing. We identified 14 Papa-A, 20 Papa-B and 11 Papa-C alleles, of which eight, five and eight alleles, respectively, have not been reported previously. Within this pool of MHC class I variation, we recovered only 2 Papa-A, 3 Papa-B and 6 Papa-C intron 2 sequences. As compared to humans, bonobos appear to have an even more diminished MHC class I intron 2 lineage repertoire than common chimpanzees. This supports the notion that the selective sweep may have predated the speciation of common chimpanzees and bonobos. The further reduction of the MHC class I intron 2 lineage repertoire observed in bonobos as compared to the common chimpanzee may be explained by a founding effect or other subsequent selective processes.

  4. A conserved intronic U1 snRNP-binding sequence promotes trans-splicing in Drosophila

    PubMed Central

    Gao, Jun-Li; Fan, Yu-Jie; Wang, Xiu-Ye; Zhang, Yu; Pu, Jia; Li, Liang; Shao, Wei; Zhan, Shuai; Hao, Jianjiang

    2015-01-01

    Unlike typical cis-splicing, trans-splicing joins exons from two separate transcripts to produce chimeric mRNA and has been detected in most eukaryotes. Trans-splicing in trypanosomes and nematodes has been characterized as a spliced leader RNA-facilitated reaction; in contrast, its mechanism in higher eukaryotes remains unclear. Here we investigate mod(mdg4), a classic trans-spliced gene in Drosophila, and report that two critical RNA sequences in the middle of the last 5′ intron, TSA and TSB, promote trans-splicing of mod(mdg4). In TSA, a 13-nucleotide (nt) core motif is conserved across Drosophila species and is essential and sufficient for trans-splicing, which binds U1 small nuclear RNP (snRNP) through strong base-pairing with U1 snRNA. In TSB, a conserved secondary structure acts as an enhancer. Deletions of TSA and TSB using the CRISPR/Cas9 system result in developmental defects in flies. Although it is not clear how the 5′ intron finds the 3′ introns, compensatory changes in U1 snRNA rescue trans-splicing of TSA mutants, demonstrating that U1 recruitment is critical to promote trans-splicing in vivo. Furthermore, TSA core-like motifs are found in many other trans-spliced Drosophila genes, including lola. These findings represent a novel mechanism of trans-splicing, in which RNA motifs in the 5′ intron are sufficient to bring separate transcripts into close proximity to promote trans-splicing. PMID:25838544

  5. Mitogenome rearrangement in the cold-water scleractinian coral Lophelia pertusa (Cnidaria, Anthozoa) involves a long-term evolving group I intron.

    PubMed

    Emblem, Åse; Karlsen, Bård Ove; Evertsen, Jussi; Johansen, Steinar D

    2011-11-01

    Group I introns are genetic insertion elements that invade host genomes in a wide range of organisms. In metazoans, however, group I introns are extremely rare, so far only identified within mitogenomes of hexacorals and some sponges. We sequenced the complete mitogenome of the cold-water scleractinian coral Lophelia pertusa, the dominating deep sea reef-building coral species in the North Atlantic Ocean. The mitogenome (16,150 bp) has the same gene content but organized in a unique gene order compared to that of other known scleractinian corals. A complex group I intron (6460 bp) inserted in the ND5 gene (position 717) was found to host seven essential mitochondrial protein genes and one ribosomal RNA gene. Phylogenetic analysis supports a vertical inheritance pattern of the ND5-717 intron among hexacoral mitogenomes with no examples of intron loss. Structural assessments of the Lophelia intron revealed an unusual organization that lacks the universally conserved ωG at the 3' end, as well as a highly compact RNA core structure with overlapping ribozyme and protein coding capacities. Based on phylogenetic and structural analyses we reconstructed the evolutionary history of ND5-717, from its ancestral protist origin, through intron loss in some early metazoan lineages, and into a compulsory feature with functional implications in hexacorals. Copyright © 2011 Elsevier Inc. All rights reserved.

  6. Purifying Selection on Exonic Splice Enhancers in Intronless Genes

    PubMed Central

    Savisaar, Rosina; Hurst, Laurence D.

    2016-01-01

    Exonic splice enhancers (ESEs) are short nucleotide motifs, enriched near exon ends, that enhance the recognition of the splice site and thus promote splicing. Are intronless genes under selection to avoid these motifs so as not to attract the splicing machinery to an mRNA that should not be spliced, thereby preventing the production of an aberrant transcript? Consistent with this possibility, we find that ESEs in putative recent retrocopies are at a higher density and evolving faster than those in other intronless genes, suggesting that they are being lost. Moreover, intronless genes are less dense in putative ESEs than intron-containing ones. However, this latter difference is likely due to the skewed base composition of intronless sequences, a skew that is in line with the general GC richness of few exon genes. Indeed, after controlling for such biases, we find that both intronless and intron-containing genes are denser in ESEs than expected by chance. Importantly, nucleotide-controlled analysis of evolutionary rates at synonymous sites in ESEs indicates that the ESEs in intronless genes are under purifying selection in both human and mouse. We conclude that on the loss of introns, some but not all, ESE motifs are lost, the remainder having functions beyond a role in splice promotion. These results have implications for the design of intronless transgenes and for understanding the causes of selection on synonymous sites. PMID:26802218

  7. Evolutionary Dynamics of the Gametologous CTNNB1 Gene on the Z and W Chromosomes of Snakes.

    PubMed

    Laopichienpong, Nararat; Muangmai, Narongrit; Chanhome, Lawan; Suntrarachun, Sunutcha; Twilprawat, Panupon; Peyachoknagul, Surin; Srikulnath, Kornsorn

    2017-03-01

    Snakes exhibit genotypic sex determination with female heterogamety (ZZ males and ZW females), and the state of sex chromosome differentiation also varies among lineages. To investigate the evolutionary history of homologous genes located in the nonrecombining region of differentiated sex chromosomes in snakes, partial sequences of the gametologous CTNNB1 gene were analyzed for 12 species belonging to henophid (Cylindrophiidae, Xenopeltidae, and Pythonidae) and caenophid snakes (Viperidae, Elapidae, and Colubridae). Nonsynonymous/synonymous substitution ratios (Ka/Ks) in coding sequences were low (Ka/Ks < 1) between CTNNB1Z and CTNNB1W, suggesting that these 2 genes may have similar functional properties. However, frequencies of intron sequence substitutions and insertion–deletions were higher in CTNNB1Z than CTNNB1W, suggesting that Z-linked sequences evolved faster than W-linked sequences. Molecular phylogeny based on both intron and exon sequences showed the presence of 2 major clades: 1) Z-linked sequences of Caenophidia and 2) W-linked sequences of Caenophidia clustered with Z-linked sequences of Henophidia, which suggests that the sequence divergence between CTNNB1Z and CTNNB1W in Caenophidia may have occurred by the cessation of recombination after the split from Henophidia.

  8. Identification of genes in anonymous DNA sequences. Annual performance report, February 1, 1991--January 31, 1992

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fields, C.A.

    1996-06-01

    The objective of this project is the development of practical software to automate the identification of genes in anonymous DNA sequences from the human, and other higher eukaryotic genomes. A software system for automated sequence analysis, gm (gene modeler) has been designed, implemented, tested, and distributed to several dozen laboratories worldwide. A significantly faster, more robust, and more flexible version of this software, gm 2.0 has now been completed, and is being tested by operational use to analyze human cosmid sequence data. A range of efforts to further understand the features of eukaryoyic gene sequences are also underway. This progressmore » report also contains papers coming out of the project including the following: gm: a Tool for Exploratory Analysis of DNA Sequence Data; The Human THE-LTR(O) and MstII Interspersed Repeats are subfamilies of a single widely distruted highly variable repeat family; Information contents and dinucleotide compostions of plant intron sequences vary with evolutionary origin; Splicing signals in Drosophila: intron size, information content, and consensus sequences; Integration of automated sequence analysis into mapping and sequencing projects; Software for the C. elegans genome project.« less

  9. PERMANENT GENETIC RESOURCES: Consensus primers of cyp73 genes discriminate willow species and hybrids (Salix, Salicaceae).

    PubMed

    Trung, Le Quang; VAN Puyvelde, Karolien; Triest, Ludwig

    2008-03-01

    Consensus primers, based on exon sequences of the cyp73 gene family coding for cinnamate 4-hydroxylase (C4H) of the lignin biosynthesis pathway, were designed for the tetraploid willow species Salix alba and Salix fragilis. Diagnostic alleles at species level were observed among introns of three cyp73 genes and allowed unambiguous detection of the first generation and introgressed hybrids in populations. Progeny analysis of a female S. alba with a male introgressed hybrid confirmed the codominant inheritance of each intron. Sequences of the diagnostic alleles of both species were similar to those found in the hybrids. © 2007 The Authors.

  10. The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.

    PubMed

    Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo

    2016-05-01

    The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.

  11. Fractal landscapes in biological systems: long-range correlations in DNA and interbeat heart intervals

    NASA Technical Reports Server (NTRS)

    Stanley, H. E.; Buldyrev, S. V.; Goldberger, A. L.; Hausdorff, J. M.; Havlin, S.; Mietus, J.; Sciortino, F.; Simons, M.

    1992-01-01

    Here we discuss recent advances in applying ideas of fractals and disordered systems to two topics of biological interest, both topics having common the appearance of scale-free phenomena, i.e., correlations that have no characteristic length scale, typically exhibited by physical systems near a critical point and dynamical systems far from equilibrium. (i) DNA nucleotide sequences have traditionally been analyzed using models which incorporate the possibility of short-range nucleotide correlations. We found, instead, a remarkably long-range power law correlation. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences as well as intragenomic DNA, but not in cDNA sequences or intron-less genes. We also found that the myosin heavy chain family gene evolution increases the fractal complexity of the DNA landscapes, consistent with the intron-late hypothesis of gene evolution. (ii) The healthy heartbeat is traditionally thought to be regulated according to the classical principle of homeostasis, whereby physiologic systems operate to reduce variability and achieve an equilibrium-like state. We found, however, that under normal conditions, beat-to-beat fluctuations in heart rate display long-range power law correlations.

  12. The complete chloroplast genome sequence of Euonymus japonicus (Celastraceae).

    PubMed

    Choi, Kyoung Su; Park, SeonJoo

    2016-09-01

    The complete chloroplast (cp) genome sequence of the Euonymus japonicus, the first sequenced of the genus Euonymus, was reported in this study. The total length was 157 637 bp, containing a pair of 26 678 bp inverted repeat region (IR), which were separated by small single copy (SSC) region and large single copy (LSC) region of 18 340 bp and 85 941 bp, respectively. This genome contains 107 unique genes, including 74 coding genes, four rRNA genes, and 29 tRNA genes. Seventeen genes contain intron of E. japonicus, of which three genes (clpP, ycf3, and rps12) include two introns. The maximum likelihood (ML) phylogenetic analysis revealed that E. japonicus was closely related to Manihot and Populus.

  13. Factor IX[sub Madrid 2]: A deletion/insertion in Facotr IX gene which abolishes the sequence of the donor junction at the exon IV-intron d splice site

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Solera, J.; Magallon, M.; Martin-Villar, J.

    1992-02-01

    DNA from a patient with severe hemophilia B was evaluated by RFLP analysis, producing results which suggested the existence of a partial deletion within the factor IX gene. The deletion was further localized and characterized by PCR amplification and sequencing. The altered allele has a 4,442-bp deletion which removes both the donor splice site located at the 5[prime] end of intron d and the two last coding nucleotides located at the 3[prime] end of exon IV in the normal factor IX gene; this fragment has been inserted in inverted orientation. Two homologous sequences have been discovered at the ends ofmore » the deleted DNA fragment.« less

  14. Cloning of a CACTA transposon-like insertion in intron I of tomato invertase Lin5 gene and identification of transposase-like sequences of Solanaceae species.

    PubMed

    Proels, Reinhard K; Roitsch, Thomas

    2006-03-01

    Very few CACTA transposon-like sequences have been described in Solanaceae species. Sequence information has been restricted to partial transposase (TPase)-like fragments, and no target gene of CACTA-like transposon insertion has been described in tomato to date. In this manuscript, we report on a CACTA transposon-like insertion in intron I of tomato (Lycopersicon esculentum) invertase gene Lin5 and TPase-like sequences of several Solanaceae species. Consensus primers deduced from the TPase region of the tomato CACTA transposon-like element allowed the amplification of similar sequences from various Solanaceae species of different subfamilies including Solaneae (Solanum tuberosum), Cestreae (Nicotiana tabacum) and Datureae (Datura stramonium). This demonstrates the ubiquitous presence of CACTA-like elements in Solanaceae genomes. The obtained partial sequences are highly conserved, and allow further detection and detailed analysis of CACTA-like transposons throughout Solanaceae species. CACTA-like transposon sequences make possible the evaluation of their use for genome analysis, functional studies of genes and the evolutionary relationships between plant species.

  15. Molecular evolution of the CPP-like gene family in plants: insights from comparative genomics of Arabidopsis and rice.

    PubMed

    Yang, Zefeng; Gu, Shiliang; Wang, Xuefeng; Li, Wenjuan; Tang, Zaixiang; Xu, Chenwu

    2008-09-01

    CPP-like genes are members of a small family which features the existence of two similar Cys-rich domains termed CXC domains in their protein products and are distributed widely in plants and animals but do not exist in yeast. The members of this family in plants play an important role in development of reproductive tissue and control of cell division. To gain insights into how CPP-like genes evolved in plants, we conducted a comparative phylogenetic and molecular evolutionary analysis of the CPP-like gene family in Arabidopsis and rice. The results of phylogeny revealed that both gene loss and species-specific expansion contributed to the evolution of this family in Arabidopsis and rice. Both intron gain and intron loss were observed through intron/exon structure analysis for duplicated genes. Our results also suggested that positive selection was a major force during the evolution of CPP-like genes in plants, and most amino acid residues under positive selection were disproportionately located in the region outside the CXC domains. Further analysis revealed that two CXC domains and sequences connecting them might have coevolved during the long evolutionary period.

  16. A novel intronic mutation in the DDP1 gene in a family with X-linked dystonia-deafness syndrome.

    PubMed

    Ezquerra, Mario; Campdelacreu, Jaume; Muñoz, Esteban; Tolosa, Eduardo; Martí, María J

    2005-02-01

    X-linked dystonia-deafness syndrome (Mohr-Tranebjaerg syndrome) is a rare neurodegenerative disease characterized by hearing loss and dystonia. So far, 7 mutations in the coding region of the DDP1 gene have been described. They consist of frameshift, nonsense, missense mutations or deletions. To investigate the presence of mutations in the DDP1 gene in a family with dystonia-deafness syndrome. Seven members belonging to 2 generations of a family with 2 affected subjects underwent genetic analysis. Mutational screening in the DDP1 gene was made through DNA direct sequencing. We found an intronic mutation in the DDP1 gene. It consists of an A-to-C substitution in the position -23 in reference to the first nucleotide of exon 2 (IVS1-23A>C). The mutation was present in 2 affected men and their respective unaffected mothers, whereas it was absent in the healthy men from this family and in 90 healthy controls. Intronic mutations in the DDP1 gene can also cause X-linked dystonia-deafness syndrome. In our case, the effect of the mutation could be due to a splicing alteration.

  17. The presence of the NOS3 gene polymorphism for intron 4 mitigates the beneficial effects of exercise training on ambulatory blood pressure monitoring in adults.

    PubMed

    Sponton, Carlos H; Esposti, Rodrigo; Rodovalho, Cynara M; Ferreira, Maycon J; Jarrete, Aline P; Anaruma, Chadi P; Bacci, Mauricio; Zanesco, Angelina

    2014-06-15

    The number of studies that have evaluated exercise training (ET) and nitric oxide synthase (NOS)3 gene polymorphisms is scarce. The present study was designed to evaluate the relationship between exercise training and NOS3 polymorphisms at -786T>C, 894G>T, and intron 4b/a on blood pressure (BP) using 24-h ambulatory BP monitoring (ABPM), nitrate/nitrite levels (NOx), and redox state. Eighty-six volunteers (51 ± 0.6 yr old) were genotyped into nonpolymorphic and polymorphic groups for each of the three positions of NOS3 polymorphisms. Auscultatory BP, ABPM, SOD activity, catalase activity, NOx levels, and malondialdehyde levels were measured. DNA was extracted from leukocytes, and PCR followed by sequencing was applied for genotype analysis. Aerobic ET consisted of 24 sessions for 3 days/wk for 40 min at moderate intensity. This study was performed in a double-blind and crossover format. ET was effective in lowering office BP (systolic BP: 3.2% and diastolic BP: 3%) as well as ABPM (systolic BP: 2% and diastolic BP: 1.3%). Increased SOD and catalase activity (42.6% and 15.1%, respectively) were also observed. The NOS3 polymorphism for intron 4 mitigated the beneficial effect of ET for systolic BP (nonpolymorphic group: -3.0% and polymorphic group: -0.6%) and diastolic BP (nonpolymorphic group: -3.2% and polymorphic group: -0.5%), but it was not associated with NOx level and redox state. Paradoxical responses were found for positions T786-C and G894T for the NOS3 gene. Consistently, the presence of the polymorphism for intron 4 blunted the beneficial effects of ET in middle-aged adults. Possibly, this effect might be as consequence of intron 4 acting as a short intronic repeat RNA controlling endothelial NOS activity epigenetically. Copyright © 2014 the American Physiological Society.

  18. Conservation and Sex-Specific Splicing of the transformer Gene in the Calliphorids Cochliomyia hominivorax, Cochliomyia macellaria and Lucilia sericata

    PubMed Central

    Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.

    2013-01-01

    Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170

  19. A mutation in yeast mitochondrial DNA results in a precise excision of the terminal intron of the cytochrome b gene.

    PubMed

    Hill, J; McGraw, P; Tzagoloff, A

    1985-03-25

    The yeast nuclear gene CBP2 was previously proposed to code for a protein necessary for processing of the terminal intron in the cytochrome b pre-mRNA (McGraw, P., and Tzagoloff, A. (1983) J. Biol. Chem. 258, 9459-9468). In the present study we describe a mitochondrial mutation capable of suppressing the respiratory deficiency of cbp2 mutants. The mitochondrial suppressor mutation has been shown to be the result of a precise excision of the last intervening sequence from the cytochrome b gene. Strains with the altered mitochondrial DNA have normal levels of mature cytochrome b mRNA and of cytochrome b and exhibit wild type growth on glycerol. These results confirm that CBP2 codes for a protein specifically required for splicing of the cytochrome b intron and further suggest that absence of the intervening sequence does not noticeably affect the expression of respiratory function in mitochondria.

  20. Intriguing Balancing Selection on the Intron 5 Region of LMBR1 in Human Population

    PubMed Central

    He, Fang; Wu, Dong-Dong; Kong, Qing-Peng; Zhang, Ya-Ping

    2008-01-01

    Background The intron 5 of gene LMBR1 is the cis-acting regulatory module for the sonic hedgehog (SHH) gene. Mutation in this non-coding region is associated with preaxial polydactyly, and may play crucial roles in the evolution of limb and skeletal system. Methodology/Principal Findings We sequenced a region of the LMBR1 gene intron 5 in East Asian human population, and found a significant deviation of Tajima's D statistics from neutrality taking human population growth into account. Data from HapMap also demonstrated extended linkage disequilibrium in the region in East Asian and European population, and significantly low degree of genetic differentiation among human populations. Conclusion/Significance We proposed that the intron 5 of LMBR1 was presumably subject to balancing selection during the evolution of modern human. PMID:18698406

  1. Sexing the Sciuridae: a simple and accurate set of molecular methods to determine sex in tree squirrels, ground squirrels and marmots.

    PubMed

    Gorrell, Jamieson C; Boutin, Stan; Raveh, Shirley; Neuhaus, Peter; Côté, Steeve D; Coltman, David W

    2012-09-01

    We determined the sequence of the male-specific minor histocompatibility complex antigen (Smcy) from the Y chromosome of seven squirrel species (Sciuridae, Rodentia). Based on conserved regions inside the Smcy intron sequence, we designed PCR primers for sex determination in these species that can be co-amplified with nuclear loci as controls. PCR co-amplification yields two products for males and one for females that are easily visualized as bands by agarose gel electrophoresis. Our method provides simple and reliable sex determination across a wide range of squirrel species. © 2012 Blackwell Publishing Ltd.

  2. The nuclear OXPHOS genes in insecta: a common evolutionary origin, a common cis-regulatory motif, a common destiny for gene duplicates

    PubMed Central

    Porcelli, Damiano; Barsanti, Paolo; Pesole, Graziano; Caggese, Corrado

    2007-01-01

    Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens) and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG) has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions. PMID:18315839

  3. Multi-species comparative analysis of the equine ACE gene identifies a highly conserved potential transcription factor binding site in intron 16.

    PubMed

    Hamilton, Natasha A; Tammen, Imke; Raadsma, Herman W

    2013-01-01

    Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism.

  4. Multi-Species Comparative Analysis of the Equine ACE Gene Identifies a Highly Conserved Potential Transcription Factor Binding Site in Intron 16

    PubMed Central

    Hamilton, Natasha A.; Tammen, Imke; Raadsma, Herman W.

    2013-01-01

    Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism. PMID:23408978

  5. Rare intronic variants of TCF7L2 arising by selective sweeps in an indigenous population from Mexico.

    PubMed

    Acosta, Jose Luis; Hernández-Mondragón, Alma Cristal; Correa-Acosta, Laura Carolina; Cazañas-Padilla, Sandra Nathaly; Chávez-Florencio, Berenice; Ramírez-Vega, Elvia Yamilet; Monge-Cázares, Tulia; Aguilar-Salinas, Carlos A; Tusié-Luna, Teresa; Del Bosque-Plata, Laura

    2016-05-26

    Genetic variations of the TCF7L2 gene are associated with the development of Type 2 diabetes (T2D). The associated mutations have demonstrated an adaptive role in some human populations, but no studies have determined the impact of evolutionary forces on genetic diversity in indigenous populations from Mexico. Here, we sequenced and analyzed the variation of the TCF7L2 gene in three Amerindian populations and compared the results with whole-exon-sequencing of Mestizo populations from Sigma and the 1000 Genomes Project to assess the roles of selection and recombination in diversity. The diversity in the indigenous populations was biased to intronic regions. Most of the variation was low frequency. Only mutations rs77961654 and rs61724286 were located on exon 15. We did not observe variation in intronic region 4-6 in any of the three indigenous populations. In addition, we identified peaks of selective sweeps in the mestizo samples from the Sigma Project within this region. By replicating the analysis of association with T2D between case-controls from the Sigma Project, we determined that T2D was most highly associated with the rs7903146 risk allele and to a lesser extent with the other six variants. All associated markers were located in intronic region 4-6, and their r(2) values of linkage disequilibrium were significantly higher in the Mexican population than in Africans from the 1000 Genomes Project. We observed reticulations in both the haplotypes network analysis from seven marker associates and the neighborNet tree based on 6061 markers in the TCF7L2 gene identified from all samples of the 1000 Genomes Project. Finally, we identified two recombination hotspots in the upstream region and 3' end of the TCF7L2 gene. The lack of diversity in intronic region 4-6 in Indigenous populations could be an effect of selective sweeps generated by the selection of neighboring rare variants at T2D-associated mutations. The survivors' variants make the intronic region 4-6 the area of the greatest population differentiation within the TCF7L2 gene. The abundance of selective peak sweeps in the downstream region of the TCF7L2 gene suggests that the TCF7L2 gene is part of a region that is in constant recombination between populations.

  6. Evolutionary conservation and regulation of particular alternative splicing events in plant SR proteins

    PubMed Central

    Kalyna, Maria; Lopato, Sergiy; Voronin, Viktor; Barta, Andrea

    2006-01-01

    Alternative splicing is an important mechanism for fine tuning of gene expression at the post-transcriptional level. SR proteins govern splice site selection and spliceosome assembly. The Arabidopsis genome encodes 19 SR proteins, several of which have no orthologues in metazoan. Three of the plant specific subfamilies are characterized by the presence of a relatively long alternatively spliced intron located in their first RNA recognition motif, which potentially results in an extremely truncated protein. In atRSZ33, a member of the RS2Z subfamily, this alternative splicing event was shown to be autoregulated. Here we show that atRSp31, a member of the RS subfamily, does not autoregulate alternative splicing of its similarily positioned intron. Interestingly, this alternative splicing event is regulated by atRSZ33. We demonstrate that the positions of these long introns and their capability for alternative splicing are conserved from green algae to flowering plants. Moreover, in particular alternative splicing events the splicing signals are embedded into highly conserved sequences. In different taxa, these conserved sequences occur in at least one gene within a subfamily. The evolutionary preservation of alternative splice forms together with highly conserved intron features argues for additional functions hidden in the genes of these plant-specific SR proteins. PMID:16936312

  7. Gene replacements and insertions in rice by intron targeting using CRISPR-Cas9.

    PubMed

    Li, Jun; Meng, Xiangbing; Zong, Yuan; Chen, Kunling; Zhang, Huawei; Liu, Jinxing; Li, Jiayang; Gao, Caixia

    2016-09-12

    Sequence-specific nucleases have been exploited to create targeted gene knockouts in various plants(1), but replacing a fragment and even obtaining gene insertions at specific loci in plant genomes remain a serious challenge. Here, we report efficient intron-mediated site-specific gene replacement and insertion approaches that generate mutations using the non-homologous end joining (NHEJ) pathway using the clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein 9 (Cas9) system. Using a pair of single guide RNAs (sgRNAs) targeting adjacent introns and a donor DNA template including the same pair of sgRNA sites, we achieved gene replacements in the rice endogenous gene 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) at a frequency of 2.0%. We also obtained targeted gene insertions at a frequency of 2.2% using a sgRNA targeting one intron and a donor DNA template including the same sgRNA site. Rice plants harbouring the OsEPSPS gene with the intended substitutions were glyphosate-resistant. Furthermore, the site-specific gene replacements and insertions were faithfully transmitted to the next generation. These newly developed approaches can be generally used to replace targeted gene fragments and to insert exogenous DNA sequences into specific genomic sites in rice and other plants.

  8. A study of the relationships of cultivated peanut (Arachis hypogaea) and its most closely related wild species using intron sequences and microsatellite markers

    PubMed Central

    Moretzsohn, Márcio C.; Gouvea, Ediene G.; Inglis, Peter W.; Leal-Bertioli, Soraya C. M.; Valls, José F. M.; Bertioli, David J.

    2013-01-01

    Background and Aims The genus Arachis contains 80 described species. Section Arachis is of particular interest because it includes cultivated peanut, an allotetraploid, and closely related wild species, most of which are diploids. This study aimed to analyse the genetic relationships of multiple accessions of section Arachis species using two complementary methods. Microsatellites allowed the analysis of inter- and intraspecific variability. Intron sequences from single-copy genes allowed phylogenetic analysis including the separation of the allotetraploid genome components. Methods Intron sequences and microsatellite markers were used to reconstruct phylogenetic relationships in section Arachis through maximum parsimony and genetic distance analyses. Key Results Although high intraspecific variability was evident, there was good support for most species. However, some problems were revealed, notably a probable polyphyletic origin for A. kuhlmannii. The validity of the genome groups was well supported. The F, K and D genomes grouped close to the A genome group. The 2n = 18 species grouped closer to the B genome group. The phylogenetic tree based on the intron data strongly indicated that A. duranensis and A. ipaënsis are the ancestors of A. hypogaea and A. monticola. Intron nucleotide substitutions allowed the ages of divergences of the main genome groups to be estimated at a relatively recent 2·3–2·9 million years ago. This age and the number of species described indicate a much higher speciation rate for section Arachis than for legumes in general. Conclusions The analyses revealed relationships between the species and genome groups and showed a generally high level of intraspecific genetic diversity. The improved knowledge of species relationships should facilitate the utilization of wild species for peanut improvement. The estimates of speciation rates in section Arachis are high, but not unprecedented. We suggest these high rates may be linked to the peculiar reproductive biology of Arachis. PMID:23131301

  9. Extremely hypomorphic and severe deep intronic variants in the ABCA4 locus result in varying Stargardt disease phenotypes.

    PubMed

    Zernant, Jana; Lee, Winston; Nagasaki, Takayuki; Collison, Frederick T; Fishman, Gerald A; Bertelsen, Mette; Rosenberg, Thomas; Gouras, Peter; Tsang, Stephen H; Allikmets, Rando

    2018-05-30

    Autosomal recessive Stargardt disease (STGD1, MIM 248200) is caused by mutations in the ABCA4 gene. Complete sequencing of the ABCA4 locus in STGD1 patients identifies two expected disease-causing alleles in ~75% of patients and only one mutation in ~15% of patients. Recently, many possibly pathogenic variants in deep intronic sequences of ABCA4 have been identified in the latter group. We extended our analyses of deep intronic ABCA4 variants and determined that one of these, c.4253+43G>A (rs61754045), is present in 29/1155 (2.6%) of STGD1 patients. The variant is found at statistically significantly higher frequency in patients with only one pathogenic ABCA4 allele, 23/160 (14.38%), MAF=0.072, compared to MAF=0.013 in all STGD1 cases and MAF=0.006 in the matching general population (P<1x10-7). The variant, which is not predicted to have any effect on splicing, is the first reported intronic "extremely hypomorphic allele" in the ABCA4 locus; i.e., it is pathogenic only when in trans with a loss-of-function ABCA4 allele. It results in a distinct clinical phenotype characterized by late-onset of symptoms and foveal sparing. In ~70% of cases the variant was allelic with the c.6006-609T>A (rs575968112) variant, which was deemed non-pathogenic. Another rare deep intronic variant, c.5196+1056A>G (rs886044749), found in 5/834 (0.6%) of STGD1 cases is, conversely, a severe allele. This study determines pathogenicity for three non-coding variants in STGD1 patients of European descent accounting for ~3% of the disease. Defining disease-associated alleles in the non-coding sequences of the ABCA4 locus can be accomplished by integrated clinical and genetic analyses. Cold Spring Harbor Laboratory Press.

  10. Loss of a Trans-Splicing nad1 Intron from Geraniaceae and Transfer of the Maturase Gene matR to the Nucleus in Pelargonium

    PubMed Central

    Grewe, Felix; Zhu, Andan; Mower, Jeffrey P.

    2016-01-01

    The mitochondrial nad1 gene of seed plants has a complex structure, including four introns in cis or trans configurations and a maturase gene (matR) hosted within the final intron. In the geranium family (Geraniaceae), however, sequencing of representative species revealed that three of the four introns, including one in a trans configuration and another that hosts matR, were lost from the nad1 gene in their common ancestor. Despite the loss of the host intron, matR has been retained as a freestanding gene in most genera of the family, indicating that this maturase has additional functions beyond the splicing of its host intron. In the common ancestor of Pelargonium, matR was transferred to the nuclear genome, where it was split into two unlinked genes that encode either its reverse transcriptase or maturase domain. Both nuclear genes are transcribed and contain predicted mitochondrial targeting signals, suggesting that they express functional proteins that are imported into mitochondria. The nuclear localization and split domain structure of matR in the Pelargonium nuclear genome offers a unique opportunity to assess the function of these two domains using transgenic approaches. PMID:27664178

  11. Nonsynonymous substitution in abalone sperm fertilization genes exceeds substitution in introns and mitochondrial DNA

    PubMed Central

    Metz, Edward C.; Robles-Sikisaka, Refugio; Vacquier, Victor D.

    1998-01-01

    Strong positive Darwinian selection acts on two sperm fertilization proteins, lysin and 18-kDa protein, from abalone (Haliotis). To understand the phylogenetic context for this dramatic molecular evolution, we obtained sequences of mitochondrial cytochrome c oxidase subunit I (mtCOI), and genomic sequences of lysin, 18-kDa, and a G protein subunit. Based on mtDNA differentiation, four north Pacific abalone species diverged within the past 2 million years (Myr), and remaining north Pacific species diverged over a period of 4–20 Myr. Between-species nonsynonymous differences in lysin and 18-kDa exons exceed nucleotide differences in introns by 3.5- to 24-fold. Remarkably, in some comparisons nonsynonymous substitutions in lysin and 18-kDa genes exceed synonymous substitutions in mtCOI. Lysin and 18-kDa intron/exon segments were sequenced from multiple red abalone individuals collected over a 1,200-km range. Only two nucleotide changes and two sites of slippage variation were detected in a total of >29,000 nucleotides surveyed. However, polymorphism in mtCOI and a G protein intron was found in this species. This finding suggests that positive selection swept one lysin allele and one 18-kDa allele to fixation. Similarities between mtCOI and lysin gene trees indicate that rapid adaptive evolution of lysin has occurred consistently through the history of the group. Comparisons with mtCOI molecular clock calibrations suggest that nonsynonymous substitutions accumulate 2–50 times faster in lysin and 18-kDa genes than in rapidly evolving mammalian genes. PMID:9724763

  12. A base substitution in the donor site of intron 12 of KIT gene is responsible for the dominant white coat colour of blue fox (Alopex lagopus).

    PubMed

    Yan, S Q; Hou, J N; Bai, C Y; Jiang, Y; Zhang, X J; Ren, H L; Sun, B X; Zhao, Z H; Sun, J H

    2014-04-01

    The dominant white coat colour of farmed blue fox is inherited as a monogenic autosomal dominant trait and is suggested to be embryonic lethal in the homozygous state. In this study, the transcripts of KIT were identified by RT-PCR for a dominant white fox and a normal blue fox. Sequence analysis showed that the KIT transcript in normal blue fox contained the full-length coding sequence of 2919 bp (GenBank Acc. No KF530833), but in the dominant white individual, a truncated isoform lacking the entire exon 12 specifically co-expressed with the normal transcript. Genomic DNA sequencing revealed that a single nucleotide polymorphism (c.1867+1G>T) in intron 12 appeared only in the dominant white individuals and a 1-bp ins/del polymorphism in the same intron showed in individuals representing two different coat colours. Genotyping results of the SNP with PCR-RFLP in 185 individuals showed all 90 normal blue foxes were homozygous for the G allele, and all dominant white individuals were heterozygous. Due to the truncated protein with a deletion of 35 amino acids and an amino acid replacement (p.Pro623Ala) located in the conserved ATP binding domain, we propose that the mutant receptor had absent tyrosine kinase activity. These findings reveal that the base substitution at the first nucleotide of intron 12 of KIT gene, resulting in skipping of exon 12, is a causative mutation responsible for the dominant white phenotype of blue fox. © 2013 Stichting International Foundation for Animal Genetics.

  13. Plastid and mitochondrion genomic sequences from Arctic Chlorella sp. ArM0029B.

    PubMed

    Jeong, Haeyoung; Lim, Jong-Min; Park, Jihye; Sim, Young Mi; Choi, Han-Gu; Lee, Jungho; Jeong, Won-Joong

    2014-04-16

    Chorella is the representative taxon of Chlorellales in Trebouxiophyceae, and its chloroplast (cp) genomic information has been thought to depend only on studies concerning Chlorella vulgaris and GenBank information of C. variablis. Mitochondrial (mt) genomic information regarding Chlorella is currently unavailable. To elucidate the evolution of organelle genomes and genetic information of Chlorella, we have sequenced and characterized the cp and mt genomes of Arctic Chlorella sp. ArM0029B. The 119,989-bp cp genome lacking inverted repeats and 65,049-bp mt genome were sequenced. The ArM0029B cp genome contains 114 conserved genes, including 32 tRNA genes, 3 rRNA genes, and 79 genes encoding proteins. Chlorella cp genomes are highly rearranged except for a Chlorella-specific six-gene cluster, and the ArM0029B plastid resembles that of Chlorella variabilis except for a 15-kb gene cluster inversion. In the mt genome, 62 conserved genes, including 27 tRNA genes, 3 rRNA genes, and 32 genes encoding proteins were determined. The mt genome of ArM0029B is similar to that of the non-photosynthetic species Prototheca and Heicosporidium. The ArM0029B mt genome contains a group I intron, with an ORF containing two LAGLIDADG motifs, in cox1. The intronic ORF is shared by C. vulgaris and Prototheca. The phylogeny of the plastid genome reveals that ArM0029B showed a close relationship of Chlorella to Parachlorella and Oocystis within Chlorellales. The distribution of the cox1 intron at 721 support membership in the order Chlorellales. Mitochondrial phylogenomic analyses, however, indicated that ArM0029B shows a greater affinity to MX-AZ01 and Coccomyxa than to the Helicosporidium-Prototheca clade, although the detailed phylogenetic relationships among the three taxa remain to be resolved. The plastid genome of ArM0029B is similar to that of C. variabilis. The mt sequence of ArM0029B is the first genome to be reported for Chlorella. Chloroplast genome phylogeny supports monophyly of the seven investigated members of Chlorellales. The presence of the cox1 intron at 721 in all four investigated Chlorellales taxa indicates that the cox1 intron had been introduced in early Chorellales as a cis-splice form and that the cis-splicing intron was inherited to recent Chlorellales and was recently trans-spliced in Helicosporidium.

  14. Plastid and mitochondrion genomic sequences from Arctic Chlorella sp. ArM0029B

    PubMed Central

    2014-01-01

    Background Chorella is the representative taxon of Chlorellales in Trebouxiophyceae, and its chloroplast (cp) genomic information has been thought to depend only on studies concerning Chlorella vulgaris and GenBank information of C. variablis. Mitochondrial (mt) genomic information regarding Chlorella is currently unavailable. To elucidate the evolution of organelle genomes and genetic information of Chlorella, we have sequenced and characterized the cp and mt genomes of Arctic Chlorella sp. ArM0029B. Results The 119,989-bp cp genome lacking inverted repeats and 65,049-bp mt genome were sequenced. The ArM0029B cp genome contains 114 conserved genes, including 32 tRNA genes, 3 rRNA genes, and 79 genes encoding proteins. Chlorella cp genomes are highly rearranged except for a Chlorella-specific six-gene cluster, and the ArM0029B plastid resembles that of Chlorella variabilis except for a 15-kb gene cluster inversion. In the mt genome, 62 conserved genes, including 27 tRNA genes, 3 rRNA genes, and 32 genes encoding proteins were determined. The mt genome of ArM0029B is similar to that of the non-photosynthetic species Prototheca and Heicosporidium. The ArM0029B mt genome contains a group I intron, with an ORF containing two LAGLIDADG motifs, in cox1. The intronic ORF is shared by C. vulgaris and Prototheca. The phylogeny of the plastid genome reveals that ArM0029B showed a close relationship of Chlorella to Parachlorella and Oocystis within Chlorellales. The distribution of the cox1 intron at 721 support membership in the order Chlorellales. Mitochondrial phylogenomic analyses, however, indicated that ArM0029B shows a greater affinity to MX-AZ01 and Coccomyxa than to the Helicosporidium-Prototheca clade, although the detailed phylogenetic relationships among the three taxa remain to be resolved. Conclusions The plastid genome of ArM0029B is similar to that of C. variabilis. The mt sequence of ArM0029B is the first genome to be reported for Chlorella. Chloroplast genome phylogeny supports monophyly of the seven investigated members of Chlorellales. The presence of the cox1 intron at 721 in all four investigated Chlorellales taxa indicates that the cox1 intron had been introduced in early Chorellales as a cis-splice form and that the cis-splicing intron was inherited to recent Chlorellales and was recently trans-spliced in Helicosporidium. PMID:24735464

  15. Euglena Transcript Processing.

    PubMed

    McWatters, David C; Russell, Anthony G

    2017-01-01

    RNA transcript processing is an important stage in the gene expression pathway of all organisms and is subject to various mechanisms of control that influence the final levels of gene products. RNA processing involves events such as nuclease-mediated cleavage, removal of intervening sequences referred to as introns and modifications to RNA structure (nucleoside modification and editing). In Euglena, RNA transcript processing was initially examined in chloroplasts because of historical interest in the secondary endosymbiotic origin of this organelle in this organism. More recent efforts to examine mitochondrial genome structure and RNA maturation have been stimulated by the discovery of unusual processing pathways in other Euglenozoans such as kinetoplastids and diplonemids. Eukaryotes containing large genomes are now known to typically contain large collections of introns and regulatory RNAs involved in RNA processing events, and Euglena gracilis in particular has a relatively large genome for a protist. Studies examining the structure of nuclear genes and the mechanisms involved in nuclear RNA processing have revealed that indeed Euglena contains large numbers of introns in the limited set of genes so far examined and also possesses large numbers of specific classes of regulatory and processing RNAs, such as small nucleolar RNAs (snoRNAs). Most interestingly, these studies have also revealed that Euglena possesses novel processing pathways generating highly fragmented cytosolic ribosomal RNAs and subunits and non-conventional intron classes removed by unknown splicing mechanisms. This unexpected diversity in RNA processing pathways emphasizes the importance of identifying the components involved in these processing mechanisms and their evolutionary emergence in Euglena species.

  16. Bipolar localization of the group II intron Ll.LtrB is maintained in Escherichia coli deficient in nucleoid condensation, chromosome partitioning and DNA replication.

    PubMed

    Beauregard, Arthur; Chalamcharla, Venkata R; Piazza, Carol Lyn; Belfort, Marlene; Coros, Colin J

    2006-11-01

    Group II introns are mobile genetic elements that invade their cognate intron-minus alleles via an RNA intermediate, in a process known as retrohoming. They can also retrotranspose to ectopic sites at low frequency. In Escherichia coli, retrotransposition of the lactococcal group II intron, Ll.LtrB, occurs preferentially within the Ori and Ter macrodomains of the E. coli chromosome. These macrodomains migrate towards the poles of the cell, where the intron-encoded protein, LtrA, localizes. Here we investigate whether alteration of nucleoid condensation, chromosome partitioning and replication affect retrotransposition frequencies, as well as bipolar localization of the Ll.LtrB intron integration and LtrA distribution in E. coli. We thus examined these properties in the absence of the nucleoid-associated proteins H-NS, StpA and MukB, in variants of partitioning functions including the centromere-like sequence migS and the actin homologue MreB, as well as in the replication mutants DeltaoriC, seqA, tus and topoIV (ts). Although there were some dramatic fluctuations in retrotransposition levels in these hosts, bipolar localization of integration events was maintained. LtrA was consistently found in nucleoid-free regions, with its localization to the cellular poles being largely preserved in these hosts. Together, these results suggest that bipolar localization of group II intron retrotransposition results from the residence of the intron-encoded protein at the poles of the cell.

  17. The structure of the coding and 5'-flanking region of the type 1 iodothyronine deiodinase (dio1) gene is normal in a patient with suspected congenital dio1 deficiency.

    PubMed

    Toyoda, N; Kleinhaus, N; Larsen, P R

    1996-06-01

    We analyzed the exon-intron structure of the human type 1 deiodinase gene (dio1) and compared it with that of a patient with suspected congenital type 1 deiodinase (D1) deficiency. The hdio1 gene is identical in exon-intron arrangement to the mouse gene, with coding sequences and a selenocysteine insertion sequence (SECIS) element contained in four exons. There were no mutations in the sequences of exons 1-4 of the patient's genomic DNA. Functional studies by transient expression techniques showed no difference in basal promoter activity or T3 responsiveness between the patient's and the normal dio1 gene. A structural abnormality in the dio1 gene is not a likely explanation for this patient's D1-deficient phenotype.

  18. The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum).

    PubMed

    Zeng, Fan-chun; Gao, Cheng-wen; Gao, Li-zhi

    2016-01-01

    The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum) is reported and characterized in this study. The genome size is 156,612 bp, containing a pair of inverted repeats (IRs) of 25,776 bp separated by a large single-copy region of 87,213 bp and a small single-copy region of 17,851 bp. The chloroplast genome harbors 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes, and 37 tRNA genes. A total of 18 of these genes are duplicated in the inverted repeat regions, 16 genes contain 1 intron, and 2 genes and one ycf have 2 introns.

  19. Chloroplast Genome Evolution in Early Diverged Leptosporangiate Ferns

    PubMed Central

    Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong

    2014-01-01

    In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnV-GCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of co-dons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns. PMID:24823358

  20. Chloroplast genome evolution in early diverged leptosporangiate ferns.

    PubMed

    Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong

    2014-05-01

    In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnVGCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of codons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns.

  1. Large Diversity of Nonstandard Genes and Dynamic Evolution of Chloroplast Genomes in Siphonous Green Algae (Bryopsidales, Chlorophyta)

    PubMed Central

    Leliaert, Frederik; Marcelino, Vanessa R

    2018-01-01

    Abstract Chloroplast genomes have undergone tremendous alterations through the evolutionary history of the green algae (Chloroplastida). This study focuses on the evolution of chloroplast genomes in the siphonous green algae (order Bryopsidales). We present five new chloroplast genomes, which along with existing sequences, yield a data set representing all but one families of the order. Using comparative phylogenetic methods, we investigated the evolutionary dynamics of genomic features in the order. Our results show extensive variation in chloroplast genome architecture and intron content. Variation in genome size is accounted for by the amount of intergenic space and freestanding open reading frames that do not show significant homology to standard plastid genes. We show the diversity of these nonstandard genes based on their conserved protein domains, which are often associated with mobile functions (reverse transcriptase/intron maturase, integrases, phage- or plasmid-DNA primases, transposases, integrases, ligases). Investigation of the introns showed proliferation of group II introns in the early evolution of the order and their subsequent loss in the core Halimedineae, possibly through RT-mediated intron loss. PMID:29635329

  2. Allelic association of sequence variants in the herpes virus entry mediator-B gene (PVRL2) with the severity of multiple sclerosis.

    PubMed

    Schmidt, S; Pericak-Vance, M A; Sawcer, S; Barcellos, L F; Hart, J; Sims, J; Prokop, A M; van der Walt, J; DeLoa, C; Lincoln, R R; Oksenberg, J R; Compston, A; Hauser, S L; Haines, J L; Gregory, S G

    2006-07-01

    Discrepant findings have been reported regarding an association of the apolipoprotein E (APOE) gene with the clinical course of multiple sclerosis (MS). To resolve these discrepancies, we examined common sequence variation in six candidate genes residing in a 380-kb genomic region surrounding and including the APOE locus for an association with MS severity. We genotyped at least three polymorphisms in each of six candidate genes in 1,540 Caucasian MS families (729 single-case and multiple-case families from the United States, 811 single-case families from the UK). By applying the quantitative transmission/disequilibrium test to a recently proposed MS severity score, the only statistically significant (P=0.003) association with MS severity was found for an intronic variant in the Herpes Virus Entry Mediator-B Gene PVRL2. Additional genotyping extended the association to a 16.6 kb block spanning intron 1 to intron 2 of the gene. Sequencing of PVRL2 failed to identify variants with an obvious functional role. In conclusion, the analysis of a very large data set suggests that genetic polymorphisms in PVRL2 may influence MS severity and supports the possibility that viral factors may contribute to the clinical course of MS, consistent with previous reports.

  3. Cloning and Genomic Organization of a Rhamnogalacturonase Gene from Locally Isolated Strain of Aspergillus niger.

    PubMed

    Damak, Naourez; Abdeljalil, Salma; Taeib, Noomen Hadj; Gargouri, Ali

    2015-08-01

    The rhg gene encoding a rhamnogalacturonase was isolated from the novel strain A1 of Aspergillus niger. It consists of an ORF of 1.505 kb encoding a putative protein of 446 amino acids with a predicted molecular mass of 47 kDa, belonging to the family 28 of glycosyl hydrolases. The nature and position of amino acids comprising the active site as well as the three-dimensional structure were well conserved between the A. niger CTM10548 and fungal rhamnogalacturonases. The coding region of the rhg gene is interrupted by three short introns of 56 (introns 1 and 3) and 52 (intron 2) bp in length. The comparison of the peptide sequence with A. niger rhg sequences revealed that the A1 rhg should be an endo-rhamnogalacturonases, more homologous to rhg A than rhg B A. niger known enzymes. The comparison of rhg nucleotide sequence from A. niger A1 with rhg A from A. niger shows several base changes. Most of these changes (59 %) are located at the third base of codons suggesting maintaining the same enzyme function. We used the rhamnogalacturonase A from Aspergillus aculeatus as a template to build a structural model of rhg A1 that adopted a right-handed parallel β-helix.

  4. Widespread alternative and aberrant splicing revealed by lariat sequencing

    PubMed Central

    Stepankiw, Nicholas; Raghavan, Madhura; Fogarty, Elizabeth A.; Grimson, Andrew; Pleiss, Jeffrey A.

    2015-01-01

    Alternative splicing is an important and ancient feature of eukaryotic gene structure, the existence of which has likely facilitated eukaryotic proteome expansions. Here, we have used intron lariat sequencing to generate a comprehensive profile of splicing events in Schizosaccharomyces pombe, amongst the simplest organisms that possess mammalian-like splice site degeneracy. We reveal an unprecedented level of alternative splicing, including alternative splice site selection for over half of all annotated introns, hundreds of novel exon-skipping events, and thousands of novel introns. Moreover, the frequency of these events is far higher than previous estimates, with alternative splice sites on average activated at ∼3% the rate of canonical sites. Although a subset of alternative sites are conserved in related species, implying functional potential, the majority are not detectably conserved. Interestingly, the rate of aberrant splicing is inversely related to expression level, with lowly expressed genes more prone to erroneous splicing. Although we validate many events with RNAseq, the proportion of alternative splicing discovered with lariat sequencing is far greater, a difference we attribute to preferential decay of aberrantly spliced transcripts. Together, these data suggest the spliceosome possesses far lower fidelity than previously appreciated, highlighting the potential contributions of alternative splicing in generating novel gene structures. PMID:26261211

  5. Sequence analysis of three mitochondrial DNA molecules reveals interesting differences among Saccharomyces yeasts

    PubMed Central

    Langkjær, R. B.; Casaregola, S.; Ussery, D. W.; Gaillardin, C.; Piškur, J.

    2003-01-01

    The complete sequences of mitochondrial DNA (mtDNA) from the two budding yeasts Saccharomyces castellii and Saccharomyces servazzii, consisting of 25 753 and 30 782 bp, respectively, were analysed and compared to Saccharomyces cerevisiae mtDNA. While some of the traits are very similar among Saccharomyces yeasts, others have highly diverged. The two mtDNAs are much more compact than that of S.cerevisiae and contain fewer introns and intergenic sequences, although they have almost the same coding potential. A few genes contain group I introns, but group II introns, otherwise found in S.cerevisiae mtDNA, are not present. Surprisingly, four genes (ATP6, COX2, COX3 and COB) in the mtDNA of S.servazzii contain, in total, five +1 frameshifts. mtDNAs of S.castellii, S.servazzii and S.cerevisiae contain all genes on the same strand, except for one tRNA gene. On the other hand, the gene order is very different. Several gene rearrangements have taken place upon separation of the Saccharomyces lineages, and even a part of the transcription units have not been preserved. It seems that the mechanism(s) involved in the generation of the rearrangements has had to ensure that all genes stayed encoded by the same DNA strand. PMID:12799436

  6. Deep sequencing with intronic capture enables identification of an APC exon 10 inversion in a patient with polyposis.

    PubMed

    Shirts, Brian H; Salipante, Stephen J; Casadei, Silvia; Ryan, Shawnia; Martin, Judith; Jacobson, Angela; Vlaskin, Tatyana; Koehler, Karen; Livingston, Robert J; King, Mary-Claire; Walsh, Tom; Pritchard, Colin C

    2014-10-01

    Single-exon inversions have rarely been described in clinical syndromes and are challenging to detect using Sanger sequencing. We report the case of a 40-year-old woman with adenomatous colon polyps too numerous to count and who had a complex inversion spanning the entire exon 10 in APC (the gene encoding for adenomatous polyposis coli), causing exon skipping and resulting in a frameshift and premature protein truncation. In this study, we employed complete APC gene sequencing using high-coverage next-generation sequencing by ColoSeq, analysis with BreakDancer and SLOPE software, and confirmatory transcript analysis. ColoSeq identified a complex small genomic rearrangement consisting of an inversion that results in translational skipping of exon 10 in the APC gene. This mutation would not have been detected by traditional sequencing or gene-dosage methods. We report a case of adenomatous polyposis resulting from a complex single-exon inversion. Our report highlights the benefits of large-scale sequencing methods that capture intronic sequences with high enough depth of coverage-as well as the use of informatics tools-to enable detection of small pathogenic structural rearrangements.

  7. Comparative genomic survey, exon-intron annotation and phylogenetic analysis of NAT-homologous sequences in archaea, protists, fungi, viruses, and invertebrates

    USDA-ARS?s Scientific Manuscript database

    We have previously published extensive genomic surveys [1-3], reporting NAT-homologous sequences in hundreds of sequenced bacterial, fungal and vertebrate genomes. We present here the results of our latest search of 2445 genomes, representing 1532 (70 archaeal, 1210 bacterial, 43 protist, 97 fungal,...

  8. Structural features of diverse Pin-II proteinase inhibitor genes from Capsicum annuum.

    PubMed

    Mahajan, Neha S; Dewangan, Veena; Lomate, Purushottam R; Joshi, Rakesh S; Mishra, Manasi; Gupta, Vidya S; Giri, Ashok P

    2015-02-01

    The proteinase inhibitor (PI) genes from Capsicum annuum were characterized with respect to their UTR, introns and promoter elements. The occurrence of PIs with circularly permuted domain organization was evident. Several potato inhibitor II (Pin-II) type proteinase inhibitor (PI) genes have been analyzed from Capsicum annuum (L.) with respect to their differential expression during plant defense response. However, complete gene characterization of any of these C. annuum PIs (CanPIs) has not been carried out so far. Complete gene architectures of a previously identified CanPI-7 (Beads-on-string, Type A) and a member of newly isolated Bracelet type B, CanPI-69 are reported in this study. The 5' UTR (untranslated region), 3'UTR, and intronic sequences of both the CanPI genes were obtained. The genomic sequence of CanPI-7 exhibited, exon 1 (49 base pair, bp) and exon 2 (740 bp) interrupted by a 294-bp long type I intron. We noted the occurrence of three multi-domain PIs (CanPI-69, 70, 71) with circularly permuted domain organization. CanPI-69 was found to possess exon 1 (49 bp), exon 2 (551 bp) and a 584-bp long type I intron. The upstream sequence analysis of CanPI-7 and CanPI-69 predicted various transcription factor-binding sites including TATA and CAAT boxes, hormone-responsive elements (ABRELATERD1, DOFCOREZM, ERELEE4), and a defense-responsive element (WRKY71OS). Binding of transcription factors such as zinc finger motif MADS-box and MYB to the promoter regions was confirmed using electrophoretic mobility shift assay followed by mass spectrometric identification. The 3' UTR analysis for 25 CanPI genes revealed unique/distinct 3' UTR sequence for each gene. Structures of three domain CanPIs of type A and B were predicted and further analyzed for their attributes. This investigation of CanPI gene architecture will enable the better understanding of the genetic elements present in CanPIs.

  9. Intronic L1 Retrotransposons and Nested Genes Cause Transcriptional Interference by Inducing Intron Retention, Exonization and Cryptic Polyadenylation

    PubMed Central

    Kaer, Kristel; Branovets, Jelena; Hallikma, Anni; Nigumann, Pilvi; Speek, Mart

    2011-01-01

    Background Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown. Methodology/Principal Findings Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3′ ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs) and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals. Conclusions/Significance Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression. PMID:22022525

  10. The genome sequence of the colonial chordate, Botryllus schlosseri

    PubMed Central

    Voskoboynik, Ayelet; Neff, Norma F; Sahoo, Debashis; Newman, Aaron M; Pushkarev, Dmitry; Koh, Winston; Passarelli, Benedetto; Fan, H Christina; Mantalas, Gary L; Palmeri, Karla J; Ishizuka, Katherine J; Gissi, Carmela; Griggio, Francesca; Ben-Shlomo, Rachel; Corey, Daniel M; Penland, Lolita; White, Richard A; Weissman, Irving L; Quake, Stephen R

    2013-01-01

    Botryllus schlosseri is a colonial urochordate that follows the chordate plan of development following sexual reproduction, but invokes a stem cell-mediated budding program during subsequent rounds of asexual reproduction. As urochordates are considered to be the closest living invertebrate relatives of vertebrates, they are ideal subjects for whole genome sequence analyses. Using a novel method for high-throughput sequencing of eukaryotic genomes, we sequenced and assembled 580 Mbp of the B. schlosseri genome. The genome assembly is comprised of nearly 14,000 intron-containing predicted genes, and 13,500 intron-less predicted genes, 40% of which could be confidently parceled into 13 (of 16 haploid) chromosomes. A comparison of homologous genes between B. schlosseri and other diverse taxonomic groups revealed genomic events underlying the evolution of vertebrates and lymphoid-mediated immunity. The B. schlosseri genome is a community resource for studying alternative modes of reproduction, natural transplantation reactions, and stem cell-mediated regeneration. DOI: http://dx.doi.org/10.7554/eLife.00569.001 PMID:23840927

  11. The Anopheles stephensi odorant binding protein 1 (AsteObp1) gene: a new molecular marker for biological forms diagnosis.

    PubMed

    Gholizadeh, S; Firooziyan, S; Ladonni, H; Hajipirloo, H Mohammadzadeh; Djadid, N Dinparast; Hosseini, A; Raz, A

    2015-06-01

    Anopheles (Cellia) stephensi Liston 1901 is known as an Asian malaria vector. Three biological forms, namely "mysorensis", "intermediate", and "type" have been earlier reported in this species. Nevertheless, the present morphological and molecular information is insufficient to diagnose these forms. During this investigation, An. stephensi biological forms were morphologically identified and sequenced for odorant-binding protein 1 (Obp1) gene. Also, intron I sequences were used to construct phylogenetic trees. Despite nucleotide sequence variation in exon of AsteObp1, nearly 100% identity was observed at the amino acid level among the three biological forms. In order to overcome difficulties in using egg morphology characters, intron I sequences of An. stephensi Obp1 opens new molecular way to the identification of the main Asian malaria vector biological forms. However, multidisciplinary studies are needed to establish the taxonomic status of An. stephensi. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. [Study of gene mutation in 62 hemophilia A children].

    PubMed

    Hu, Q; Liu, A G; Zhang, L Q; Zhang, A; Wang, Y Q; Wang, S M; Lu, Y J; Wang, X

    2017-11-02

    Objective: To analyze the mutation type of FⅧ gene in children with hemophilia A and to explore the relationship among hemophilia gene mutation spectrum, gene mutation and clinical phenotype. Method: Sixty-two children with hemophilia A from Department of Pediatric Hematology, Tongji Hospital of Tongji Medical College, Huazhong University of Science and Technology between January 2015 and March 2017 were enrolled. All patients were male, aged from 4 months to 7 years and F Ⅷ activity ranged 0.2%-11.0%. Fifty cases had severe, 10 cases had moderate and 2 cases had mild hemophilia A. DNA was isolated from peripheral blood in hemophilia A children and the target gene fragment was amplified by PCR, in combination with the second generation sequencing, 22 and 1 introns were detected. Negative cases were detected by the second generation sequencing and results were compared with those of the international FⅧ gene mutation database. Result: There were 20 cases (32%) of intron 22 inversion, 2 cases (3%) of intron 1 inversion, 18 cases (29%) of missense mutation, 5 cases (8%) of nonsense mutation, 7 cases (11%) of deletion mutation, 1 case(2%)of splice site mutation, 2 cases (3%) of large fragment deletion and 1 case of insertion mutation (2%). No mutation was detected in 2 cases (3%), and 4 cases (7%) failed to amplify. The correlation between phenotype and genotype showed that the most common gene mutation in severe hemophilia A was intron 22 inversion (20 cases), accounting for 40% of severe patients, followed by 11 cases of missense mutation (22%). The most common mutation in moderate hemophilia A was missense mutation (6 cases), accounting for 60% of moderate patients. Conclusion: The most frequent mutation type in hemophilia A was intron 22 inversion, followed by missense mutation, again for missing mutation. The relationship between phenotype and genotype: the most frequent gene mutation in severe hemophilia A is intron 22 inversion, followed by missense mutation; the most frequent gene mutation in medium hemophilia A is missense mutation.

  13. Colonization of Ireland: revisiting 'the pygmy shrew syndrome' using mitochondrial, Y chromosomal and microsatellite markers.

    PubMed

    McDevitt, A D; Vega, R; Rambau, R V; Yannic, G; Herman, J S; Hayden, T J; Searle, J B

    2011-12-01

    There is great uncertainty about how Ireland attained its current fauna and flora. Long-distance human-mediated colonization from southwestern Europe has been seen as a possible way that Ireland obtained many of its species; however, Britain has (surprisingly) been neglected as a source area for Ireland. The pygmy shrew has long been considered an illustrative model species, such that the uncertainty of the Irish colonization process has been dubbed 'the pygmy shrew syndrome'. Here, we used new genetic data consisting of 218 cytochrome (cyt) b sequences, 153 control region sequences, 17 Y-intron sequences and 335 microsatellite multilocus genotypes to distinguish between four possible hypotheses for the colonization of the British Isles, formulated in the context of previously published data. Cyt b sequences from western Europe were basal to those found in Ireland, but also to those found in the periphery of Britain and several offshore islands. Although the central cyt b haplotype in Ireland was found in northern Spain, we argue that it most likely occurred in Britain also, from where the pygmy shrew colonized Ireland as a human introduction during the Holocene. Y-intron and microsatellite data are consistent with this hypothesis, and the biological traits and distributional data of pygmy shrews argue against long-distance colonization from Spain. The compact starburst of the Irish cyt b expansion and the low genetic diversity across all markers strongly suggests a recent colonization. This detailed molecular study of the pygmy shrew provides a new perspective on an old colonization question.

  14. Colonization of Ireland: revisiting ‘the pygmy shrew syndrome' using mitochondrial, Y chromosomal and microsatellite markers

    PubMed Central

    McDevitt, A D; Vega, R; Rambau, R V; Yannic, G; Herman, J S; Hayden, T J; Searle, J B

    2011-01-01

    There is great uncertainty about how Ireland attained its current fauna and flora. Long-distance human-mediated colonization from southwestern Europe has been seen as a possible way that Ireland obtained many of its species; however, Britain has (surprisingly) been neglected as a source area for Ireland. The pygmy shrew has long been considered an illustrative model species, such that the uncertainty of the Irish colonization process has been dubbed ‘the pygmy shrew syndrome'. Here, we used new genetic data consisting of 218 cytochrome (cyt) b sequences, 153 control region sequences, 17 Y-intron sequences and 335 microsatellite multilocus genotypes to distinguish between four possible hypotheses for the colonization of the British Isles, formulated in the context of previously published data. Cyt b sequences from western Europe were basal to those found in Ireland, but also to those found in the periphery of Britain and several offshore islands. Although the central cyt b haplotype in Ireland was found in northern Spain, we argue that it most likely occurred in Britain also, from where the pygmy shrew colonized Ireland as a human introduction during the Holocene. Y-intron and microsatellite data are consistent with this hypothesis, and the biological traits and distributional data of pygmy shrews argue against long-distance colonization from Spain. The compact starburst of the Irish cyt b expansion and the low genetic diversity across all markers strongly suggests a recent colonization. This detailed molecular study of the pygmy shrew provides a new perspective on an old colonization question. PMID:21673740

  15. EvolMarkers: a database for mining exon and intron markers for evolution, ecology and conservation studies.

    PubMed

    Li, Chenhong; Riethoven, Jean-Jack M; Naylor, Gavin J P

    2012-09-01

    Recent innovations in next-generation sequencing have lowered the cost of genome projects. Nevertheless, sequencing entire genomes for all representatives in a study remains expensive and unnecessary for most studies in ecology, evolution and conservation. It is still more cost-effective and efficient to target and sequence single-copy nuclear gene markers for such studies. Many tools have been developed for identifying nuclear markers, but most of these have focused on particular taxonomic groups. We have built a searchable database, EvolMarkers, for developing single-copy coding sequence (CDS) and exon-primed-intron-crossing (EPIC) markers that is designed to work across a broad range of phylogenetic divergences. The database is made up of single-copy CDS derived from BLAST searches of a variety of metazoan genomes. Users can search the database for different types of markers (CDS or EPIC) that are common to different sets of input species with different divergence characteristics. EvolMarkers can be applied to any taxonomic group for which genome data are available for two or more species. We included 82 genomes in the first version of EvolMarkers and have found the methods to be effective across Placozoa, Cnidaria, Arthropod, Nematoda, Annelida, Mollusca, Echinodermata, Hemichordata, Chordata and plants. We demonstrate the effectiveness of searching for CDS markers within annelids and show how to find potentially useful intronic markers within the lizard Anolis. © 2012 Blackwell Publishing Ltd.

  16. Structural analysis of the 5{prime} region of mouse and human Huntington disease genes reveals conservation of putative promoter region and Di- and trinucleotide polymorphisms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lin, Biaoyang; Nasir, J.; Kalchman, M.A.

    1995-02-10

    We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less

  17. zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs.

    PubMed

    Parekh, Swati; Ziegenhain, Christoph; Vieth, Beate; Enard, Wolfgang; Hellmann, Ines

    2018-06-01

    Single-cell RNA-sequencing (scRNA-seq) experiments typically analyze hundreds or thousands of cells after amplification of the cDNA. The high throughput is made possible by the early introduction of sample-specific bar codes (BCs), and the amplification bias is alleviated by unique molecular identifiers (UMIs). Thus, the ideal analysis pipeline for scRNA-seq data needs to efficiently tabulate reads according to both BC and UMI. zUMIs is a pipeline that can handle both known and random BCs and also efficiently collapse UMIs, either just for exon mapping reads or for both exon and intron mapping reads. If BC annotation is missing, zUMIs can accurately detect intact cells from the distribution of sequencing reads. Another unique feature of zUMIs is the adaptive downsampling function that facilitates dealing with hugely varying library sizes but also allows the user to evaluate whether the library has been sequenced to saturation. To illustrate the utility of zUMIs, we analyzed a single-nucleus RNA-seq dataset and show that more than 35% of all reads map to introns. Also, we show that these intronic reads are informative about expression levels, significantly increasing the number of detected genes and improving the cluster resolution. zUMIs flexibility makes if possible to accommodate data generated with any of the major scRNA-seq protocols that use BCs and UMIs and is the most feature-rich, fast, and user-friendly pipeline to process such scRNA-seq data.

  18. Isolation and characterization of a water stress-specific genomic gene, pwsi 18, from rice.

    PubMed

    Joshee, N; Kisaka, H; Kitagawa, Y

    1998-01-01

    One of the water stress-specific cDNA clones of rice characterised previously, wsi18, was selected for further study. The wsi18 gene can be induced by water stress conditions such as mannitol, NaCl, and dryness, but not by ABA, cold, or heat. A genomic clone for wsi18, pwsi18, contained about 1.7 kbp of the 5' upstream sequence, two introns, and the full coding sequence. The 5'-upstream sequence of pwsi18 contained putative cis-acting elements, namely an ABA-responsive element (ABRE), three G-boxes, three E-boxes, a MEF-2 sequence, four direct and two inverted repeats, and four sequences similar to DRE, which is involved in the dehydration response of Arabidopsis genes. The gusA reporter gene under the control of the pwsi18 promoter showed transient expression in response to water stress. Deletion of the downstream DRE-like sequence between the distal G-boxes-2 and -3 resulted in rather low GUS expression.

  19. Human intron-encoded Alu RNAs are processed and packaged into Wdr79-associated nucleoplasmic box H/ACA RNPs

    PubMed Central

    Jády, Beáta E.; Ketele, Amandine; Kiss, Tamás

    2012-01-01

    Alu repetitive sequences are the most abundant short interspersed DNA elements in the human genome. Full-length Alu elements are composed of two tandem sequence monomers, the left and right Alu arms, both derived from the 7SL signal recognition particle RNA. Since Alu elements are common in protein-coding genes, they are frequently transcribed into pre-mRNAs. Here, we demonstrate that the right arms of nascent Alu transcripts synthesized within pre-mRNA introns are processed into metabolically stable small RNAs. The intron-encoded Alu RNAs, termed AluACA RNAs, are structurally highly reminiscent of box H/ACA small Cajal body (CB) RNAs (scaRNAs). They are composed of two hairpin units followed by the essential H (AnAnnA) and ACA box motifs. The mature AluACA RNAs associate with the four H/ACA core proteins: dyskerin, Nop10, Nhp2, and Gar1. Moreover, the 3′ hairpin of AluACA RNAs carries two closely spaced CB localization motifs, CAB boxes (UGAG), which bind Wdr79 in a cumulative fashion. In contrast to canonical H/ACA scaRNPs, which concentrate in CBs, the AluACA RNPs accumulate in the nucleoplasm. Identification of 348 human AluACA RNAs demonstrates that intron-encoded AluACA RNAs represent a novel, large subgroup of H/ACA RNAs, which are apparently confined to human or primate cells. PMID:22892240

  20. Partial androgen insensitivity syndrome caused by a deep intronic mutation creating an alternative splice acceptor site of the AR gene.

    PubMed

    Ono, Hiroyuki; Saitsu, Hirotomo; Horikawa, Reiko; Nakashima, Shinichi; Ohkubo, Yumiko; Yanagi, Kumiko; Nakabayashi, Kazuhiko; Fukami, Maki; Fujisawa, Yasuko; Ogata, Tsutomu

    2018-02-02

    Although partial androgen insensitivity syndrome (PAIS) is caused by attenuated responsiveness to androgens, androgen receptor gene (AR) mutations on the coding regions and their splice sites have been identified only in <25% of patients with a diagnosis of PAIS. We performed extensive molecular studies including whole exome sequencing in a Japanese family with PAIS, identifying a deep intronic variant beyond the branch site at intron 6 of AR (NM_000044.4:c.2450-42 G > A). This variant created the splice acceptor motif that was accompanied by pyrimidine-rich sequence and two candidate branch sites. Consistent with this, reverse transcriptase (RT)-PCR experiments for cycloheximide-treated lymphoblastoid cell lines revealed a relatively large amount of aberrant mRNA produced by the newly created splice acceptor site and a relatively small amount of wildtype mRNA produced by the normal splice acceptor site. Furthermore, most of the aberrant mRNA was shown to undergo nonsense mediated decay (NMD) and, if a small amount of aberrant mRNA may have escaped NMD, such mRNA was predicted to generate a truncated AR protein missing some functional domains. These findings imply that the deep intronic mutation creating an alternative splice acceptor site resulted in the production of a relatively small amount of wildtype AR mRNA, leading to PAIS.

  1. Targeted Deep Resequencing Identifies Coding Variants in the PEAR1 Gene That Play a Role in Platelet Aggregation

    PubMed Central

    Kim, Yoonhee; Suktitipat, Bhoom; Yanek, Lisa R.; Faraday, Nauder; Wilson, Alexander F.; Becker, Diane M.; Becker, Lewis C.; Mathias, Rasika A.

    2013-01-01

    Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1) gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13) selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate). Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5%) were noted in African Americans compared to European Americans (108 vs. 45). The common intronic GWAS-identified variant (rs12041331) demonstrated the most significant association signal in African Americans (p = 4.020×10−4); no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331). Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965) supports the results noted in the sequenced discovery sample: p = 3.56×10−4, 2.27×10−7, 5.20×10−5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans, and show that exonic variants play an additional role in platelet aggregation in European Americans. PMID:23704978

  2. Strong Signature of Natural Selection within an FHIT Intron Implicated in Prostate Cancer Risk

    PubMed Central

    Ding, Yan; Larson, Garrett; Rivas, Guillermo; Lundberg, Cathryn; Geller, Louis; Ouyang, Ching; Weitzel, Jeffrey; Archambeau, John; Slater, Jerry; Daly, Mary B.; Benson, Al B.; Kirkwood, John M.; O'Dwyer, Peter J.; Sutphen, Rebecca; Stewart, James A.; Johnson, David; Nordborg, Magnus; Krontiris, Theodore G.

    2008-01-01

    Previously, a candidate gene linkage approach on brother pairs affected with prostate cancer identified a locus of prostate cancer susceptibility at D3S1234 within the fragile histidine triad gene (FHIT), a tumor suppressor that induces apoptosis. Subsequent association tests on 16 SNPs spanning approximately 381 kb surrounding D3S1234 in Americans of European descent revealed significant evidence of association for a single SNP within intron 5 of FHIT. In the current study, re-sequencing and genotyping within a 28.5 kb region surrounding this SNP further delineated the association with prostate cancer risk to a 15 kb region. Multiple SNPs in sequences under evolutionary constraint within intron 5 of FHIT defined several related haplotypes with an increased risk of prostate cancer in European-Americans. Strong associations were detected for a risk haplotype defined by SNPs 138543, 142413, and 152494 in all cases (Pearson's χ2 = 12.34, df 1, P = 0.00045) and for the homozygous risk haplotype defined by SNPs 144716, 142413, and 148444 in cases that shared 2 alleles identical by descent with their affected brothers (Pearson's χ2 = 11.50, df 1, P = 0.00070). In addition to highly conserved sequences encompassing SNPs 148444 and 152413, population studies revealed strong signatures of natural selection for a 1 kb window covering the SNP 144716 in two human populations, the European American (π = 0.0072, Tajima's D = 3.31, 14 SNPs) and the Japanese (π = 0.0049, Fay & Wu's H = 8.05, 14 SNPs), as well as in chimpanzees (Fay & Wu's H = 8.62, 12 SNPs). These results strongly support the involvement of the FHIT intronic region in an increased risk of prostate cancer. PMID:18953408

  3. Deep developmental transcriptome sequencing uncovers numerous new genes and enhances gene annotation in the sponge Amphimedon queenslandica.

    PubMed

    Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M

    2015-05-15

    The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.

  4. Mitochondrion-to-Chloroplast DNA Transfers and Intragenomic Proliferation of Chloroplast Group II Introns in Gloeotilopsis Green Algae (Ulotrichales, Ulvophyceae).

    PubMed

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    2016-09-19

    To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G planctonica and 262,888-bp G sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. Mitochondrion-to-Chloroplast DNA Transfers and Intragenomic Proliferation of Chloroplast Group II Introns in Gloeotilopsis Green Algae (Ulotrichales, Ulvophyceae)

    PubMed Central

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    2016-01-01

    Abstract To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G. planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G. planctonica and 262,888-bp G. sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G. sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. PMID:27503298

  6. Variations of Human DNA Polymerase Genes as Biomarkers of Prostate Cancer Progression

    DTIC Science & Technology

    2011-07-01

    Forward sequence Reverse sequence Sequence contextb 1 g.39835C4Tc P169S 15 25 gTG GGG TC CTT g.39897C4T Intronic 22 15 AGA T GGt TA AAT g.39985T4C...Intronic 34 25 AGA TT tAA AAG g.40051C4Tc P184S 19 34 TGt CT GGA ATT 4 g.39835C4Tc P169S 19 29 gTG GGG TC CTT g.40051C4Tc P184S 23 34 TGt CT GGA ATT 6 g...39835C4Tc P169S 14 24 gTG GGG TC CTT g.40051C4Tc P184S 21 32 TGt CT GGA ATT 11 g.40055A4G D185G 28 35 TTC C AGA C AAG g.40073A4G Y191C 28 20 gGA T AtG CC

  7. Chromosomal localization and partial genomic structure of the human peroxisome proliferator activated receptor-gamma (hPPAR gamma) gene.

    PubMed

    Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R

    1997-04-28

    We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.

  8. Exon Shuffling and Origin of Scorpion Venom Biodiversity

    PubMed Central

    Wang, Xueli; Gao, Bin; Zhu, Shunyi

    2016-01-01

    Scorpion venom is a complex combinatorial library of peptides and proteins with multiple biological functions. A combination of transcriptomic and proteomic techniques has revealed its enormous molecular diversity, as identified by the presence of a large number of ion channel-targeted neurotoxins with different folds, membrane-active antimicrobial peptides, proteases, and protease inhibitors. Although the biodiversity of scorpion venom has long been known, how it arises remains unsolved. In this work, we analyzed the exon-intron structures of an array of scorpion venom protein-encoding genes and unexpectedly found that nearly all of these genes possess a phase-1 intron (one intron located between the first and second nucleotides of a codon) near the cleavage site of a signal sequence despite their mature peptides remarkably differ. This observation matches a theory of exon shuffling in the origin of new genes and suggests that recruitment of different folds into scorpion venom might be achieved via shuffling between body protein-coding genes and ancestral venom gland-specific genes that presumably contributed tissue-specific regulatory elements and secretory signal sequences. PMID:28035955

  9. Exon Shuffling and Origin of Scorpion Venom Biodiversity.

    PubMed

    Wang, Xueli; Gao, Bin; Zhu, Shunyi

    2016-12-26

    Scorpion venom is a complex combinatorial library of peptides and proteins with multiple biological functions. A combination of transcriptomic and proteomic techniques has revealed its enormous molecular diversity, as identified by the presence of a large number of ion channel-targeted neurotoxins with different folds, membrane-active antimicrobial peptides, proteases, and protease inhibitors. Although the biodiversity of scorpion venom has long been known, how it arises remains unsolved. In this work, we analyzed the exon-intron structures of an array of scorpion venom protein-encoding genes and unexpectedly found that nearly all of these genes possess a phase-1 intron (one intron located between the first and second nucleotides of a codon) near the cleavage site of a signal sequence despite their mature peptides remarkably differ. This observation matches a theory of exon shuffling in the origin of new genes and suggests that recruitment of different folds into scorpion venom might be achieved via shuffling between body protein-coding genes and ancestral venom gland-specific genes that presumably contributed tissue-specific regulatory elements and secretory signal sequences.

  10. Genomic organization and expression of the human MSH3 gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Watanabe, Atsushi; Ikejima, Miyoko; Suzuki, Noriko

    1996-02-01

    We have studied the expression and genomic organization of the human MSH3 gene, which encodes a human homologue of the bacterial DNA mismatch repair protein MutS. This gene is located upstream of the dihydrofolate reductase (DHFR) gene. Northern analysis has demonstrated that the hMSH3 gene is expressed in a variety of human tissues at low levels, like the DHFR gene. Characterization of cosmid clones has shown that the hMSH3 gene consists of 24 exons spanning at least 160 kb. All exon-intron junction sequences match the classical GT/AG rule, except that intron 6 has AT and AA at the ends. Twomore » major transcripts of 5.0 and 3.8 kb have been shown to be derived from the differential use of two polyadenylation sites. Elucidation of the complete genomic organization and the nucleotide sequences of the introns of the hMSH3 gene should be useful for studying the function of this gene and the possible involvement of specific mutations of the hMSH3 gene in some diseases. 34 refs., 5 figs., 1 tab.« less

  11. Characterization of Ferredoxin-Dependent Glutamine-Oxoglutarate Amidotransferase (Fd-GOGAT) Genes and Their Relationship with Grain Protein Content QTL in Wheat

    PubMed Central

    2014-01-01

    Background In higher plants, inorganic nitrogen is assimilated via the glutamate synthase cycle or GS-GOGAT pathway. GOGAT enzyme occurs in two distinct forms that use NADH (NADH-GOGAT) or Fd (Fd-GOGAT) as electron carriers. The goal of the present study was to characterize wheat Fd-GOGAT genes and to assess the linkage with grain protein content (GPC), an important quantitative trait controlled by multiple genes. Results We report the complete genomic sequences of the three homoeologous A, B and D Fd-GOGAT genes from hexaploid wheat (Triticum aestivum) and their localization and characterization. The gene is comprised of 33 exons and 32 introns for all the three homoeologues genes. The three genes show the same exon/intron number and size, with the only exception of a series of indels in intronic regions. The partial sequence of the Fd-GOGAT gene located on A genome was determined in two durum wheat (Triticum turgidum ssp. durum) cvs Ciccio and Svevo, characterized by different grain protein content. Genomic differences allowed the gene mapping in the centromeric region of chromosome 2A. QTL analysis was conducted in the Svevo×Ciccio RIL mapping population, previously evaluated in 5 different environments. The study co-localized the Fd-GOGAT-A gene with the marker GWM-339, identifying a significant major QTL for GPC. Conclusions The wheat Fd-GOGAT genes are highly conserved; both among the three homoeologous hexaploid wheat genes and in comparison with other plants. In durum wheat, an association was shown between the Fd-GOGAT allele of cv Svevo with increasing GPC - potentially useful in breeding programs. PMID:25099972

  12. Identification of processed Chinese medicinal materials using DNA mini-barcoding.

    PubMed

    Song, Ming; Dong, Gang-Qiang; Zhang, Ya-Qin; Liu, Xia; Sun, Wei

    2017-07-01

    Most of Chinese medicinal herbs are subjected to traditional processing procedures, including stir-frying, charring, steaming, boiling, and calcining before they are released into dispensaries. The marketing and identification of processed medicinal materials is a growing issue in the marketplace. However, conventional methods of identification have limitations, while DNA mini-barcoding, based on the sequencing of a short-standardized region, has received considerable attention as a new potential means to identify processed medicinal materials. In the present study, six DNA barcode loci including ITS2, psbA-trnH, rbcL, matK, trnL (UAA) intron and its P6 loop, were employed for the authentication of 45 processed samples belonging to 15 species. We evaluated the amplification efficiency of each locus. We also examined the identification accuracy of the potential mini-barcode locus, of trnL (UAA) intron P6 loop. Our results showed that the five primary barcode loci were successfully amplified in only 8.89%-20% of the processed samples, while the amplification rates of the trnL (UAA) intron P6 loop were higher, at 75.56% successful amplification. We compared the mini-barcode sequences with Genbank using the Blast program. The analysis showed that 45.23% samples could be identified to genus level, while only one sample could be identified to the species level. We conclude that trnL (UAA) p6 loop is a candidate mini-barcode that has shown its potential and may become a universal mini-barcode as complementary barcode for authenticity testing and will play an important role in medicinal materials control. Copyright © 2017 China Pharmaceutical University. Published by Elsevier B.V. All rights reserved.

  13. Novel USH2A compound heterozygous mutations cause RP/USH2 in a Chinese family.

    PubMed

    Liu, Xiaowen; Tang, Zhaohui; Li, Chang; Yang, Kangjuan; Gan, Guanqi; Zhang, Zibo; Liu, Jingyu; Jiang, Fagang; Wang, Qing; Liu, Mugen

    2010-03-17

    To identify the disease-causing gene in a four-generation Chinese family affected with retinitis pigmentosa (RP). Linkage analysis was performed with a panel of microsatellite markers flanking the candidate genetic loci of RP. These loci included 38 known RP genes. The complete coding region and exon-intron boundaries of Usher syndrome 2A (USH2A) were sequenced with the proband DNA to screen the disease-causing gene mutation. Restriction fragment length polymorphism (RFLP) analysis and direct DNA sequence analysis were done to demonstrate co-segregation of the USH2A mutations with the family disease. One hundred normal controls were used without the mutations. The disease-causing gene in this Chinese family was linked to the USH2A locus on chromosome 1q41. Direct DNA sequence analysis of USH2A identified two novel mutations in the patients: one missense mutation p.G1734R in exon 26 and a splice site mutation, IVS32+1G>A, which was found in the donor site of intron 32 of USH2A. Neither the p.G1734R nor the IVS32+1G>A mutation was found in the unaffected family members or the 100 normal controls. One patient with a homozygous mutation displayed only RP symptoms until now, while three patients with compound heterozygous mutations in the family of study showed both RP and hearing impairment. This study identified two novel mutations: p.G1734R and IVS32+1G>A of USH2A in a four-generation Chinese RP family. In this study, the heterozygous mutation and the homozygous mutation in USH2A may cause Usher syndrome Type II or RP, respectively. These two mutations expand the mutant spectrum of USH2A.

  14. An engineered Streptomyces hygroscopicus aph 7" gene mediates dominant resistance against hygromycin B in Chlamydomonas reinhardtii.

    PubMed

    Berthold, Peter; Schmitt, Rüdiger; Mages, Wolfgang

    2002-12-01

    We have developed a positively selectable marker for the green alga Chlamydomonas reinhardtii using the Streptomyces hygroscopicus aminoglycoside phosphotransferase gene (aph7"). Its expression is controlled by C. reinhardtii regulatory elements, namely, the beta2-tubulin gene promoter in combination with the first intron and the 3' untranslated region of the small subunit of ribulose bisphosphate carboxylase, rbcS2. C. reinhardtii cell-wall deficient and wild-type strains were transformed at rates up to 5 x 10(-5) with two constructs, pHyg3 and pHyg4 (intron-less). Transformants selected on plates with 10 microg/ml hygromycin B exhibited diverse levels of resistance of up to 200 microg/ml that were stably maintained for at least seven months; they contained two to five copies of the construct integrated in their genomes. Transcription of the chimeric aph7" gene, correct splicing of the rbcS2 intron, and polyadenylation of the transcripts have been verified by sequencing of RT-PCR products. Average co-transformation rates using pHyg3 and a second selectable plasmid were about 11%. This advocates the hygromycin-resistance plasmid, pHyg3, as a new versatile tool for the transformation of a broad range of C. reinhardtii strains without the sustained need for using auxotrophic mutants as recipients.

  15. Identification of a novel large deletion in a patient with severe factor V deficiency using an in-house F5 MLPA assay.

    PubMed

    Nuzzo, F; Paraboschi, E M; Straniero, L; Pavlova, A; Duga, S; Castoldi, E

    2015-01-01

    Factor V (FV) deficiency is a rare autosomal recessive bleeding disorder caused by mutations in the F5 gene. FV-deficient patients in whom no mutation or only one mutation is found may harbour large gene rearrangements, which are not detected by conventional mutation screening strategies. The aim of this study was to develop and validate a multiplex ligation-dependent probe amplification (MLPA) assay for the detection of large deletions and duplications in the F5 gene. Twenty-two MLPA probes targeting 19 of the 25 exons and the upstream and downstream regions of the F5 gene were designed and tested in 10 normal controls, a patient with a known heterozygous deletion of F5 exons 1-7 (positive control) and 14 genetically unexplained FV-deficient patients. MLPA results were confirmed by digital PCR on a QuantStudio(™) 3D Digital PCR System. The F5-specific probes yielded a reproducible peak profile in normal controls, correctly detected the known deletion in the positive control and suggested the presence of a novel deletion of exons 9-10 in a patient with undetectable FV levels and only one identified mutation. Follow-up by chip-based digital PCR, long-range PCR and direct sequencing confirmed that this patient carried a heterozygous F5 deletion of 1823 bp extending from intron 8 to intron 10. Bioinformatics sequence analysis pinpointed repetitive elements that might have originated the deletion. In conclusion, we have developed and validated an MLPA assay for the detection of gross F5 gene rearrangements. This assay may represent a valuable tool for the molecular diagnosis of FV deficiency. © 2014 John Wiley & Sons Ltd.

  16. Molecular evolution of multiple-level control of heme biosynthesis pathway in animal kingdom.

    PubMed

    Tzou, Wen-Shyong; Chu, Ying; Lin, Tzung-Yi; Hu, Chin-Hwa; Pai, Tun-Wen; Liu, Hsin-Fu; Lin, Han-Jia; Cases, Ildeofonso; Rojas, Ana; Sanchez, Mayka; You, Zong-Ye; Hsu, Ming-Wei

    2014-01-01

    Adaptation of enzymes in a metabolic pathway can occur not only through changes in amino acid sequences but also through variations in transcriptional activation, mRNA splicing and mRNA translation. The heme biosynthesis pathway, a linear pathway comprised of eight consecutive enzymes in animals, provides researchers with ample information for multiple types of evolutionary analyses performed with respect to the position of each enzyme in the pathway. Through bioinformatics analysis, we found that the protein-coding sequences of all enzymes in this pathway are under strong purifying selection, from cnidarians to mammals. However, loose evolutionary constraints are observed for enzymes in which self-catalysis occurs. Through comparative genomics, we found that in animals, the first intron of the enzyme-encoding genes has been co-opted for transcriptional activation of the genes in this pathway. Organisms sense the cellular content of iron, and through iron-responsive elements in the 5' untranslated regions of mRNAs and the intron-exon boundary regions of pathway genes, translational inhibition and exon choice in enzymes may be enabled, respectively. Pathway product (heme)-mediated negative feedback control can affect the transport of pathway enzymes into the mitochondria as well as the ubiquitin-mediated stability of enzymes. Remarkably, the positions of these controls on pathway activity are not ubiquitous but are biased towards the enzymes in the upstream portion of the pathway. We revealed that multiple-level controls on the activity of the heme biosynthesis pathway depend on the linear depth of the enzymes in the pathway, indicating a new strategy for discovering the molecular constraints that shape the evolution of a metabolic pathway.

  17. Mechanisms and Regulation of Alternative Pre-mRNA Splicing

    PubMed Central

    Lee, Yeon

    2015-01-01

    Precursor messenger RNA (pre-mRNA) splicing is a critical step in the posttranscriptional regulation of gene expression, providing significant expansion of the functional proteome of eukaryotic organisms with limited gene numbers. Split eukaryotic genes contain intervening sequences or introns disrupting protein-coding exons, and intron removal occurs by repeated assembly of a large and highly dynamic ribonucleoprotein complex termed the spliceosome, which is composed of five small nuclear ribonucleoprotein particles, U1, U2, U4/U6, and U5. Biochemical studies over the past 10 years have allowed the isolation as well as compositional, functional, and structural analysis of splicing complexes at distinct stages along the spliceosome cycle. The average human gene contains eight exons and seven introns, producing an average of three or more alternatively spliced mRNA isoforms. Recent high-throughput sequencing studies indicate that 100% of human genes produce at least two alternative mRNA isoforms. Mechanisms of alternative splicing include RNA–protein interactions of splicing factors with regulatory sites termed silencers or enhancers, RNA–RNA base-pairing interactions, or chromatin-based effects that can change or determine splicing patterns. Disease-causing mutations can often occur in splice sites near intron borders or in exonic or intronic RNA regulatory silencer or enhancer elements, as well as in genes that encode splicing factors. Together, these studies provide mechanistic insights into how spliceosome assembly, dynamics, and catalysis occur; how alternative splicing is regulated and evolves; and how splicing can be disrupted by cis- and trans-acting mutations leading to disease states. These findings make the spliceosome an attractive new target for small-molecule, antisense, and genome-editing therapeutic interventions. PMID:25784052

  18. The genetic association study between polymorphisms in uncoupling protein 2 and uncoupling protein 3 and metabolic data in dogs.

    PubMed

    Udagawa, Chihiro; Tada, Naomi; Asano, Junzo; Ishioka, Katsumi; Ochiai, Kazuhiko; Bonkobara, Makoto; Tsuchida, Shuichi; Omi, Toshinori

    2014-12-11

    The uncoupling proteins (UCPs) in the mitochondrial inner membrane are members of the mitochondrial anion carrier protein family that play an important role in energy homeostasis. Genetic association studies have shown that human UCP2 and UCP3 variants (SNPs and indels) are associated with obesity, insulin resistance, type 2 diabetes mellitus, and metabolic syndrome. The aim of this study was to examine the genetic association between polymorphisms in UCP2 and UCP3 and metabolic data in dogs. We identified 10 SNPs (9 intronic and 1 exonic) and 4 indels (intronic) in UCP2, and 13 SNPs (11 intronic and 2 exonic) and one indel (exonic) in UCP3, by DNA sequence analysis of 11 different dog breeds (n=119). An association study between these UCP2 and UCP3 variants and the biochemical parameters of glucose, total cholesterol, lactate dehydrogenase and triglyceride in Labrador Retrievers (n=50) showed that none of the UCP2 polymorphisms were significantly associated with the levels of these parameters. However, four UCP3 SNPs (intron 1) were significantly associated with total cholesterol levels. In addition, the allele frequencies of two of the four SNPs associated with higher total cholesterol levels in a breed that is susceptible to hypercholesterolemia (Shetland Sheepdogs, n=30), compared with the control breed (Shiba, n=30). The results obtained from a limited number of individuals suggest that the UCP3 gene in dogs may be associated with total cholesterol levels. The examination of larger sample sizes and further analysis will lead to increased precision of these results.

  19. Analysis for complete genomic sequence of HLA-B and HLA-C alleles in the Chinese Han population.

    PubMed

    Zhu, F; He, Y; Zhang, W; He, J; He, J; Xu, X; Lv, H; Yan, L

    2011-08-01

    In the present study, we have determined the complete genomic sequence and analysed the intron polymorphism of partial HLA-B and HLA-C alleles in the Chinese Han population. Over 3.0 kb DNA fragments of HLA-B and HLA-C loci were amplified by polymerase chain reaction from partial 5' untranslated region to 3' noncoding region respectively, and then the amplified products were sequenced. Full-length nucleotide sequences of 14 HLA-B alleles and 10 HLA-C alleles were obtained and have been submitted to GenBank and IMGT/HLA database. Two novel alleles of HLA-B*52:01:01:02 and HLA-B*59:01:01:02 were identified, and the complete genomic sequence of HLA-B*52:01:01:01 was firstly reported. Totally 157 and 167 polymorphism positions were found in the full-length genomic sequence of HLA-B and HLA-C loci respectively. Our results suggested that many single nucleotide polymorphisms existed in the exon and intron regions, and the data can provide useful information for understanding the evolution of HLA-B and HLA-C alleles. © 2011 Blackwell Publishing Ltd.

  20. Molecular evolution of Adh and LEAFY and the phylogenetic utility of their introns in Pyrus (Rosaceae)

    PubMed Central

    2011-01-01

    Background The genus Pyrus belongs to the tribe Pyreae (the former subfamily Maloideae) of the family Rosaceae, and includes one of the most important commercial fruit crops, pear. The phylogeny of Pyrus has not been definitively reconstructed. In our previous efforts, the internal transcribed spacer region (ITS) revealed a poorly resolved phylogeny due to non-concerted evolution of nrDNA arrays. Therefore, introns of low copy nuclear genes (LCNG) are explored here for improved resolution. However, paralogs and lineage sorting are still two challenges for applying LCNGs in phylogenetic studies, and at least two independent nuclear loci should be compared. In this work the second intron of LEAFY and the alcohol dehydrogenase gene (Adh) were selected to investigate their molecular evolution and phylogenetic utility. Results DNA sequence analyses revealed a complex ortholog and paralog structure of Adh genes in Pyrus and Malus, the pears and apples. Comparisons between sequences from RT-PCR and genomic PCR indicate that some Adh homologs are putatively nonfunctional. A partial region of Adh1 was sequenced for 18 Pyrus species and three subparalogs representing Adh1-1 were identified. These led to poorly resolved phylogenies due to low sequence divergence and the inclusion of putative recombinants. For the second intron of LEAFY, multiple inparalogs were discovered for both LFY1int2 and LFY2int2. LFY1int2 is inadequate for phylogenetic analysis due to lineage sorting of two inparalogs. LFY2int2-N, however, showed a relatively high sequence divergence and led to the best-resolved phylogeny. This study documents the coexistence of outparalogs and inparalogs, and lineage sorting of these paralogs and orthologous copies. It reveals putative recombinants that can lead to incorrect phylogenetic inferences, and presents an improved phylogenetic resolution of Pyrus using LFY2int2-N. Conclusions Our study represents the first phylogenetic analyses based on LCNGs in Pyrus. Ancient and recent duplications lead to a complex structure of Adh outparalogs and inparalogs in Pyrus and Malus, resulting in neofunctionalization, nonfunctionalization and possible subfunctionalization. Among all investigated orthologs, LFY2int2-N is the best nuclear marker for phylogenetic reconstruction of Pyrus due to suitable sequence divergence and the absence of lineage sorting. PMID:21917170

  1. Characterization of the molecular basis of group II intron RNA recognition by CRS1-CRM domains.

    PubMed

    Keren, Ido; Klipcan, Liron; Bezawork-Geleta, Ayenachew; Kolton, Max; Shaya, Felix; Ostersetzer-Biran, Oren

    2008-08-22

    CRM (chloroplast RNA splicing and ribosome maturation) is a recently recognized RNA-binding domain of ancient origin that has been retained in eukaryotic genomes only within the plant lineage. Whereas in bacteria CRM domains exist as single domain proteins involved in ribosome maturation, in plants they are found in a family of proteins that contain between one and four repeats. Several members of this family with multiple CRM domains have been shown to be required for the splicing of specific plastidic group II introns. Detailed biochemical analysis of one of these factors in maize, CRS1, demonstrated its high affinity and specific binding to the single group II intron whose splicing it facilitates, the plastid-encoded atpF intron RNA. Through its association with two intronic regions, CRS1 guides the folding of atpF intron RNA into its predicted "catalytically active" form. To understand how multiple CRM domains cooperate to achieve high affinity sequence-specific binding to RNA, we analyzed the RNA binding affinity and specificity associated with each individual CRM domain in CRS1; whereas CRM3 bound tightly to the RNA, CRM1 associated specifically with a unique region found within atpF intron domain I. CRM2, which demonstrated only low binding affinity, also seems to form specific interactions with regions localized to domains I, III, and IV. We further show that CRM domains share structural similarities and RNA binding characteristics with the well known RNA recognition motif domain.

  2. The role of transposable elements in the evolution of non-mammalian vertebrates and invertebrates

    PubMed Central

    2010-01-01

    Background Transposable elements (TEs) have played an important role in the diversification and enrichment of mammalian transcriptomes through various mechanisms such as exonization and intronization (the birth of new exons/introns from previously intronic/exonic sequences, respectively), and insertion into first and last exons. However, no extensive analysis has compared the effects of TEs on the transcriptomes of mammals, non-mammalian vertebrates and invertebrates. Results We analyzed the influence of TEs on the transcriptomes of five species, three invertebrates and two non-mammalian vertebrates. Compared to previously analyzed mammals, there were lower levels of TE introduction into introns, significantly lower numbers of exonizations originating from TEs and a lower percentage of TE insertion within the first and last exons. Although the transcriptomes of vertebrates exhibit significant levels of exonization of TEs, only anecdotal cases were found in invertebrates. In vertebrates, as in mammals, the exonized TEs are mostly alternatively spliced, indicating that selective pressure maintains the original mRNA product generated from such genes. Conclusions Exonization of TEs is widespread in mammals, less so in non-mammalian vertebrates, and very low in invertebrates. We assume that the exonization process depends on the length of introns. Vertebrates, unlike invertebrates, are characterized by long introns and short internal exons. Our results suggest that there is a direct link between the length of introns and exonization of TEs and that this process became more prevalent following the appearance of mammals. PMID:20525173

  3. Cloning and characterization of the nagA gene that encodes beta-n-acetylglucosaminidase from Aspergillus nidulans and its expression in Aspergillus oryzae.

    PubMed

    Kim, Sunhwa; Matsuo, Ichiro; Ajisaka, Katsumi; Nakajima, Harushi; Kitamoto, Katsuhiko

    2002-10-01

    We isolated a beta-N-acetylglucosaminidase encoding gene and its cDNA from the filamentous fungus Aspergillus nidulans, and designated it nagA. The nagA gene contained no intron and encoded a polypeptide of 603 amino acids with a putative 19-amino acid signal sequence. The deduced amino acid sequence was very similar to the sequence of Candida albicans Hex1 and Trichoderma harzianum Nag1. Yeast cells containing the nagA cDNA under the control of the GAL1 promoter expressed beta-N-acetylglucosaminidase activity. The chromosomal nagA gene of A. nidulans was disrupted by replacement with the argB marker gene. The disruptant strains expressed low levels of beta-N-acetylglucosaminidase activity and showed poor growth on a medium containing chitobiose as a carbon source. Aspergillus oryzae strain carrying the nagA gene under the control of the improved glaA promoter produced large amounts of beta-N-acetylglucosaminidase in a wheat bran solid culture.

  4. Big and slow: phylogenetic estimates of molecular evolution in baleen whales (suborder mysticeti).

    PubMed

    Jackson, J A; Baker, C S; Vant, M; Steel, D J; Medrano-González, L; Palumbi, S R

    2009-11-01

    Baleen whales are the largest animals that have ever lived. To develop an improved estimation of substitution rate for nuclear and mitochondrial DNA for this taxon, we implemented a relaxed-clock phylogenetic approach using three fossil calibration dates: the divergence between odontocetes and mysticetes approximately 34 million years ago (Ma), between the balaenids and balaenopterids approximately 28 Ma, and the time to most recent common ancestor within the Balaenopteridae approximately 12 Ma. We examined seven mitochondrial genomes, a large number of mitochondrial control region sequences (219 haplotypes for 465 bp) and nine nuclear introns representing five species of whales, within which multiple species-specific alleles were sequenced to account for within-species diversity (1-15 for each locus). The total data set represents >1.65 Mbp of mitogenome and nuclear genomic sequence. The estimated substitution rate for the humpback whale control region (3.9%/million years, My) was higher than previous estimates for baleen whales but slow relative to other mammal species with similar generation times (e.g., human-chimp mean rate > 20%/My). The mitogenomic third codon position rate was also slow relative to other mammals (mean estimate 1%/My compared with a mammalian average of 9.8%/My for the cytochrome b gene). The mean nuclear genomic substitution rate (0.05%/My) was substantially slower than average synonymous estimates for other mammals (0.21-0.37%/My across a range of studies). The nuclear and mitogenome rate estimates for baleen whales were thus roughly consistent with an 8- to 10-fold slowing due to a combination of large body size and long generation times. Surprisingly, despite the large data set of nuclear intron sequences, there was only weak and conflicting support for alternate hypotheses about the phylogeny of balaenopterid whales, suggesting that interspecies introgressions or a rapid radiation has obscured species relationships in the nuclear genome.

  5. Isolated familial somatotropinomas: clinical features and analysis of the MEN1 gene.

    PubMed

    De Menis, Ernesto; Prezant, Toni R

    2002-01-01

    Isolated familial somatotropinomas (IFS) rarely occurs in the absence of multiple endocrine neoplasia type I (MEN1) or the Carney complex. In the present study we report two Italian siblings affected by GH-secreting adenomas. There was no history of parental consanguinity. The sister presented at 18 years of age with secondary amenorrhea and acromegalic features and one of her two brothers presented with gigantism at the same age. Endocrinological investigations confirmed GH hypersecretion in both cases. Although a pituitary microadenoma was detected in both patients, transsphenoidal surgery was not successful. The sister received conventional radiotherapy and acromegaly is now considered controlled; the brother is being treated with octreotide LAR 30 mg monthly and the disease is considered clinically active. Patients, their parents and the unaffected brother underwent extensive evaluation, and no features of MEN1 or Carney complex were found. Analysis of polymorphic microsatellite markers from chromosome 11q13 (D11S599, D11S4945, D11S4939, D11S4938 and D11S987) showed that the acromegalic siblings had inherited different maternal chromosomes and shared the paternal chromosome. No pathogenic MEN1 sequence changes were detected by sequencing or dideoxy fingerprinting of the coding sequence (exons 2-10) and exon/intron junctions. Although mutations in the promoter, introns or untranslated regions of the MEN1 gene cannot be excluded, germline mutations within the coding region of this gene do not appear responsible for IFS in this family.

  6. A novel pathogenic splice acceptor site germline mutation in intron 14 of the APC gene in a Chinese family with familial adenomatous polyposis.

    PubMed

    Wang, Dan; Liang, Shengyun; Zhang, Zhao; Zhao, Guoru; Hu, Yuan; Liang, Shengran; Zhang, Xipeng; Banerjee, Santasree

    2017-03-28

    Familial adenomatous polyposis (FAP) is an autosomal dominant precancerous condition, clinically characterized by the presence of multiple colorectal adenomas or polyps. Patients with FAP has a high risk of developing colorectal cancer (CRC) from these colorectal adenomatous polyps by the mean age of diagnosis at 40 years. Germline mutations of the APC gene cause familial adenomatous polyposis (FAP). Colectomy has recommended for the FAP patients with significant polyposis. Here, we present a clinical molecular study of a four generation Chinese family with FAP. Clinical diagnosis of FAP has been done according to the phenotype, family history and medical records. Patient's blood samples were collected and genomic DNA was extracted. In order to identify the pathogenic mutation underlying the disease phenotype targeted next-generation sequencing and confirmatory sanger sequencing has undertaken. Targeted next generation sequencing identified a novel heterozygous splice-acceptor site mutation [c.1744-1G>A] in intron 14 of APC gene, which is co-segregated with the FAP phenotypes in the proband and amongst all the affected family members. This mutation is not present in unaffected family members and in normal healthy controls of same ethnic origin. According to the LOVD database for Chinese colorectal cancer patients, in Chinese population, 60% of the previously reported APC gene mutations causes FAP, are missense mutations. This novel splice-acceptor site mutation causing FAP in this Chinese family expands the germline mutation spectrum of the APC gene in the Chinese population.

  7. Post-transcriptional regulation mediated by specific neurofilament introns in vivo.

    PubMed

    Wang, Chen; Szaro, Ben G

    2016-04-01

    Neurons regulate genes post-transcriptionally to coordinate the supply of cytoskeletal proteins, such as the medium neurofilament (NEFM), with demand for structural materials in response to extracellular cues encountered by developing axons. By using a method for evaluating functionality of cis-regulatory gene elements in vivo through plasmid injection into Xenopus embryos, we discovered that splicing of a specific nefm intron was required for robust transgene expression, regardless of promoter or cell type. Transgenes utilizing the nefm 3'-UTR but substituting other nefm introns expressed little or no protein owing to defects in handling of the messenger (m)RNA as opposed to transcription or splicing. Post-transcriptional events at multiple steps, but mainly during nucleocytoplasmic export, contributed to these varied levels of protein expression. An intron of the β-globin gene was also able to promote expression in a manner identical to that of the nefm intron, implying a more general preference for certain introns in controlling nefm expression. These results expand our knowledge of intron-mediated gene expression to encompass neurofilaments, indicating an additional layer of complexity in the control of a cytoskeletal gene needed for developing and maintaining healthy axons. © 2016. Published by The Company of Biologists Ltd.

  8. Four novel cystic fibrosis mutations in splice junction sequences affecting the CFTR nucleotide binding folds

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Doerk, T.; Wulbrand, U.; Tuemmler, B.

    1993-03-01

    Single cases of the four novel splice site mutations 1525[minus]1 G [r arrow] A (intron 9), 3601[minus]2 A [r arrow] G (intron 18), 3850[minus]3 T [r arrow] G (intron 19), and 4374+1 G [r arrow] T (intron 23) were detected in the CFTR gene of cystic fibrosis patients of Indo-Iranian, Turkish, Polish, and Germany descent. The nucleotide substitutions at the +1, [minus]1, and [minus]2 positions all destroy splice sites and lead to severe disease alleles associated with features typical of gastrointestinal and pulmonary cystic fibrosis disease. The 3850[minus]3 T-to-G change was discovered in a very mildly affected 33-year-old [Delta]F508 compoundmore » heterozygote, suggesting that the T-to-G transversion at the less conserved [minus]3 position of the acceptor splice site may retain some wildtype function. 13 refs., 1 fig., 2 tabs.« less

  9. Molecular evolution of flavonoid dioxygenases in the family Apiaceae.

    PubMed

    Gebhardt, Yvonne; Witte, Simone; Forkmann, Gert; Lukacin, Richard; Matern, Ulrich; Martens, Stefan

    2005-06-01

    Plant species of the family Apiaceae are known to accumulate flavonoids mainly in the form of flavones and flavonols. Three 2-oxoglutarate-dependent dioxygenases, flavone synthase or flavanone 3 beta-hydroxylase and flavonol synthase are involved in the biosynthesis of these secondary metabolites. The corresponding genes were cloned recently from parsley (Petroselinum crispum) leaves. Flavone synthase I appears to be confined to the Apiaceae, and the unique occurrence as well as its high sequence similarity to flavanone 3beta-hydroxylase laid the basis for evolutionary studies. In order to examine the relationship of these two enzymes throughout the Apiaceae, RT-PCR based cloning and functional identification of flavone synthases I or flavanone 3beta-hydroxylases were accomplished from Ammi majus, Anethum graveolens, Apium graveolens, Pimpinella anisum, Conium maculatum and Daucus carota, yielding three additional synthase and three additional hydroxylase cDNAs. Molecular and phylogenetic analyses of these sequences were compatible with the phylogeny based on morphological characteristics and suggested that flavone synthase I most likely resulted from gene duplication of flavanone 3beta-hydroxylase, and functional diversification at some point during the development of the apiaceae subfamilies. Furthermore, the genomic sequences from Petroselinum crispum and Daucus carota revealed two introns in each of the synthases and a lack of introns in the hydroxylases. These results might be explained by intron losses from the hydroxylases occurring at a later stage of evolution.

  10. Mutation Spectrum of the ABCA4 Gene in a Greek Cohort with Stargardt Disease: Identification of Novel Mutations and Evidence of Three Prevalent Mutated Alleles

    PubMed Central

    Vassiliki, Kokkinou; George, Koutsodontis; Polixeni, Stamatiou; Christoforos, Giatzakis; Minas, Aslanides Ioannis; Stavrenia, Koukoula; Ioannis, Datseris

    2018-01-01

    Aim To evaluate the frequency and pattern of disease-associated mutations of ABCA4 gene among Greek patients with presumed Stargardt disease (STGD1). Materials and Methods A total of 59 patients were analyzed for ABCA4 mutations using the ABCR400 microarray and PCR-based sequencing of all coding exons and flanking intronic regions. MLPA analysis as well as sequencing of two regions in introns 30 and 36 reported earlier to harbor deep intronic disease-associated variants was used in 4 selected cases. Results An overall detection rate of at least one mutant allele was achieved in 52 of the 59 patients (88.1%). Direct sequencing improved significantly the complete characterization rate, that is, identification of two mutations compared to the microarray analysis (93.1% versus 50%). In total, 40 distinct potentially disease-causing variants of the ABCA4 gene were detected, including six previously unreported potentially pathogenic variants. Among the disease-causing variants, in this cohort, the most frequent was c.5714+5G>A representing 16.1%, while p.Gly1961Glu and p.Leu541Pro represented 15.2% and 8.5%, respectively. Conclusions By using a combination of methods, we completely molecularly diagnosed 48 of the 59 patients studied. In addition, we identified six previously unreported, potentially pathogenic ABCA4 mutations. PMID:29854428

  11. Intron Definition and a Branch Site Adenosine at nt 385 Control RNA Splicing of HPV16 E6*I and E7 Expression

    PubMed Central

    Ajiro, Masahiko; Jia, Rong; Zhang, Lifang; Liu, Xuefeng; Zheng, Zhi-Ming

    2012-01-01

    HPV16 E6 and E7, two viral oncogenes, are expressed from a single bicistronic pre-mRNA. In this report, we provide the evidence that the bicistronic pre-mRNA intron 1 contains three 5′ splice sites (5′ ss) and three 3′ splice sites (3′ ss) normally used in HPV16+ cervical cancer and its derived cell lines. The choice of two novel alternative 5′ ss (nt 221 5′ ss and nt 191 5′ ss) produces two novel isoforms of E6E7 mRNAs (E6*V and E6*VI). The nt 226 5′ ss and nt 409 3′ ss is preferentially selected over the other splice sites crossing over the intron to excise a minimal length of the intron in RNA splicing. We identified AACAAAC as the preferred branch point sequence (BPS) and an adenosine at nt 385 (underlined) in the BPS as a branch site to dictate the selection of the nt 409 3′ ss for E6*I splicing and E7 expression. Introduction of point mutations into the mapped BPS led to reduced U2 binding to the BPS and thereby inhibition of the second step of E6E7 splicing at the nt 409 3′ ss. Importantly, the E6E7 bicistronic RNA with a mutant BPS and inefficient splicing makes little or no E7 and the resulted E6 with mutations of 91QYNK94 to 91PSFW94 displays attenuate activity on p53 degradation. Together, our data provide structural basis of the E6E7 intron 1 for better understanding of how viral E6 and E7 expression is regulated by alternative RNA splicing. This study elucidates for the first time a mapped branch point in HPV16 genome involved in viral oncogene expression. PMID:23056301

  12. Forensic identification of Indian snakeroot (Rauvolfia serpentina Benth. ex Kurz) using DNA barcoding.

    PubMed

    Eurlings, Marcel C M; Lens, Frederic; Pakusza, Csilla; Peelen, Tamara; Wieringa, Jan J; Gravendeel, Barbara

    2013-05-01

    Indian snakeroot (Rauvolfia serpentina) is a valuable forest product, root extracts of which are used as an antihypertensive drug. Increasing demand led to overharvesting in the wild. Control of international trade is hampered by the inability to identify root samples to the species level. We therefore evaluated the potential of molecular identification by searching for species-specific DNA polymorphisms. We found two species-specific indels in the rps16 intron region for R. serpentina. Our DNA barcoding method was tested for its specificity, reproducibility, sensitivity and stability. We included samples of various tissues and ages, which had been treated differently for preservation. DNA extractions were tested in a range of amplification settings and dilutions. Species-specific rps16 intron sequences were obtained from 79 herbarium accessions and one confiscated root, encompassing 39 different species. Our results demonstrate that molecular analysis provides new perspectives for forensic identification of Indian snakeroot. © 2013 American Academy of Forensic Sciences.

  13. Association of ESR1 gene tagging SNPs with breast cancer risk

    PubMed Central

    Dunning, Alison M.; Healey, Catherine S.; Baynes, Caroline; Maia, Ana-Teresa; Scollen, Serena; Vega, Ana; Rodríguez, Raquel; Barbosa-Morais, Nuno L.; Ponder, Bruce A.J.; Low, Yen-Ling; Bingham, Sheila; Haiman, Christopher A.; Le Marchand, Loic; Broeks, Annegien; Schmidt, Marjanka K.; Hopper, John; Southey, Melissa; Beckmann, Matthias W.; Fasching, Peter A.; Peto, Julian; Johnson, Nichola; Bojesen, Stig E.; Nordestgaard, Børge; Milne, Roger L.; Benitez, Javier; Hamann, Ute; Ko, Yon; Schmutzler, Rita K.; Burwinkel, Barbara; Schürmann, Peter; Dörk, Thilo; Heikkinen, Tuomas; Nevanlinna, Heli; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Chen, Xiaoqing; Spurdle, Amanda; Change-Claude, Jenny; Flesch-Janys, Dieter; Couch, Fergus J.; Olson, Janet E.; Severi, Gianluca; Baglietto, Laura; Børresen-Dale, Anne-Lise; Kristensen, Vessela; Hunter, David J.; Hankinson, Susan E.; Devilee, Peter; Vreeswijk, Maaike; Lissowska, Jolanta; Brinton, Louise; Liu, Jianjun; Hall, Per; Kang, Daehee; Yoo, Keun-Young; Shen, Chen-Yang; Yu, Jyh-Cherng; Anton-Culver, Hoda; Ziogoas, Argyrios; Sigurdson, Alice; Struewing, Jeff; Easton, Douglas F.; Garcia-Closas, Montserrat; Humphreys, Manjeet K.; Morrison, Jonathan; Pharoah, Paul D.P.; Pooley, Karen A.; Chenevix-Trench, Georgia

    2009-01-01

    We have conducted a three-stage, comprehensive single nucleotide polymorphism (SNP)-tagging association study of ESR1 gene variants (SNPs) in more than 55 000 breast cancer cases and controls from studies within the Breast Cancer Association Consortium (BCAC). No large risks or highly significant associations were revealed. SNP rs3020314, tagging a region of ESR1 intron 4, is associated with an increase in breast cancer susceptibility with a dominant mode of action in European populations. Carriers of the c-allele have an odds ratio (OR) of 1.05 [95% Confidence Intervals (CI) 1.02–1.09] relative to t-allele homozygotes, P = 0.004. There is significant heterogeneity between studies, P = 0.002. The increased risk appears largely confined to oestrogen receptor-positive tumour risk. The region tagged by SNP rs3020314 contains sequence that is more highly conserved across mammalian species than the rest of intron 4, and it may subtly alter the ratio of two mRNA splice forms. PMID:19126777

  14. A novel splicing mutation in GALT gene causing Galactosemia in Ecuadorian family.

    PubMed

    De Lucca, M; Barba, C; Casique, L

    2017-07-01

    Classic Galactosemia (OMIM 230400) is an autosomal recessive disorder of galactose metabolism caused by mutations in the galactose-1-phosphate uridyl transferase (GALT) gene. This disease caused by the inability to metabolize galactose is potentially life-threatening but its pathophysiology has not been clearly defined. GALT gene presents high allelic heterogeneity and around 336 variations have been identified. Here, we report the case of a patient with Classic Galactosemia who was detected during a neonatal screening in Ecuador. Molecular study revealed a mutation in GALT gene intron 1, c.82+3A>G in homozygous condition, this mutation has not been previously reported. This gene variation was not found in any of the 119 healthy Ecuadorian individuals used as control. Furthermore, the mutation was the only alteration detected in the propositus's GALT after sequencing all exons and introns of this gene. In silico modeling predicted that the mutation was pathogenic. Copyright © 2017. Published by Elsevier B.V.

  15. Gene relocations within chloroplast genomes of Jasminum and Menodora (Oleaceae) are due to multiple, overlapping inversions.

    PubMed

    Lee, Hae-Lim; Jansen, Robert K; Chumley, Timothy W; Kim, Ki-Joong

    2007-05-01

    The chloroplast (cp) DNA sequence of Jasminum nudiflorum (Oleaceae-Jasmineae) is completed and compared with the large single-copy region sequences from 6 related species. The cp genomes of the tribe Jasmineae (Jasminum and Menodora) show several distinctive rearrangements, including inversions, gene duplications, insertions, inverted repeat expansions, and gene and intron losses. The ycf4-psaI region in Jasminum section Primulina was relocated as a result of 2 overlapping inversions of 21,169 and 18,414 bp. The 1st, larger inversion is shared by all members of the Jasmineae indicating that it occurred in the common ancestor of the tribe. Similar rearrangements were also identified in the cp genome of Menodora. In this case, 2 fragments including ycf4 and rps4-trnS-ycf3 genes were moved by 2 additional inversions of 14 and 59 kb that are unique to Menodora. Other rearrangements in the Oleaceae are confined to certain regions of the Jasminum and Menodora cp genomes, including the presence of highly repeated sequences and duplications of coding and noncoding sequences that are inserted into clpP and between rbcL and psaI. These insertions are correlated with the loss of 2 introns in clpP and a serial loss of segments of accD. The loss of the accD gene and clpP introns in both the monocot family Poaceae and the eudicot family Oleaceae are clearly independent evolutionary events. However, their genome organization is surprisingly similar despite the distant relationship of these 2 angiosperm families.

  16. Trans splicing in Leishmania enriettii and identification of ribonucleoprotein complexes containing the spliced leader and U2 equivalent RNAs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Miller, S.I.; Wirth, D.F.

    1988-06-01

    The 5' ends of Leishmania mRNAs contain an identical 35-nucleotide sequence termed the spliced leader (SL) or 5' mini-exon. The SL sequence is at the 5' end of an 85-nucleotide primary transcript that contains a consensus eucaryotic 5' intron-exon splice junction immediately 3' to the SL. The SL is added to protein-coding genes immediately 3' to a consensus eucaryotic 3' intron-exon splice junction. The authors' previous work demonstrated possible intermediates in discontinuous mRNA processing that contain the 50 nucleotides of the SL primary transcript 3' to the SL, the SL intron sequence (SLIS). These RNAs have a 5' terminus atmore » the splice junction of the SL and the SLIS. The authors examined a Leishmania nuclear extract for these RNAs in ribonucleoprotein (RNP) particles. Density centrifugation analysis showed that the SL RNA is predominately in RNP complexes at 60S, while the SLIS-containing RNAs are in complexes at 40S. They also demonstrated that the SLIS can be released from polyadenylated RNA by incubation with a HeLa cell extract containing debranching enzymatic activity. These data suggested that Leishmania enriettii mRNAs are assembled by bimolecular or trans splicing as has been recently demonstrated for Trypanosoma brucei. Furthermore, they determined the partial sequence of the Leishmania U2 equivalent RNA and demonstrated that it cosediments with the SL RNA at 60S in a nuclear extract. These RNP particles may be analogous to so-called spliceosomes that have been demonstrated in other systems.« less

  17. Molecular characterization of a nuclear topoisomerase II from Nicotiana tabacum that functionally complements a temperature-sensitive topoisomerase II yeast mutant.

    PubMed

    Singh, B N; Mudgil, Yashwanti; Sopory, S K; Reddy, M K

    2003-07-01

    We have successfully expressed enzymatically active plant topoisomerase II in Escherichia coli for the first time, which has enabled its biochemical characterization. Using a PCR-based strategy, we obtained a full-length cDNA and the corresponding genomic clone of tobacco topoisomerase II. The genomic clone has 18 exons interrupted by 17 introns. Most of the 5' and 3' splice junctions follow the typical canonical consensus dinucleotide sequence GU-AG present in other plant introns. The position of introns and phasing with respect to primary amino acid sequence in tobacco TopII and Arabidopsis TopII are highly conserved, suggesting that the two genes are evolved from the common ancestral type II topoisomerase gene. The cDNA encodes a polypeptide of 1482 amino acids. The primary amino acid sequence shows a striking sequence similarity, preserving all the structural domains that are conserved among eukaryotic type II topoisomerases in an identical spatial order. We have expressed the full-length polypeptide in E. coli and purified the recombinant protein to homogeneity. The full-length polypeptide relaxed supercoiled DNA and decatenated the catenated DNA in a Mg(2+)- and ATP-dependent manner, and this activity was inhibited by 4'-(9-acridinylamino)-3'-methoxymethanesulfonanilide (m-AMSA). The immunofluorescence and confocal microscopic studies, with antibodies developed against the N-terminal region of tobacco recombinant topoisomerase II, established the nuclear localization of topoisomerase II in tobacco BY2 cells. The regulated expression of tobacco topoisomerase II gene under the GAL1 promoter functionally complemented a temperature-sensitive TopII(ts) yeast mutant.

  18. LEDGF/p75 interacts with mRNA splicing factors and targets HIV-1 integration to highly spliced genes

    PubMed Central

    Singh, Parmit Kumar; Plumb, Matthew R.; Ferris, Andrea L.; Iben, James R.; Wu, Xiaolin; Fadel, Hind J.; Luke, Brian T.; Esnault, Caroline; Poeschla, Eric M.; Hughes, Stephen H.; Kvaratskhelia, Mamuka; Levin, Henry L.

    2015-01-01

    The host chromatin-binding factor LEDGF/p75 interacts with HIV-1 integrase and directs integration to active transcription units. To understand how LEDGF/p75 recognizes transcription units, we sequenced 1 million HIV-1 integration sites isolated from cultured HEK293T cells. Analysis of integration sites showed that cancer genes were preferentially targeted, raising concerns about using lentivirus vectors for gene therapy. Additional analysis led to the discovery that introns and alternative splicing contributed significantly to integration site selection. These correlations were independent of transcription levels, size of transcription units, and length of the introns. Multivariate analysis with five parameters previously found to predict integration sites showed that intron density is the strongest predictor of integration density in transcription units. Analysis of previously published HIV-1 integration site data showed that integration density in transcription units in mouse embryonic fibroblasts also correlated strongly with intron number, and this correlation was absent in cells lacking LEDGF. Affinity purification showed that LEDGF/p75 is associated with a number of splicing factors, and RNA sequencing (RNA-seq) analysis of HEK293T cells lacking LEDGF/p75 or the LEDGF/p75 integrase-binding domain (IBD) showed that LEDGF/p75 contributes to splicing patterns in half of the transcription units that have alternative isoforms. Thus, LEDGF/p75 interacts with splicing factors, contributes to exon choice, and directs HIV-1 integration to transcription units that are highly spliced. PMID:26545813

  19. The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).

    PubMed

    Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu

    2017-05-01

    The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.

  20. Phylogeny of lion tamarins (Leontopithecus spp) based on interphotoreceptor retinol binding protein intron sequences.

    PubMed

    Mundy, N I; Kelly, J

    2001-05-01

    The evolutionary relationships of the lion tamarins (Leontopithecus) were investigated using nuclear interphotoreceptor retinol binding protein (IRBP) intron sequences. Phylogenetic reconstructions strongly support the monophyly of the genus, and a sister relationship between the golden lion tamarin, Leontopithecus rosalia, and the black lion tamarin, L. chrysopygus, to the exclusion of the golden-headed lion tamarin, L. chrysomelas. The most parsimonious evolutionary reconstruction suggests that the ancestral lion tamarin and the common ancestor of L. rosalia and L. chrysopygus had predominantly black coats. This reconstruction is not consistent with a theory of orthogenetic evolution of coat color that was based on coat color evolution in marmosets and tamarins. An alternative reconstruction that is consistent with metachromism requires that ancestral lion tamarins had agouti hairs. Copyright 2001 Wiley-Liss, Inc.

  1. Latency-associated transcript (LAT) exon 1 controls herpes simplex virus species-specific phenotypes: reactivation in the guinea pig genital model and neuron subtype-specific latent expression of LAT.

    PubMed

    Bertke, Andrea S; Patel, Amita; Imai, Yumi; Apakupakul, Kathleen; Margolis, Todd P; Krause, Philip R

    2009-10-01

    Herpes simplex virus 1 (HSV-1) and HSV-2 cause similar acute infections but differ in their abilities to reactivate from trigeminal and lumbosacral dorsal root ganglia. During latency, HSV-1 and HSV-2 also preferentially express their latency-associated transcripts (LATs) in different sensory neuronal subtypes that are positive for A5 and KH10 markers, respectively. Chimeric virus studies showed that LAT region sequences influence both of these viral species-specific phenotypes. To further map the LAT region sequences responsible for these phenotypes, we constructed the chimeric virus HSV2-LAT-E1, in which exon 1 (from the LAT TATA to the intron splice site) was replaced by the corresponding sequence from HSV-1 LAT. In intravaginally infected guinea pigs, HSV2-LAT-E1 reactivated inefficiently relative to the efficiency of its rescuant and wild-type HSV-2, but it yielded similar levels of viral DNA, LAT, and ICP0 during acute and latent infection. HSV2-LAT-E1 preferentially expressed the LAT in A5+ neurons (as does HSV-1), while the chimeric viruses HSV2-LAT-P1 (LAT promoter swap) and HSV2-LAT-S1 (LAT sequence swap downstream of the promoter) exhibited neuron subtype-specific latent LAT expression phenotypes more similar to that of HSV-2 than that of HSV-1. Rescuant viruses displayed the wild-type HSV-2 phenotypes of efficient reactivation in the guinea pig genital model and a tendency to express LAT in KH10+ neurons. The region that is critical for HSV species-specific differences in latency and reactivation thus lies between the LAT TATA and the intron splice site, and minor differences in the 5' ends of chimeric sequences in HSV2-LAT-E1 and HSV2-LAT-S1 point to sequences immediately downstream of the LAT TATA.

  2. Latency-Associated Transcript (LAT) Exon 1 Controls Herpes Simplex Virus Species-Specific Phenotypes: Reactivation in the Guinea Pig Genital Model and Neuron Subtype-Specific Latent Expression of LAT▿

    PubMed Central

    Bertke, Andrea S.; Patel, Amita; Imai, Yumi; Apakupakul, Kathleen; Margolis, Todd P.; Krause, Philip R.

    2009-01-01

    Herpes simplex virus 1 (HSV-1) and HSV-2 cause similar acute infections but differ in their abilities to reactivate from trigeminal and lumbosacral dorsal root ganglia. During latency, HSV-1 and HSV-2 also preferentially express their latency-associated transcripts (LATs) in different sensory neuronal subtypes that are positive for A5 and KH10 markers, respectively. Chimeric virus studies showed that LAT region sequences influence both of these viral species-specific phenotypes. To further map the LAT region sequences responsible for these phenotypes, we constructed the chimeric virus HSV2-LAT-E1, in which exon 1 (from the LAT TATA to the intron splice site) was replaced by the corresponding sequence from HSV-1 LAT. In intravaginally infected guinea pigs, HSV2-LAT-E1 reactivated inefficiently relative to the efficiency of its rescuant and wild-type HSV-2, but it yielded similar levels of viral DNA, LAT, and ICP0 during acute and latent infection. HSV2-LAT-E1 preferentially expressed the LAT in A5+ neurons (as does HSV-1), while the chimeric viruses HSV2-LAT-P1 (LAT promoter swap) and HSV2-LAT-S1 (LAT sequence swap downstream of the promoter) exhibited neuron subtype-specific latent LAT expression phenotypes more similar to that of HSV-2 than that of HSV-1. Rescuant viruses displayed the wild-type HSV-2 phenotypes of efficient reactivation in the guinea pig genital model and a tendency to express LAT in KH10+ neurons. The region that is critical for HSV species-specific differences in latency and reactivation thus lies between the LAT TATA and the intron splice site, and minor differences in the 5′ ends of chimeric sequences in HSV2-LAT-E1 and HSV2-LAT-S1 point to sequences immediately downstream of the LAT TATA. PMID:19641003

  3. Detection of Splice Sites Using Support Vector Machine

    NASA Astrophysics Data System (ADS)

    Varadwaj, Pritish; Purohit, Neetesh; Arora, Bhumika

    Automatic identification and annotation of exon and intron region of gene, from DNA sequences has been an important research area in field of computational biology. Several approaches viz. Hidden Markov Model (HMM), Artificial Intelligence (AI) based machine learning and Digital Signal Processing (DSP) techniques have extensively and independently been used by various researchers to cater this challenging task. In this work, we propose a Support Vector Machine based kernel learning approach for detection of splice sites (the exon-intron boundary) in a gene. Electron-Ion Interaction Potential (EIIP) values of nucleotides have been used for mapping character sequences to corresponding numeric sequences. Radial Basis Function (RBF) SVM kernel is trained using EIIP numeric sequences. Furthermore this was tested on test gene dataset for detection of splice site by window (of 12 residues) shifting. Optimum values of window size, various important parameters of SVM kernel have been optimized for a better accuracy. Receiver Operating Characteristic (ROC) curves have been utilized for displaying the sensitivity rate of the classifier and results showed 94.82% accuracy for splice site detection on test dataset.

  4. Familial retinoblastoma due to intronic LINE-1 insertion causes aberrant and noncanonical mRNA splicing of the RB1 gene.

    PubMed

    Rodríguez-Martín, Carlos; Cidre, Florencia; Fernández-Teijeiro, Ana; Gómez-Mariano, Gema; de la Vega, Leticia; Ramos, Patricia; Zaballos, Ángel; Monzón, Sara; Alonso, Javier

    2016-05-01

    Retinoblastoma (RB, MIM 180200) is the paradigm of hereditary cancer. Individuals harboring a constitutional mutation in one allele of the RB1 gene have a high predisposition to develop RB. Here, we present the first case of familial RB caused by a de novo insertion of a full-length long interspersed element-1 (LINE-1) into intron 14 of the RB1 gene that caused a highly heterogeneous splicing pattern of RB1 mRNA. LINE-1 insertion was inferred by mRNA studies and full-length sequenced by massive parallel sequencing. Some of the aberrant mRNAs were produced by noncanonical acceptor splice sites, a new finding that up to date has not been described to occur upon LINE-1 retrotransposition. Our results clearly show that RNA-based strategies have the potential to detect disease-causing transposon insertions. It also confirms that the incorporation of new genetic approaches, such as massive parallel sequencing, contributes to characterize at the sequence level these unique and exceptional genetic alterations.

  5. Genomic structure and promoter functional analysis of GnRH3 gene in large yellow croaker (Larimichthys crocea).

    PubMed

    Huang, Wei; Zhang, Jianshe; Liao, Zhi; Lv, Zhenming; Wu, Huifei; Zhu, Aiyi; Wu, Changwen

    2016-01-15

    Gonadotropin-releasing hormone III (GnRH3) is considered to be a key neurohormone in fish reproduction control. In the present study, the cDNA and genomic sequences of GnRH3 were cloned and characterized from large yellow croaker Larimichthys crocea. The cDNA encoded a protein of 99 amino acids with four functional motifs. The full-length genome sequence was composed of 3797 nucleotides, including four exons and three introns. Higher identities of amino acid sequences and conserved exon-intron organizations were found between LcGnRH3 and other GnRH3 genes. In addition, some special features of the sequences were detected in partial species. For example, two specific residues (V and A) were found in the family Sciaenidae, and the unique 75-72 bp type of the open reading frame 2 and 3 existed in the family Cyprinidae. Analysis of the 2576 bp promoter fragment of LcGnRH3 showed a number of transcription factor binding sites, such as AP1, CREB, GATA-1, HSF, FOXA2, and FOXL1. Promoter functional analysis using an EGFP reporter fusion in zebrafish larvae presented positive signals in the brain, including the olfactory region, the terminal nerve ganglion, the telencephalon, and the hypothalamus. The expression pattern was generally consistent with the endogenous GnRH3 GFP-expressing transgenic zebrafish lines, but the details were different. These results indicate that the structure and function of LcGnRH3 are generally similar to the other teleost GnRH3 genes, but there exist some distinctions among them. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Intronic deletions in the SLC34A3 gene: A cautionary tale for mutation analysis of hereditary hypophosphatemic rickets with hypercalciuria

    PubMed Central

    Ichikawa, Shoji; Tuchman, Shamir; Padgett, Leah R.; Gray, Amie K.; Baluarte, H. Jorge; Econs, Michael J.

    2013-01-01

    Hereditary hypophosphatemic rickets with hypercalciuria (HHRH) is a rare metabolic disorder, characterized by hypophosphatemia, variable degrees of rickets/osteomalacia, and hypercalciuria secondary to increased serum 1,25-dihydroxyvitamin D [1,25(OH)2D] levels. HHRH is caused by mutations in the SLC34A3 gene, which encodes sodium-phosphate co-transporter type IIc. A 6 ½-year-old female presented with a history of nephrolithiasis. Her metabolic evaluation revealed increased 24- hour urine calcium excretion with high serum calcium, low intact parathyroid hormone (PTH) levels, and elevated 1,25(OH)2D level. In addition, the patient had low to low-normal serum phosphorus with high urine phosphorus. The patient had normal stature; without rachitic or boney deformities or a history of fractures. Genetic analysis of SLC34A3 revealed the patient to be a compound heterozygote for a novel single base pair deletion in exon 12 (c.1304delG) and 30-base pair deletion in intron 6 (g.1440–1469del). The single-base pair mutation causes a frameshift, which results in premature stop codon. The intronic deletion is likely caused by misalignment of the 4-basepair homologous repeats and results in the truncation of an already small intron to 63 bp, which would impair proper RNA splicing of the intron. This is the fourth unique intronic deletion identified in patients with HHRH, suggesting the frequent occurrence of sequence misalignments in SLC34A3 and the importance of screening introns in patients with HHRH. PMID:24176905

  7. Intronic deletions in the SLC34A3 gene: a cautionary tale for mutation analysis of hereditary hypophosphatemic rickets with hypercalciuria.

    PubMed

    Ichikawa, Shoji; Tuchman, Shamir; Padgett, Leah R; Gray, Amie K; Baluarte, H Jorge; Econs, Michael J

    2014-02-01

    Hereditary hypophosphatemic rickets with hypercalciuria (HHRH) is a rare metabolic disorder, characterized by hypophosphatemia, variable degrees of rickets/osteomalacia, and hypercalciuria secondary to increased serum 1,25-dihydroxyvitamin D [1,25(OH)2D] levels. HHRH is caused by mutations in the SLC34A3 gene, which encodes sodium-phosphate co-transporter type IIc. A 6-1/2-year-old female presented with a history of nephrolithiasis. Her metabolic evaluation revealed increased 24-hour urine calcium excretion with high serum calcium, low intact parathyroid hormone (PTH), and elevated 1,25(OH)2D. In addition, the patient had low to low-normal serum phosphorus with high urine phosphorus. The patient had normal stature; without rachitic or boney deformities or a history of fractures. Genetic analysis of SLC34A3 revealed the patient to be a compound heterozygote for a novel single base pair deletion in exon 12 (c.1304delG) and 30-base pair deletion in intron 6 (g.1440-1469del). The single-base pair mutation causes a frameshift, which results in premature stop codon. The intronic deletion is likely caused by misalignment of the 4-basepair homologous repeats and results in the truncation of an already small intron to 63bp, which would impair proper RNA splicing of the intron. This is the fourth unique intronic deletion identified in patients with HHRH, suggesting the frequent occurrence of sequence misalignments in SLC34A3 and the importance of screening introns in patients with HHRH. © 2013.

  8. An efficient method to find potentially universal population genetic markers, applied to metazoans

    PubMed Central

    2010-01-01

    Background Despite the impressive growth of sequence databases, the limited availability of nuclear markers that are sufficiently polymorphic for population genetics and phylogeography and applicable across various phyla restricts many potential studies, particularly in non-model organisms. Numerous introns have invariant positions among kingdoms, providing a potential source for such markers. Unfortunately, most of the few known EPIC (Exon Primed Intron Crossing) loci are restricted to vertebrates or belong to multigenic families. Results In order to develop markers with broad applicability, we designed a bioinformatic approach aimed at avoiding multigenic families while identifying intron positions conserved across metazoan phyla. We developed a program facilitating the identification of EPIC loci which allowed slight variation in intron position. From the Homolens databases we selected 29 gene families which contained 52 promising introns for which we designed 93 primer pairs. PCR tests were performed on several ascidians, echinoderms, bivalves and cnidarians. On average, 24 different introns per genus were amplified in bilaterians. Remarkably, five of the introns successfully amplified in all of the metazoan genera tested (a dozen genera, including cnidarians). The influence of several factors on amplification success was investigated. Success rate was not related to the phylogenetic relatedness of a taxon to the groups that most influenced primer design, showing that these EPIC markers are extremely conserved in animals. Conclusions Our new method now makes it possible to (i) rapidly isolate a set of EPIC markers for any phylum, even outside the animal kingdom, and thus, (ii) compare genetic diversity at potentially homologous polymorphic loci between divergent taxa. PMID:20836842

  9. Human growth hormone (GH1) gene polymorphism map in a normal-statured adult population

    PubMed Central

    Esteban, Cristina; Audí, Laura; Carrascosa, Antonio; Fernández-Cancio, Mónica; Pérez-Arroyo, Annalisa; Ulied, Angels; Andaluz, Pilar; Arjona, Rosa; Albisu, Marian; Clemente, María; Gussinyé, Miquel; Yeste, Diego

    2007-01-01

    Objective GH1 gene presents a complex map of single nucleotide polymorphisms (SNPs) in the entire promoter, coding and noncoding regions. The aim of the study was to establish the complete map of GH1 gene SNPs in our control normal population and to analyse its association with adult height. Design, subjects and measurements A systematic GH1 gene analysis was designed in a control population of 307 adults of both sexes with height normally distributed within normal range for the same population: −2 standard deviation scores (SDS) to +2 SDS. An analysis was performed on individual and combined genotype associations with adult height. Results Twenty-five SNPs presented a frequency over 1%: 11 in the promoter (P1 to P11), three in the 5′UTR region (P12 to P14), one in exon 1 (P15), three in intron 1 (P16 to P18), two in intron 2 (P19 and P20), two in exon 4 (P21 and P22) and three in intron 4 (P23 to P25). Twenty-nine additional changes with frequencies under 1% were found in 29 subjects. P8, P19, P20 and P25 had not been previously described. P6, P12, P17 and P25 accounted for 6·2% of the variation in adult height (P = 0·0007) in this population with genotypes A/G at P6, G/G at P6 and A/G at P12 decreasing height SDS (−0·063 ± 0·031, −0·693 ± 0·350 and −0·489 ± 0·265, Mean ± SE) and genotypes A/T at P17 and T/G at P25 increasing height SDS (+1·094 ± 0·456 and +1·184 ± 0·432). Conclusions This study established the GH1 gene sequence variation map in a normal adult height control population confirming the high density of SNPs in a relatively small gene. Our study shows that the more frequent SNPs did not significantly contribute to height determination, while only one promoter and two intronic SNPs contributed significantly to it. Studies in larger populations will have to confirm the associations and in vitro functional studies will elucidate the mechanisms involved. Systematic GH1 gene analysis in patients with growth delay and suspected GH deficiency/insufficiency will clarify whether different SNP frequencies and/or the presence of different sequence changes may be associated with phenotypes in them. PMID:17223997

  10. Cloning and characterization of largemouth bass ( Micropterus salmoides) myostatin encoding gene and its promoter

    NASA Astrophysics Data System (ADS)

    Li, Shengjie; Bai, Junjie; Wang, Lin

    2008-08-01

    Myostatin or GDF-8, a member of the transforming growth factor-β (TGF-β) superfamily, has been demonstrated to be a negative regulator of skeletal muscle mass in mammals. In the present study, we obtained a 5.64 kb sequence of myostatin encoding gene and its promoter from largemouth bass ( Micropterus salmoides). The myostatin encoding gene consisted of three exons (488 bp, 371 bp and 1779 bp, respectively) and two introns (390 bp and 855 bp, respectively). The intron-exon boundaries were conservative in comparison with those of mammalian myostatin encoding genes, whereas the size of introns was smaller than that of mammals. Sequence analysis of 1.569 kb of the largemouth bass myostatin gene promoter region revealed that it contained two TATA boxes, one CAAT box and nine putative E-boxes. Putative muscle growth response elements for myocyte enhancer factor 2 (MEF2), serum response factor (SRF), activator protein 1 (AP1), etc., and muscle-specific Mt binding site (MTBF) were also detected. Some of the transcription factor binding sites were conserved among five teleost species. This information will be useful for studying the transcriptional regulation of myostatin in fish.

  11. Molecular Targeting of Prostate Cancer During Androgen Ablation: Inhibition of CHES1/FOXN3

    DTIC Science & Technology

    2013-05-01

    the DNA sequences (~25^6 reads/sample) were mapped to the human genome reference sequence (hg19...tumor the AR has a genomic abnormality, placing the novel sequence 3’ of the transcriptional start site. However, it is unclear if a genomic alteration...exon/intron organization of the CHES1 gene was determined by BLAST analysis of the human genome using the 1,473-bp CHES1 cDNA sequence

  12. Comprehensive phylogenetic analysis of bacterial reverse transcriptases.

    PubMed

    Toro, Nicolás; Nisa-Martínez, Rafael

    2014-01-01

    Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology.

  13. Comprehensive Phylogenetic Analysis of Bacterial Reverse Transcriptases

    PubMed Central

    Toro, Nicolás; Nisa-Martínez, Rafael

    2014-01-01

    Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology. PMID:25423096

  14. Simultaneous mutation detection of three homoeologous genes in wheat by High Resolution Melting analysis and Mutation Surveyor.

    PubMed

    Dong, Chongmei; Vincent, Kate; Sharp, Peter

    2009-12-04

    TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful tool for reverse genetics, combining traditional chemical mutagenesis with high-throughput PCR-based mutation detection to discover induced mutations that alter protein function. The most popular mutation detection method for TILLING is a mismatch cleavage assay using the endonuclease CelI. For this method, locus-specific PCR is essential. Most wheat genes are present as three similar sequences with high homology in exons and low homology in introns. Locus-specific primers can usually be designed in introns. However, it is sometimes difficult to design locus-specific PCR primers in a conserved region with high homology among the three homoeologous genes, or in a gene lacking introns, or if information on introns is not available. Here we describe a mutation detection method which combines High Resolution Melting (HRM) analysis of mixed PCR amplicons containing three homoeologous gene fragments and sequence analysis using Mutation Surveyor software, aimed at simultaneous detection of mutations in three homoeologous genes. We demonstrate that High Resolution Melting (HRM) analysis can be used in mutation scans in mixed PCR amplicons containing three homoeologous gene fragments. Combining HRM scanning with sequence analysis using Mutation Surveyor is sensitive enough to detect a single nucleotide mutation in the heterozygous state in a mixed PCR amplicon containing three homoeoloci. The method was tested and validated in an EMS (ethylmethane sulfonate)-treated wheat TILLING population, screening mutations in the carboxyl terminal domain of the Starch Synthase II (SSII) gene. Selected identified mutations of interest can be further analysed by cloning to confirm the mutation and determine the genomic origin of the mutation. Polyploidy is common in plants. Conserved regions of a gene often represent functional domains and have high sequence similarity between homoeologous loci. The method described here is a useful alternative to locus-specific based methods for screening mutations in conserved functional domains of homoeologous genes. This method can also be used for SNP (single nucleotide polymorphism) marker development and eco-TILLING in polyploid species.

  15. Mutually Exclusive Splicing of the Insect Dscam Pre-mRNA Directed by Competing Intronic RNA Secondary Structures

    PubMed Central

    Graveley, Brenton R.

    2008-01-01

    Summary Drosophila Dscam encodes 38,016 distinct axon guidance receptors through the mutually exclusive alternative splicing of 95 variable exons. Importantly, known mechanisms that ensure the mutually exclusive splicing of pairs of exons cannot explain this phenomenon in Dscam. I have identified two classes of conserved elements in the Dscam exon 6 cluster, which contains 48 alternative exons—the docking site, located in the intron downstream of constitutive exon 5, and the selector sequences, which are located upstream of each exon 6 variant. Strikingly, each selector sequence is complementary to a portion of the docking site, and this pairing juxtaposes one, and only one, alternative exon to the upstream constitutive exon. The mutually exclusive nature of the docking site:selector sequence interactions suggests that the formation of these competing RNA structures is a central component of the mechanism guaranteeing that only one exon 6 variant is included in each Dscam mRNA. PMID:16213213

  16. Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean

    PubMed Central

    2012-01-01

    Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop. PMID:22734675

  17. PepLine: a software pipeline for high-throughput direct mapping of tandem mass spectrometry data on genomic sequences.

    PubMed

    Ferro, Myriam; Tardif, Marianne; Reguer, Erwan; Cahuzac, Romain; Bruley, Christophe; Vermat, Thierry; Nugues, Estelle; Vigouroux, Marielle; Vandenbrouck, Yves; Garin, Jérôme; Viari, Alain

    2008-05-01

    PepLine is a fully automated software which maps MS/MS fragmentation spectra of trypsic peptides to genomic DNA sequences. The approach is based on Peptide Sequence Tags (PSTs) obtained from partial interpretation of QTOF MS/MS spectra (first module). PSTs are then mapped on the six-frame translations of genomic sequences (second module) giving hits. Hits are then clustered to detect potential coding regions (third module). Our work aimed at optimizing the algorithms of each component to allow the whole pipeline to proceed in a fully automated manner using raw nucleic acid sequences (i.e., genomes that have not been "reduced" to a database of ORFs or putative exons sequences). The whole pipeline was tested on controlled MS/MS spectra sets from standard proteins and from Arabidopsis thaliana envelope chloroplast samples. Our results demonstrate that PepLine competed with protein database searching softwares and was fast enough to potentially tackle large data sets and/or high size genomes. We also illustrate the potential of this approach for the detection of the intron/exon structure of genes.

  18. High-throughput sequencing methods to study neuronal RNA-protein interactions.

    PubMed

    Ule, Jernej

    2009-12-01

    UV-cross-linking and RNase protection, combined with high-throughput sequencing, have provided global maps of RNA sites bound by individual proteins or ribosomes. Using a stringent purification protocol, UV-CLIP (UV-cross-linking and immunoprecipitation) was able to identify intronic and exonic sites bound by splicing regulators in mouse brain tissue. Ribosome profiling has been used to quantify ribosome density on budding yeast mRNAs under different environmental conditions. Post-transcriptional regulation in neurons requires high spatial and temporal precision, as is evident from the role of localized translational control in synaptic plasticity. It remains to be seen if the high-throughput methods can be applied quantitatively to study the dynamics of RNP (ribonucleoprotein) remodelling in specific neuronal populations during the neurodegenerative process. It is certain, however, that applications of new biochemical techniques followed by high-throughput sequencing will continue to provide important insights into the mechanisms of neuronal post-transcriptional regulation.

  19. A novel type of EWS-CHOP fusion gene in myxoid liposarcoma

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Matsui, Yoshito; Ueda, Takafumi; Kubo, Takahiro

    2006-09-22

    The cytogenetic hallmark of myxoid type and round cell type liposarcoma consists of reciprocal translocation of t(12;16)(q13;p11) and t(12;22)(q13;q12), which results in fusion of TLS/FUS and CHOP, and EWS and CHOP, respectively. Nine structural variations of the TLS/FUS-CHOP chimeric transcript have been reported, however, only two types of EWS-CHOP have been described. We describe here a case of myxoid liposarcoma containing a novel EWS-CHOP chimeric transcript and identified the breakpoint occurring in intron 13 of EWS. Reverse transcription-polymerase chain reaction and direct sequence showed that exon 13 of EWS was in-frame fused to exon 2 of CHOP. Genomic analysis revealedmore » that the breaks were located in intron 13 of EWS and intron 1 of CHOP.« less

  20. A homozygous mutation in the stem II domain of RNU4ATAC causes typical Roifman syndrome.

    PubMed

    Dinur Schejter, Yael; Ovadia, Adi; Alexandrova, Roumiana; Thiruvahindrapuram, Bhooma; Pereira, Sergio L; Manson, David E; Vincent, Ajoy; Merico, Daniele; Roifman, Chaim M

    2017-01-01

    Roifman syndrome (OMIM# 616651) is a complex syndrome encompassing skeletal dysplasia, immunodeficiency, retinal dystrophy and developmental delay, and is caused by compound heterozygous mutations involving the Stem II region and one of the other domains of the RNU4ATAC gene. This small nuclear RNA gene is essential for minor intron splicing. The Canadian Centre for Primary Immunodeficiency Registry and Repository were used to derive patient information as well as tissues. Utilising RNA sequencing methodologies, we analysed samples from patients with Roifman syndrome and assessed intron retention. We demonstrate that a homozygous mutation in Stem II is sufficient to cause the full spectrum of features associated with typical Roifman syndrome. Further, we demonstrate the same pattern of aberration in minor intron retention as found in cases with compound heterozygous mutations.

  1. Is “Junk” DNA Mostly Intron DNA?

    PubMed Central

    Wong, Gane Ka-Shu; Passey, Douglas A.; Huang, Ying-zong; Yang, Zhiyong; Yu, Jun

    2000-01-01

    Among higher eukaryotes, very little of the genome codes for protein. What is in the rest of the genome, or the “junk” DNA, that, in Homo sapiens, is estimated to be almost 97% of the genome? Is it possible that much of this “junk” is intron DNA? This is not a question that can be answered just by looking at the published data, even from the finished genomes. One cannot assume that there are no genes in a sequenced region, just because no genes were annotated. We introduce another approach to this problem, based on an analysis of the cDNA-to-genomic alignments, in all of the complete or nearly-complete genomes from the multicellular organisms. Our conclusion is that, in animals but not in plants, most of the “junk” is intron DNA. PMID:11076852

  2. Structure of the human gene encoding the protein repair L-isoaspartyl (D-aspartyl) O-methyltransferase.

    PubMed

    DeVry, C G; Tsai, W; Clarke, S

    1996-11-15

    The protein L-isoaspartyl/D-aspartyl O-methyltransferase (EC 2.1.1.77) catalyzes the first step in the repair of proteins damaged in the aging process by isomerization or racemization reactions at aspartyl and asparaginyl residues. A single gene has been localized to human chromosome 6 and multiple transcripts arising through alternative splicing have been identified. Restriction enzyme mapping, subcloning, and DNA sequence analysis of three overlapping clones from a human genomic library in bacteriophage P1 indicate that the gene spans approximately 60 kb and is composed of 8 exons interrupted by 7 introns. Analysis of intron/exon splice junctions reveals that all of the donor and acceptor splice sites are in agreement with the mammalian consensus splicing sequence. Determination of transcription initiation sites by primer extension analysis of poly(A)+ mRNA from human brain identifies multiple start sites, with a major site 159 nucleotides upstream from the ATG start codon. Sequence analysis of the 5'-untranslated region demonstrates several potential cis-acting DNA elements including SP1, ETF, AP1, AP2, ARE, XRE, CREB, MED-1, and half-palindromic ERE motifs. The promoter of this methyltransferase gene lacks an identifiable TATA box but is characterized by a CpG island which begins approximately 723 nucleotides upstream of the major transcriptional start site and extends through exon 1 and into the first intron. These features are characteristic of housekeeping genes and are consistent with the wide tissue distribution observed for this methyltransferase activity.

  3. WES homozygosity mapping in a recessive form of Charcot-Marie-Tooth neuropathy reveals intronic GDAP1 variant leading to a premature stop codon.

    PubMed

    Masingue, Marion; Perrot, Jimmy; Carlier, Robert-Yves; Piguet-Lacroix, Guenaelle; Latour, Philippe; Stojkovic, Tanya

    2018-05-01

    Charcot-Marie-Tooth disease (CMT) refers to a group of clinically and genetically heterogeneous inherited neuropathies. Ganglioside-induced differentiation-associated protein 1 GDAP1-related CMT has been reported in an autosomal dominant or recessive form in patients presenting either axonal or demyelinating neuropathy. We report two Sri Lankan sisters born to consanguineous parents and presenting with a severe axonal sensorimotor neuropathy. The early onset of the disease, the distal and proximal weakness and atrophy leading to major disability, along with areflexia, and, most notably, vocal cord and diaphragm paralysis were highly evocative of a GDAP1-related CMT. However, sequencing of the coding regions of the gene was normal. Whole-exome sequencing (WES) was performed and revealed that the largest region of homozygosity was around GDAP1 with several variants, mostly in non-coding regions. In view of the high clinical suspicion of GDAP1 gene involvement, we examined the variants in this gene and this, along with functional studies, allowed us to identify an alternative splicing site revealing a cryptic in-frame stop codon in intron 4 responsible for a severe loss of wild-type GDAP1. This work is the first to describe a deleterious mutation in GDAP1 gene outside of coding sequences or intronic junctions and emphasizes the importance of interpreting molecular analysis, and in particular WES results, in light of the clinical and electrophysiological phenotype.

  4. Voltage-gated sodium channel polymorphism and metabolic resistance in pyrethroid-resistant Aedes aegypti from Brazil.

    PubMed

    Martins, Ademir Jesus; Lins, Rachel Mazzei Moura de Andrade; Linss, Jutta Gerlinde Birgitt; Peixoto, Alexandre Afranio; Valle, Denise

    2009-07-01

    The nature of pyrethroid resistance in Aedes aegypti Brazilian populations was investigated. Quantification of enzymes related to metabolic resistance in two distinct populations, located in the Northeast and Southeast regions, revealed increases in Glutathione-S-transferase (GST) and Esterase levels. Additionally, polymorphism was found in the IIS6 region of Ae. aegypti voltage-gated sodium channel (AaNa(V)), the pyrethroid target site. Sequences were classified in two haplotype groups, A and B, according to the size of the intron in that region. Rockefeller, a susceptible control lineage, contains only B sequences. In field populations, some A sequences present a substitution in the 1011 site (Ile/Met). When resistant and susceptible individuals were compared, the frequency of both A (with the Met mutation) and B sequences were slightly increased in resistant specimens. The involvement of the AaNa(V) polymorphism in pyrethroid resistance and the metabolic mechanisms that lead to potential cross-resistance between organophosphate and pyrethroids are discussed.

  5. CryoEM structure of the spliceosome immediately after branching

    PubMed Central

    Galej, Wojciech P.; Wilkinson, Max E.; Fica, Sebastian M.; Oubridge, Chris; Newman, Andrew J.; Nagai, Kiyoshi

    2016-01-01

    Pre-mRNA splicing proceeds by two consecutive trans-esterification reactions via a lariat-intron intermediate. We present the 3.8Å cryoEM structure of the spliceosome immediately after lariat formation. The 5’-splice site is cleaved but remains close to the catalytic Mg2+ site in the U2/U6 snRNA triplex, and the 5’-phosphate of the intron nucleotide G(+1) is linked to the branch adenosine 2’OH. The 5’-exon is held between the Prp8 N-terminal and Linker domains, and base-pairs with U5 snRNA loop 1. Non-Watson-Crick interactions between the branch helix and 5’-splice site dock the branch adenosine into the active site, while intron nucleotides +3 to +6 base-pair with the U6 snRNA ACAGAGA sequence. Isy1 and the step one factors Yju2 and Cwc25 stabilise docking of the branch helix. The intron downstream of the branch site emerges between the Prp8 RT and Linker domains and extends towards Prp16 helicase, suggesting a plausible mechanism of remodelling before exon ligation. PMID:27459055

  6. Recruitment of Staufen2 Enhances Dendritic Localization of an Intron-Containing CaMKIIα mRNA.

    PubMed

    Ortiz, Raúl; Georgieva, Maya V; Gutiérrez, Sara; Pedraza, Neus; Fernández-Moya, Sandra M; Gallego, Carme

    2017-07-05

    Regulation of mRNA localization is a conserved cellular process observed in many types of cells and organisms. Asymmetrical mRNA distribution plays a particularly important role in the nervous system, where local translation of localized mRNA represents a key mechanism in synaptic plasticity. CaMKIIα is a very abundant mRNA detected in neurites, consistent with its crucial role at glutamatergic synapses. Here, we report the presence of CaMKIIα mRNA isoforms that contain intron i16 in dendrites, RNA granules, and synaptoneurosomes from primary neurons and brain. This subpopulation of unspliced mRNA preferentially localizes to distal dendrites in a synaptic-activity-dependent manner. Staufen2, a well-established marker of RNA transport in dendrites, interacts with intron i16 sequences and enhances its distal dendritic localization, pointing to the existence of intron-mediated mechanisms in the molecular pathways that modulate dendritic transport and localization of synaptic mRNAs. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  7. One novel SNP of growth hormone gene and its associations with growth and carcass traits in ducks.

    PubMed

    Wu, Y; Pan, A L; Pi, J S; Pu, Y J; Du, J P; Liang, Z H; Shen, J

    2012-08-01

    In this study, the growth hormone (GH) gene was studied as a candidate gene for growth and carcass traits of three duck populations (Cherry Valley duck, Muscovy duck and Jingjiang duck). Three pairs of primers were designed to detect single nucleotide polymorphisms of introns 2, 3 and 4 of the GH gene by polymerase chain reaction-restriction fragment length polymorphism and sequencing methods. Only the products amplified from intron 2 displayed polymorphism. The results showed one novel polymorphism: a variation in intron 2 of GH gene (C172T, JN408701 and JN408702). It was associated with some growth and carcass traits in three duck populations including birth weight, 8-week weight, carcass weight, breast muscle weight, leg muscle weight, eviscerated weight, lean meat rate, dressing percentage, etc. And the TT and CT genotypes were associated with superior growth and carcass traits in carcass weight, dressing percentage and percentage of eviscerated weight. Therefore, the variation in intron 2 of GH may be a molecular marker for superior growth and carcass traits in above duck populations.

  8. Homing endonucleases from mobile group I introns: discovery to genome engineering

    PubMed Central

    2014-01-01

    Homing endonucleases are highly specific DNA cleaving enzymes that are encoded within genomes of all forms of microbial life including phage and eukaryotic organelles. These proteins drive the mobility and persistence of their own reading frames. The genes that encode homing endonucleases are often embedded within self-splicing elements such as group I introns, group II introns and inteins. This combination of molecular functions is mutually advantageous: the endonuclease activity allows surrounding introns and inteins to act as invasive DNA elements, while the splicing activity allows the endonuclease gene to invade a coding sequence without disrupting its product. Crystallographic analyses of representatives from all known homing endonuclease families have illustrated both their mechanisms of action and their evolutionary relationships to a wide range of host proteins. Several homing endonucleases have been completely redesigned and used for a variety of genome engineering applications. Recent efforts to augment homing endonucleases with auxiliary DNA recognition elements and/or nucleic acid processing factors has further accelerated their use for applications that demand exceptionally high specificity and activity. PMID:24589358

  9. The hemocyanin from a living fossil, the cephalopod Nautilus pompilius: protein structure, gene organization, and evolution.

    PubMed

    Bergmann, Sandra; Lieb, Bernhard; Ruth, Peter; Markl, Jürgen

    2006-03-01

    By electron microscopic and immunobiochemical analyses we have confirmed earlier evidence that Nautilus pompilius hemocyanin (NpH) is a ring-like decamer (M(r) = approximately 3.5 million), assembled from 10 identical copies of an approximately 350-kDa polypeptide. This subunit in turn is substructured into seven sequential covalently linked functional units of approximately 50 kDa each (FUs a-g). We have cloned and sequenced the cDNA encoding the complete polypeptide; it comprises 9198 bp and is subdivided into a 5' UTR of 58 bp, a 3' UTR of 365 bp, and an open reading frame for a signal peptide of 21 amino acids plus a polypeptide of 2903 amino acids (M(r) = 335,881). According to sequence alignments, the seven FUs of Nautilus hemocyanin directly correspond to the seven FU types of the previously sequenced hemocyanin "OdH" from the cephalopod Octopus dofleini. Thirteen potential N-glycosylation sites are distributed among the seven Nautilus hemocyanin FUs; the structural consequences of putatively attached glycans are discussed on the basis of the published X-ray structure for an Octopus dofleini and a Rapana thomasiana FU. Moreover, the complete gene structure of Nautilus hemocyanin was analyzed; it resembles that of Octopus hemocyanin with respect to linker introns but shows two internal introns that differ in position from the three internal introns of the Octopus hemocyanin gene. Multiple sequence alignments allowed calculation of a rather robust phylogenetic tree and a statistically firm molecular clock. This reveals that the last common ancestor of Nautilus and Octopus lived 415 +/- 24 million years ago, in close agreement with fossil records from the early Devonian.

  10. Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases.

    PubMed

    Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo

    2013-11-01

    Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC 3 (NM_003786.3:c.1783-1G>A), KLHDC 1 (NM_172193.1:c.568-2A>G), HOOK 1 (NM_015888.4:c.1662-1G>A), SMAD 9 (NM_001127217.2:c.1004-1C>T), and DNAH 9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC 3, HOOK 1. In ABCC 3 and HOOK 1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK 1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4-6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis.

  11. SplicingTypesAnno: annotating and quantifying alternative splicing events for RNA-Seq data.

    PubMed

    Sun, Xiaoyong; Zuo, Fenghua; Ru, Yuanbin; Guo, Jiqiang; Yan, Xiaoyan; Sablok, Gaurav

    2015-04-01

    Alternative splicing plays a key role in the regulation of the central dogma. Four major types of alternative splicing have been classified as intron retention, exon skipping, alternative 5 splice sites or alternative donor sites, and alternative 3 splice sites or alternative acceptor sites. A few algorithms have been developed to detect splice junctions from RNA-Seq reads. However, there are few tools targeting at the major alternative splicing types at the exon/intron level. This type of analysis may reveal subtle, yet important events of alternative splicing, and thus help gain deeper understanding of the mechanism of alternative splicing. This paper describes a user-friendly R package, extracting, annotating and analyzing alternative splicing types for sequence alignment files from RNA-Seq. SplicingTypesAnno can: (1) provide annotation for major alternative splicing at exon/intron level. By comparing the annotation from GTF/GFF file, it identifies the novel alternative splicing sites; (2) offer a convenient two-level analysis: genome-scale annotation for users with high performance computing environment, and gene-scale annotation for users with personal computers; (3) generate a user-friendly web report and additional BED files for IGV visualization. SplicingTypesAnno is a user-friendly R package for extracting, annotating and analyzing alternative splicing types at exon/intron level for sequence alignment files from RNA-Seq. It is publically available at https://sourceforge.net/projects/splicingtypes/files/ or http://genome.sdau.edu.cn/research/software/SplicingTypesAnno.html. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  12. Abiotic Stresses Modulate Landscape of Poplar Transcriptome via Alternative Splicing, Differential Intron Retention, and Isoform Ratio Switching

    PubMed Central

    Filichkin, Sergei A.; Hamilton, Michael; Dharmawardhana, Palitha D.; Singh, Sunil K.; Sullivan, Christopher; Ben-Hur, Asa; Reddy, Anireddy S. N.; Jaiswal, Pankaj

    2018-01-01

    Abiotic stresses affect plant physiology, development, growth, and alter pre-mRNA splicing. Western poplar is a model woody tree and a potential bioenergy feedstock. To investigate the extent of stress-regulated alternative splicing (AS), we conducted an in-depth survey of leaf, root, and stem xylem transcriptomes under drought, salt, or temperature stress. Analysis of approximately one billion of genome-aligned RNA-Seq reads from tissue- or stress-specific libraries revealed over fifteen millions of novel splice junctions. Transcript models supported by both RNA-Seq and single molecule isoform sequencing (Iso-Seq) data revealed a broad array of novel stress- and/or tissue-specific isoforms. Analysis of Iso-Seq data also resulted in the discovery of 15,087 novel transcribed regions of which 164 show AS. Our findings demonstrate that abiotic stresses profoundly perturb transcript isoform profiles and trigger widespread intron retention (IR) events. Stress treatments often increased or decreased retention of specific introns – a phenomenon described here as differential intron retention (DIR). Many differentially retained introns were regulated in a stress- and/or tissue-specific manner. A subset of transcripts harboring super stress-responsive DIR events showed persisting fluctuations in the degree of IR across all treatments and tissue types. To investigate coordinated dynamics of intron-containing transcripts in the study we quantified absolute copy number of isoforms of two conserved transcription factors (TFs) using Droplet Digital PCR. This case study suggests that stress treatments can be associated with coordinated switches in relative ratios between fully spliced and intron-retaining isoforms and may play a role in adjusting transcriptome to abiotic stresses. PMID:29483921

  13. Whole-gene CFTR sequencing combined with digital RT-PCR improves genetic diagnosis of cystic fibrosis.

    PubMed

    Straniero, Letizia; Soldà, Giulia; Costantino, Lucy; Seia, Manuela; Melotti, Paola; Colombo, Carla; Asselta, Rosanna; Duga, Stefano

    2016-12-01

    Despite extensive screening, 1-5% of cystic fibrosis (CF) patients lack a definite molecular diagnosis. Next-generation sequencing (NGS) is making affordable genetic testing based on the identification of variants in extended genomic regions. In this frame, we analyzed 23 CF patients and one carrier by whole-gene CFTR resequencing: 4 were previously characterized and served as controls; 17 were cases lacking a complete diagnosis after a full conventional CFTR screening; 3 were consecutive subjects referring to our centers, not previously submitted to any screening. We also included in the custom NGS design the coding portions of the SCNN1A, SCNN1B and SCNN1G genes, encoding the subunits of the sodium channel ENaC, which were found to be mutated in CF-like patients. Besides 2 novel SCNN1B missense mutations, we identified 22 previously-known CFTR mutations, including 2 large deletions (whose breakpoints were precisely mapped), and novel deep-intronic variants, whose role on splicing was excluded by ex-vivo analyses. Finally, for 2 patients, compound heterozygotes for a CFTR mutation and the intron-9c.1210-34TG [11-12] T 5 allele-known to be associated with decreased CFTR mRNA levels-the molecular diagnosis was implemented by measuring the residual level of wild-type transcript by digital reverse transcription polymerase chain reaction performed on RNA extracted from nasal brushing.

  14. Evolution of the myosin heavy chain gene MYH14 and its intronic microRNA miR-499: muscle-specific miR-499 expression persists in the absence of the ancestral host gene.

    PubMed

    Bhuiyan, Sharmin Siddique; Kinoshita, Shigeharu; Wongwarangkana, Chaninya; Asaduzzaman, Md; Asakawa, Shuichi; Watabe, Shugo

    2013-07-06

    A novel sarcomeric myosin heavy chain gene, MYH14, was identified following the completion of the human genome project. MYH14 contains an intronic microRNA, miR-499, which is expressed in a slow/cardiac muscle specific manner along with its host gene; it plays a key role in muscle fiber-type specification in mammals. Interestingly, teleost fish genomes contain multiple MYH14 and miR-499 paralogs. However, the evolutionary history of MYH14 and miR-499 has not been studied in detail. In the present study, we identified MYH14/miR-499 loci on various teleost fish genomes and examined their evolutionary history by sequence and expression analyses. Synteny and phylogenetic analyses depict the evolutionary history of MYH14/miR-499 loci where teleost specific duplication and several subsequent rounds of species-specific gene loss events took place. Interestingly, miR-499 was not located in the MYH14 introns of certain teleost fish. An MYH14 paralog, lacking miR-499, exhibited an accelerated rate of evolution compared with those containing miR-499, suggesting a putative functional relationship between MYH14 and miR-499. In medaka, Oryzias latipes, miR-499 is present where MYH14 is completely absent in the genome. Furthermore, by using in situ hybridization and small RNA sequencing, miR-499 was expressed in the notochord at the medaka embryonic stage and slow/cardiac muscle at the larval and adult stages. Comparing the flanking sequences of MYH14/miR-499 loci between torafugu Takifugu rubripes, zebrafish Danio rerio, and medaka revealed some highly conserved regions, suggesting that cis-regulatory elements have been functionally conserved in medaka miR-499 despite the loss of its host gene. This study reveals the evolutionary history of the MYH14/miRNA-499 locus in teleost fish, indicating divergent distribution and expression of MYH14 and miR-499 genes in different teleost fish lineages. We also found that medaka miR-499 was even expressed in the absence of its host gene. To our knowledge, this is the first report that shows the conversion of intronic into non-intronic miRNA during the evolution of a teleost fish lineage.

  15. Variants in intron 13 of the ELMO1 gene are associated with diabetic nephropathy in African Americans

    PubMed Central

    Leak, T. S.; Perlegas, P.S.; Smith, S.G.; Keene, K.L.; Hicks, P.J.; Langefeld, C.D.; Mychaleckyj, J.C.; Rich, S.S.; Kirk, J.K.; Freedman, B.I.; Bowden, D.W.; Sale, M.M.

    2009-01-01

    Variants in the engulfment and cell motility 1 (ELMO1) gene are associated with nephropathy due to type 2 diabetes mellitus (T2DM) in a Japanese cohort. We comprehensively evaluated this gene in African American (AA) T2DM patients with end-stage renal disease (ESRD). Three hundred nine HapMap tagging SNPs and 9 reportedly associated SNPs were genotyped in 577 AA T2DM-ESRD patients and 596 AA non-diabetic controls, plus 43 non-diabetic European American controls and 45 Yoruba Nigerian samples for admixture adjustment. Replication analyses were conducted in 558 AAs with T2DM-ESRD and 564 controls without diabetes. Extension analyses included 328 AA with T2DM lacking nephropathy and 326 with non-diabetic ESRD. The original and replication analyses confirmed association with four SNPs in intron 13 (permutation p-values for combined analyses = 0.001-0.003), one in intron 1 (P=0.004) and one in intron 5 (P=0.002) with T2DM-associated ESRD. In a subsequent combined analysis of all 1,135 T2DM-ESRD cases and 1,160 controls, an additional 7 intron 13 SNPs produced evidence of association (P = 3.5×10-5 – P=0.05). No associations were seen with these SNPs in those with T2DM lacking nephropathy or with ESRD due to non-diabetic causes. Variants in intron 13 of the ELMO1 gene appear to confer risk for diabetic nephropathy in AA. PMID:19183347

  16. Assessing information content and interactive relationships of subgenomic DNA sequences of the MHC using complexity theory approaches based on the non-extensive statistical mechanics

    NASA Astrophysics Data System (ADS)

    Karakatsanis, L. P.; Pavlos, G. P.; Iliopoulos, A. C.; Pavlos, E. G.; Clark, P. M.; Duke, J. L.; Monos, D. S.

    2018-09-01

    This study combines two independent domains of science, the high throughput DNA sequencing capabilities of Genomics and complexity theory from Physics, to assess the information encoded by the different genomic segments of exonic, intronic and intergenic regions of the Major Histocompatibility Complex (MHC) and identify possible interactive relationships. The dynamic and non-extensive statistical characteristics of two well characterized MHC sequences from the homozygous cell lines, PGF and COX, in addition to two other genomic regions of comparable size, used as controls, have been studied using the reconstructed phase space theorem and the non-extensive statistical theory of Tsallis. The results reveal similar non-linear dynamical behavior as far as complexity and self-organization features. In particular, the low-dimensional deterministic nonlinear chaotic and non-extensive statistical character of the DNA sequences was verified with strong multifractal characteristics and long-range correlations. The nonlinear indices repeatedly verified that MHC sequences, whether exonic, intronic or intergenic include varying levels of information and reveal an interaction of the genes with intergenic regions, whereby the lower the number of genes in a region, the less the complexity and information content of the intergenic region. Finally we showed the significance of the intergenic region in the production of the DNA dynamics. The findings reveal interesting content information in all three genomic elements and interactive relationships of the genes with the intergenic regions. The results most likely are relevant to the whole genome and not only to the MHC. These findings are consistent with the ENCODE project, which has now established that the non-coding regions of the genome remain to be of relevance, as they are functionally important and play a significant role in the regulation of expression of genes and coordination of the many biological processes of the cell.

  17. “Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes”

    PubMed Central

    Neafsey, Daniel E.; Waterhouse, Robert M.; Abai, Mohammad R.; Aganezov, Sergey S.; Alekseyev, Max A.; Allen, James E.; Amon, James; Arcà, Bruno; Arensburger, Peter; Artemov, Gleb; Assour, Lauren A.; Basseri, Hamidreza; Berlin, Aaron; Birren, Bruce W.; Blandin, Stephanie A.; Brockman, Andrew I.; Burkot, Thomas R.; Burt, Austin; Chan, Clara S.; Chauve, Cedric; Chiu, Joanna C.; Christensen, Mikkel; Costantini, Carlo; Davidson, Victoria L.M.; Deligianni, Elena; Dottorini, Tania; Dritsou, Vicky; Gabriel, Stacey B.; Guelbeogo, Wamdaogo M.; Hall, Andrew B.; Han, Mira V.; Hlaing, Thaung; Hughes, Daniel S.T.; Jenkins, Adam M.; Jiang, Xiaofang; Jungreis, Irwin; Kakani, Evdoxia G.; Kamali, Maryam; Kemppainen, Petri; Kennedy, Ryan C.; Kirmitzoglou, Ioannis K.; Koekemoer, Lizette L.; Laban, Njoroge; Langridge, Nicholas; Lawniczak, Mara K.N.; Lirakis, Manolis; Lobo, Neil F.; Lowy, Ernesto; MacCallum, Robert M.; Mao, Chunhong; Maslen, Gareth; Mbogo, Charles; McCarthy, Jenny; Michel, Kristin; Mitchell, Sara N.; Moore, Wendy; Murphy, Katherine A.; Naumenko, Anastasia N.; Nolan, Tony; Novoa, Eva M.; O'Loughlin, Samantha; Oringanje, Chioma; Oshaghi, Mohammad A.; Pakpour, Nazzy; Papathanos, Philippos A.; Peery, Ashley N.; Povelones, Michael; Prakash, Anil; Price, David P.; Rajaraman, Ashok; Reimer, Lisa J.; Rinker, David C.; Rokas, Antonis; Russell, Tanya L.; Sagnon, N'Fale; Sharakhova, Maria V.; Shea, Terrance; Simão, Felipe A.; Simard, Frederic; Slotman, Michel A.; Somboon, Pradya; Stegniy, Vladimir; Struchiner, Claudio J.; Thomas, Gregg W.C.; Tojo, Marta; Topalis, Pantelis; Tubio, José M.C.; Unger, Maria F.; Vontas, John; Walton, Catherine; Wilding, Craig S.; Willis, Judith H.; Wu, Yi-Chieh; Yan, Guiyun; Zdobnov, Evgeny M.; Zhou, Xiaofan; Catteruccia, Flaminia; Christophides, George K.; Collins, Frank H.; Cornman, Robert S.; Crisanti, Andrea; Donnelly, Martin J.; Emrich, Scott J.; Fontaine, Michael C.; Gelbart, William; Hahn, Matthew W.; Hansen, Immo A.; Howell, Paul I.; Kafatos, Fotis C.; Kellis, Manolis; Lawson, Daniel; Louis, Christos; Luckhart, Shirley; Muskavitch, Marc A.T.; Ribeiro, José M.; Riehle, Michael A.; Sharakhov, Igor V.; Tu, Zhijian; Zwiebel, Laurence J.; Besansky, Nora J.

    2015-01-01

    Variation in vectorial capacity for human malaria among Anopheles mosquito species is determined by many factors, including behavior, immunity, and life history. To investigate the genomic basis of vectorial capacity and explore new avenues for vector control, we sequenced the genomes of 16 anopheline mosquito species from diverse locations spanning ~100 million years of evolution. Comparative analyses show faster rates of gene gain and loss, elevated gene shuffling on the X chromosome, and more intron losses, relative to Drosophila. Some determinants of vectorial capacity, such as chemosensory genes, do not show elevated turnover, but instead diversify through protein-sequence changes. This dynamism of anopheline genes and genomes may contribute to their flexible capacity to take advantage of new ecological niches, including adapting to humans as primary hosts. PMID:25554792

  18. Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome.

    PubMed

    Kawaguchi, Risa; Kiryu, Hisanori

    2016-05-06

    RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .

  19. Intronic sequences are required for AINTEGUMENTA-LIKE6 expression in Arabidopsis flowers.

    PubMed

    Krizek, Beth A

    2015-10-12

    The AINTEGUMENTA-LIKE6/PLETHORA3 (AIL6/PLT3) gene of Arabidopsis thaliana is a key regulator of growth and patterning in both shoots and roots. AIL6 encodes an AINTEGUMENTA-LIKE/PLETHORA (AIL/PLT) transcription factor that is expressed in the root stem cell niche, the peripheral region of the shoot apical meristem and young lateral organ primordia. In flowers, AIL6 acts redundantly with AINTEGUMENTA (ANT) to regulate floral organ positioning, growth, identity and patterning. Experiments were undertaken to define the genomic regions required for AIL6 function and expression in flowers. Transgenic plants expressing a copy of the coding region of AIL6 in the context of 7.7 kb of 5' sequence and 919 bp of 3' sequence (AIL6:cAIL6-3') fail to fully complement AIL6 function when assayed in the ant-4 ail6-2 double mutant background. In contrast, a genomic copy of AIL6 with the same amount of 5' and 3' sequence (AIL6:gAIL6-3') can fully complement ant-4 ail6-2. In addition, a genomic copy of AIL6 with 590 bp of 5' sequence and 919 bp of 3' sequence (AIL6m:gAIL6-3') complements ant-4 ail6-2 and contains all regulatory elements needed to confer normal AIL6 expression in inflorescences. Efforts to map cis-regulatory elements reveal that the third intron of AIL6 contains enhancer elements that confer expression in young flowers but in a broader pattern than that of AIL6 mRNA in wild-type flowers. Some AIL6:gAIL6-3' and AIL6m:gAIL6-3' lines confer an over-rescue phenotype in the ant-4 ail6-2 background that is correlated with higher levels of AIL6 mRNA accumulation. The results presented here indicate that AIL6 intronic sequences serve as transcriptional enhancer elements. In addition, the results show that increased expression of AIL6 can partially compensate for loss of ANT function in flowers.

  20. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kerr, J.M.; Fisher, L.W.; Termine, J.D.

    The authors have isolated and partially sequenced the human bone sialoprotein gene (IBSP). IBSP has been sublocalized by in situ hybridization to chromosome 4q38-q31 and is composed of six small exons (51 to 159 bp) and 1 large exon ([approximately]2.6 kb). The intron/exon junctions defined by sequence analysis are of class O, retaining an intact coding triplet. Sequence analysis of the 5[prime] upstream region revealed a TATAA (nucleotides -30 to-25 from the transcriptional start point) and a CCAAT (nucleotides -56 to-52) box, both in the reverse orientation. Intron 1 contains interesting structural elements composed of polypyrimidine repeats followed by amore » poly(AC)[sub n] tract. Both types of structural elements have been detected in promoter regions of other genes and have been implicated in transcriptional regulation. Several differences between the previously published cDNA sequence and the authors' sequence have been identified, most of which are contained within the untranslated exon 1. Three base revisions in the coding region include a G to T (Gly to Val, amino acid 195), T to C (Val to Ala, amino acid 268), and T to A (Glu to Asp, amino acid 270). In conclusion, the genomic organization and potential regulatory elements of human IBSP have been elucidated. 42 refs., 4 figs., 1 tab.« less

  1. Increased complexity of circRNA expression during species evolution.

    PubMed

    Dong, Rui; Ma, Xu-Kai; Chen, Ling-Ling; Yang, Li

    2017-08-03

    Circular RNAs (circRNAs) are broadly identified from precursor mRNA (pre-mRNA) back-splicing across various species. Recent studies have suggested a cell-/tissue- specific manner of circRNA expression. However, the distinct expression pattern of circRNAs among species and its underlying mechanism still remain to be explored. Here, we systematically compared circRNA expression from human and mouse, and found that only a small portion of human circRNAs could be determined in parallel mouse samples. The conserved circRNA expression between human and mouse is correlated with the existence of orientation-opposite complementary sequences in introns that flank back-spliced exons in both species, but not the circRNA sequences themselves. Quantification of RNA pairing capacity of orientation-opposite complementary sequences across circRNA-flanking introns by Complementary Sequence Index (CSI) identifies that among all types of complementary sequences, SINEs, especially Alu elements in human, contribute the most for circRNA formation and that their diverse distribution across species leads to the increased complexity of circRNA expression during species evolution. Together, our integrated and comparative reference catalog of circRNAs in different species reveals a species-specific pattern of circRNA expression and suggests a previously under-appreciated impact of fast-evolved SINEs on the regulation of (circRNA) gene expression.

  2. Comparing COI and ITS as DNA barcode markers for mushrooms and allies (Agaricomycotina).

    PubMed

    Dentinger, Bryn T M; Didukh, Maryna Y; Moncalvo, Jean-Marc

    2011-01-01

    DNA barcoding is an approach to rapidly identify species using short, standard genetic markers. The mitochondrial cytochrome oxidase I gene (COI) has been proposed as the universal barcode locus, but its utility for barcoding in mushrooms (ca. 20,000 species) has not been established. We succeeded in generating 167 partial COI sequences (~450 bp) representing ~100 morphospecies from ~650 collections of Agaricomycotina using several sets of new primers. Large introns (~1500 bp) at variable locations were detected in ~5% of the sequences we obtained. We suspect that widespread presence of large introns is responsible for our low PCR success (~30%) with this locus. We also sequenced the nuclear internal transcribed spacer rDNA regions (ITS) to compare with COI. Among the small proportion of taxa for which COI could be sequenced, COI and ITS perform similarly as a barcode. However, in a densely sampled set of closely related taxa, COI was less divergent than ITS and failed to distinguish all terminal clades. Given our results and the wealth of ITS data already available in public databases, we recommend that COI be abandoned in favor of ITS as the primary DNA barcode locus in mushrooms.

  3. Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.

    PubMed

    Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M

    1991-02-15

    The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.

  4. Comparing COI and ITS as DNA Barcode Markers for Mushrooms and Allies (Agaricomycotina)

    PubMed Central

    Dentinger, Bryn T. M.; Didukh, Maryna Y.; Moncalvo, Jean-Marc

    2011-01-01

    DNA barcoding is an approach to rapidly identify species using short, standard genetic markers. The mitochondrial cytochrome oxidase I gene (COI) has been proposed as the universal barcode locus, but its utility for barcoding in mushrooms (ca. 20,000 species) has not been established. We succeeded in generating 167 partial COI sequences (∼450 bp) representing ∼100 morphospecies from ∼650 collections of Agaricomycotina using several sets of new primers. Large introns (∼1500 bp) at variable locations were detected in ∼5% of the sequences we obtained. We suspect that widespread presence of large introns is responsible for our low PCR success (∼30%) with this locus. We also sequenced the nuclear internal transcribed spacer rDNA regions (ITS) to compare with COI. Among the small proportion of taxa for which COI could be sequenced, COI and ITS perform similarly as a barcode. However, in a densely sampled set of closely related taxa, COI was less divergent than ITS and failed to distinguish all terminal clades. Given our results and the wealth of ITS data already available in public databases, we recommend that COI be abandoned in favor of ITS as the primary DNA barcode locus in mushrooms. PMID:21966418

  5. Management of adult-onset methylmalonic acidemia with hypotonia and acute respiratory failure: A case report.

    PubMed

    Zhao, Zhanqi; Chu, Chan-Ching; Chang, Mei-Yun; Chang, Hao-Tai; Hsu, Yeong-Long

    2018-06-01

    Methylmalonic acidemia (MMA) is an autosomal recessive disease of organic acidemia. We report a 26-year-old male who presented with metabolic acidosis, acute renal failure required hemodialysis and acute respiratory failure required mechanical ventilation support. Progressive hypotonia of muscles made weaning from mechanical ventilator difficult. High level of serum methylmalonic acid and the mut genotype sequences confirmed the diagnosis of this adult-onset MMA. Two mut genotype sequences were found by analyzing all coding exons and exon-intron junctions. One genotype was well documented (Exon 6 Mutation, c. 1280G>A. p. G427D, heterozygous). The other mut genotype sequence had never been reported elsewhere (Intron 6 Novel, c. 1333-13_c. 1333-8delTTTTTC, heterozygous). Diet modification, medication, regular hemodialysis and physical rehabilitation. Weaning strategy adjusted with help of electrical impedance tomography. The muscle power of the patient gradually recovered. Extubation of the patient was successful and he was discharged without oxygen required. This case gives us the lesson that MMA can be newly diagnosed in adult patient. A new mut genotype sequence was discovered. The use of electrical impedance tomography to select a suitable method for inspiratory muscle training was possible and useful.

  6. Insertion of an SVA-E retrotransposon into the CASP8 gene is associated with protection against prostate cancer

    PubMed Central

    Stacey, Simon N.; Kehr, Birte; Gudmundsson, Julius; Zink, Florian; Jonasdottir, Aslaug; Gudjonsson, Sigurjon A.; Sigurdsson, Asgeir; Halldorsson, Bjarni V.; Agnarsson, Bjarni A.; Benediktsdottir, Kristrun R.; Aben, Katja K.H.; Vermeulen, Sita H.; Cremers, Ruben G.; Panadero, Angeles; Helfand, Brian T.; Cooper, Phillip R.; Donovan, Jenny L.; Hamdy, Freddie C.; Jinga, Viorel; Okamoto, Ichiro; Jonasson, Jon G.; Tryggvadottir, Laufey; Johannsdottir, Hrefna; Kristinsdottir, Anna M.; Masson, Gisli; Magnusson, Olafur T.; Iordache, Paul D.; Helgason, Agnar; Helgason, Hannes; Sulem, Patrick; Gudbjartsson, Daniel F.; Kong, Augustine; Jonsson, Eirikur; Barkardottir, Rosa B.; Einarsson, Gudmundur V.; Rafnar, Thorunn; Thorsteinsdottir, Unnur; Mates, Ioan N.; Neal, David E.; Catalona, William J.; Mayordomo, José I.; Kiemeney, Lambertus A.; Thorleifsson, Gudmar; Stefansson, Kari

    2016-01-01

    Transcriptional and splicing anomalies have been observed in intron 8 of the CASP8 gene (encoding procaspase-8) in association with cutaneous basal-cell carcinoma (BCC) and linked to a germline SNP rs700635. Here, we show that the rs700635[C] allele, which is associated with increased risk of BCC and breast cancer, is protective against prostate cancer [odds ratio (OR) = 0.91, P = 1.0 × 10−6]. rs700635[C] is also associated with failures to correctly splice out CASP8 intron 8 in breast and prostate tumours and in corresponding normal tissues. Investigation of rs700635[C] carriers revealed that they have a human-specific short interspersed element-variable number of tandem repeat-Alu (SINE-VNTR-Alu), subfamily-E retrotransposon (SVA-E) inserted into CASP8 intron 8. The SVA-E shows evidence of prior activity, because it has transduced some CASP8 sequences during subsequent retrotransposition events. Whole-genome sequence (WGS) data were used to tag the SVA-E with a surrogate SNP rs1035142[T] (r2 = 0.999), which showed associations with both the splicing anomalies (P = 6.5 × 10−32) and with protection against prostate cancer (OR = 0.91, P = 3.8 × 10−7). PMID:26740556

  7. Dynamic evolution of Geranium mitochondrial genomes through multiple horizontal and intracellular gene transfers.

    PubMed

    Park, Seongjun; Grewe, Felix; Zhu, Andan; Ruhlman, Tracey A; Sabir, Jamal; Mower, Jeffrey P; Jansen, Robert K

    2015-10-01

    The exchange of genetic material between cellular organelles through intracellular gene transfer (IGT) or between species by horizontal gene transfer (HGT) has played an important role in plant mitochondrial genome evolution. The mitochondrial genomes of Geraniaceae display a number of unusual phenomena including highly accelerated rates of synonymous substitutions, extensive gene loss and reduction in RNA editing. Mitochondrial DNA sequences assembled for 17 species of Geranium revealed substantial reduction in gene and intron content relative to the ancestor of the Geranium lineage. Comparative analyses of nuclear transcriptome data suggest that a number of these sequences have been functionally relocated to the nucleus via IGT. Evidence for rampant HGT was detected in several Geranium species containing foreign organellar DNA from diverse eudicots, including many transfers from parasitic plants. One lineage has experienced multiple, independent HGT episodes, many of which occurred within the past 5.5 Myr. Both duplicative and recapture HGT were documented in Geranium lineages. The mitochondrial genome of Geranium brycei contains at least four independent HGT tracts that are absent in its nearest relative. Furthermore, G. brycei mitochondria carry two copies of the cox1 gene that differ in intron content, providing insight into contrasting hypotheses on cox1 intron evolution. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  8. Improvement of a yeast self-excising integrative vector by prevention of expression leakage of the intronated Cre recombinase gene during plasmid maintenance in Escherichia coli.

    PubMed

    Agaphonov, Michael O

    2017-12-01

    The use of plasmids possessing a regulatable gene coding for a site-specific recombinase together with its recognition sequences significantly facilitates genome manipulations since it allows self-excision of the portion of the genetic construct integrated into the host genome. Stable maintenance of such plasmids in Escherichia coli, which is used for plasmid preparation, requires prevention of recombinase synthesis in this host, which can be achieved by interrupting the recombinase gene with an intron. Based on this approach, Saccharomyces cerevisiae and Hansenula polymorpha self-excising vectors possessing intronated gene for Cre recombinase and its recognition sites (LoxP) were previously constructed. However, this work shows instability of the H. polymorpha vectors during plasmid maintenance in E. coli cells. This could be due to recombination between the loxP sites caused by residual expression of the cre gene. Prevention of translation reinitiation on an internal methionine codon completely solved this problem. A similar modification was made in a self-excising vector designed for S. cerevisiae. Apart from substantial improvement of yeast self-excising vectors, the obtained results also narrow down the essential part of Cre sequence. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  9. Exonization of an Intronic LINE-1 Element Causing Becker Muscular Dystrophy as a Novel Mutational Mechanism in Dystrophin Gene.

    PubMed

    Gonçalves, Ana; Oliveira, Jorge; Coelho, Teresa; Taipa, Ricardo; Melo-Pires, Manuel; Sousa, Mário; Santos, Rosário

    2017-10-03

    A broad mutational spectrum in the dystrophin ( DMD ) gene, from large deletions/duplications to point mutations, causes Duchenne/Becker muscular dystrophy (D/BMD). Comprehensive genotyping is particularly relevant considering the mutation-centered therapies for dystrophinopathies. We report the genetic characterization of a patient with disease onset at age 13 years, elevated creatine kinase levels and reduced dystrophin labeling, where multiplex-ligation probe amplification (MLPA) and genomic sequencing failed to detect pathogenic variants. Bioinformatic, transcriptomic (real time PCR, RT-PCR), and genomic approaches (Southern blot, long-range PCR, and single molecule real-time sequencing) were used to characterize the mutation. An aberrant transcript was identified, containing a 103-nucleotide insertion between exons 51 and 52, with no similarity with the DMD gene. This corresponded to the partial exonization of a long interspersed nuclear element (LINE-1), disrupting the open reading frame. Further characterization identified a complete LINE-1 (~6 kb with typical hallmarks) deeply inserted in intron 51. Haplotyping and segregation analysis demonstrated that the mutation had a de novo origin. Besides underscoring the importance of mRNA studies in genetically unsolved cases, this is the first report of a disease-causing fully intronic LINE-1 element in DMD , adding to the diversity of mutational events that give rise to D/BMD.

  10. The genomic structure: proof of the role of non-coding DNA.

    PubMed

    Bouaynaya, Nidhal; Schonfeld, Dan

    2006-01-01

    We prove that the introns play the role of a decoy in absorbing mutations in the same way hollow uninhabited structures are used by the military to protect important installations. Our approach is based on a probability of error analysis, where errors are mutations which occur in the exon sequences. We derive the optimal exon length distribution, which minimizes the probability of error in the genome. Furthermore, to understand how can Nature generate the optimal distribution, we propose a diffusive random walk model for exon generation throughout evolution. This model results in an alpha stable exon length distribution, which is asymptotically equivalent to the optimal distribution. Experimental results show that both distributions accurately fit the real data. Given that introns also drive biological evolution by increasing the rate of unequal crossover between genes, we conclude that the role of introns is to maintain a genius balance between stability and adaptability in eukaryotic genomes.

  11. A 3.0-kb deletion including an erythroid cell-specific regulatory element in intron 1 of the ABO blood group gene in an individual with the Bm phenotype.

    PubMed

    Sano, R; Kuboya, E; Nakajima, T; Takahashi, Y; Takahashi, K; Kubo, R; Kominato, Y; Takeshita, H; Yamao, H; Kishida, T; Isa, K; Ogasawara, K; Uchikawa, M

    2015-04-01

    We developed a sequence-specific primer PCR (SSP-PCR) for detection of a 5.8-kb deletion (B(m) 5.8) involving an erythroid cell-specific regulatory element in intron 1 of the ABO blood group gene. Using this SSP-PCR, we performed genetic analysis of 382 individuals with Bm or ABm. The 5.8-kb deletion was found in 380 individuals, and disruption of the GATA motif in the regulatory element was found in one individual. Furthermore, a novel 3.0-kb deletion involving the element (B(m) 3.0) was demonstrated in the remaining individual. Comparisons of single-nucleotide polymorphisms and microsatellites in intron 1 between B(m) 5.8 and B(m) 3.0 suggested that these deletions occurred independently. © 2014 International Society of Blood Transfusion.

  12. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Abraitiene, Asta; US Department of Agriculture, Agricultural Research Service, Molecular Plant Pathology Laboratory, Room 214 Building 004 BARC-West, 10300 Baltimore Avenue, Beltsville, MD 20705; Zhao Yan

    Transient expression of engineered reporter RNAs encoding an intron-containing green fluorescent protein (GFP) from a Potato virus X-based expression vector previously demonstrated the nuclear targeting capability of the 359 nucleotide Potato spindle tuber viroid (PSTVd) RNA genome. To further delimit the putative nuclear-targeting signal, PSTVd subgenomic fragments were embedded within the intron, and recombinant reporter RNAs were inoculated onto Nicotiana benthamiana plants. Appearance of green fluorescence in leaf tissue inoculated with PSTVd-fragment-containing constructs indicated shuttling of the RNA into the nucleus by fragments as short as 80 nucleotides in length. Plant-to-plant variation in the timing of intron removal and subsequentmore » GFP fluorescence was observed; however, earliest and most abundant GFP expression was obtained with constructs containing the conserved hairpin I palindrome structure and embedded upper central conserved region. Our results suggest that this conserved sequence and/or the stem-loop structure it forms is sufficient for import of PSTVd into the nucleus.« less

  13. The Rise and Fall of the Gene.

    ERIC Educational Resources Information Center

    Mahadeva, Madhu; Randerson, Sherman

    1985-01-01

    Summarizes the current state of genetics, highlighting major historical events in the development of the field and discussing topics related to introns ("silent" or noncoding base sequences in eucaryotic genes) and exons (the coding parts of DNA). (JN)

  14. A CT-rich haplotype in intron 4 of SNCA confers risk for Lewy body pathology in Alzheimer’s disease and affects SNCA expression

    PubMed Central

    Lutz, Michael W.; Saul, Robert; Linnertz, Colton; Glenn, Omolara-Chinue; Roses, Allen D.; Chiba-Falek, Ornit

    2015-01-01

    INTRODUCTION We recently showed that tagging-SNPs across the SNCA locus were significantly associated with increased risk for LB pathology in AD cases. However, the actual genetic variant(s) that underlie the observed associations remain elusive. METHODS We used a bioinformatics algorithm to catalogue Structural-Variants in a region of SNCA-intron4, followed by phased-sequencing. We performed a genetic-association analysis in autopsy series of LBV/AD cases compared with AD-only controls. We investigated the biological functions by expression analysis using temporal-cortex samples. RESULTS We identified four distinct haplotypes within a highly-polymorphic-low-complexity CT-rich region. We showed that a specific haplotype conferred risk to develop LBV/AD. We demonstrated that the CT-rich site acts as an enhancer element, where the risk haplotype was significantly associated with elevated levels of SNCA-mRNA. DISCUSSION We have discovered a novel haplotype in a CT-rich region in SNCA that contributes to LB pathology in AD patients, possibly via cis-regulation of the gene expression. PMID:26079410

  15. High-purity circular RNA isolation method (RPAD) reveals vast collection of intronic circRNAs.

    PubMed

    Panda, Amaresh C; De, Supriyo; Grammatikakis, Ioannis; Munk, Rachel; Yang, Xiaoling; Piao, Yulan; Dudekula, Dawood B; Abdelmohsen, Kotb; Gorospe, Myriam

    2017-07-07

    High-throughput RNA sequencing methods coupled with specialized bioinformatic analyses have recently uncovered tens of thousands of unique circular (circ)RNAs, but their complete sequences, genes of origin and functions are largely unknown. Given that circRNAs lack free ends and are thus relatively stable, their association with microRNAs (miRNAs) and RNA-binding proteins (RBPs) can influence gene expression programs. While exoribonuclease treatment is widely used to degrade linear RNAs and enrich circRNAs in RNA samples, it does not efficiently eliminate all linear RNAs. Here, we describe a novel method for the isolation of highly pure circRNA populations involving RNase R treatment followed by Polyadenylation and poly(A)+ RNA Depletion (RPAD), which removes linear RNA to near completion. High-throughput sequencing of RNA prepared using RPAD from human cervical carcinoma HeLa cells and mouse C2C12 myoblasts led to two surprising discoveries: (i) many exonic circRNA (EcircRNA) isoforms share an identical backsplice sequence but have different body sizes and sequences, and (ii) thousands of novel intronic circular RNAs (IcircRNAs) are expressed in cells. In sum, isolating high-purity circRNAs using the RPAD method can enable quantitative and qualitative analyses of circRNA types and sequence composition, paving the way for the elucidation of circRNA functions. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.

  16. High-purity circular RNA isolation method (RPAD) reveals vast collection of intronic circRNAs

    PubMed Central

    De, Supriyo; Grammatikakis, Ioannis; Munk, Rachel; Yang, Xiaoling; Piao, Yulan; Dudekula, Dawood B.; Gorospe, Myriam

    2017-01-01

    Abstract High-throughput RNA sequencing methods coupled with specialized bioinformatic analyses have recently uncovered tens of thousands of unique circular (circ)RNAs, but their complete sequences, genes of origin and functions are largely unknown. Given that circRNAs lack free ends and are thus relatively stable, their association with microRNAs (miRNAs) and RNA-binding proteins (RBPs) can influence gene expression programs. While exoribonuclease treatment is widely used to degrade linear RNAs and enrich circRNAs in RNA samples, it does not efficiently eliminate all linear RNAs. Here, we describe a novel method for the isolation of highly pure circRNA populations involving RNase R treatment followed by Polyadenylation and poly(A)+ RNA Depletion (RPAD), which removes linear RNA to near completion. High-throughput sequencing of RNA prepared using RPAD from human cervical carcinoma HeLa cells and mouse C2C12 myoblasts led to two surprising discoveries: (i) many exonic circRNA (EcircRNA) isoforms share an identical backsplice sequence but have different body sizes and sequences, and (ii) thousands of novel intronic circular RNAs (IcircRNAs) are expressed in cells. In sum, isolating high-purity circRNAs using the RPAD method can enable quantitative and qualitative analyses of circRNA types and sequence composition, paving the way for the elucidation of circRNA functions. PMID:28444238

  17. In silico identification and analysis of phytoene synthase genes in plants.

    PubMed

    Han, Y; Zheng, Q S; Wei, Y P; Chen, J; Liu, R; Wan, H J

    2015-08-14

    In this study, we examined phytoene synthetase (PSY), the first key limiting enzyme in the synthesis of carotenoids and catalyzing the formation of geranylgeranyl pyrophosphate in terpenoid biosynthesis. We used known amino acid sequences of the PSY gene in tomato plants to conduct a genome-wide search and identify putative candidates in 34 sequenced plants. A total of 101 homologous genes were identified. Phylogenetic analysis revealed that PSY evolved independently in algae as well as monocotyledonous and dicotyledonous plants. Our results showed that the amino acid structures exhibited 5 motifs (motifs 1 to 5) in algae and those in higher plants were highly conserved. The PSY gene structures showed that the number of intron in algae varied widely, while the number of introns in higher plants was 4 to 5. Identification of PSY genes in plants and the analysis of the gene structure may provide a theoretical basis for studying evolutionary relationships in future analyses.

  18. The active gene that encodes human High Mobility Group 1 protein (HMG1) contains introns and maps to chromosome 13

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ferrari, S.; Finelli, P.; Rocchi, M.

    The human genome contains a large number of sequences related to the cDNA for High Mobility Group 1 protein (HMG1), which so far has hampered the cloning and mapping of the active HMG1 gene. We show that the human HMG1 gene contains introns, while the HMG1-related sequences do not and most likely are retrotransposed pseudogenes. We identified eight YACs from the ICI and CEPH libraries that contain the human HMG1 gene. The HMG1 gene is similar in structure to the previously characterized murine homologue and maps to human chromosome 13 and q12, as determined by in situ hybridization. The mousemore » Hmg1 gene maps to the telomeric region of murine Chromosome 5, which is syntenic to the human 13q12 band. 18 refs., 3 figs.« less

  19. Diverse Forms of RPS9 Splicing Are Part of an Evolving Autoregulatory Circuit

    PubMed Central

    Plocik, Alex M.; Guthrie, Christine

    2012-01-01

    Ribosomal proteins are essential to life. While the functions of ribosomal protein-encoding genes (RPGs) are highly conserved, the evolution of their regulatory mechanisms is remarkably dynamic. In Saccharomyces cerevisiae, RPGs are unusual in that they are commonly present as two highly similar gene copies and in that they are over-represented among intron-containing genes. To investigate the role of introns in the regulation of RPG expression, we constructed 16 S. cerevisiae strains with precise deletions of RPG introns. We found that several yeast introns function to repress rather than to increase steady-state mRNA levels. Among these, the RPS9A and RPS9B introns were required for cross-regulation of the two paralogous gene copies, which is consistent with the duplication of an autoregulatory circuit. To test for similar intron function in animals, we performed an experimental test and comparative analyses for autoregulation among distantly related animal RPS9 orthologs. Overexpression of an exogenous RpS9 copy in Drosophila melanogaster S2 cells induced alternative splicing and degradation of the endogenous copy by nonsense-mediated decay (NMD). Also, analysis of expressed sequence tag data from distantly related animals, including Homo sapiens and Ciona intestinalis, revealed diverse alternatively-spliced RPS9 isoforms predicted to elicit NMD. We propose that multiple forms of splicing regulation among RPS9 orthologs from various eukaryotes operate analogously to translational repression of the alpha operon by S4, the distant prokaryotic ortholog. Thus, RPS9 orthologs appear to have independently evolved variations on a fundamental autoregulatory circuit. PMID:22479208

  20. Neurodegenerative disorder FTDP-17-related tau intron 10 +16C → T mutation increases tau exon 10 splicing and causes tauopathy in transgenic mice.

    PubMed

    Umeda, Tomohiro; Yamashita, Takenari; Kimura, Tetsuya; Ohnishi, Kiyouhisa; Takuma, Hiroshi; Ozeki, Tomoko; Takashima, Akihiko; Tomiyama, Takami; Mori, Hiroshi

    2013-07-01

    Frontotemporal dementia and parkinsonism linked to chromosome 17 (FTDP-17) is a neurodegenerative disorder caused by mutations in the tau gene. Many mutations identified in FTDP-17 have been shown to affect tau exon 10 splicing in vitro, which presumably causes pathologic imbalances in exon 10(-) [3-repeat (3R)] and exon 10(+) [4-repeat (4R)] tau expression and leads to intracellular inclusions of hyperphosphorylated tau in patient brains. However, no reports have investigated this theory using model mice with a tau intronic mutation. Herein, we generated new transgenic mice harboring the tau intron 10 +16C → T mutation. We prepared a transgene construct containing intronic sequences required for exon 10 splicing in the longest tau isoform cDNA. Although mice bearing the construct without the intronic mutation showed normal developmental changes of the tau isoform from 3R tau to equal amounts of 3R and 4R tau, mice with the mutation showed much higher levels of 4R tau at the adult stage. 4R tau was selectively recovered in insoluble brain fractions in their old age. Furthermore, these mice displayed abnormal tau phosphorylation, synapse loss and dysfunction, memory impairment, glial activation, tangle formation, and neuronal loss in an age-dependent manner. These findings provide the first evidence in a mouse model that a tau intronic mutation-induced imbalance of 3R and 4R tau could be a cause of tauopathy. Copyright © 2013 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.

  1. Association of IL-4 (intron 3) and IL-10 (-1082) gene polymorphisms with risk of mitral valve disease in children with rheumatic heart disease.

    PubMed

    Yousry, Sherif M; Sedky, Yasser; Sobieh, Alaa

    2016-10-01

    Aim Rheumatic heart disease is an inflammatory disease of cardiac tissue. The underlying pathogenic mechanisms highlight a complex interplay of immunological, genetic, and environmental factors. The aim of the present study was to investigate whether IL-4 (intron 3) and IL-10 (-1082) gene polymorphisms could be associated with susceptibility and/or severity of rheumatic heart disease among patients from the Egyptian population. Materials and methods A cohort of 140 Egyptian children with rheumatic heart disease and 100 healthy controls were enrolled in this case-control study. Genotyping for IL-4 (intron 3) and IL-10 (-1082) gene polymorphisms was carried out for all patients using a polymerase chain reaction-based analysis. No significant difference in the distribution of genotypes and allelic frequencies between rheumatic heart disease cases and controls for IL-4 (intron 3) (p=0.17; OR 1.07, 95% CI 0.82-3.74) and IL-10 (-1082) (p=0.49; OR 1.03, 95% CI 0.65-2.71) gene polymorphisms was observed. Further categorisation of patients into mitral valve disease and combined valve disease subgroups showed that cases with mitral valve disease have significantly higher frequency of the RP2 allele of IL-4 (intron 3) (p=0.03; OR 2.98, 95% CI 1.93-6.15) and the G allele of IL-10 (-1082) (p=0.04; OR 2.14, 95% CI 1.62-4.95) when compared with controls. Discussion Our study shows that IL-4 (intron 3) and IL-10 (-1082) gene polymorphisms are not significantly associated with susceptibility to rheumatic heart disease, but they might play a role in the pathogenesis of patients with mitral valve disease.

  2. Polyoma virus small tumor antigen pre-mRNA splicing requires cooperation between two 3' splice sites.

    PubMed Central

    Ge, H; Noble, J; Colgan, J; Manley, J L

    1990-01-01

    We have studied splicing of the polyoma virus early region pre-mRNA in vitro. This RNA is alternatively spliced in vivo to produce mRNA encoding the large, middle-sized (MTAg), and small (StAg) tumor antigens. Our primary interest was to learn how the 48-nucleotide StAg intron is excised, because the length of this intron is significantly less than the apparent minimum established for mammalian introns. Although the products of all three splices are detected in vitro, characterization of the pathway and sequence requirements of StAg splicing suggests that splicing factors interact with the precursor RNA in an unexpected way to catalyze removal of this intron. Specifically, StAg splicing uses either of two lariat branch points, one of which is located only 4 nucleotides from the 3' splice site. Furthermore, the StAg splice absolutely requires that the alternative MTAg 3' splice site, located 14 nucleotides downstream of the StAg 3' splice site, be intact. Insertion mutations that increase or decrease the quality of the MTAg pyrimidine stretch enhance or repress StAg as well as MTAg splicing, and a single-base change in the MTAg AG splice acceptor totally blocks both splices. These results demonstrate the ability of two 3' splice sites to cooperate with each other to bring about removal of a single intron. Images PMID:2159146

  3. Novel strains of mice deficient for the vesicular acetylcholine transporter: insights on transcriptional regulation and control of locomotor behavior.

    PubMed

    Martins-Silva, Cristina; De Jaeger, Xavier; Guzman, Monica S; Lima, Ricardo D F; Santos, Magda S; Kushmerick, Christopher; Gomez, Marcus V; Caron, Marc G; Prado, Marco A M; Prado, Vania F

    2011-03-10

    Defining the contribution of acetylcholine to specific behaviors has been challenging, mainly because of the difficulty in generating suitable animal models of cholinergic dysfunction. We have recently shown that, by targeting the vesicular acetylcholine transporter (VAChT) gene, it is possible to generate genetically modified mice with cholinergic deficiency. Here we describe novel VAChT mutant lines. VAChT gene is embedded within the first intron of the choline acetyltransferase (ChAT) gene, which provides a unique arrangement and regulation for these two genes. We generated a VAChT allele that is flanked by loxP sequences and carries the resistance cassette placed in a ChAT intronic region (FloxNeo allele). We show that mice with the FloxNeo allele exhibit differential VAChT expression in distinct neuronal populations. These mice show relatively intact VAChT expression in somatomotor cholinergic neurons, but pronounced decrease in other cholinergic neurons in the brain. VAChT mutant mice present preserved neuromuscular function, but altered brain cholinergic function and are hyperactive. Genetic removal of the resistance cassette rescues VAChT expression and the hyperactivity phenotype. These results suggest that release of ACh in the brain is normally required to "turn down" neuronal circuits controlling locomotion.

  4. Similar Ratios of Introns to Intergenic Sequence across Animal Genomes

    PubMed Central

    Wörheide, Gert

    2017-01-01

    Abstract One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking into account problems of gene annotation. We show that, regardless of genome size, the ratio of introns to intergenic sequence is comparable across essentially all animals, with nearly all deviations dominated by increased intergenic sequence. Genomes of model organisms have ratios much closer to 1:1, suggesting that the majority of published genomes of nonmodel organisms are underannotated and consequently omit substantial numbers of genes, with likely negative impact on evolutionary interpretations. Finally, our results also indicate that most animals transcribe half or more of their genomes arguing against differences in genome usage between animal groups, and also suggesting that the transcribed portion is more dependent on genome size than previously thought. PMID:28633296

  5. Molecular evolution and diversification of snake toxin genes, revealed by analysis of intron sequences.

    PubMed

    Fujimi, T J; Nakajyo, T; Nishimura, E; Ogura, E; Tsuchiya, T; Tamiya, T

    2003-08-14

    The genes encoding erabutoxin (short chain neurotoxin) isoforms (Ea, Eb, and Ec), LsIII (long chain neurotoxin) and a novel long chain neurotoxin pseudogene were cloned from a Laticauda semifasciata genomic library. Short and long chain neurotoxin genes were also cloned from the genome of Laticauda laticaudata, a closely related species of L. semifasciata, by PCR. A putative matrix attached region (MAR) sequence was found in the intron I of the LsIII gene. Comparative analysis of 11 structurally relevant snake toxin genes (three-finger-structure toxins) revealed the molecular evolution of these toxins. Three-finger-structure toxin genes diverged from a common ancestor through two types of evolutionary pathways (long and short types), early in the course of evolution. At a later stage of evolution in each gene, the accumulation of mutations in the exons, especially exon II, by accelerated evolution may have caused the increased diversification in their functions. It was also revealed that the putative MAR sequence found in the LsIII gene was integrated into the gene after the species-level divergence.

  6. EF-1α DNA Sequences Indicate Multiple Origins of Introduced Populations of Essigella californica (Hemiptera: Aphididae).

    PubMed

    Théry, Thomas; Brockerhoff, Eckehard G; Carnegie, Angus J; Chen, Rui; Elms, Stephen R; Hullé, Maurice; Glatz, Richard; Ortego, Jaime; Qiao, Ge-Xia; Turpeau, Évelyne; Favret, Colin

    2017-06-01

    Aphids in the pine-feeding Nearctic genus Essigella (Sternorrhyncha, Aphididae, Lachninae) have been introduced in Europe, North Africa, Oceania, and South America. Mitochondrial, nuclear, and endosymbiont DNA sequences of 12 introduced populations from three continents confirm they all belong to Essigella californica (Essig, 1909). Intron sequence variation of the nuclear gene EF-1α has revealed the existence of four distinct groups. Group I gathers one population from China, where the species is newly reported, and several from Europe (France and Italy); Group II is represented by one population from Argentina; Group III includes two populations from Southern Australia with one from New Zealand; and Group IV corresponds to five populations from Eastern and South-Eastern Australia. These results indicate that introduced populations of E. californica have at least four source populations. They also show that intron variation of EF-1α can be a method to discriminate populations of asexually reproducing aphids. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  7. Alternative Splicing as a Target for Cancer Treatment.

    PubMed

    Martinez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Anaya Ruiz, Maricruz; Monjaraz-Guzman, Eduardo; Martinez-Contreras, Rebeca

    2018-02-11

    Alternative splicing is a key mechanism determinant for gene expression in metazoan. During alternative splicing, non-coding sequences are removed to generate different mature messenger RNAs due to a combination of sequence elements and cellular factors that contribute to splicing regulation. A different combination of splicing sites, exonic or intronic sequences, mutually exclusive exons or retained introns could be selected during alternative splicing to generate different mature mRNAs that could in turn produce distinct protein products. Alternative splicing is the main source of protein diversity responsible for 90% of human gene expression, and it has recently become a hallmark for cancer with a full potential as a prognostic and therapeutic tool. Currently, more than 15,000 alternative splicing events have been associated to different aspects of cancer biology, including cell proliferation and invasion, apoptosis resistance and susceptibility to different chemotherapeutic drugs. Here, we present well established and newly discovered splicing events that occur in different cancer-related genes, their modification by several approaches and the current status of key tools developed to target alternative splicing with diagnostic and therapeutic purposes.

  8. A Targeted Oligonucleotide Enhancer of SMN2 Exon 7 Splicing Forms Competing Quadruplex and Protein Complexes in Functional Conditions

    PubMed Central

    Smith, Lindsay D.; Dickinson, Rachel L.; Lucas, Christian M.; Cousins, Alex; Malygin, Alexey A.; Weldon, Carika; Perrett, Andrew J.; Bottrill, Andrew R.; Searle, Mark S.; Burley, Glenn A.; Eperon, Ian C.

    2014-01-01

    Summary The use of oligonucleotides to activate the splicing of selected exons is limited by a poor understanding of the mechanisms affected. A targeted bifunctional oligonucleotide enhancer of splicing (TOES) anneals to SMN2 exon 7 and carries an exonic splicing enhancer (ESE) sequence. We show that it stimulates splicing specifically of intron 6 in the presence of repressing sequences in intron 7. Complementarity to the 5′ end of exon 7 increases U2AF65 binding, but the ESE sequence is required for efficient recruitment of U2 snRNP. The ESE forms at least three coexisting discrete states: a quadruplex, a complex containing only hnRNP F/H, and a complex enriched in the activator SRSF1. Neither hnRNP H nor quadruplex formation contributes to ESE activity. The results suggest that splicing limited by weak signals can be rescued by rapid exchange of TOES oligonucleotides in various complexes and raise the possibility that SR proteins associate transiently with ESEs. PMID:25263560

  9. Analysis of PAC1 receptor gene variants in Caucasian and African American infants dying of sudden infant death syndrome.

    PubMed

    Barrett, Karlene T; Rodikova, Ekaterina; Weese-Mayer, Debra E; Rand, Casey M; Marazita, Mary L; Cooper, Margaret E; Berry-Kravis, Elizabeth M; Bech-Hansen, N Torben; Wilson, Richard J A

    2013-12-01

    Stress peptide, pituitary adenylate cyclase-activating polypeptide (PACAP), has been implicated in sudden infant death syndrome (SIDS). The aim of this exploratory study was to determine whether variants in the gene encoding the PACAP-specific receptor, PAC1, are associated with SIDS in Caucasian and African American infants. Polymerase chain reaction and Sanger DNA sequencing was used to compare variants in the 5'-untranslated region, exons and intron-exon boundaries of the PAC1 gene in 96 SIDS cases and 96 race- and gender-matched controls. The intron 3 variant, A/G: rs758995 (variant 'h'), and the intron 6 variant, C/T: rs10081254 (variant 'n'), were significantly associated with SIDS in Caucasians and African Americans, respectively (p < 0.05). Also associated with SIDS were interactions between the variants rs2302475 (variant 'i') in PAC1 and rs8192597 and rs2856966 in PACAP among Caucasians (p < 0.02) and rs2267734 (variant 'q') in PAC1 and rs1893154 in PACAP among African Americans (p < 0.01). However, none of these differences survived post hoc analysis. Overall, this study does not support a strong association between variants in the PAC1 gene and SIDS; however, a number of potential associations between race-specific variants and SIDS were identified that warrant targeted investigations in future studies. ©2013 Foundation Acta Paediatrica. Published by John Wiley & Sons Ltd.

  10. An equine chromosome 3 inversion is associated with the tobiano spotting pattern in German horse breeds.

    PubMed

    Haase, B; Jude, R; Brooks, S A; Leeb, T

    2008-06-01

    The tobiano white-spotting pattern is one of several known depigmentation phenotypes in horses and is desired by many horse breeders and owners. The tobiano spotting phenotype is inherited as an autosomal dominant trait. Horses that are heterozygous or homozygous for the tobiano allele (To) are phenotypically indistinguishable. A SNP associated with To had previously been identified in intron 13 of the equine KIT gene and was used for an indirect gene test. The test was useful in several horse breeds. However, genotyping this sequence variant in the Lewitzer horse breed revealed that 14% of horses with the tobiano pattern did not show the polymorphism in intron 13 and consequently the test was not useful to identify putative homozygotes for To within this breed. Speculations were raised that an independent mutation might cause the tobiano spotting pattern in this breed. Recently, the putative causative mutation for To was described as a large chromosomal inversion on equine chromosome 3. One of the inversion breakpoints is approximately 70 kb downstream of the KIT gene and probably disrupts a regulatory element of the KIT gene. We obtained genotypes for the intron 13 SNP and the chromosomal inversion for 204 tobiano spotted horses and 24 control animals of several breeds. The genotyping data confirmed that the chromosomal inversion was perfectly associated with the To allele in all investigated horses. Therefore, the new test is suitable to discriminate heterozygous To/+ and homozygous To/To horses in the investigated breeds.

  11. A few nucleotide polymorphisms are sufficient to recruit nuclear factors differentially to the intron 1 of HPV-16 intratypic variants.

    PubMed

    López-Urrutia, Eduardo; Valdés, Jesús; Bonilla-Moreno, Raúl; Martínez-Salazar, Martha; Martínez-Garcia, Martha; Berumen, Jaime; Villegas-Sepúlveda, Nicolás

    2012-06-01

    The HPV-16 E6/E7 genes, which contain intron 1, are processed by alternative splicing and its transcripts are detected with a heterogeneous profile in tumours cells. Frequently, the HPV-16 positive carcinoma cells bear viral variants that contain single nucleotide polymorphisms into its DNA sequence. We were interested in analysing the contribution of this polymorphism to the heterogeneity in the pattern of the E6/E7 spliced transcripts. Using the E6/E7 sequences from three closely related HPV-16 variants, we have shown that a few nucleotide changes are sufficient to produce heterogeneity in the splicing profile. Furthermore, using mutants that contained a single SNP, we also showed that one nucleotide change was sufficient to reproduce the heterogeneous splicing profile. Additionally, a difference of two or three SNPs among these viral sequences was sufficient to recruit differentially several splicing factors to the polymorphic E6/E7 transcripts. Moreover, only one SNP was sufficient to alter the binding site of at least one splicing factor, changing the ability of splicing factors to bind the transcript. Finally, the factors that were differentially bound to the short form of intron 1 of one of these E6/E7 variants were identified as TIA1 and/or TIAR and U1-70k, while U2AF65, U5-52k and PTB were preferentially bound to the transcript of the other variants. Copyright © 2012 Elsevier B.V. All rights reserved.

  12. Intraspecific variations of Dekkera/Brettanomyces bruxellensis genome studied by capillary electrophoresis separation of the intron splice site profiles.

    PubMed

    Vigentini, Ileana; De Lorenzis, Gabriella; Picozzi, Claudia; Imazio, Serena; Merico, Annamaria; Galafassi, Silvia; Piškur, Jure; Foschino, Roberto

    2012-06-15

    In enology, "Brett" character refers to the wine spoilage caused by the yeast Dekkera/Brettanomyces bruxellensis and its production of volatile phenolic off-flavours. However, the spoilage potential of this yeast is strain-dependent. Therefore, a rapid and reliable recognition at the strain level is a key point to avoid serious economic losses. The present work provides an operative tool to assess the genetic intraspecific variation in this species through the use of introns as molecular targets. Firstly, the available partial D./B. bruxellensis genome sequence was investigated in order to build primers annealing to introns 5' splice site sequence (ISS). This analysis allowed the detection of a non-random vocabulary flanking the site and, exploiting this feature, the creation of specific probes for strain discrimination. Secondly, the separation of the intron splice site PCR fragments was obtained throughout the set up of a capillary electrophoresis protocol, giving a 94% repeatability threshold in our experimental conditions. The comparison of results obtained with ISS-PCR/CE versus the ones performed by mtDNA RFLP revealed that the former protocol is more discriminating and allowed a reliable identification at strain level. Actually sixty D./B. bruxellensis isolates were recognised as unique strains, showing a level of similarity below 79% and confirming the high genetic polymorphism existing within the species. Two main clusters were grouped at similarity levels of about 46% and 47%, respectively, showing a poor correlation with the geographic area of isolation. Moreover, from the evolutionary point of view, the proposed technique could determine the frequency of the genome rearrangements that can occur in D./B. bruxellesis populations. Copyright © 2012 Elsevier B.V. All rights reserved.

  13. What does it take to resolve relationships and to identify species with molecular markers? An example from the epiphytic Rhipsalideae (Cactaceae).

    PubMed

    Korotkova, Nadja; Borsch, Thomas; Quandt, Dietmar; Taylor, Nigel P; Müller, Kai F; Barthlott, Wilhelm

    2011-09-01

    The Cactaceae are a major New World plant family and popular in horticulture. Still, taxonomic units and species limits have been difficult to define, and molecular phylogenetic studies so far have yielded largely unresolved trees, so relationships within Cactaceae remain insufficiently understood. This study focuses on the predominantly epiphytic tribe Rhipsalideae and evaluates the utility of a spectrum of plastid genomic regions. • We present a phylogenetic study including 52 of the 53 Rhipsalideae species and all the infraspecific taxa. Seven regions (trnK intron, matK, rbcL, rps3-rpl16, rpl16 intron, psbA-trnH, trnQ-rps16), ca. 5600 nucleotides (nt) were sequenced per sample. The regions used were evaluated for their phylogenetic performance and performance in DNA-based species recognition based on operational taxonomic units (OTUs) defined beforehand. • The Rhipsalideae are monophyletic and contain five clades that correspond to the genera Rhipsalis, Lepismium, Schlumbergera, Hatiora, and Rhipsalidopsis. The species-level tree was well resolved and supported; the rpl16 and trnK introns yielded the best phylogenetic signal. Although the psbA-trnH and trnQ-rps16 spacers were the most successful individual regions for OTU identification, their success rate did not significantly exceed 70%. The highest OTU identification rate of 97% was found using the combination of psbA-trnH, rps3-rpl16, trnK intron, and trnQ-rps16 as a minimum possible marker length (ca. 1660 nt). • The phylogenetic performance of a marker is not determined by the level of sequence variability, and species discrimination power does not necessarily correlate with phylogenetic utility.

  14. New Splice Site Acceptor Mutation in AIRE Gene in Autoimmune Polyendocrine Syndrome Type 1

    PubMed Central

    Mora, Mireia; Hanzu, Felicia A.; Pradas-Juni, Marta; Aranda, Gloria B.; Halperin, Irene; Puig-Domingo, Manuel; Aguiló, Sira; Fernández-Rebollo, Eduardo

    2014-01-01

    Autoimmune polyglandular syndrome type 1 (APS-1, OMIM 240300) is a rare autosomal recessive disorder, characterized by the presence of at least two of three major diseases: hypoparathyroidism, Addison’s disease, and chronic mucocutaneous candidiasis. We aim to identify the molecular defects and investigate the clinical and mutational characteristics in an index case and other members of a consanguineous family. We identified a novel homozygous mutation in the splice site acceptor (SSA) of intron 5 (c.653-1G>A) in two siblings with different clinical outcomes of APS-1. Coding DNA sequencing revealed that this AIRE mutation potentially compromised the recognition of the constitutive SSA of intron 5, splicing upstream onto a nearby cryptic SSA in intron 5. Surprisingly, the use of an alternative SSA entails the uncovering of a cryptic donor splice site in exon 5. This new transcript generates a truncated protein (p.A214fs67X) containing the first 213 amino acids and followed by 68 aberrant amino acids. The mutation affects the proper splicing, not only at the acceptor but also at the donor splice site, highlighting the complexity of recognizing suitable splicing sites and the importance of sequencing the intron-exon junctions for a more precise molecular diagnosis and correct genetic counseling. As both siblings were carrying the same mutation but exhibited a different APS-1 onset, and one of the brothers was not clinically diagnosed, our finding highlights the possibility to suspect mutations in the AIRE gene in cases of childhood chronic candidiasis and/or hypoparathyroidism otherwise unexplained, especially when the phenotype is associated with other autoimmune diseases. PMID:24988226

  15. Screening for microsatellite instability target genes in colorectal cancers

    PubMed Central

    Vilkki, S; Launonen, V; Karhu, A; Sistonen, P; Vastrik, I; Aaltonen, L

    2002-01-01

    Background: Defects in the DNA repair system lead to genetic instability because replication errors are not corrected. This type of genetic instability is a key event in the malignant progression of HNPCC and a subset of sporadic colon cancers and mutation rates are particularly high at short repetitive sequences. Somatic deletions of coding mononucleotide repeats have been detected, for example, in the TGFßRII and BAX genes, and recently many novel target genes for microsatellite instability (MSI) have been proposed. Novel target genes are likely to be discovered in the future. More data should be created on background mutation rates in MSI tumours to evaluate mutation rates observed in the candidate target genes. Methods: Mutation rates in 14 neutral intronic repeats were evaluated in MSI tumours. Bioinformatic searches combined with keywords related to cancer and tumour suppressor or CRC related gene homology were used to find new candidate MSI target genes. By comparison of mutation frequencies observed in intronic mononucleotide repeats versus exonic coding repeats of potential MSI target genes, the significance of the exonic mutations was estimated. Results: As expected, the length of an intronic mononucleotide repeat correlated positively with the number of slippages for both G/C and A/T repeats (p=0.0020 and p=0.0012, respectively). BRCA1, CtBP1, and Rb1 associated CtIP and other candidates were found in a bioinformatic search combined with keywords related to cancer. Sequencing showed a significantly increased mutation rate in the exonic A9 repeat of CtIP (25/109=22.9%) as compared with similar intronic repeats (p≤0.001). Conclusions: We propose a new candidate MSI target gene CtIP to be evaluated in further studies. PMID:12414815

  16. Insertion of part of an intron into the 5[prime] untranslated region of a Caenorhabditis elegans gene converts it into a trans-spliced gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Conrad, R.; Thomas, J.; Spieth, J.

    In nematodes, the RNA products of some genes are trans-spliced to a 22-nucleotide spliced leader (SL), while the RNA products of other genes are not. In Caenorhabditis elegans, there are two SLs, Sl1 and SL2, donated by two distinct small nuclear ribonucleoprotein particles in a process functionally quite similar to nuclear intron removal. The authors demonstrate here that it is possible to convert a non-trans-spliced gene into a trans-spliced gene by placement of an intron missing only the 5[prime] splice site into the 5[prime] untranslated region. Stable transgenic strains were isolated expressing a gene in which 69 nucleotides of amore » vit-5 intron, including the 3[prime] splice site, were inserted into the 5[prime] untranslated region of a vit-2/vit-6 fusion gene. The RNA product of this gene was examined by primer extension and PCR amplification. Although the vit-2/vit-6 transgene product is not normally trans-spliced, the majority of transcripts from this altered gene were trans-spliced to SL1. They termed the region of a trans-spliced mRNA precursor between the 5[prime] end and the first 3[prime] splice site an 'outrun'. The results suggest that if a transcript begins with intronlike sequence followed by a 3[prime] splice site, this alone may constitute an outrun and be sufficient to demarcate a transcript as a trans-splice acceptor. These findings leave open the possibility that specific sequences are required to increase the efficiency of trans-splicing.« less

  17. Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis

    PubMed Central

    D’Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; He, Hong; Li, Shibo; Hejtmancik, James F.; Sieving, Paul A.; Wang, Xinjing

    2013-01-01

    Purpose X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4–5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Methods Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Results Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5′ region of the RS1 gene (including the promoter) through intron 1 (c.(−35)-1723_c.51+2664del4472). The exon 4–5 deletion spans introns 3 to intron 5 (c.185–1020_c.522+1844del5764). Conclusions Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes. PMID:24227916

  18. Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis.

    PubMed

    D'Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; Lee, Ji-Yun; He, Hong; Li, Shibo; Smaoui, Nizar; Hejtmancik, James F; Sieving, Paul A; Wang, Xinjing

    2013-01-01

    X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4-5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5' region of the RS1 gene (including the promoter) through intron 1 (c.(-35)-1723_c.51+2664del4472). The exon 4-5 deletion spans introns 3 to intron 5 (c.185-1020_c.522+1844del5764). Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes.

  19. Complete Sequence of the Intronless Mitochondrial Genome of the Saccharomyces cerevisiae Strain CW252

    PubMed Central

    2018-01-01

    ABSTRACT The mitochondrial genomes of Saccharomyces cerevisiae strains contain up to 13 introns. An intronless recombinant genome introduced into the nuclear background of S. cerevisiae strain W303 gave the S. cerevisiae CW252 strain, which is used to model mitochondrial respiratory pathologies. The complete sequence of this mitochondrial genome was obtained using a hybrid assembling methodology. PMID:29700138

  20. Molecular and phylogenetic characterization of the homoeologous EPSP Synthase genes of allohexaploid wheat, Triticum aestivum (L.).

    PubMed

    Aramrak, Attawan; Kidwell, Kimberlee K; Steber, Camille M; Burke, Ian C

    2015-10-23

    5-Enolpyruvylshikimate-3-phosphate synthase (EPSPS) is the sixth and penultimate enzyme in the shikimate biosynthesis pathway, and is the target of the herbicide glyphosate. The EPSPS genes of allohexaploid wheat (Triticum aestivum, AABBDD) have not been well characterized. Herein, the three homoeologous copies of the allohexaploid wheat EPSPS gene were cloned and characterized. Genomic and coding DNA sequences of EPSPS from the three related genomes of allohexaploid wheat were isolated using PCR and inverse PCR approaches from soft white spring "Louise'. Development of genome-specific primers allowed the mapping and expression analysis of TaEPSPS-7A1, TaEPSPS-7D1, and TaEPSPS-4A1 on chromosomes 7A, 7D, and 4A, respectively. Sequence alignments of cDNA sequences from wheat and wheat relatives served as a basis for phylogenetic analysis. The three genomic copies of wheat EPSPS differed by insertion/deletion and single nucleotide polymorphisms (SNPs), largely in intron sequences. RT-PCR analysis and cDNA cloning revealed that EPSPS is expressed from all three genomic copies. However, TaEPSPS-4A1 is expressed at much lower levels than TaEPSPS-7A1 and TaEPSPS-7D1 in wheat seedlings. Phylogenetic analysis of 1190-bp cDNA clones from wheat and wheat relatives revealed that: 1) TaEPSPS-7A1 is most similar to EPSPS from the tetraploid AB genome donor, T. turgidum (99.7 % identity); 2) TaEPSPS-7D1 most resembles EPSPS from the diploid D genome donor, Aegilops tauschii (100 % identity); and 3) TaEPSPS-4A1 resembles EPSPS from the diploid B genome relative, Ae. speltoides (97.7 % identity). Thus, EPSPS sequences in allohexaploid wheat are preserved from the most two recent ancestors. The wheat EPSPS genes are more closely related to Lolium multiflorum and Brachypodium distachyon than to Oryza sativa (rice). The three related EPSPS homoeologues of wheat exhibited conservation of the exon/intron structure and of coding region sequence, but contained significant sequence variation within intron regions. The genome-specific primers developed will enable future characterization of natural and induced variation in EPSPS sequence and expression. This can be useful in investigating new causes of glyphosate herbicide resistance.

  1. Functional and comparative genomics analyses of pmp22 in medaka fish

    PubMed Central

    Itou, Junji; Suyama, Mikita; Imamura, Yukio; Deguchi, Tomonori; Fujimori, Kazuhiro; Yuba, Shunsuke; Kawarabayasi, Yutaka; Kawasaki, Takashi

    2009-01-01

    Background Pmp22, a member of the junction protein family Claudin/EMP/PMP22, plays an important role in myelin formation. Increase of pmp22 transcription causes peripheral neuropathy, Charcot-Marie-Tooth disease type1A (CMT1A). The pathophysiological phenotype of CMT1A is aberrant axonal myelination which induces a reduction in nerve conduction velocity (NCV). Several CMT1A model rodents have been established by overexpressing pmp22. Thus, it is thought that pmp22 expression must be tightly regulated for correct myelin formation in mammals. Interestingly, the myelin sheath is also present in other jawed vertebrates. The purpose of this study is to analyze the evolutionary conservation of the association between pmp22 transcription level and vertebrate myelin formation, and to find the conserved non-coding sequences for pmp22 regulation by comparative genomics analyses between jawed fishes and mammals. Results A transgenic pmp22 over-expression medaka fish line was established. The transgenic fish had approximately one fifth the peripheral NCV values of controls, and aberrant myelination of transgenic fish in the peripheral nerve system (PNS) was observed. We successfully confirmed that medaka fish pmp22 has the same exon-intron structure as mammals, and identified some known conserved regulatory motifs. Furthermore, we found novel conserved sequences in the first intron and 3'UTR. Conclusion Medaka fish undergo abnormalities in the PNS when pmp22 transcription increases. This result indicates that an adequate pmp22 transcription level is necessary for correct myelination of jawed vertebrates. Comparison of pmp22 orthologs between distantly related species identifies evolutionary conserved sequences that contribute to precise regulation of pmp22 expression. PMID:19534778

  2. Molecular cloning of the mouse gene coding for {alpha}{sub 2}-macroglobulin and targeting of the gene in embryonic stem cells

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Umans, L.; Serneels, L.; Hilliker, C.

    1994-08-01

    The authors have cloned the mouse gene coding for {alpha}{sub 2}-macroglobulin in overlapping {lambda} clones and have analyzed its structure. The gene contains 36 exons, coding for the 4.8-kb cDNA that we cloned previously. Including putative control elements in the 5{prime} flanking region, the gene covers about 45 kb. A region of 3.8 kb, stretching from 835 bases upstream of the cDNA start site to exon 4, including all intervening sequences, was sequenced completely. The analysis demonstrated that the putative promoter region of the mouse A2M gene differed considerably from the known promoter sequences of the human A2M gene andmore » of the rat acute-phas A2M gene. Comparison of the exon-intron structure of all known genes of the A2M family confirmed that the rat acute phase A2M gene is more closely related to the human gene than to the mouse A2M gene. To generate mice with the A2M gene inactivated, an insertion type of construct containing 7.5 kb of genomic DNA of the mouse strain 129/J, encompassing exons 16 to 19, was synthesized. A hygromycin marker gene was embedded in intron 17. After electroporation, 198 hygromycin-resistant ES cell lines were isolated and analyzed by Southern blotting. Five ES cell lines were obtained with one allele of the mouse A2M gene targeted by this insertion construct, demonstrating that the position and the characteristics of the vector served the intended goal.« less

  3. Investigation of the estrogen receptor-alpha gene with type 2 diabetes and/or nephropathy in African-American and European-American populations.

    PubMed

    Gallagher, Carla J; Keene, Keith L; Mychaleckyj, Josyf C; Langefeld, Carl D; Hirschhorn, Joel N; Henderson, Brian E; Gordon, Candace J; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M

    2007-03-01

    The estrogen receptor-alpha gene (ESR1) was selected as a positional candidate under a type 2 diabetes linkage peak at 6q24-27. A total of 42 ESR1 single nucleotide polymorphisms (SNPs) were genotyped in 380 African-American type 2 diabetic case subjects with end-stage renal disease (ESRD) and 276 African-American control subjects. A total of 22 ancestry informative markers were also genotyped, and the program Admixmap was used to adjust allelic and haplotypic association tests for individual estimates of admixture. The most significant association with type 2 diabetes-ESRD was with rs1033182 in intron 2 (P = 0.013, admixture-adjusted P(a) = 0.021). Genotyping 17 SNPs across a region of ESR1 intron 1-intron 2 in an expanded population of 851 case and 635 control subjects supported association with rs1033182 (P = 0.004, P(a) = 0.027) and with an independent six-SNP haplotype of high linkage disequilibrium spanning 6.4 kb (P < 0.0001, P(a) < 0.0001). The same 17 ESR1 SNPs were genotyped in 300 European-American type 2 diabetes-ESRD case subjects and 310 European-American control subjects. Two intron 2 SNPs, rs2431260 (P = 0.015) and rs1709183 (P = 0.019), and a four-SNP haplotype containing these SNPs (P = 0.033) were associated with type 2 diabetes and/or ESRD. Results suggest that intron 1 and intron 2 of the ESR1 gene may contain functionally important regions related to type 2 diabetes or ESRD risk.

  4. Mosquito genomics. Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes.

    PubMed

    Neafsey, Daniel E; Waterhouse, Robert M; Abai, Mohammad R; Aganezov, Sergey S; Alekseyev, Max A; Allen, James E; Amon, James; Arcà, Bruno; Arensburger, Peter; Artemov, Gleb; Assour, Lauren A; Basseri, Hamidreza; Berlin, Aaron; Birren, Bruce W; Blandin, Stephanie A; Brockman, Andrew I; Burkot, Thomas R; Burt, Austin; Chan, Clara S; Chauve, Cedric; Chiu, Joanna C; Christensen, Mikkel; Costantini, Carlo; Davidson, Victoria L M; Deligianni, Elena; Dottorini, Tania; Dritsou, Vicky; Gabriel, Stacey B; Guelbeogo, Wamdaogo M; Hall, Andrew B; Han, Mira V; Hlaing, Thaung; Hughes, Daniel S T; Jenkins, Adam M; Jiang, Xiaofang; Jungreis, Irwin; Kakani, Evdoxia G; Kamali, Maryam; Kemppainen, Petri; Kennedy, Ryan C; Kirmitzoglou, Ioannis K; Koekemoer, Lizette L; Laban, Njoroge; Langridge, Nicholas; Lawniczak, Mara K N; Lirakis, Manolis; Lobo, Neil F; Lowy, Ernesto; MacCallum, Robert M; Mao, Chunhong; Maslen, Gareth; Mbogo, Charles; McCarthy, Jenny; Michel, Kristin; Mitchell, Sara N; Moore, Wendy; Murphy, Katherine A; Naumenko, Anastasia N; Nolan, Tony; Novoa, Eva M; O'Loughlin, Samantha; Oringanje, Chioma; Oshaghi, Mohammad A; Pakpour, Nazzy; Papathanos, Philippos A; Peery, Ashley N; Povelones, Michael; Prakash, Anil; Price, David P; Rajaraman, Ashok; Reimer, Lisa J; Rinker, David C; Rokas, Antonis; Russell, Tanya L; Sagnon, N'Fale; Sharakhova, Maria V; Shea, Terrance; Simão, Felipe A; Simard, Frederic; Slotman, Michel A; Somboon, Pradya; Stegniy, Vladimir; Struchiner, Claudio J; Thomas, Gregg W C; Tojo, Marta; Topalis, Pantelis; Tubio, José M C; Unger, Maria F; Vontas, John; Walton, Catherine; Wilding, Craig S; Willis, Judith H; Wu, Yi-Chieh; Yan, Guiyun; Zdobnov, Evgeny M; Zhou, Xiaofan; Catteruccia, Flaminia; Christophides, George K; Collins, Frank H; Cornman, Robert S; Crisanti, Andrea; Donnelly, Martin J; Emrich, Scott J; Fontaine, Michael C; Gelbart, William; Hahn, Matthew W; Hansen, Immo A; Howell, Paul I; Kafatos, Fotis C; Kellis, Manolis; Lawson, Daniel; Louis, Christos; Luckhart, Shirley; Muskavitch, Marc A T; Ribeiro, José M; Riehle, Michael A; Sharakhov, Igor V; Tu, Zhijian; Zwiebel, Laurence J; Besansky, Nora J

    2015-01-02

    Variation in vectorial capacity for human malaria among Anopheles mosquito species is determined by many factors, including behavior, immunity, and life history. To investigate the genomic basis of vectorial capacity and explore new avenues for vector control, we sequenced the genomes of 16 anopheline mosquito species from diverse locations spanning ~100 million years of evolution. Comparative analyses show faster rates of gene gain and loss, elevated gene shuffling on the X chromosome, and more intron losses, relative to Drosophila. Some determinants of vectorial capacity, such as chemosensory genes, do not show elevated turnover but instead diversify through protein-sequence changes. This dynamism of anopheline genes and genomes may contribute to their flexible capacity to take advantage of new ecological niches, including adapting to humans as primary hosts. Copyright © 2015, American Association for the Advancement of Science.

  5. Genetic association of ubiquilin with Alzheimer's disease and related quantitative measures.

    PubMed

    Kamboh, M I; Minster, R L; Feingold, E; DeKosky, S T

    2006-03-01

    The gene coding for ubiquilin 1 (UBQLN1) is located near a linkage peak on chromosome 9q22.2 and it also impacts the function of presenilin proteins involved in early-onset Alzheimer's disease (AD). Recently, genetic variation in UBQLN1 has been shown to affect the risk of AD in two independent family-based samples. The purpose of this study was to confirm the reported association in a large case-control sample and to also examine the association of UBQLN1 SNPs with quantitative measures of AD progression, namely age-at-onset (AAO), disease duration and Mini-Mental State Examination (MMSE) score. We examined the associations of three SNPs in the UBQLN1 gene (intron 6/A>C, intron 8/T>C and intron 9/A>G) in up to 978 LOAD cases and 808 controls. All SNPs were in significant linkage disequilibrium (P<0.0001). While modest significant associations were observed in the single-site regression analysis, 3-site haplotype analysis revealed significant associations (P<0.0001 for overall haplotype analysis). One common haplotype (H4) defined by intron 6/A-intron 8/C-intron 9/G alleles was associated with AD risk and one less common haplotype (H5) defined by intron 6/C-intron 8/C-intron 9/A alleles was associated with protection. The adjusted odds ratios with potentially one and two copies of risk haplotype H4 were 1.5 (95% CI: 0.99-2.26; P=0.054) and 3.66 (95% CI: 1.43-9.39; P=0.007), respectively, and odds ratio for haplotype H5 carriers was 0.31 (95% CI: 0.10-0.95; P=0.0398). In addition to disease risk, the homozygosity of the risk haplotype was also associated with older AAO, longer disease duration and lower MMSE score. In summary, our data from a large case-control cohort indicate that genetic variation in the UBQLN1 gene has a modest effect on risk, AAO and disease duration of AD. Our haplotype data suggest the presence of additional putative functional variants either in the UBQLN1 gene or nearby genes and provide strong justification for additional work in this region on chromosome 9.

  6. Tenebrio molitor antifreeze protein gene identification and regulation.

    PubMed

    Qin, Wensheng; Walker, Virginia K

    2006-02-15

    The yellow mealworm, Tenebrio molitor, is a freeze susceptible, stored product pest. Its winter survival is facilitated by the accumulation of antifreeze proteins (AFPs), encoded by a small gene family. We have now isolated 11 different AFP genomic clones from 3 genomic libraries. All the clones had a single coding sequence, with no evidence of intervening sequences. Three genomic clones were further characterized. All have putative TATA box sequences upstream of the coding regions and multiple potential poly(A) signal sequences downstream of the coding regions. A TmAFP regulatory region, B1037, conferred transcriptional activity when ligated to a luciferase reporter sequence and after transfection into an insect cell line. A 143 bp core promoter including a TATA box sequence was identified. Its promoter activity was increased 4.4 times by inserting an exotic 245 bp intron into the construct, similar to the enhancement of transgenic expression seen in several other systems. The addition of a duplication of the first 120 bp sequence from the 143 bp core promoter decreased promoter activity by half. Although putative hormonal response sequences were identified, none of the five hormones tested enhanced reporter activity. These studies on the mechanisms of AFP transcriptional control are important for the consideration of any transfer of freeze-resistance phenotypes to beneficial hosts.

  7. The complete chloroplast genome sequence of the medicinal plant Andrographis paniculata.

    PubMed

    Ding, Ping; Shao, Yanhua; Li, Qian; Gao, Junli; Zhang, Runjing; Lai, Xiaoping; Wang, Deqin; Zhang, Huiye

    2016-07-01

    The complete chloroplast genome of Andrographis paniculata, an important medicinal plant with great economic value, has been studied in this article. The genome size is 150,249 bp in length, with 38.3% GC content. A pair of inverted repeats (IRs, 25,300 bp) are separated by a large single copy region (LSC, 82,459 bp) and a small single-copy region (SSC, 17,190 bp). The chloroplast genome contains 114 unique genes, 80 protein-coding genes, 30 tRNA genes and 4 rRNA genes. In these genes, 15 genes contained 1 intron and 3 genes comprised of 2 introns.

  8. Phylogenetic relationships among morphotypes of Caesalpinia echinata Lam. (Caesalpinioideae: Leguminosae) evidenced by trnL intron sequences

    NASA Astrophysics Data System (ADS)

    Juchum, Fabrício Sacramento; Costa, Marco Antônio; Amorim, André Márcio; Corrêa, Ronan Xavier

    2008-11-01

    Caesalpinia echinata (brazilwood or Pernambuco wood) comprises a complex of three morphological leaf variants, characterized by differences in the number and size of the pinnae and leaflets, and occurring in allopatric and sympatric populations. The present study evaluates the utility of the chloroplast DNA trnL intron in a phylogenetic analysis of the three leaf variants along with other species of Caesalpinia and generic relatives. Our study supports the hypothesis that the name C. echinata designates a species complex and provides evidence that one of the forms, the highly divergent C. echinata large-leafleted variant, represents a distinct taxon.

  9. Systems analysis of cis-regulatory motifs in C4 photosynthesis genes using maize and rice leaf transcriptomic data during a process of de-etiolation

    PubMed Central

    Xu, Jiajia; Bräutigam, Andrea; Weber, Andreas P. M.; Zhu, Xin-Guang

    2016-01-01

    Identification of potential cis-regulatory motifs controlling the development of C4 photosynthesis is a major focus of current research. In this study, we used time-series RNA-seq data collected from etiolated maize and rice leaf tissues sampled during a de-etiolation process to systematically characterize the expression patterns of C4-related genes and to further identify potential cis elements in five different genomic regions (i.e. promoter, 5′UTR, 3′UTR, intron, and coding sequence) of C4 orthologous genes. The results demonstrate that although most of the C4 genes show similar expression patterns, a number of them, including chloroplast dicarboxylate transporter 1, aspartate aminotransferase, and triose phosphate transporter, show shifted expression patterns compared with their C3 counterparts. A number of conserved short DNA motifs between maize C4 genes and their rice orthologous genes were identified not only in the promoter, 5′UTR, 3′UTR, and coding sequences, but also in the introns of core C4 genes. We also identified cis-regulatory motifs that exist in maize C4 genes and also in genes showing similar expression patterns as maize C4 genes but that do not exist in rice C3 orthologs, suggesting a possible recruitment of pre-existing cis-elements from genes unrelated to C4 photosynthesis into C4 photosynthesis genes during C4 evolution. PMID:27436282

  10. Evaluation of the arrestin gene in patients with retinitis pigmentosa or an allied disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    DeStefano, D.J.; Berson, E.L.; Dryja, T.P.

    1994-09-01

    Arrestin, also called 48K protein or S-antigen, plays a role in deactivating rhodopsin, the photosensitive, seven-helix, G-protein receptor found in rod photoreceptors. In Drosophila, null mutations in arrestin genes cause a light-dependent photoreceptor degeneration. It is possible that a comparable photoreceptor degeneration in humans is caused by defects in the rod arrestin gene. In order to evaluate this possibility, we are characterizing the human arrestin locus on chromosome 2q. We screened a genomic library (5 million plaques) using an arrestin cDNA clone. Sixty-eight hybridizing clones were identified; portions of 7 clones were sequenced to determine the intron sequence flanking themore » exons. We are using SSCP analysis and direct genomic sequencing to screen the entire coding region, splice donor and acceptor sites, and the promoter region of the arrestin gene in 188 patients with autosomal dominant and 104 patients with autosomal recessive retinitis pigmentosa. We have already obtained flanking intron sequences necessary for SSCP analysis for 13 of 16 exons. So far, we have identified 4 silent base changes at codons 67 (TGC-to-TGT), 107 (CTG-to-CTC), 163 (GCC-to-GCT), and 288 (CTG-to-TGT), all with allele frequencies at 1% or less. Several other variant bands detected by SSCP analysis are currently being sequenced.« less

  11. Reduced DNA methylation of FKBP5 in Cushing's syndrome.

    PubMed

    Resmini, Eugenia; Santos, Alicia; Aulinas, Anna; Webb, Susan M; Vives-Gilabert, Yolanda; Cox, Olivia; Wand, Gary; Lee, Richard S

    2016-12-01

    FKBP5 encodes a co-chaperone of HSP90 protein that regulates intracellular glucocorticoid receptor sensitivity. When it is bound to the glucocorticoid receptor complex, cortisol binds with lower affinity to glucocorticoid receptor. Cushing's syndrome is associated with memory deficits, smaller hippocampal volumes, and wide range of cognitive impairments. We aimed at evaluating blood DNA methylation of FKBP5 and its relationship with memory and hippocampal volumes in Cushing's syndrome patients. Polymorphism rs1360780 in FKBP5 has also been assessed to determine whether genetic variations can also govern CpG methylation. Thirty-two Cushing's syndrome patients and 32 matched controls underwent memory tests, 3-Tesla MRI of the brain, and DNA extraction from total leukocytes. DNA samples were bisulfite treated, PCR amplified, and pyrosequenced to assess a total of 41CpG-dinucleotides in the introns 1, 2, 5, and 7 of FKBP5. Significantly lower intronic FKBP5 DNA methylation in CS patients compared to controls was observed in ten CpG-dinucleotides. DNA methylation at these CpGs correlated with left and right HV (Intron-2-Region-2-CpG-3: LHV, r = 0.73, p = 0.02; RHV, r = 0.58, p = 0.03). Cured and active CS patients showed both lower methylation of intron 2 (92.37, 91.8, and 93.34 %, respectively, p = 0.03 for both) and of intron 7 (77.08, 73.74, and 79.71 %, respectively, p = 0.02 and p < 0.01) than controls. Twenty-two subjects had the CC genotype, 34 had the TC genotype, and eight had the TT genotype. Lower average DNA methylation in intron 7 was observed in the TT subjects compared to CC (72.5vs. 79.5 %, p = 0.02) and to TC (72.5 vs. 79.0 %, p = 0.03). Our data demonstrate, for the first time, a reduction of intronic DNA methylation of FKBP5 in CS patients.

  12. Structure of the human type IV collagen COL4A6 gene, which is mutated in Alport syndrome-associated leiomyomatosis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Xu; Zhou, Jing; Reeders, S.T.

    1996-05-01

    Basement membrane (type IV) collagen, a subfamily of the collagen protein family, is encoded by six distinct genes in mammals. Three of those, COL4A3, COL4A4, and COL4A5, are linked with Alport syndrome (hereditary nephritis). Patients with leimoyomatosis associated with Alport syndrome have been shown to have deletions in the 5{prime} end of the COL4A6 gene, in addition to having deletions in COL4A6. The human COL4A6 gene is reported to be 425 kb as determined by mapping of overlapping YAC clones by probes for its 5{prime} and 3{prime} ends. In the present study we describe the complete exon/intron size pattern ofmore » the human COL4A6 gene. The 12 {lambda} phage clones characterized in the study spanned a total of 110 kb, including 85 kb of the actual gene and 25 kb of flanking sequences. The overlapping clones contained all 46 exons of the gene and all introns, except for intron 2. Since the total size of the exons and all introns except for intron 2 is about 85 kb, intron 2 must be about 340 kb. All exons of the gene were assigned to EcoRI restriction fragments to facilitate analysis of the gene in patients with leiomyomatosis associated with Alport syndrome. The exon size pattern of COL4A6 is highly homologous with that of the human and mouse COL4A2 genes, with 27 of the 46 exons of COL4A6 being identical in size between the genes. 42 refs., 2 figs., 3 tabs.« less

  13. Evolutionary pattern of mutation in the factor IX genes of great apes: How does it compare to the pattern of recent germline mutation in patients with hemophilia B?

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Grouse, L.H.; Ketterling, R.P.; Sommer, S.S.

    Most mutations causing hemophilia B have arisen within the past 150 years. By correcting for multiple biases, the underlying rates of spontaneous germline mutation have been estimated in the factor IX gene. From these rates, an underlying pattern of mutation has emerged. To determine if this pattern compares to a underlying pattern found in the great apes, sequence changes were determined in intronic regions of the factor IX gene. The following species were studied: Gorilla gorilla, Pan troglodytes (chimpanzee), Pongo pygmacus (orangutan) and Homo sapiens. Intronic sequences at least 200 bp from a splice junction were randomly chosen, amplified bymore » cross-species PCR, and sequenced. These regions are expected to be subject to little if any selective pressure. Early diverged species of Old World monkeys were also studied to help determine the direction of mutational changes. A total of 62 sequence changes were observed. Initial data suggest that the average pattern since evolution of the great apes has a paucity of transitions at CpG dinucleotides and an excess of microinsertions to microdeletions when compared to the pattern observed in humans during the past 150 years (p<.05). A larger study is in progress to confirm these results.« less

  14. Phylogeographic analysis of introns and mitochondrial DNA in the clam Ruditapes decussatus uncovers the effects of Pleistocene glaciations and endogenous barriers to gene flow.

    PubMed

    Cordero, David; Peña, Juan B; Saavedra, Carlos

    2014-02-01

    Studies on the phylogeography of species inhabiting the Mediterranean and the nearby coasts of the NE Atlantic Ocean (MEDAT) have found subdivision and/or phylogeographic structure in one or more of the Atlantic, western Mediterranean and eastern Mediterranean basins. This structure has been explained as the result of past population fragmentation caused by Pleistocene sea level changes and current patterns of marine circulation. However, the increasing use of nuclear markers has revealed that these two factors alone are not enough to explain the phylogeographic patterns, and an additional role has been suggested for endogenous barriers to gene flow or natural selection. In this article we examined the role of these factors in Ruditapes decussatus, a commercial clam species native to MEDAT. A genetic analysis of 11 populations was carried out by examining 6 introns with a PCR-RFLP technique. We found subdivision in three regions: Atlantic (ATL), western Mediterranean plus Tunisia (WMED), and Aegean and Adriatic seas (AEGAD). Two introns (Ech and Tbp) showed alleles that were restricted to AEGAD. Sequencing a subsample of individuals for these introns indicated that AEGAD-specific alleles were separate clades, thus revealing a phylogeographic brake at the WMED-AEGAD boundary. Sequencing of the mitochondrial COI locus confirmed this phylogeographic break. Dating of the AEGAD mitochondrial haplotypes and nuclear alleles with a Bayesian MCMC method revealed that they shared common ancestors in the Pleistocene. These results can be explained in the framework of Pleistocene sea level drops and patterns of gene flow in MEDAT. An additional observation was a lack of differentiation at COI between the ATL and WMED, in sharp contrast with 4 introns that showed clear genetic subdivision. Neutrality tests did not support the hypothesis of a selective sweep acting on mtDNA to explain the contrasting levels of differentiation between mitochondrial and nuclear markers across the ATL-WMED transition, and we argue that the difference between markers is best explained by the existence of an endogenous genetic barrier, rather than by a physical barrier to larval migration alone. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. Splicing stimulates siRNA formation at Drosophila DNA double-strand breaks

    PubMed Central

    Merk, Karin; Breinig, Marco; Böttcher, Romy; Krebs, Stefan; Blum, Helmut; Boutros, Michael

    2017-01-01

    DNA double-strand breaks trigger the production of locus-derived siRNAs in fruit flies, human cells and plants. At least in flies, their biogenesis depends on active transcription running towards the break. Since siRNAs derive from a double-stranded RNA precursor, a major question is how broken DNA ends can generate matching sense and antisense transcripts. We performed a genome-wide RNAi-screen in cultured Drosophila cells, which revealed that in addition to DNA repair factors, many spliceosome components are required for efficient siRNA generation. We validated this observation through site-specific DNA cleavage with CRISPR-cas9 followed by deep sequencing of small RNAs. DNA breaks in intron-less genes or upstream of a gene’s first intron did not efficiently trigger siRNA production. When DNA double-strand breaks were induced downstream of an intron, however, this led to robust siRNA generation. Furthermore, a downstream break slowed down splicing of the upstream intron and a detailed analysis of siRNA coverage at the targeted locus revealed that unspliced pre-mRNA contributes the sense strand to the siRNA precursor. Since splicing factors are stimulating the response but unspliced transcripts are entering the siRNA biogenesis, the spliceosome is apparently stalled in a pre-catalytic state and serves as a signaling hub. We conclude that convergent transcription at DNA breaks is stimulated by a splicing dependent control process. The resulting double-stranded RNA is converted into siRNAs that instruct the degradation of cognate mRNAs. In addition to a potential role in DNA repair, the break-induced transcription may thus be a means to cull improper RNAs from the transcriptome of Drosophila melanogaster. Since the splicing factors identified in our screen also stimulated siRNA production from high copy transgenes, it is possible that this surveillance mechanism serves in genome defense beyond DNA double-strand breaks. PMID:28628606

  16. The mitochondrial genome of the arbuscular mycorrhizal fungus Gigaspora margarita reveals two unsuspected trans-splicing events of group I introns.

    PubMed

    Pelin, Adrian; Pombert, Jean-François; Salvioli, Alessandra; Bonen, Linda; Bonfante, Paola; Corradi, Nicolas

    2012-05-01

    • Arbuscular mycorrhizal fungi (AMF) are ubiquitous organisms that benefit ecosystems through the establishment of an association with the roots of most plants: the mycorrhizal symbiosis. Despite their ecological importance, however, these fungi have been poorly studied at the genome level. • In this study, total DNA from the AMF Gigaspora margarita was subjected to a combination of 454 and Illumina sequencing, and the resulting reads were used to assemble its mitochondrial genome de novo. This genome was annotated and compared with those of other relatives to better comprehend the evolution of the AMF lineage. • The mitochondrial genome of G. margarita is unique in many ways, exhibiting a large size (97 kbp) and elevated GC content (45%). This genome also harbors molecular events that were previously unknown to occur in fungal mitochondrial genomes, including trans-splicing of group I introns from two different genes coding for the first subunit of the cytochrome oxidase and for the small subunit of the rRNA. • This study reports the second published genome from an AMF organelle, resulting in relevant DNA sequence information from this poorly studied fungal group, and providing new insights into the frequency, origin and evolution of trans-spliced group I introns found across the mitochondrial genomes of distantly related organisms. © 2012 The Authors. New Phytologist © 2012 New Phytologist Trust.

  17. [Analysis of USH2A gene mutation in a Chinese family affected with Usher syndrome].

    PubMed

    Li, Pengcheng; Liu, Fei; Zhang, Mingchang; Wang, Qiufen; Liu, Mugen

    2015-08-01

    To investigate the disease-causing mutation in a Chinese family affected with Usher syndrome type II. All of the 11 members from the family underwent comprehensive ophthalmologic examination and hearing test, and their genomic DNA were isolated from venous leukocytes. PCR and direct sequencing of USH2A gene were performed for the proband. Wild type and mutant type minigene vectors containing exon 42, intron 42 and exon 43 of the USH2A gene were constructed and transfected into Hela cells by lipofectamine reagent. Reverse transcription (RT)-PCR was carried out to verify the splicing of the minigenes. Pedigree analysis and clinical diagnosis indicated that the patients have suffered from autosomal recessive Usher syndrome type II. DNA sequencing has detected a homozygous c.8559-2A>G mutation of the USH2A gene in the proband, which has co-segregated with the disease in the family. The mutation has affected a conserved splice site in intron 42, which has led to inactivation of the splice site. Minigene experiment has confirmed the retaining of intron 42 in mature mRNA. The c.8559-2A>G mutation in the USH2A gene probably underlies the Usher syndrome type II in this family. The splice site mutation has resulted in abnormal splicing of USH2A pre-mRNA.

  18. In situ genetic correction of F8 intron 22 inversion in hemophilia A patient-specific iPSCs.

    PubMed

    Wu, Yong; Hu, Zhiqing; Li, Zhuo; Pang, Jialun; Feng, Mai; Hu, Xuyun; Wang, Xiaolin; Lin-Peng, Siyuan; Liu, Bo; Chen, Fangping; Wu, Lingqian; Liang, Desheng

    2016-01-08

    Nearly half of severe Hemophilia A (HA) cases are caused by F8 intron 22 inversion (Inv22). This 0.6-Mb inversion splits the 186-kb F8 into two parts with opposite transcription directions. The inverted 5' part (141 kb) preserves the first 22 exons that are driven by the intrinsic F8 promoter, leading to a truncated F8 transcript due to the lack of the last 627 bp coding sequence of exons 23-26. Here we describe an in situ genetic correction of Inv22 in patient-specific induced pluripotent stem cells (iPSCs). By using TALENs, the 627 bp sequence plus a polyA signal was precisely targeted at the junction of exon 22 and intron 22 via homologous recombination (HR) with high targeting efficiencies of 62.5% and 52.9%. The gene-corrected iPSCs retained a normal karyotype following removal of drug selection cassette using a Cre-LoxP system. Importantly, both F8 transcription and FVIII secretion were rescued in the candidate cell types for HA gene therapy including endothelial cells (ECs) and mesenchymal stem cells (MSCs) derived from the gene-corrected iPSCs. This is the first report of an efficient in situ genetic correction of the large inversion mutation using a strategy of targeted gene addition.

  19. Exonization of an Intronic LINE-1 Element Causing Becker Muscular Dystrophy as a Novel Mutational Mechanism in Dystrophin Gene

    PubMed Central

    Gonçalves, Ana; Coelho, Teresa; Melo-Pires, Manuel; Sousa, Mário

    2017-01-01

    A broad mutational spectrum in the dystrophin (DMD) gene, from large deletions/duplications to point mutations, causes Duchenne/Becker muscular dystrophy (D/BMD). Comprehensive genotyping is particularly relevant considering the mutation-centered therapies for dystrophinopathies. We report the genetic characterization of a patient with disease onset at age 13 years, elevated creatine kinase levels and reduced dystrophin labeling, where multiplex-ligation probe amplification (MLPA) and genomic sequencing failed to detect pathogenic variants. Bioinformatic, transcriptomic (real time PCR, RT-PCR), and genomic approaches (Southern blot, long-range PCR, and single molecule real-time sequencing) were used to characterize the mutation. An aberrant transcript was identified, containing a 103-nucleotide insertion between exons 51 and 52, with no similarity with the DMD gene. This corresponded to the partial exonization of a long interspersed nuclear element (LINE-1), disrupting the open reading frame. Further characterization identified a complete LINE-1 (~6 kb with typical hallmarks) deeply inserted in intron 51. Haplotyping and segregation analysis demonstrated that the mutation had a de novo origin. Besides underscoring the importance of mRNA studies in genetically unsolved cases, this is the first report of a disease-causing fully intronic LINE-1 element in DMD, adding to the diversity of mutational events that give rise to D/BMD. PMID:28972564

  20. In situ genetic correction of F8 intron 22 inversion in hemophilia A patient-specific iPSCs

    PubMed Central

    Wu, Yong; Hu, Zhiqing; Li, Zhuo; Pang, Jialun; Feng, Mai; Hu, Xuyun; Wang, Xiaolin; Lin-Peng, Siyuan; Liu, Bo; Chen, Fangping; Wu, Lingqian; Liang, Desheng

    2016-01-01

    Nearly half of severe Hemophilia A (HA) cases are caused by F8 intron 22 inversion (Inv22). This 0.6-Mb inversion splits the 186-kb F8 into two parts with opposite transcription directions. The inverted 5′ part (141 kb) preserves the first 22 exons that are driven by the intrinsic F8 promoter, leading to a truncated F8 transcript due to the lack of the last 627 bp coding sequence of exons 23–26. Here we describe an in situ genetic correction of Inv22 in patient-specific induced pluripotent stem cells (iPSCs). By using TALENs, the 627 bp sequence plus a polyA signal was precisely targeted at the junction of exon 22 and intron 22 via homologous recombination (HR) with high targeting efficiencies of 62.5% and 52.9%. The gene-corrected iPSCs retained a normal karyotype following removal of drug selection cassette using a Cre-LoxP system. Importantly, both F8 transcription and FVIII secretion were rescued in the candidate cell types for HA gene therapy including endothelial cells (ECs) and mesenchymal stem cells (MSCs) derived from the gene-corrected iPSCs. This is the first report of an efficient in situ genetic correction of the large inversion mutation using a strategy of targeted gene addition. PMID:26743572

  1. Expression analysis and in silico characterization of intronic long noncoding RNAs in renal cell carcinoma: emerging functional associations

    PubMed Central

    2013-01-01

    Background Intronic and intergenic long noncoding RNAs (lncRNAs) are emerging gene expression regulators. The molecular pathogenesis of renal cell carcinoma (RCC) is still poorly understood, and in particular, limited studies are available for intronic lncRNAs expressed in RCC. Methods Microarray experiments were performed with custom-designed arrays enriched with probes for lncRNAs mapping to intronic genomic regions. Samples from 18 primary RCC tumors and 11 nontumor adjacent matched tissues were analyzed. Meta-analyses were performed with microarray expression data from three additional human tissues (normal liver, prostate tumor and kidney nontumor samples), and with large-scale public data for epigenetic regulatory marks and for evolutionarily conserved sequences. Results A signature of 29 intronic lncRNAs differentially expressed between RCC and nontumor samples was obtained (false discovery rate (FDR) <5%). A signature of 26 intronic lncRNAs significantly correlated with the RCC five-year patient survival outcome was identified (FDR <5%, p-value ≤0.01). We identified 4303 intronic antisense lncRNAs expressed in RCC, of which 22% were significantly (p <0.05) cis correlated with the expression of the mRNA in the same locus across RCC and three other human tissues. Gene Ontology (GO) analysis of those loci pointed to 'regulation of biological processes’ as the main enriched category. A module map analysis of the protein-coding genes significantly (p <0.05) trans correlated with the 20% most abundant lncRNAs, identified 51 enriched GO terms (p <0.05). We determined that 60% of the expressed lncRNAs are evolutionarily conserved. At the genomic loci containing the intronic RCC-expressed lncRNAs, a strong association (p <0.001) was found between their transcription start sites and genomic marks such as CpG islands, RNA Pol II binding and histones methylation and acetylation. Conclusion Intronic antisense lncRNAs are widely expressed in RCC tumors. Some of them are significantly altered in RCC in comparison with nontumor samples. The majority of these lncRNAs is evolutionarily conserved and possibly modulated by epigenetic modifications. Our data suggest that these RCC lncRNAs may contribute to the complex network of regulatory RNAs playing a role in renal cell malignant transformation. PMID:24238219

  2. Exome sequence analysis and follow up genotyping implicates rare ULK1 variants to be involved in susceptibility to schizophrenia

    PubMed Central

    Al Eissa, Mariam M.; Fiorentino, Alessia; Sharp, Sally I.; O'Brien, Niamh L.; Wolfe, Kate; Giaroli, Giovanni; Curtis, David; Bass, Nicholas J.

    2017-01-01

    Summary Schizophrenia (SCZ) is a severe, highly heritable psychiatric disorder. Elucidation of the genetic architecture of the disorder will facilitate greater understanding of the altered underlying neurobiological mechanisms. The aim of this study was to identify likely aetiological variants in subjects affected with SCZ. Exome sequence data from a SCZ cas–control sample from Sweden was analysed for likely aetiological variants using a weighted burden test. Suggestive evidence implicated the UNC‐51‐like kinase (ULK1) gene, and it was observed that four rare variants that were more common in the Swedish SCZ cases were also more common in UK10K SCZ cases, as compared to obesity cases. These three missense variants and one intronic variant were genotyped in the University College London cohort of 1304 SCZ cases and 1348 ethnically matched controls. All four variants were more common in the SCZ cases than controls and combining them produced a result significant at P = 0.02. The results presented here demonstrate the importance of following up exome sequencing studies using additional datasets. The roles of ULK1 in autophagy and mTOR signalling strengthen the case that these pathways may be important in the pathophysiology of SCZ. The findings reported here await independent replication. PMID:29148569

  3. Coordinated tissue-specific regulation of adjacent alternative 3′ splice sites in C. elegans

    PubMed Central

    Ragle, James Matthew; Katzman, Sol; Akers, Taylor F.; Barberan-Soler, Sergio; Zahler, Alan M.

    2015-01-01

    Adjacent alternative 3′ splice sites, those separated by ≤18 nucleotides, provide a unique problem in the study of alternative splicing regulation; there is overlap of the cis-elements that define the adjacent sites. Identification of the intron's 3′ end depends upon sequence elements that define the branchpoint, polypyrimidine tract, and terminal AG dinucleotide. Starting with RNA-seq data from germline-enriched and somatic cell-enriched Caenorhabditis elegans samples, we identify hundreds of introns with adjacent alternative 3′ splice sites. We identify 203 events that undergo tissue-specific alternative splicing. For these, the regulation is monodirectional, with somatic cells preferring to splice at the distal 3′ splice site (furthest from the 5′ end of the intron) and germline cells showing a distinct shift toward usage of the adjacent proximal 3′ splice site (closer to the 5′ end of the intron). Splicing patterns in somatic cells follow C. elegans consensus rules of 3′ splice site definition; a short stretch of pyrimidines preceding an AG dinucleotide. Splicing in germline cells occurs at proximal 3′ splice sites that lack a preceding polypyrimidine tract, and in three instances the germline-specific site lacks the AG dinucleotide. We provide evidence that use of germline-specific proximal 3′ splice sites is conserved across Caenorhabditis species. We propose that there are differences between germline and somatic cells in the way that the basal splicing machinery functions to determine the intron terminus. PMID:25922281

  4. Multiple independent origins of mitochondrial control region duplications in the order Psittaciformes

    PubMed Central

    Schirtzinger, Erin E.; Tavares, Erika S.; Gonzales, Lauren A.; Eberhard, Jessica R.; Miyaki, Cristina Y.; Sanchez, Juan J.; Hernandez, Alexis; Müeller, Heinrich; Graves, Gary R.; Fleischer, Robert C.; Wright, Timothy F.

    2012-01-01

    Mitochondrial genomes are generally thought to be under selection for compactness, due to their small size, consistent gene content, and a lack of introns or intergenic spacers. As more animal mitochondrial genomes are fully sequenced, rearrangements and partial duplications are being identified with increasing frequency, particularly in birds (Class Aves). In this study, we investigate the evolutionary history of mitochondrial control region states within the avian order Psittaciformes (parrots and cockatoos). To this aim, we reconstructed a comprehensive multi-locus phylogeny of parrots, used PCR of three diagnostic fragments to classify the mitochondrial control region state as single or duplicated, and mapped these states onto the phylogeny. We further sequenced 44 selected species to validate these inferences of control region state. Ancestral state reconstruction using a range of weighting schemes identified six independent origins of mitochondrial control region duplications within Psittaciformes. Analysis of sequence data showed that varying levels of mitochondrial gene and tRNA homology and degradation were present within a given clade exhibiting duplications. Levels of divergence between control regions within an individual varied from 0–10.9% with the differences occurring mainly between 51 and 225 nucleotides 3′ of the goose hairpin in domain I. Further investigations into the fates of duplicated mitochondrial genes, the potential costs and benefits of having a second control region, and the complex relationship between evolutionary rates, selection, and time since duplication are needed to fully explain these patterns in the mitochondrial genome. PMID:22543055

  5. Association of GSK3B With Alzheimer Disease and Frontotemporal Dementia

    PubMed Central

    Schaffer, Barbara A. J.; Bertram, Lars; Miller, Bruce L.; Mullin, Kristina; Weintraub, Sandra; Johnson, Nancy; Bigio, Eileen H.; Mesulam, Marsel; Wiedau-Pazos, Martina; Jackson, George R.; Cummings, Jeffrey L.; Cantor, Rita M.; Levey, Allan I.; Tanzi, Rudolph E.; Geschwind, Daniel H.

    2009-01-01

    Background Deposits of abnormally hyperphosphorylated tau are a hallmark of several dementias, including Alzheimer disease (AD), and about 10% of familial frontotemporal dementia (FTD) cases are caused by mutations in the tau gene. As a known tau kinase, GSK3B is a promising candidate gene in the remaining cases of FTD and in AD, for which tau mutations have not been found. Objective To examine the promoter of GSK3B and all 12 exons, including the surrounding intronic sequence, in patients with FTD, patients with AD, and aged healthy subjects to identify single-nucleotide polymorphisms associated with disease. Design, Setting, and Participants Single-nucleotide polymorphism frequency was examined in a case-control cohort of 48 patients with probable AD, 102 patients with FTD, 38 patients with primary progressive aphasia, and 85 aged healthy subjects. Results were followed up in 2 independent AD family samples consisting of 437 multiplex families with AD (National Institute of Mental Health Genetics Initiative AD Study) or 150 sibships discordant for AD (Consortium on Alzheimer’s Genetics Study). Results Several rare sequence variants in GSK3B were identified in the case-control study. An intronic polymorphism (IVS2−68G>A) occurred at more than twice the frequency among patients with FTD (10.8%) and patients with AD (14.6%) than in aged healthy subjects (4.1%). The polymorphism showed association with disease in both follow-up samples independently, although only the Consortium on Alzheimer’s Genetics sample showed the same direction of association as the case-control sample. Conclusions To our knowledge, this is the first evidence that a gene known to be involved in tau phosphorylation, GSK3B, is associated with risk for primary neurodegenerative dementias. This supports previous work in animal models suggesting that such genes are therapeutic targets. PMID:18852354

  6. The prediction of human exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.

    1994-12-31

    Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less

  7. Genomic organization of plant aminopropyl transferases.

    PubMed

    Rodríguez-Kessler, Margarita; Delgado-Sánchez, Pablo; Rodríguez-Kessler, Gabriela Theresia; Moriguchi, Takaya; Jiménez-Bremont, Juan Francisco

    2010-07-01

    Aminopropyl transferases like spermidine synthase (SPDS; EC 2.5.1.16), spermine synthase and thermospermine synthase (SPMS, tSPMS; EC 2.5.1.22) belong to a class of widely distributed enzymes that use decarboxylated S-adenosylmethionine as an aminopropyl donor and putrescine or spermidine as an amino acceptor to form in that order spermidine, spermine or thermospermine. We describe the analysis of plant genomic sequences encoding SPDS, SPMS, tSPMS and PMT (putrescine N-methyltransferase; EC 2.1.1.53). Genome organization (including exon size, gain and loss, as well as intron number, size, loss, retention, placement and phase, and the presence of transposons) of plant aminopropyl transferase genes were compared between the genomic sequences of SPDS, SPMS and tSPMS from Zea mays, Oryza sativa, Malus x domestica, Populus trichocarpa, Arabidopsis thaliana and Physcomitrella patens. In addition, the genomic organization of plant PMT genes, proposed to be derived from SPDS during the evolution of alkaloid metabolism, is illustrated. Herein, a particular conservation and arrangement of exon and intron sequences between plant SPDS, SPMS and PMT genes that clearly differs with that of ACL5 genes, is shown. The possible acquisition of the plant SPMS exon II and, in particular exon XI in the monocot SPMS genes, is a remarkable feature that allows their differentiation from SPDS genes. In accordance with our in silico analysis, functional complementation experiments of the maize ZmSPMS1 enzyme (previously considered to be SPDS) in yeast demonstrated its spermine synthase activity. Another significant aspect is the conservation of intron sequences among SPDS and PMT paralogs. In addition the existence of microsynteny among some SPDS paralogs, especially in P. trichocarpa and A. thaliana, supports duplication events of plant SPDS genes. Based in our analysis, we hypothesize that SPMS genes appeared with the divergence of vascular plants by a processes of gene duplication and the acquisition of unique exons of as-yet unknown origin. 2010 Elsevier Masson SAS. All rights reserved.

  8. Gaucher disease: A G[sup +1][yields]A[sup +1] IVS2 splice donor site mutation causing exon 2 skipping in the acid [beta]-glucosidase mRNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    He, Guo-Shun; Grabowski, G.A.

    1992-10-01

    Gaucher disease is the most frequent lysosomal storage disease and the most prevalent Jewish genetic disease. About 30 identified missense mutations are causal to the defective activity of acid [beta]-glucosidase in this disease. cDNAs were characterized from a moderately affected 9-year-old Ashkenazi Jewish Gaucher disease type 1 patient whose 80-years-old, enzyme-deficient, 1226G (Asn[sup 370][yields]Ser [N370S]) homozygous grandfather was nearly asymptomatic. Sequence analyses revealed four populations of cDNAs with either the 1226G mutation, an exact exon 2 ([Delta] EX2) deletion, a deletion of exon 2 and the first 115 bp of exon 3 ([Delta] EX2-3), or a completely normal sequence. Aboutmore » 50% of the cDNAs were the [Delta] EX2, the [Delta] EX2-3, and the normal cDNAs, in a ratio of 6:3:1. Specific amplification and characterization of exon 2 and 5[prime] and 3[prime] intronic flanking sequences from the structural gene demonstrated clones with either the normal sequence or with a G[sup +1][yields]A[sup +1] transition at the exon 2/intron 2 boundary. This mutation destroyed the splice donor consensus site (U1 binding site) for mRNA processing. This transition also was present at the corresponding exon/intron boundary of the highly homologous pseudogene. This new mutation, termed [open quotes]IVS2 G[sup +1],[close quotes] is the first in the Ashkenazi Jewish population. The occurrence of this [open quotes]pseudogene[close quotes]-type mutation in the structural gene indicates the role of acid [beta]-glucosidase pseudogene and structural gene rearrangements in the pathogenesis of this disease. 33 refs., 8 figs., 1 tab.« less

  9. Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases

    PubMed Central

    Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo

    2013-01-01

    Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC3 (NM_003786.3:c.1783-1G>A), KLHDC1 (NM_172193.1:c.568-2A>G), HOOK1 (NM_015888.4:c.1662-1G>A), SMAD9 (NM_001127217.2:c.1004-1C>T), and DNAH9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC3, HOOK1. In ABCC3 and HOOK1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4–6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis. PMID:24498620

  10. Porcine calbindin-D9k gene: expression in endometrium, myometrium, and placenta in the absence of a functional estrogen response element in intron A.

    PubMed

    Krisinger, J; Jeung, E B; Simmen, R C; Leung, P C

    1995-01-01

    The expression of Calbindin-D9k (CaBP-9k) in the pig uterus and placenta was measured by Northern blot analysis and reverse transcription polymerase chain reaction (PCR), respectively. Progesterone (P4) administration to ovariectomized pigs decreased CaBP-9k mRNA levels. Expression of endometrial CaBP-9k mRNA was high on pregnancy Days 10-12 and below the detection limit on Days 15 and 18. On Day 60, expression could be detected at low levels. In myometrium and placenta, CaBP-9k mRNA expression was not detectable by Northern analysis using total RNA. Reverse-transcribed RNA from both tissues demonstrated the presence of CaBP-9k transcripts by means of PCR. The partial CaBP-9k gene was amplified by PCR and cloned to determine the sequence of intron A. In contrast to the rat CaBP-9k gene, the pig gene does not contain a functional estrogen response element (ERE) within this region. A similar ERE-like sequence located at the identical location was examined by gel retardation analysis and failed to bind the estradiol receptor. A similar disruption of this ERE-like sequence has been described in the human CaBP-9k gene, which is not expressed at any level in placenta, myometrium, or endometrium. It is concluded that the pig CaBP-9k gene is regulated in these reproductive tissues in a manner distinct from that in rat and human tissues. The regulation is probably due to a regulatory region outside of intron A, which in the rat gene contains the key cis element for uterine expression of the CaBP-9k gene.

  11. Using a minigene approach to characterize a novel splice site mutation in human F7 gene causing inherited factor VII deficiency in a Chinese pedigree.

    PubMed

    Yu, T; Wang, X; Ding, Q; Fu, Q; Dai, J; Lu, Y; Xi, X; Wang, H

    2009-11-01

    Factor VII deficiency which transmitted as an autosomal recessive disorder is a rare haemorrhagic condition. The aim of this study was to identify the molecular genetic defect and determine its functional consequences in a Chinese pedigree with FVII deficiency. The proband was diagnosed as inherited coagulation FVII deficiency by reduced plasma levels of FVII activity (4.4%) and antigen (38.5%). All nine exons and their flanking sequence of F7 gene were amplified by polymerase chain reaction (PCR) for the proband and the PCR products were directly sequenced. The compound heterozygous mutations of F7 (NM_000131.3) c.572-1G>A and F7 (NM_000131.3) c.1165T>G; p.Cys389Gly were identified in the proband's F7 gene. To investigate the splicing patterns associated with F7 c.572-1G>A, ectopic transcripts in leucocytes of the proband were analyzed. F7 minigenes, spanning from intron 4 to intron 7 and carrying either an A or a G at position -1 of intron 5, were constructed and transiently transfected into human embryonic kidney (HEK) 293T cells, followed by RT-PCR analysis. The aberrant transcripts from the F7 c.572-1G>A mutant allele were not detected by ectopic transcription study. Sequencing of the RT-PCR products from the mutant transfectant demonstrated the production of an erroneously spliced mRNA with exon 6 skipping, whereas a normal splicing occurred in the wide type transfectant. The aberrant mRNA produced from the F7 c.572-1G>A mutant allele is responsible for the factor VII deficiency in this pedigree.

  12. Characterization of the human gene (TBXAS1) encoding thromboxane synthase.

    PubMed

    Miyata, A; Yokoyama, C; Ihara, H; Bandoh, S; Takeda, O; Takahashi, E; Tanabe, T

    1994-09-01

    The gene encoding human thromboxane synthase (TBXAS1) was isolated from a human EMBL3 genomic library using human platelet thromboxane synthase cDNA as a probe. Nucleotide sequencing revealed that the human thromboxane synthase gene spans more than 75 kb and consists of 13 exons and 12 introns, of which the splice donor and acceptor sites conform to the GT/AG rule. The exon-intron boundaries of the thromboxane synthase gene were similar to those of the human cytochrome P450 nifedipine oxidase gene (CYP3A4) except for introns 9 and 10, although the primary sequences of these enzymes exhibited 35.8% identity each other. The 1.2-kb of the 5'-flanking region sequence contained potential binding sites for several transcription factors (AP-1, AP-2, GATA-1, CCAAT box, xenobiotic-response element, PEA-3, LF-A1, myb, basic transcription element and cAMP-response element). Primer-extension analysis indicated the multiple transcription-start sites, and the major start site was identified as an adenine residue located 142 bases upstream of the translation-initiation site. However, neither a typical TATA box nor a typical CAAT box is found within the 100-b upstream of the translation-initiation site. Southern-blot analysis revealed the presence of one copy of the thromboxane synthase gene per haploid genome. Furthermore, a fluorescence in situ hybridization study revealed that the human gene for thromboxane synthase is localized to band q33-q34 of the long arm of chromosome 7. A tissue-distribution study demonstrated that thromboxane synthase mRNA is widely expressed in human tissues and is particularly abundant in peripheral blood leukocyte, spleen, lung and liver. The low but significant levels of mRNA were observed in kidney, placenta and thymus.

  13. Structure of the horseradish peroxidase isozyme C genes.

    PubMed

    Fujiyama, K; Takemura, H; Shibayama, S; Kobayashi, K; Choi, J K; Shinmyo, A; Takano, M; Yamada, Y; Okada, H

    1988-05-02

    We have isolated, cloned and characterized three cDNAs and two genomic DNAs corresponding to the mRNAs and genes for the horseradish (Armoracia rusticana) peroxidase isoenzyme C (HPR C). The amino acid sequence of HRP C1, deduced from the nucleotide sequence of one of the cDNA clone, pSK1, contained the same primary sequence as that of the purified enzyme established by Welinder [FEBS Lett. 72, 19-23 (1976)] with additional sequences at the N and C terminal. All three inserts in the cDNA clones, pSK1, pSK2 and pSK3, coded the same size of peptide (308 amino acid residues) if these are processed in the same way, and the amino acid sequence were homologous to each other by 91-94%. Functional amino acids, including His40, His170, Tyr185 and Arg183 and S-S-bond-forming Cys, were conserved in the three isozymes, but a few N-glycosylation sites were not the same. Two HRP C isoenzyme genomic genes, prxC1 and prxC2, were tandem on the chromosomal DNA and each gene consisted of four exons and three introns. The positions in the exons interrupted by introns were the same in two genes. We observed a putative promoter sequence 5' upstream and a poly(A) signal 3' downstream in both genes. The gene product of prxC1 might be processed with a signal sequence of 30 amino acid residues at the N terminus and a peptide consisting of 15 amino acid residues at the C terminus.

  14. Sequence variants in ARHGAP15, COLQ and FAM155A associate with diverticular disease and diverticulitis

    PubMed Central

    Sigurdsson, Snaevar; Alexandersson, Kristjan F.; Sulem, Patrick; Feenstra, Bjarke; Gudmundsdottir, Steinunn; Halldorsson, Gisli H.; Olafsson, Sigurgeir; Sigurdsson, Asgeir; Rafnar, Thorunn; Thorgeirsson, Thorgeir; Sørensen, Erik; Nordholm-Carstensen, Andreas; Burcharth, Jakob; Andersen, Jens; Jørgensen, Henrik Stig; Possfelt-Møller, Emma; Ullum, Henrik; Thorleifsson, Gudmar; Masson, Gisli; Thorsteinsdottir, Unnur; Melbye, Mads; Gudbjartsson, Daniel F.; Stefansson, Tryggvi; Jonsdottir, Ingileif; Stefansson, Kari

    2017-01-01

    Diverticular disease is characterized by pouches (that is, diverticulae) due to weakness in the bowel wall, which can become infected and inflamed causing diverticulitis, with potentially severe complications. Here, we test 32.4 million sequence variants identified through whole-genome sequencing (WGS) of 15,220 Icelanders for association with diverticular disease (5,426 cases) and its more severe form diverticulitis (2,764 cases). Subsequently, 16 sequence variants are followed up in a diverticular disease sample from Denmark (5,970 cases, 3,020 controls). In the combined Icelandic and Danish data sets we observe significant association of intronic variants in ARHGAP15 (Rho GTPase-activating protein 15; rs4662344-T: P=1.9 × 10−18, odds ratio (OR)=1.23) and COLQ (collagen-like tail subunit of asymmetric acetylcholinesterase; rs7609897-T: P=1.5 × 10−10, OR=0.87) with diverticular disease and in FAM155A (family with sequence similarity 155A; rs67153654-A: P=3.0 × 10−11, OR=0.82) with diverticulitis. These are the first loci shown to associate with diverticular disease in a genome-wide study. PMID:28585551

  15. RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.

    PubMed

    Xiong, Hui Y; Alipanahi, Babak; Lee, Leo J; Bretschneider, Hannes; Merico, Daniele; Yuen, Ryan K C; Hua, Yimin; Gueroussov, Serge; Najafabadi, Hamed S; Hughes, Timothy R; Morris, Quaid; Barash, Yoseph; Krainer, Adrian R; Jojic, Nebojsa; Scherer, Stephen W; Blencowe, Benjamin J; Frey, Brendan J

    2015-01-09

    To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing. We detected tens of thousands of disease-causing mutations, including those involved in cancers and spinal muscular atrophy. Examination of intronic and exonic variants found using whole-genome sequencing of individuals with autism revealed misspliced genes with neurodevelopmental phenotypes. Our approach provides evidence for causal variants and should enable new discoveries in precision medicine. Copyright © 2015, American Association for the Advancement of Science.

  16. Pectinases From Sphenophorus levis Vaurie, 1978 (Coleoptera: Curculionidae): Putative Accessory Digestive Enzymes

    PubMed Central

    Evangelista, Danilo Elton; de Paula, Fernando Fonseca Pereira; Rodrigues, André; Henrique-Silva, Flávio

    2015-01-01

    The cell wall in plants offers protection against invading organisms and is mainly composed of the polysaccharides pectin, cellulose, and hemicellulose, which can be degraded by plant cell wall degrading enzymes (PCWDEs). Such enzymes are often synthesized by free living microorganisms or endosymbionts that live in the gut of some animals, including certain phytophagous insects. Thus, the ability of an insect to degrade the cell wall was once thought to be related to endosymbiont enzyme activity. However, recent studies have revealed that some phytophagous insects are able to synthesize their own PCWDEs by endogenous genes, although questions regarding the origin of these genes remain unclear. This study describes two pectinases from the sugarcane weevil, Sphenophorus levis Vaurie, 1978 (Sl-pectinases), which is considered one of the most serious agricultural pests in Brazil. Two cDNA sequences identified in a cDNA library of the insect larvae coding for a pectin methylesterase (PME) and an endo-polygalacturonase (endo-PG)—denominated Sl-PME and Sl-endoPG, respectively—were isolated and characterized. The quantitative real-time reverse transcriptase polymerase chain reaction expression profile for both Sl-pectinases showed mRNA production mainly in the insect feeding stages and exclusively in midgut tissue of the larvae. This analysis, together Western blotting data, suggests that Sl-pectinases have a digestive role. Phylogenetic analyses indicate that Sl-PME and Sl-endoPG sequences are closely related to bacteria and fungi, respectively. Moreover, the partial genomic sequences of the pectinases were amplified from insect fat body DNA, which was certified to be free of endosymbiotic DNA. The analysis of genomic sequences revealed the existence of two small introns with 53 and 166 bp in Sl-endoPG, which is similar to the common pattern in fungal introns. In contrast, no intron was identified in the Sl-PME genomic sequence, as generally observed in bacteria. These data support the theory of horizontal gene transfer proposed for the origin of insect pectinases, reinforcing the acquisition of PME genes from bacteria and endo-PG genes from fungi. PMID:25673050

  17. Splicing predictions reliably classify different types of alternative splicing

    PubMed Central

    Busch, Anke; Hertel, Klemens J.

    2015-01-01

    Alternative splicing is a key player in the creation of complex mammalian transcriptomes and its misregulation is associated with many human diseases. Multiple mRNA isoforms are generated from most human genes, a process mediated by the interplay of various RNA signature elements and trans-acting factors that guide spliceosomal assembly and intron removal. Here, we introduce a splicing predictor that evaluates hundreds of RNA features simultaneously to successfully differentiate between exons that are constitutively spliced, exons that undergo alternative 5′ or 3′ splice-site selection, and alternative cassette-type exons. Surprisingly, the splicing predictor did not feature strong discriminatory contributions from binding sites for known splicing regulators. Rather, the ability of an exon to be involved in one or multiple types of alternative splicing is dictated by its immediate sequence context, mainly driven by the identity of the exon's splice sites, the conservation around them, and its exon/intron architecture. Thus, the splicing behavior of human exons can be reliably predicted based on basic RNA sequence elements. PMID:25805853

  18. Similar Ratios of Introns to Intergenic Sequence across Animal Genomes.

    PubMed

    Francis, Warren R; Wörheide, Gert

    2017-06-01

    One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking into account problems of gene annotation. We show that, regardless of genome size, the ratio of introns to intergenic sequence is comparable across essentially all animals, with nearly all deviations dominated by increased intergenic sequence. Genomes of model organisms have ratios much closer to 1:1, suggesting that the majority of published genomes of nonmodel organisms are underannotated and consequently omit substantial numbers of genes, with likely negative impact on evolutionary interpretations. Finally, our results also indicate that most animals transcribe half or more of their genomes arguing against differences in genome usage between animal groups, and also suggesting that the transcribed portion is more dependent on genome size than previously thought. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. A molecular phylogenetic study of the subtribe Glycininae (Leguminosae) derived from the chloroplast DNA rps16 intron sequences.

    PubMed

    Lee, J; Hymowitz, T

    2001-11-01

    Phylogenetic relationships among 13 genera of the subtribe Glycininae, two genera of the allied subtribe Diocleinae that were included within Glycininae by Polhill, and two genera of the subtribe Erythrininae as outgroups were inferred from chloroplast DNA rps16 intron sequence variation. Pairwise sequence divergence values ranged from identity between Teramnus mollis and T. micans and between T. flexilis and T. labialis to 7.89% between Pueraria wallichii and Pseudeminia comosa across all accessions. Phylogenies estimated using parsimony and neighbor-joining methods revealed that (1) Glycininae is monophyletic if Pachyrhizus and Calopogonium (both Diocleinae) are included within Glycininae; (2) the genus Teramnus is closely related to Glycine, and Amphicarpaea showed a sister relationship to the clade comprising Teramnus and Glycine; (3) the expanded Glycininae including two genera of Diocleinae is divided into three branches, temporarily named I (comprising the rest of the examined taxa), II (Pueraria wallichii), and III (Mastersia), but their relationships are equivocal; and (4) the genus Pueraria, regarded as a closely related genus to Glycine, is not monophyletic and should be divided into at least four genera (a hypothesis supported previously by Lackey).

  20. Genomic resources and genetic diversity of captive lesser kudu (Tragelaphus imberbis).

    PubMed

    Bock, Friederike; Gallus, Susanne; Janke, Axel; Hailer, Frank; Steck, Beatrice L; Kumar, Vikas; Nilsson, Maria A

    2014-01-01

    The lesser kudu (Tragelaphus imberbis) is a spiral-horned antelope native to northeastern Africa. Individuals kept in zoological gardens are suspected to be highly inbred due to few founder individuals and a small breeding stock. A morphological study suggested two distinct subspecies of the lesser kudu. However, subspecies designation and population structure in zoological gardens has not been analyzed using molecular markers. We analyzed one mitochondrial marker and two nuclear intron loci (total: 2,239 nucleotides) in 52 lesser kudu individuals. Of these, 48 individuals were bred in captivity and sampled from seven different zoos. The four remaining individuals were recently captured in Somalia and are currently held in the Maktoum zoo. Maternally inherited mitochondrial sequences indicate substantial amounts of genetic variation in the zoo populations, while the biparentally inherited intron sequences are, as expected, less variable. The analyzed individuals show 10 mitochondrial haplotypes with a maximal distance of 10 mutational steps. No prominent subspecies structure is detectable in this study. For further studies of the lesser kudu population genetics, we present microsatellite markers from a low-coverage genome survey using 454 sequencing technology. © 2014 Wiley Periodicals, Inc.

  1. Mutation analysis of the chromosome 14q24.3 dihydrolipoyl succinyltransferase (DLST) gene in patients with early-onset Alzheimer disease.

    PubMed

    Cruts, M; Backhovens, H; Van Gassen, G; Theuns, J; Wang, S Y; Wehnert, A; van Duijn, C M; Karlsson, T; Hofman, A; Adolfsson, R

    1995-10-13

    Linkage analysis studies have indicated that the chromosome band 14q24.3 harbours a major gene for familial early-onset Alzheimer's disease (AD). Recently we localized the chromosome 14 AD gene (AD3) in the 6.4 cM interval between the markers D14S289 and D14S61. We mapped the gene encoding dihydrolipoyl succinyltransferase (DLST), the E2k component of human alpha-ketoglutarate dehydrogenase complex (KGDHC), in the AD3 candidate region using yeast artificial chromosomes (YACs). The DLST gene is a candidate for the AD3 gene since deficiencies in KGDHC activity have been observed in brain tissue and fibroblasts of AD patients. The 15 exons and the promoter region of the DLST gene were analysed for mutations in chromosome 14 linked AD cases and in two series of unrelated early-onset AD cases (onset age < 55 years). Sequence variations in intronic sequences (introns 3, 5 and 10) or silent mutations in exonic sequences (exons 8 and 14) were identified. However, no AD related mutations were observed, suggesting that the DLST gene is not the chromosome 14 AD3 gene.

  2. Association between polymorphisms of estrogen receptor 2 and benign prostatic hyperplasia

    PubMed Central

    KIM, SU KANG; CHUNG, JOO-HO; PARK, HYUN CHUL; KIM, JUN HO; ANN, JAE HONG; PARK, HUN KUK; LEE, SANG HYUP; YOO, KOO HAN; LEE, BYUNG-CHEOL; KIM, YOUNG OCK

    2015-01-01

    Estrogens and estrogen receptors (ESRs) have been implicated in the stimulation of aberrant prostate growth and the development of prostate diseases. The aim of the present study was to investigate four single nucleotide polymorphisms (SNPs) of the ESR2 gene in order to examine whether ESR2 is a susceptibility gene for benign prostatic hyperplasia (BPH). In order to evaluate whether an association exists between ESR2 and BPH risk, four polymorphisms [rs4986938 (intron), rs17766755 (intron), rs12435857 (intron) and rs1256049 (Val328Val)] of the ESR2 gene were genotyped by direct sequencing. A total of 94 patients with BPH and 79 control subjects were examined. SNPStats and Haploview version 4.2 we used for the genetic analysis. Multiple logistic regression models (codominant1, codominant2, dominant, recessive and log-additive) were produced in order to obtain the odds ratio, 95% confidence interval and P-value. Three SNPs (rs4986938, rs17766755 and rs12435857) showed significant associations with BPH (rs4986938, P=0.015 in log-additive model; rs17766755, P=0.033 in codominant1 model, P=0.019 in dominant model and P=0.020 in log-additive model; rs12435857, P=0.023 in dominant model and P=0.011 in log-additive model). The minor alleles of these SNPs increased the risk of BPH, and the AAC haplotype showed significant association with BPH (χ2=6.34, P=0.0118). These data suggest that the ESR2 gene may be associated with susceptibility to BPH. PMID:26640585

  3. Association between polymorphisms of estrogen receptor 2 and benign prostatic hyperplasia.

    PubMed

    Kim, Su Kang; Chung, Joo-Ho; Park, Hyun Chul; Kim, Jun Ho; Ann, Jae Hong; Park, Hun Kuk; Lee, Sang Hyup; Yoo, Koo Han; Lee, Byung-Cheol; Kim, Young Ock

    2015-11-01

    Estrogens and estrogen receptors (ESRs) have been implicated in the stimulation of aberrant prostate growth and the development of prostate diseases. The aim of the present study was to investigate four single nucleotide polymorphisms (SNPs) of the ESR2 gene in order to examine whether ESR2 is a susceptibility gene for benign prostatic hyperplasia (BPH). In order to evaluate whether an association exists between ESR2 and BPH risk, four polymorphisms [rs4986938 (intron), rs17766755 (intron), rs12435857 (intron) and rs1256049 (Val328Val)] of the ESR2 gene were genotyped by direct sequencing. A total of 94 patients with BPH and 79 control subjects were examined. SNPStats and Haploview version 4.2 we used for the genetic analysis. Multiple logistic regression models (codominant1, codominant2, dominant, recessive and log-additive) were produced in order to obtain the odds ratio, 95% confidence interval and P-value. Three SNPs (rs4986938, rs17766755 and rs12435857) showed significant associations with BPH (rs4986938, P=0.015 in log-additive model; rs17766755, P=0.033 in codominant1 model, P=0.019 in dominant model and P=0.020 in log-additive model; rs12435857, P=0.023 in dominant model and P=0.011 in log-additive model). The minor alleles of these SNPs increased the risk of BPH, and the AAC haplotype showed significant association with BPH (χ 2 =6.34, P=0.0118). These data suggest that the ESR2 gene may be associated with susceptibility to BPH.

  4. Identification of an Intronic Splicing Enhancer Essential for the Inclusion of FGFR2 Exon IIIc*S⃞

    PubMed Central

    Seth, Puneet; Miller, Heather B.; Lasda, Erika L.; Pearson, James L.; Garcia-Blanco, Mariano A.

    2008-01-01

    The ligand specificity of fibroblast growth factor receptor 2 (FGFR2) is determined by the alternative splicing of exons 8 (IIIb) or 9 (IIIc). Exon IIIb is included in epithelial cells, whereas exon IIIc is included in mesenchymal cells. Although a number of cis elements and trans factors have been identified that play a role in exon IIIb inclusion in epithelium, little is known about the activation of exon IIIc in mesenchyme. We report here the identification of a splicing enhancer required for IIIc inclusion. This 24-nucleotide (nt) downstream intronic splicing enhancer (DISE) is located within intron 9 immediately downstream of exon IIIc. DISE was able to activate the inclusion of heterologous exons rat FGFR2 IIIb and human β-globin exon 2 in cell lines from different tissues and species and also in HeLa cell nuclear extracts in vitro. DISE was capable of replacing the intronic activator sequence 1 (IAS1), a known IIIb splicing enhancer and vice versa. This fact, together with the requirement for DISE to be close to the 5′-splice site and the ability of DISE to promote binding of U1 snRNP, suggested that IAS1 and DISE belong to the same class of cis-acting elements. PMID:18256031

  5. Multiple splicing defects in an intronic false exon.

    PubMed

    Sun, H; Chasin, L A

    2000-09-01

    Splice site consensus sequences alone are insufficient to dictate the recognition of real constitutive splice sites within the typically large transcripts of higher eukaryotes, and large numbers of pseudoexons flanked by pseudosplice sites with good matches to the consensus sequences can be easily designated. In an attempt to identify elements that prevent pseudoexon splicing, we have systematically altered known splicing signals, as well as immediately adjacent flanking sequences, of an arbitrarily chosen pseudoexon from intron 1 of the human hprt gene. The substitution of a 5' splice site that perfectly matches the 5' consensus combined with mutation to match the CAG/G sequence of the 3' consensus failed to get this model pseudoexon included as the central exon in a dhfr minigene context. Provision of a real 3' splice site and a consensus 5' splice site and removal of an upstream inhibitory sequence were necessary and sufficient to confer splicing on the pseudoexon. This activated context also supported the splicing of a second pseudoexon sequence containing no apparent enhancer. Thus, both the 5' splice site sequence and the polypyrimidine tract of the pseudoexon are defective despite their good agreement with the consensus. On the other hand, the pseudoexon body did not exert a negative influence on splicing. The introduction into the pseudoexon of a sequence selected for binding to ASF/SF2 or its replacement with beta-globin exon 2 only partially reversed the effect of the upstream negative element and the defective polypyrimidine tract. These results support the idea that exon-bridging enhancers are not a prerequisite for constitutive exon definition and suggest that intrinsically defective splice sites and negative elements play important roles in distinguishing the real splicing signal from the vast number of false splicing signals.

  6. Tumor necrosis factor-alpha and interleukin-4 gene polymorphisms in Chinese patients with gout.

    PubMed

    Chen, M-L; Tsai, F-J; Tsai, C-H; Huang, C-M

    2007-01-01

    The purpose of this study was to examine whether polymorphisms of interleukin-4 (IL-4) (promoter-590 and intron 3) and tumor necrosis factor-alpha (TNF-alpha) promoter-308 genes are markers of susceptibility to or clinical manifestations of gout in Taiwanese patients. The study included 196 Taiwanese patients with gout and 103 unrelated healthy control subjects living in central Taiwan. Polymorphisms of the IL-4 (promoter-590 and intron 3) and TNF-alpha (promoter-308) genes were typed from genomic DNA. Allelic frequencies and carriage rates were then compared between gout patients and control subjects. The correlation between allelic frequencies, carriage rates and clinical manifestations of gout were evaluated. No significant differences were observed in the allelic frequencies and carriage rates of the IL-4 (promoter-590 and intron 3) and TNF-alpha gene polymorphisms between patients with gout and healthy control subjects. Furthermore, the IL-4 (promoter-590 and intron 3) and TNF-alpha genotypes were not found to be associated with the clinical and laboratory profiles in gout patients. However, there was a significant difference in the TNF-alphapolymorphism genotype between patients with and without hypertriglyceridemia (P=0.001, xi2=11.47, OR=10.3, 95%CI=3.57-29.7). The results of our study suggest that polymorphisms of the IL-4 (promoter-590 and intron 3) and TNF-alpha promoter-308 genes are not related to gout in Chinese patients in Taiwan.

  7. Novel nuclear intron-spanning primers for Arecaceae evolutionary biology.

    PubMed

    Bacon, Christine D; Feltus, F Alex; Paterson, Andrew H; Bailey, C Donovan

    2008-01-01

    In this study, 96 nuclear 'conserved intron-scanning primers' were screened across subfamilies the Arecaceae (palms) for potential use in research focused on palm evolutionary biology. Primers were evaluated based on their ability to amplify single polymerase chain reaction products in Arecaceae, the clarity of sequencing reads, and the interspecific variability observed. Ultimately, the results suggest that: (i) seven of the loci are likely to be suitable when comparing non-Arecaceae outgroups and Arecaceae ingroups; (ii) seven loci may be of use when comparing subfamilies of Arecaceae; and (iii) four of the loci may be of use when comparing closely related genera. © 2007 Blackwell Publishing Ltd No claim to original US government works.

  8. Low-copy nuclear primers and ycf1 primers in Cactaceae.

    PubMed

    Franck, Alan R; Cochrane, Bruce J; Garey, James R

    2012-10-01

    To increase the number of variable regions available for phylogenetic study in the Cactaceae, primers were developed for a portion of the plastid ycf1 gene and intron-spanning regions of two low-copy nuclear genes (isi1, nhx1). • Primers were tested on several families within Caryophyllales, focusing on the Cactaceae. Gel electrophoresis indicated positive amplification in most samples. Sequences of these three regions (isi1, nhx1, ycf1) from Harrisia exhibited variation similar to or greater than two plastid regions (atpB-rbcL intergenic spacer and rpl16 intron). • The isi, nhx, and ycf1 primers amplify phylogenetically useful information applicable to the Cactaceae and other families in the Caryophyllales.

  9. Novel p53 tumour suppressor mutations in cases of spindle cell sarcoma, pleomorphic sarcoma and fibrosarcoma in cats.

    PubMed

    Mayr, B; Reifinger, M; Alton, K; Schaffner, G

    1998-06-01

    Twenty feline neoplasms were sequenced in the region from exons 5 to 8 for the presence of tumour suppressor gene p53 mutations. In a spindle cell sarcoma of the bladder, a missense mutation (codon 164 AAG-->GAG, lysine-->glutamic acid) in exon 5 was detected. In a pleomorphic sarcoma, a 23 bp deletion involving the splicing junction between intron 5 and exon 6 was observed. In a fibrosarcoma, a 6 bp deletion of p53 covering 2 bp of exon 7 and 4 bp of intron 7, including the splicing junction, was found. The study demonstrates three new p53 mutations in different types of sarcomas in cats.

  10. Reduced mutation rate in exons due to differential mismatch repair

    PubMed Central

    Mularoni, Loris; Muiños, Ferran; Gonzalez-Perez, Abel; López-Bigas, Núria

    2017-01-01

    While recent studies have revealed higher than anticipated heterogeneity of mutation rate across genomic regions, mutations in exons and introns are assumed to be generated at the same rate. Here we find fewer somatic mutations in exons than expected based on their sequence content, and demonstrate that this is not due to purifying selection. Moreover, we show that it is caused by higher mismatch repair activity in exonic than in intronic regions. Our findings have important implications for our understanding of mutational and DNA repair processes, our knowledge of the evolution of eukaryotic genes, and practical ramifications for the study of the evolution of both tumors and species. PMID:29106418

  11. Beta-globin LCR and intron elements cooperate and direct spatial reorganization for gene therapy.

    PubMed

    Buzina, Alla; Lo, Mandy Y M; Moffett, Angela; Hotta, Akitsu; Fussner, Eden; Bharadwaj, Rikki R; Pasceri, Peter; Garcia-Martinez, J Victor; Bazett-Jones, David P; Ellis, James

    2008-04-11

    The Locus Control Region (LCR) requires intronic elements within beta-globin transgenes to direct high level expression at all ectopic integration sites. However, these essential intronic elements cannot be transmitted through retrovirus vectors and their deletion may compromise the therapeutic potential for gene therapy. Here, we systematically regenerate functional beta-globin intron 2 elements that rescue LCR activity directed by 5'HS3. Evaluation in transgenic mice demonstrates that an Oct-1 binding site and an enhancer in the intron cooperate to increase expression levels from LCR globin transgenes. Replacement of the intronic AT-rich region with the Igmu 3'MAR rescues LCR activity in single copy transgenic mice. Importantly, a combination of the Oct-1 site, Igmu 3'MAR and intronic enhancer in the BGT158 cassette directs more consistent levels of expression in transgenic mice. By introducing intron-modified transgenes into the same genomic integration site in erythroid cells, we show that BGT158 has the greatest transcriptional induction. 3D DNA FISH establishes that induction stimulates this small 5'HS3 containing transgene and the endogenous locus to spatially reorganize towards more central locations in erythroid nuclei. Electron Spectroscopic Imaging (ESI) of chromatin fibers demonstrates that ultrastructural heterochromatin is primarily perinuclear and does not reorganize. Finally, we transmit intron-modified globin transgenes through insulated self-inactivating (SIN) lentivirus vectors into erythroid cells. We show efficient transfer and robust mRNA and protein expression by the BGT158 vector, and virus titer improvements mediated by the modified intron 2 in the presence of an LCR cassette composed of 5'HS2-4. Our results have important implications for the mechanism of LCR activity at ectopic integration sites. The modified transgenes are the first to transfer intronic elements that potentiate LCR activity and are designed to facilitate correction of hemoglobinopathies using single copy vectors.

  12. Biclustering as a method for RNA local multiple sequence alignment.

    PubMed

    Wang, Shu; Gutell, Robin R; Miranker, Daniel P

    2007-12-15

    Biclustering is a clustering method that simultaneously clusters both the domain and range of a relation. A challenge in multiple sequence alignment (MSA) is that the alignment of sequences is often intended to reveal groups of conserved functional subsequences. Simultaneously, the grouping of the sequences can impact the alignment; precisely the kind of dual situation biclustering is intended to address. We define a representation of the MSA problem enabling the application of biclustering algorithms. We develop a computer program for local MSA, BlockMSA, that combines biclustering with divide-and-conquer. BlockMSA simultaneously finds groups of similar sequences and locally aligns subsequences within them. Further alignment is accomplished by dividing both the set of sequences and their contents. The net result is both a multiple sequence alignment and a hierarchical clustering of the sequences. BlockMSA was tested on the subsets of the BRAliBase 2.1 benchmark suite that display high variability and on an extension to that suite to larger problem sizes. Also, alignments were evaluated of two large datasets of current biological interest, T box sequences and Group IC1 Introns. The results were compared with alignments computed by ClustalW, MAFFT, MUCLE and PROBCONS alignment programs using Sum of Pairs (SPS) and Consensus Count. Results for the benchmark suite are sensitive to problem size. On problems of 15 or greater sequences, BlockMSA is consistently the best. On none of the problems in the test suite are there appreciable differences in scores among BlockMSA, MAFFT and PROBCONS. On the T box sequences, BlockMSA does the most faithful job of reproducing known annotations. MAFFT and PROBCONS do not. On the Intron sequences, BlockMSA, MAFFT and MUSCLE are comparable at identifying conserved regions. BlockMSA is implemented in Java. Source code and supplementary datasets are available at http://aug.csres.utexas.edu/msa/

  13. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization.

    PubMed

    Seibt, Kathrin M; Wenke, Torsten; Muders, Katja; Truberg, Bernd; Schmidt, Thomas

    2016-05-01

    Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  14. Mhc class II B gene evolution in East African cichlid fishes.

    PubMed

    Figueroa, F; Mayer, W E; Sültmann, H; O'hUigin, C; Tichy, H; Satta, Y; Takezaki, N; Takahata, N; Klein, J

    2000-06-01

    A distinctive feature of essential major histocompatibility complex (Mhc) loci is their polymorphism characterized by large genetic distances between alleles and long persistence times of allelic lineages. Since the lineages often span several successive speciations, we investigated the behavior of the Mhc alleles during or close to the speciation phase. We sequenced exon 2 of the class II B locus 4 from 232 East African cichlid fishes representing 32 related species. The divergence times of the (sub)species ranged from 6,000 to 8.4 million years. Two types of evolutionary analysis were used to elucidate the pattern of exon 2 sequence divergence. First, phylogenetic methods were applied to reconstruct the most likely evolutionary pathways leading from the last common ancestor of the set to the extant sequences, and to assess the probable mechanisms involved in allelic diversification. Second, pairwise comparisons of sequences were carried out to detect differences seemingly incompatible with origin by nonparallel point mutations. The analysis revealed point mutations to be the most important mechanism behind allelic divergences, with recombination playing only an auxiliary part. Comparison of sequences from related species revealed evidence of random allelic (lineage) losses apparently associated with speciation. Sharing of identical alleles could be demonstrated between species that diverged 2 million years ago. The phylogeny of the exon was incongruent with that of the flanking introns, indicating either a high degree of convergent evolution at the peptide-binding region-encoding sites, or intron homogenization.

  15. Contribution of human growth hormone-releasing hormone receptor (GHRHR) gene sequence variation to isolated severe growth hormone deficiency (ISGHD) and normal adult height.

    PubMed

    Camats, Núria; Fernández-Cancio, Mónica; Carrascosa, Antonio; Andaluz, Pilar; Albisu, M Ángeles; Clemente, María; Gussinyé, Miquel; Yeste, Diego; Audí, Laura

    2012-10-01

    Molecular causes of isolated severe growth hormone deficiency (ISGHD) in several genes have been established. The aim of this study was to analyse the contribution of growth hormone-releasing hormone receptor (GHRHR) gene sequence variation to GH deficiency in a series of prepubertal ISGHD patients and to normal adult height. A systematic GHRHR gene sequence analysis was performed in 69 ISGHD patients and 60 normal adult height controls (NAHC). Four GHRHR single-nucleotide polymorphisms (SNPs) were genotyped in 248 additional NAHC. An analysis was performed on individual SNPs and combined genotype associations with diagnosis in ISGHD patients and with height-SDS in NAHC. Twenty-one SNPs were found. P3, P13, P15 and P20 had not been previously described. Patients and controls shared 12 SNPs (P1, P2, P4-P11, P16 and P21). Significantly different frequencies of the heterozygous genotype and alternate allele were detected in P9 (exon 4, rs4988498) and P12 (intron 6, rs35609199); P9 heterozygous genotype frequencies were similar in patients and the shortest control group (heights between -2 and -1 SDS) and significantly different in controls (heights between -1 and +2 SDS). GHRHR P9 together with 4 GH1 SNP genotypes contributed to 6·2% of height-SDS variation in the entire 308 NAHC. This study established the GHRHR gene sequence variation map in ISGHD patients and NAHC. No evidence of GHRHR mutation contribution to ISGHD was found in this population, although P9 and P12 SNP frequencies were significantly different between ISGHD and NAHC. Thus, the gene sequence may contribute to normal adult height, as demonstrated in NAHC. © 2012 Blackwell Publishing Ltd.

  16. A general framework for optimization of probes for gene expression microarray and its application to the fungus Podospora anserina.

    PubMed

    Bidard, Frédérique; Imbeaud, Sandrine; Reymond, Nancie; Lespinet, Olivier; Silar, Philippe; Clavé, Corinne; Delacroix, Hervé; Berteaux-Lecellier, Véronique; Debuchy, Robert

    2010-06-18

    The development of new microarray technologies makes custom long oligonucleotide arrays affordable for many experimental applications, notably gene expression analyses. Reliable results depend on probe design quality and selection. Probe design strategy should cope with the limited accuracy of de novo gene prediction programs, and annotation up-dating. We present a novel in silico procedure which addresses these issues and includes experimental screening, as an empirical approach is the best strategy to identify optimal probes in the in silico outcome. We used four criteria for in silico probe selection: cross-hybridization, hairpin stability, probe location relative to coding sequence end and intron position. This latter criterion is critical when exon-intron gene structure predictions for intron-rich genes are inaccurate. For each coding sequence (CDS), we selected a sub-set of four probes. These probes were included in a test microarray, which was used to evaluate the hybridization behavior of each probe. The best probe for each CDS was selected according to three experimental criteria: signal-to-noise ratio, signal reproducibility, and representative signal intensities. This procedure was applied for the development of a gene expression Agilent platform for the filamentous fungus Podospora anserina and the selection of a single 60-mer probe for each of the 10,556 P. anserina CDS. A reliable gene expression microarray version based on the Agilent 44K platform was developed with four spot replicates of each probe to increase statistical significance of analysis.

  17. Three tiers of genome evolution in reptiles

    PubMed Central

    Organ, Chris L.; Moreno, Ricardo Godínez; Edwards, Scott V.

    2008-01-01

    Characterization of reptilian genomes is essential for understanding the overall diversity and evolution of amniote genomes, because reptiles, which include birds, constitute a major fraction of the amniote evolutionary tree. To better understand the evolution and diversity of genomic characteristics in Reptilia, we conducted comparative analyses of online sequence data from Alligator mississippiensis (alligator) and Sphenodon punctatus (tuatara) as well as genome size and karyological data from a wide range of reptilian species. At the whole-genome and chromosomal tiers of organization, we find that reptilian genome size distribution is consistent with a model of continuous gradual evolution while genomic compartmentalization, as manifested in the number of microchromosomes and macrochromosomes, appears to have undergone early rapid change. At the sequence level, the third genomic tier, we find that exon size in Alligator is distributed in a pattern matching that of exons in Gallus (chicken), especially in the 101—200 bp size class. A small spike in the fraction of exons in the 301 bp—1 kb size class is also observed for Alligator, but more so for Sphenodon. For introns, we find that members of Reptilia have a larger fraction of introns within the 101 bp–2 kb size class and a lower fraction of introns within the 5–30 kb size class than do mammals. These findings suggest that the mode of reptilian genome evolution varies across three hierarchical levels of the genome, a pattern consistent with a mosaic model of genomic evolution. PMID:21669810

  18. Comprehensive profiling of rhizome-associated alternative splicing and alternative polyadenylation in moso bamboo (Phyllostachys edulis).

    PubMed

    Wang, Taotao; Wang, Huiyuan; Cai, Dawei; Gao, Yubang; Zhang, Hangxiao; Wang, Yongsheng; Lin, Chentao; Ma, Liuyin; Gu, Lianfeng

    2017-08-01

    Moso bamboo (Phyllostachys edulis) represents one of the fastest-spreading plants in the world, due in part to its well-developed rhizome system. However, the post-transcriptional mechanism for the development of the rhizome system in bamboo has not been comprehensively studied. We therefore used a combination of single-molecule long-read sequencing technology and polyadenylation site sequencing (PAS-seq) to re-annotate the bamboo genome, and identify genome-wide alternative splicing (AS) and alternative polyadenylation (APA) in the rhizome system. In total, 145 522 mapped full-length non-chimeric (FLNC) reads were analyzed, resulting in the correction of 2241 mis-annotated genes and the identification of 8091 previously unannotated loci. Notably, more than 42 280 distinct splicing isoforms were derived from 128 667 intron-containing full-length FLNC reads, including a large number of AS events associated with rhizome systems. In addition, we characterized 25 069 polyadenylation sites from 11 450 genes, 6311 of which have APA sites. Further analysis of intronic polyadenylation revealed that LTR/Gypsy and LTR/Copia were two major transposable elements within the intronic polyadenylation region. Furthermore, this study provided a quantitative atlas of poly(A) usage. Several hundred differential poly(A) sites in the rhizome-root system were identified. Taken together, these results suggest that post-transcriptional regulation may potentially have a vital role in the underground rhizome-root system. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  19. Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript

    PubMed Central

    Rose, Dominic; Stadler, Peter F.

    2011-01-01

    Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364

  20. Utility of 17 chloroplast genes for inferring the phylogeny of the basal angiosperms.

    PubMed

    Graham, S W; Olmstead, R G

    2000-11-01

    Sequences from 14 slowly evolving chloroplast genes (including three highly conserved introns) were obtained for representative basal angiosperm and seed-plant taxa, using novel primers described here. These data were combined with published sequences from atpB, rbcL, and newly obtained sequences from ndhF. Combined data from these 17 genes permit sturdy, well-resolved inference of major aspects of basal angiosperm relationships, demonstrating that the new primers are valuable tools for sorting out the deepest events in flowering plant phylogeny. Sequences from the inverted repeat (IR) proved to be particularly reliable (low homoplasy, high retention index). Representatives of Cabomba and Illicium were the first two successive branches of the angiosperms in an initial sampling of 19 exemplar taxa. This result was strongly supported by bootstrap analysis and by two small insertion/deletion events in the slowly evolving introns. Several paleoherb groups (representatives of Piperales) formed a strongly supported clade with taxa representing core woody magnoliids (Laurales, Magnoliales, and Winteraceae). The monophyly of the sampled eudicots and monocots was also well supported. Analyses of three major partitions of the data showed many of the same clades and supported the rooting seen with all the data combined. While Amborella trichopoda was supported as the sister group of the remaining angiosperms when we added Amborella and Nymphaea odorata to the analysis, a strongly conflicting rooting was observed when Amborella alone was added.

Top