Sultan, Laure D.; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Dietrich, André
2016-01-01
Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. PMID:27760804
Isolation and identification of gene-specific microRNAs.
Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao
2006-01-01
Prediction of microRNA (miRNA) candidates using computer programming has identified hundreds and hundreds of genomic hairpin sequences, of which, the functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene-silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem, and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. By insertion of a hairpin-like pre-miRNA structure into the intron region of a gene, this intronic miRNA biogenesis system has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA-expressing system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafish, chicken embryos, and adult mice. Based on the strand complementarity between the designed miRNA and its target gene sequence, we have also developed a miRNA isolation protocol to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proof- of-principle method, we now have the knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing system.
Isolation and identification of gene-specific microRNAs.
Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao
2013-01-01
Computer programming has identified hundreds of genomic hairpin sequences, many with functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA generation system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafishes, chicken embryos, and adult mice. We have also developed an miRNA isolation protocol, based on the complementarity between the designed miRNA and its target gene sequence, to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proven-of-principle method, we now have full knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing systems.
Sultan, Laure D; Mileshina, Daria; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Głodowicz, Paweł; Niazi, Adnan Khan; Keren, Ido; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Mower, Jeffrey P; Dietrich, André; Ostersetzer-Biran, Oren
2016-11-01
Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. © 2016 American Society of Plant Biologists. All rights reserved.
Nishimura, Yuki; Kamikawa, Ryoma; Hashimoto, Tetsuo; Inagaki, Yuji
2014-01-01
Mitochondrial (mt) genome sequences, which often bear introns, have been sampled from phylogenetically diverse eukaryotes. Thus, we can anticipate novel insights into intron evolution from previously unstudied mt genomes. We here investigated the origins and evolution of three introns in the mt genome of the haptophyte Chrysochromulina sp. NIES-1333, which was sequenced completely in this study. All the three introns were characterized as group II, on the basis of predicted secondary structure, and the conserved sequence motifs at the 5′ and 3′ termini. Our comparative studies on diverse mt genomes prompt us to propose that the Chrysochromulina mt genome laterally acquired the introns from mt genomes in distantly related eukaryotes. Many group II introns harbor intronic open reading frames for the proteins (intron-encoded proteins or IEPs), which likely facilitate the splicing of their host introns. However, we propose that a “free-standing,” IEP-like protein, which is not encoded within any introns in the Chrysochromulina mt genome, is involved in the splicing of the first cox1 intron that lacks any open reading frames. PMID:25054084
Ancient nature of alternative splicing and functions of introns
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Kemin; Salamov, Asaf; Kuo, Alan
Using four genomes: Chamydomonas reinhardtii, Agaricus bisporus, Aspergillus carbonarius, and Sporotricum thermophile with EST coverage of 2.9x, 8.9x, 29.5x, and 46.3x respectively, we identified 11 alternative splicing (AS) types that were dominated by intron retention (RI; biased toward short introns) and found 15, 35, 52, and 63percent AS of multiexon genes respectively. Genes with AS were more ancient, and number of AS correlated with number of exons, expression level, and maximum intron length of the gene. Introns with tendency to be retained had either stop codons or length of 3n+1 or 3n+2 presumably triggering nonsense-mediated mRNA decay (NMD), but intronsmore » retained in major isoforms (0.2-6percent of all introns) were biased toward 3n length and stop codon free. Stopless introns were biased toward phase 0, but 3n introns favored phase 1 that introduced more flexible and hydrophilic amino acids on both ends of introns which would be less disruptive to protein structure. We proposed a model in which minor RI intron could evolve into major RI that could facilitate intron loss through exonization.« less
Forks in the tracks: Group II introns, spliceosomes, telomeres and beyond.
Agrawal, Rajendra Kumar; Wang, Hong-Wei; Belfort, Marlene
2016-12-01
Group II introns are large catalytic RNAs that form a ribonucleoprotein (RNP) complex by binding to an intron-encoded protein (IEP). The IEP, which facilitates both RNA splicing and intron mobility, has multiple activities including reverse transcriptase. Recent structures of a group II intron RNP complex and of IEPs from diverse bacteria fuel arguments that group II introns are ancestrally related to eukaryotic spliceosomes as well as to telomerase and viruses. Furthermore, recent structural studies of various functional states of the spliceosome allow us to draw parallels between the group II intron RNP and the spliceosome. Here we present an overview of these studies, with an emphasis on the structure of the IEPs in their isolated and RNA-bound states and on their evolutionary relatedness. In addition, we address the conundrum of the free, albeit truncated IEPs forming dimers, whereas the IEP bound to the intron ribozyme is a monomer in the mature RNP. Future studies needed to resolve some of the outstanding issues related to group II intron RNP function and dynamics are also discussed.
Analysis of nonuniformity in intron phase distribution.
Fedorov, A; Suboch, G; Bujakov, M; Fedorova, L
1992-01-01
The distribution of different intron groups with respect to phases has been analyzed. It has been established that group II introns and nuclear introns have a minimum frequency of phase 2 introns. Since the phase of introns is an extremely conservative measure the observed minimum reflects evolutionary processes. A sample of all known, group I introns was too small to provide a valid characteristic of their phase distribution. The findings observed for the unequal distribution of phases cannot be explained solely on the basis of the mobile properties of introns. One of the most likely explanations for this nonuniformity in the intron phase distribution is the process of exon shuffling. It is proposed that group II introns originated at the early stages of evolution and were involved in the process of exon shuffling. PMID:1598214
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Philippi, Susanne; Lorain, Stéphanie; Beley, Cyriaque; Peccate, Cécile; Précigout, Guillaume; Spuler, Simone; Garcia, Luis
2015-07-15
The modification of the pre-mRNA cis-splicing process employing a pre-mRNA trans-splicing molecule (PTM) is an attractive strategy for the in situ correction of genes whose careful transcription regulation and full-length expression is determinative for protein function, as it is the case for the dysferlin (DYSF, Dysf) gene. Loss-of-function mutations of DYSF result in different types of muscular dystrophy mainly manifesting as limb girdle muscular dystrophy 2B (LGMD2B) and Miyoshi muscular dystrophy 1 (MMD1). We established a 3' replacement strategy for mutated DYSF pre-mRNAs induced by spliceosome-mediated pre-mRNA trans-splicing (SmaRT) by the use of a PTM. In contrast to previously established SmaRT strategies, we particularly focused on the identification of a suitable pre-mRNA target intron other than the optimization of the PTM design. By targeting DYSF pre-mRNA introns harbouring differentially defined 3' splice sites (3' SS), we found that target introns encoding weakly defined 3' SSs were trans-spliced successfully in vitro in human LGMD2B myoblasts as well as in vivo in skeletal muscle of wild-type and Dysf(-/-) mice. For the first time, we demonstrate rescue of Dysf protein by SmaRT in vivo. Moreover, we identified concordant qualities among the successfully targeted Dysf introns and targeted endogenous introns in previously reported SmaRT approaches that might facilitate a selective choice of target introns in future SmaRT strategies. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco
2014-01-01
Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes. PMID:25482895
Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco
2014-01-01
Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.
Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria.
Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée
2006-09-14
The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis.
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria
Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée
2006-01-01
Background The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). Results A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Conclusion Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis. PMID:16972986
Processing of Archaebacterial Intron-Containing tRNA Gene Transcripts.
1987-07-31
1{ 1. Project Goals: A. To determine the mechanism of tRNA intron processing in the halophilic archaebacteria. B. Characterize and compare the...enzyme(s) responsible for the removal of 5’-flanking sequences from halophilic and sulfur-dependent tRNA gene transcripts. C. Examine the structure and...distribution of tRNA introns in the halophilic archaebacteria. 2. Accomplishments: A. Intron processing mechanism We have succeeded in our primary
Origin and evolution of spliceosomal introns
2012-01-01
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section. PMID:22507701
Kramer, Marianne C.; Liang, Dongming; Tatomer, Deirdre C.; Gold, Beth; March, Zachary M.; Cherry, Sara; Wilusz, Jeremy E.
2015-01-01
Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3′ end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine–arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. PMID:26450910
RNA structure in splicing: An evolutionary perspective.
Lin, Chien-Ling; Taggart, Allison J; Fairbrother, William G
2016-09-01
Pre-mRNA splicing is a key post-transcriptional regulation process in which introns are excised and exons are ligated together. A novel class of structured intron was recently discovered in fish. Simple expansions of complementary AC and GT dimers at opposite boundaries of an intron were found to form a bridging structure, thereby enforcing correct splice site pairing across the intron. In some fish introns, the RNA structures are strong enough to bypass the need of regulatory protein factors for splicing. Here, we discuss the prevalence and potential functions of highly structured introns. In humans, structured introns usually arise through the co-occurrence of C and G-rich repeats at intron boundaries. We explore the potentially instructive example of the HLA receptor genes. In HLA pre-mRNA, structured introns flank the exons that encode the highly polymorphic β sheet cleft, making the processing of the transcript robust to variants that disrupt splicing factor binding. While selective forces that have shaped HLA receptor are fairly atypical, numerous other highly polymorphic genes that encode receptors contain structured introns. Finally, we discuss how the elevated mutation rate associated with the simple repeats that often compose structured intron can make structured introns themselves rapidly evolving elements.
Parallel Loss of Plastid Introns and Their Maturase in the Genus Cuscuta
McNeal, Joel R.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Leebens-Mack, Jim; dePamphilis, Claude W.
2009-01-01
Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta. PMID:19543388
Parallel loss of plastid introns and their maturase in the genus Cuscuta.
McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; Leebens-Mack, Jim; dePamphilis, Claude W
2009-06-19
Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta.
Koonin, Eugene V
2006-01-01
Background Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes. Results I argue that several lines of evidence now suggest a coherent solution to the introns-early versus introns-late debate, and the emerging picture of intron evolution integrates aspects of both views although, formally, there seems to be no support for the original version of introns-early. Firstly, there is growing evidence that spliceosomal introns evolved from group II self-splicing introns which are present, usually, in small numbers, in many bacteria, and probably, moved into the evolving eukaryotic genome from the α-proteobacterial progenitor of the mitochondria. Secondly, the concept of a primordial pool of 'virus-like' genetic elements implies that self-splicing introns are among the most ancient genetic entities. Thirdly, reconstructions of the ancestral state of eukaryotic genes suggest that the last common ancestor of extant eukaryotes had an intron-rich genome. Thus, it appears that ancestors of spliceosomal introns, indeed, have existed since the earliest stages of life's evolution, in a formal agreement with the introns-early scenario. However, there is no evidence that these ancient introns ever became widespread before the emergence of eukaryotes, hence, the central tenet of introns-early, the role of introns in early evolution of proteins, has no support. However, the demonstration that numerous introns invaded eukaryotic genes at the outset of eukaryotic evolution and that subsequent intron gain has been limited in many eukaryotic lineages implicates introns as an ancestral feature of eukaryotic genomes and refutes radical versions of introns-late. Perhaps, most importantly, I argue that the intron invasion triggered other pivotal events of eukaryogenesis, including the emergence of the spliceosome, the nucleus, the linear chromosomes, the telomerase, and the ubiquitin signaling system. This concept of eukaryogenesis, in a sense, revives some tenets of the exon hypothesis, by assigning to introns crucial roles in eukaryotic evolutionary innovation. Conclusion The scenario of the origin and evolution of introns that is best compatible with the results of comparative genomics and theoretical considerations goes as follows: self-splicing introns since the earliest stages of life's evolution – numerous spliceosomal introns invading genes of the emerging eukaryote during eukaryogenesis – subsequent lineage-specific loss and gain of introns. The intron invasion, probably, spawned by the mitochondrial endosymbiont, might have critically contributed to the emergence of the principal features of the eukaryotic cell. This scenario combines aspects of the introns-early and introns-late views. Reviewers this article was reviewed by W. Ford Doolittle, James Darnell (nominated by W. Ford Doolittle), William Martin, and Anthony Poole. PMID:16907971
Kramer, Marianne C; Liang, Dongming; Tatomer, Deirdre C; Gold, Beth; March, Zachary M; Cherry, Sara; Wilusz, Jeremy E
2015-10-15
Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼ 400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3' end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine-arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. © 2015 Kramer et al.; Published by Cold Spring Harbor Laboratory Press.
Localized Retroprocessing as a Model of Intron Loss in the Plant Mitochondrial Genome
Cuenca, Argelia; Ross, T. Gregory; Graham, Sean W.; Barrett, Craig F.; Davis, Jerrold I.; Seberg, Ole; Petersen, Gitte
2016-01-01
Loss of introns in plant mitochondrial genes is commonly explained by retroprocessing. Under this model, an mRNA is reverse transcribed and integrated back into the genome, simultaneously affecting the contents of introns and edited sites. To evaluate the extent to which retroprocessing explains intron loss, we analyzed patterns of intron content and predicted RNA editing for whole mitochondrial genomes of 30 species in the monocot order Alismatales. In this group, we found an unusually high degree of variation in the intron content, even expanding the hitherto known variation among angiosperms. Some species have lost some two-third of the cis-spliced introns. We found a strong correlation between intron content and editing frequency, and detected 27 events in which intron loss is consistent with the presence of nucleotides in an edited state, supporting retroprocessing. However, we also detected seven cases of intron loss not readily being explained by retroprocession. Our analyses are also not consistent with the entire length of a fully processed cDNA copy being integrated into the genome, but instead indicate that retroprocessing usually occurs for only part of the gene. In some cases, several rounds of retroprocessing may explain intron loss in genes completely devoid of introns. A number of taxa retroprocessing seem to be very common and a possibly ongoing process. It affects the entire mitochondrial genome. PMID:27435795
Intermediate introns in nuclear genes of euglenids - are they a distinct type?
Milanowski, Rafał; Gumińska, Natalia; Karnkowska, Anna; Ishikawa, Takao; Zakryś, Bożena
2016-02-29
Nuclear genes of euglenids contain two major types of introns: conventional spliceosomal and nonconventional introns. The latter are characterized by variable non-canonical borders, RNA secondary structure that brings intron ends together, and an unknown mechanism of removal. Some researchers also distinguish intermediate introns, which combine features of both types. They form a stable RNA secondary structure and are classified into two subtypes depending on whether they contain one (intermediate/nonconventional subtype) or both (conventional/intermediate subtype) canonical spliceosomal borders. However, it has been also postulated that most introns classified as intermediate could simply be special cases of conventional or nonconventional introns. Sequences of tubB, hsp90 and gapC genes from six strains of Euglena agilis were obtained. They contain four, six, and two or three introns, respectively (the third intron in the gapC gene is unique for just one strain). Conventional introns were present at three positions: two in the tubB gene (at one position conventional/intermediate introns were also found) and one in the gapC gene. Nonconventional introns are present at ten positions: two in the tubB gene (at one position intermediate/nonconventional introns were also found), six in hsp90 (at four positions intermediate/nonconventional introns were also found), and two in the gapC gene. Sequence and RNA secondary structure analyses of nonconventional introns confirmed that their most strongly conserved elements are base pairing nucleotides at positions +4, +5 and +6/ -8, -7 and -6 (in most introns CAG/CTG nucleotides were observed). It was also confirmed that the presence of the 5' GT/C end in intermediate/nonconventional introns is not the result of kinship with conventional introns, but is due to evolutionary pressure to preserve the purine at the 5' end. However, an example of a nonconventional intron with GC-AG ends was shown, suggesting the possibility of intron type conversion between nonconventional and conventional. Furthermore, an analysis of conventional introns revealed that the ability to form a stable RNA secondary structure by some introns is probably not a result of their relationship with nonconventional introns. It was also shown that acquisition of new nonconventional introns is an ongoing process and can be observed at the level of a single species. In the recently acquired intron in the gapC gene an extended direct repeats at the intron-exon junctions are present, suggesting that double-strand break repair process could be the source of new nonconventional introns.
Functional understanding of the diverse exon-intron structures of human GPCR genes.
Hammond, Dorothy A; Olman, Victor; Xu, Ying
2014-02-01
The GPCR genes have a variety of exon-intron structures even though their proteins are all structurally homologous. We have examined all human GPCR genes with at least two functional protein isoforms, totaling 199, aiming to gain an understanding of what may have contributed to the large diversity of the exon-intron structures of the GPCR genes. The 199 genes have a total of 808 known protein splicing isoforms with experimentally verified functions. Our analysis reveals that 1301 (80.6%) adjacent exon-exon pairs out of the total of 1,613 in the 199 genes have either exactly one exon skipped or the intron in-between retained in at least one of the 808 protein splicing isoforms. This observation has a statistical significance p-value of 2.051762 * e(-09), assuming that the observed splicing isoforms are independent of the exon-intron structures. Our interpretation of this observation is that the exon boundaries of the GPCR genes are not randomly determined; instead they may be selected to facilitate specific alternative splicing for functional purposes.
Mohr, Sabine; Ghanem, Eman; Smith, Whitney; Sheeter, Dennis; Qin, Yidan; King, Olga; Polioudakis, Damon; Iyer, Vishwanath R; Hunicke-Smith, Scott; Swamy, Sajani; Kuersten, Scott; Lambowitz, Alan M
2013-07-01
Mobile group II introns encode reverse transcriptases (RTs) that function in intron mobility ("retrohoming") by a process that requires reverse transcription of a highly structured, 2-2.5-kb intron RNA with high processivity and fidelity. Although the latter properties are potentially useful for applications in cDNA synthesis and next-generation RNA sequencing (RNA-seq), group II intron RTs have been difficult to purify free of the intron RNA, and their utility as research tools has not been investigated systematically. Here, we developed general methods for the high-level expression and purification of group II intron-encoded RTs as fusion proteins with a rigidly linked, noncleavable solubility tag, and we applied them to group II intron RTs from bacterial thermophiles. We thus obtained thermostable group II intron RT fusion proteins that have higher processivity, fidelity, and thermostability than retroviral RTs, synthesize cDNAs at temperatures up to 81°C, and have significant advantages for qRT-PCR, capillary electrophoresis for RNA-structure mapping, and next-generation RNA sequencing. Further, we find that group II intron RTs differ from the retroviral enzymes in template switching with minimal base-pairing to the 3' ends of new RNA templates, making it possible to efficiently and seamlessly link adaptors containing PCR-primer binding sites to cDNA ends without an RNA ligase step. This novel template-switching activity enables facile and less biased cloning of nonpolyadenylated RNAs, such as miRNAs or protein-bound RNA fragments. Our findings demonstrate novel biochemical activities and inherent advantages of group II intron RTs for research, biotechnological, and diagnostic methods, with potentially wide applications.
Beta-globin LCR and intron elements cooperate and direct spatial reorganization for gene therapy.
Buzina, Alla; Lo, Mandy Y M; Moffett, Angela; Hotta, Akitsu; Fussner, Eden; Bharadwaj, Rikki R; Pasceri, Peter; Garcia-Martinez, J Victor; Bazett-Jones, David P; Ellis, James
2008-04-11
The Locus Control Region (LCR) requires intronic elements within beta-globin transgenes to direct high level expression at all ectopic integration sites. However, these essential intronic elements cannot be transmitted through retrovirus vectors and their deletion may compromise the therapeutic potential for gene therapy. Here, we systematically regenerate functional beta-globin intron 2 elements that rescue LCR activity directed by 5'HS3. Evaluation in transgenic mice demonstrates that an Oct-1 binding site and an enhancer in the intron cooperate to increase expression levels from LCR globin transgenes. Replacement of the intronic AT-rich region with the Igmu 3'MAR rescues LCR activity in single copy transgenic mice. Importantly, a combination of the Oct-1 site, Igmu 3'MAR and intronic enhancer in the BGT158 cassette directs more consistent levels of expression in transgenic mice. By introducing intron-modified transgenes into the same genomic integration site in erythroid cells, we show that BGT158 has the greatest transcriptional induction. 3D DNA FISH establishes that induction stimulates this small 5'HS3 containing transgene and the endogenous locus to spatially reorganize towards more central locations in erythroid nuclei. Electron Spectroscopic Imaging (ESI) of chromatin fibers demonstrates that ultrastructural heterochromatin is primarily perinuclear and does not reorganize. Finally, we transmit intron-modified globin transgenes through insulated self-inactivating (SIN) lentivirus vectors into erythroid cells. We show efficient transfer and robust mRNA and protein expression by the BGT158 vector, and virus titer improvements mediated by the modified intron 2 in the presence of an LCR cassette composed of 5'HS2-4. Our results have important implications for the mechanism of LCR activity at ectopic integration sites. The modified transgenes are the first to transfer intronic elements that potentiate LCR activity and are designed to facilitate correction of hemoglobinopathies using single copy vectors.
Introns: The Functional Benefits of Introns in Genomes.
Jo, Bong-Seok; Choi, Sun Shim
2015-12-01
The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.
zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs.
Parekh, Swati; Ziegenhain, Christoph; Vieth, Beate; Enard, Wolfgang; Hellmann, Ines
2018-06-01
Single-cell RNA-sequencing (scRNA-seq) experiments typically analyze hundreds or thousands of cells after amplification of the cDNA. The high throughput is made possible by the early introduction of sample-specific bar codes (BCs), and the amplification bias is alleviated by unique molecular identifiers (UMIs). Thus, the ideal analysis pipeline for scRNA-seq data needs to efficiently tabulate reads according to both BC and UMI. zUMIs is a pipeline that can handle both known and random BCs and also efficiently collapse UMIs, either just for exon mapping reads or for both exon and intron mapping reads. If BC annotation is missing, zUMIs can accurately detect intact cells from the distribution of sequencing reads. Another unique feature of zUMIs is the adaptive downsampling function that facilitates dealing with hugely varying library sizes but also allows the user to evaluate whether the library has been sequenced to saturation. To illustrate the utility of zUMIs, we analyzed a single-nucleus RNA-seq dataset and show that more than 35% of all reads map to introns. Also, we show that these intronic reads are informative about expression levels, significantly increasing the number of detected genes and improving the cluster resolution. zUMIs flexibility makes if possible to accommodate data generated with any of the major scRNA-seq protocols that use BCs and UMIs and is the most feature-rich, fast, and user-friendly pipeline to process such scRNA-seq data.
Identification of human short introns
Abebrese, Emmanuel L.; Arnold, Zachary R.; Armstrong, Katharine; Burns, Lindsay; Day, R. Thomas; Hsu, Daniel G.; Jarrell, Katherine; Luo, Yi; Mugayo, Daphine
2017-01-01
Canonical pre-mRNA splicing requires snRNPs and associated splicing factors to excise conserved intronic sequences, with a minimum intron length required for efficient splicing. Non-canonical splicing–intron excision without the spliceosome–has been documented; most notably, some tRNAs and the XBP1 mRNA contain short introns that are not removed by the spliceosome. There have been some efforts to identify additional short introns, but little is known about how many short introns are processed from mRNAs. Here, we report an approach to identify RNA short introns from RNA-Seq data, discriminating against small genomic deletions. We identify hundreds of short introns conserved among multiple human cell lines. These short introns are often alternatively spliced and are found in a variety of RNAs–both mRNAs and lncRNAs. Short intron splicing efficiency is increased by secondary structure, and we detect both canonical and non-canonical short introns. In many cases, splicing of these short introns from mRNAs is predicted to alter the reading frame and change protein output. Our findings imply that standard gene prediction models which often assume a lower limit for intron size fail to predict short introns effectively. We conclude that short introns are abundant in the human transcriptome, and short intron splicing represents an added layer to mRNA regulation. PMID:28520720
Characterization of the molecular basis of group II intron RNA recognition by CRS1-CRM domains.
Keren, Ido; Klipcan, Liron; Bezawork-Geleta, Ayenachew; Kolton, Max; Shaya, Felix; Ostersetzer-Biran, Oren
2008-08-22
CRM (chloroplast RNA splicing and ribosome maturation) is a recently recognized RNA-binding domain of ancient origin that has been retained in eukaryotic genomes only within the plant lineage. Whereas in bacteria CRM domains exist as single domain proteins involved in ribosome maturation, in plants they are found in a family of proteins that contain between one and four repeats. Several members of this family with multiple CRM domains have been shown to be required for the splicing of specific plastidic group II introns. Detailed biochemical analysis of one of these factors in maize, CRS1, demonstrated its high affinity and specific binding to the single group II intron whose splicing it facilitates, the plastid-encoded atpF intron RNA. Through its association with two intronic regions, CRS1 guides the folding of atpF intron RNA into its predicted "catalytically active" form. To understand how multiple CRM domains cooperate to achieve high affinity sequence-specific binding to RNA, we analyzed the RNA binding affinity and specificity associated with each individual CRM domain in CRS1; whereas CRM3 bound tightly to the RNA, CRM1 associated specifically with a unique region found within atpF intron domain I. CRM2, which demonstrated only low binding affinity, also seems to form specific interactions with regions localized to domains I, III, and IV. We further show that CRM domains share structural similarities and RNA binding characteristics with the well known RNA recognition motif domain.
Crystal structure of group II intron domain 1 reveals a template for RNA assembly
Zhao, Chen; Rajashankar, Kanagalaghatta R.; Marcia, Marco; ...
2015-10-26
Although the importance of large noncoding RNAs is increasingly appreciated, our understanding of their structures and architectural dynamics remains limited. In particular, we know little about RNA folding intermediates and how they facilitate the productive assembly of RNA tertiary structures. In this paper, we report the crystal structure of an obligate intermediate that is required during the earliest stages of group II intron folding. Composed of domain 1 from the Oceanobacillus iheyensis group II intron (266 nucleotides), this intermediate retains native-like features but adopts a compact conformation in which the active site cleft is closed. Transition between this closed andmore » the open (native) conformation is achieved through discrete rotations of hinge motifs in two regions of the molecule. Finally, the open state is then stabilized by sequential docking of downstream intron domains, suggesting a 'first come, first folded' strategy that may represent a generalizable pathway for assembly of large RNA and ribonucleoprotein structures.« less
Bao, Penghui; Wu, Qi-Jia; Yin, Ping; Jiang, Yanfei; Wang, Xu; Xie, Mao-Hua; Sun, Tao; Huang, Lin; Mo, Ding-Ding; Zhang, Yi
2008-01-01
Self-splicing of group I introns is accomplished by two sequential ester-transfer reactions mediated by sequential binding of two different guanosine ligands, but it is yet unclear how the binding is coordinated at a single G-binding site. Using a three-piece trans-splicing system derived from the Candida intron, we studied the effect of the prior GTP binding on the later ωG binding by assaying the ribozyme activity in the second reaction. We showed that adding GTP simultaneously with and prior to the esterified ωG in a substrate strongly accelerated the second reaction, suggesting that the early binding of GTP facilitates the subsequent binding of ωG. GTP-mediated facilitation requires C2 amino and C6 carbonyl groups on the Watson–Crick edge of the base but not the phosphate or sugar groups, suggesting that the base triple interactions between GTP and the binding site are important for the subsequent ωG binding. Strikingly, GTP binding loosens a few local structures of the ribozyme including that adjacent to the base triple, providing structural basis for a rapid exchange of ωG for bound GTP. PMID:18978026
Recurrent Loss of Specific Introns during Angiosperm Evolution
Wang, Hao; Devos, Katrien M.; Bennetzen, Jeffrey L.
2014-01-01
Numerous instances of presence/absence variations for introns have been documented in eukaryotes, and some cases of recurrent loss of the same intron have been suggested. However, there has been no comprehensive or phylogenetically deep analysis of recurrent intron loss. Of 883 cases of intron presence/absence variation that we detected in five sequenced grass genomes, 93 were confirmed as recurrent losses and the rest could be explained by single losses (652) or single gains (118). No case of recurrent intron gain was observed. Deep phylogenetic analysis often indicated that apparent intron gains were actually numerous independent losses of the same intron. Recurrent loss exhibited extreme non-randomness, in that some introns were removed independently in many lineages. The two larger genomes, maize and sorghum, were found to have a higher rate of both recurrent loss and overall loss and/or gain than foxtail millet, rice or Brachypodium. Adjacent introns and small introns were found to be preferentially lost. Intron loss genes exhibited a high frequency of germ line or early embryogenesis expression. In addition, flanking exon A+T-richness and intron TG/CG ratios were higher in retained introns. This last result suggests that epigenetic status, as evidenced by a loss of methylated CG dinucleotides, may play a role in the process of intron loss. This study provides the first comprehensive analysis of recurrent intron loss, makes a series of novel findings on the patterns of recurrent intron loss during the evolution of the grass family, and provides insight into the molecular mechanism(s) underlying intron loss. PMID:25474210
An efficient method to find potentially universal population genetic markers, applied to metazoans
2010-01-01
Background Despite the impressive growth of sequence databases, the limited availability of nuclear markers that are sufficiently polymorphic for population genetics and phylogeography and applicable across various phyla restricts many potential studies, particularly in non-model organisms. Numerous introns have invariant positions among kingdoms, providing a potential source for such markers. Unfortunately, most of the few known EPIC (Exon Primed Intron Crossing) loci are restricted to vertebrates or belong to multigenic families. Results In order to develop markers with broad applicability, we designed a bioinformatic approach aimed at avoiding multigenic families while identifying intron positions conserved across metazoan phyla. We developed a program facilitating the identification of EPIC loci which allowed slight variation in intron position. From the Homolens databases we selected 29 gene families which contained 52 promising introns for which we designed 93 primer pairs. PCR tests were performed on several ascidians, echinoderms, bivalves and cnidarians. On average, 24 different introns per genus were amplified in bilaterians. Remarkably, five of the introns successfully amplified in all of the metazoan genera tested (a dozen genera, including cnidarians). The influence of several factors on amplification success was investigated. Success rate was not related to the phylogenetic relatedness of a taxon to the groups that most influenced primer design, showing that these EPIC markers are extremely conserved in animals. Conclusions Our new method now makes it possible to (i) rapidly isolate a set of EPIC markers for any phylum, even outside the animal kingdom, and thus, (ii) compare genetic diversity at potentially homologous polymorphic loci between divergent taxa. PMID:20836842
Mitochondrial Group II Introns, Cytochrome c Oxidase, and Senescence in Podospora anserina†
Begel, Odile; Boulay, Jocelyne; Albert, Beatrice; Dufour, Eric; Sainsard-Chanet, Annie
1999-01-01
Podospora anserina is a filamentous fungus with a limited life span. It expresses a degenerative syndrome called senescence, which is always associated with the accumulation of circular molecules (senDNAs) containing specific regions of the mitochondrial chromosome. A mobile group II intron (α) has been thought to play a prominent role in this syndrome. Intron α is the first intron of the cytochrome c oxidase subunit I gene (COX1). Mitochondrial mutants that escape the senescence process are missing this intron, as well as the first exon of the COX1 gene. We describe here the first mutant of P. anserina that has the α sequence precisely deleted and whose cytochrome c oxidase activity is identical to that of wild-type cells. The integration site of the intron is slightly modified, and this change prevents efficient homing of intron α. We show here that this mutant displays a senescence syndrome similar to that of the wild type and that its life span is increased about twofold. The introduction of a related group II intron into the mitochondrial genome of the mutant does not restore the wild-type life span. These data clearly demonstrate that intron α is not the specific senescence factor but rather an accelerator or amplifier of the senescence process. They emphasize the role that intron α plays in the instability of the mitochondrial chromosome and the link between this instability and longevity. Our results strongly support the idea that in Podospora, “immortality” can be acquired not by the absence of intron α but rather by the lack of active cytochrome c oxidase. PMID:10330149
Three distinct modes of intron dynamics in the evolution of eukaryotes.
Carmel, Liran; Wolf, Yuri I; Rogozin, Igor B; Koonin, Eugene V
2007-07-01
Several contrasting scenarios have been proposed for the origin and evolution of spliceosomal introns, a hallmark of eukaryotic genes. A comprehensive probabilistic model to obtain a definitive reconstruction of intron evolution was developed and applied to 391 sets of conserved genes from 19 eukaryotic species. It is inferred that a relatively high intron density was reached early, i.e., the last common ancestor of eukaryotes contained >2.15 introns/kilobase, and the last common ancestor of multicellular life forms harbored approximately 3.4 introns/kilobase, a greater intron density than in most of the extant fungi and in some animals. The rates of intron gain and intron loss appear to have been dropping during the last approximately 1.3 billion years, with the decline in the gain rate being much steeper. Eukaryotic lineages exhibit three distinct modes of evolution of the intron-exon structure. The primary, balanced mode, apparently, operates in all lineages. In this mode, intron gain and loss are strongly and positively correlated, in contrast to previous reports on inverse correlation between these processes. The second mode involves an elevated rate of intron loss and is prevalent in several lineages, such as fungi and insects. The third mode, characterized by elevated rate of intron gain, is seen only in deep branches of the tree, indicating that bursts of intron invasion occurred at key points in eukaryotic evolution, such as the origin of animals. Intron dynamics could depend on multiple mechanisms, and in the balanced mode, gain and loss of introns might share common mechanistic features.
Transposition of an intron in yeast mitochondria requires a protein encoded by that intron.
Macreadie, I G; Scott, R M; Zinn, A R; Butow, R A
1985-06-01
The optional 1143 bp intron in the yeast mitochondrial 21S rRNA gene (omega +) is nearly quantitatively inserted in genetic crosses into 21S rRNA alleles that lack it (omega -). The intron contains an open reading frame that can encode a protein of 235 amino acids, but no function has been ascribed to this sequence. We previously found an in vivo double-strand break in omega - DNA at or close to the intron insertion site only in zygotes of omega + X omega - crosses that appears with the same kinetics as intron insertion. We now show that mutations in the intron open reading frame that would alter the translation product simultaneously inhibit nonreciprocal omega recombination and the in vivo double-strand break in omega - DNA. These results provide evidence that the open reading frame encodes a protein required for intron transposition and support the role of the double-strand break in the process.
Hypervariable and highly divergent intron-exon organizations in the chordate Oikopleura dioica.
Edvardsen, Rolf B; Lerat, Emmanuelle; Maeland, Anne Dorthea; Flåt, Mette; Tewari, Rita; Jensen, Marit F; Lehrach, Hans; Reinhardt, Richard; Seo, Hee-Chan; Chourrout, Daniel
2004-10-01
Oikopleura dioica is a pelagic tunicate with a very small genome and a very short life cycle. In order to investigate the intron-exon organizations in Oikopleura, we have isolated and characterized ribosomal protein EF-1alpha, Hox, and alpha-tubulin genes. Their intron positions have been compared with those of the same genes from various invertebrates and vertebrates, including four species with entirely sequenced genomes. Oikopleura genes, like Caenorhabditis genes, have introns at a large number of nonconserved positions, which must originate from late insertions or intron sliding of ancient insertions. Both species exhibit hypervariable intron-exon organization within their alpha-tubulin gene family. This is due to localization of most nonconserved intron positions in single members of this gene family. The hypervariability and divergence of intron positions in Oikopleura and Caenorhabditis may be related to the predominance of short introns, the processing of which is not very dependent upon the exonic environment compared to large introns. Also, both species have an undermethylated genome, and the control of methylation-induced point mutations imposes a control on exon size, at least in vertebrate genes. That introns placed at such variable positions in Oikopleura or C. elegans may serve a specific purpose is not easy to infer from our current knowledge and hypotheses on intron functions. We propose that new introns are retained in species with very short life cycles, because illegitimate exchanges including gene conversion are repressed. We also speculate that introns placed at gene-specific positions may contribute to suppressing these exchanges and thereby favor their own persistence.
Sainsard-Chanet, A; Begel, O; Belcour, L
1994-10-07
In the filamentous fungus Podospora anserina, the unavoidable phenomenon of senescence is associated with the amplification of the first intron of the mitochondrial cox1 that accumulates as circular DNA molecules consisting of tandem repeats. This group II intron (cox1-i1 or alpha) is able to transpose and contains an open reading frame with significant amino acid similarity with reverse transcriptases. The generation of these intronic circular DNA molecules, their amplification and their involvement in the senescence process are unresolved questions. We demonstrate here that: (1) another group II intron, the fourth intron of gene cox1, cox1-i4, is also able to give precise DNA end to end junctions; (2) this intronic sequence can be found amplified during senescence, although to a lesser extent than cox1-i1; (3) the amplification of the DNA multimeric cox1-i1 molecules likely does not proceed by autonomous replication; (4) the generation of the DNA intronic circles does not require efficient intron splicing; (5) a DNA double-strand break occurs in vivo at the 3' extremity of the cox1-e1 and cox1-e4 exons preceding the group II introns that form circular DNAs. On the whole, these results show that the ability to form DNA circular molecules is a property of some group II introns and they demonstrate the occurrence of a specific DNA cleavage at or near the integration site of these group II introns. The results strongly suggest that this cleavage is involved in the formation of the group II intronic DNA circles and could also be involved in the phenomenon of group II intron homing.
Pimentel, Harold; Parra, Marilyn; Gee, Sherry L.; ...
2015-11-03
Differentiating erythroblasts execute a dynamic alternative splicing program shown here to include extensive and diverse intron retention (IR) events. Cluster analysis revealed hundreds of developmentallydynamic introns that exhibit increased IR in mature erythroblasts, and are enriched in functions related to RNA processing such as SF3B1 spliceosomal factor. Distinct, developmentally-stable IR clusters are enriched in metal-ion binding functions and include mitoferrin genes SLC25A37 and SLC25A28 that are critical for iron homeostasis. Some IR transcripts are abundant, e.g. comprising ~50% of highly-expressed SLC25A37 and SF3B1 transcripts in late erythroblasts, and thereby limiting functional mRNA levels. IR transcripts tested were predominantly nuclearlocalized. Splicemore » site strength correlated with IR among stable but not dynamic intron clusters, indicating distinct regulation of dynamically-increased IR in late erythroblasts. Retained introns were preferentially associated with alternative exons with premature termination codons (PTCs). High IR was observed in disease-causing genes including SF3B1 and the RNA binding protein FUS. Comparative studies demonstrated that the intron retention program in erythroblasts shares features with other tissues but ultimately is unique to erythropoiesis. Finally, we conclude that IR is a multi-dimensional set of processes that post-transcriptionally regulate diverse gene groups during normal erythropoiesis, misregulation of which could be responsible for human disease.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pimentel, Harold; Parra, Marilyn; Gee, Sherry L.
Differentiating erythroblasts execute a dynamic alternative splicing program shown here to include extensive and diverse intron retention (IR) events. Cluster analysis revealed hundreds of developmentallydynamic introns that exhibit increased IR in mature erythroblasts, and are enriched in functions related to RNA processing such as SF3B1 spliceosomal factor. Distinct, developmentally-stable IR clusters are enriched in metal-ion binding functions and include mitoferrin genes SLC25A37 and SLC25A28 that are critical for iron homeostasis. Some IR transcripts are abundant, e.g. comprising ~50% of highly-expressed SLC25A37 and SF3B1 transcripts in late erythroblasts, and thereby limiting functional mRNA levels. IR transcripts tested were predominantly nuclearlocalized. Splicemore » site strength correlated with IR among stable but not dynamic intron clusters, indicating distinct regulation of dynamically-increased IR in late erythroblasts. Retained introns were preferentially associated with alternative exons with premature termination codons (PTCs). High IR was observed in disease-causing genes including SF3B1 and the RNA binding protein FUS. Comparative studies demonstrated that the intron retention program in erythroblasts shares features with other tissues but ultimately is unique to erythropoiesis. Finally, we conclude that IR is a multi-dimensional set of processes that post-transcriptionally regulate diverse gene groups during normal erythropoiesis, misregulation of which could be responsible for human disease.« less
Zmudjak, Michal; Colas des Francs-Small, Catherine; Keren, Ido; Shaya, Felix; Belausov, Eduard; Small, Ian; Ostersetzer-Biran, Oren
2013-07-01
The coding regions of many mitochondrial genes in plants are interrupted by intervening sequences that are classified as group II introns. Their splicing is essential for the expression of the genes they interrupt and hence for respiratory function, and is facilitated by various protein cofactors. Despite the importance of these cofactors, only a few of them have been characterized. CRS1-YhbY domain (CRM) is a recently recognized RNA-binding domain that is present in several characterized splicing factors in plant chloroplasts. The Arabidopsis genome encodes 16 CRM proteins, but these are largely uncharacterized. Here, we analyzed the intracellular location of one of these hypothetical proteins in Arabidopsis, mitochondrial CAF-like splicing factor 1 (mCSF1; At4 g31010), and analyzed the growth phenotypes and organellar activities associated with mcsf1 mutants in plants. Our data indicated that mCSF1 resides within mitochondria and its functions are essential during embryogenesis. Mutant plants with reduced mCSF1 displayed inhibited germination and retarded growth phenotypes that were tightly associated with reduced complex I and IV activities. Analogously to the functions of plastid-localized CRM proteins, analysis of the RNA profiles in wildtype and mcsf1 plants showed that mCSF1 acts in the splicing of many of the group II intron RNAs in Arabidopsis mitochondria. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Celis, Juan Sebastián; Edgell, David R; Stelbrink, Björn; Wibberg, Daniel; Hauffe, Torsten; Blom, Jochen; Kalinowski, Jörn; Wilke, Thomas
2017-01-01
Group I introns and homing endonuclease genes (HEGs) are mobile genetic elements, capable of invading target sequences in intron-less genomes. LAGLIDADG HEGs are the largest family of endonucleases, playing a key role in the mobility of group I introns in a process known as 'homing'. Group I introns and HEGs are rare in metazoans, and can be mainly found inserted in the COXI gene of some sponges and cnidarians, including stony corals (Scleractinia) and mushroom corals (Corallimorpharia). Vertical and horizontal intron transfer mechanisms have been proposed as explanations for intron occurrence in cnidarians. However, the central role of LAGLIDADG motifs in intron mobility mechanisms remains poorly understood. To resolve questions regarding the evolutionary origin and distribution of group I introns and HEGs in Scleractinia and Corallimorpharia, we examined intron/HEGs sequences within a comprehensive phylogenetic framework. Analyses of LAGLIDADG motif conservation showed a high degree of degradation in complex Scleractinia and Corallimorpharia. Moreover, the two motifs lack the respective acidic residues necessary for metal-ion binding and catalysis, potentially impairing horizontal intron mobility. In contrast, both motifs are highly conserved within robust Scleractinia, indicating a fully functional endonuclease capable of promoting horizontal intron transference. A higher rate of non-synonymous substitutions (Ka) detected in the HEGs of complex Scleractinia and Corallimorpharia suggests degradation of the HEG, whereas lower Ka rates in robust Scleractinia are consistent with a scenario of purifying selection. Molecular-clock analyses and ancestral inference of intron type indicated an earlier intron insertion in complex Scleractinia and Corallimorpharia in comparison to robust Scleractinia. These findings suggest that the lack of horizontal intron transfers in the former two groups is related to an age-dependent degradation of the endonuclease activity. Moreover, they also explain the peculiar geographical patterns of introns in stony and mushroom corals.
Pandey, Manmohan; Kumar, Ravindra; Srivastava, Prachi; Agarwal, Suyash; Srivastava, Shreya; Nagpure, Naresh S; Jena, Joy K; Kushwaha, Basdeo
2018-03-16
Mining and characterization of Simple Sequence Repeat (SSR) markers from whole genomes provide valuable information about biological significance of SSR distribution and also facilitate development of markers for genetic analysis. Whole genome sequencing (WGS)-SSR Annotation Tool (WGSSAT) is a graphical user interface pipeline developed using Java Netbeans and Perl scripts which facilitates in simplifying the process of SSR mining and characterization. WGSSAT takes input in FASTA format and automates the prediction of genes, noncoding RNA (ncRNA), core genes, repeats and SSRs from whole genomes followed by mapping of the predicted SSRs onto a genome (classified according to genes, ncRNA, repeats, exonic, intronic, and core gene region) along with primer identification and mining of cross-species markers. The program also generates a detailed statistical report along with visualization of mapped SSRs, genes, core genes, and RNAs. The features of WGSSAT were demonstrated using Takifugu rubripes data. This yielded a total of 139 057 SSR, out of which 113 703 SSR primer pairs were uniquely amplified in silico onto a T. rubripes (fugu) genome. Out of 113 703 mined SSRs, 81 463 were from coding region (including 4286 exonic and 77 177 intronic), 7 from RNA, 267 from core genes of fugu, whereas 105 641 SSR and 601 SSR primer pairs were uniquely mapped onto the medaka genome. WGSSAT is tested under Ubuntu Linux. The source code, documentation, user manual, example dataset and scripts are available online at https://sourceforge.net/projects/wgssat-nbfgr.
Cianciulli, Antonia; Calvello, Rosa; Panaro, Maria A
2015-04-01
In the homologous genes studied, the exons and introns alternated in the same order in mouse and human. We studied, in both species: corresponding short segments of introns, whole corresponding introns and complete homologous genes. We considered the total number of nucleotides and the number and orientation of the SINE inserts. Comparisons of mouse and human data series showed that at the level of individual relatively short segments of intronic sequences the stochastic variability prevails in the local structuring, but at higher levels of organization a deterministic component emerges, conserved in mouse and human during the divergent evolution, despite the ample re-editing of the intronic sequences and the fact that processes such as SINE spread had taken place in an independent way in the two species. Intron conservation is negatively correlated with the SINE occupancy, suggesting that virus inserts interfere with the conservation of the sequences inherited from the common ancestor. Copyright © 2015 Elsevier Ltd. All rights reserved.
Isolation and Identification of Gene-Specific MicroRNAs.
Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao
2018-01-01
Computer programming has identified hundreds of genomic hairpin sequences, many with functions yet to be determined. Because transfection of hairpin-like microRNA precursors (pre-miRNAs) into mammalian cells is not always sufficient to trigger RNA-induced gene silencing complex (RISC) assembly, a key step for inducing RNA interference (RNAi)-related gene silencing, we have developed an intronic miRNA expression system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene, and hence successfully increase the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis mechanism has been found to depend on a coupled interaction of nascent messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA so obtained is transcribed by type-II RNA polymerases, coexpressed within a primary gene transcript, and then excised out of the gene transcript by intracellular RNA splicing and processing machineries. After that, ribonuclease III (RNaseIII) endonucleases further process the spliced introns into mature miRNAs. Using this intronic miRNA expression system, we have shown for the first time that the intron-derived miRNAs are able to elicit strong RNAi effects in not only human and mouse cells in vitro but also in zebrafishes, chicken embryos, and adult mice in vivo. We have also developed a miRNA isolation protocol, based on the complementarity between the designed miRNA and its targeted gene sequence, to purify and identify the mature miRNAs generated. As a result, several intronic miRNA identities and structures have been confirmed. According to this proof-of-principle methodology, we now have full knowledge to design various intronic pre-miRNA inserts that are more efficient and effective for inducing specific gene silencing effects in vitro and in vivo.
Sinha, Rahul; Goyal, Pankaj; Grapputo, Alessandro
2011-01-01
Background Insertions of spliceosomal introns are very rare events during evolution of vertebrates and the mechanisms governing creation of novel intron(s) remain obscure. Largely, gene structures of melanocortin (MC) receptors are characterized by intron-less architecture. However, recently a few exceptions have been reported in some fishes. This warrants a systematic survey of MC receptors for understanding intron insertion events during vertebrate evolution. Methodology/Principal Findings We have compiled an extended list of MC receptors from different vertebrate genomes with variations in fishes. Notably, the closely linked MC2Rs and MC5Rs from a group of ray-finned fishes have three and one intron insertion(s), respectively, with conserved positions and intron phase. In both genes, one novel insertion was in the highly conserved DRY motif at the end of helix TM3. Further, the proto-splice site MAG↑R is maintained at intron insertion sites in these two genes. However, the orthologs of these receptors from zebrafish and tetrapods are intron-less, suggesting these introns are simultaneously created in selected fishes. Surprisingly, these novel introns are traceable only in four fish genomes. We found that these fish genomes are severely compacted after the separation from zebrafish. Furthermore, we also report novel intron insertions in P2Y receptors and in CHRM3. Finally, we report ultrasmall introns in MC2R genes from selected fishes. Conclusions/Significance The current repository of MC receptors illustrates that fishes have no MC3R ortholog. MC2R, MC5R, P2Y receptors and CHRM3 have novel intron insertions only in ray-finned fishes that underwent genome compaction. These receptors share one intron at an identical position suggestive of being inserted contemporaneously. In addition to repetitive elements, genome compaction is now believed to be a new hallmark that promotes intron insertions, as it requires rapid DNA breakage and subsequent repair processes to gain back normal functionality. PMID:21850219
Nisa-Martínez, Rafael; Jiménez-Zurdo, José I.; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás
2007-01-01
RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase–maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (ΔORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic ΔORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature. PMID:17158161
Nisa-Martínez, Rafael; Jiménez-Zurdo, José I; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás
2007-01-01
RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase-maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (DeltaORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic DeltaORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Xu; Zhou, Jing; Reeders, S.T.
1996-05-01
Basement membrane (type IV) collagen, a subfamily of the collagen protein family, is encoded by six distinct genes in mammals. Three of those, COL4A3, COL4A4, and COL4A5, are linked with Alport syndrome (hereditary nephritis). Patients with leimoyomatosis associated with Alport syndrome have been shown to have deletions in the 5{prime} end of the COL4A6 gene, in addition to having deletions in COL4A6. The human COL4A6 gene is reported to be 425 kb as determined by mapping of overlapping YAC clones by probes for its 5{prime} and 3{prime} ends. In the present study we describe the complete exon/intron size pattern ofmore » the human COL4A6 gene. The 12 {lambda} phage clones characterized in the study spanned a total of 110 kb, including 85 kb of the actual gene and 25 kb of flanking sequences. The overlapping clones contained all 46 exons of the gene and all introns, except for intron 2. Since the total size of the exons and all introns except for intron 2 is about 85 kb, intron 2 must be about 340 kb. All exons of the gene were assigned to EcoRI restriction fragments to facilitate analysis of the gene in patients with leiomyomatosis associated with Alport syndrome. The exon size pattern of COL4A6 is highly homologous with that of the human and mouse COL4A2 genes, with 27 of the 46 exons of COL4A6 being identical in size between the genes. 42 refs., 2 figs., 3 tabs.« less
The role of transposable elements in the evolution of non-mammalian vertebrates and invertebrates
2010-01-01
Background Transposable elements (TEs) have played an important role in the diversification and enrichment of mammalian transcriptomes through various mechanisms such as exonization and intronization (the birth of new exons/introns from previously intronic/exonic sequences, respectively), and insertion into first and last exons. However, no extensive analysis has compared the effects of TEs on the transcriptomes of mammals, non-mammalian vertebrates and invertebrates. Results We analyzed the influence of TEs on the transcriptomes of five species, three invertebrates and two non-mammalian vertebrates. Compared to previously analyzed mammals, there were lower levels of TE introduction into introns, significantly lower numbers of exonizations originating from TEs and a lower percentage of TE insertion within the first and last exons. Although the transcriptomes of vertebrates exhibit significant levels of exonization of TEs, only anecdotal cases were found in invertebrates. In vertebrates, as in mammals, the exonized TEs are mostly alternatively spliced, indicating that selective pressure maintains the original mRNA product generated from such genes. Conclusions Exonization of TEs is widespread in mammals, less so in non-mammalian vertebrates, and very low in invertebrates. We assume that the exonization process depends on the length of introns. Vertebrates, unlike invertebrates, are characterized by long introns and short internal exons. Our results suggest that there is a direct link between the length of introns and exonization of TEs and that this process became more prevalent following the appearance of mammals. PMID:20525173
A conserved intronic U1 snRNP-binding sequence promotes trans-splicing in Drosophila
Gao, Jun-Li; Fan, Yu-Jie; Wang, Xiu-Ye; Zhang, Yu; Pu, Jia; Li, Liang; Shao, Wei; Zhan, Shuai; Hao, Jianjiang
2015-01-01
Unlike typical cis-splicing, trans-splicing joins exons from two separate transcripts to produce chimeric mRNA and has been detected in most eukaryotes. Trans-splicing in trypanosomes and nematodes has been characterized as a spliced leader RNA-facilitated reaction; in contrast, its mechanism in higher eukaryotes remains unclear. Here we investigate mod(mdg4), a classic trans-spliced gene in Drosophila, and report that two critical RNA sequences in the middle of the last 5′ intron, TSA and TSB, promote trans-splicing of mod(mdg4). In TSA, a 13-nucleotide (nt) core motif is conserved across Drosophila species and is essential and sufficient for trans-splicing, which binds U1 small nuclear RNP (snRNP) through strong base-pairing with U1 snRNA. In TSB, a conserved secondary structure acts as an enhancer. Deletions of TSA and TSB using the CRISPR/Cas9 system result in developmental defects in flies. Although it is not clear how the 5′ intron finds the 3′ introns, compensatory changes in U1 snRNA rescue trans-splicing of TSA mutants, demonstrating that U1 recruitment is critical to promote trans-splicing in vivo. Furthermore, TSA core-like motifs are found in many other trans-spliced Drosophila genes, including lola. These findings represent a novel mechanism of trans-splicing, in which RNA motifs in the 5′ intron are sufficient to bring separate transcripts into close proximity to promote trans-splicing. PMID:25838544
RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.
Xiong, Hui Y; Alipanahi, Babak; Lee, Leo J; Bretschneider, Hannes; Merico, Daniele; Yuen, Ryan K C; Hua, Yimin; Gueroussov, Serge; Najafabadi, Hamed S; Hughes, Timothy R; Morris, Quaid; Barash, Yoseph; Krainer, Adrian R; Jojic, Nebojsa; Scherer, Stephen W; Blencowe, Benjamin J; Frey, Brendan J
2015-01-09
To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing. We detected tens of thousands of disease-causing mutations, including those involved in cancers and spinal muscular atrophy. Examination of intronic and exonic variants found using whole-genome sequencing of individuals with autism revealed misspliced genes with neurodevelopmental phenotypes. Our approach provides evidence for causal variants and should enable new discoveries in precision medicine. Copyright © 2015, American Association for the Advancement of Science.
Euglena Transcript Processing.
McWatters, David C; Russell, Anthony G
2017-01-01
RNA transcript processing is an important stage in the gene expression pathway of all organisms and is subject to various mechanisms of control that influence the final levels of gene products. RNA processing involves events such as nuclease-mediated cleavage, removal of intervening sequences referred to as introns and modifications to RNA structure (nucleoside modification and editing). In Euglena, RNA transcript processing was initially examined in chloroplasts because of historical interest in the secondary endosymbiotic origin of this organelle in this organism. More recent efforts to examine mitochondrial genome structure and RNA maturation have been stimulated by the discovery of unusual processing pathways in other Euglenozoans such as kinetoplastids and diplonemids. Eukaryotes containing large genomes are now known to typically contain large collections of introns and regulatory RNAs involved in RNA processing events, and Euglena gracilis in particular has a relatively large genome for a protist. Studies examining the structure of nuclear genes and the mechanisms involved in nuclear RNA processing have revealed that indeed Euglena contains large numbers of introns in the limited set of genes so far examined and also possesses large numbers of specific classes of regulatory and processing RNAs, such as small nucleolar RNAs (snoRNAs). Most interestingly, these studies have also revealed that Euglena possesses novel processing pathways generating highly fragmented cytosolic ribosomal RNAs and subunits and non-conventional intron classes removed by unknown splicing mechanisms. This unexpected diversity in RNA processing pathways emphasizes the importance of identifying the components involved in these processing mechanisms and their evolutionary emergence in Euglena species.
Functionality of In vitro Reconstituted Group II Intron RmInt1-Derived Ribonucleoprotein Particles.
Molina-Sánchez, Maria D; García-Rodríguez, Fernando M; Toro, Nicolás
2016-01-01
The functional unit of mobile group II introns is a ribonucleoprotein particle (RNP) consisting of the intron-encoded protein (IEP) and the excised intron RNA. The IEP has reverse transcriptase activity but also promotes RNA splicing, and the RNA-protein complex triggers site-specific DNA insertion by reverse splicing, in a process called retrohoming. In vitro reconstituted ribonucleoprotein complexes from the Lactococcus lactis group II intron Ll.LtrB, which produce a double strand break, have recently been studied as a means of developing group II intron-based gene targeting methods for higher organisms. The Sinorhizobium meliloti group II intron RmInt1 is an efficient mobile retroelement, the dispersal of which appears to be linked to transient single-stranded DNA during replication. The RmInt1IEP lacks the endonuclease domain (En) and cannot cut the bottom strand to generate the 3' end to initiate reverse transcription. We used an Escherichia coli expression system to produce soluble and active RmInt1 IEP and reconstituted RNPs with purified components in vitro . The RNPs generated were functional and reverse-spliced into a single-stranded DNA target. This work constitutes the starting point for the use of group II introns lacking DNA endonuclease domain-derived RNPs for highly specific gene targeting methods.
Functionality of In vitro Reconstituted Group II Intron RmInt1-Derived Ribonucleoprotein Particles
Molina-Sánchez, Maria D.; García-Rodríguez, Fernando M.; Toro, Nicolás
2016-01-01
The functional unit of mobile group II introns is a ribonucleoprotein particle (RNP) consisting of the intron-encoded protein (IEP) and the excised intron RNA. The IEP has reverse transcriptase activity but also promotes RNA splicing, and the RNA-protein complex triggers site-specific DNA insertion by reverse splicing, in a process called retrohoming. In vitro reconstituted ribonucleoprotein complexes from the Lactococcus lactis group II intron Ll.LtrB, which produce a double strand break, have recently been studied as a means of developing group II intron-based gene targeting methods for higher organisms. The Sinorhizobium meliloti group II intron RmInt1 is an efficient mobile retroelement, the dispersal of which appears to be linked to transient single-stranded DNA during replication. The RmInt1IEP lacks the endonuclease domain (En) and cannot cut the bottom strand to generate the 3′ end to initiate reverse transcription. We used an Escherichia coli expression system to produce soluble and active RmInt1 IEP and reconstituted RNPs with purified components in vitro. The RNPs generated were functional and reverse-spliced into a single-stranded DNA target. This work constitutes the starting point for the use of group II introns lacking DNA endonuclease domain-derived RNPs for highly specific gene targeting methods. PMID:27730127
Intron retention in viruses and cellular genes: Detention, border controls and passports.
Rekosh, David; Hammarskjold, Marie-Louise
2018-05-01
Intron retention (IR), where one or more introns remain in the RNA after splicing, was long thought to be rare in mammalian cells, albeit common in plants and some viruses. Largely due to the development of better methods for RNA analysis, it has now been recognized that IR is much more common than previously thought and that this mechanism is likely to play an important role in mammalian gene regulation. To date, most publications and reviews about IR have described the resulting mRNAs as "dead end" products, with no direct consequence for the proteome. However, there are also many reports of mRNAs with retained introns giving rise to alternative protein isoforms. Although this was originally revealed in viral systems, there are now numerous examples of bona fide cellular proteins that are translated from mRNAs with retained introns. These new isoforms have sometimes been shown to have important regulatory functions. In this review, we highlight recent developments in this area and the research on viruses that led the way to the realization of the many ways in which mRNAs with retained introns can be regulated. This article is categorized under: RNA Processing > Splicing Mechanisms RNA Processing > Splicing Regulation/Alternative Splicing RNA Export and Localization > Nuclear Export/Import RNA Interactions with Proteins and Other Molecules > RNA-Protein Complexes. © 2018 Wiley Periodicals, Inc.
Processing of Archaebacterial Intron-Containing tRNA Gene Transcripts
1988-07-27
number) The overall goal of this project is to develop an understanding of tRNA gene structure and transcript processing in the halophilic Archaebacteria...containing precursor tRNAs in the halophilic Archaebecteria suggest that tRNATr p may be the only interrupted tR?4A gene in these organisms...1 August 1986 RESEARCH OBJECTIVE: To determine the mechanism of tRNA intron processing in the halophilic archaebacteria; characterize the enzyme
Muddukrishna, Bhavana; Jackson, Christopher A; Yu, Michael C
2017-06-01
Protein arginine methylation occurs on spliceosomal components and spliceosome-associated proteins, but how this modification contributes to their function in pre-mRNA splicing remains sparse. Here we provide evidence that protein arginine methylation of the yeast SR-/hnRNP-like protein Npl3 plays a role in facilitating efficient splicing of the SUS1 intron that harbors a non-consensus 5' splice site and branch site. In yeast cells lacking the major protein arginine methyltransferase HMT1, we observed a change in the co-transcriptional recruitment of the U1 snRNP subunit Snp1 and Npl3 to pre-mRNAs harboring both consensus (ECM33 and ASC1) and non-consensus (SUS1) 5' splice site and branch site. Using an Npl3 mutant that phenocopies wild-type Npl3 when expressed in Δhmt1 cells, we showed that the arginine methylation of Npl3 is responsible for this. Examination of pre-mRNA splicing efficiency in these mutants reveals the requirement of Npl3 methylation for the efficient splicing of SUS1 intron 1, but not of ECM33 or ASC1. Changing the 5' splice site and branch site in SUS1 intron 1 to the consensus form restored splicing efficiency in an Hmt1-independent manner. Results from biochemical studies show that methylation of Npl3 promotes its optimal association with the U1 snRNP through its association with the U1 snRNP subunit Mud1. Based on these data, we propose a model in which Hmt1, via arginine methylation of Npl3, facilitates U1 snRNP engagement with the pre-mRNA to promote usage of non-consensus splice sites by the splicing machinery. Published by Elsevier B.V.
Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R
1997-04-28
We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.
Dorababu, Patchva; Nagesh, Narayana; Linga, Vijay Gandhi; Gundeti, Sadashivudu; Kutala, Vijay Kumar; Reddanna, Pallu; Digumarti, Raghunadharao
2012-04-01
To explore the role of genetic variants of thiopurine methyltransferase (TPMT) and inosine triphosphate pyrophosphatase (ITPA) in 6-mercaptopurine (6-MP)-induced toxicity in Indian children with acute lymphoblastic leukemia (ALL). Children with ALL receiving 6-MP in maintenance phase of treatment (n = 90) were enrolled in the study. Bidirectional sequencing of TPMT (whole gene) and ITPA (exon 2, exon 3, and intron 2) was undertaken, and correlation between genotype and 6-MP toxicity was assessed. Five variations were observed in TPMT, including two exonic variations, TPMT*12 (374 C > T) and TPMT*3C (719A > G), and three intronic, intron 3 (12356 C > T), intron 4 (16638 C > T), and TPMT rs2842949. Two exonic, ITPA exon -2 (94 C → A) and exon 3 of ITPA (138 G > A), and one intronic, ITPA intron 2 (A→C), variations were observed in ITPA. Multifactor dimensionality reduction analysis of all the genetic variants showed independent association of ITPA 94 C→A as well as synergic epistatic interactions, i.e., TPMT*12 × ITPA ex3, ITPA ex2 × TPMT*12 × ITPA ex3, and TPMT*3C × ITPA ex2 × TPMT*12 × ITPA ex3, in determining hematological toxicity. This is further substantiated by a multiple linear regression model, which showed moderate predictability of toxicity with these variants (area under the curve = 0.70, p = 0.004). Our results suggest that apart from the individual effect of ITPA 94 C→A, epistatic interactions between the variations of TPMT (*3C, *12) and ITPA (ex2, ex3) are associated with the 6-MP toxicity. Testing these variants facilitates tailoring of the 6-MP therapy in children with ALL.
Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.
2013-01-01
Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170
Huang, Xue-Yong; Niu, Jin; Sun, Ming-Xi; Zhu, Jun; Gao, Ju-Fang; Yang, Jun; Zhou, Que; Yang, Zhong-Nan
2013-01-01
Arabidopsis thaliana CYCLIN-DEPEDENT KINASE G1 (CDKG1) belongs to the family of cyclin-dependent protein kinases that were originally characterized as cell cycle regulators in eukaryotes. Here, we report that CDKG1 regulates pre-mRNA splicing of CALLOSE SYNTHASE5 (CalS5) and, therefore, pollen wall formation. The knockout mutant cdkg1 exhibits reduced male fertility with impaired callose synthesis and abnormal pollen wall formation. The sixth intron in CalS5 pre-mRNA, a rare type of intron with a GC 5′ splice site, is abnormally spliced in cdkg1. RNA immunoprecipitation analysis suggests that CDKG1 is associated with this intron. CDKG1 contains N-terminal Ser/Arg (RS) motifs and interacts with splicing factor Arginine/Serine-Rich Zinc Knuckle-Containing Protein33 (RSZ33) through its RS region to regulate proper splicing. CDKG1 and RS-containing Zinc Finger Protein22 (SRZ22), a splicing factor interacting with RSZ33 and U1 small nuclear ribonucleoprotein particle (snRNP) component U1-70k, colocalize in nuclear speckles and reside in the same complex. We propose that CDKG1 is recruited to U1 snRNP through RSZ33 to facilitate the splicing of the sixth intron of CalS5. PMID:23404887
Beauregard, Arthur; Chalamcharla, Venkata R; Piazza, Carol Lyn; Belfort, Marlene; Coros, Colin J
2006-11-01
Group II introns are mobile genetic elements that invade their cognate intron-minus alleles via an RNA intermediate, in a process known as retrohoming. They can also retrotranspose to ectopic sites at low frequency. In Escherichia coli, retrotransposition of the lactococcal group II intron, Ll.LtrB, occurs preferentially within the Ori and Ter macrodomains of the E. coli chromosome. These macrodomains migrate towards the poles of the cell, where the intron-encoded protein, LtrA, localizes. Here we investigate whether alteration of nucleoid condensation, chromosome partitioning and replication affect retrotransposition frequencies, as well as bipolar localization of the Ll.LtrB intron integration and LtrA distribution in E. coli. We thus examined these properties in the absence of the nucleoid-associated proteins H-NS, StpA and MukB, in variants of partitioning functions including the centromere-like sequence migS and the actin homologue MreB, as well as in the replication mutants DeltaoriC, seqA, tus and topoIV (ts). Although there were some dramatic fluctuations in retrotransposition levels in these hosts, bipolar localization of integration events was maintained. LtrA was consistently found in nucleoid-free regions, with its localization to the cellular poles being largely preserved in these hosts. Together, these results suggest that bipolar localization of group II intron retrotransposition results from the residence of the intron-encoded protein at the poles of the cell.
Deveson, Ira W.; Holleley, Clare E.; Blackburn, James; Marshall Graves, Jennifer A.; Mattick, John S.; Waters, Paul D.; Georges, Arthur
2017-01-01
In many vertebrates, sex of offspring is determined by external environmental cues rather than by sex chromosomes. In reptiles, for instance, temperature-dependent sex determination (TSD) is common. Despite decades of work, the mechanism by which temperature is converted into a sex-determining signal remains mysterious. This is partly because it is difficult to distinguish the primary molecular events of TSD from the confounding downstream signatures of sexual differentiation. We use the Australian central bearded dragon, in which chromosomal sex determination is overridden at high temperatures to produce sex-reversed female offspring, as a unique model to identify TSD-specific features of the transcriptome. We show that an intron is retained in mature transcripts from each of two Jumonji family genes, JARID2 and JMJD3, in female dragons that have been sex-reversed by temperature but not in normal chromosomal females or males. JARID2 is a component of the master chromatin modifier Polycomb Repressive Complex 2, and the mammalian sex-determining factor SRY is directly regulated by an independent but closely related Jumonji family member. We propose that the perturbation of JARID2/JMJD3 function by intron retention alters the epigenetic landscape to override chromosomal sex-determining cues, triggering sex reversal at extreme temperatures. Sex reversal may then facilitate a transition from genetic sex determination to TSD, with JARID2/JMJD3 intron retention preserved as the decisive regulatory signal. Significantly, we also observe sex-associated differential retention of the equivalent introns in JARID2/JMJD3 transcripts expressed in embryonic gonads from TSD alligators and turtles, indicative of a reptile-wide mechanism controlling TSD. PMID:28630932
Deveson, Ira W; Holleley, Clare E; Blackburn, James; Marshall Graves, Jennifer A; Mattick, John S; Waters, Paul D; Georges, Arthur
2017-06-01
In many vertebrates, sex of offspring is determined by external environmental cues rather than by sex chromosomes. In reptiles, for instance, temperature-dependent sex determination (TSD) is common. Despite decades of work, the mechanism by which temperature is converted into a sex-determining signal remains mysterious. This is partly because it is difficult to distinguish the primary molecular events of TSD from the confounding downstream signatures of sexual differentiation. We use the Australian central bearded dragon, in which chromosomal sex determination is overridden at high temperatures to produce sex-reversed female offspring, as a unique model to identify TSD-specific features of the transcriptome. We show that an intron is retained in mature transcripts from each of two Jumonji family genes, JARID2 and JMJD3 , in female dragons that have been sex-reversed by temperature but not in normal chromosomal females or males. JARID2 is a component of the master chromatin modifier Polycomb Repressive Complex 2, and the mammalian sex-determining factor SRY is directly regulated by an independent but closely related Jumonji family member. We propose that the perturbation of JARID2/JMJD3 function by intron retention alters the epigenetic landscape to override chromosomal sex-determining cues, triggering sex reversal at extreme temperatures. Sex reversal may then facilitate a transition from genetic sex determination to TSD, with JARID2/JMJD3 intron retention preserved as the decisive regulatory signal. Significantly, we also observe sex-associated differential retention of the equivalent introns in JARID2/JMJD3 transcripts expressed in embryonic gonads from TSD alligators and turtles, indicative of a reptile-wide mechanism controlling TSD.
Porcelli, Damiano; Barsanti, Paolo; Pesole, Graziano; Caggese, Corrado
2007-01-01
Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens) and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG) has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions. PMID:18315839
Contribution of Mobile Group II Introns to Sinorhizobium meliloti Genome Evolution.
Toro, Nicolás; Martínez-Abarca, Francisco; Molina-Sánchez, María D; García-Rodríguez, Fernando M; Nisa-Martínez, Rafael
2018-01-01
Mobile group II introns are ribozymes and retroelements that probably originate from bacteria. Sinorhizobium meliloti , the nitrogen-fixing endosymbiont of legumes of genus Medicago , harbors a large number of these retroelements. One of these elements, RmInt1, has been particularly successful at colonizing this multipartite genome. Many studies have improved our understanding of RmInt1 and phylogenetically related group II introns, their mobility mechanisms, spread and dynamics within S. meliloti and closely related species. Although RmInt1 conserves the ancient retroelement behavior, its evolutionary history suggests that this group II intron has played a role in the short- and long-term evolution of the S. meliloti genome. We will discuss its proposed role in genome evolution by controlling the spread and coexistence of potentially harmful mobile genetic elements, by ectopic transposition to different genetic loci as a source of early genomic variation and by generating sequence variation after a very slow degradation process, through intron remnants that may have continued to evolve, contributing to bacterial speciation.
Contribution of Mobile Group II Introns to Sinorhizobium meliloti Genome Evolution
Toro, Nicolás; Martínez-Abarca, Francisco; Molina-Sánchez, María D.; García-Rodríguez, Fernando M.; Nisa-Martínez, Rafael
2018-01-01
Mobile group II introns are ribozymes and retroelements that probably originate from bacteria. Sinorhizobium meliloti, the nitrogen-fixing endosymbiont of legumes of genus Medicago, harbors a large number of these retroelements. One of these elements, RmInt1, has been particularly successful at colonizing this multipartite genome. Many studies have improved our understanding of RmInt1 and phylogenetically related group II introns, their mobility mechanisms, spread and dynamics within S. meliloti and closely related species. Although RmInt1 conserves the ancient retroelement behavior, its evolutionary history suggests that this group II intron has played a role in the short- and long-term evolution of the S. meliloti genome. We will discuss its proposed role in genome evolution by controlling the spread and coexistence of potentially harmful mobile genetic elements, by ectopic transposition to different genetic loci as a source of early genomic variation and by generating sequence variation after a very slow degradation process, through intron remnants that may have continued to evolve, contributing to bacterial speciation. PMID:29670598
Splicing of goose parvovirus pre-mRNA influences cytoplasmic translation of the processed mRNA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Long; Pintel, David J., E-mail: pinteld@missouri.edu
2012-04-25
Translation of goose parvovirus (GPV) 72 kDa Rep 1 is initiated from unspliced P9-generated mRNAs in ORF1 from the first in-frame AUG (537 AUG); however, this AUG is bypassed in spliced P9-generated RNA: translation of the 52 kDa Rep 2 protein from spliced RNA is initiated in ORF2 at the next AUG downstream (650 AUG). Usage of the 537 AUG was restored in spliced RNA when the GPV intron was replaced with a chimeric SV40 intron, or following specific mutations of the GPV intron which did not appear in the final spliced mRNA. Additionally, 650 AUG usage was gained inmore » unspliced RNA when the GPV intron splice sites were debilitated. Splicing-dependent regulation of translation initiation was mediated in cis by GPV RNA surrounding the target AUGs. Thus, nuclear RNA processing of GPV P9-generated pre-mRNAs has a complex, but significant, effect on alternative translation initiation of the GPV Rep proteins.« less
The m6A pathway facilitates sex determination in Drosophila
Kan, Lijuan; Grozhik, Anya V.; Vedanayagam, Jeffrey; Patil, Deepak P.; Pang, Nan; Lim, Kok-Seong; Huang, Yi-Chun; Joseph, Brian; Lin, Ching-Jung; Despic, Vladimir; Guo, Jian; Yan, Dong; Kondo, Shu; Deng, Wu-Min; Dedon, Peter C.; Jaffrey, Samie R.; Lai, Eric C.
2017-01-01
The conserved modification N6-methyladenosine (m6A) modulates mRNA processing and activity. Here, we establish the Drosophila system to study the m6A pathway. We first apply miCLIP to map m6A across embryogenesis, characterize its m6A ‘writer’ complex, validate its YTH ‘readers’ CG6422 and YT521-B, and generate mutants in five m6A factors. While m6A factors with additional roles in splicing are lethal, m6A-specific mutants are viable but present certain developmental and behavioural defects. Notably, m6A facilitates the master female determinant Sxl, since multiple m6A components enhance female lethality in Sxl sensitized backgrounds. The m6A pathway regulates Sxl processing directly, since miCLIP data reveal Sxl as a major intronic m6A target, and female-specific Sxl splicing is compromised in multiple m6A pathway mutants. YT521-B is a dominant m6A effector for Sxl regulation, and YT521-B overexpression can induce female-specific Sxl splicing. Overall, our transcriptomic and genetic toolkit reveals in vivo biologic function for the Drosophila m6A pathway. PMID:28675155
Phase distribution of spliceosomal introns: implications for intron origin
Nguyen, Hung D; Yoshihama, Maki; Kenmochi, Naoya
2006-01-01
Background The origin of spliceosomal introns is the central subject of the introns-early versus introns-late debate. The distribution of intron phases is non-uniform, with an excess of phase-0 introns. Introns-early explains this by speculating that a fraction of present-day introns were present between minigenes in the progenote and therefore must lie in phase-0. In contrast, introns-late predicts that the nonuniformity of intron phase distribution reflects the nonrandomness of intron insertions. Results In this paper, we tested the two theories using analyses of intron phase distribution. We inferred the evolution of intron phase distribution from a dataset of 684 gene orthologs from seven eukaryotes using a maximum likelihood method. We also tested whether the observed intron phase distributions from 10 eukaryotes can be explained by intron insertions on a genome-wide scale. In contrast to the prediction of introns-early, the inferred evolution of intron phase distribution showed that the proportion of phase-0 introns increased over evolution. Consistent with introns-late, the observed intron phase distributions matched those predicted by an intron insertion model quite well. Conclusion Our results strongly support the introns-late hypothesis of the origin of spliceosomal introns. PMID:16959043
Pea chloroplast tRNA(Lys) (UUU) gene: transcription and analysis of an intron-containing gene.
Boyer, S K; Mullet, J E
1988-07-01
The pea chloroplast trnK gene which encodes tRNA(Lys) (UUU) was sequenced. TrnK is located 210 bp upstream from the promoter of psbA and immediately downstream from the 3'-end of rbcL. The gene is transcribed from the same DNA strand as psbA and rbcL. A 2447 bp intron with class II features is located in the trnK anticodon loop. The intron contains a 506 amino acid open reading frame which could encode an RNA maturase. The primary transcript of trnK is 2.9 kb long; its 5'-end was identified as a site of transcription initiation by in vitro transcription experiments. The 5'-terminus is adjacent to DNA sequences previously identified as transcription promoter elements. The most abundant trnK transcript is 2.5 kb long with termini corresponding to the 5' and 3' ends of the trnK exons. Intron specific RNAs were not detected. This suggests that RNA processing which produces tRNA(Lys) leads to rapid degradation of intron sequences.
Tuo, Decai; Shen, Wentao; Yan, Pu; Li, Xiaoying; Zhou, Peng
2015-01-01
Papaya leaf distortion mosaic virus (PLDMV) is becoming a threat to papaya and transgenic papaya resistant to the related pathogen, papaya ringspot virus (PRSV). The generation of infectious viral clones is an essential step for reverse-genetics studies of viral gene function and cross-protection. In this study, a sequence- and ligation-independent cloning system, the In-Fusion® Cloning Kit (Clontech, Mountain View, CA, USA), was used to construct intron-less or intron-containing full-length cDNA clones of the isolate PLDMV-DF, with the simultaneous scarless assembly of multiple viral and intron fragments into a plasmid vector in a single reaction. The intron-containing full-length cDNA clone of PLDMV-DF was stably propagated in Escherichia coli. In vitro intron-containing transcripts were processed and spliced into biologically active intron-less transcripts following mechanical inoculation and then initiated systemic infections in Carica papaya L. seedlings, which developed similar symptoms to those caused by the wild-type virus. However, no infectivity was detected when the plants were inoculated with RNA transcripts from the intron-less construct because the instability of the viral cDNA clone in bacterial cells caused a non-sense or deletion mutation of the genomic sequence of PLDMV-DF. To our knowledge, this is the first report of the construction of an infectious full-length cDNA clone of PLDMV and the splicing of intron-containing transcripts following mechanical inoculation. In-Fusion cloning shortens the construction time from months to days. Therefore, it is a faster, more flexible, and more efficient method than the traditional multistep restriction enzyme-mediated subcloning procedure. PMID:26633465
Tuo, Decai; Shen, Wentao; Yan, Pu; Li, Xiaoying; Zhou, Peng
2015-12-01
Papaya leaf distortion mosaic virus (PLDMV) is becoming a threat to papaya and transgenic papaya resistant to the related pathogen, papaya ringspot virus (PRSV). The generation of infectious viral clones is an essential step for reverse-genetics studies of viral gene function and cross-protection. In this study, a sequence- and ligation-independent cloning system, the In-Fusion(®) Cloning Kit (Clontech, Mountain View, CA, USA), was used to construct intron-less or intron-containing full-length cDNA clones of the isolate PLDMV-DF, with the simultaneous scarless assembly of multiple viral and intron fragments into a plasmid vector in a single reaction. The intron-containing full-length cDNA clone of PLDMV-DF was stably propagated in Escherichia coli. In vitro intron-containing transcripts were processed and spliced into biologically active intron-less transcripts following mechanical inoculation and then initiated systemic infections in Carica papaya L. seedlings, which developed similar symptoms to those caused by the wild-type virus. However, no infectivity was detected when the plants were inoculated with RNA transcripts from the intron-less construct because the instability of the viral cDNA clone in bacterial cells caused a non-sense or deletion mutation of the genomic sequence of PLDMV-DF. To our knowledge, this is the first report of the construction of an infectious full-length cDNA clone of PLDMV and the splicing of intron-containing transcripts following mechanical inoculation. In-Fusion cloning shortens the construction time from months to days. Therefore, it is a faster, more flexible, and more efficient method than the traditional multistep restriction enzyme-mediated subcloning procedure.
Enyeart, Peter J; Mohr, Georg; Ellington, Andrew D; Lambowitz, Alan M
2014-01-13
Mobile group II introns are bacterial retrotransposons that combine the activities of an autocatalytic intron RNA (a ribozyme) and an intron-encoded reverse transcriptase to insert site-specifically into DNA. They recognize DNA target sites largely by base pairing of sequences within the intron RNA and achieve high DNA target specificity by using the ribozyme active site to couple correct base pairing to RNA-catalyzed intron integration. Algorithms have been developed to program the DNA target site specificity of several mobile group II introns, allowing them to be made into 'targetrons.' Targetrons function for gene targeting in a wide variety of bacteria and typically integrate at efficiencies high enough to be screened easily by colony PCR, without the need for selectable markers. Targetrons have found wide application in microbiological research, enabling gene targeting and genetic engineering of bacteria that had been intractable to other methods. Recently, a thermostable targetron has been developed for use in bacterial thermophiles, and new methods have been developed for using targetrons to position recombinase recognition sites, enabling large-scale genome-editing operations, such as deletions, inversions, insertions, and 'cut-and-pastes' (that is, translocation of large DNA segments), in a wide range of bacteria at high efficiency. Using targetrons in eukaryotes presents challenges due to the difficulties of nuclear localization and sub-optimal magnesium concentrations, although supplementation with magnesium can increase integration efficiency, and directed evolution is being employed to overcome these barriers. Finally, spurred by new methods for expressing group II intron reverse transcriptases that yield large amounts of highly active protein, thermostable group II intron reverse transcriptases from bacterial thermophiles are being used as research tools for a variety of applications, including qRT-PCR and next-generation RNA sequencing (RNA-seq). The high processivity and fidelity of group II intron reverse transcriptases along with their novel template-switching activity, which can directly link RNA-seq adaptor sequences to cDNAs during reverse transcription, open new approaches for RNA-seq and the identification and profiling of non-coding RNAs, with potentially wide applications in research and biotechnology.
Moretzsohn, Márcio C.; Gouvea, Ediene G.; Inglis, Peter W.; Leal-Bertioli, Soraya C. M.; Valls, José F. M.; Bertioli, David J.
2013-01-01
Background and Aims The genus Arachis contains 80 described species. Section Arachis is of particular interest because it includes cultivated peanut, an allotetraploid, and closely related wild species, most of which are diploids. This study aimed to analyse the genetic relationships of multiple accessions of section Arachis species using two complementary methods. Microsatellites allowed the analysis of inter- and intraspecific variability. Intron sequences from single-copy genes allowed phylogenetic analysis including the separation of the allotetraploid genome components. Methods Intron sequences and microsatellite markers were used to reconstruct phylogenetic relationships in section Arachis through maximum parsimony and genetic distance analyses. Key Results Although high intraspecific variability was evident, there was good support for most species. However, some problems were revealed, notably a probable polyphyletic origin for A. kuhlmannii. The validity of the genome groups was well supported. The F, K and D genomes grouped close to the A genome group. The 2n = 18 species grouped closer to the B genome group. The phylogenetic tree based on the intron data strongly indicated that A. duranensis and A. ipaënsis are the ancestors of A. hypogaea and A. monticola. Intron nucleotide substitutions allowed the ages of divergences of the main genome groups to be estimated at a relatively recent 2·3–2·9 million years ago. This age and the number of species described indicate a much higher speciation rate for section Arachis than for legumes in general. Conclusions The analyses revealed relationships between the species and genome groups and showed a generally high level of intraspecific genetic diversity. The improved knowledge of species relationships should facilitate the utilization of wild species for peanut improvement. The estimates of speciation rates in section Arachis are high, but not unprecedented. We suggest these high rates may be linked to the peculiar reproductive biology of Arachis. PMID:23131301
Group I introns are widespread in archaea.
Nawrocki, Eric P; Jones, Thomas A; Eddy, Sean R
2018-05-18
Group I catalytic introns have been found in bacterial, viral, organellar, and some eukaryotic genomes, but not in archaea. All known archaeal introns are bulge-helix-bulge (BHB) introns, with the exception of a few group II introns. It has been proposed that BHB introns arose from extinct group I intron ancestors, much like eukaryotic spliceosomal introns are thought to have descended from group II introns. However, group I introns have little sequence conservation, making them difficult to detect with standard sequence similarity searches. Taking advantage of recent improvements in a computational homology search method that accounts for both conserved sequence and RNA secondary structure, we have identified 39 group I introns in a wide range of archaeal phyla, including examples of group I introns and BHB introns in the same host gene.
Wu, Baojun; Hao, Weilong
2014-04-16
Group I introns are highly dynamic and mobile, featuring extensive presence-absence variation and widespread horizontal transfer. Group I introns can invade intron-lacking alleles via intron homing powered by their own encoded homing endonuclease gene (HEG) after horizontal transfer or via reverse splicing through an RNA intermediate. After successful invasion, the intron and HEG are subject to degeneration and sequential loss. It remains unclear whether these mechanisms can fully address the high dynamics and mobility of group I introns. Here, we found that HEGs undergo a fast gain-and-loss turnover comparable with introns in the yeast mitochondrial 21S-rRNA gene, which is unexpected, as the intron and HEG are generally believed to move together as a unit. We further observed extensively mosaic sequences in both the introns and HEGs, and evidence of gene conversion between HEG-containing and HEG-lacking introns. Our findings suggest horizontal transfer and gene conversion can accelerate HEG/intron degeneration and loss, or rescue and propagate HEG/introns, and ultimately result in high HEG/intron turnover rate. Given that up to 25% of the yeast mitochondrial genome is composed of introns and most mitochondrial introns are group I introns, horizontal transfer and gene conversion could have served as an important mechanism in introducing mitochondrial intron diversity, promoting intron mobility and consequently shaping mitochondrial genome architecture.
Efficiency of introns from various origins in fish cells.
Bétancourt, O H; Attal, J; Théron, M C; Puissant, C; Houdebine, L M
1993-06-01
Several vectors containing (1) regulatory regions from Rous sarcoma virus (RSV), human cytomegalovirus (CMV), and herpes simplex thymidine kinase (TK); (2) introns from early or late SV40 genes and from trout growth hormone gene (tGH); (3) chloramphenicol acetyltransferase gene (CAT); and (4) transcription terminators from SV40 were transfected into carp EPC cells, salmon CHSE cells, tilapia TO2 cells, quail QT6 cells, and hamster CHO cells. CAT activity was measured in extracts from several cell lines 3 days after transfection and in the fish EPC stable clones. The CMV and RSV promoters were the most potent in all cell types. The intron from late SV40 genes (VP1 intron) worked properly in QT6 and CHO cells but not in EPC and very weakly in TO2 cells. The tGH intron was efficient in all cell types but preferentially in fish cells. The small t intron from SV40 was processed in all cell types. The small t and, to a lesser extent, the tGH introns amplified expression of cat gene in stable clones, in comparison to the transiently transfected cells. These results indicate that elements from mammalian genes may not be properly recognized by the fish cellular machinery and in an unpredictable manner. This finding suggests that vectors prepared to express foreign genes in transfected cultured fish cells and transgenic fish should preferably contain DNA sequences from fish genes or, alternatively, those sequences from mammalian genes that have been previously proved to be compatible with the fish cellular machinery.
Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W
1996-02-15
Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).
Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W
1996-01-01
Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U). PMID:8604302
Bergkessel, Megan; Whitworth, Gregg B.; Guthrie, Christine
2011-01-01
Gene expression in eukaryotic cells is profoundly influenced by the post-transcriptional processing of mRNAs, including the splicing of introns in the nucleus and both nuclear and cytoplasmic degradation pathways. These processes have the potential to affect both the steady-state levels and the kinetics of changes to levels of intron-containing transcripts. Here we report the use of a splicing isoform-specific microarray platform to investigate the effects of diverse stress conditions on pre-mRNA processing. Interestingly, we find that diverse stresses cause distinct patterns of changes at this level. The responses we observed are most dramatic for the RPGs and can be categorized into three major classes. The first is characterized by accumulation of RPG pre-mRNA and is seen in multiple types of amino acid starvation regimes; the magnitude of splicing inhibition correlates with the severity of the stress. The second class is characterized by a rapid decrease in both pre- and mature RPG mRNA and is seen in many stresses that inactivate the TORC1 kinase complex. These decreases depend on nuclear turnover of the intron-containing pre-RNAs. The third class is characterized by a decrease in RPG pre-mRNA, with only a modest reduction in the mature species; this response is observed in hyperosmotic and cation-toxic stresses. We show that casein kinase 2 (CK2) makes important contributions to the changes in pre-mRNA processing, particularly for the first two classes of stress responses. In total, our data suggest that complex post-transcriptional programs cooperate to fine-tune expression of intron-containing transcripts in budding yeast. PMID:21697354
Bergkessel, Megan; Whitworth, Gregg B; Guthrie, Christine
2011-08-01
Gene expression in eukaryotic cells is profoundly influenced by the post-transcriptional processing of mRNAs, including the splicing of introns in the nucleus and both nuclear and cytoplasmic degradation pathways. These processes have the potential to affect both the steady-state levels and the kinetics of changes to levels of intron-containing transcripts. Here we report the use of a splicing isoform-specific microarray platform to investigate the effects of diverse stress conditions on pre-mRNA processing. Interestingly, we find that diverse stresses cause distinct patterns of changes at this level. The responses we observed are most dramatic for the RPGs and can be categorized into three major classes. The first is characterized by accumulation of RPG pre-mRNA and is seen in multiple types of amino acid starvation regimes; the magnitude of splicing inhibition correlates with the severity of the stress. The second class is characterized by a rapid decrease in both pre- and mature RPG mRNA and is seen in many stresses that inactivate the TORC1 kinase complex. These decreases depend on nuclear turnover of the intron-containing pre-RNAs. The third class is characterized by a decrease in RPG pre-mRNA, with only a modest reduction in the mature species; this response is observed in hyperosmotic and cation-toxic stresses. We show that casein kinase 2 (CK2) makes important contributions to the changes in pre-mRNA processing, particularly for the first two classes of stress responses. In total, our data suggest that complex post-transcriptional programs cooperate to fine-tune expression of intron-containing transcripts in budding yeast.
Widespread alternative and aberrant splicing revealed by lariat sequencing
Stepankiw, Nicholas; Raghavan, Madhura; Fogarty, Elizabeth A.; Grimson, Andrew; Pleiss, Jeffrey A.
2015-01-01
Alternative splicing is an important and ancient feature of eukaryotic gene structure, the existence of which has likely facilitated eukaryotic proteome expansions. Here, we have used intron lariat sequencing to generate a comprehensive profile of splicing events in Schizosaccharomyces pombe, amongst the simplest organisms that possess mammalian-like splice site degeneracy. We reveal an unprecedented level of alternative splicing, including alternative splice site selection for over half of all annotated introns, hundreds of novel exon-skipping events, and thousands of novel introns. Moreover, the frequency of these events is far higher than previous estimates, with alternative splice sites on average activated at ∼3% the rate of canonical sites. Although a subset of alternative sites are conserved in related species, implying functional potential, the majority are not detectably conserved. Interestingly, the rate of aberrant splicing is inversely related to expression level, with lowly expressed genes more prone to erroneous splicing. Although we validate many events with RNAseq, the proportion of alternative splicing discovered with lariat sequencing is far greater, a difference we attribute to preferential decay of aberrantly spliced transcripts. Together, these data suggest the spliceosome possesses far lower fidelity than previously appreciated, highlighting the potential contributions of alternative splicing in generating novel gene structures. PMID:26261211
Agaphonov, Michael O
2017-12-01
The use of plasmids possessing a regulatable gene coding for a site-specific recombinase together with its recognition sequences significantly facilitates genome manipulations since it allows self-excision of the portion of the genetic construct integrated into the host genome. Stable maintenance of such plasmids in Escherichia coli, which is used for plasmid preparation, requires prevention of recombinase synthesis in this host, which can be achieved by interrupting the recombinase gene with an intron. Based on this approach, Saccharomyces cerevisiae and Hansenula polymorpha self-excising vectors possessing intronated gene for Cre recombinase and its recognition sites (LoxP) were previously constructed. However, this work shows instability of the H. polymorpha vectors during plasmid maintenance in E. coli cells. This could be due to recombination between the loxP sites caused by residual expression of the cre gene. Prevention of translation reinitiation on an internal methionine codon completely solved this problem. A similar modification was made in a self-excising vector designed for S. cerevisiae. Apart from substantial improvement of yeast self-excising vectors, the obtained results also narrow down the essential part of Cre sequence. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Splicing-Related Features of Introns Serve to Propel Evolution
Luo, Yuping; Li, Chun; Gong, Xi; Wang, Yanlu; Zhang, Kunshan; Cui, Yaru; Sun, Yi Eve; Li, Siguang
2013-01-01
The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron's ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution. PMID:23516505
Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M
2017-01-01
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Long-read sequencing of nascent RNA reveals coupling among RNA processing events.
Herzel, Lydia; Straube, Korinna; Neugebauer, Karla M
2018-06-14
Pre-mRNA splicing is accomplished by the spliceosome, a megadalton complex that assembles de novo on each intron. Because spliceosome assembly and catalysis occur cotranscriptionally, we hypothesized that introns are removed in the order of their transcription in genomes dominated by constitutive splicing. Remarkably little is known about splicing order and the regulatory potential of nascent transcript remodeling by splicing, due to the limitations of existing methods that focus on analysis of mature splicing products (mRNAs) rather than substrates and intermediates. Here, we overcome this obstacle through long-read RNA sequencing of nascent, multi-intron transcripts in the fission yeast Schizosaccharomyces pombe Most multi-intron transcripts were fully spliced, consistent with rapid cotranscriptional splicing. However, an unexpectedly high proportion of transcripts were either fully spliced or fully unspliced, suggesting that splicing of any given intron is dependent on the splicing status of other introns in the transcript. Supporting this, mild inhibition of splicing by a temperature-sensitive mutation in prp2 , the homolog of vertebrate U2AF65, increased the frequency of fully unspliced transcripts. Importantly, fully unspliced transcripts displayed transcriptional read-through at the polyA site and were degraded cotranscriptionally by the nuclear exosome. Finally, we show that cellular mRNA levels were reduced in genes with a high number of unspliced nascent transcripts during caffeine treatment, showing regulatory significance of cotranscriptional splicing. Therefore, overall splicing of individual nascent transcripts, 3' end formation, and mRNA half-life depend on the splicing status of neighboring introns, suggesting crosstalk among spliceosomes and the polyA cleavage machinery during transcription elongation. © 2018 Herzel et al.; Published by Cold Spring Harbor Laboratory Press.
Extensive intron gain in the ancestor of placental mammals
2011-01-01
Background Genome-wide studies of intron dynamics in mammalian orthologous genes have found convincing evidence for loss of introns but very little for intron turnover. Similarly, large-scale analysis of intron dynamics in a few vertebrate genomes has identified only intron losses and no gains, indicating that intron gain is an extremely rare event in vertebrate evolution. These studies suggest that the intron-rich genomes of vertebrates do not allow intron gain. The aim of this study was to search for evidence of de novo intron gain in domesticated genes from an analysis of their exon/intron structures. Results A phylogenomic approach has been used to analyse all domesticated genes in mammals and chordates that originated from the coding parts of transposable elements. Gain of introns in domesticated genes has been reconstructed on well established mammalian, vertebrate and chordate phylogenies, and examined as to where and when the gain events occurred. The locations, sizes and amounts of de novo introns gained in the domesticated genes during the evolution of mammals and chordates has been analyzed. A significant amount of intron gain was found only in domesticated genes of placental mammals, where more than 70 cases were identified. De novo gained introns show clear positional bias, since they are distributed mainly in 5' UTR and coding regions, while 3' UTR introns are very rare. In the coding regions of some domesticated genes up to 8 de novo gained introns have been found. Intron densities in Eutheria-specific domesticated genes and in older domesticated genes that originated early in vertebrates are lower than those for normal mammalian and vertebrate genes. Surprisingly, the majority of intron gains have occurred in the ancestor of placentals. Conclusions This study provides the first evidence for numerous intron gains in the ancestor of placental mammals and demonstrates that adequate taxon sampling is crucial for reconstructing intron evolution. The findings of this comprehensive study slightly challenge the current view on the evolutionary stasis in intron dynamics during the last 100 - 200 My. Domesticated genes could constitute an excellent system on which to analyse the mechanisms of intron gain in placental mammals. Reviewers: this article was reviewed by Dan Graur, Eugene V. Koonin and Jürgen Brosius. PMID:22112745
Patterns and rates of intron divergence between humans and chimpanzees
Gazave, Elodie; Marqués-Bonet, Tomàs; Fernando, Olga; Charlesworth, Brian; Navarro, Arcadi
2007-01-01
Background Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results. Results We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns). Conclusion We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints. PMID:17309804
Evaluation of the mechanisms of intron loss and gain in the social amoebae Dictyostelium.
Ma, Ming-Yue; Che, Xun-Ru; Porceddu, Andrea; Niu, Deng-Ke
2015-12-18
Spliceosomal introns are a common feature of eukaryotic genomes. To approach a comprehensive understanding of intron evolution on Earth, studies should look beyond repeatedly studied groups such as animals, plants, and fungi. The slime mold Dictyostelium belongs to a supergroup of eukaryotes not covered in previous studies. We found 441 precise intron losses in Dictyostelium discoideum and 202 precise intron losses in Dictyostelium purpureum. Consistent with these observations, Dictyostelium discoideum was found to have significantly more copies of reverse transcriptase genes than Dictyostelium purpureum. We also found that the lost introns are significantly further from the 5' end of genes than the conserved introns. Adjacent introns were prone to be lost simultaneously in Dictyostelium discoideum. In both Dictyostelium species, the exonic sequences flanking lost introns were found to have a significantly higher GC content than those flanking conserved introns. Together, these observations support a reverse-transcription model of intron loss in which intron losses were caused by gene conversion between genomic DNA and cDNA reverse transcribed from mature mRNA. We also identified two imprecise intron losses in Dictyostelium discoideum that may have resulted from genomic deletions. Ninety-eight putative intron gains were also observed. Consistent with previous studies of other lineages, the source sequences were found in only a small number of cases, with only two instances of intron gain identified in Dictyostelium discoideum. Although they diverged very early from animals and fungi, Dictyostelium species have similar mechanisms of intron loss.
Tissue- and case-specific retention of intron 40 in mature dystrophin mRNA.
Nishida, Atsushi; Minegishi, Maki; Takeuchi, Atsuko; Niba, Emma Tabe Eko; Awano, Hiroyuki; Lee, Tomoko; Iijima, Kazumoto; Takeshima, Yasuhiro; Matsuo, Masafumi
2015-06-01
The dystrophin gene, which is mutated in Duchenne muscular dystrophy (DMD), comprises 79 exons that show multiple alternative splicing events. Intron retention, a type of alternative splicing, may control gene expression. We examined intron retention in dystrophin introns by reverse-transcription PCR from skeletal muscle, focusing on the nine shortest (all <1000 bp), because these are more likely to be retained. Only one, intron 40, was retained in mRNA; sequencing revealed insertion of a complete intron 40 (851 nt) between exons 40 and 41. The intron 40 retention product accounted for 1.2% of the total product but had a premature stop codon at the fifth intronic codon. Intron 40 retention was most strongly observed in the kidney (36.6%) and was not obtained from the fetal liver, lung, spleen or placenta. This indicated that intron retention is a tissue-specific event whose level varies among tissues. In two DMD patients, intron 40 retention was observed in one patient but not in the other. Examination of splicing regulatory factors revealed that intron 40 had the highest guanine-cytosine content of all examined introns in a 30-nt segment at its 3' end. Further studies are needed to clarify the biological role of intron 40-retained dystrophin mRNA.
Jády, Beáta E.; Ketele, Amandine; Kiss, Tamás
2012-01-01
Alu repetitive sequences are the most abundant short interspersed DNA elements in the human genome. Full-length Alu elements are composed of two tandem sequence monomers, the left and right Alu arms, both derived from the 7SL signal recognition particle RNA. Since Alu elements are common in protein-coding genes, they are frequently transcribed into pre-mRNAs. Here, we demonstrate that the right arms of nascent Alu transcripts synthesized within pre-mRNA introns are processed into metabolically stable small RNAs. The intron-encoded Alu RNAs, termed AluACA RNAs, are structurally highly reminiscent of box H/ACA small Cajal body (CB) RNAs (scaRNAs). They are composed of two hairpin units followed by the essential H (AnAnnA) and ACA box motifs. The mature AluACA RNAs associate with the four H/ACA core proteins: dyskerin, Nop10, Nhp2, and Gar1. Moreover, the 3′ hairpin of AluACA RNAs carries two closely spaced CB localization motifs, CAB boxes (UGAG), which bind Wdr79 in a cumulative fashion. In contrast to canonical H/ACA scaRNPs, which concentrate in CBs, the AluACA RNPs accumulate in the nucleoplasm. Identification of 348 human AluACA RNAs demonstrates that intron-encoded AluACA RNAs represent a novel, large subgroup of H/ACA RNAs, which are apparently confined to human or primate cells. PMID:22892240
Intron open reading frames as mobile elements and evolution of a group I intron.
Sellem, C H; Belcour, L
1997-05-01
Group I introns are proposed to have become mobile following the acquisition of open reading frames (ORFs) that encode highly specific DNA endonucleases. This proposal implies that intron ORFs could behave as autonomously mobile entities. This was supported by abundant circumstantial evidence but no experiment of ORF transfer from an ORF-containing intron to its ORF-less counterpart has been described. In this paper we present such experiments, which demonstrate the efficient mobility of the mitochondrial nad1-i4-orf1 between two Podospora strains. The homing of this mobile ORF was accompanied by a bidirectional co-conversion that did not systematically involve the whole intron sequence. Orf1 acquisition would be the most recent step in the evolution of the nad1-i4 intron, which has resulted in many strains of Podospora having an intron with two ORFs (biorfic) and four splicing pathways. We show that two of the splicing events that operate in this biorfic intron, as evidenced by PCR experiments, are generated by a 5'-alternative splice site, which is most probably a remnant of the monoorfic ancestral form of the intron. We propose a sequential evolution model that is consistent with the four organizations of the corresponding nad1 locus that we found among various species of the Pyrenomycete family; these organizations consist of no intron, an intron alone, a monoorfic intron, and a biorfic intron.
Mechanisms Used for Genomic Proliferation by Thermophilic Group II Introns
Mohr, Georg; Ghanem, Eman; Lambowitz, Alan M.
2010-01-01
Mobile group II introns, which are found in bacterial and organellar genomes, are site-specific retroelments hypothesized to be evolutionary ancestors of spliceosomal introns and retrotransposons in higher organisms. Most bacteria, however, contain no more than one or a few group II introns, making it unclear how introns could have proliferated to higher copy numbers in eukaryotic genomes. An exception is the thermophilic cyanobacterium Thermosynechococcus elongatus, which contains 28 closely related copies of a group II intron, constituting ∼1.3% of the genome. Here, by using a combination of bioinformatics and mobility assays at different temperatures, we identified mechanisms that contribute to the proliferation of T. elongatus group II introns. These mechanisms include divergence of DNA target specificity to avoid target site saturation; adaptation of some intron-encoded reverse transcriptases to splice and mobilize multiple degenerate introns that do not encode reverse transcriptases, leading to a common splicing apparatus; and preferential insertion within other mobile introns or insertion elements, which provide new unoccupied sites in expanding non-essential DNA regions. Additionally, unlike mesophilic group II introns, the thermophilic T. elongatus introns rely on elevated temperatures to help promote DNA strand separation, enabling access to a larger number of DNA target sites by base pairing of the intron RNA, with minimal constraint from the reverse transcriptase. Our results provide insight into group II intron proliferation mechanisms and show that higher temperatures, which are thought to have prevailed on Earth during the emergence of eukaryotes, favor intron proliferation by increasing the accessibility of DNA target sites. We also identify actively mobile thermophilic introns, which may be useful for structural studies, gene targeting in thermophiles, and as a source of thermostable reverse transcriptases. PMID:20543989
Schuster, Astrid; Lopez, Jose V; Becking, Leontine E; Kelly, Michelle; Pomponi, Shirley A; Wörheide, Gert; Erpenbeck, Dirk; Cárdenas, Paco
2017-03-20
Mitochondrial introns intermit coding regions of genes and feature characteristic secondary structures and splicing mechanisms. In metazoans, mitochondrial introns have only been detected in sponges, cnidarians, placozoans and one annelid species. Within demosponges, group I and group II introns are present in six families. Based on different insertion sites within the cox1 gene and secondary structures, four types of group I and two types of group II introns are known, which can harbor up to three encoding homing endonuclease genes (HEG) of the LAGLIDADG family (group I) and/or reverse transcriptase (group II). However, only little is known about sponge intron mobility, transmission, and origin due to the lack of a comprehensive dataset. We analyzed the largest dataset on sponge mitochondrial group I introns to date: 95 specimens, from 11 different sponge genera which provided novel insights into the evolution of group I introns. For the first time group I introns were detected in four genera of the sponge family Scleritodermidae (Scleritoderma, Microscleroderma, Aciculites, Setidium). We demonstrated that group I introns in sponges aggregate in the most conserved regions of cox1. We showed that co-occurrence of two introns in cox1 is unique among metazoans, but not uncommon in sponges. However, this combination always associates an active intron with a degenerating one. Earlier hypotheses of HGT were confirmed and for the first time VGT and secondary losses of introns conclusively demonstrated. This study validates the subclass Spirophorina (Tetractinellida) as an intron hotspot in sponges. Our analyses confirm that most sponge group I introns probably originated from fungi. DNA barcoding is discussed and the application of alternative primers suggested.
Zerbato, Madeleine; Holic, Nathalie; Moniot-Frin, Sophie; Ingrao, Dina; Galy, Anne; Perea, Javier
2013-01-01
Group II introns are self-splicing mobile elements found in prokaryotes and eukaryotic organelles. These introns propagate by homing into precise genomic locations, following assembly of a ribonucleoprotein complex containing the intron-encoded protein (IEP) and the spliced intron RNA. Engineered group II introns are now commonly used tools for targeted genomic modifications in prokaryotes but not in eukaryotes. We speculate that the catalytic activation of currently known group II introns is limited in eukaryotic cells. The brown algae Pylaiella littoralis Pl.LSU/2 group II intron is uniquely capable of in vitro ribozyme activity at physiological level of magnesium but this intron remains poorly characterized. We purified and characterized recombinant Pl.LSU/2 IEP. Unlike most IEPs, Pl.LSU/2 IEP displayed a reverse transcriptase activity without intronic RNA. The Pl.LSU/2 intron could be engineered to splice accurately in Saccharomyces cerevisiae and splicing efficiency was increased by the maturase activity of the IEP. However, spliced transcripts were not expressed. Furthermore, intron splicing was not detected in human cells. While further tool development is needed, these data provide the first functional characterization of the PI.LSU/2 IEP and the first evidence that the Pl.LSU/2 group II intron splicing occurs in vivo in eukaryotes in an IEP-dependent manner.
Intron self-complementarity enforces exon inclusion in a yeast pre-mRNA
Howe, Kenneth James; Ares, Manuel
1997-01-01
Skipping of internal exons during removal of introns from pre-mRNA must be avoided for proper expression of most eukaryotic genes. Despite significant understanding of the mechanics of intron removal, mechanisms that ensure inclusion of internal exons in multi-intron pre-mRNAs remain mysterious. Using a natural two-intron yeast gene, we have identified distinct RNA–RNA complementarities within each intron that prevent exon skipping and ensure inclusion of internal exons. We show that these complementarities are positioned to act as intron identity elements, bringing together only the appropriate 5′ splice sites and branchpoints. Destroying either intron self-complementarity allows exon skipping to occur, and restoring the complementarity using compensatory mutations rescues exon inclusion, indicating that the elements act through formation of RNA secondary structure. Introducing new pairing potential between regions near the 5′ splice site of intron 1 and the branchpoint of intron 2 dramatically enhances exon skipping. Similar elements identified in single intron yeast genes contribute to splicing efficiency. Our results illustrate how intron secondary structure serves to coordinate splice site pairing and enforce exon inclusion. We suggest that similar elements in vertebrate genes could assist in the splicing of very large introns and in the evolution of alternative splicing. PMID:9356473
Crotti, Lia; Lewandowska, Marzena A; Schwartz, Peter J; Insolia, Roberto; Pedrazzini, Matteo; Bussani, Erica; Dagradi, Federica; George, Alfred L; Pagani, Franco
2009-02-01
Genetic screening of long QT syndrome (LQTS) fails to identify disease-causing mutations in about 30% of patients. So far, molecular screening has focused mainly on coding sequence mutations or on substitutions at canonical splice sites. The purpose of this study was to explore the possibility that intronic variants not at canonical splice sites might affect splicing regulatory elements, lead to aberrant transcripts, and cause LQTS. Molecular screening was performed through DHPLC and sequence analysis. The role of the intronic mutation identified was assessed with a hybrid minigene splicing assay. A three-generation LQTS family was investigated. Molecular screening failed to identify an obvious disease-causing mutation in the coding sequences of the major LQTS genes but revealed an intronic A-to-G substitution in KCNH2 (IVS9-28A/G) cosegregating with the clinical phenotype in family members. In vitro analysis proved that the mutation disrupts the acceptor splice site definition by affecting the branch point (BP) sequence and promoting intron retention. We further demonstrated a tight functional relationship between the BP and the polypyrimidine tract, whose weakness is responsible for the pathological effect of the IVS9-28A/G mutation. We identified a novel BP mutation in KCNH2 that disrupts the intron 9 acceptor splice site definition and causes LQT2. The present finding demonstrates that intronic mutations affecting pre-mRNA processing may contribute to the failure of traditional molecular screening in identifying disease-causing mutations in LQTS subjects and offers a rationale strategy for the reduction of genotype-negative cases.
Hong, Jin-Sung; Ryu, Ki-Hyun; Kwon, Soon-Jae; Kim, Jin-Won; Kim, Kwang-Soo; Park, Kyong-Cheul
2013-01-01
Polygalacturonase (PG) gene is a typical gene family present in eukaryotes. Forty-nine PGs were mined from the genomes of Neurospora crassa and five Aspergillus species. The PGs were classified into 3 clades such as clade 1 for rhamno-PGs, clade 2 for exo-PGs and clade 3 for exo- and endo-PGs, which were further grouped into 13 sub-clades based on the polypeptide sequence similarity. In gene structure analysis, a total of 124 introns were present in 44 genes and five genes lacked introns to give an average of 2.5 introns per gene. Intron phase distribution was 64.5% for phase 0, 21.8% for phase 1, and 13.7% for phase 2, respectively. The introns varied in their sequences and their lengths ranged from 20 bp to 424 bp with an average of 65.9 bp, which is approximately half the size of introns in other fungal genes. There were 29 homologous intron blocks and 26 of those were sub-clade specific. Intron losses were counted in 18 introns in which no obvious phase preference for intron loss was observed. Eighteen introns were placed at novel positions, which is considerably higher than those of plant PGs. In an evolutionary sense both intron loss and gain must have taken place for shaping the current PGs in these fungi. Together with the small intron size, low conservation of homologous intron blocks and higher number of novel introns, PGs of fungal species seem to have recently undergone highly dynamic evolution. PMID:25288950
Recruitment of Staufen2 Enhances Dendritic Localization of an Intron-Containing CaMKIIα mRNA.
Ortiz, Raúl; Georgieva, Maya V; Gutiérrez, Sara; Pedraza, Neus; Fernández-Moya, Sandra M; Gallego, Carme
2017-07-05
Regulation of mRNA localization is a conserved cellular process observed in many types of cells and organisms. Asymmetrical mRNA distribution plays a particularly important role in the nervous system, where local translation of localized mRNA represents a key mechanism in synaptic plasticity. CaMKIIα is a very abundant mRNA detected in neurites, consistent with its crucial role at glutamatergic synapses. Here, we report the presence of CaMKIIα mRNA isoforms that contain intron i16 in dendrites, RNA granules, and synaptoneurosomes from primary neurons and brain. This subpopulation of unspliced mRNA preferentially localizes to distal dendrites in a synaptic-activity-dependent manner. Staufen2, a well-established marker of RNA transport in dendrites, interacts with intron i16 sequences and enhances its distal dendritic localization, pointing to the existence of intron-mediated mechanisms in the molecular pathways that modulate dendritic transport and localization of synaptic mRNAs. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Homing endonucleases from mobile group I introns: discovery to genome engineering
2014-01-01
Homing endonucleases are highly specific DNA cleaving enzymes that are encoded within genomes of all forms of microbial life including phage and eukaryotic organelles. These proteins drive the mobility and persistence of their own reading frames. The genes that encode homing endonucleases are often embedded within self-splicing elements such as group I introns, group II introns and inteins. This combination of molecular functions is mutually advantageous: the endonuclease activity allows surrounding introns and inteins to act as invasive DNA elements, while the splicing activity allows the endonuclease gene to invade a coding sequence without disrupting its product. Crystallographic analyses of representatives from all known homing endonuclease families have illustrated both their mechanisms of action and their evolutionary relationships to a wide range of host proteins. Several homing endonucleases have been completely redesigned and used for a variety of genome engineering applications. Recent efforts to augment homing endonucleases with auxiliary DNA recognition elements and/or nucleic acid processing factors has further accelerated their use for applications that demand exceptionally high specificity and activity. PMID:24589358
Da Lage, Jean-Luc; Maczkowiak, Frédérique; Cariou, Marie-Louise
2011-01-01
Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that “resets” of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures. PMID:21611157
Mobile Bacterial Group II Introns at the Crux of Eukaryotic Evolution
Lambowitz, Alan M.; Belfort, Marlene
2015-01-01
SUMMARY This review focuses on recent developments in our understanding of group II intron function, the relationships of these introns to retrotransposons and spliceosomes, and how their common features have informed thinking about bacterial group II introns as key elements in eukaryotic evolution. Reverse transcriptase-mediated and host factor-aided intron retrohoming pathways are considered along with retrotransposition mechanisms to novel sites in bacteria, where group II introns are thought to have originated. DNA target recognition and movement by target-primed reverse transcription infer an evolutionary relationship among group II introns, non-LTR retrotransposons, such as LINE elements, and telomerase. Additionally, group II introns are almost certainly the progenitors of spliceosomal introns. Their profound similarities include splicing chemistry extending to RNA catalysis, reaction stereochemistry, and the position of two divalent metals that perform catalysis at the RNA active site. There are also sequence and structural similarities between group II introns and the spliceosome’s small nuclear RNAs (snRNAs) and between a highly conserved core spliceosomal protein Prp8 and a group II intron-like reverse transcriptase. It has been proposed that group II introns entered eukaryotes during bacterial endosymbiosis or bacterial-archaeal fusion, proliferated within the nuclear genome, necessitating evolution of the nuclear envelope, and fragmented giving rise to spliceosomal introns. Thus, these bacterial self-splicing mobile elements have fundamentally impacted the composition of extant eukaryotic genomes, including the human genome, most of which is derived from close relatives of mobile group II introns. PMID:25878921
De novo insertion of an intron into the mammalian sex determining gene, SRY
O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall
1998-01-01
Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071
NASA Astrophysics Data System (ADS)
Zhao, Xiaoqing; Li, Hong; Bao, Tonglaga; Ying, Zhiqiang
2012-09-01
Many experiment evidences showed that sequence structures of introns and intron loss/gain can influence gene expression, but current mechanisms did not refer to the functions of post-spliced introns directly. We propose that postspliced introns play their functions in gene expression by interacting with their mRNA sequences and the interaction is characterized by the matched segments between introns and their CDS. In this study, we investigated the interaction characters with length series by improved Smith-Waterman local alignment software for the ribosomal protein genes in C. elegans and D. melanogaster. Our results showed that RF values of five intron groups are significantly high in the central non-conserved region and very low in 5'-end and 3'-end splicing region. It is interesting that the number of the optimal matched regions gradually increases with intron length. Distributions of the optimal matched regions are different for five intron groups. Our study revealed that there are more interaction regions between longer introns and their CDS than shorter, and it provides a positive pattern for regulating the gene expression.
SON is a spliceosome-associated factor required for mitotic progression.
Huen, Michael S Y; Sy, Shirley M H; Leung, Ka Man; Ching, Yick-Pang; Tipoe, George L; Man, Cornelia; Dong, Shuo; Chen, Junjie
2010-07-01
The eukaryotic RNA splicing machinery is dedicated to the daunting task of excising intronic sequences on the many nascent RNA transcripts in a cell, and in doing so facilitates proper translation of its transcriptome. Notably, emerging evidence suggests that RNA splicing may also play direct roles in maintaining genome stability. Here we report the identification of the RNA/DNA-binding protein SON as a component of spliceosome that plays pleiotropic roles during mitotic progression. We found that SON is essential for cell proliferation, and that its inactivation triggers a MAD2-dependent mitotic delay. Moreover, SON deficiency is accompanied by defective chromosome congression, compromised chromosome segregation and cytokinesis, which in turn contributes to cellular aneuploidy and cell death. In summary, our study uncovers a specific link between SON and mitosis, and highlights the potential of RNA processing as additional regulatory mechanisms that govern cell proliferation and division. © 2010 Landes Bioscience
SON is a spliceosome-associated factor required for mitotic progression
Sy, Shirley MH; Leung, Ka Man; Ching, Yick-Pang; Tipoe, George L; Man, Cornelia; Dong, Shuo
2010-01-01
The eukaryotic RNA splicing machinery is dedicated to the daunting task of excising intronic sequences on the many nascent RNA transcripts in a cell, and in doing so facilitates proper translation of its transcriptome. Notably, emerging evidence suggests that RNA splicing may also play direct roles in maintaining genome stability. Here we report the identification of the RNA/DNA-binding protein SON as a component of spliceosome that plays pleiotropic roles during mitotic progression. We found that SON is essential for cell proliferation, and that its inactivation triggers a MAD2-dependent mitotic delay. Moreover, SON deficiency is accompanied by defective chromosome congression, compromised chromosome segregation and cytokinesis, which in turn contributes to cellular aneuploidy and cell death. In summary, our study uncovers a specific link between SON and mitosis, and highlights the potential of RNA processing as additional regulatory mechanisms that govern cell proliferation and division. PMID:20581448
Intron-loss evolution of hatching enzyme genes in Teleostei
2010-01-01
Background Hatching enzyme, belonging to the astacin metallo-protease family, digests egg envelope at embryo hatching. Orthologous genes of the enzyme are found in all vertebrate genomes. Recently, we found that exon-intron structures of the genes were conserved among tetrapods, while the genes of teleosts frequently lost their introns. Occurrence of such intron losses in teleostean hatching enzyme genes is an uncommon evolutionary event, as most eukaryotic genes are generally known to be interrupted by introns and the intron insertion sites are conserved from species to species. Here, we report on extensive studies of the exon-intron structures of teleostean hatching enzyme genes for insight into how and why introns were lost during evolution. Results We investigated the evolutionary pathway of intron-losses in hatching enzyme genes of 27 species of Teleostei. Hatching enzyme genes of basal teleosts are of only one type, which conserves the 9-exon-8-intron structure of an assumed ancestor. On the other hand, otocephalans and euteleosts possess two types of hatching enzyme genes, suggesting a gene duplication event in the common ancestor of otocephalans and euteleosts. The duplicated genes were classified into two clades, clades I and II, based on phylogenetic analysis. In otocephalans and euteleosts, clade I genes developed a phylogeny-specific structure, such as an 8-exon-7-intron, 5-exon-4-intron, 4-exon-3-intron or intron-less structure. In contrast to the clade I genes, the structures of clade II genes were relatively stable in their configuration, and were similar to that of the ancestral genes. Expression analyses revealed that hatching enzyme genes were high-expression genes, when compared to that of housekeeping genes. When expression levels were compared between clade I and II genes, clade I genes tends to be expressed more highly than clade II genes. Conclusions Hatching enzyme genes evolved to lose their introns, and the intron-loss events occurred at the specific points of teleostean phylogeny. We propose that the high-expression hatching enzyme genes frequently lost their introns during the evolution of teleosts, while the low-expression genes maintained the exon-intron structure of the ancestral gene. PMID:20796321
Remarkable sequence conservation of the last intron in the PKD1 gene.
Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P
2003-10-01
The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.
Zerbato, Madeleine; Holic, Nathalie; Moniot-Frin, Sophie; Ingrao, Dina; Galy, Anne; Perea, Javier
2013-01-01
Group II introns are self-splicing mobile elements found in prokaryotes and eukaryotic organelles. These introns propagate by homing into precise genomic locations, following assembly of a ribonucleoprotein complex containing the intron-encoded protein (IEP) and the spliced intron RNA. Engineered group II introns are now commonly used tools for targeted genomic modifications in prokaryotes but not in eukaryotes. We speculate that the catalytic activation of currently known group II introns is limited in eukaryotic cells. The brown algae Pylaiella littoralis Pl.LSU/2 group II intron is uniquely capable of in vitro ribozyme activity at physiological level of magnesium but this intron remains poorly characterized. We purified and characterized recombinant Pl.LSU/2 IEP. Unlike most IEPs, Pl.LSU/2 IEP displayed a reverse transcriptase activity without intronic RNA. The Pl.LSU/2 intron could be engineered to splice accurately in Saccharomyces cerevisiae and splicing efficiency was increased by the maturase activity of the IEP. However, spliced transcripts were not expressed. Furthermore, intron splicing was not detected in human cells. While further tool development is needed, these data provide the first functional characterization of the PI.LSU/2 IEP and the first evidence that the Pl.LSU/2 group II intron splicing occurs in vivo in eukaryotes in an IEP-dependent manner. PMID:23505475
Janbon, Guilhem
2018-01-01
In Cryptococcus neoformans, nearly all genes are interrupted by small introns. In recent years, genome annotation and genetic analysis have illuminated the major roles these introns play in the biology of this pathogenic yeast. Introns are necessary for gene expression and alternative splicing can regulate gene expression in response to environmental cues. In addition, recent studies have revealed that C. neoformans introns help to prevent transposon dissemination and protect genome integrity. These characteristics of cryptococcal introns are probably not unique to Cryptococcus, and this yeast likely can be considered as a model for intron-related studies in fungi.
Branchpoint selection in the splicing of U12-dependent introns in vitro.
McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A
2002-05-01
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome.
Branchpoint selection in the splicing of U12-dependent introns in vitro.
McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A
2002-01-01
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome. PMID:12022225
Ishida, Ken; Kuboshima, Megumi; Morita, Hiroto; Maeda, Hiroshi; Okamoto, Ayako; Takeuchi, Michio; Yamagata, Youhei
2014-01-01
Alternative splicing is thought to be a means for diversification of products by mRNA modification. Although some intron retentions are predicted by transcriptome analysis in Aspergillus oryzae, its physiological significance remains unknown. We found that intron retention occurred occasionally in the serine-type carboxypeptidase gene, ocpG. Analysis under various culture conditions revealed that extracellular nitrogen conditions influence splicing patterns; this suggested that there might be a correlation between splicing efficiency and the necessity of OcpG activity for obtaining a nitrogen source. Since further analysis showed that splicing occurred independently in each intron, we constructed ocpG intron-exchanging strain by interchanging the positions of intron-1 and intron-2. The splicing pattern indicated the probability that ocpG intron retention was affected by the secondary structures of intronic mRNA.
Copertino, D W; Christopher, D A; Hallick, R B
1991-01-01
The splicing of a 409 nucleotide intron from the Euglena gracilis chloroplast ribosomal protein S3 gene (rps3) was examined by cDNA cloning and sequencing, and northern hybridization. Based on the characterization of a partially spliced pre-mRNA, the intron was characterized as a 'mixed' twintron, composed of a 311 nucleotide group II intron internal to a 98 nucleotide group III intron. Twintron excision is via a 2-step sequential splicing pathway, with removal of the internal group II intron preceding excision of the external group III intron. Based on secondary structural analysis of the twintron, we propose that group III introns may represent highly degenerate versions of group II introns. The existence of twintrons is interpreted as evidence that group II introns were inserted during the evolution of Euglena chloroplast genes from a common ancestor with eubacteria, archaebacteria, cyanobacteria, and other chloroplasts. Images PMID:1721702
Nguyen Dinh, Sy; Sai, Than Zaw Tun; Nawaz, Ghazala; Lee, Kwanuk; Kang, Hunseung
2016-08-20
Despite the increasing understanding of the regulation of chloroplast gene expression in plants, the importance of intron splicing and processing of chloroplast RNA transcripts under stress conditions is largely unknown. Here, to understand how abiotic stresses affect the intron splicing and expression patterns of chloroplast genes in dicots and monocots, we carried out a comprehensive analysis of the intron splicing and expression patterns of chloroplast genes in the coffee plant (Coffea arabica) as a dicot and rice (Oryza sativa) as a monocot under abiotic stresses, including drought, cold, or combined drought and heat stresses. The photosynthetic activity of both coffee plants and rice seedlings was significantly reduced under all stress conditions tested. Analysis of the transcript levels of chloroplast genes revealed that the splicing of tRNAs and mRNAs in coffee plants and rice seedlings were significantly affected by abiotic stresses. Notably, abiotic stresses affected differently the splicing of chloroplast tRNAs and mRNAs in coffee plants and rice seedlings. The transcript levels of most chloroplast genes were markedly downregulated in both coffee plants and rice seedlings upon stress treatment. Taken together, these results suggest that coffee and rice plants respond to abiotic stresses via regulating the intron splicing and expression of different sets of chloroplast genes. Copyright © 2016 Elsevier GmbH. All rights reserved.
Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei
2014-09-10
Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.
Perrineau, Marie-Mathilde; Price, Dana C.; Mohr, Georg
2015-01-01
Group II introns are closely linked to eukaryote evolution because nuclear spliceosomal introns and the small RNAs associated with the spliceosome are thought to trace their ancient origins to these mobile elements. Therefore, elucidating how group II introns move, and how they lose mobility can potentially shed light on fundamental aspects of eukaryote biology. To this end, we studied five strains of the unicellular red alga Porphyridium purpureum that surprisingly contain 42 group II introns in their plastid genomes. We focused on a subset of these introns that encode mobility-conferring intron-encoded proteins (IEPs) and found them to be distributed among the strains in a lineage-specific manner. The reverse transcriptase and maturase domains were present in all lineages but the DNA endonuclease domain was deleted in vertically inherited introns, demonstrating a key step in the loss of mobility. P. purpureum plastid intron RNAs had a classic group IIB secondary structure despite variability in the DIII and DVI domains. We report for the first time the presence of twintrons (introns-within-introns, derived from the same mobile element) in Rhodophyta. The P. purpureum IEPs and their mobile introns provide a valuable model for the study of mobile retroelements in eukaryotes and offer promise for biotechnological applications. PMID:26157604
Frazier, Courtney L.; San Filippo, Joseph; Lambowitz, Alan M.; Mills, David A.
2003-01-01
Despite their commercial importance, there are relatively few facile methods for genomic manipulation of the lactic acid bacteria. Here, the lactococcal group II intron, Ll.ltrB, was targeted to insert efficiently into genes encoding malate decarboxylase (mleS) and tetracycline resistance (tetM) within the Lactococcus lactis genome. Integrants were readily identified and maintained in the absence of a selectable marker. Since splicing of the Ll.ltrB intron depends on the intron-encoded protein, targeted invasion with an intron lacking the intron open reading frame disrupted TetM and MleS function, and MleS activity could be partially restored by expressing the intron-encoded protein in trans. Restoration of splicing from intron variants lacking the intron-encoded protein illustrates how targeted group II introns could be used for conditional expression of any gene. Furthermore, the modified Ll.ltrB intron was used to separately deliver a phage resistance gene (abiD) and a tetracycline resistance marker (tetM) into mleS, without the need for selection to drive the integration or to maintain the integrant. Our findings demonstrate the utility of targeted group II introns as a potential food-grade mechanism for delivery of industrially important traits into the genomes of lactococci. PMID:12571038
Evolution of introns in the archaeal world.
Tocchini-Valentini, Giuseppe D; Fruscoloni, Paolo; Tocchini-Valentini, Glauco P
2011-03-22
The self-splicing group I introns are removed by an autocatalytic mechanism that involves a series of transesterification reactions. They require RNA binding proteins to act as chaperones to correctly fold the RNA into an active intermediate structure in vivo. Pre-tRNA introns in Bacteria and in higher eukaryote plastids are typical examples of self-splicing group I introns. By contrast, two striking features characterize RNA splicing in the archaeal world. First, self-splicing group I introns cannot be found, to this date, in that kingdom. Second, the RNA splicing scenario in Archaea is uniform: All introns, whether in pre-tRNA or elsewhere, are removed by tRNA splicing endonucleases. We suggest that in Archaea, the protein recruited for splicing is the preexisting tRNA splicing endonuclease and that this enzyme, together with the ligase, takes over the task of intron removal in a more efficient fashion than the ribozyme. The extinction of group I introns in Archaea would then be a consequence of recruitment of the tRNA splicing endonuclease. We deal here with comparative genome analysis, focusing specifically on the integration of introns into genes coding for 23S rRNA molecules, and how this newly acquired intron has to be removed to regenerate a functional RNA molecule. We show that all known oligomeric structures of the endonuclease can recognize and cleave a ribosomal intron, even when the endonuclease derives from a strain lacking rRNA introns. The persistence of group I introns in mitochondria and chloroplasts would be explained by the inaccessibility of these introns to the endonuclease.
Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude
2013-01-01
Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications of this interesting observation for trans-splicing of group I introns. PMID:24386369
Song, Zhongshu; Bakeer, Walid; Marshall, James W.; Yakasai, Ahmed A.; Khalid, Rozida Mohd; Collemare, Jerome; Skellam, Elizabeth; Tharreau, Didier; Lebrun, Marc-Henri; Lazarus, Colin M.; Bailey, Andrew M.; Simpson, Thomas J.
2015-01-01
The ACE1 and RAP1 genes from the avirulence signalling gene cluster of the rice blast fungus Magnaporthe oryzae were expressed in Aspergillus oryzae and M. oryzae itself. Expression of ACE1 alone produced a polyenyl pyrone (magnaporthepyrone), which is regioselectively epoxidised and hydrolysed to give different diols, 6 and 7, in the two host organisms. Analysis of the three introns present in ACE1 determined that A. oryzae does not process intron 2 correctly, while M. oryzae processes all introns correctly in both appressoria and mycelia. Co-expression of ACE1 and RAP1 in A. oryzae produced an amide 8 which is similar to the PKS-NRPS derived backbone of the cytochalasans. Biological testing on rice leaves showed that neither the diols 6 and 7, nor amide 8 was responsible for the observed ACE1 mediated avirulence, however, gene cluster analysis suggests that the true avirulence signalling compound may be a tyrosine-derived cytochalasan compound. PMID:29142718
Bacterial group II introns: not just splicing.
Toro, Nicolás; Jiménez-Zurdo, José Ignacio; García-Rodríguez, Fernando Manuel
2007-04-01
Group II introns are both catalytic RNAs (ribozymes) and mobile retroelements that were discovered almost 14 years ago. It has been suggested that eukaryotic mRNA introns might have originated from the group II introns present in the alphaproteobacterial progenitor of the mitochondria. Bacterial group II introns are of considerable interest not only because of their evolutionary significance, but also because they could potentially be used as tools for genetic manipulation in biotechnology and for gene therapy. This review summarizes what is known about the splicing mechanisms and mobility of bacterial group II introns, and describes the recent development of group II intron-based gene-targetting methods. Bacterial group II intron diversity, evolutionary relationships, and behaviour in bacteria are also discussed.
Structure of a group II intron in complex with its reverse transcriptase.
Qu, Guosheng; Kaushal, Prem Singh; Wang, Jia; Shigematsu, Hideki; Piazza, Carol Lyn; Agrawal, Rajendra Kumar; Belfort, Marlene; Wang, Hong-Wei
2016-06-01
Bacterial group II introns are large catalytic RNAs related to nuclear spliceosomal introns and eukaryotic retrotransposons. They self-splice, yielding mature RNA, and integrate into DNA as retroelements. A fully active group II intron forms a ribonucleoprotein complex comprising the intron ribozyme and an intron-encoded protein that performs multiple activities including reverse transcription, in which intron RNA is copied into the DNA target. Here we report cryo-EM structures of an endogenously spliced Lactococcus lactis group IIA intron in its ribonucleoprotein complex form at 3.8-Å resolution and in its protein-depleted form at 4.5-Å resolution, revealing functional coordination of the intron RNA with the protein. Remarkably, the protein structure reveals a close relationship between the reverse transcriptase catalytic domain and telomerase, whereas the active splicing center resembles the spliceosomal Prp8 protein. These extraordinary similarities hint at intricate ancestral relationships and provide new insights into splicing and retromobility.
Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V
2003-09-02
Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
Csuros, Miklos; Rogozin, Igor B.; Koonin, Eugene V.
2011-01-01
Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6–7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing. PMID:21935348
Phylogenetic Analysis of Nuclear-Encoded RNA Maturases
Malik, Sunita; Upadhyaya, KC; Khurana, SM Paul
2017-01-01
Posttranscriptional processes, such as splicing, play a crucial role in gene expression and are prevalent not only in nuclear genes but also in plant mitochondria where splicing of group II introns is catalyzed by a class of proteins termed maturases. In plant mitochondria, there are 22 mitochondrial group II introns. matR, nMAT1, nMAT2, nMAT3, and nMAT4 proteins have been shown to be required for efficient splicing of several group II introns in Arabidopsis thaliana. Nuclear maturases (nMATs) are necessary for splicing of mitochondrial genes, leading to normal oxidative phosphorylation. Sequence analysis through phylogenetic tree (including bootstrapping) revealed high homology with maturase sequences of A thaliana and other plants. This study shows the phylogenetic relationship of nMAT proteins between A thaliana and other nonredundant plant species taken from BLASTP analysis. PMID:28607538
2011-01-01
Background The most frequent case of horizontal transfer in plants involves a group I intron in the mitochondrial gene cox1, which has been acquired via some 80 separate plant-to-plant transfer events among 833 diverse angiosperms examined. This homing intron encodes an endonuclease thought to promote the intron's promiscuous behavior. A promising experimental approach to study endonuclease activity and intron transmission involves somatic cell hybridization, which in plants leads to mitochondrial fusion and genome recombination. However, the cox1 intron has not yet been found in the ideal group for plant somatic genetics - the Solanaceae. We therefore undertook an extensive survey of this family to find members with the intron and to learn more about the evolutionary history of this exceptionally mobile genetic element. Results Although 409 of the 426 species of Solanaceae examined lack the cox1 intron, it is uniformly present in three phylogenetically disjunct clades. Despite strong overall incongruence of cox1 intron phylogeny with angiosperm phylogeny, two of these clades possess nearly identical intron sequences and are monophyletic in intron phylogeny. These two clades, and possibly the third also, contain a co-conversion tract (CCT) downstream of the intron that is extended relative to all previously recognized CCTs in angiosperm cox1. Re-examination of all published cox1 genes uncovered additional cases of extended co-conversion and identified a rare case of putative intron loss, accompanied by full retention of the CCT. Conclusions We infer that the cox1 intron was separately and recently acquired by at least three different lineages of Solanaceae. The striking identity of the intron and CCT from two of these lineages suggests that one of these three intron captures may have occurred by a within-family transfer event. This is consistent with previous evidence that horizontal transfer in plants is biased towards phylogenetically local events. The discovery of extended co-conversion suggests that other cox1 conversions may be longer than realized but obscured by the exceptional conservation of plant mitochondrial sequences. Our findings provide further support for the rampant-transfer model of cox1 intron evolution and recommend the Solanaceae as a model system for the experimental analysis of cox1 intron transfer in plants. PMID:21943226
2013-01-01
Background Accurate and complete identification of mobile elements is a challenging task in the current era of sequencing, given their large numbers and frequent truncations. Group II intron retroelements, which consist of a ribozyme and an intron-encoded protein (IEP), are usually identified in bacterial genomes through their IEP; however, the RNA component that defines the intron boundaries is often difficult to identify because of a lack of strong sequence conservation corresponding to the RNA structure. Compounding the problem of boundary definition is the fact that a majority of group II intron copies in bacteria are truncated. Results Here we present a pipeline of 11 programs that collect and analyze group II intron sequences from GenBank. The pipeline begins with a BLAST search of GenBank using a set of representative group II IEPs as queries. Subsequent steps download the corresponding genomic sequences and flanks, filter out non-group II introns, assign introns to phylogenetic subclasses, filter out incomplete and/or non-functional introns, and assign IEP sequences and RNA boundaries to the full-length introns. In the final step, the redundancy in the data set is reduced by grouping introns into sets of ≥95% identity, with one example sequence chosen to be the representative. Conclusions These programs should be useful for comprehensive identification of group II introns in sequence databases as data continue to rapidly accumulate. PMID:24359548
Montandon, P E; Vasserot, A; Stutz, E
1986-01-01
We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.
Exon definition as a potential negative force against intron losses in evolution.
Niu, Deng-Ke
2008-11-13
Previous studies have indicated that the wide variation in intron density (the number of introns per gene) among different eukaryotes largely reflects varying degrees of intron loss during evolution. The most popular model, which suggests that organisms lose introns through a mechanism in which reverse-transcribed cDNA recombines with the genomic DNA, concerns only one mutational force. Using exons as the units of splicing-site recognition, exon definition constrains the length of exons. An intron-loss event results in fusion of flanking exons and thus a larger exon. The large size of the newborn exon may cause splicing errors, i.e., exon skipping, if the splicing of pre-mRNAs is initiated by exon definition. By contrast, if the splicing of pre-mRNAs is initiated by intron definition, intron loss does not matter. Exon definition may thus be a selective force against intron loss. An organism with a high frequency of exon definition is expected to experience a low rate of intron loss throughout evolution and have a high density of spliceosomal introns. The majority of spliceosomal introns in vertebrates may be maintained during evolution not because of potential functions, but because of their splicing mechanism (i.e., exon definition). Further research is required to determine whether exon definition is a negative force in maintaining the high intron density of vertebrates. This article was reviewed by Dr. Scott W. Roy (nominated by Dr. John Logsdon), Dr.Eugene V. Koonin, and Dr. Igor B. Rogozin (nominated by Dr. Mikhail Gelfand). For the full reviews,please go to the Reviewers' comments section.
Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting
Piazza, Carol Lyn; Smith, Dorie
2018-01-01
Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis, inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. PMID:29905149
Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting.
Qu, Guosheng; Piazza, Carol Lyn; Smith, Dorie; Belfort, Marlene
2018-06-15
Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis , inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. © 2018, Qu et al.
Detailed analysis of complex single molecule FRET data with the software MASH
NASA Astrophysics Data System (ADS)
Hadzic, Mélodie C. A. S.; Kowerko, Danny; Börner, Richard; Zelger-Paulus, Susann; Sigel, Roland K. O.
2016-04-01
The processing and analysis of surface-immobilized single molecule FRET (Förster resonance energy transfer) data follows systematic steps (e.g. single molecule localization, clearance of different sources of noise, selection of the conformational and kinetic model, etc.) that require a solid knowledge in optics, photophysics, signal processing and statistics. The present proceeding aims at standardizing and facilitating procedures for single molecule detection by guiding the reader through an optimization protocol for a particular experimental data set. Relevant features were determined from single molecule movies (SMM) imaging Cy3- and Cy5-labeled Sc.ai5γ group II intron molecules synthetically recreated, to test the performances of four different detection algorithms. Up to 120 different parameterizations per method were routinely evaluated to finally establish an optimum detection procedure. The present protocol is adaptable to any movie displaying surface-immobilized molecules, and can be easily reproduced with our home-written software MASH (multifunctional analysis software for heterogeneous data) and script routines (both available in the download section of www.chem.uzh.ch/rna).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Greenspan, D.S.; Papenberg, K.A.; Marchuk, D.A.
1994-09-01
Type V collagen is the only fibrillar collagen which has yet to be implicated in the pathogenesis of genetic diseases in humans or mice. To begin examining the possible role of type V collagen in genetic disease, we have previously mapped COL5A1, the gene for the {alpha}1 chain of type V collagen, to 9q23.2{r_arrow}q34.3 and described two restriction site polymorphisms which allowed us to exclude COL5A1 as candidate gene for nail-patella syndrome. We have now used these polymorphisms to exclude COL5A1 as candidate gene for tuberous sclerosis complex 1 and Ehlers-Danlos syndrome type II. In addition, we describe a CAmore » repeat, with observed heterozygosity of about 0.5, in a COL5A1 intron, which has allowed us to exclude COL5A1 as a candidate gene in hereditary hemorrhagic telangiectasia and to place COL5A1 on the CEPH family genetic map between markers D9S66 and D9S67. We have also determined the entire intron/exon organization of COL5A1, which will facilitate characterization of mutations in genetic diseases with which COL5A1 may be linked in future studies.« less
Limited MHC class I intron 2 repertoire variation in bonobos.
de Groot, Natasja G; Heijmans, Corrine M C; Helsen, Philippe; Otting, Nel; Pereboom, Zjef; Stevens, Jeroen M G; Bontrop, Ronald E
2017-10-01
Common chimpanzees (Pan troglodytes) experienced a selective sweep, probably caused by a SIV-like virus, which targeted their MHC class I repertoire. Based on MHC class I intron 2 data analyses, this selective sweep took place about 2-3 million years ago. As a consequence, common chimpanzees have a skewed MHC class I repertoire that is enriched for allotypes that are able to recognise conserved regions of the SIV proteome. The bonobo (Pan paniscus) shared an ancestor with common chimpanzees approximately 1.5 to 2 million years ago. To investigate whether the signature of this selective sweep is also detectable in bonobos, the MHC class I gene repertoire of two bonobo panels comprising in total 29 animals was investigated by Sanger sequencing. We identified 14 Papa-A, 20 Papa-B and 11 Papa-C alleles, of which eight, five and eight alleles, respectively, have not been reported previously. Within this pool of MHC class I variation, we recovered only 2 Papa-A, 3 Papa-B and 6 Papa-C intron 2 sequences. As compared to humans, bonobos appear to have an even more diminished MHC class I intron 2 lineage repertoire than common chimpanzees. This supports the notion that the selective sweep may have predated the speciation of common chimpanzees and bonobos. The further reduction of the MHC class I intron 2 lineage repertoire observed in bonobos as compared to the common chimpanzee may be explained by a founding effect or other subsequent selective processes.
Structural and Functional Characterization of Ribosomal Protein Gene Introns in Sponges
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with “higher” metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales. PMID:22880015
Structural and functional characterization of ribosomal protein gene introns in sponges.
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with "higher" metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales.
Schwarze, Ulrike; Hata, Ryu-Ichiro; McKusick, Victor A.; Shinkai, Hiroshi; Hoyme, H. Eugene; Pyeritz, Reed E.; Byers, Peter H.
2004-01-01
Splice site mutations in the COL1A2 gene of type I collagen can give rise to forms of Ehlers-Danlos syndrome (EDS) because of partial or complete skipping of exon 6, as well as to mild, moderate, or lethal forms of osteogenesis imperfecta as a consequence of skipping of other exons. We identified three unrelated individuals with a rare recessively inherited form of EDS (characterized by joint hypermobility, skin hyperextensibility, and cardiac valvular defects); in two of them, COL1A2 messenger RNA (mRNA) instability results from compound heterozygosity for splice site mutations in the COL1A2 gene, and, in the third, it results from homozygosity for a nonsense codon. The splice site mutations led to use of cryptic splice donor sites, creation of a downstream premature termination codon, and extremely unstable mRNA. In the wild-type allele, the two introns (IVS11 and IVS24) in which these mutations occurred were usually spliced slowly in relation to their respective immediate upstream introns. In the mutant alleles, the upstream intron was removed, so that exon skipping could not occur. In the context of the mutation in IVS24, computer-generated folding of a short stretch of mRNA surrounding the mutation site demonstrated realignment of the relationships between the donor and acceptor sites that could facilitate use of a cryptic donor site. These findings suggest that the order of intron removal is an important variable in prediction of mutation outcome at splice sites and that folding of the nascent mRNA could be one element that contributes to determination of order of splicing. The complete absence of proα2(I) chains has the surprising effect of producing cardiac valvular disease without bone involvement. PMID:15077201
Schwarze, Ulrike; Hata, Ryu-Ichiro; McKusick, Victor A; Shinkai, Hiroshi; Hoyme, H Eugene; Pyeritz, Reed E; Byers, Peter H
2004-05-01
Splice site mutations in the COL1A2 gene of type I collagen can give rise to forms of Ehlers-Danlos syndrome (EDS) because of partial or complete skipping of exon 6, as well as to mild, moderate, or lethal forms of osteogenesis imperfecta as a consequence of skipping of other exons. We identified three unrelated individuals with a rare recessively inherited form of EDS (characterized by joint hypermobility, skin hyperextensibility, and cardiac valvular defects); in two of them, COL1A2 messenger RNA (mRNA) instability results from compound heterozygosity for splice site mutations in the COL1A2 gene, and, in the third, it results from homozygosity for a nonsense codon. The splice site mutations led to use of cryptic splice donor sites, creation of a downstream premature termination codon, and extremely unstable mRNA. In the wild-type allele, the two introns (IVS11 and IVS24) in which these mutations occurred were usually spliced slowly in relation to their respective immediate upstream introns. In the mutant alleles, the upstream intron was removed, so that exon skipping could not occur. In the context of the mutation in IVS24, computer-generated folding of a short stretch of mRNA surrounding the mutation site demonstrated realignment of the relationships between the donor and acceptor sites that could facilitate use of a cryptic donor site. These findings suggest that the order of intron removal is an important variable in prediction of mutation outcome at splice sites and that folding of the nascent mRNA could be one element that contributes to determination of order of splicing. The complete absence of pro alpha 2(I) chains has the surprising effect of producing cardiac valvular disease without bone involvement.
Nomiyama, H; Kuhara, S; Kukita, T; Otsuka, T; Sakaki, Y
1981-01-01
The 26S ribosomal RNA gene of Physarum polycephalum is interrupted by two introns, and we have previously determined the sequence of one of them (intron 1) (Nomiyama et al. Proc.Natl.Acad.Sci.USA 78, 1376-1380, 1981). In this study we sequenced the second intron (intron 2) of about 0.5 kb length and its flanking regions, and found that one nucleotide at each junction is identical in intron 1 and intron 2, though the junction regions share no other sequence homology. Comparison of the flanking exon sequences to E. coli 23S rRNA sequences shows that conserved sequences are interspersed with tracts having little homology. In particular, the region encompassing the intron 2 interruption site is highly conserved. The E. coli ribosomal protein L1 binding region is also conserved. Images PMID:6171776
Exon–intron organization of genes in the slime mold Physarum polycephalum
Trzcinska-Danielewicz, Joanna; Fronk, Jan
2000-01-01
The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon–intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon–intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon–intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3′-ends. PMID:10982858
SURVEY AND SUMMARY: exon-intron organization of genes in the slime mold Physarum polycephalum.
Trzcinska-Danielewicz, J; Fronk, J
2000-09-15
The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon-intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon-intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon-intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3'-ends.
RBFOX and PTBP1 proteins regulate the alternative splicing of micro-exons in human brain transcripts
Sanchez-Pulido, Luis; Haerty, Wilfried
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein–protein interactions. PMID:25524026
Evolution of Mhc-DRB introns: implications for the origin of primates.
Kupfermann, H; Satta, Y; Takahata, N; Tichy, H; Klein, J
1999-06-01
Introns are generally believed to evolve too rapidly and too erratically to be of much use in phylogenetic reconstructions. Few phylogenetically informative intron sequences are available, however, to ascertain the validity of this supposition. In the present study the supposition was tested on the example of the mammalian class II major histocompatibility complex (Mhc) genes of the DRB family. Since the Mhc genes evolve under balancing selection and are believed to recombine or rearrange frequently, the evolution of their introns could be expected to be particularly rapid and subject to scrambling. Sequences of intron 4 and 5 DRB genes were obtained from polymerase chain reaction-amplified fragments of genomic DNA from representatives of six eutherian orders-Primates, Scandentia, Chiroptera, Dermoptera, Lagomorpha, and Insectivora. Although short stretches of the introns have indeed proved to be unalignable, the bulk of the intron sequences from all six orders, spanning >85 million years (my) of evolution, could be aligned and used in a study of the tempo and mode of intron evolution. The analysis has revealed the Mhc introns to evolve at a rate similar to that of other genes and of synonymous sites of non-Mhc genes. No evidence of homogenization or large-scale scrambling of the intron sequences could be found. The Mhc introns apparently evolve largely by point mutations and insertions/deletions. The phylogenetic signals contained in the intron sequences could be used to identify Scandentia as the sister group of Primates, to support the existence of the Archonta superorder, and to confirm the monophyly of the Chiroptera.
Gonzalez, P; Barroso, G; Labarère, J
1999-04-01
The complete gene sequence and secondary structure of the mitochondrial LSU rRNA from the cultivated Basidiomycota Agrocybe aegerita was derived by chromosome walking. The A.aegerita LSU rRNA gene (13 526 nt) represents, to date, the longest described, due to the highest number of introns (eight) and the occurrence of six long nucleotidic extensions. Seven introns belong to group I, while the intronic sequence i5 constitutes the first typical group II intron reported in a fungal mitochondrial LSU rDNA. As with most fungal LSU rDNA introns reported to date, four introns (i5-i8) are distributed in domain V associated with the peptidyl-transferase activity. One intron (i1) is located in domain I, and three (i2-i4) in domain II. The introns i2-i8 possess homologies with other fungal, algal or protozoan introns located at the same position in LSU rDNAs. One of them (i6) is located at the same insertion site as most Ascomycota or algae LSU introns, suggesting a possible inheritance from a common ancestor. On the contrary, intron i1 is located at a so-far unreported insertion site. Among the six unusual nucleotide extensions, five are located in domain I and one in domain V. This is the first report of a mitochondrial LSU rRNA gene sequence and secondary structure for the whole Basidiomycota division.
Lyons, Brendan M; McHenry, Monique A; Barrington, David S
2017-07-01
Cytosolic phosphoglucose isomerase (pgiC) is an enzyme essential to glycolysis found universally in eukaryotes, but broad understanding of variation in the gene coding for pgiC is lacking for ferns. We used a substantially expanded representation of the gene for Andean species of the fern genus Polystichum to characterize pgiC in ferns relative to angiosperms, insects, and an amoebozoan; assess the impact of selection versus neutral evolutionary processes on pgiC; and explore evolutionary relationships of selected Andean species. The dataset of complete sequences comprised nine accessions representing seven species and one hybrid from the Andes and Serra do Mar. The aligned sequences of the full data set comprised 3376 base pairs (70% of the entire gene) including 17 exons and 15 introns from two central areas of the gene. The exons are highly conserved relative to angiosperms and retain substantial homology to insect pgiC, but intron length and structure are unique to the ferns. Average intron size is similar to angiosperms; intron number and location in insects are unlike those of the plants we considered. The introns included an array of indels and, in intron 7, an extensive microsatellite array with potential utility in analyzing population-level histories. Bayesian and maximum-parsimony analysis of 129 variable nucleotides in the Andean polystichums revealed that 59 (1.7% of the 3376 total) were phylogenetically informative; most of these united sister accessions. The phylogenetic trees for the Andean polystichums were incongruent with previously published cpDNA trees for the same taxa, likely the result of rapid evolutionary change in the introns and contrasting stability in the exons. The exons code a total of seven amino-acid substitutions. Comparison of non-synonymous to synonymous substitutions did not suggest that the pgiC gene is under selection in the Andes. Variation in pgiC including two additional accessions represented by incomplete sequences provided new insights into reticulate relationships among Andean taxa. Copyright © 2017 Elsevier Inc. All rights reserved.
Oliveira, Patrícia; Sanges, Remo; Huntsman, David; Stupka, Elia; Oliveira, Carla
2012-01-01
Cadherins are cell–cell adhesion proteins essential for the maintenance of tissue architecture and integrity, and their impairment is often associated with human cancer. Knowledge regarding regulatory mechanisms associated with cadherin misexpression in cancer is scarce. Specific features of the intronic-structure and intronic-based regulatory mechanisms in the cadherin superfamily are unidentified. This study aims at systematically characterizing the intronic portion of cadherin superfamily members and the identification of intronic regions constituting putative targets/triggers of regulation, using a bioinformatic approach and biological data mining. Our study demonstrates that the cadherin superfamily genes harbour specific characteristics in comparison to all non-cadherin genes, both from the genomic and transcriptional standpoints. Cadherin superfamily genes display higher average total intron number and significantly longer introns than other genes and across the entire vertebrate lineage. Moreover, in the human genome, we observed an uncommon high frequency of MIR (mammalian-wide interspersed repeats) and MaLR (mammalian-wide interspersed repeats, a subtype of LTR) regulatory-associated repetitive elements at 5′-located introns, concomitantly with increased de novo intronic transcription. Using this approach, we identified cadherin intronic-specific sites that may constitute novel targets/triggers of cadherin superfamily expression regulation. These findings pinpoint the need to identify mechanisms affecting particularly MIR and MaLR elements located in introns 2 and 3 of human cadherin genes, possibly important in the expression modulation of this superfamily in homeostasis and cancer. PMID:22317972
Wolff, G; Burger, G; Lang, B F; Kück, U
1993-01-01
The mitochondrial DNA from the colourless alga Prototheca wickerhamii contains two mosaic genes as was revealed from complete sequencing of the circular extranuclear genome. The genes for the large subunit of the ribosomal RNA (LSUrRNA) as well as for subunit I of the cytochrome oxidase (coxI) carry two and three intronic sequences respectively. On the basis of their canonical nucleotide sequences they can be classified as group I introns. Phylogenetic comparisons of the coxI protein sequences allow us to conclude that the P.wickerhamii mtDNA is much closer related to higher plant mtDNAs than to those of the chlorophyte alga C.reinhardtii. The comparison of the intron sequences revealed several unusual features: (1) The P.wickerhamii introns are structurally related to mitochondrial introns from various ascomycetous fungi. (2) Phylogenetic analyses indicate a close relationship between fungal and algal intronic sequences. (3) The P. wickerhamii introns are located at positions within the structural genes which can be considered as preferred intron insertion sites in homologous mitochondrial genes from fungi or liverwort. In all cases, the sequences adjacent to the insertion sites are very well conserved over large evolutionary distances. Our finding of highly similar introns in fungi and algae is consistent with the idea that introns have already been present in the bacterial ancestors of present day mitochondria and evolved concomitantly with the organelles. PMID:7680126
Vaughn, J C; Mason, M T; Sper-Whitis, G L; Kuhlman, P; Palmer, J D
1995-11-01
We present phylogenetic evidence that a group I intron in an angiosperm mitochondrial gene arose recently by horizontal transfer from a fungal donor species. A 1,716-bp fragment of the mitochondrial coxI gene from the angiosperm Peperomia polybotrya was amplified via the polymerase chain reaction and sequenced. Comparison to other coxI genes revealed a 966-bp group I intron, which, based on homology with the related yeast coxI intron aI4, potentially encodes a 279-amino-acid site-specific DNA endonuclease. This intron, which is believed to function as a ribozyme during its own splicing, is not present in any of 19 coxI genes examined from other diverse vascular plant species. Phylogenetic analysis of intron origin was carried out using three different tree-generating algorithms, and on a variety of nucleotide and amino acid data sets from the intron and its flanking exon sequences. These analyses show that the Peperomia coxI gene intron and exon sequences are of fundamentally different evolutionary origin. The Peperomia intron is more closely related to several fungal mitochondrial introns, two of which are located at identical positions in coxI, than to identically located coxI introns from the land plant Marchantia and the green alga Prototheca. Conversely, the exon sequence of this gene is, as expected, most closely related to other angiosperm coxI genes. These results, together with evidence suggestive of co-conversion of exonic markers immediately flanking the intron insertion site, lead us to conclude that the Peperomia coxI intron probably arose by horizontal transfer from a fungal donor, using the double-strand-break repair pathway. The donor species may have been one of the symbiotic mycorrhizal fungi that live in close obligate association with most plants.
[Detection of factor VIII intron 1 inversion in severe haemophilia A].
Liang, Yan; Yan, Zhen-yu; Yan, Mei; Hua, Bao-lai; Xiao, Bai; Zhao, Yong-qiang; Liu, Jing-zhong
2009-06-01
Screening the intron 1 inversion of factor VIII (FVIII) in the population of severe haemophilia A(HA) in China and performing carrier detection and prenatal diagnosis. Using LD-PCR to detect intron 22 inversions and multiple-PCR within two tubes to intron 1 inversions in severe HA patients. Carrier detection and prenatal diagnosis were performed in affected families. Linkage analysis and DNA sequencing were used to verify these tests. One hundred and eighteen patients were seven diagnosed as intron 22 inversions and 7 were intron 1 inversions out of 247 severe HA patients. The prevalence of the intron 1 inversion in Chinese severe haemophilia A patients was 2.8% (7/247). Six women from family A and 2 from family B were diagnosed as carriers. One fetus from family A was affected fetus. Intron 1 inversion could be detected directly by multiple-PCR within two tubes. This method made the strategy more perfective in carrier and prenatal diagnosis of haemophilia A.
Introns Protect Eukaryotic Genomes from Transcription-Associated Genetic Instability.
Bonnet, Amandine; Grosso, Ana R; Elkaoutari, Abdessamad; Coleno, Emeline; Presle, Adrien; Sridhara, Sreerama C; Janbon, Guilhem; Géli, Vincent; de Almeida, Sérgio F; Palancade, Benoit
2017-08-17
Transcription is a source of genetic instability that can notably result from the formation of genotoxic DNA:RNA hybrids, or R-loops, between the nascent mRNA and its template. Here we report an unexpected function for introns in counteracting R-loop accumulation in eukaryotic genomes. Deletion of endogenous introns increases R-loop formation, while insertion of an intron into an intronless gene suppresses R-loop accumulation and its deleterious impact on transcription and recombination in yeast. Recruitment of the spliceosome onto the mRNA, but not splicing per se, is shown to be critical to attenuate R-loop formation and transcription-associated genetic instability. Genome-wide analyses in a number of distant species differing in their intron content, including human, further revealed that intron-containing genes and the intron-richest genomes are best protected against R-loop accumulation and subsequent genetic instability. Our results thereby provide a possible rationale for the conservation of introns throughout the eukaryotic lineage. Copyright © 2017 Elsevier Inc. All rights reserved.
Splicing of a group II intron involved in the conjugative transfer of pRS01 in lactococci.
Mills, D A; McKay, L L; Dunny, G M
1996-06-01
Analysis of a region involved in the conjugative transfer of the lactococcal conjugative element pRS01 has revealed a bacteria] group II intron. Splicing of this lactococcal intron (designated Ll.ltrB) in vivo resulted in the ligation of two exon messages (ltrBE1 and ltrBE2) which encoded a putative conjugative relaxase essential for the transfer of pRS01. Like many group II introns, the Ll.ltrB intron possessed an open reading frame (ltrA) with homology to reverse transcriptases. Remarkably, sequence analysis of ltrA suggested a greater similarity to open reading frames encoded by eukaryotic mitochondrial group II introns than to those identified to date from other bacteria. Several insertional mutations within ltrA resulted in plasmids exhibiting a conjugative transfer-deficient phenotype. These results provide the first direct evidence for splicing of a prokaryotic group II intron in vivo and suggest that conjugative transfer is a mechanism for group II intron dissemination in bacteria.
Imprecise intron losses are less frequent than precise intron losses but are not rare in plants.
Ma, Ming-Yue; Zhu, Tao; Li, Xue-Nan; Lan, Xin-Ran; Liu, Heng-Yuan; Yang, Yu-Fei; Niu, Deng-Ke
2015-05-27
In this study, we identified 19 intron losses, including 11 precise intron losses (PILs), six imprecise intron losses (IILs), one de-exonization, and one exon deletion in tomato and potato, and 17 IILs in Arabidopsis thaliana. Comparative analysis of related genomes confirmed that all of the IILs have been fixed during evolution. Consistent with previous studies, our results indicate that PILs are a major type of intron loss. However, at least in plants, IILs are unlikely to be as rare as previously reported. This article was reviewed by Jun Yu and Zhang Zhang. For complete reviews, see the Reviewers' Reports section.
Bhattacharya, D; Surek, B; Rüsing, M; Damberger, S; Melkonian, M
1994-01-01
Group I introns are found in organellar genomes, in the genomes of eubacteria and phages, and in nuclear-encoded rRNAs. The origin and distribution of nuclear-encoded rRNA group I introns are not understood. To elucidate their evolutionary relationships, we analyzed diverse nuclear-encoded small-subunit rRNA group I introns including nine sequences from the green-algal order Zygnematales (Charophyceae). Phylogenetic analyses of group I introns and rRNA coding regions suggest that lateral transfers have occurred in the evolutionary history of group I introns and that, after transfer, some of these elements may form stable components of the host-cell nuclear genomes. The Zygnematales introns, which share a common insertion site (position 1506 relative to the Escherichia coli small-subunit rRNA), form one subfamily of group I introns that has, after its origin, been inherited through common ancestry. Since the first Zygnematales appear in the middle Devonian within the fossil record, the "1506" group I intron presumably has been a stable component of the Zygnematales small-subunit rRNA coding region for 350-400 million years. PMID:7937917
Organellar maturases: A window into the evolution of the spliceosome.
Schmitz-Linneweber, Christian; Lampe, Marie-Kristin; Sultan, Laure D; Ostersetzer-Biran, Oren
2015-09-01
During the evolution of eukaryotic genomes, many genes have been interrupted by intervening sequences (introns) that must be removed post-transcriptionally from RNA precursors to form mRNAs ready for translation. The origin of nuclear introns is still under debate, but one hypothesis is that the spliceosome and the intron-exon structure of genes have evolved from bacterial-type group II introns that invaded the eukaryotic genomes. The group II introns were most likely introduced into the eukaryotic genome from an α-proteobacterial predecessor of mitochondria early during the endosymbiosis event. These self-splicing and mobile introns spread through the eukaryotic genome and later degenerated. Pieces of introns became part of the general splicing machinery we know today as the spliceosome. In addition, group II introns likely brought intron maturases with them to the nucleus. Maturases are found in most bacterial introns, where they act as highly specific splicing factors for group II introns. In the spliceosome, the core protein Prp8 shows homology to group II intron-encoded maturases. While maturases are entirely intron specific, their descendant of the spliceosomal machinery, the Prp8 protein, is an extremely versatile splicing factor with multiple interacting proteins and RNAs. How could such a general player in spliceosomal splicing evolve from the monospecific bacterial maturases? Analysis of the organellar splicing machinery in plants may give clues on the evolution of nuclear splicing. Plants encode various proteins which are closely related to bacterial maturases. The organellar genomes contain one maturase each, named MatK in chloroplasts and MatR in mitochondria. In addition, several maturase genes have been found in the nucleus as well, which are acting on mitochondrial pre-RNAs. All plant maturases show sequence deviation from their progenitor bacterial maturases, and interestingly are all acting on multiple organellar group II intron targets. Moreover, they seem to function in the splicing of group II introns together with a number of additional nuclear-encoded splicing factors, possibly acting as an organellar proto-spliceosome. Together, this makes them interesting models for the early evolution of nuclear spliceosomal splicing. In this review, we summarize recent advances in our understanding of the role of plant maturases and their accessory factors in plants. This article is part of a Special Issue entitled: Chloroplast Biogenesis. Copyright © 2015 Elsevier B.V. All rights reserved.
Insertion of a self-splicing intron into the mtDNA of atriploblastic animal
DOE Office of Scientific and Technical Information (OSTI.GOV)
Valles, Y.; Halanych, K.; Boore, J.L.
2006-04-14
Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking amore » long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In contrast to this mtDNA structure and organization diversity, current genome level studies point to a monophyletic origin of the mitochondria (REFS), raising questions such as: what are the pressures at work shaping the evolution of the mitochondrial genome at 'higher' levels? What drives the absence of introns and other non-coding spacers in metazoan mtDNA? What characteristics must have an intron to be maintained in an environment where 'extra chromosomes' are usually selected against?« less
Lambert, Marie-Pierre; Terrone, Sophie; Giraud, Guillaume; Benoit-Pilven, Clara; Cluet, David; Combaret, Valérie; Mortreux, Franck; Auboeuf, Didier; Bourgeois, Cyril F
2018-06-21
The Repressor Element 1-silencing transcription factor (REST) represses a number of neuronal genes in non-neuronal cells or in undifferentiated neural progenitors. Here, we report that the DEAD box RNA helicase DDX17 controls important REST-related processes that are critical during the early phases of neuronal differentiation. First, DDX17 associates with REST, promotes its binding to the promoter of a subset of REST-targeted genes and co-regulates REST transcriptional repression activity. During neuronal differentiation, we observed a downregulation of DDX17 along with that of the REST complex that contributes to the activation of neuronal genes. Second, DDX17 and its paralog DDX5 regulate the expression of several proneural microRNAs that are known to target the REST complex during neurogenesis, including miR-26a/b that are also direct regulators of DDX17 expression. In this context, we propose a new mechanism by which RNA helicases can control the biogenesis of intronic miRNAs. We show that the processing of the miR-26a2 precursor is dependent on RNA helicases, owing to an intronic regulatory region that negatively impacts on both miRNA processing and splicing of its host intron. Our work places DDX17 in the heart of a pathway involving REST and miRNAs that allows neuronal gene repression.
Musiyenko, Alla; Bitko, Vira; Barik, Sailen
2008-03-01
MicroRNAs (miRNAs) are endogenous noncoding RNAs that down-regulate gene expression by promoting cleavage or translational arrest of target mRNAs. While most miRNAs are transcribed from their own dedicated genes, some map to introns of 'host' transcripts, the biological significance of which remains unknown. Here, we show that prostate cells are naturally devoid of EGF-like domain 7 (Egfl7) transcripts and hence also deficient in a miRNA, miR-126*, generated from splicing and processing of its ninth intron. Use of recombinant and synthetic miRNAs or a specific antagomir established a role of miR-126* in silencing prostein in non-endothelial cells. We mapped two miR-126*-binding sites in the 3'UTR of the prostein mRNA required for translational repression. Transfection of synthetic miR-126* into prostate cancer LNCaP cells strongly reduced the translation of prostein. Interestingly, loss of prostein correlated with reduction of LNCaP cell migration and invasion. Thus, the robust expression of prostein protein in the prostate cells results from a combination of transcriptional activation of the prostein gene and absence of intronic miRNA-126* due to the prostate-specific repression of the Egfl7 gene. We conclude that intronic miRNAs from tissue-specific transcripts, or their natural absence, make cardinal contributions to cellular gene expression and phenotype. These findings also open the door to tissue-specific miRNA therapy.
Maf1 Protein, Repressor of RNA Polymerase III, Indirectly Affects tRNA Processing*
Karkusiewicz, Iwona; Turowski, Tomasz W.; Graczyk, Damian; Towpik, Joanna; Dhungel, Nripesh; Hopper, Anita K.; Boguta, Magdalena
2011-01-01
Maf1 is negative regulator of RNA polymerase III in yeast. We observed high levels of both primary transcript and end-matured, intron-containing pre-tRNAs in the maf1Δ strain. This pre-tRNA accumulation could be overcome by transcription inhibition, arguing against a direct role of Maf1 in tRNA maturation and suggesting saturation of processing machinery by the increased amounts of primary transcripts. Saturation of the tRNA exportin, Los1, is one reason why end-matured intron-containing pre-tRNAs accumulate in maf1Δ cells. However, it is likely possible that other components of the processing pathway are also limiting when tRNA transcription is increased. According to our model, Maf1-mediated transcription control and nuclear export by Los1 are two major stages of tRNA biosynthesis that are regulated by environmental conditions in a coordinated manner. PMID:21940626
Maf1 protein, repressor of RNA polymerase III, indirectly affects tRNA processing.
Karkusiewicz, Iwona; Turowski, Tomasz W; Graczyk, Damian; Towpik, Joanna; Dhungel, Nripesh; Hopper, Anita K; Boguta, Magdalena
2011-11-11
Maf1 is negative regulator of RNA polymerase III in yeast. We observed high levels of both primary transcript and end-matured, intron-containing pre-tRNAs in the maf1Δ strain. This pre-tRNA accumulation could be overcome by transcription inhibition, arguing against a direct role of Maf1 in tRNA maturation and suggesting saturation of processing machinery by the increased amounts of primary transcripts. Saturation of the tRNA exportin, Los1, is one reason why end-matured intron-containing pre-tRNAs accumulate in maf1Δ cells. However, it is likely possible that other components of the processing pathway are also limiting when tRNA transcription is increased. According to our model, Maf1-mediated transcription control and nuclear export by Los1 are two major stages of tRNA biosynthesis that are regulated by environmental conditions in a coordinated manner.
NASA Technical Reports Server (NTRS)
Breault, D. T.; Lichtler, A. C.; Rowe, D. W.
1997-01-01
Collagen reporter gene constructs have be used to identify cell-specific sequences needed for transcriptional activation. The elements required for endogenous levels of COL1A1 expression, however, have not been elucidated. The human COL1A1 minigene is expressed at high levels and likely harbors sequence elements required for endogenous levels of activity. Using stably transfected osteoblastic Py1a cells, we studied a series of constructs (pOBColCAT) designed to characterize further the elements required for high level of expression. pOBColCAT, which contains the COL1A1 first intron, was expressed at 50-100-fold higher levels than ColCAT 3.6, which lacks the first intron. This difference is best explained by improved mRNA processing rather than a transcriptional effect. Furthermore, variation in activity observed with the intron deletion constructs is best explained by altered mRNA splicing. Two major regions of the human COL1A1 minigene, the 3'-flanking sequences and the minigene body, were introduced into pOBColCAT to assess both transcriptional enhancing activity and the effect on mRNA stability. Analysis of the minigene body, which includes the first five exons and introns fused with the terminal six introns and exons, revealed an orientation-independent 5-fold increase in CAT activity. In contrast the 3'-flanking sequences gave rise to a modest 61% increase in CAT activity. Neither region increased the mRNA half-life of the parent construct, suggesting that CAT-specific mRNA instability elements may serve as dominant negative regulators of stability. This study suggests that other sites within the body of the COL1A1 minigene are important for high expression, e.g. during periods of rapid extracellular matrix production.
2011-01-01
Background One member of the W family of human endogenous retroviruses (HERV) appears to have been functionally adopted by the human host. Nevertheless, a highly diversified and regulated transcription from a range of HERV-W elements has been observed in human tissues and cells. Aberrant expression of members of this family has also been associated with human disease such as multiple sclerosis (MS) and schizophrenia. It is not known whether this broad expression of HERV-W elements represents transcriptional leakage or specific transcription initiated from the retroviral promoter in the long terminal repeat (LTR) region. Therefore, potential influences of genomic context, structure and orientation on the expression levels of individual HERV-W elements in normal human tissues were systematically investigated. Results Whereas intronic HERV-W elements with a pseudogene structure exhibited a strong anti-sense orientation bias, intronic elements with a proviral structure and solo LTRs did not. Although a highly variable expression across tissues and elements was observed, systematic effects of context, structure and orientation were also observed. Elements located in intronic regions appeared to be expressed at higher levels than elements located in intergenic regions. Intronic elements with proviral structures were expressed at higher levels than those elements bearing hallmarks of processed pseudogenes or solo LTRs. Relative to their corresponding genes, intronic elements integrated on the sense strand appeared to be transcribed at higher levels than those integrated on the anti-sense strand. Moreover, the expression of proviral elements appeared to be independent from that of their corresponding genes. Conclusions Intronic HERV-W provirus integrations on the sense strand appear to have elicited a weaker negative selection than pseudogene integrations of transcripts from such elements. Our current findings suggest that the previously observed diversified and tissue-specific expression of elements in the HERV-W family is the result of both directed transcription (involving both the LTR and internal sequence) and leaky transcription of HERV-W elements in normal human tissues. PMID:21226900
Crosstalk between Wnt Signaling and RNA Processing in Colorectal Cancer.
Bordonaro, Michael
2013-01-01
RNA processing involves a variety of processes affecting gene expression, including the removal of introns through RNA splicing, as well as 3' end processing (cleavage and polyadenylation). Alternative RNA processing is fundamentally important for gene regulation, and aberrant processing is associated with the initiation and progression of cancer. Deregulated Wnt signaling, which is the initiating event in the development of most cases of human colorectal cancer (CRC), has been linked to modified RNA processing, which may contribute to Wnt-mediated colonic carcinogenesis. Crosstalk between Wnt signaling and alternative RNA splicing with relevance to CRC includes effects on the expression of Rac1b, an alternatively spliced gene associated with tumorigenesis, which exhibits alternative RNA splicing that is influenced by Wnt activity. In addition, Tcf4, a crucial component of Wnt signaling, also exhibits alternative splicing, which is likely involved in colonic tumorigenesis. Modulation of 3' end formation, including of the Wnt target gene COX-2, also can influence the neoplastic process, with implications for CRC. While many human genes are dependent on introns and splicing for normal levels of gene expression, naturally intronless genes exist with a unique metabolism that allows for intron-independent gene expression. Effects of Wnt activity on the RNA metabolism of the intronless Wnt-target gene c-jun is a likely contributor to cancer development. Further, butyrate, a breakdown product of dietary fiber and a histone deacetylase inhibitor, upregulates Wnt activity in CRC cells, and also modulates RNA processing; therefore, the interplay between Wnt activity, the modulation of this activity by butyrate, and differential RNA metabolism in colonic cells can significantly influence tumorigenesis. Determining the role played by altered RNA processing in Wnt-mediated neoplasia may lead to novel interventions aimed at restoring normal RNA metabolism for therapeutic benefit. Therefore, this minireview presents a brief overview of several aspects of RNA processing of relevance to cancer, which potentially influence, or are influenced by, Wnt signaling activity.
Identification of processed Chinese medicinal materials using DNA mini-barcoding.
Song, Ming; Dong, Gang-Qiang; Zhang, Ya-Qin; Liu, Xia; Sun, Wei
2017-07-01
Most of Chinese medicinal herbs are subjected to traditional processing procedures, including stir-frying, charring, steaming, boiling, and calcining before they are released into dispensaries. The marketing and identification of processed medicinal materials is a growing issue in the marketplace. However, conventional methods of identification have limitations, while DNA mini-barcoding, based on the sequencing of a short-standardized region, has received considerable attention as a new potential means to identify processed medicinal materials. In the present study, six DNA barcode loci including ITS2, psbA-trnH, rbcL, matK, trnL (UAA) intron and its P6 loop, were employed for the authentication of 45 processed samples belonging to 15 species. We evaluated the amplification efficiency of each locus. We also examined the identification accuracy of the potential mini-barcode locus, of trnL (UAA) intron P6 loop. Our results showed that the five primary barcode loci were successfully amplified in only 8.89%-20% of the processed samples, while the amplification rates of the trnL (UAA) intron P6 loop were higher, at 75.56% successful amplification. We compared the mini-barcode sequences with Genbank using the Blast program. The analysis showed that 45.23% samples could be identified to genus level, while only one sample could be identified to the species level. We conclude that trnL (UAA) p6 loop is a candidate mini-barcode that has shown its potential and may become a universal mini-barcode as complementary barcode for authenticity testing and will play an important role in medicinal materials control. Copyright © 2017 China Pharmaceutical University. Published by Elsevier B.V. All rights reserved.
Schäfer, B; Merlos-Lange, A M; Anderl, C; Welser, F; Zimmer, M; Wolf, K
1991-01-01
In this paper we report the inability of four group I introns in the gene encoding subunit I of cytochrome c oxidase (cox1) and the group II intron in the apocytochrome b gene (cob) to splice autocatalytically. Furthermore we present the characterization of the first cox1 intron in the mutator strain anar-14 and the construction and characterization of strains with intronless mitochondrial genomes. We provide evidence that removal of introns at the DNA level (termed DNA splicing) is dependent on an active RNA maturase. Finally we demonstrate that the absence of introns does not abolish homologous mitochondrial recombination.
Colonization of heterochromatic genes by transposable elements in Drosophila.
Dimitri, Patrizio; Junakovic, Nikolaj; Arcà, Bruno
2003-04-01
As a further step toward understanding transposable element-host genome interactions, we investigated the molecular anatomy of introns from five heterochromatic and 22 euchromatic protein-coding genes of Drosophila melanogaster. A total of 79 kb of intronic sequences from heterochromatic genes and 355 kb of intronic sequences from euchromatic genes have been used in Blast searches against Drosophila transposable elements (TEs). The results show that TE-homologous sequences belonging to 19 different families represent about 50% of intronic DNA from heterochromatic genes. In contrast, only 0.1% of the euchromatic intron DNA exhibits homology to known TEs. Intraspecific and interspecific size polymorphisms of introns were found, which are likely to be associated with changes in TE-related sequences. Together, the enrichment in TEs and the apparent dynamic state of heterochromatic introns suggest that TEs contribute significantly to the evolution of genes located in heterochromatin.
An intron within the 16S ribosomal RNA gene of the archaeon Pyrobaculum aerophilum
NASA Technical Reports Server (NTRS)
Burggraf, S.; Larsen, N.; Woese, C. R.; Stetter, K. O.
1993-01-01
The 16S rRNA genes of Pyrobaculum aerophilum and Pyrobaculum islandicum were amplified by the polymerase chain reaction, and the resulting products were sequenced directly. The two organisms are closely related by this measure (over 98% similar). However, they differ in that the (lone) 16S rRNA gene of Pyrobaculum aerophilum contains a 713-bp intron not seen in the corresponding gene of Pyrobaculum islandicum. To our knowledge, this is the only intron so far reported in the small subunit rRNA gene of a prokaryote. Upon excision the intron is circularized. A secondary structure model of the intron-containing rRNA suggests a splicing mechanism of the same type as that invoked for the tRNA introns of the Archaea and Eucarya and 23S rRNAs of the Archaea. The intron contains an open reading frame whose protein translation shows no certain homology with any known protein sequence.
The group II intron maturase: a reverse transcriptase and splicing factor go hand in hand.
Zhao, Chen; Pyle, Anna Marie
2017-12-01
The splicing of group II introns in vivo requires the assistance of a multifunctional intron encoded protein (IEP, or maturase). Each IEP is also a reverse-transcriptase enzyme that enables group II introns to behave as mobile genetic elements. During splicing or retro-transposition, each group II intron forms a tight, specific complex with its own encoded IEP, resulting in a highly reactive holoenzyme. This review focuses on the structural basis for IEP function, as revealed by recent crystal structures of an IEP reverse transcriptase domain and cryo-EM structures of an IEP-intron complex. These structures explain how the same IEP scaffold is utilized for intron recognition, splicing and reverse transcription, while providing a physical basis for understanding the evolutionary transformation of the IEP into the eukaryotic splicing factor Prp8. Copyright © 2017 Elsevier Ltd. All rights reserved.
Asakura, Yukari; Barkan, Alice
2007-12-01
The CRM domain is a recently recognized RNA binding domain found in three group II intron splicing factors in chloroplasts, in a bacterial protein that associates with ribosome precursors, and in a family of uncharacterized proteins in plants. To elucidate the functional repertoire of proteins with CRM domains, we studied CFM2 (for CRM Family Member 2), which harbors four CRM domains. RNA coimmunoprecipitation assays showed that CFM2 in maize (Zea mays) chloroplasts is associated with the group I intron in pre-trnL-UAA and group II introns in the ndhA and ycf3 pre-mRNAs. T-DNA insertions in the Arabidopsis thaliana ortholog condition a defective-seed phenotype (strong allele) or chlorophyll-deficient seedlings with impaired splicing of the trnL group I intron and the ndhA, ycf3-int1, and clpP-int2 group II introns (weak alleles). CFM2 and two previously described CRM proteins are bound simultaneously to the ndhA and ycf3-int1 introns and act in a nonredundant fashion to promote their splicing. With these findings, CRM domain proteins are implicated in the activities of three classes of catalytic RNA: group I introns, group II introns, and 23S rRNA.
Nisa-Martínez, Rafael; Laporte, Philippe; Jiménez-Zurdo, José Ignacio; Frugier, Florian; Crespi, Martin; Toro, Nicolás
2013-01-01
Some bacterial group II introns are widely used for genetic engineering in bacteria, because they can be reprogrammed to insert into the desired DNA target sites. There is considerable interest in developing this group II intron gene targeting technology for use in eukaryotes, but nuclear genomes present several obstacles to the use of this approach. The nuclear genomes of eukaryotes do not contain group II introns, but these introns are thought to have been the progenitors of nuclear spliceosomal introns. We investigated the expression and subcellular localization of the bacterial RmInt1 group II intron-encoded protein (IEP) in Arabidopsis thaliana protoplasts. Following the expression of translational fusions of the wild-type protein and several mutant variants with EGFP, the full-length IEP was found exclusively in the nucleolus, whereas the maturase domain alone targeted EGFP to nuclear speckles. The distribution of the bacterial RmInt1 IEP in plant cell protoplasts suggests that the compartmentalization of eukaryotic cells into nucleus and cytoplasm does not prevent group II introns from invading the host genome. Furthermore, the trafficking of the IEP between the nucleolus and the speckles upon maturase inactivation is consistent with the hypothesis that the spliceosomal machinery evolved from group II introns.
Nisa-Martínez, Rafael; Laporte, Philippe; Jiménez-Zurdo, José Ignacio; Frugier, Florian; Crespi, Martin; Toro, Nicolás
2013-01-01
Some bacterial group II introns are widely used for genetic engineering in bacteria, because they can be reprogrammed to insert into the desired DNA target sites. There is considerable interest in developing this group II intron gene targeting technology for use in eukaryotes, but nuclear genomes present several obstacles to the use of this approach. The nuclear genomes of eukaryotes do not contain group II introns, but these introns are thought to have been the progenitors of nuclear spliceosomal introns. We investigated the expression and subcellular localization of the bacterial RmInt1 group II intron-encoded protein (IEP) in Arabidopsis thaliana protoplasts. Following the expression of translational fusions of the wild-type protein and several mutant variants with EGFP, the full-length IEP was found exclusively in the nucleolus, whereas the maturase domain alone targeted EGFP to nuclear speckles. The distribution of the bacterial RmInt1 IEP in plant cell protoplasts suggests that the compartmentalization of eukaryotic cells into nucleus and cytoplasm does not prevent group II introns from invading the host genome. Furthermore, the trafficking of the IEP between the nucleolus and the speckles upon maturase inactivation is consistent with the hypothesis that the spliceosomal machinery evolved from group II introns. PMID:24391881
Comparative Analysis of Vertebrate Dystrophin Loci Indicate Intron Gigantism as a Common Feature
Pozzoli, Uberto; Elgar, Greg; Cagliani, Rachele; Riva, Laura; Comi, Giacomo P.; Bresolin, Nereo; Bardoni, Alessandra; Sironi, Manuela
2003-01-01
The human DMD gene is the largest known to date, spanning > 2000 kb on the X chromosome. The gene size is mainly accounted for by huge intronic regions. We sequenced 190 kb of Fugu rubripes (pufferfish) genomic DNA corresponding to the complete dystrophin gene (FrDMD) and provide the first report of gene structure and sequence comparison among dystrophin genomic sequences from different vertebrate organisms. Almost all intron positions and phases are conserved between FrDMD and its mammalian counterparts, and the predicted protein product of the Fugu gene displays 55% identity and 71% similarity to human dystrophin. In analogy to the human gene, FrDMD presents several-fold longer than average intronic regions. Analysis of intron sequences of the human and murine genes revealed that they are extremely conserved in size and that a similar fraction of total intron length is represented by repetitive elements; moreover, our data indicate that intron expansion through repeat accumulation in the two orthologs is the result of independent insertional events. The hypothesis that intron length might be functionally relevant to the DMD gene regulation is proposed and substantiated by the finding that dystrophin intron gigantism is common to the three vertebrate genes. [Supplemental material is available online at www.genome.org.] PMID:12727896
Mechanism for DNA transposons to generate introns on genomic scales
Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.
2017-01-01
Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113
Zeng, Xian-Chun; Nie, Yao; Luo, Xuesong; Wu, Shifen; Shi, Wanxia; Zhang, Lei; Liu, Yichen; Cao, Hanjun; Yang, Ye; Zhou, Jianping
2013-03-01
The full-length cDNA sequences of two novel cysteine-rich peptides (referred to as HsVx1 and MmKTx1) were obtained from scorpions. The two peptides represent a novel class of cysteine-rich peptides with a unique cysteine pattern. The genomic sequence of HsVx1 is composed of three exons interrupted by two introns that are localized in the mature peptide encoding region and inserted in phase 1 and phase 2, respectively. Such a genomic organization markedly differs from those of other peptides from scorpions described previously. Genome-wide search for the orthologs of HsVx1 identified 59 novel cysteine-rich peptides from arthropods. These peptides share a consistent cysteine pattern with HsVx1. Genomic comparison revealed extensive intron length differences and intronic number and position polymorphisms among the genes of these peptides. Further analysis identified 30 cases of intron sliding, 1 case of intron gain and 22 cases of intron loss occurred with the genes of the HsVx1 and HsVx1-like peptides. It is interesting to see that three HsVx1-like peptides XP_001658928, XP_001658929 and XP_001658930 were derived from a single gene (XP gene): the former two were generated from alternative splicing; the third one was encoded by a DNA region in the reverse complementary strand of the third intron of the XP gene. These findings strongly suggest that the genes of these cysteine-rich peptides were evolved by intron sliding, intron gain/loss, gene recombination and alternative splicing events in response to selective forces without changing their cysteine pattern. The evolution of these genes is dominated by intron sliding and intron loss. Copyright © 2012 Elsevier Inc. All rights reserved.
Circular RNAs: Unexpected outputs of many protein-coding genes
Wilusz, Jeremy E.
2017-01-01
ABSTRACT Pre-mRNAs from thousands of eukaryotic genes can be non-canonically spliced to generate circular RNAs, some of which accumulate to higher levels than their associated linear mRNA. Recent work has revealed widespread mechanisms that dictate whether the spliceosome generates a linear or circular RNA. For most genes, circular RNA biogenesis via backsplicing is far less efficient than canonical splicing, but circular RNAs can accumulate due to their long half-lives. Backsplicing is often initiated when complementary sequences from different introns base pair and bring the intervening splice sites close together. This process is further regulated by the combinatorial action of RNA binding proteins, which allow circular RNAs to be expressed in unique patterns. Some genes do not require complementary sequences to generate RNA circles and instead take advantage of exon skipping events. It is still unclear what most mature circular RNAs do, but future investigations into their functions will be facilitated by recently described methods to modulate circular RNA levels. PMID:27571848
Gonzalez, P; Barroso, G; Labarère, J
1998-10-05
The Basidiomycota Agrocybe aegerita (Aa) mitochondrial cox1 gene (6790 nucleotides), encoding a protein of 527aa (58377Da), is split by four large subgroup IB introns possessing site-specific endonucleases assumed to be involved in intron mobility. When compared to other fungal COX1 proteins, the Aa protein is closely related to the COX1 one of the Basidiomycota Schizophyllum commune (Sc). This clade reveals a relationship with the studied Ascomycota ones, with the exception of Schizosaccharomyces pombe (Sp) which ranges in an out-group position compared with both higher fungi divisions. When comparison is extended to other kingdoms, fungal COX1 sequences are found to be more related to algae and plant ones (more than 57.5% aa similarity) than to animal sequences (53.6% aa similarity), contrasting with the previously established close relationship between fungi and animals, based on comparisons of nuclear genes. The four Aa cox1 introns are homologous to Ascomycota or algae cox1 introns sharing the same location within the exonic sequences. The percentages of identity of the intronic nucleotide sequences suggest a possible acquisition by lateral transfers of ancestral copies or of their derived sequences. These identities extend over the whole intronic sequences, arguing in favor of a transfer of the complete intron rather than a transfer limited to the encoded ORF. The intron i4 shares 74% of identity, at the nucleotidic level, with the Podospora anserina (Pa) intron i14, and up to 90.5% of aa similarity between the encoded proteins, i.e. the highest values reported to date between introns of two phylogenetically distant species. This low divergence argues for a recent lateral transfer between the two species. On the contrary, the low sequence identities (below 36%) observed between Aa i1 and the homologous Sp i1 or Prototheca wickeramii (Pw) i1 suggest a long evolution time after the separation of these sequences. The introns i2 and i3 possessed intermediate percentages of identity with their homologous Ascomycota introns. This is the first report of the complete nucleotide sequence and molecular organization of a mitochondrial cox1 gene of any member of the Basidiomycota division.
Haut, Donald D.; Pintel, D. J.
1998-01-01
Alternative splicing of pre-mRNAs plays a critical role in maximizing the coding capacity of the small parvovirus genome. The small-intron region of minute virus of mice (MVM) pre-mRNAs undergoes an unusual pattern of overlapping alternative splicing—using two donors (D1 and D2) and two acceptors (A1 and A2) within a region of 120 nucleotides—that determines the steady-state ratios of the various viral mRNAs. In this report, we show that the determinants that govern excision of the small intron are complex and are also required for efficient definition of the upstream exon. For the MVM small intron in its natural context, the two donors appear to compete for the splicing machinery: the position of D1 favors its usage, while the primary sequence of D2 must be more like the consensus sequence than is D1 to be used efficiently. We have genetically defined the branch points that are used for generation of the major and minor spliced forms and show that recognition of components of the small-intron acceptors is likely to be the dominant determinant in alternative small-intron excision. We have also identified a G-rich intronic enhancer sequence within the small intron that is essential for splicing of the minor form (D2 to A2) but not the major form (D1 to A1) of MVM mRNAs and is required for efficient definition of the upstream NS2-specific exon. In its natural context, the small intron appears to be excised by a mechanism consistent with intron definition. When the MVM small intron is expanded, various parameters of its excision are altered, indicating that critical cis-acting signals are context dependent. Relative use of the donors and acceptors is altered, and the upstream NS2-specific exon is no longer efficiently defined. The fact that definition of the upstream NS2-specific exon can be achieved by the MVM small intron in its natural context, but not when it is expanded, suggests that the multiple determinants that govern definition and excision of the small intron are required, in concert, for upstream exon definition. Our data are consistent with a model in which alternative splicing of the MVM P4-generated pre-mRNAs is governed by a hybrid of intron- and exon-defining mechanisms. PMID:9499034
Hill, J; McGraw, P; Tzagoloff, A
1985-03-25
The yeast nuclear gene CBP2 was previously proposed to code for a protein necessary for processing of the terminal intron in the cytochrome b pre-mRNA (McGraw, P., and Tzagoloff, A. (1983) J. Biol. Chem. 258, 9459-9468). In the present study we describe a mitochondrial mutation capable of suppressing the respiratory deficiency of cbp2 mutants. The mitochondrial suppressor mutation has been shown to be the result of a precise excision of the last intervening sequence from the cytochrome b gene. Strains with the altered mitochondrial DNA have normal levels of mature cytochrome b mRNA and of cytochrome b and exhibit wild type growth on glycerol. These results confirm that CBP2 codes for a protein specifically required for splicing of the cytochrome b intron and further suggest that absence of the intervening sequence does not noticeably affect the expression of respiratory function in mitochondria.
Intron retention and nuclear loss of SFPQ are molecular hallmarks of ALS.
Luisier, Raphaelle; Tyzack, Giulia E; Hall, Claire E; Mitchell, Jamie S; Devine, Helen; Taha, Doaa M; Malik, Bilal; Meyer, Ione; Greensmith, Linda; Newcombe, Jia; Ule, Jernej; Luscombe, Nicholas M; Patani, Rickie
2018-05-22
Mutations causing amyotrophic lateral sclerosis (ALS) strongly implicate ubiquitously expressed regulators of RNA processing. To understand the molecular impact of ALS-causing mutations on neuronal development and disease, we analysed transcriptomes during in vitro differentiation of motor neurons (MNs) from human control and patient-specific VCP mutant induced-pluripotent stem cells (iPSCs). We identify increased intron retention (IR) as a dominant feature of the splicing programme during early neural differentiation. Importantly, IR occurs prematurely in VCP mutant cultures compared with control counterparts. These aberrant IR events are also seen in independent RNAseq data sets from SOD1- and FUS-mutant MNs. The most significant IR is seen in the SFPQ transcript. The SFPQ protein binds extensively to its retained intron, exhibits lower nuclear abundance in VCP mutant cultures and is lost from nuclei of MNs in mouse models and human sporadic ALS. Collectively, we demonstrate SFPQ IR and nuclear loss as molecular hallmarks of familial and sporadic ALS.
Reenacting the birth of an intron
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hellsten, Uffe; Aspden, Julie L.; Rio, Donald C.
2011-07-01
An intron is an extended genomic feature whose function requires multiple constrained positions - donor and acceptor splice sites, a branch point, a polypyrimidine tract and suitable splicing enhancers - that may be distributed over hundreds or thousands of nucleotides. New introns are therefore unlikely to emerge by incremental accumulation of functional sub-elements. Here we demonstrate that a functional intron can be created de novo in a single step by a segmental genomic duplication. This experiment recapitulates in vivo the birth of an intron that arose in the ancestral jawed vertebrate lineage nearly half a billion years ago.
Jia, Fan; Rock, Christopher D.
2013-01-01
MicroRNAs (miRNAs) are ~21-nucleotide long endogenous small RNAs that regulate gene expression through post-transcriptional or transcriptional gene silencing (PTGS/TGS) and/or translational inhibition. miRNAs can arise from the “exon” of a MIRNA gene, from an intron (e.g. mirtrons in animals), or from the antisense strand of a protein coding gene (natural antisense microRNAs, nat-miRNAs). Here we demonstrate that two functionally related miRNAs, miR842 and miR846, arise from the same transcription unit but from alternate splicing isoforms. miR846 is expressed only from Isoform1 while in Isoforms2 and -3, a part of pre-miR846 containing the miRNA* sequence is included in the intron. The splicing of the intron truncates the pre-MIRNA and disrupts the expression of the mature miR846.. We name this novel phenomenon splicing-regulated miRNA. Abscisic acid (ABA) is shown to mediate the alternative splicing event by reducing the functional Isoform1 and increasing the non-functional Isoform3, thus repressing the expression of miR846 concomitant with accumulation of an ABA-inducible target jacalin At5g28520 mRNA, whose cleavage was shown by modified 5′-RACE. This regulation shows the functional importance of splicing-regulated miRNA and suggests possible mechanisms for altered ABA response phenotypes of miRNA biogenesis mutants. A. lyrata-MIR842 and Aly-MIR846 have conserved genomic arrangements with A. thaliana and candidate target jacalins, similar primary transcript structures and intron processing, and better miRNA-miRNA* pairings, suggesting that the interactions between ABA, MIR842, MIR846 and jacalins are similar in A. lyrata. Together, splicing-regulated miRNAs, nat-miRNAs/inc-miRNAs and mirtrons illustrate the complexity of MIRNA genes, the importance of introns in the biogenesis and regulation of miRNAs, and raise questions about the processes and molecular mechanisms that drive MIRNA evolution. PMID:23341152
The relationship between RNA catalytic processes
NASA Astrophysics Data System (ADS)
Cedergren, Robert; Lang, B. Franz; Gravel, Denis
1988-09-01
Proposals that an RNA-based genetic system preceeded DNA, stem from the ability of RNA to store genetic information and to promote simple catalysis. However, to be a valid basis for the RNA world, RNA catalysis must demonstrate or be related to intrinsic chemical properties which could have existed in primordial times. We analyze this question by first classifying RNA catalysis and related processes according to their mechanism. We define: (A) thedisjunct nucleophile class which leads to 5'-phosphates. These include Group I and II intron splicing, nuclear mRNA splicing and RNase P reactions. Although Group I introns and its excision mechanism is likely to have existed in primordial times, present-day examples have arisen independently in different phyla much more recently. Comparative methodology indicates that RNase P catalysis originated before the divergence of the major kingdoms. In addition, alldisjunct nucleophile reactions can be interrelated by a proposed mechanism involving a distant 2-OH nucleophile. (B) theconjunct nucleophile class leading to 3'-phosphates. This class is composed of self-cleaving RNAs found in plant viruses and the newt. We propose that tRNA splicing is related to this mechanism rather than the previous one. The presence of introns in tRNA genes of eukaryotes and archaebacteria supports the idea that tRNA splicing predates the divergence of these cell types.
Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons
Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil
2012-01-01
Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994
Two CRM protein subfamilies cooperate in the splicing of group IIB introns in chloroplasts.
Asakura, Yukari; Bayraktar, Omer Ali; Barkan, Alice
2008-11-01
Chloroplast genomes in angiosperms encode approximately 20 group II introns, approximately half of which are classified as subgroup IIB. The splicing of all but one of the subgroup IIB introns requires a heterodimer containing the peptidyl-tRNA hydrolase homolog CRS2 and one of two closely related proteins, CAF1 or CAF2, that harbor a recently recognized RNA binding domain called the CRM domain. Two CRS2/CAF-dependent introns require, in addition, a CRM domain protein called CFM2 that is only distantly related to CAF1 and CAF2. Here, we show that CFM3, a close relative of CFM2, associates in vivo with those CRS2/CAF-dependent introns that are not CFM2 ligands. Mutant phenotypes in rice and Arabidopsis support a role for CFM3 in the splicing of most of the introns with which it associates. These results show that either CAF1 or CAF2 and either CFM2 or CFM3 simultaneously bind most chloroplast subgroup IIB introns in vivo, and that the CAF and CFM subunits play nonredundant roles in splicing. These results suggest that the expansion of the CRM protein family in plants resulted in two subfamilies that play different roles in group II intron splicing, with further diversification within a subfamily to accommodate multiple intron ligands.
The splicing of tiny introns of Paramecium is controlled by MAGO.
Contreras, Julia; Begley, Victoria; Marsella, Laura; Villalobo, Eduardo
2018-07-15
The exon junction complex (EJC) is a key element of the splicing machinery. The EJC core is composed of eIF4A3, MAGO, Y14 and MLN51. Few accessory proteins, such as CWC22 or UPF3, bind transiently to the EJC. The EJC has been implicated in the control of the splicing of long introns. To ascertain whether the EJC controls the splicing of short introns, we used Paramecium tetraurelia as a model organism, since it has thousands of very tiny introns. To elucidate whether EJC affects intron splicing in P. tetraurelia, we searched for EJC protein-coding genes, and silenced those genes coding for eIF4A3, MAGO and CWC22. We found that P. tetraurelia likely assembles an active EJC with only three of the core proteins, since MLN51 is lacking. Silencing of eIF4A3 or CWC22 genes, but not that of MAGO, caused lethality. Silencing of the MAGO gene caused either an increase, decrease, or no change in intron retention levels of some intron-containing mRNAs used as reporters. We suggest that a fine-tuning expression of EJC genes is required for steady intron removal in P. tetraurelia. Taking into consideration our results and those published by others, we conclude that the EJC controls splicing independently of the intron size. Copyright © 2018 Elsevier B.V. All rights reserved.
Gomes, Felipe E E S; Arantes, Thales D; Fernandes, José A L; Ferreira, Leonardo C; Romero, Héctor; Bosco, Sandra M G; Oliveira, Maria T B; Del Negro, Gilda M B; Theodoro, Raquel C
2018-01-01
Cryptococcosis, one of the most important systemic mycosis in the world, is caused by different genotypes of Cryptococcus neoformans and Cryptococcus gattii , which differ in their ecology, epidemiology, and antifungal susceptibility. Therefore, the search for new molecular markers for genotyping, pathogenicity and drug susceptibility is necessary. Group I introns fulfill the requisites for such task because (i) they are polymorphic sequences; (ii) their self-splicing is inhibited by some drugs; and (iii) their correct splicing under parasitic conditions is indispensable for pathogen survival. Here, we investigated the presence of group I introns in the mitochondrial LSU rRNA gene in 77 Cryptococcus isolates and its possible relation to drug susceptibility. Sequencing revealed two new introns in the LSU rRNA gene. All the introns showed high sequence similarity to other mitochondrial introns from distinct fungi, supporting the hypothesis of an ancient non-allelic invasion. Intron presence was statistically associated with those genotypes reported to be less pathogenic ( p < 0.001). Further virulence assays are needed to confirm this finding. In addition, in vitro antifungal tests indicated that the presence of LSU rRNA introns may influence the minimum inhibitory concentration (MIC) of amphotericin B and 5-fluorocytosine. These findings point to group I introns in the mitochondrial genome of Cryptococcus as potential molecular markers for antifungal resistance, as well as therapeutic targets.
Chee, Gab-Joo; Takami, Hideto
2011-01-01
Group II introns inserted into genes often undergo splicing at unexpected sites, and participate in the transcription of host genes. We identified five copies of a group II intron, designated Oi.Int, in the genome of an extremely halotolerant and alkaliphilic bacillus, Oceanobacillus iheyensis. The Oi.Int4 differs from the Oi.Int3 at four bases. The ligated exons of the Oi.Int4 could not be detected by RT-PCR assays in vivo or in vitro although group II introns can generally self-splice in vitro without the involvement of an intron-encoded open reading frame (ORF). In the Oi.Int4 mutants with base substitutions within the ORF, ligated exons were detected by in vitro self-splicing. It was clear that the ligation of exons during splicing is affected by the sequence of the intron-encoded ORF since the splice sites corresponded to the joining sites of the intron. In addition, the mutant introns showed unexpected multiple products with alternative 5' splice sites. These findings imply that alternative 5' splicing which causes a functional change of ligated exons presumably has influenced past adaptations of O. iheyensis to various environmental changes.
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.
Rasschaert, Perrine; Dambrine, Ginette; Rasschaert, Denis; Laurent, Sylvie
2016-01-01
ABSTRACT Interplay between alternative splicing and the Microprocessor may have differential effects on the expression of intronic miRNAs organized into clusters. We used a viral model — the LAT long non-coding RNA (LAT lncRNA) of Marek's disease oncogenic herpesvirus (MDV-1), which has the mdv1-miR-M8-M6-M7-M10 cluster embedded in its first intron — to assess the impact of splicing modifications on the biogenesis of each of the miRNAs from the cluster. Drosha silencing and alternative splicing of an extended exon 2 of the LAT lncRNA from a newly identified 3′ splice site (SS) at the end of the second miRNA of the cluster showed that mdv1-miR-M6 was a 5′-tailed mirtron. We have thus identified the first 5′-tailed mirtron within a cluster of miRNAs for which alternative splicing is directly associated with differential expression of the other miRNAs of the cluster, with an increase in intronic mdv1-miR-M8 expression and a decrease in expression of the exonic mdv1-miR-M7, and indirectly associated with regulation of the host transcript. According to the alternative 3SS used for the host intron splicing, the mdv1-miR-M6 is processed as a mirtron by the spliceosome, dispatching the other miRNAs of the cluster into intron and exon, or as a canonical miRNA by the Microprocessor complex. The viral mdv1-miR-M6 mirtron is the first mirtron described that can also follow the canonical pathway. PMID:27715458
Melangath, Geetha; Sen, Titash; Kumar, Rakesh; Bawa, Pushpinder; Srinivasan, Subha; Vijayraghavan, Usha
2017-01-01
Budding yeast spliceosomal factors ScSlu7 and ScPrp18 interact and mediate intron 3'ss choice during second step pre-mRNA splicing. The fission yeast genome with abundant multi-intronic transcripts, degenerate splice signals and SR proteins is an apt unicellular fungal model to deduce roles for core spliceosomal factors in alternative splice-site choice, intron retention and to study the cellular implications of regulated splicing. From our custom microarray data we deduce a stringent reproducible subset of S. pombe alternative events. We examined the role of factors SpSlu7 or SpPrp18 for these splice events and investigated the relationship to growth phase and stress. Wild-type log and stationary phase cells showed ats1+ exon 3 skipped and intron 3 retained transcripts. Interestingly the non-consensus 5'ss in ats1+ intron 3 caused SpSlu7 and SpPrp18 dependent intron retention. We validated the use of an alternative 5'ss in dtd1+ intron 1 and of an upstream alternative 3'ss in DUF3074 intron 1. The dtd1+ intron 1 non-canonical 5'ss yielded an alternative mRNA whose levels increased in stationary phase. Utilization of dtd1+ intron 1 sub-optimal 5' ss required functional SpPrp18 and SpSlu7 while compromise in SpSlu7 function alone hampered the selection of the DUF3074 intron 1 non canonical 3'ss. We analysed the relative abundance of these splice isoforms during mild thermal, oxidative and heavy metal stress and found stress-specific splice patterns for ats1+ and DUF3074 intron 1 some of which were SpSlu7 and SpPrp18 dependent. By studying ats1+ splice isoforms during compromised transcription elongation rates in wild-type, spslu7-2 and spprp18-5 mutant cells we found dynamic and intron context-specific effects in splice-site choice. Our work thus shows the combinatorial effects of splice site strength, core splicing factor functions and transcription elongation kinetics to dictate alternative splice patterns which in turn serve as an additional recourse of gene regulation in fission yeast.
Kumar, Rakesh; Bawa, Pushpinder; Srinivasan, Subha
2017-01-01
Budding yeast spliceosomal factors ScSlu7 and ScPrp18 interact and mediate intron 3’ss choice during second step pre-mRNA splicing. The fission yeast genome with abundant multi-intronic transcripts, degenerate splice signals and SR proteins is an apt unicellular fungal model to deduce roles for core spliceosomal factors in alternative splice-site choice, intron retention and to study the cellular implications of regulated splicing. From our custom microarray data we deduce a stringent reproducible subset of S. pombe alternative events. We examined the role of factors SpSlu7 or SpPrp18 for these splice events and investigated the relationship to growth phase and stress. Wild-type log and stationary phase cells showed ats1+ exon 3 skipped and intron 3 retained transcripts. Interestingly the non-consensus 5’ss in ats1+ intron 3 caused SpSlu7 and SpPrp18 dependent intron retention. We validated the use of an alternative 5’ss in dtd1+ intron 1 and of an upstream alternative 3’ss in DUF3074 intron 1. The dtd1+ intron 1 non-canonical 5’ss yielded an alternative mRNA whose levels increased in stationary phase. Utilization of dtd1+ intron 1 sub-optimal 5’ ss required functional SpPrp18 and SpSlu7 while compromise in SpSlu7 function alone hampered the selection of the DUF3074 intron 1 non canonical 3’ss. We analysed the relative abundance of these splice isoforms during mild thermal, oxidative and heavy metal stress and found stress-specific splice patterns for ats1+ and DUF3074 intron 1 some of which were SpSlu7 and SpPrp18 dependent. By studying ats1+ splice isoforms during compromised transcription elongation rates in wild-type, spslu7-2 and spprp18-5 mutant cells we found dynamic and intron context-specific effects in splice-site choice. Our work thus shows the combinatorial effects of splice site strength, core splicing factor functions and transcription elongation kinetics to dictate alternative splice patterns which in turn serve as an additional recourse of gene regulation in fission yeast. PMID:29236736
Lamech, Lilian T.; Mallam, Anna L.; Lambowitz, Alan M.
2014-01-01
The Neurospora crassa mitochondrial tyrosyl-tRNA synthetase (mtTyrRS; CYT-18 protein) evolved a new function as a group I intron splicing factor by acquiring the ability to bind group I intron RNAs and stabilize their catalytically active RNA structure. Previous studies showed: (i) CYT-18 binds group I introns by using both its N-terminal catalytic domain and flexibly attached C-terminal anticodon-binding domain (CTD); and (ii) the catalytic domain binds group I introns specifically via multiple structural adaptations that occurred during or after the divergence of Peziomycotina and Saccharomycotina. However, the function of the CTD and how it contributed to the evolution of splicing activity have been unclear. Here, small angle X-ray scattering analysis of CYT-18 shows that both CTDs of the homodimeric protein extend outward from the catalytic domain, but move inward to bind opposite ends of a group I intron RNA. Biochemical assays show that the isolated CTD of CYT-18 binds RNAs non-specifically, possibly contributing to its interaction with the structurally different ends of the intron RNA. Finally, we find that the yeast mtTyrRS, which diverged from Pezizomycotina fungal mtTyrRSs prior to the evolution of splicing activity, binds group I intron and other RNAs non-specifically via its CTD, but lacks further adaptations needed for group I intron splicing. Our results suggest a scenario of constructive neutral (i.e., pre-adaptive) evolution in which an initial non-specific interaction between the CTD of an ancestral fungal mtTyrRS and a self-splicing group I intron was “fixed” by an intron RNA mutation that resulted in protein-dependent splicing. Once fixed, this interaction could be elaborated by further adaptive mutations in both the catalytic domain and CTD that enabled specific binding of group I introns. Our results highlight a role for non-specific RNA binding in the evolution of RNA-binding proteins. PMID:25536042
Lamech, Lilian T; Mallam, Anna L; Lambowitz, Alan M
2014-12-01
The Neurospora crassa mitochondrial tyrosyl-tRNA synthetase (mtTyrRS; CYT-18 protein) evolved a new function as a group I intron splicing factor by acquiring the ability to bind group I intron RNAs and stabilize their catalytically active RNA structure. Previous studies showed: (i) CYT-18 binds group I introns by using both its N-terminal catalytic domain and flexibly attached C-terminal anticodon-binding domain (CTD); and (ii) the catalytic domain binds group I introns specifically via multiple structural adaptations that occurred during or after the divergence of Peziomycotina and Saccharomycotina. However, the function of the CTD and how it contributed to the evolution of splicing activity have been unclear. Here, small angle X-ray scattering analysis of CYT-18 shows that both CTDs of the homodimeric protein extend outward from the catalytic domain, but move inward to bind opposite ends of a group I intron RNA. Biochemical assays show that the isolated CTD of CYT-18 binds RNAs non-specifically, possibly contributing to its interaction with the structurally different ends of the intron RNA. Finally, we find that the yeast mtTyrRS, which diverged from Pezizomycotina fungal mtTyrRSs prior to the evolution of splicing activity, binds group I intron and other RNAs non-specifically via its CTD, but lacks further adaptations needed for group I intron splicing. Our results suggest a scenario of constructive neutral (i.e., pre-adaptive) evolution in which an initial non-specific interaction between the CTD of an ancestral fungal mtTyrRS and a self-splicing group I intron was "fixed" by an intron RNA mutation that resulted in protein-dependent splicing. Once fixed, this interaction could be elaborated by further adaptive mutations in both the catalytic domain and CTD that enabled specific binding of group I introns. Our results highlight a role for non-specific RNA binding in the evolution of RNA-binding proteins.
Torrado, Mario; Iglesias, Raquel; Nespereira, Beatriz; Centeno, Alberto; López, Eduardo; Mikhailov, Alexander T
2009-07-01
The cardiac ankyrin repeat domain 1 protein (ANKRD1, also known as CARP) has been extensively characterized with regard to its proposed functions as a cardio-enriched transcriptional co-factor and stress-inducible myofibrillar protein. The present results show the occurrence of alternative splicing by intron retention events in the pig and human ankrd1 gene. In pig heart, ankrd1 is expressed as four alternatively spliced transcripts, three of which have non-excised introns: ankrd1-contained introns 6, 7 and 8 (i.e., ankrd1-i6,7,8), ankrd1-contained introns 7 and 8 (i.e., ankrd1-i7,8), and ankrd1 retained only intron 8 (i.e., ankrd1-i8). In the human heart, two orthologues of porcine intron-retaining ankrd1 variants (i.e., ankrd1-i8 and ankrd1-i7,8) are detected. We demonstrate that these newly-identified intron-retaining ankrd1 transcripts are functionally intact, efficiently translated into protein in vitro and exported to the cytoplasm in cardiomyocytes in vivo. In the piglet heart, both the intronless and intron-retaining ankrd1 mRNAs are co-expressed in a chamber-dependent manner being more abundant in the left as compared to the right myocardium. Our data further indicate co-upregulation of the ankrd1 spliced variants in myocardium in the porcine model of diastolic heart failure. Most significantly, we demonstrate that in vivo forced expression of recombinant intronless ankrd1 markedly increases the levels of intron-retaining ankrd1 variants (but not of the endogenous main transcript) in piglet myocardium, suggesting that ANKRD1 may positively regulate the expression of its own intron-containing RNAs in response to cardiac stress. Overall, our findings demonstrate that in cardiomyocytes ANKRD1 can exist in multiple isoforms which may contribute to the functional diversity of this factor in heart development and disease.
Landscape of the spliced leader trans-splicing mechanism in Schistosoma mansoni.
Boroni, Mariana; Sammeth, Michael; Gava, Sandra Grossi; Jorge, Natasha Andressa Nogueira; Macedo, Andréa Mara; Machado, Carlos Renato; Mourão, Marina Moraes; Franco, Glória Regina
2018-03-01
Spliced leader dependent trans-splicing (SLTS) has been described as an important RNA regulatory process that occurs in different organisms, including the trematode Schistosoma mansoni. We identified more than seven thousand putative SLTS sites in the parasite, comprising genes with a wide spectrum of functional classes, which underlines the SLTS as a ubiquitous mechanism in the parasite. Also, SLTS gene expression levels span several orders of magnitude, showing that SLTS frequency is not determined by the expression level of the target gene, but by the presence of particular gene features facilitating or hindering the trans-splicing mechanism. Our in-depth investigation of SLTS events demonstrates widespread alternative trans-splicing (ATS) acceptor sites occurring in different regions along the entire gene body, highlighting another important role of SLTS generating alternative RNA isoforms in the parasite, besides the polycistron resolution. Particularly for introns where SLTS directly competes for the same acceptor substrate with cis-splicing, we identified for the first time additional and important features that might determine the type of splicing. Our study substantially extends the current knowledge of RNA processing by SLTS in S. mansoni, and provide basis for future studies on the trans-splicing mechanism in other eukaryotes.
Li, Yang I; Sanchez-Pulido, Luis; Haerty, Wilfried; Ponting, Chris P
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein-protein interactions. © 2015 Li et al.; Published by Cold Spring Harbor Laboratory Press.
Lipphardt, Mark F; Deryal, Mustafa; Ong, Mei Fang; Schmidt, Werner; Mahlknecht, Ulrich
2013-01-01
Estrogen and progesterone hormones are key regulators of a wide variety of biological processes. In addition to their influence on reproduction, cell differentiation and apoptosis, they affect inflammatory response, cell metabolism and most importantly, they regulate physiological breast tissue proliferation and differentiation as well as the development and progression of breast cancer. In order to assess whether genetic variants in the steroid hormone receptor gene ESR1 (estrogen receptor alpha) had an effect on sporadic breast cancer susceptibility, we assessed 7 ESR1 single nucleotide polymorphisms (SNPs) for associations with breast cancer susceptibility and clinical parameters in 221 breast cancer patients and 221 controls, respectively. We identified ESR1 intron SNP +2464 C/T (rs3020314) and ESR1 intron SNP -4576 A/C (rs1514348) to correlate with breast cancer susceptibility and progesterone receptor expression status. Patients genotyped CT for ESR1 intron SNP +2464 (rs3020314) (p ≤ 0.045) or genotyped AC for ESR1 intron SNP -4576 (rs1514348) (p ≤ 0.000026) were identified to carry a significant risk as to the development of breast cancer in the Central European Caucasian population (both together: p ≤ 0.000488). Our study could confirm previous associations and revealed new associations of SNP rs1514348 with susceptibility to breast cancer and clinical outcome, which might be used as new additional SNP markers.
Yamada, Takashi; Onimatsu, Hideki; Van Etten, James L.
2007-01-01
Chlorella viruses or chloroviruses are large, icosahedral, plaque‐forming, double‐stranded‐DNA—containing viruses that replicate in certain strains of the unicellular green alga Chlorella. DNA sequence analysis of the 330‐kbp genome of Paramecium bursaria chlorella virus 1 (PBCV‐1), the prototype of this virus family (Phycodnaviridae), predict ∼366 protein‐encoding genes and 11 tRNA genes. The predicted gene products of ∼50% of these genes resemble proteins of known function, including many that are completely unexpected for a virus. In addition, the chlorella viruses have several features and encode many gene products that distinguish them from most viruses. These products include: (1) multiple DNA methyltransferases and DNA site‐specific endonucleases, (2) the enzymes required to glycosylate their proteins and synthesize polysaccharides such as hyaluronan and chitin, (3) a virus‐encoded K+ channel (called Kcv) located in the internal membrane of the virions, (4) a SET domain containing protein (referred to as vSET) that dimethylates Lys27 in histone 3, and (5) PBCV‐1 has three types of introns; a self‐splicing intron, a spliceosomal processed intron, and a small tRNA intron. Accumulating evidence indicates that the chlorella viruses have a very long evolutionary history. This review mainly deals with research on the virion structure, genome rearrangements, gene expression, cell wall degradation, polysaccharide synthesis, and evolution of PBCV‐1 as well as other related viruses. PMID:16877063
Dutta, Debargh; Gunasekera, Devi; Ragni, Margaret V; Pratt, Kathleen P
2016-12-27
The most frequent mutations resulting in hemophilia A are an intron 22 or intron 1 gene inversion, which together cause ∼50% of severe hemophilia A cases. We report a simple and accurate RNA-based assay to detect these mutations in patients and heterozygous carriers. The assays do not require specialized equipment or expensive reagents; therefore, they may provide useful and economic protocols that could be standardized for central laboratory testing. RNA is purified from a blood sample, and reverse transcription nested polymerase chain reaction (RT-NPCR) reactions amplify DNA fragments with the F8 sequence spanning the exon 22 to 23 splice site (intron 22 inversion test) or the exon 1 to 2 splice site (intron 1 inversion test). These sequences will be amplified only from F8 RNA without an intron 22 or intron 1 inversion mutation, respectively. Additional RT-NPCR reactions are then carried out to amplify the inverted sequences extending from F8 exon 19 to the first in-frame stop codon within intron 22 or a chimeric transcript containing F8 exon 1 and the VBP1 gene. These latter 2 products are produced only by individuals with an intron 22 or intron 1 inversion mutation, respectively. The intron 22 inversion mutations may be further classified (eg, as type 1 or type 2, reflecting the specific homologous recombination sites) by the standard DNA-based "inverse-shifting" PCR assay if desired. Efficient Bcl I and T4 DNA ligase enzymes that cleave and ligate DNA in minutes were used, which is a substantial improvement over previous protocols that required overnight incubations. These protocols can accurately detect F8 inversion mutations via same-day testing of patient samples.
Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F
2014-10-01
Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species.
Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F
2014-01-01
Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3′ terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785
Mollusk genes encoding lysine tRNA (UUU) contain introns.
Matsuo, M; Abe, Y; Saruta, Y; Okada, N
1995-11-20
New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.
Pre-Mrna Introns as a Model for Cryptographic Algorithm:. Theory and Experiments
NASA Astrophysics Data System (ADS)
Regoli, Massimo
2010-01-01
The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. In particular the RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
a Simple Symmetric Algorithm Using a Likeness with Introns Behavior in RNA Sequences
NASA Astrophysics Data System (ADS)
Regoli, Massimo
2009-02-01
The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences has some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algoritnm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
Bornstein, P; McKay, J; Liska, D J; Apone, S; Devarayalu, S
1988-01-01
The first intron of the human collagen alpha 1(I) gene contains several positively and negatively acting elements. We have studied the transcription of collagen-human growth hormone fusion genes, containing deletions and rearrangements of collagen intronic sequences, by transient transfection of chick tendon fibroblasts and NIH 3T3 cells. In chick tendon fibroblasts, but not in 3T3 cells, inversion of intronic sequences containing a previously studied 274-base-pair segment, A274, resulted in markedly reduced human growth hormone mRNA levels as determined by an RNase protection assay. This inhibitory effect was largely alleviated when deletions were introduced in the collagen promoter of plasmids containing negatively oriented intronic sequences. Evidence for interaction of the promoter with the intronic segment, A274, was obtained by gel mobility shift assays. We suggest that promoter-intron interactions, mediated by DNA-binding proteins, regulate collagen gene transcription. Inversion of intronic segments containing critical interactive elements might then lead to an altered geometry and reduced activity of a transcriptional complex in those cells with sufficiently high levels of appropriate transcription factors. We further suggest that the deleted promoter segment plays a key role in directing DNA interactions involved in transcriptional control. Images PMID:3211130
The in vivo use of alternate 3'-splice sites in group I introns.
Sellem, C H; Belcour, L
1994-04-11
Alternative splicing of group I introns has been postulated as a possible mechanism that would ensure the translation of proteins encoded into intronic open reading frames, discontinuous with the upstream exon and lacking an initiation signal. Alternate splice sites were previously depicted according to secondary structures of several group I introns. We present here strong evidence that, in the case of Podospora anserina nad 1-i4 and cox1-i7 mitochondrial introns, alternative splicing events do occur in vivo. Indeed, by PCR experiments we have detected molecules whose sequence is precisely that expected if the predicted alternate 3'-splice sites were used.
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen
2017-01-01
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pai, Athma A.; Henriques, Telmo; McCue, Kayla
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
Lakshmi, G. Girija; Ghosh, Sushmita; Jones, Gabriel P.; Parikh, Roshni; Rawlins, Bridgette A.; Vaughn, Jack C.
2014-01-01
Alternative splicing greatly enhances the diversity of proteins encoded by eukaryotic genomes, and is also important in gene expression control. In contrast to the great depth of knowledge as to molecular mechanisms in the splicing pathway itself, relatively little is known about the regulatory events behind this process. The 5′-UTR and 3′-UTR in pre-mRNAs play a variety of roles in controlling eukaryotic gene expression, including translational modulation, and nearly 4,000 of the roughly 14,000 protein coding genes in Drosophila contain introns of unknown functional significance in their 5′-UTR. Here we report the results of an RNA electrophoretic mobility shift analysis of Drosophila rnp-4f 5′-UTR intron 0 splicing regulatory proteins. The pre-mRNA potential regulatory element consists of an evolutionarily-conserved 177-nt stem-loop arising from pairing of intron 0 with part of adjacent exon 2. Incubation of in vitro transcribed probe with embryo protein extract is shown to result in two shifted RNA-protein bands, and protein extract from a dADAR null mutant fly line results in only one shifted band. A mutated stem-loop in which the conserved exon 2 primary sequence is changed but secondary structure maintained by introducing compensatory base changes results in diminished band shifts. To test the hypothesis that dADAR plays a role in intron splicing regulation in vivo, levels of unspliced rnp-4f mRNA in dADAR mutant were compared to wild-type via real-time qRT-PCR. The results show that during embryogenesis unspliced rnp-4f mRNA levels fall by up to 85% in the mutant, in support of the hypothesis. Taken together, these results demonstrate a novel role for dADAR protein in rnp-4f 5′-UTR alternative intron splicing regulation which is consistent with a previously proposed model. PMID:23026215
Zhang, Libin; Leibowitz, Michael J; Zhang, Yi
2009-02-18
Self-splicing of group I intron from the 26S rRNA of Candida albicans is essential for maturation of the host RNA. Here, we demonstrated that the co-transcriptional splicing of the intron in vitro was blocked by antisense oligonucleotides (AONs) targeting the P3-P7 core of the intron. The core-targeted AON effectively and specifically inhibited the intron splicing from its host RNA in living C. albicans. Furthermore, flow cytometry experiments showed that the growth inhibition was caused by a fungicidal effect. For the first time, we showed that an AON targeting the ribozyme core folding specifically inhibits the endogenous ribozyme splicing in living cells and specifically kills the intron-containing fungal strains, which sheds light on the development of antifungal drugs in the future.
He, Yi; Ahmad, Dawood; Zhang, Xu; Zhang, Yu; Wu, Lei; Jiang, Peng; Ma, Hongxiang
2018-04-19
Fusarium head blight (FHB), a devastating disease in wheat worldwide, results in yield loses and mycotoxin, such as deoxynivalenol (DON), accumulation in infected grains. DON also facilitates the pathogen colonization and spread of FHB symptoms during disease development. UDP-glycosyltransferase enzymes (UGTs) are known to contribute to detoxification and enhance FHB resistance by glycosylating DON into DON-3-glucoside (D3G) in wheat. However, a comprehensive investigation of wheat (Triticum aestivum) UGT genes is still lacking. In this study, we carried out a genome-wide analysis of family-1 UDP glycosyltransferases in wheat based on the PSPG conserved box that resulted in the identification of 179 putative UGT genes. The identified genes were clustered into 16 major phylogenetic groups with a lack of phylogenetic group K. The UGT genes were invariably distributed among all the chromosomes of the 3 genomes. At least 10 intron insertion events were found in the UGT sequences, where intron 4 was observed as the most conserved intron. The expression analysis of the wheat UGT genes using both online microarray data and quantitative real-time PCR verification suggested the distinct role of UGT genes in different tissues and developmental stages. The expression of many UGT genes was up-regulated after Fusarium graminearum inoculation, and six of the genes were further verified by RT-qPCR. We identified 179 UGT genes from wheat using the available sequenced wheat genome. This study provides useful insight into the phylogenetic structure, distribution, and expression patterns of family-1 UDP glycosyltransferases in wheat. The results also offer a foundation for future work aimed at elucidating the molecular mechanisms underlying the resistance to FHB and DON accumulation.
Paulsrud, Per; Lindblad, Peter
1998-01-01
We examined the genetic diversity of Nostoc symbionts in some lichens by using the tRNALeu (UAA) intron as a genetic marker. The nucleotide sequence was analyzed in the context of the secondary structure of the transcribed intron. Cyanobacterial tRNALeu (UAA) introns were specifically amplified from freshly collected lichen samples without previous DNA extraction. The lichen species used in the present study were Nephroma arcticum, Peltigera aphthosa, P. membranacea, and P. canina. Introns with different sizes around 300 bp were consistently obtained. Multiple clones from single PCRs were screened by using their single-stranded conformational polymorphism pattern, and the nucleotide sequence was determined. No evidence for sample heterogenity was found. This implies that the symbiont in situ is not a diverse community of cyanobionts but, rather, one Nostoc strain. Furthermore, each lichen thallus contained only one intron type, indicating that each thallus is colonized only once or that there is a high degree of specificity. The same cyanobacterial intron sequence was also found in samples of one lichen species from different localities. In a phylogenetic analysis, the cyanobacterial lichen sequences grouped together with the sequences from two free-living Nostoc strains. The size differences in the intron were due to insertions and deletions in highly variable regions. The sequence data were used in discussions concerning specificity and biology of the lichen symbiosis. It is concluded that the tRNALeu (UAA) intron can be of great value when examining cyanobacterial diversity. PMID:9435083
Analysis of Nearly One Thousand Mammalian Mirtrons Reveals Novel Features of Dicer Substrates
Shenker, Sol; Mohammed, Jaaved; Lai, Eric C.
2015-01-01
Mirtrons are microRNA (miRNA) substrates that utilize the splicing machinery to bypass the necessity of Drosha cleavage for their biogenesis. Expanding our recent efforts for mammalian mirtron annotation, we use meta-analysis of aggregate datasets to identify ~500 novel mouse and human introns that confidently generate diced small RNA duplexes. These comprise nearly 1000 total loci distributed in four splicing-mediated biogenesis subclasses, with 5'-tailed mirtrons as, by far, the dominant subtype. Thus, mirtrons surprisingly comprise a substantial fraction of endogenous Dicer substrates in mammalian genomes. Although mirtron-derived small RNAs exhibit overall expression correlation with their host mRNAs, we observe a subset with substantial differences that suggest regulated processing or accumulation. We identify characteristic sequence, length, and structural features of mirtron loci that distinguish them from bulk introns, and find that mirtrons preferentially emerge from genes with larger numbers of introns. While mirtrons generate miRNA-class regulatory RNAs, we also find that mirtrons exhibit many features that distinguish them from canonical miRNAs. We observe that conventional mirtron hairpins are substantially longer than Drosha-generated pre-miRNAs, indicating that the characteristic length of canonical pre-miRNAs is not a general feature of Dicer substrate hairpins. In addition, mammalian mirtrons exhibit unique patterns of ordered 5' and 3' heterogeneity, which reveal hidden complexity in miRNA processing pathways. These include broad 3'-uridylation of mirtron hairpins, atypically heterogeneous 5' termini that may result from exonucleolytic processing, and occasionally robust decapitation of the 5' guanine (G) of mirtron-5p species defined by splicing. Altogether, this study reveals that this extensive class of non-canonical miRNA bears a multitude of characteristic properties, many of which raise general mechanistic questions regarding the processing of endogenous hairpin transcripts. PMID:26325366
Three Group-I introns in 18S rDNA of Endosymbiotic Algae of Paramecium bursaria from Japan
NASA Astrophysics Data System (ADS)
Hoshina, Ryo; Kamako, Shin-ichiro; Imamura, Nobutaka
2004-08-01
In the nuclear encoded small subunit ribosomal DNA (18S rDNA) of symbiotic alga of Paramecium bursaria (F36 collected in Japan) possesses three intron-like insertions (Hoshina et al., unpubl. data, 2003). The present study confirmed these exact lengths and insertion sites by reverse transcription-PCR. Two of them were inserted at Escherichia coli 16S rRNA genic position 943 and 1512 that are frequent intron insertion positions, but another insertion position (nearly 1370) was the first finding. Their secondary structures suggested they belong to Group-I intron; one belongs to subgroup IE, others belong to subgroup IC1. Similarity search indicated these introns are ancestral ones.
Defective minor spliceosome mRNA processing results in isolated familial growth hormone deficiency
Argente, Jesús; Flores, Raquel; Gutiérrez-Arumí, Armand; Verma, Bhupendra; Martos-Moreno, Gabriel Á; Cuscó, Ivon; Oghabian, Ali; Chowen, Julie A; Frilander, Mikko J; Pérez-Jurado, Luis A
2014-01-01
The molecular basis of a significant number of cases of isolated growth hormone deficiency remains unknown. We describe three sisters affected with severe isolated growth hormone deficiency and pituitary hypoplasia caused by biallelic mutations in the RNPC3 gene, which codes for a minor spliceosome protein required for U11/U12 small nuclear ribonucleoprotein (snRNP) formation and splicing of U12-type introns. We found anomalies in U11/U12 di-snRNP formation and in splicing of multiple U12-type introns in patient cells. Defective transcripts include preprohormone convertases SPCS2 and SPCS3 and actin-related ARPC5L genes, which are candidates for the somatotroph-restricted dysfunction. The reported novel mechanism for familial growth hormone deficiency demonstrates that general mRNA processing defects of the minor spliceosome can lead to very narrow tissue-specific consequences. Subject Categories Genetics, Gene Therapy ' Genetic Disease; Metabolism PMID:24480542
[Alternative splicing regulation: implications in cancer diagnosis and treatment].
Martínez-Montiel, Nancy; Rosas-Murrieta, Nora; Martínez-Contreras, Rebeca
2015-04-08
The accurate expression of the genetic information is regulated by processes like mRNA splicing, proposed after the discoveries of Phil Sharp and Richard Roberts, who demonstrated the existence of intronic sequences, present in almost every structural eukaryotic gene, which should be precisely removed. This intron removal is called "splicing", which generates different proteins from a single mRNA, with different or even antagonistic functions. We currently know that alternative splicing is the most important source of protein diversity, given that 70% of the human genes undergo splicing and that mutations causing defects in this process could originate up to 50% of genetic diseases, including cancer. When these defects occur in genes involved in cell adhesion, proliferation and cell cycle regulation, there is an impact on cancer progression, rising the opportunity to diagnose and treat some types of cancer according to a particular splicing profile. Copyright © 2013 Elsevier España, S.L.U. All rights reserved.
Selfish DNA: homing endonucleases find a home.
Edgell, David R
2009-02-10
Self-splicing group I introns come in two flavours - those with a homing endonuclease to promote mobility of the intron, and those without an endonuclease. How homing endonucleases and self-splicing introns associate to form a composite selfish genetic element is a question of long-standing interest. Recent work has revealed that a shared characteristic of both introns and endonucleases, the targeting of conserved sequences, may provide the impetus for the evolution of composite mobile genetic elements.
Barnea-Yizhar, Ofer; Ram, Sigal; Kovalev, Ekaterina; Azriel, Aviva; Rand, Ulfert; Nakayama, Manabu; Hauser, Hansjörg; Gepstein, Lior; Levi, Ben-Zion
2016-01-01
Interferon Regulatory Factor-8 (IRF-8) serves as a key factor in the hierarchical differentiation towards monocyte/dendritic cell lineages. While much insight has been accumulated into the mechanisms essential for its hematopoietic specific expression, the mode of restricting IRF-8 expression in non-hematopoietic cells is still unknown. Here we show that the repression of IRF-8 expression in restrictive cells is mediated by its 3rd intron. Removal of this intron alleviates the repression of Bacterial Artificial Chromosome (BAC) IRF-8 reporter gene in these cells. Fine deletion analysis points to conserved regions within this intron mediating its restricted expression. Further, the intron alone selectively initiates gene silencing only in expression-restrictive cells. Characterization of this intron’s properties points to its role as an initiator of sustainable gene silencing inducing chromatin condensation with suppressive histone modifications. This intronic element cannot silence episomal transgene expression underlining a strict chromatin-dependent silencing mechanism. We validated this chromatin-state specificity of IRF-8 intron upon in-vitro differentiation of induced pluripotent stem cells (iPSCs) into cardiomyocytes. Taken together, the IRF-8 3rd intron is sufficient and necessary to initiate gene silencing in non-hematopoietic cells, highlighting its role as a nucleation core for repressed chromatin during differentiation. PMID:27257682
Emblem, Åse; Karlsen, Bård Ove; Evertsen, Jussi; Johansen, Steinar D
2011-11-01
Group I introns are genetic insertion elements that invade host genomes in a wide range of organisms. In metazoans, however, group I introns are extremely rare, so far only identified within mitogenomes of hexacorals and some sponges. We sequenced the complete mitogenome of the cold-water scleractinian coral Lophelia pertusa, the dominating deep sea reef-building coral species in the North Atlantic Ocean. The mitogenome (16,150 bp) has the same gene content but organized in a unique gene order compared to that of other known scleractinian corals. A complex group I intron (6460 bp) inserted in the ND5 gene (position 717) was found to host seven essential mitochondrial protein genes and one ribosomal RNA gene. Phylogenetic analysis supports a vertical inheritance pattern of the ND5-717 intron among hexacoral mitogenomes with no examples of intron loss. Structural assessments of the Lophelia intron revealed an unusual organization that lacks the universally conserved ωG at the 3' end, as well as a highly compact RNA core structure with overlapping ribozyme and protein coding capacities. Based on phylogenetic and structural analyses we reconstructed the evolutionary history of ND5-717, from its ancestral protist origin, through intron loss in some early metazoan lineages, and into a compulsory feature with functional implications in hexacorals. Copyright © 2011 Elsevier Inc. All rights reserved.
Reduced mutation rate in exons due to differential mismatch repair
Mularoni, Loris; Muiños, Ferran; Gonzalez-Perez, Abel; López-Bigas, Núria
2017-01-01
While recent studies have revealed higher than anticipated heterogeneity of mutation rate across genomic regions, mutations in exons and introns are assumed to be generated at the same rate. Here we find fewer somatic mutations in exons than expected based on their sequence content, and demonstrate that this is not due to purifying selection. Moreover, we show that it is caused by higher mismatch repair activity in exonic than in intronic regions. Our findings have important implications for our understanding of mutational and DNA repair processes, our knowledge of the evolution of eukaryotic genes, and practical ramifications for the study of the evolution of both tumors and species. PMID:29106418
Johanson, Urban; Karlsson, Maria; Johansson, Ingela; Gustavsson, Sofia; Sjövall, Sara; Fraysse, Laure; Weig, Alfons R.; Kjellbom, Per
2001-01-01
Major intrinsic proteins (MIPs) facilitate the passive transport of small polar molecules across membranes. MIPs constitute a very old family of proteins and different forms have been found in all kinds of living organisms, including bacteria, fungi, animals, and plants. In the genomic sequence of Arabidopsis, we have identified 35 different MIP-encoding genes. Based on sequence similarity, these 35 proteins are divided into four different subfamilies: plasma membrane intrinsic proteins, tonoplast intrinsic proteins, NOD26-like intrinsic proteins also called NOD26-like MIPs, and the recently discovered small basic intrinsic proteins. In Arabidopsis, there are 13 plasma membrane intrinsic proteins, 10 tonoplast intrinsic proteins, nine NOD26-like intrinsic proteins, and three small basic intrinsic proteins. The gene structure in general is conserved within each subfamily, although there is a tendency to lose introns. Based on phylogenetic comparisons of maize (Zea mays) and Arabidopsis MIPs (AtMIPs), it is argued that the general intron patterns in the subfamilies were formed before the split of monocotyledons and dicotyledons. Although the gene structure is unique for each subfamily, there is a common pattern in how transmembrane helices are encoded on the exons in three of the subfamilies. The nomenclature for plant MIPs varies widely between different species but also between subfamilies in the same species. Based on the phylogeny of all AtMIPs, a new and more consistent nomenclature is proposed. The complete set of AtMIPs, together with the new nomenclature, will facilitate the isolation, classification, and labeling of plant MIPs from other species. PMID:11500536
Post-transcriptional regulation mediated by specific neurofilament introns in vivo.
Wang, Chen; Szaro, Ben G
2016-04-01
Neurons regulate genes post-transcriptionally to coordinate the supply of cytoskeletal proteins, such as the medium neurofilament (NEFM), with demand for structural materials in response to extracellular cues encountered by developing axons. By using a method for evaluating functionality of cis-regulatory gene elements in vivo through plasmid injection into Xenopus embryos, we discovered that splicing of a specific nefm intron was required for robust transgene expression, regardless of promoter or cell type. Transgenes utilizing the nefm 3'-UTR but substituting other nefm introns expressed little or no protein owing to defects in handling of the messenger (m)RNA as opposed to transcription or splicing. Post-transcriptional events at multiple steps, but mainly during nucleocytoplasmic export, contributed to these varied levels of protein expression. An intron of the β-globin gene was also able to promote expression in a manner identical to that of the nefm intron, implying a more general preference for certain introns in controlling nefm expression. These results expand our knowledge of intron-mediated gene expression to encompass neurofilaments, indicating an additional layer of complexity in the control of a cytoskeletal gene needed for developing and maintaining healthy axons. © 2016. Published by The Company of Biologists Ltd.
Horiuchi, Keiko; Perez-Cerezales, Serafín; Papasaikas, Panagiotis; Ramos-Ibeas, Priscila; López-Cardona, Angela Patricia; Laguna-Barraza, Ricardo; Fonseca Balvís, Noelia; Pericuesta, Eva; Fernández-González, Raul; Planells, Benjamín; Viera, Alberto; Suja, Jose Angel; Ross, Pablo Juan; Alén, Francisco; Orio, Laura; Rodriguez de Fonseca, Fernando; Pintado, Belén; Valcárcel, Juan; Gutiérrez-Adán, Alfonso
2018-04-03
The U2AF35-like ZRSR1 has been implicated in the recognition of 3' splice site during spliceosome assembly, but ZRSR1 knockout mice do not show abnormal phenotypes. To analyze ZRSR1 function and its precise role in RNA splicing, we generated ZRSR1 mutant mice containing truncating mutations within its RNA-recognition motif. Homozygous mutant mice exhibited severe defects in erythrocytes, muscle stretch, and spermatogenesis, along with germ cell sloughing and apoptosis, ultimately leading to azoospermia and male sterility. Testis RNA sequencing (RNA-seq) analyses revealed increased intron retention of both U2- and U12-type introns, including U12-type intron events in genes with key functions in spermatogenesis and spermatid development. Affected U2 introns were commonly found flanking U12 introns, suggesting functional cross-talk between the two spliceosomes. The splicing and tissue defects observed in mutant mice attributed to ZRSR1 loss of function suggest a physiological role for this factor in U12 intron splicing. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y
1992-01-01
The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081
Genomic structure of two ras family genes in the slime mold Physarum polycephalum.
Trzcińska-Danielewicz, Joanna; Kozlowski, Piotr; Gierdal, Katarzyna; Wiejak, Jolanta; Jagielski, Adam; Toczko, Kazimierz; Fronk, Jan
2002-08-01
Genomic structure of two Physarum polycephalum ras family genes, Ppras2 and Pprap1, has been determined, including the upstream region of the latter. The genes are interrupted by three and four introns, respectively. The first intron of Ppras2 has the same location within the coding sequence as the first intron in another ras homolog from this organism, Ppras1 [Trzcińska-Danielewicz, J., Kozlowski, P., and Toczko, K. (1996). "Cloning and genomic sequence of the Physarum polycephalum Ppras1 gene, a homologue of the ras protooncogene", Gene 169, pp. 143-144]. All introns, ranging from 53 to ca. 460 base pairs, have the canonical 5' and 3' ends, are greatly enriched in pyrimidines in the coding strand and have frequent pyrimidines-only tracts. These latter features seem to be responsible for the difficulties in cloning and sequencing of parts of these genes. Short sequences shared with P. polycephalum transposon-like repeats are common in the introns, indicating a possible role of transposition in intron evolution. In all three ras family genes phase zero introns are located mostly between sequences coding for regular protein secondary structure elements.
Kaer, Kristel; Branovets, Jelena; Hallikma, Anni; Nigumann, Pilvi; Speek, Mart
2011-01-01
Background Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown. Methodology/Principal Findings Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3′ ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs) and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals. Conclusions/Significance Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression. PMID:22022525
Molecular characterization and expression analysis of WRKY family genes in Dendrobium officinale.
Wang, Tao; Song, Zheng; Wei, Li; Li, Lubin
2018-03-01
The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators, and the members regulate multiple biological processes. However, there is limited information on WRKYs in Dendrobium officinale. In this study, 52 WRKY family genes of D. officinale were surveyed for the first time. Conserved domain, phylogenetic, exon-intron construction, and expression analyses were performed for the DoWRKY genes. Two major types of intron splicing (PR and VQR introns) were found, and the intron insertion position was observed to be relatively conserved in the conserved DoWRKY domains. The expression profiles of nine DoWRKYs were analyzed in cold- and methyl jasmonate (MeJA)-treated D. officinale seedlings; the DoWRKYs showed significant expression changes at different levels, which suggested their vital roles in stress tolerance. Moreover, the expression trends of most of the DoWRKYs after the simultaneous cold stress and MeJA treatment were the opposite of those of DoWRKYs after the individual cold stress and MeJA treatments, suggesting that the two stresses might have antagonistic effects and affect the adaptive capacity of the plants to stresses. Twelve DoWRKY genes were differentially expressed between symbiotic and asymbiotic germinated seeds; all were upregulated in the symbiotic germinated seeds except DoWRKY16. These differences in expression of DoWRKYs might be involved in promoting in vitro symbiotic germination of seeds with Tulasnella-like fungi. Our findings will be useful for further studies on the WRKY family genes in orchids.
Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.
2017-01-01
We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072
Invariant U2 snRNA nucleotides form a stem loop to recognize the intron early in splicing
Perriman, Rhonda; Ares, Manuel
2010-01-01
U2 snRNA-intron branchpoint pairing is a critical step in pre-mRNA recognition by the splicing apparatus, but the mechanism by which these two RNAs engage each other is unknown. Here we identify a new U2 snRNA structure, the branchpoint interaction stem-loop (BSL), that presents the U2 nucleotides that will contact the intron. We provide evidence that the BSL forms prior to interaction with the intron, and is disrupted by the DExD/H protein Prp5p during engagement of the snRNA with the intron. In vitro splicing complex assembly in a BSL-destabilized mutant extract suggests that the BSL is required at a previously unrecognized step between commitment complex and prespliceosome formation. The extreme evolutionary conservation of the BSL suggests it represents an ancient structural solution to the problem of intron branchpoint recognition by dynamic RNA elements that must serve multiple functions at other times during splicing. PMID:20471947
PIECE 2.0: an update for the plant gene structure comparison and evolution database
USDA-ARS?s Scientific Manuscript database
PIECE (Plant Intron Exon Comparision and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron-exon organization and provides valuable insights into the evolution of gene structure in ...
Functional comparison of three transformer gene introns regulating conditional female lethality
USDA-ARS?s Scientific Manuscript database
The trasformer gene plays a critical role in the sex determination pathways of many insects. We cloned two transformer gene introns from Anastrepha suspensa, the Caribbean fruit fly. These introns have sequences that putatively have a role in sex-specific splicing patterns that affect sex determinat...
Choosing and Using Introns in Molecular Phylogenetics
Creer, Simon
2007-01-01
Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984
Ślipiko, Monika; Buczkowska-Chmielewska, Katarzyna; Bączkiewicz, Alina; Szczecińska, Monika; Sawicki, Jakub
2017-01-01
Liverwort mitogenomes are considered to be evolutionarily stable. A comparative analysis of four Calypogeia species revealed differences compared to previously sequenced liverwort mitogenomes. Such differences involve unexpected structural changes in the two genes, cox1 and atp1, which have lost three and two introns, respectively. The group I introns in the cox1 gene are proposed to have been lost by two-step localized retroprocessing, whereas one-step retroprocessing could be responsible for the disappearance of the group II introns in the atp1 gene. These cases represent the first identified losses of introns in mitogenomes of leafy liverworts (Jungermanniopsida) contrasting the stability of mitochondrial gene order with certain changes in the gene content and intron set in liverworts. PMID:29257096
Bio—Cryptography: A Possible Coding Role for RNA Redundancy
NASA Astrophysics Data System (ADS)
Regoli, M.
2009-03-01
The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions," are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behavior in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun
2016-06-04
Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.
RNA chaperone StpA loosens interactions of the tertiary structure in the td group I intron in vivo
Waldsich, Christina; Grossberger, Rupert; Schroeder, Renée
2002-01-01
Efficient splicing of the td group I intron in vivo is dependent on the ribosome. In the absence of translation, the pre-mRNA is trapped in nonnative-splicing-incompetent conformations. Alternatively, folding of the pre-mRNA can be promoted by the RNA chaperone StpA or by the group I intron-specific splicing factor Cyt-18. To understand the mechanism of action of RNA chaperones, we probed the impact of StpA on the structure of the td intron in vivo. Our data suggest that StpA loosens tertiary interactions. The most prominent structural change was the opening of the base triples, which are involved in the correct orientation of the two major intron core domains. In line with the destabilizing activity of StpA, splicing of mutant introns with a reduced structural stability is sensitive to StpA. In contrast, Cyt-18 strengthens tertiary contacts, thereby rescuing splicing of structurally compromised td mutants in vivo. Our data provide direct evidence for protein-induced conformational changes within catalytic RNA in vivo. Whereas StpA resolves tertiary contacts enabling the RNA to refold, Cyt-18 contributes to the overall compactness of the td intron in vivo. PMID:12208852
Lunghi, Matteo; Spano, Furio; Magini, Alessandro; Emiliani, Carla; Carruthers, Vern B; Di Cristina, Manlio
2016-02-01
Apicomplexan parasites including Toxoplasma gondii and Plasmodium species have complex life cycles that include multiple hosts and differentiation through several morphologically distinct stages requiring marked changes in gene expression. This review highlights emerging evidence implicating regulation of mRNA splicing as a mechanism to prime these parasites for rapid gene expression upon differentiation. We summarize the most important insights in alternative splicing including its role in regulating gene expression by decreasing mRNA abundance via 'Regulated Unproductive Splicing and Translation'. As a related but less well-understood mechanism, we discuss also our recent work suggesting a role for intron retention for precluding translation of stage specific isoforms of T. gondii glycolytic enzymes. We additionally provide new evidence that intron retention might be a widespread mechanism during parasite differentiation. Supporting this notion, recent genome-wide analysis of Toxoplasma and Plasmodium suggests intron retention is more pervasive than heretofore thought. These findings parallel recent emergence of intron retention being more prevalent in mammals than previously believed, thereby adding to the established roles in plants, fungi and unicellular eukaryotes. Deeper mechanistic studies of intron retention will provide important insight into its role in regulating gene expression in apicomplexan parasites and more general in eukaryotic organisms.
Two distinct promoters drive transcription of the human D1A dopamine receptor gene.
Lee, S H; Minowa, M T; Mouradian, M M
1996-10-11
The human D1A dopamine receptor gene has a GC-rich, TATA-less promoter located upstream of a small, noncoding exon 1, which is separated from the coding exon 2 by a 116-base pair (bp)-long intron. Serial 3'-deletions of the 5'-noncoding region of this gene, including the intron and 5'-end of exon 2, resulted in 80 and 40% decrease in transcriptional activity of the upstream promoter in two D1A-expressing neuroblastoma cell lines, SK-N-MC and NS20Y, respectively. To investigate the function of this region, the intron and 245 bp at the 5'-end of exon 2 were investigated. Transient expression analyses using various chloramphenicol acetyltransferase constructs showed that the transcriptional activity of the intron is higher than that of the upstream promoter by 12-fold in SK-N-MC cells and by 5.5-fold in NS20Y cells in an orientation-dependent manner, indicating that the D1A intron is a strong promoter. Primer extension and ribonuclease protection assays revealed that transcription driven by the intron promoter is initiated at the junction of intron and exon 2 and at a cluster of nucleotides located 50 bp downstream from this junction. The same transcription start sites are utilized by the chloramphenicol acetyltransferase constructs employed in transfections as well as by the D1A gene expressed within the human caudate. The relative abundance of D1A transcripts originating from the upstream promoter compared with those transcribed from the intron promoter is 1.5-2.9 times in SK-N-MC cells and 2 times in the human caudate. Transcript stability studies in SK-N-MC cells revealed that longer D1A mRNA molecules containing exon 1 are degraded 1.8 times faster than shorter transcripts lacking exon 1. Although gel mobility shift assay could not detect DNA-protein interaction at the D1A intron, competitive co-transfection using the intron as competitor confirmed the presence of trans-acting factors at the intron. These data taken together indicate that the human D1A gene has two functional TATA-less promoters, both in D1A expressing cultured neuroblastoma cells and in the human striatum.
Barvkar, Vitthal T; Pardeshi, Varsha C; Kale, Sandip M; Kadoo, Narendra Y; Gupta, Vidya S
2012-05-08
The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT) family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L.) is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT) genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT) genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N). Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST), microarray data and reverse transcription quantitative real time PCR (RT-qPCR). Seventy-three per cent of these genes (100 out of 137) showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot genomes indicated that seven UGTs were flax diverged. Flax has a large number of UGT genes including few flax diverged ones. Phylogenetic analysis and expression profiles of these genes identified tissue and condition specific repertoire of UGT genes from this crop. This study would facilitate precise selection of candidate genes and their further characterization of substrate specificities and in planta functions.
2012-01-01
Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT) family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L.) is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT) genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT) genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N). Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST), microarray data and reverse transcription quantitative real time PCR (RT-qPCR). Seventy-three per cent of these genes (100 out of 137) showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot genomes indicated that seven UGTs were flax diverged. Conclusions Flax has a large number of UGT genes including few flax diverged ones. Phylogenetic analysis and expression profiles of these genes identified tissue and condition specific repertoire of UGT genes from this crop. This study would facilitate precise selection of candidate genes and their further characterization of substrate specificities and in planta functions. PMID:22568875
Towards barcode markers in Fungi: an intron map of Ascomycota mitochondria.
Santamaria, Monica; Vicario, Saverio; Pappadà, Graziano; Scioscia, Gaetano; Scazzocchio, Claudio; Saccone, Cecilia
2009-06-16
A standardized and cost-effective molecular identification system is now an urgent need for Fungi owing to their wide involvement in human life quality. In particular the potential use of mitochondrial DNA species markers has been taken in account. Unfortunately, a serious difficulty in the PCR and bioinformatic surveys is due to the presence of mobile introns in almost all the fungal mitochondrial genes. The aim of this work is to verify the incidence of this phenomenon in Ascomycota, testing, at the same time, a new bioinformatic tool for extracting and managing sequence databases annotations, in order to identify the mitochondrial gene regions where introns are missing so as to propose them as species markers. The general trend towards a large occurrence of introns in the mitochondrial genome of Fungi has been confirmed in Ascomycota by an extensive bioinformatic analysis, performed on all the entries concerning 11 mitochondrial protein coding genes and 2 mitochondrial rRNA (ribosomal RNA) specifying genes, belonging to this phylum, available in public nucleotide sequence databases. A new query approach has been developed to retrieve effectively introns information included in these entries. After comparing the new query-based approach with a blast-based procedure, with the aim of designing a faithful Ascomycota mitochondrial intron map, the first method appeared clearly the most accurate. Within this map, despite the large pervasiveness of introns, it is possible to distinguish specific regions comprised in several genes, including the full NADH dehydrogenase subunit 6 (ND6) gene, which could be considered as barcode candidates for Ascomycota due to their paucity of introns and to their length, above 400 bp, comparable to the lower end size of the length range of barcodes successfully used in animals. The development of the new query system described here would answer the pressing requirement to improve drastically the bioinformatics support to the DNA Barcode Initiative. The large scale investigation of Ascomycota mitochondrial introns performed through this tool, allowing to exclude the introns-rich sequences from the barcode candidates exploration, could be the first step towards a mitochondrial barcoding strategy for these organisms, similar to the standard approach employed in metazoans.
On splice site prediction using weight array models: a comparison of smoothing techniques
NASA Astrophysics Data System (ADS)
Taher, Leila; Meinicke, Peter; Morgenstern, Burkhard
2007-11-01
In most eukaryotic genes, protein-coding exons are separated by non-coding introns which are removed from the primary transcript by a process called "splicing". The positions where introns are cut and exons are spliced together are called "splice sites". Thus, computational prediction of splice sites is crucial for gene finding in eukaryotes. Weight array models are a powerful probabilistic approach to splice site detection. Parameters for these models are usually derived from m-tuple frequencies in trusted training data and subsequently smoothed to avoid zero probabilities. In this study we compare three different ways of parameter estimation for m-tuple frequencies, namely (a) non-smoothed probability estimation, (b) standard pseudo counts and (c) a Gaussian smoothing procedure that we recently developed.
Meher, J K; Meher, P K; Dash, G N; Raval, M K
2012-01-01
The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.
Contrasting population structure from nuclear intron sequences and mtDNA of humpback whales.
Palumbi, S R; Baker, C S
1994-05-01
Powerful analyses of population structure require information from multiple genetic loci. To help develop a molecular toolbox for obtaining this information, we have designed universal oligonucleotide primers that span conserved intron-exon junctions in a wide variety of animal phyla. We test the utility of exon-primed, intron-crossing amplifications by analyzing the variability of actin intron sequences from humpback, blue, and bowhead whales and comparing the results with mitochondrial DNA (mtDNA) haplotype data. Humpback actin introns fall into two major clades that exist in different frequencies in different oceanic populations. It is surprising that Hawaii and California populations, which are very distinct in mtDNAs, are similar in actin intron alleles. This discrepancy between mtDNA and nuclear DNA results may be due either to differences in genetic drift in mitochondrial and nuclear genes or to preferential movement of males, which do not transmit mtDNA to offspring, between separate breeding grounds. Opposing mtDNA and nuclear DNA results can help clarify otherwise hidden patterns of structure in natural populations.
Tooley, Paul W; Bandyopadhyay, Ranajit; Carras, Marie M; Pazoutová, Sylvie
2006-04-01
Isolates of Claviceps causing ergot on sorghum in India were analysed by AFLP analysis, and by analysis of DNA sequences of the EF-1alpha gene intron 4 and beta-tubulin gene intron 3 region. Of 89 isolates assayed from six states in India, four were determined to be C. sorghi, and the rest C. africana. A relatively low level of genetic diversity was observed within the Indian C. africana population. No evidence of genetic exchange between C. africana and C. sorghi was observed in either AFLP or DNA sequence analysis. Phylogenetic analysis was conducted using DNA sequences from 14 different Claviceps species. A multigene phylogeny based on the EF-1alpha gene intron 4, the beta-tubulin gene intron 3 region, and rDNA showed that C. sorghi grouped most closely with C. gigantea and C. africana. Although the Claviceps species we analysed were closely related, they colonize hosts that are taxonomically very distinct suggesting that there is no direct coevolution of Claviceps with its hosts.
Park, Linda K.; Hauser, Lorenz
2018-01-01
Interspecific hybridization is often seen as a major conservation issue, potentially threatening endangered species and decreasing biodiversity. In natural populations, the conservation implications of hybridization depends on both on anthropogenic factors and the evolutionary processes maintaining the hybrid zone. However, the timeline and patterns of hybridization in the hybrid zone are often not known. Therefore, species conservation becomes a concern when recent anthropogenic changes influence hybridization and not if hybridization is part of a long-term process. Here, we use sequence data from one mitochondrial gene, three nuclear introns and one nuclear exon to estimate the direction, geographic extent, frequency and possible timeline of hybridization between three rockfish species (Sebastes auriculatus, S. caurinus, S. maliger) in the Salish Sea, Washington, USA. We show that (i) introgression occurred much more frequently in the Salish Sea than on the outer coast, (ii) introgression was highly asymmetrical from S. maliger into the other two species, (iii) almost 40% of individuals in the Salish Sea were hybrids, with frequency of hybrids increasing with isolation from the coast, and (iv) all hybrids were later generation backcrosses rather than F1 hybrids. Our results suggest long-standing low-level hybridization rather than recent onset of interbreeding because of human induced environmental change, possibly facilitated by specific environmental conditions in the sub-basins of the Salish Sea, and by differences in population sizes during recolonization of the area after the last glaciation. This rockfish hybrid system, with asymmetrical introgression and the maintenance of parental species, may prove useful to study both mechanisms that maintain species boundaries and that facilitate speciation in the presence of rapid environmental change. PMID:29566070
Hardy, C M; Clark-Walker, G D
1991-07-01
The cytochrome oxidase subunit 1 gene (COX1) in K. lactis K8 mtDNA spans 8,826 bp and contains five exons (termed E1-E5) totalling 1,602 bp that show 88% nucleotide base matching and 91% amino acid homology to the equivalent gene in S. cerevisiae. The four introns (termed K1 cox1.1-1.4) contain open reading frames encoding proteins of 786, 333, 319 and 395 amino acids respectively that potentially encode maturase enzymes. The first intron belongs to group II whereas the remaining three are group I type B. Introns K1 cox1.1, 1.3, and 1.4 are found at identical locations to introns Sc cox1.2, 1.5 a, and 1.5 b respectively from S. cerevisiae. Horizontal transfer of an intron between recent progenitors of K. lactis and S. cerevisiae is suggested by the observation that K1 cox1.1 and Sc cox1.2 show 96% base matching. Sequence comparisons between K1 cox1.3/Sc cox1.5 a and K1 cox1.4/Sc cox1.5 b suggest that these introns are likely to have been present in the ancestral COX1 gene of these yeasts. Intron K1 cox1.2 is not found in S. cerevisiae and appears at an unique location in K. lactis. A feature of the DNA sequences of the group I introns K1 cox1.2, 1.3, and 1.4 is the presence of 11 GC-rich clusters inserted into both coding and noncoding regions. Immediately downstream of the COX1 gene is the ATPase subunit 8 gene (A8) that shows 82.6% base matching to its counterpart in S. cerevisiae mtDNA.
Bennett, Matthew S.; Triemer, Richard E.; Preisfeld, Angelika
2017-01-01
Background Over the last few years multiple studies have been published showing a great diversity in size of chloroplast genomes (cpGenomes), and in the arrangement of gene clusters, in the Euglenales. However, while these genomes provided important insights into the evolution of cpGenomes across the Euglenales and within their genera, only two genomes were analyzed in regard to genomic variability between and within Euglenales and Eutreptiales. To better understand the dynamics of chloroplast genome evolution in early evolving Eutreptiales, this study focused on the cpGenome of Eutreptiella pomquetensis, and the spread and peculiarities of introns. Methods The Etl. pomquetensis cpGenome was sequenced, annotated and afterwards examined in structure, size, gene order and intron content. These features were compared with other euglenoid cpGenomes as well as those of prasinophyte green algae, including Pyramimonas parkeae. Results and Discussion With about 130,561 bp the chloroplast genome of Etl. pomquetensis, a basal taxon in the phototrophic euglenoids, was considerably larger than the two other Eutreptiales cpGenomes sequenced so far. Although the detected quadripartite structure resembled most green algae and plant chloroplast genomes, the gene content of the single copy regions in Etl. pomquetensis was completely different from those observed in green algae and plants. The gene composition of Etl. pomquetensis was extensively changed and turned out to be almost identical to other Eutreptiales and Euglenales, and not to P. parkeae. Furthermore, the cpGenome of Etl. pomquetensis was unexpectedly permeated by a high number of introns, which led to a substantially larger genome. The 51 identified introns of Etl. pomquetensis showed two major unique features: (i) more than half of the introns displayed a high level of pairwise identities; (ii) no group III introns could be identified in the protein coding genes. These findings support the hypothesis that group III introns are degenerated group II introns and evolved later. PMID:28852596
Manukjan, Georgi; Bösing, Hendrik; Schmugge, Markus; Strauß, Gabriele; Schulze, Harald
2017-11-01
Thrombocytopenia absent radii (TAR) syndrome is clearly defined by the combination of radial aplasia and reduced platelet counts. The genetics of TAR syndrome has recently been resolved and comprises a microdeletion on Chromosome 1 including the RBM8A gene and a single nucleotide polymorphism (SNP) either at the 5' untranslated region (5'UTR) or within the first intron of RBM8A. Although phenotypically readily diagnosed after birth, the genetic determination of particular SNPs in TAR syndrome harbours valuable information to evaluate disease severity and treatment decisions. Here, we present clinical data in a cohort of 38 patients and observed that platelet counts in individuals with 5'UTR SNP are significantly lower compared to patients bearing the SNP in intron 1. Moreover, elevated haemoglobin values could only be assessed in patients with 5'UTR SNP whereas white blood cell count is unaffected, indicating that frequently observed anaemia in TAR patients could also be SNP-dependent whereas leucocytosis does not correlate with genetic background. However, this report on a large cohort provides an overview of important haematological characteristics in TAR patients, facilitating evaluation of the various traits in this disease and indicating the importance of genetic validation for TAR syndrome. © 2017 John Wiley & Sons Ltd.
Nanoplasmonic probes of RNA folding and assembly during pre-mRNA splicing
NASA Astrophysics Data System (ADS)
Nguyen, Anh H.; Lee, Jong Uk; Sim, Sang Jun
2016-02-01
RNA splicing plays important roles in transcriptome and proteome diversity. Herein, we describe the use of a nanoplasmonic system that unveils RNA folding and assembly during pre-mRNA splicing wherein the quantification of mRNA splice variants is not taken into account. With a couple of SERS-probes and plasmonic probes binding at the boundary sites of exon-2/intron-2 and intron-2/exon-3 of the pre-mature RNA of the β-globin gene, the splicing process brings the probes into the plasmonic bands. For plasmonic probes, a plasmon shift increase of ~29 nm, corresponding to intron removal and exon-2 and exon-3 connection to form the mRNA molecule, is measured by plasmonic coupling. The increased scattering intensity and surface-enhanced Raman scattering (SERS) fingerprinting reveal the clear dynamics of pre-mRNA splicing. Moreover, a time-resolved experiment of individual RNA molecules exhibited a successful splicing and an inhibited splicing event by 33 μM biflavonoid isoginkgetin, a general inhibitor of RNA splicing. The results suggest that the RNA splicing is successfully monitored with the nanoplasmonic system. Thus, this platform can be useful for studying RNA nanotechnology, biomolecular folding, alternative splicing, and maturation of microRNA.
Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun
2016-01-01
Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615
Gupta, Kushol; Contreras, Lydia M; Smith, Dorie; Qu, Guosheng; Huang, Tao; Spruce, Lynn A; Seeholzer, Steven H; Belfort, Marlene; Van Duyne, Gregory D
2014-04-01
The stable ribonucleoprotein (RNP) complex formed between the Lactococcus lactis group II intron and its self-encoded LtrA protein is essential for the intron's genetic mobility. In this study, we report the biochemical, compositional, hydrodynamic and structural properties of active group II intron RNP particles (+A) isolated from its native host using a novel purification scheme. We employed small-angle X-ray scattering to determine the structural properties of these particles as they exist in solution. Using sucrose as a contrasting agent, we derived a two-phase quaternary model of the protein-RNA complex. This approach revealed that the spatial properties of the complex are largely defined by the RNA component, with the protein dimer located near the center of mass. A transfer RNA fusion engineered into domain II of the intron provided a distinct landmark consistent with this interpretation. Comparison of the derived +A RNP shape with that of the previously reported precursor intron (ΔA) particle extends previous findings that the loosely packed precursor RNP undergoes a dramatic conformational change as it compacts into its active form. Our results provide insights into the quaternary arrangement of these RNP complexes in solution, an important step to understanding the transition of the group II intron from the precursor to a species fully active for DNA invasion.
Wang, Xinye; Xu, Xindong; Lu, Xingyu; Zhang, Yuanbin; Pan, Weiqing
2015-01-01
Alternative splicing is a molecular process that contributes greatly to the diversification of proteome and to gene functions. Understanding the mechanisms of stage-specific alternative splicing can provide a better understanding of the development of eukaryotes and the functions of different genes. Schistosoma japonicum is an infectious blood-dwelling trematode with a complex lifecycle that causes the tropical disease schistosomiasis. In this study, we analyzed the transcriptome of Schistosoma japonicum to discover alternative splicing events in this parasite, by applying RNA-seq to cDNA library of adults and schistosomula. Results were validated by RT-PCR and sequencing. We found 11,623 alternative splicing events among 7,099 protein encoding genes and average proportion of alternative splicing events per gene was 42.14%. We showed that exon skip is the most common type of alternative splicing events as found in high eukaryotes, whereas intron retention is the least common alternative splicing type. According to intron boundary analysis, the parasite possesses same intron boundaries as other organisms, namely the classic “GT-AG” rule. And in alternative spliced introns or exons, this rule is less strict. And we have attempted to detect alternative splicing events in genes encoding proteins with signal peptides and transmembrane helices, suggesting that alternative splicing could change subcellular locations of specific gene products. Our results indicate that alternative splicing is prevalent in this parasitic worm, and that the worm is close to its hosts. The revealed secretome involved in alternative splicing implies new perspective into understanding interaction between the parasite and its host. PMID:26407301
Arabidopsis Chloroplast Mini-Ribonuclease III Participates in rRNA Maturation and Intron Recycling
Hotto, Amber M.; Castandet, Benoît; Gilet, Laetitia; Higdon, Andrea; Condon, Ciarán; Stern, David B.
2015-01-01
RNase III proteins recognize double-stranded RNA structures and catalyze endoribonucleolytic cleavages that often regulate gene expression. Here, we characterize the functions of RNC3 and RNC4, two Arabidopsis thaliana chloroplast Mini-RNase III-like enzymes sharing 75% amino acid sequence identity. Whereas rnc3 and rnc4 null mutants have no visible phenotype, rnc3/rnc4 (rnc3/4) double mutants are slightly smaller and chlorotic compared with the wild type. In Bacillus subtilis, the RNase Mini-III is integral to 23S rRNA maturation. In Arabidopsis, we observed imprecise maturation of 23S rRNA in the rnc3/4 double mutant, suggesting that exoribonucleases generated staggered ends in the absence of specific Mini-III-catalyzed cleavages. A similar phenotype was found at the 3′ end of the 16S rRNA, and the primary 4.5S rRNA transcript contained 3′ extensions, suggesting that Mini-III catalyzes several processing events of the polycistronic rRNA precursor. The rnc3/4 mutant showed overaccumulation of a noncoding RNA complementary to the 4.5S-5S rRNA intergenic region, and its presence correlated with that of the extended 4.5S rRNA precursor. Finally, we found rnc3/4-specific intron degradation intermediates that are probable substrates for Mini-III and show that B. subtilis Mini-III is also involved in intron regulation. Overall, this study extends our knowledge of the key role of Mini-III in intron and noncoding RNA regulation and provides important insight into plastid rRNA maturation. PMID:25724636
Xu, Tao; Kim, Bo Mi; Kwak, Kyung Jin; Jung, Hyun Ju; Kang, Hunseung
2016-01-01
The minor U12 introns are removed from precursor mRNAs by the U12 intron-specific minor spliceosome. Among the seven ribonucleoproteins unique to the minor spliceosome, denoted as U11/U12-20K, U11/U12-25K, U11/U12-31K, U11/U12-65K, U11-35K, U11-48K, and U11-59K, the roles of only U11/U12-31K and U11/U12-65K have been demonstrated in U12 intron splicing and plant development. Here, the functional role of the Arabidopsis homolog of human U11-48K in U12 intron splicing and the development of Arabidopsis thaliana was examined using transgenic knockdown plants. The u11-48k mutants exhibited several defects in growth and development, such as severely arrested primary inflorescence stems, formation of serrated leaves, production of many rosette leaves after bolting, and delayed senescence. The splicing of most U12 introns analyzed was impaired in the u11-48k mutants. Comparative analysis of the splicing defects and phenotypes among the u11/u12-31k, u11-48k, and u11/12-65k mutants showed that the severity of abnormal development was closely correlated with the degree of impairment in U12 intron splicing. Taken together, these results provide compelling evidence that the Arabidopsis homolog of human U11-48K protein, as well as U11/U12-31K and U11/U12-65K proteins, is necessary for correct splicing of U12 introns and normal plant growth and development. PMID:27091878
Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang
2016-06-30
Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification.
Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang
2016-01-01
Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification. PMID:27376282
Chen, Meng-Yun; Liang, Dan; Zhang, Peng
2017-08-01
The interordinal relationships of Laurasiatherian mammals are currently one of the most controversial questions in mammalian phylogenetics. Previous studies mainly relied on coding sequences (CDS) and seldom used noncoding sequences. Here, by data mining public genome data, we compiled an intron data set of 3,638 genes (all introns from a protein-coding gene are considered as a gene) (19,055,073 bp) and a CDS data set of 10,259 genes (20,994,285 bp), covering all major lineages of Laurasiatheria (except Pholidota). We found that the intron data contained stronger and more congruent phylogenetic signals than the CDS data. In agreement with this observation, concatenation and species-tree analyses of the intron data set yielded well-resolved and identical phylogenies, whereas the CDS data set produced weakly supported and incongruent results. Further analyses showed that the phylogeny inferred from the intron data is highly robust to data subsampling and change in outgroup, but the CDS data produced unstable results under the same conditions. Interestingly, gene tree statistical results showed that the most frequently observed gene tree topologies for the CDS and intron data are identical, suggesting that the major phylogenetic signal within the CDS data is actually congruent with that within the intron data. Our final result of Laurasiatheria phylogeny is (Eulipotyphla,((Chiroptera, Perissodactyla),(Carnivora, Cetartiodactyla))), favoring a close relationship between Chiroptera and Perissodactyla. Our study 1) provides a well-supported phylogenetic framework for Laurasiatheria, representing a step towards ending the long-standing "hard" polytomy and 2) argues that intron within genome data is a promising data resource for resolving rapid radiation events across the tree of life. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Frequencies of VNTR and RFLP polymorphisms associated with factor VIII gene in Singapore
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fong, I.; Lai, P.S.; Ouah, T.C.
1994-09-01
The allelic frequency of any polymorphism within a population determines its usefulness for genetic counselling. This is important in populations of non-Caucasian origin as RFLPs may significantly differ among ethnic groups. We report a study of five intragenic polymorphisms in factor VIII gene carried out in Singapore. The three PCR-based RFLP markers studied were Intron 18/Bcl I, Intron 19/Hind III and Intron 22/Xba I. In an analysis of 148 unrelated normal X chromosomes, the allele frequencies were found to be A1 = 0.18, A2 = 0.82 (Bcl I RFLP), A1 = 0.80, A2 = 0.20 (Hind III RFLP) and A1more » = 0.58, and A2 = 0.42 (Xba I RFLP). The heterozygosity rates of 74 females analyzed separately were 31%, 32% and 84.2%, respectively. Linkage disequilibrium was also observed to some degree between Bcl I and Hind III polymorphism in our population. We have also analyzed a sequence polymorphism in Intron 7 using hybridization with radioactive-labelled {sup 32}P allele-specific oligonucleotide probes. This polymorphism was not very polymorphic in our population with only 2% of 117 individuals analyzed being informative. However, the use of a hypervariable dinucleotide repeat sequence (VNTR) in Intron 13 showed that 25 of our of 27 (93%) females were heterozygous. Allele frequencies ranged from 1 to 55 %. We conclude that a viable strategy for molecular analysis of Hemophilia A families in our population should include the use of Intron 18/Bcl I and Intron 22/Xba I RFLP markers and the Intron 13 VNTR marker.« less
Pydiura, Nikolay; Pirko, Yaroslav; Galinousky, Dmitry; Postovoitova, Anastasiia; Yemets, Alla; Kilchevsky, Aleksandr; Blume, Yaroslav
2018-06-08
Flax (Linum usitatissimum L.) is a valuable food and fiber crop cultivated for its quality fiber and seed oil. α-, β-, γ-tubulins and actins are the main structural proteins of the cytoskeleton. α- and γ-tubulin and actin genes have not been characterized yet in the flax genome. In this study, we have identified 6 α-tubulin genes, 13 β-tubulin genes, 2 γ-tubulin genes, and 15 actin genes in the flax genome and analysed the phylogenetic relationships between flax and A. thaliana tubulin and actin genes. Six α-tubulin genes are represented by 3 paralogous pairs, among 13 β-tubulin genes 7 different isotypes can be distinguished, 6 of which are encoded by two paralogous genes each. γ-tubulin is represented by a paralogous pair of genes one of which may be not functional. Fifteen actin genes represent 7 paralogous pairs - 7 actin isotypes and a sequentially duplicated copy of one of the genes of one of the isotypes. Exon-intron structure analysis has shown intron length polymorphism within the β-tubulin genes and intron number variation among the α-tubulin gene: 3 or 4 introns are found in two or four genes, respectively. Intron positioning occurs at conservative sites, as observed in numerous other plant species. Flax actin genes show both intron length polymorphisms and variation in the number of intron that may be 2 or 3. These data will be useful to support further studies on the specificity, functioning, regulation and evolution of the flax cytoskeleton proteins. This article is protected by copyright. All rights reserved.
Ciarlo, Eleonora; Massone, Sara; Penna, Ilaria; Nizzari, Mario; Gigoni, Arianna; Dieci, Giorgio; Russo, Claudio; Florio, Tullio; Cancedda, Ranieri; Pagano, Aldo
2013-03-01
Recent studies indicated that sortilin-related receptor 1 (SORL1) is a risk gene for late-onset Alzheimer's disease (AD), although its role in the aetiology and/or progression of this disorder is not fully understood. Here, we report the finding of a non-coding (nc) RNA (hereafter referred to as 51A) that maps in antisense configuration to intron 1 of the SORL1 gene. 51A expression drives a splicing shift of SORL1 from the synthesis of the canonical long protein variant A to an alternatively spliced protein form. This process, resulting in a decreased synthesis of SORL1 variant A, is associated with impaired processing of amyloid precursor protein (APP), leading to increased Aβ formation. Interestingly, we found that 51A is expressed in human brains, being frequently upregulated in cerebral cortices from individuals with Alzheimer's disease. Altogether, these findings document a novel ncRNA-dependent regulatory pathway that might have relevant implications in neurodegeneration.
Localization of a bacterial group II intron-encoded protein in human cells.
Reinoso-Colacio, Mercedes; García-Rodríguez, Fernando Manuel; García-Cañadas, Marta; Amador-Cubero, Suyapa; García Pérez, José Luis; Toro, Nicolás
2015-08-05
Group II introns are mobile retroelements that self-splice from precursor RNAs to form ribonucleoparticles (RNP), which can invade new specific genomic DNA sites. This specificity can be reprogrammed, for insertion into any desired DNA site, making these introns useful tools for bacterial genetic engineering. However, previous studies have suggested that these elements may function inefficiently in eukaryotes. We investigated the subcellular distribution, in cultured human cells, of the protein encoded by the group II intron RmInt1 (IEP) and several mutants. We created fusions with yellow fluorescent protein (YFP) and with a FLAG epitope. We found that the IEP was localized in the nucleus and nucleolus of the cells. Remarkably, it also accumulated at the periphery of the nuclear matrix. We were also able to identify spliced lariat intron RNA, which co-immunoprecipitated with the IEP, suggesting that functional RmInt1 RNPs can be assembled in cultured human cells.
Localization of a bacterial group II intron-encoded protein in human cells
Reinoso-Colacio, Mercedes; García-Rodríguez, Fernando Manuel; García-Cañadas, Marta; Amador-Cubero, Suyapa; Pérez, José Luis García; Toro, Nicolás
2015-01-01
Group II introns are mobile retroelements that self-splice from precursor RNAs to form ribonucleoparticles (RNP), which can invade new specific genomic DNA sites. This specificity can be reprogrammed, for insertion into any desired DNA site, making these introns useful tools for bacterial genetic engineering. However, previous studies have suggested that these elements may function inefficiently in eukaryotes. We investigated the subcellular distribution, in cultured human cells, of the protein encoded by the group II intron RmInt1 (IEP) and several mutants. We created fusions with yellow fluorescent protein (YFP) and with a FLAG epitope. We found that the IEP was localized in the nucleus and nucleolus of the cells. Remarkably, it also accumulated at the periphery of the nuclear matrix. We were also able to identify spliced lariat intron RNA, which co-immunoprecipitated with the IEP, suggesting that functional RmInt1 RNPs can be assembled in cultured human cells. PMID:26244523
Chery, Joyce G; Sass, Chodon; Specht, Chelsea D
2017-09-01
We developed a bioinformatic pipeline that leverages a publicly available genome and published transcriptomes to design primers in conserved coding sequences flanking targeted introns of single-copy nuclear loci. Paullinieae (Sapindaceae) is used to demonstrate the pipeline. Transcriptome reads phylogenetically closer to the lineage of interest are aligned to the closest genome. Single-nucleotide polymorphisms are called, generating a "pseudoreference" closer to the lineage of interest. Several filters are applied to meet the criteria of single-copy nuclear loci with introns of a desired size. Primers are designed in conserved coding sequences flanking introns. Using this pipeline, we developed nine single-copy nuclear intron markers for Paullinieae. This pipeline is highly flexible and can be used for any group with available genomic and transcriptomic resources. This pipeline led to the development of nine variable markers for phylogenetic study without generating sequence data de novo.
Sugita, Mamoru; Shinozaki, Kazuo; Sugiura, Masahiro
1985-01-01
The nucleotide sequence of a tRNALys(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNAGly(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long. Images PMID:16593561
Sugita, M; Shinozaki, K; Sugiura, M
1985-06-01
The nucleotide sequence of a tRNA(Lys)(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNA(Gly)(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long.
Grewe, Felix; Zhu, Andan; Mower, Jeffrey P.
2016-01-01
The mitochondrial nad1 gene of seed plants has a complex structure, including four introns in cis or trans configurations and a maturase gene (matR) hosted within the final intron. In the geranium family (Geraniaceae), however, sequencing of representative species revealed that three of the four introns, including one in a trans configuration and another that hosts matR, were lost from the nad1 gene in their common ancestor. Despite the loss of the host intron, matR has been retained as a freestanding gene in most genera of the family, indicating that this maturase has additional functions beyond the splicing of its host intron. In the common ancestor of Pelargonium, matR was transferred to the nuclear genome, where it was split into two unlinked genes that encode either its reverse transcriptase or maturase domain. Both nuclear genes are transcribed and contain predicted mitochondrial targeting signals, suggesting that they express functional proteins that are imported into mitochondria. The nuclear localization and split domain structure of matR in the Pelargonium nuclear genome offers a unique opportunity to assess the function of these two domains using transgenic approaches. PMID:27664178
Stephan, W.; Rodriguez, V. S.; Zhou, B.; Parsch, J.
1994-01-01
Three distinctly different alleles of the metallothionein gene Mtn have been identified in natural Drosophila melanogaster populations: Mtn(.3), Mtn(1), and Dp(Mtn(1)), where the latter designates a tandem duplication of Mtn(1). In Drosophila simulans, only Mtn(.3)-type alleles have been found. It has been suggested that Mtn(.3) is the ancestral allele and demonstrated that a presumed two-step transition from Mtn(.3) to Mtn(1) to Dp(Mtn(1)) is accompanied by an approximate 5-fold increase in RNA levels. We analyzed the evolutionary genetics of the Mtn locus of Drosophila ananassae, a distant relative of D. melanogaster and D. simulans within the melanogaster species group. The Mtn gene of D. ananassae is most similar to Mtn(.3). (i) it is identical with Mtn(.3) at the amino acid level, but differs from Mtn(1) in its terminal codon; (ii) its 3' UTR contains a characteristic extra DNA segment of about 50 bp which is present in Mtn(.3), but lacking in Mtn(1); (iii) duplications of Mtn were not found in a worldwide sample of 110 wild D. ananassae chromosomes. However, the intron of the Mtn gene in D. ananassae is only 69 bp long, whereas the length of the Mtn(.3) and Mtn(1) introns is 265 bp; and it lacks a polypyrimidine stretch upstream of the 3' splice site in contrast to the much greater pyrimidine-richness found in the Mtn(.3) and Mtn(1) introns. A short intron (67 bp) was also identified in a D. pseudoobscura Mtn allele, suggesting that the short intron is the ancestral form and that the transition from the short to the long intron occurred within the melanogaster species group. We discuss the significance of this observation with regard to the recently proposed classification of D. melanogaster introns into two groups: short introns (<90 bp) which tend to lack polypyrimidine stretches, and longer ones which have strong 3' splice signals similar to mammalian introns. A database search revealed that this length dimorphism is an evolutionarily conserved feature of Drosophila introns; transitions from one size class to the other appear to be rare between closely related species (e.g., within the melanogaster subgroup). PMID:8001781
Genome-Wide Analysis of the NADK Gene Family in Plants
Li, Wen-Yan; Wang, Xiang; Li, Ri; Li, Wen-Qiang; Chen, Kun-Ming
2014-01-01
Background NAD(H) kinase (NADK) is the key enzyme that catalyzes de novo synthesis of NADP(H) from NAD(H) for NADP(H)-based metabolic pathways. In plants, NADKs form functional subfamilies. Studies of these families in Arabidopsis thaliana indicate that they have undergone considerable evolutionary selection; however, the detailed evolutionary history and functions of the various NADKs in plants are not clearly understood. Principal Findings We performed a comparative genomic analysis that identified 74 NADK gene homologs from 24 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots and eudicots. Phylogenetic and structural analysis classified these NADK genes into four well-conserved subfamilies with considerable variety in the domain organization and gene structure among subfamily members. In addition to the typical NAD_kinase domain, additional domains, such as adenylate kinase, dual-specificity phosphatase, and protein tyrosine phosphatase catalytic domains, were found in subfamily II. Interestingly, NADKs in subfamily III exhibited low sequence similarity (∼30%) in the kinase domain within the subfamily and with the other subfamilies. These observations suggest that gene fusion and exon shuffling may have occurred after gene duplication, leading to specific domain organization seen in subfamilies II and III, respectively. Further analysis of the exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures, during the process of structural evolution of NADK family genes. Finally, both available global microarray data analysis and qRT-RCR experiments revealed that the NADK genes in Arabidopsis and Oryza sativa show different expression patterns in different developmental stages and under several different abiotic/biotic stresses and hormone treatments, underscoring the functional diversity and functional divergence of the NADK family in plants. Conclusions These findings will facilitate further studies of the NADK family and provide valuable information for functional validation of this family in plants. PMID:24968225
Sharangdhar, Tejaswini; Sugimoto, Yoichiro; Heraud-Farlow, Jacqueline; Fernández-Moya, Sandra M; Ehses, Janina; Ruiz de Los Mozos, Igor; Ule, Jernej; Kiebler, Michael A
2017-10-01
Dendritic localization and hence local mRNA translation contributes to synaptic plasticity in neurons. Staufen2 (Stau2) is a well-known neuronal double-stranded RNA-binding protein (dsRBP) that has been implicated in dendritic mRNA localization. The specificity of Stau2 binding to its target mRNAs remains elusive. Using individual-nucleotide resolution CLIP (iCLIP), we identified significantly enriched Stau2 binding to the 3'-UTRs of 356 transcripts. In 28 (7.9%) of those, binding occurred to a retained intron in their 3'-UTR The strongest bound 3'-UTR intron was present in the longest isoform of Calmodulin 3 ( Calm3 L ) mRNA Calm3 L 3'-UTR contains six Stau2 crosslink clusters, four of which are in this retained 3'-UTR intron. The Calm3 L mRNA localized to neuronal dendrites, while lack of the 3'-UTR intron impaired its dendritic localization. Importantly, Stau2 mediates this dendritic localization via the 3'-UTR intron, without affecting its stability. Also, NMDA-mediated synaptic activity specifically promoted the dendritic mRNA localization of the Calm3 L isoform, while inhibition of synaptic activity reduced it substantially. Together, our results identify the retained intron as a critical element in recruiting Stau2, which then allows for the localization of Calm3 L mRNA to distal dendrites. © 2017 The Authors.
Voorbij, Annemarie M W Y; van Steenbeek, Frank G; Vos-Loohuis, Manon; Martens, Ellen E C P; Hanson-Nilsson, Jeanette M; van Oost, Bernard A; Kooistra, Hans S; Leegwater, Peter A
2011-01-01
Dwarfism in German shepherd dogs is due to combined pituitary hormone deficiency of unknown genetic cause. We localized the recessively inherited defect by a genome wide approach to a region on chromosome 9 with a lod score of 9.8. The region contains LHX3, which codes for a transcription factor essential for pituitary development. Dwarfs have a deletion of one of six 7 bp repeats in intron 5 of LHX3, reducing the intron size to 68 bp. One dwarf was compound heterozygous for the deletion and an insertion of an asparagine residue in the DNA-binding homeodomain of LHX3, suggesting involvement of the gene in the disorder. An exon trapping assay indicated that the shortened intron is not spliced efficiently, probably because it is too small. We applied bisulfite conversion of cytosine to uracil in RNA followed by RT-PCR to analyze the splicing products. The aberrantly spliced RNA molecules resulted from either skipping of exon 5 or retention of intron 5. The same splicing defects were observed in cDNA derived from the pituitary of dwarfs. A survey of similarly mutated introns suggests that there is a minimal distance requirement between the splice donor and branch site of 50 nucleotides. In conclusion, a contraction of a DNA repeat in intron 5 of canine LHX3 leads to deficient splicing and is associated with pituitary dwarfism.
Voorbij, Annemarie M. W. Y.; van Steenbeek, Frank G.; Vos-Loohuis, Manon; Martens, Ellen E. C. P.; Hanson-Nilsson, Jeanette M.; van Oost, Bernard A.; Kooistra, Hans S.; Leegwater, Peter A.
2011-01-01
Dwarfism in German shepherd dogs is due to combined pituitary hormone deficiency of unknown genetic cause. We localized the recessively inherited defect by a genome wide approach to a region on chromosome 9 with a lod score of 9.8. The region contains LHX3, which codes for a transcription factor essential for pituitary development. Dwarfs have a deletion of one of six 7 bp repeats in intron 5 of LHX3, reducing the intron size to 68 bp. One dwarf was compound heterozygous for the deletion and an insertion of an asparagine residue in the DNA-binding homeodomain of LHX3, suggesting involvement of the gene in the disorder. An exon trapping assay indicated that the shortened intron is not spliced efficiently, probably because it is too small. We applied bisulfite conversion of cytosine to uracil in RNA followed by RT-PCR to analyze the splicing products. The aberrantly spliced RNA molecules resulted from either skipping of exon 5 or retention of intron 5. The same splicing defects were observed in cDNA derived from the pituitary of dwarfs. A survey of similarly mutated introns suggests that there is a minimal distance requirement between the splice donor and branch site of 50 nucleotides. In conclusion, a contraction of a DNA repeat in intron 5 of canine LHX3 leads to deficient splicing and is associated with pituitary dwarfism. PMID:22132174
Mobile group II intron based gene targeting in Lactobacillus plantarum WCFS1.
Sasikumar, Ponnusamy; Paul, Eldho; Gomathi, Sivasamy; Abhishek, Albert; Sasikumar, Sundaresan; Selvam, Govindan Sadasivam
2016-10-01
The usage of recombinant lactic acid bacteria for delivery of therapeutic proteins to the mucosa has been emerging. In the present study, an attempt was made to engineer a thyA mutant of Lactobacillus plantarum (L. plantarum) using lactococcal group II intron Ll.LtrB for the development of biologically contained recombinant L. plantarum for prevention of calcium oxalate stone disease. The 3 kb Ll.LtrB intron donor cassettes from the source vector pACD4C was PCR amplified, ligated into pSIP series of lactobacillus vector pLp_3050sAmyA, yielding a novel vector pLpACD4C (8.6 kb). The quantitative real-time PCR experiment shows 94-fold increased expression of Ll.LtrB intron and 14-fold increased expression of ltrA gene in recombinant L. plantarum containing pLpACD4C. In order to target the thyA gene, the potential intron RNA binding sites in the thyA gene of L. plantarum was predicted with help of computer algorithm. The insertion location 188|189s of thyA gene (lowest E-0.134) was chosen and the wild type intron Ll.LtrB was PCR modified, yielding a retargeted intron of pLpACDthyA. The retargeted intron was expressed by using induction peptide (sppIP), subsequently the integration of intron in thyA gene was identified by PCR screening and finally ThyA - mutant of L. plantarum (ThyA18) was detected. In vitro growth curve result showed that in the absence of thymidine, colony forming units of mutant ThyA18 was decreased, whereas high thymidine concentration (10 μM) supported the growth of the culture until saturation. In conclusion, ThyA - mutant of L. plantarum (ThyA18) constructed in this study will be used as a biologically contained recombinant probiotic to deliver oxalate decarboxylase into the lumen for treatment of hyperoxaluria and calcium oxalate stone deposition. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Norris, Vic; Root-Bernstein, Robert
2009-06-04
In the "ecosystems-first" approach to the origins of life, networks of non-covalent assemblies of molecules (composomes), rather than individual protocells, evolved under the constraints of molecular complementarity. Composomes evolved into the hyperstructures of modern bacteria. We extend the ecosystems-first approach to explain the origin of eukaryotic cells through the integration of mixed populations of bacteria. We suggest that mutualism and symbiosis resulted in cellular mergers entailing the loss of redundant hyperstructures, the uncoupling of transcription and translation, and the emergence of introns and multiple chromosomes. Molecular complementarity also facilitated integration of bacterial hyperstructures to perform cytoskeletal and movement functions.
Brouard, Jean-Simon; Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-01-01
The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA) structure, size, gene order, and intron content have been observed. The large inverted repeat (IR), an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales) but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum . The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium , it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold longer and dispersed repeats are more abundant, but a smaller fraction of the Oedocladium genome is occupied by introns. Six additional group II introns are present, five of which lack ORFs and carry highly similar sequences to that of the ORF-less IIA intron shared with Oedogonium . Secondary structure analysis of the group IIA introns disclosed marked differences in the exon-binding sites; however, each intron showed perfect or nearly perfect base pairing interactions with its target site. Our results suggest that chloroplast genes rearrange more slowly in the Oedogoniales than in the Chaetophorales and raise questions as to what was the nature of the foreign coding sequences in the IR of the common ancestor of the Oedogoniales. They provide the first evidence for intragenomic proliferation of group IIA introns in the Viridiplantae, revealing that intron spread in the Oedocladium lineage likely occurred by retrohoming after sequence divergence of the exon-binding sites.
Cosme, Ruth S Cruz; Yamamura, Yasuhiro; Tang, Qiyi
2009-04-01
Human cytomegalovirus (HCMV), a member of the beta subgroup of the family Herpesviridae, causes serious health problems worldwide. HCMV gene expression in host cells is a well-defined sequential process: immediate-early (IE) gene expression, early-gene expression, DNA replication, and late-gene expression. The most abundant IE gene, major IE (MIE) gene pre-mRNA, needs to be spliced before being exported to the cytoplasm for translation. In this study, the regulation of MIE gene splicing was investigated; in so doing, we found that polypyrimidine tract binding proteins (PTBs) strongly repressed MIE gene production in cotransfection assays. In addition, we discovered that the repressive effects of PTB could be rescued by splicing factor U2AF. Taken together, the results suggest that PTBs inhibit MIE gene splicing by competing with U2AF65 for binding to the polypyrimidine tract in pre-mRNA. In intron deletion mutation assays and RNA detection experiments (reverse transcription [RT]-PCR and real-time RT-PCR), we further observed that PTBs target all the introns of the MIE gene, especially intron 2, and affect gene splicing, which was reflected in the variation in the ratio of pre-mRNA to mRNA. Using transfection assays, we demonstrated that PTB knockdown cells induce a higher degree of MIE gene splicing/expression. Consistently, HCMV can produce more viral proteins and viral particles in PTB knockdown cells after infection. We conclude that PTB inhibits HCMV replication by interfering with MIE gene splicing through competition with U2AF for binding to the polypyrimidine tract in MIE gene introns.
Cosme, Ruth S. Cruz; Yamamura, Yasuhiro; Tang, Qiyi
2009-01-01
Human cytomegalovirus (HCMV), a member of the β subgroup of the family Herpesviridae, causes serious health problems worldwide. HCMV gene expression in host cells is a well-defined sequential process: immediate-early (IE) gene expression, early-gene expression, DNA replication, and late-gene expression. The most abundant IE gene, major IE (MIE) gene pre-mRNA, needs to be spliced before being exported to the cytoplasm for translation. In this study, the regulation of MIE gene splicing was investigated; in so doing, we found that polypyrimidine tract binding proteins (PTBs) strongly repressed MIE gene production in cotransfection assays. In addition, we discovered that the repressive effects of PTB could be rescued by splicing factor U2AF. Taken together, the results suggest that PTBs inhibit MIE gene splicing by competing with U2AF65 for binding to the polypyrimidine tract in pre-mRNA. In intron deletion mutation assays and RNA detection experiments (reverse transcription [RT]-PCR and real-time RT-PCR), we further observed that PTBs target all the introns of the MIE gene, especially intron 2, and affect gene splicing, which was reflected in the variation in the ratio of pre-mRNA to mRNA. Using transfection assays, we demonstrated that PTB knockdown cells induce a higher degree of MIE gene splicing/expression. Consistently, HCMV can produce more viral proteins and viral particles in PTB knockdown cells after infection. We conclude that PTB inhibits HCMV replication by interfering with MIE gene splicing through competition with U2AF for binding to the polypyrimidine tract in MIE gene introns. PMID:19144709
Novel methodologies for spectral classification of exon and intron sequences
NASA Astrophysics Data System (ADS)
Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.
2012-12-01
Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-09-19
To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G planctonica and 262,888-bp G sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-01-01
Abstract To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G. planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G. planctonica and 262,888-bp G. sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G. sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. PMID:27503298
DOE Office of Scientific and Technical Information (OSTI.GOV)
Conrad, R.; Thomas, J.; Spieth, J.
In nematodes, the RNA products of some genes are trans-spliced to a 22-nucleotide spliced leader (SL), while the RNA products of other genes are not. In Caenorhabditis elegans, there are two SLs, Sl1 and SL2, donated by two distinct small nuclear ribonucleoprotein particles in a process functionally quite similar to nuclear intron removal. The authors demonstrate here that it is possible to convert a non-trans-spliced gene into a trans-spliced gene by placement of an intron missing only the 5[prime] splice site into the 5[prime] untranslated region. Stable transgenic strains were isolated expressing a gene in which 69 nucleotides of amore » vit-5 intron, including the 3[prime] splice site, were inserted into the 5[prime] untranslated region of a vit-2/vit-6 fusion gene. The RNA product of this gene was examined by primer extension and PCR amplification. Although the vit-2/vit-6 transgene product is not normally trans-spliced, the majority of transcripts from this altered gene were trans-spliced to SL1. They termed the region of a trans-spliced mRNA precursor between the 5[prime] end and the first 3[prime] splice site an 'outrun'. The results suggest that if a transcript begins with intronlike sequence followed by a 3[prime] splice site, this alone may constitute an outrun and be sufficient to demarcate a transcript as a trans-splice acceptor. These findings leave open the possibility that specific sequences are required to increase the efficiency of trans-splicing.« less
Crystal structures of a group II intron maturase reveal a missing link in spliceosome evolution.
Zhao, Chen; Pyle, Anna Marie
2016-06-01
Group II introns are self-splicing ribozymes that are essential in many organisms, and they have been hypothesized to share a common evolutionary ancestor with the spliceosome. Although structural similarity of RNA components supports this connection, it is of interest to determine whether associated protein factors also share an evolutionary heritage. Here we present the crystal structures of reverse transcriptase (RT) domains from two group II intron-encoded proteins (maturases) from Roseburia intestinalis and Eubacterium rectale, obtained at 1.2-Å and 2.1-Å resolution, respectively. These domains are more similar in architecture to the spliceosomal Prp8 RT-like domain than to any other RTs, and they share substantial similarity with flaviviral RNA polymerases. The RT domain itself is sufficient for binding intron RNA with high affinity and specificity, and it is contained within an active RT enzyme. These studies provide a foundation for understanding structure-function relationships within group II intron-maturase complexes.
Deep intronic GPR143 mutation in a Japanese family with ocular albinism
Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei
2015-01-01
Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease. PMID:26061757
Deep intronic GPR143 mutation in a Japanese family with ocular albinism.
Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei
2015-06-10
Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease.
Spliced RNA of woodchuck hepatitis virus.
Ogston, C W; Razman, D G
1992-07-01
Polymerase chain reaction was used to investigate RNA splicing in liver of woodchucks infected with woodchuck hepatitis virus (WHV). Two spliced species were detected, and the splice junctions were sequenced. The larger spliced RNA has an intron of 1300 nucleotides, and the smaller spliced sequence shows an additional downstream intron of 1104 nucleotides. We did not detect singly spliced sequences from which the smaller intron alone was removed. Control experiments showed that spliced sequences are present in both RNA and DNA in infected liver, showing that the viral reverse transcriptase can use spliced RNA as template. Spliced sequences were detected also in virion DNA prepared from serum. The upstream intron produces a reading frame that fuses the core to the polymerase polypeptide, while the downstream intron causes an inframe deletion in the polymerase open reading frame. Whereas the splicing patterns in WHV are superficially similar to those reported recently in hepatitis B virus, we detected no obvious homology in the coding capacity of spliced RNAs from these two viruses.
Intronic splicing mutations in PTCH1 cause Gorlin syndrome.
Bholah, Zaynab; Smith, Miriam J; Byers, Helen J; Miles, Emma K; Evans, D Gareth; Newman, William G
2014-09-01
Gorlin syndrome is an autosomal dominant disorder characterized by multiple early-onset basal cell carcinoma, odontogenic keratocysts and skeletal abnormalities. It is caused by heterozygous mutations in the tumour suppressor PTCH1. Routine clinical genetic testing, by Sanger sequencing and multiplex ligation-dependent probe amplification (MLPA) to confirm a clinical diagnosis of Gorlin syndrome, identifies a mutation in 60-90 % of cases. We undertook RNA analysis on lymphocytes from ten individuals diagnosed with Gorlin syndrome, but without known PTCH1 mutations by exonic sequencing or MLPA. Two altered PTCH1 transcripts were identified. Genomic DNA sequence analysis identified an intron 7 mutation c.1068-10T>A, which created a strong cryptic splice acceptor site, leading to an intronic insertion of eight bases; this is predicted to create a frameshift p.(His358Alafs*12). Secondly, a deep intronic mutation c.2561-2057A>G caused an inframe insertion of 78 intronic bases in the cDNA transcript, leading to a premature stop codon p.(Gly854fs*3). The mutations are predicted to cause loss of function of PTCH1, consistent with its tumour suppressor function. The findings indicate the importance of RNA analysis to detect intronic mutations in PTCH1 not identified by routine screening techniques.
Endo, Megumi; Hirose, Mamiko; Honda, Masanao; Koga, Hiroyuki; Morino, Yoshiaki; Kiyomoto, Masato; Wada, Hiroshi
2018-06-15
The marine environment around Japan experienced significant changes during the Cenozoic Era. In this study, we report findings suggesting that this dynamic history left behind traces in the genome of the Japanese sand dollar species Peronella japonica and P. rubra. Although mitochondrial Cytochrome C Oxidase I sequences did not indicate fragmentation of the current local populations of P. japonica around Japan, two different types of intron sequence were found in the Alx1 locus. We inferred that past fragmentation of the populations account for the presence of two types of nuclear sequences as alleles in the Alx1 intron of P. japonica. It is likely that the split populations have intermixed in recent times; hence, we did not detect polymorphisms in the sequences reflecting the current localization of the species. In addition, we found two allelic sequences of theAlx1 intron in the sister species P. rubra. The divergence times of the two types of Alx1 intron sequences were estimated at approximately 14.9 and 4.0 million years ago for P. japonica and P. rubra, respectively. Our study indicates that information from the intron sequences of nuclear genes can enhance our understanding of past genetic events in organisms. Copyright © 2018 Elsevier B.V. All rights reserved.
Geiss, K T; Abbas, G M; Makaroff, C A
1994-04-01
The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.
Bhasi, Ashwini; Philip, Philge; Manikandan, Vinu; Senapathy, Periannan
2009-01-01
We have developed ExDom, a unique database for the comparative analysis of the exon–intron structures of 96 680 protein domains from seven eukaryotic organisms (Homo sapiens, Mus musculus, Bos taurus, Rattus norvegicus, Danio rerio, Gallus gallus and Arabidopsis thaliana). ExDom provides integrated access to exon-domain data through a sophisticated web interface which has the following analytical capabilities: (i) intergenomic and intragenomic comparative analysis of exon–intron structure of domains; (ii) color-coded graphical display of the domain architecture of proteins correlated with their corresponding exon-intron structures; (iii) graphical analysis of multiple sequence alignments of amino acid and coding nucleotide sequences of homologous protein domains from seven organisms; (iv) comparative graphical display of exon distributions within the tertiary structures of protein domains; and (v) visualization of exon–intron structures of alternative transcripts of a gene correlated to variations in the domain architecture of corresponding protein isoforms. These novel analytical features are highly suited for detailed investigations on the exon–intron structure of domains and make ExDom a powerful tool for exploring several key questions concerning the function, origin and evolution of genes and proteins. ExDom database is freely accessible at: http://66.170.16.154/ExDom/. PMID:18984624
Prychitko, T M; Moore, W S
1997-10-01
Estimating phylogenies from DNA sequence data has become the major methodology of molecular phylogenetics. To date, molecular phylogenetics of the vertebrates has been very dependent on mtDNA, but studies involving mtDNA are limited because the several genes comprising the mt-genome are inherited as a single linkage group. The only apparent solution to this problem is to sequence additional genes, each representing a distinct linkage group, so that the resultant gene trees provide independent estimates of the species tree. There exists the need to find novel gene sequences which contain enough phylogenetic information to resolve relationships between closely related species. A possible source is the nuclear-encoded introns, because they evolve more rapidly than exons. We designed primers to amplify and sequence the 7 intron from the beta-fibrinogen gene for a recently evolved group, the woodpeckers. We sequenced the entire intron for 10 specimens representing five species. Nucleotide substitutions are randomly distributed along the length of the intron, suggesting selective neutrality. A preliminary analysis indicates that the phylogenetic signal in the intron is as strong as that in the mitochondrial encoded cytochrome b (cyt b) gene. The topology of the beta-fibrinogen tree is identical to that of the cyt b tree. This analysis demonstrates the ability of the 7 intron of beta-fibrinogen to provide well resolved, independent gene trees for recently evolved groups and establishes it as a source of sequences to be used in other phylogenetic studies. Copyright 1997 Academic Press
Ichikawa, Shoji; Tuchman, Shamir; Padgett, Leah R.; Gray, Amie K.; Baluarte, H. Jorge; Econs, Michael J.
2013-01-01
Hereditary hypophosphatemic rickets with hypercalciuria (HHRH) is a rare metabolic disorder, characterized by hypophosphatemia, variable degrees of rickets/osteomalacia, and hypercalciuria secondary to increased serum 1,25-dihydroxyvitamin D [1,25(OH)2D] levels. HHRH is caused by mutations in the SLC34A3 gene, which encodes sodium-phosphate co-transporter type IIc. A 6 ½-year-old female presented with a history of nephrolithiasis. Her metabolic evaluation revealed increased 24- hour urine calcium excretion with high serum calcium, low intact parathyroid hormone (PTH) levels, and elevated 1,25(OH)2D level. In addition, the patient had low to low-normal serum phosphorus with high urine phosphorus. The patient had normal stature; without rachitic or boney deformities or a history of fractures. Genetic analysis of SLC34A3 revealed the patient to be a compound heterozygote for a novel single base pair deletion in exon 12 (c.1304delG) and 30-base pair deletion in intron 6 (g.1440–1469del). The single-base pair mutation causes a frameshift, which results in premature stop codon. The intronic deletion is likely caused by misalignment of the 4-basepair homologous repeats and results in the truncation of an already small intron to 63 bp, which would impair proper RNA splicing of the intron. This is the fourth unique intronic deletion identified in patients with HHRH, suggesting the frequent occurrence of sequence misalignments in SLC34A3 and the importance of screening introns in patients with HHRH. PMID:24176905
Ichikawa, Shoji; Tuchman, Shamir; Padgett, Leah R; Gray, Amie K; Baluarte, H Jorge; Econs, Michael J
2014-02-01
Hereditary hypophosphatemic rickets with hypercalciuria (HHRH) is a rare metabolic disorder, characterized by hypophosphatemia, variable degrees of rickets/osteomalacia, and hypercalciuria secondary to increased serum 1,25-dihydroxyvitamin D [1,25(OH)2D] levels. HHRH is caused by mutations in the SLC34A3 gene, which encodes sodium-phosphate co-transporter type IIc. A 6-1/2-year-old female presented with a history of nephrolithiasis. Her metabolic evaluation revealed increased 24-hour urine calcium excretion with high serum calcium, low intact parathyroid hormone (PTH), and elevated 1,25(OH)2D. In addition, the patient had low to low-normal serum phosphorus with high urine phosphorus. The patient had normal stature; without rachitic or boney deformities or a history of fractures. Genetic analysis of SLC34A3 revealed the patient to be a compound heterozygote for a novel single base pair deletion in exon 12 (c.1304delG) and 30-base pair deletion in intron 6 (g.1440-1469del). The single-base pair mutation causes a frameshift, which results in premature stop codon. The intronic deletion is likely caused by misalignment of the 4-basepair homologous repeats and results in the truncation of an already small intron to 63bp, which would impair proper RNA splicing of the intron. This is the fourth unique intronic deletion identified in patients with HHRH, suggesting the frequent occurrence of sequence misalignments in SLC34A3 and the importance of screening introns in patients with HHRH. © 2013.
Jangi, Mohini; Fleet, Christina; Cullen, Patrick; Gupta, Shipra V.; Mekhoubad, Shila; Chiao, Eric; Allaire, Norm; Bennett, C. Frank; Rigo, Frank; Krainer, Adrian R.; Hurt, Jessica A.; Carulli, John P.; Staropoli, John F.
2017-01-01
Spinal muscular atrophy (SMA), an autosomal recessive neuromuscular disease, is the leading monogenic cause of infant mortality. Homozygous loss of the gene survival of motor neuron 1 (SMN1) causes the selective degeneration of lower motor neurons and subsequent atrophy of proximal skeletal muscles. The SMN1 protein product, survival of motor neuron (SMN), is ubiquitously expressed and is a key factor in the assembly of the core splicing machinery. The molecular mechanisms by which disruption of the broad functions of SMN leads to neurodegeneration remain unclear. We used an antisense oligonucleotide (ASO)-based inducible mouse model of SMA to investigate the SMN-specific transcriptome changes associated with neurodegeneration. We found evidence of widespread intron retention, particularly of minor U12 introns, in the spinal cord of mice 30 d after SMA induction, which was then rescued by a therapeutic ASO. Intron retention was concomitant with a strong induction of the p53 pathway and DNA damage response, manifesting as γ-H2A.X positivity in neurons of the spinal cord and brain. Widespread intron retention and markers of the DNA damage response were also observed with SMN depletion in human SH-SY5Y neuroblastoma cells and human induced pluripotent stem cell-derived motor neurons. We also found that retained introns, high in GC content, served as substrates for the formation of transcriptional R-loops. We propose that defects in intron removal in SMA promote DNA damage in part through the formation of RNA:DNA hybrid structures, leading to motor neuron death. PMID:28270613
Xu, Tao; Kim, Bo Mi; Kwak, Kyung Jin; Jung, Hyun Ju; Kang, Hunseung
2016-05-01
The minor U12 introns are removed from precursor mRNAs by the U12 intron-specific minor spliceosome. Among the seven ribonucleoproteins unique to the minor spliceosome, denoted as U11/U12-20K, U11/U12-25K, U11/U12-31K, U11/U12-65K, U11-35K, U11-48K, and U11-59K, the roles of only U11/U12-31K and U11/U12-65K have been demonstrated in U12 intron splicing and plant development. Here, the functional role of the Arabidopsis homolog of human U11-48K in U12 intron splicing and the development of Arabidopsis thaliana was examined using transgenic knockdown plants. The u11-48k mutants exhibited several defects in growth and development, such as severely arrested primary inflorescence stems, formation of serrated leaves, production of many rosette leaves after bolting, and delayed senescence. The splicing of most U12 introns analyzed was impaired in the u11-48k mutants. Comparative analysis of the splicing defects and phenotypes among the u11/u12-31k, u11-48k, and u11/12-65k mutants showed that the severity of abnormal development was closely correlated with the degree of impairment in U12 intron splicing. Taken together, these results provide compelling evidence that the Arabidopsis homolog of human U11-48K protein, as well as U11/U12-31K and U11/U12-65K proteins, is necessary for correct splicing of U12 introns and normal plant growth and development. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Jurado, Juan; Fuentes-Almagro, Carlos A; Prieto-Alamo, María J; Pueyo, Carmen
2007-09-21
Alternative splicing is a widespread mechanism of gene expression regulation. Previous analyses based on conventional RT-PCR reported the presence of an unspliced c-fos transcript in several mammalian systems. Compared to the well-defined knowledge on the alternative splicing of fosB, the physiological relevance of the unspliced c-fos transcript in regulating c-fos expression remains largely unknown. This work aimed to investigate the functional significance of the alternative splicing c-fos pre-mRNA. A set of primers was designed to demonstrate that, whereas introns 1 and 2 are regularly spliced from primary c-fos transcript, intron 3 remains unspliced in part of total transcript molecules. Here, the two species are referred to as c-fos-2 (+ intron 3) and spliced c-fos (- intron 3) transcripts. Then, we used a quantitatively rigorous approach based on real-time PCR to provide, for the first time, the actual steady-state copy numbers of the two c-fos transcripts. We tested how the mouse-organ context and mouse-gestational age, the synthesis and turnover rates of the investigated transcripts, and the serum stimulation of quiescent cells modulate their absolute-expression profiles. Intron 3 generates an in-frame premature termination codon that predicts the synthesis of a truncated c-Fos protein. This prediction was evaluated by immunoaffinity chromatography purification of c-Fos proteins. We demonstrate that: (i) The c-fos-2 transcript is ubiquitously synthesized either in vivo or in vitro, in amounts that are higher or similar to those of mRNAs coding for other Fos family members, like FosB, DeltaFosB, Fra-1 or Fra-2. (ii) Intron 3 confers to c-fos-2 an outstanding destabilizing effect of about 6-fold. (iii) Major determinant of c-fos-2 steady-state levels in cultured cells is its remarkably high rate of synthesis. (iv) Rapid changes in the synthesis and/or degradation rates of both c-fos transcripts in serum-stimulated cells give rise to rapid and transient changes in their relative proportions. Taken as a whole, these findings suggest a co-ordinated fine-tune of the two c-fos transcript species, supporting the notion that the alternative processing of the precursor mRNA might be physiologically relevant. Moreover, we detected a c-Fos immunoreactive species corresponding in mobility to the predicted truncated variant.
DIP1 modulates stem cell homeostasis in Drosophila through regulation of sisR-1.
Wong, Jing Ting; Akhbar, Farzanah; Ng, Amanda Yunn Ee; Tay, Mandy Li-Ian; Loi, Gladys Jing En; Pek, Jun Wei
2017-10-02
Stable intronic sequence RNAs (sisRNAs) are by-products of splicing and regulate gene expression. How sisRNAs are regulated is unclear. Here we report that a double-stranded RNA binding protein, Disco-interacting protein 1 (DIP1) regulates sisRNAs in Drosophila. DIP1 negatively regulates the abundance of sisR-1 and INE-1 sisRNAs. Fine-tuning of sisR-1 by DIP1 is important to maintain female germline stem cell homeostasis by modulating germline stem cell differentiation and niche adhesion. Drosophila DIP1 localizes to a nuclear body (satellite body) and associates with the fourth chromosome, which contains a very high density of INE-1 transposable element sequences that are processed into sisRNAs. DIP1 presumably acts outside the satellite bodies to regulate sisR-1, which is not on the fourth chromosome. Thus, our study identifies DIP1 as a sisRNA regulatory protein that controls germline stem cell self-renewal in Drosophila.Stable intronic sequence RNAs (sisRNAs) are by-products of splicing from introns with roles in embryonic development in Drosophila. Here, the authors show that the RNA binding protein DIP1 regulates sisRNAs in Drosophila, which is necessary for germline stem cell homeostasis.
Chen, Fei; Zhou, Yu; Qi, Yingchuan B.; Khivansara, Vishal; Li, Hairi; Chun, Sang Young; Kim, John K.; Fu, Xiang-Dong; Jin, Yishi
2015-01-01
Alternative polyadenylation (APA) is widespread in neuronal development and activity-mediated neural plasticity. However, the underlying molecular mechanisms are largely unknown. We used systematic genetic studies and genome-wide surveys of the transcriptional landscape to identify a context-dependent regulatory pathway controlling APA in the Caenorhabditis elegans nervous system. Loss of function in ssup-72, a Ser5 phosphatase for the RNA polymerase II (Pol II) C-terminal domain (CTD), dampens transcription termination at a strong intronic polyadenylation site (PAS) in unc-44/ankyrin yet promotes termination at the weak intronic PAS of the MAP kinase dlk-1. A nuclear protein, SYDN-1, which regulates neuronal development, antagonizes the function of SSUP-72 and several nuclear polyadenylation factors. This regulatory pathway allows the production of a neuron-specific isoform of unc-44 and an inhibitory isoform of dlk-1. Dysregulation of the unc-44 and dlk-1 mRNA isoforms in sydn-1 mutants impairs neuronal development. Deleting the intronic PAS of unc-44 results in increased pre-mRNA processing of neuronal ankyrin and suppresses sydn-1 mutants. These results reveal a mechanism by which regulation of CTD phosphorylation controls coding region APA in the nervous system. PMID:26588990
High variability of mitochondrial gene order among fungi.
Aguileta, Gabriela; de Vienne, Damien M; Ross, Oliver N; Hood, Michael E; Giraud, Tatiana; Petit, Elsa; Gabaldón, Toni
2014-02-01
From their origin as an early alpha proteobacterial endosymbiont to their current state as cellular organelles, large-scale genomic reorganization has taken place in the mitochondria of all main eukaryotic lineages. So far, most studies have focused on plant and animal mitochondrial (mt) genomes (mtDNA), but fungi provide new opportunities to study highly differentiated mtDNAs. Here, we analyzed 38 complete fungal mt genomes to investigate the evolution of mtDNA gene order among fungi. In particular, we looked for evidence of nonhomologous intrachromosomal recombination and investigated the dynamics of gene rearrangements. We investigated the effect that introns, intronic open reading frames (ORFs), and repeats may have on gene order. Additionally, we asked whether the distribution of transfer RNAs (tRNAs) evolves independently to that of mt protein-coding genes. We found that fungal mt genomes display remarkable variation between and within the major fungal phyla in terms of gene order, genome size, composition of intergenic regions, and presence of repeats, introns, and associated ORFs. Our results support previous evidence for the presence of mt recombination in all fungal phyla, a process conspicuously lacking in most Metazoa. Overall, the patterns of rearrangements may be explained by the combined influences of recombination (i.e., most likely nonhomologous and intrachromosomal), accumulated repeats, especially at intergenic regions, and to a lesser extent, mobile element dynamics.
Qin, Yidan; Yao, Jun; Wu, Douglas C.; Nottingham, Ryan M.; Mohr, Sabine; Hunicke-Smith, Scott; Lambowitz, Alan M.
2016-01-01
Next-generation RNA-sequencing (RNA-seq) has revolutionized transcriptome profiling, gene expression analysis, and RNA-based diagnostics. Here, we developed a new RNA-seq method that exploits thermostable group II intron reverse transcriptases (TGIRTs) and used it to profile human plasma RNAs. TGIRTs have higher thermostability, processivity, and fidelity than conventional reverse transcriptases, plus a novel template-switching activity that can efficiently attach RNA-seq adapters to target RNA sequences without RNA ligation. The new TGIRT-seq method enabled construction of RNA-seq libraries from <1 ng of plasma RNA in <5 h. TGIRT-seq of RNA in 1-mL plasma samples from a healthy individual revealed RNA fragments mapping to a diverse population of protein-coding gene and long ncRNAs, which are enriched in intron and antisense sequences, as well as nearly all known classes of small ncRNAs, some of which have never before been seen in plasma. Surprisingly, many of the small ncRNA species were present as full-length transcripts, suggesting that they are protected from plasma RNases in ribonucleoprotein (RNP) complexes and/or exosomes. This TGIRT-seq method is readily adaptable for profiling of whole-cell, exosomal, and miRNAs, and for related procedures, such as HITS-CLIP and ribosome profiling. PMID:26554030
Liu, H X; Goodall, G J; Kole, R; Filipowicz, W
1995-01-16
We have performed a systematic study of the effect of artificial hairpins on pre-mRNA splicing in protoplasts of a dicot plant, Nicotiana plumbaginifolia. Hairpins with a potential to form 18 or 24 bp stems strongly inhibit splicing when they sequester the 5' splice site or are placed in the middle of short introns. However, similar 24 bp hairpins sequestering the 3' splice site do not prevent this site from being used as an acceptor. Utilization of the stem-located 3' site requires that the base of the stem is separated from the upstream 5' splice site by a minimum of approximately 45 nucleotides and that another 'helper' 3' splice site is present downstream of the stem. The results indicate that the spliceosome or factors associated with it may have a potential to unfold secondary structure present in the downstream portion of the intron, prior to or at the step of the 3' splice site selection. The finding that the helper 3' site is required for utilization of the stem-located acceptor confirms and extends previous observations, obtained with HeLa cell in vitro splicing systems, indicating that the 3' splice site may be recognized at least twice during spliceosome assembly.
[Analysis of chloroplast rpS16 intron sequences in Lemnaceae].
Martirosian, E V; Ryzhova, N N; Kochieva, E Z; Skriabin, K G
2009-01-01
Chloroplast rpS16 gene intron sequences were determined and characterized for twenty-five Lemnaceae accessions representing nine duckweed species. For each Lemnaceae species nucleotide substitutions and for Lemna minor, Lemna aequinoctialis, Wolffia arrhiza different indels were detected. Most of indels were found for Wolffia arrhiza and Lemna aequinoctialis. The analyses of intraspecific polymorphism resulted in identification of several gaplotypes in L. gibba and L. trisulca. Lemnaceae phylogenetic relationship based on rpS16 intron variability data has revealed significant differences between L. aequinoctialis and other Lemna species. Genetic distance values corroborated competence of Landoltia punctata separations from Spirodela into an independent generic taxon. The acceptability of rpS16 intron sequences for phylogenetic studies in Lemnaceae was shown.
Jayashree, B; Jagadeesh, V T; Hoisington, D
2008-05-01
The availability of complete, annotated genomic sequence information in model organisms is a rich resource that can be extended to understudied orphan crops through comparative genomic approaches. We report here a software tool (cisprimertool) for the identification of conserved intron scanning regions using expressed sequence tag alignments to a completely sequenced model crop genome. The method used is based on earlier studies reporting the assessment of conserved intron scanning primers (called CISP) within relatively conserved exons located near exon-intron boundaries from onion, banana, sorghum and pearl millet alignments with rice. The tool is freely available to academic users at http://www.icrisat.org/gt-bt/CISPTool.htm. © 2007 ICRISAT.
Metazoan tRNA introns generate stable circular RNAs in vivo
Lu, Zhipeng; Filonov, Grigory S.; Noto, John J.; Schmidt, Casey A.; Hatkevich, Talia L.; Wen, Ying; Jaffrey, Samie R.; Matera, A. Gregory
2015-01-01
We report the discovery of a class of abundant circular noncoding RNAs that are produced during metazoan tRNA splicing. These transcripts, termed tRNA intronic circular (tric)RNAs, are conserved features of animal transcriptomes. Biogenesis of tricRNAs requires anciently conserved tRNA sequence motifs and processing enzymes, and their expression is regulated in an age-dependent and tissue-specific manner. Furthermore, we exploited this biogenesis pathway to develop an in vivo expression system for generating “designer” circular RNAs in human cells. Reporter constructs expressing RNA aptamers such as Spinach and Broccoli can be used to follow the transcription and subcellular localization of tricRNAs in living cells. Owing to the superior stability of circular vs. linear RNA isoforms, this expression system has a wide range of potential applications, from basic research to pharmaceutical science. PMID:26194134
tRNA nuclear export in saccharomyces cerevisiae: in situ hybridization analysis.
Sarkar, S; Hopper, A K
1998-11-01
To understand the factors specifically affecting tRNA nuclear export, we adapted in situ hybridization procedures to locate endogenous levels of individual tRNA families in wild-type and mutant yeast cells. Our studies of tRNAs encoded by genes lacking introns show that nucleoporin Nup116p affects both poly(A) RNA and tRNA export, whereas Nup159p affects only poly(A) RNA export. Los1p is similar to exportin-t, which facilitates vertebrate tRNA export. A los1 deletion mutation affects tRNA but not poly(A) RNA export. The data support the notion that Los1p and exportin-t are functional homologues. Because LOS1 is nonessential, tRNA export in vertebrate and yeast cells likely involves factors in addition to exportin-t. Mutation of RNA1, which encodes RanGAP, causes nuclear accumulation of tRNAs and poly(A) RNA. Many yeast mutants, including those with the rna1-1 mutation, affect both pre-tRNA splicing and RNA export. Our studies of the location of intron-containing pre-tRNAs in the rna1-1 mutant rule out the possibility that this results from tRNA export occurring before splicing. Our results also argue against inappropriate subnuclear compartmentalization causing defects in pre-tRNA splicing. Rather, the data support "feedback" of nucleus/cytosol exchange to the pre-tRNA splicing machinery.
tRNA Nuclear Export in Saccharomyces cerevisiae: In Situ Hybridization Analysis
Sarkar, Srimonti; Hopper, Anita K.
1998-01-01
To understand the factors specifically affecting tRNA nuclear export, we adapted in situ hybridization procedures to locate endogenous levels of individual tRNA families in wild-type and mutant yeast cells. Our studies of tRNAs encoded by genes lacking introns show that nucleoporin Nup116p affects both poly(A) RNA and tRNA export, whereas Nup159p affects only poly(A) RNA export. Los1p is similar to exportin-t, which facilitates vertebrate tRNA export. A los1 deletion mutation affects tRNA but not poly(A) RNA export. The data support the notion that Los1p and exportin-t are functional homologues. Because LOS1 is nonessential, tRNA export in vertebrate and yeast cells likely involves factors in addition to exportin-t. Mutation of RNA1, which encodes RanGAP, causes nuclear accumulation of tRNAs and poly(A) RNA. Many yeast mutants, including those with the rna1-1 mutation, affect both pre-tRNA splicing and RNA export. Our studies of the location of intron-containing pre-tRNAs in the rna1-1 mutant rule out the possibility that this results from tRNA export occurring before splicing. Our results also argue against inappropriate subnuclear compartmentalization causing defects in pre-tRNA splicing. Rather, the data support “feedback” of nucleus/cytosol exchange to the pre-tRNA splicing machinery. PMID:9802895
Fisher, S E; van Bakel, I; Lloyd, S E; Pearce, S H; Thakker, R V; Craig, I W
1995-10-10
Dent disease, an X-linked familial renal tubular disorder, is a form of Fanconi syndrome associated with proteinuria, hypercalciuria, nephrocalcinosis, kidney stones, and eventual renal failure. We have previously used positional cloning to identify the 3' part of a novel kidney-specific gene (initially termed hClC-K2, but now referred to as CLCN5), which is deleted in patients from one pedigree segregating Dent disease. Mutations that disrupt this gene have been identified in other patients with this disorder. Here we describe the isolation and characterization of the complete open reading frame of the human CLCN5 gene, which is predicted to encode a protein of 746 amino acids, with significant homology to all known members of the ClC family of voltage-gated chloride channels. CLCN5 belongs to a distinct branch of this family, which also includes the recently identified genes CLCN3 and CLCN4. We have shown that the coding region of CLCN5 is organized into 12 exons, spanning 25-30 kb of genomic DNA, and have determined the sequence of each exon-intron boundary. The elucidation of the coding sequence and exon-intron organization of CLCN5 will both expedite the evaluation of structure/function relationships of these ion channels and facilitate the screening of other patients with renal tubular dysfunction for mutations at this locus.
Genetic association of ubiquilin with Alzheimer's disease and related quantitative measures.
Kamboh, M I; Minster, R L; Feingold, E; DeKosky, S T
2006-03-01
The gene coding for ubiquilin 1 (UBQLN1) is located near a linkage peak on chromosome 9q22.2 and it also impacts the function of presenilin proteins involved in early-onset Alzheimer's disease (AD). Recently, genetic variation in UBQLN1 has been shown to affect the risk of AD in two independent family-based samples. The purpose of this study was to confirm the reported association in a large case-control sample and to also examine the association of UBQLN1 SNPs with quantitative measures of AD progression, namely age-at-onset (AAO), disease duration and Mini-Mental State Examination (MMSE) score. We examined the associations of three SNPs in the UBQLN1 gene (intron 6/A>C, intron 8/T>C and intron 9/A>G) in up to 978 LOAD cases and 808 controls. All SNPs were in significant linkage disequilibrium (P<0.0001). While modest significant associations were observed in the single-site regression analysis, 3-site haplotype analysis revealed significant associations (P<0.0001 for overall haplotype analysis). One common haplotype (H4) defined by intron 6/A-intron 8/C-intron 9/G alleles was associated with AD risk and one less common haplotype (H5) defined by intron 6/C-intron 8/C-intron 9/A alleles was associated with protection. The adjusted odds ratios with potentially one and two copies of risk haplotype H4 were 1.5 (95% CI: 0.99-2.26; P=0.054) and 3.66 (95% CI: 1.43-9.39; P=0.007), respectively, and odds ratio for haplotype H5 carriers was 0.31 (95% CI: 0.10-0.95; P=0.0398). In addition to disease risk, the homozygosity of the risk haplotype was also associated with older AAO, longer disease duration and lower MMSE score. In summary, our data from a large case-control cohort indicate that genetic variation in the UBQLN1 gene has a modest effect on risk, AAO and disease duration of AD. Our haplotype data suggest the presence of additional putative functional variants either in the UBQLN1 gene or nearby genes and provide strong justification for additional work in this region on chromosome 9.
2013-01-01
Background Intronic and intergenic long noncoding RNAs (lncRNAs) are emerging gene expression regulators. The molecular pathogenesis of renal cell carcinoma (RCC) is still poorly understood, and in particular, limited studies are available for intronic lncRNAs expressed in RCC. Methods Microarray experiments were performed with custom-designed arrays enriched with probes for lncRNAs mapping to intronic genomic regions. Samples from 18 primary RCC tumors and 11 nontumor adjacent matched tissues were analyzed. Meta-analyses were performed with microarray expression data from three additional human tissues (normal liver, prostate tumor and kidney nontumor samples), and with large-scale public data for epigenetic regulatory marks and for evolutionarily conserved sequences. Results A signature of 29 intronic lncRNAs differentially expressed between RCC and nontumor samples was obtained (false discovery rate (FDR) <5%). A signature of 26 intronic lncRNAs significantly correlated with the RCC five-year patient survival outcome was identified (FDR <5%, p-value ≤0.01). We identified 4303 intronic antisense lncRNAs expressed in RCC, of which 22% were significantly (p <0.05) cis correlated with the expression of the mRNA in the same locus across RCC and three other human tissues. Gene Ontology (GO) analysis of those loci pointed to 'regulation of biological processes’ as the main enriched category. A module map analysis of the protein-coding genes significantly (p <0.05) trans correlated with the 20% most abundant lncRNAs, identified 51 enriched GO terms (p <0.05). We determined that 60% of the expressed lncRNAs are evolutionarily conserved. At the genomic loci containing the intronic RCC-expressed lncRNAs, a strong association (p <0.001) was found between their transcription start sites and genomic marks such as CpG islands, RNA Pol II binding and histones methylation and acetylation. Conclusion Intronic antisense lncRNAs are widely expressed in RCC tumors. Some of them are significantly altered in RCC in comparison with nontumor samples. The majority of these lncRNAs is evolutionarily conserved and possibly modulated by epigenetic modifications. Our data suggest that these RCC lncRNAs may contribute to the complex network of regulatory RNAs playing a role in renal cell malignant transformation. PMID:24238219
Filichkin, Sergei A.; Hamilton, Michael; Dharmawardhana, Palitha D.; Singh, Sunil K.; Sullivan, Christopher; Ben-Hur, Asa; Reddy, Anireddy S. N.; Jaiswal, Pankaj
2018-01-01
Abiotic stresses affect plant physiology, development, growth, and alter pre-mRNA splicing. Western poplar is a model woody tree and a potential bioenergy feedstock. To investigate the extent of stress-regulated alternative splicing (AS), we conducted an in-depth survey of leaf, root, and stem xylem transcriptomes under drought, salt, or temperature stress. Analysis of approximately one billion of genome-aligned RNA-Seq reads from tissue- or stress-specific libraries revealed over fifteen millions of novel splice junctions. Transcript models supported by both RNA-Seq and single molecule isoform sequencing (Iso-Seq) data revealed a broad array of novel stress- and/or tissue-specific isoforms. Analysis of Iso-Seq data also resulted in the discovery of 15,087 novel transcribed regions of which 164 show AS. Our findings demonstrate that abiotic stresses profoundly perturb transcript isoform profiles and trigger widespread intron retention (IR) events. Stress treatments often increased or decreased retention of specific introns – a phenomenon described here as differential intron retention (DIR). Many differentially retained introns were regulated in a stress- and/or tissue-specific manner. A subset of transcripts harboring super stress-responsive DIR events showed persisting fluctuations in the degree of IR across all treatments and tissue types. To investigate coordinated dynamics of intron-containing transcripts in the study we quantified absolute copy number of isoforms of two conserved transcription factors (TFs) using Droplet Digital PCR. This case study suggests that stress treatments can be associated with coordinated switches in relative ratios between fully spliced and intron-retaining isoforms and may play a role in adjusting transcriptome to abiotic stresses. PMID:29483921
Screening of Variations in CD22 Gene in Children with B-Precursor Acute Lymphoblastic Leukemia.
Aslar Oner, Deniz; Akin, Dilara Fatma; Sipahi, Kadir; Mumcuoglu, Mine; Ezer, Ustun; Kürekci, A Emin; Akar, Nejat
2016-09-01
CD22 is expressed on the surface of B-cell lineage cells from the early progenitor stage of pro-B cell until terminal differentiation to mature B cells. It plays a role in signal transduction and as a regulator of B-cell receptor signaling in B-cell development. We aimed to screen exons 9-14 of the CD22 gene, which is a mutational hot spot region in B-precursor acute lymphoblastic leukemia (pre-B ALL) patients, to find possible genetic variants that could play role in the pathogenesis of pre-B ALL in Turkish children. This study included 109 Turkish children with pre-B ALL who were diagnosed at Losante Hospital for Children with Leukemia. Genomic DNA was extracted from both peripheral blood and bone marrow leukocytes. Gene amplification was performed with PCR, and all samples were screened for the variants by single strand conformation polymorphism. Samples showing band shifts were sequenced on an automated sequencer. In our patient group a total of 9 variants were identified in the CD22 gene by sequencing: a novel variant in intron 10 (T2199G); a missense variant in exon 12; 5 intronic variants between exon 12 and intron 13; a novel intronic variant (C2424T); and a synonymous in exon 13. Thirteen of 109 children (11.9%) carried the T2199G novel intronic variant located in intron 10, and 17 of 109 children (15.6%) carried the C2424T novel intronic variant. Novel variants in the CD22 gene in children with pre-B ALL in Turkey that are not present, in the Human Gene Mutation Database or NCBI SNP database, were found.
Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P
2017-03-01
Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2003-01-01
Mitochondrial DNA (mtDNA) has undergone radical changes during the evolution of green plants, yet little is known about the dynamics of mtDNA evolution in this phylum. Land plant mtDNAs differ from the few green algal mtDNAs that have been analyzed to date by their expanded size, long spacers, and diversity of introns. We have determined the mtDNA sequence of Chara vulgaris (Charophyceae), a green alga belonging to the charophycean order (Charales) that is thought to be the most closely related alga to land plants. This 67,737-bp mtDNA sequence, displaying 68 conserved genes and 27 introns, was compared with those of three angiosperms, the bryophyte Marchantia polymorpha, the charophycean alga Chaetosphaeridium globosum (Coleochaetales), and the green alga Mesostigma viride. Despite important differences in size and intron composition, Chara mtDNA strikingly resembles Marchantia mtDNA; for instance, all except 9 of 68 conserved genes lie within blocks of colinear sequences. Overall, our genome comparisons and phylogenetic analyses provide unequivocal support for a sister-group relationship between the Charales and the land plants. Only four introns in land plant mtDNAs appear to have been inherited vertically from a charalean algar ancestor. We infer that the common ancestor of green algae and land plants harbored a tightly packed, gene-rich, and relatively intron-poor mitochondrial genome. The group II introns in this ancestral genome appear to have spread to new mtDNA sites during the evolution of bryophytes and charalean green algae, accounting for part of the intron diversity found in Chara and land plant mitochondria. PMID:12897260
Turmel, Monique; Otis, Christian; Lemieux, Claude
2002-01-01
The land plants and their immediate green algal ancestors, the charophytes, form the Streptophyta. There is evidence that both the chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) underwent substantial changes in their architecture (intron insertions, gene losses, scrambling in gene order, and genome expansion in the case of mtDNA) during the evolution of streptophytes; however, because no charophyte organelle DNAs have been sequenced completely thus far, the suite of events that shaped streptophyte organelle genomes remains largely unknown. Here, we have determined the complete cpDNA (131,183 bp) and mtDNA (56,574 bp) sequences of the charophyte Chaetosphaeridium globosum (Coleochaetales). At the levels of gene content (124 genes), intron composition (18 introns), and gene order, Chaetosphaeridium cpDNA is remarkably similar to land-plant cpDNAs, implying that most of the features characteristic of land-plant lineages were gained during the evolution of charophytes. Although the gene content of Chaetosphaeridium mtDNA (67 genes) closely resembles that of the bryophyte Marchantia polymorpha (69 genes), this charophyte mtDNA differs substantially from its land-plant relatives at the levels of size, intron composition (11 introns), and gene order. Our finding that it shares only one intron with its land-plant counterparts supports the idea that the vast majority of mitochondrial introns in land plants appeared after the emergence of these organisms. Our results also suggest that the events accounting for the spacious intergenic spacers found in land-plant mtDNAs took place late during the evolution of charophytes or coincided with the transition from charophytes to land plants. PMID:12161560
Kwak, Kyung Jin; Jung, Hyun Ju; Lee, Kwang Ho; Kim, Young Soon; Kim, Won Yong; Ahn, Sung Ju; Kang, Hunseung
2012-01-01
U12 intron-specific spliceosomes contain U11 and U12 small nuclear ribonucleoproteins and mediate the removal of U12 introns from precursor-mRNAs. Among the several proteins unique to the U12-type spliceosomes, an Arabidopsis thaliana AtU11/U12-31K protein has been shown to be indispensible for proper U12 intron splicing and for normal growth and development of Arabidopsis plants. Here, we assessed the functional roles of the rice (Oryza sativa) OsU11/U12-31K protein in U12 intron splicing and development of plants. The U11/U12-31K transcripts were abundantly expressed in the shoot apical meristems (SAMs) of Arabidopsis and rice. Ectopic expression of OsU11/U12-31K in AtU11/U12-31K-defecient Arabidopsis mutant complemented the incorrect U12 intron splicing and abnormal development phenotypes of the Arabidopsis mutant plants. Impaired cell division activity in the SAMs and inflorescence stems observed in the AtU11/U12-31K-deficient mutant was completely recovered to normal by the expression of OsU11/U12-31K. Similar to Arabidopsis AtU11/U12-31K, rice OsU11/U12-31K was determined to harbor RNA chaperone activity. Collectively, the present findings provide evidence for the emerging idea that the U11/U12-31K protein is an indispensible RNA chaperone that functions in U12 intron splicing and is necessary for normal development of monocotyledonous plants as well as dicotyledonous plants.
Rebscher, Nicole; Deichmann, Christina; Sudhop, Stefanie; Fritzenwanker, Jens Holger; Green, Stephen; Hassel, Monika
2009-10-01
We have analyzed the evolution of fibroblast growth factor receptor (FGFR) tyrosine kinase genes throughout a wide range of animal phyla. No evidence for an FGFR gene was found in Porifera, but we tentatively identified an FGFR gene in the placozoan Trichoplax adhaerens. The gene encodes a protein with three immunoglobulin-like domains, a single-pass transmembrane, and a split tyrosine kinase domain. By superimposing intron positions of 20 FGFR genes from Placozoa, Cnidaria, Protostomia, and Deuterostomia over the respective protein domain structure, we identified ten ancestral introns and three conserved intron groups. Our analysis shows (1) that the position of ancestral introns correlates to the modular structure of FGFRs, (2) that the acidic domain very likely evolved in the last common ancestor of triploblasts, (3) that splicing of IgIII was enabled by a triploblast-specific insertion, and (4) that IgI is subject to substantial loss or duplication particularly in quickly evolving genomes. Moreover, intron positions in the catalytic domain of FGFRs map to the borders of protein subdomains highly conserved in other serine/threonine kinases. Nevertheless, these introns were introduced in metazoan receptor tyrosine kinases exclusively. Our data support the view that protein evolution dating back to the Cambrian explosion took place in such a short time window that only subtle changes in the domain structure are detectable in extant representatives of animal phyla. We propose that the first multidomain FGFR originated in the last common ancestor of Placozoa, Cnidaria, and Bilateria. Additional domains were introduced mainly in the ancestor of triploblasts and in the Ecdysozoa.
Leliaert, Frederik; Marcelino, Vanessa R
2018-01-01
Abstract Chloroplast genomes have undergone tremendous alterations through the evolutionary history of the green algae (Chloroplastida). This study focuses on the evolution of chloroplast genomes in the siphonous green algae (order Bryopsidales). We present five new chloroplast genomes, which along with existing sequences, yield a data set representing all but one families of the order. Using comparative phylogenetic methods, we investigated the evolutionary dynamics of genomic features in the order. Our results show extensive variation in chloroplast genome architecture and intron content. Variation in genome size is accounted for by the amount of intergenic space and freestanding open reading frames that do not show significant homology to standard plastid genes. We show the diversity of these nonstandard genes based on their conserved protein domains, which are often associated with mobile functions (reverse transcriptase/intron maturase, integrases, phage- or plasmid-DNA primases, transposases, integrases, ligases). Investigation of the introns showed proliferation of group II introns in the early evolution of the order and their subsequent loss in the core Halimedineae, possibly through RT-mediated intron loss. PMID:29635329
Naro, Chiara; Jolly, Ariane; Di Persio, Sara; Bielli, Pamela; Setterblad, Niclas; Alberdi, Antonio J; Vicini, Elena; Geremia, Raffaele; De la Grange, Pierre; Sette, Claudio
2017-04-10
Global transcriptome reprogramming during spermatogenesis ensures timely expression of factors in each phase of male germ cell differentiation. Spermatocytes and spermatids require particularly extensive reprogramming of gene expression to switch from mitosis to meiosis and to support gamete morphogenesis. Here, we uncovered an extensive alternative splicing program during this transmeiotic differentiation. Notably, intron retention was largely the most enriched pattern, with spermatocytes showing generally higher levels of retention compared with spermatids. Retained introns are characterized by weak splice sites and are enriched in genes with strong relevance for gamete function. Meiotic intron-retaining transcripts (IRTs) were exclusively localized in the nucleus. However, differently from other developmentally regulated IRTs, they are stable RNAs, showing longer half-life than properly spliced transcripts. Strikingly, fate-mapping experiments revealed that IRTs are recruited onto polyribosomes days after synthesis. These studies reveal an unexpected function for regulated intron retention in modulation of the timely expression of select transcripts during spermatogenesis. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
An intronic microRNA silences genes that are functionally antagonistic to its host gene.
Barik, Sailen
2008-09-01
MicroRNAs (miRNAs) are short noncoding RNAs that down-regulate gene expression by silencing specific target mRNAs. While many miRNAs are transcribed from their own genes, nearly half map within introns of 'host' genes, the significance of which remains unclear. We report that transcriptional activation of apoptosis-associated tyrosine kinase (AATK), essential for neuronal differentiation, also generates miR-338 from an AATK gene intron that silences a family of mRNAs whose protein products are negative regulators of neuronal differentiation. We conclude that an intronic miRNA, transcribed together with the host gene mRNA, may serve the interest of its host gene by silencing a cohort of genes that are functionally antagonistic to the host gene itself.
Bottomless barrel-sponge species in the Indo-Pacific?
Setiawan, Edwin; Voogd, Nicole J De; Wörheide, Gert; Erpenbeck, Dirk
2016-07-06
The use of nuclear markers, in addition to traditional mitochondrial markers, helps to clarify hidden patterns of genetic structure in natural populations (Palumbi & Baker, 1994). This is particularly evident among demosponges that possess slow mitochondrial evolutionary rates compared to Bilateria, where nuclear intron markers can aid in the understanding of shallow level phylogenetic relationships (Shearer et al., 2002). Ideally, these nuclear markers (i) are evolutionary well-conserved across different lineages, (ii) produce amplicons holding a number of sites with sufficient variability to answer the relevant phylogenetic question, (iii) derive from single copy genes (see review in Zhang & Hewitt, 2003). A popular method to amplify intron markers uses EPIC (Exon-Primed, Intron-Crossing) primers that anneal to the more conserved flanking exon regions and subsequently bridge the intron during amplification (Palumbi & Baker, 1994).
Costa, José Hélio; de Melo, Dirce Fernandes; Gouveia, Zélia; Cardoso, Hélia Guerra; Peixe, Augusto; Arnholdt-Schmitt, Birgit
2009-12-01
'Genomic design' refers to the structural organization of gene sequences. Recently, the role of intron sequences for gene regulation is being better understood. Further, introns possess high rates of polymorphism that are considered as the major source for speciation. In molecular breeding, the length of gene-specific introns is recognized as a tool to discriminate genotypes with diverse traits of agronomic interest. 'Economy selection' and 'time-economy selection' have been proposed as models for explaining why highly expressed genes typically contain small introns. However, in contrast to these theories, plant-specific selection reveals that highly expressed genes contain introns that are large. In the presented research, 'wet'Aox gene identification from grapevine is advanced by a bioinformatics approach to study the species-specific organization of Aox gene structures in relation to available expressed sequence tag (EST) data. Two Aox1 and one Aox2 gene sequences have been identified in Vitis vinifera using grapevine cultivars from Portugal and Germany. Searching the complete genome sequence data of two grapevine cultivars confirmed that V. vinifera alternative oxidase (Aox) is encoded by a small multigene family composed of Aox1a, Aox1b and Aox2. An analysis of EST distribution revealed high expression of the VvAox2 gene. A relationship between the atypical long primary transcript of VvAox2 (in comparison to other plant Aox genes) and its expression level is suggested. V. vinifera Aox genes contain four exons interrupted by three introns except for Aox1a which contains an additional intron in the 3'-UTR. The lengths of primary Aox transcripts were estimated for each gene in two V. vinifera varieties: PN40024 and Pinot Noir. In both varieties, Aox1a and Aox1b contained small introns that corresponded to primary transcript lengths ranging from 1501 to 1810 bp. The Aox2 of PN40024 (12 329 bp) was longer than that from Pinot Noir (7279 bp) because of selection against a transposable-element insertion that is 5028 bp in size. An EST database basic local alignment search tool (BLAST) search of GenBank revealed the following ESTs percentages for each gene: Aox1a (26.2%), Aox1b (11.9%) and Aox2 (61.9%). Aox1a was expressed in fruits and roots, Aox1b expression was confined to flowers and Aox2 was ubiquitously expressed. These data for V. vinifera show that atypically long Aox intron lengths are related to high levels of gene expression. Furthermore, it is shown for the first time that two grapevine cultivars can be distinguished by Aox intron length polymorphism.
An RNAi-enhanced Logic Circuit for Cancer Specific Detection and Destruction
2010-07-01
Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its mutant hBax-S184A [4]. A plasmid containing the tested gene was transfected into HEK...the far-red fluorescent protein mKate to express the Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and...intron-exon junction. Among the donor and acceptor sequences found in literature our intron features were chosen according SplicePort [5], an
Huang, Yin-Fu; Wang, Chia-Ming; Liou, Sing-Wu
2013-01-01
A hybrid self-adaptive harmony search and back-propagation mining system was proposed to discover weighted patterns in human intron sequences. By testing the weights under a lazy nearest neighbor classifier, the numerical results revealed the significance of these weighted patterns. Comparing these weighted patterns with the popular intron consensus model, it is clear that the discovered weighted patterns make originally the ambiguous 5SS and 3SS header patterns more specific and concrete.
Wang, Chia-Ming; Liou, Sing-Wu
2013-01-01
A hybrid self-adaptive harmony search and back-propagation mining system was proposed to discover weighted patterns in human intron sequences. By testing the weights under a lazy nearest neighbor classifier, the numerical results revealed the significance of these weighted patterns. Comparing these weighted patterns with the popular intron consensus model, it is clear that the discovered weighted patterns make originally the ambiguous 5SS and 3SS header patterns more specific and concrete. PMID:23737711
Sun, Xiaoyong; Wang, Lin; Ding, Jiechao; Wang, Yanru; Wang, Jiansheng; Zhang, Xiaoyang; Che, Yulei; Liu, Ziwei; Zhang, Xinran; Ye, Jiazhen; Wang, Jie; Sablok, Gaurav; Deng, Zhiping; Zhao, Hongwei
2016-10-01
A new regulatory class of small endogenous RNAs called circular RNAs (circRNAs) has been described as miRNA sponges in animals. Using 16 Arabidopsis thaliana RNA-Seq data sets, we identified 803 circRNAs in RNase R-/non-RNase R-treated samples. The results revealed the following features: Canonical and noncanonical splicing can generate circRNAs; chloroplasts are a hotspot for circRNA generation; furthermore, limited complementary sequences exist not only in introns, but also in the sequences flanking splice sites. The latter finding suggests that multiple combinations between complementary sequences may facilitate the formation of the circular structure. Our results contribute to a better understanding of this novel class of plant circRNAs. © 2016 Federation of European Biochemical Societies.
Chu, Yan; Fang, Dong; Hou, Qiao-fang; Wang, Li-ya; Guo, Xi-rang; Wang, Ying-tai; Liao, Shi-xiu
2013-04-01
To identify potential mutations of retinoschisis 1 (RS1) gene responsible for X-linked retinoschisis (XLRS) in two Chinese families. The 6 exons and flanking intronic regions were analyzed with PCR and direct sequencing. Two RS1 mutations were identified in the two families, which included 1 frameshift mutation (c.573delG, p.Pro192fs) and 1 missense mutation (c.626G>A, p.Arg209His). Two RS1 mutations have been identified, among which Pro192fs mutation is discovered for the first time in Chinese population. Above results may enrich our understanding of the clinical manifestations of XLRS and facilitated early diagnosis and genetic counseling for the disease.
Joardar, Vinita; Abrams, Natalie F; Hostetler, Jessica; Paukstelis, Paul J; Pakala, Suchitra; Pakala, Suman B; Zafar, Nikhat; Abolude, Olukemi O; Payne, Gary; Andrianopoulos, Alex; Denning, David W; Nierman, William C
2012-12-12
The genera Aspergillus and Penicillium include some of the most beneficial as well as the most harmful fungal species such as the penicillin-producer Penicillium chrysogenum and the human pathogen Aspergillus fumigatus, respectively. Their mitochondrial genomic sequences may hold vital clues into the mechanisms of their evolution, population genetics, and biology, yet only a handful of these genomes have been fully sequenced and annotated. Here we report the complete sequence and annotation of the mitochondrial genomes of six Aspergillus and three Penicillium species: A. fumigatus, A. clavatus, A. oryzae, A. flavus, Neosartorya fischeri (A. fischerianus), A. terreus, P. chrysogenum, P. marneffei, and Talaromyces stipitatus (P. stipitatum). The accompanying comparative analysis of these and related publicly available mitochondrial genomes reveals wide variation in size (25-36 Kb) among these closely related fungi. The sources of genome expansion include group I introns and accessory genes encoding putative homing endonucleases, DNA and RNA polymerases (presumed to be of plasmid origin) and hypothetical proteins. The two smallest sequenced genomes (A. terreus and P. chrysogenum) do not contain introns in protein-coding genes, whereas the largest genome (T. stipitatus), contains a total of eleven introns. All of the sequenced genomes have a group I intron in the large ribosomal subunit RNA gene, suggesting that this intron is fixed in these species. Subsequent analysis of several A. fumigatus strains showed low intraspecies variation. This study also includes a phylogenetic analysis based on 14 concatenated core mitochondrial proteins. The phylogenetic tree has a different topology from published multilocus trees, highlighting the challenges still facing the Aspergillus systematics. The study expands the genomic resources available to fungal biologists by providing mitochondrial genomes with consistent annotations for future genetic, evolutionary and population studies. Despite the conservation of the core genes, the mitochondrial genomes of Aspergillus and Penicillium species examined here exhibit significant amount of interspecies variation. Most of this variation can be attributed to accessory genes and mobile introns, presumably acquired by horizontal gene transfer of mitochondrial plasmids and intron homing.
Liu, Yanbin; Yap, Sihui Amy; Koh, Chong Mei John; Ji, Lianghui
2016-11-25
Red yeast species in the Rhodotorula/Rhodosporidium genus are outstanding producers of triacylglyceride and cell biomass. Metabolic engineering is expected to further enhance the productivity and versatility of these hosts for the production of biobased chemicals and fuels. Promoters with strong activity during oil-accumulation stage are critical tools for metabolic engineering of these oleaginous yeasts. The upstream DNA sequences of 6 genes involved in lipid biosynthesis or accumulation in Rhodotorula toruloides were studied by luciferase reporter assay. The promoter of perilipin/lipid droplet protein 1 gene (LDP1) displayed much stronger activity (4-11 folds) than that of glyceraldehyde-3-phosphate dehydrogenase gene (GPD1), one of the strongest promoters known in yeasts. Depending on the stage of cultivation, promoter of acetyl-CoA carboxylase gene (ACC1) and fatty acid synthase β subunit gene (FAS1) exhibited intermediate strength, displaying 50-160 and 20-90% levels of GPD1 promoter, respectively. Interestingly, introns significantly modulated promoter strength at high frequency. The incorporation of intron 1 and 2 of LDP1 (LDP1in promoter) enhanced its promoter activity by 1.6-3.0 folds. Similarly, the strength of ACC1 promoter was enhanced by 1.5-3.2 folds if containing intron 1. The intron 1 sequences of ACL1 and FAS1 also played significant regulatory roles. When driven by the intronic promoters of ACC1 and LDP1 (ACC1in and LDP1in promoter, respectively), the reporter gene expression were up-regulated by nitrogen starvation, independent of de novo oil biosynthesis and accumulation. As a proof of principle, overexpression of the endogenous acyl-CoA-dependent diacylglycerol acyltransferase 1 gene (DGA1) by LDP1in promoter was significantly more efficient than GPD1 promoter in enhancing lipid accumulation. Intronic sequences play an important role in regulating gene expression in R. toruloides. Three intronic promoters, LDP1in, ACC1in and FAS1in, are excellent promoters for metabolic engineering in the oleaginous and carotenogenic yeast, R. toruloides.
Proudhon, D; Wei, J; Briat, J; Theil, E C
1996-03-01
Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kalchman, M.; Lin, B.; Nasir, J.
1994-09-01
The mouse homologue of the Huntington disease gene (Hdh) has recently been cloned and mapped to a region of synteny with the human, on mouse chromosome 5. The two genes share a high degree of both coding (90% amino acid) and nucleotide (86.2%) identity. We have subsequently performed a detailed comparison of the genomic organization of the 5{prime} region of the two genes encompassing the promoter region and first five exons of both the human and mouse genes. The comparative sequence analysis of the promoter region between HD and Hdh reveals two highly conserved regions. One region (-56 to -118)more » (+1 is the ATG start codon), shared 84% nucleotide identity and another region (-130 to -206) had 81% nucleotide identity. Nine putative Sp1 sites appear in the human promoter region contrasted with only 3 in a similar region in the mouse. Furthermore, 17 and 20 base pair direct repeats present in the HD 5{prime} region are absent in the similar Hdh region. Although both the mouse and human intron/exon boundaries conform to the GT/AG rule, the intron sizes between HD and Hdh are markedly different. The first four introns in Hdh are 15, 7, 5 and 0.5 kb compared to sizes of 10, 15, 7 and 0.5 kb, respectively. Comparison between the mouse and human intronic sequences immediately adjacent to the first five exons (excluding exon 1) reveals only about 46 to 50% identity within the first 60 bp of intronic sequence. Furthermore, we have identified novel polymorphic di-, tri- and tetra-nucleotide repeats in Hdh introns of various mouse strains that are not present in the human. For example, polymorphic CT repeats are present in introns 2 and 4 of Hdh and a novel mouse 56 AAG trinucleotide repeat (interrupted by an AAGG) is also located within intron 2. This information concerning the promoter and genomic organization of both HD and Hdh is critical for designing appropriate gene targetting vectors for studying the normal function of the HD and Hdh genes in model systems.« less
Metazoan tRNA introns generate stable circular RNAs in vivo.
Lu, Zhipeng; Filonov, Grigory S; Noto, John J; Schmidt, Casey A; Hatkevich, Talia L; Wen, Ying; Jaffrey, Samie R; Matera, A Gregory
2015-09-01
We report the discovery of a class of abundant circular noncoding RNAs that are produced during metazoan tRNA splicing. These transcripts, termed tRNA intronic circular (tric)RNAs, are conserved features of animal transcriptomes. Biogenesis of tricRNAs requires anciently conserved tRNA sequence motifs and processing enzymes, and their expression is regulated in an age-dependent and tissue-specific manner. Furthermore, we exploited this biogenesis pathway to develop an in vivo expression system for generating "designer" circular RNAs in human cells. Reporter constructs expressing RNA aptamers such as Spinach and Broccoli can be used to follow the transcription and subcellular localization of tricRNAs in living cells. Owing to the superior stability of circular vs. linear RNA isoforms, this expression system has a wide range of potential applications, from basic research to pharmaceutical science. © 2015 Lu et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Ruíz, Teresa; De la Rosa, José M; Domínguez, Angel; Rodríguez, Luis
2003-05-01
In several organisms, including Saccharomyces cerevisiae and other yeast species, the product encoded by the SEC61 gene is considered to be the core element of the translocation apparatus within the endoplasmic reticulum membrane through which translocation of secretory and membrane proteins occurs. In this study, we have cloned and characterized the homolog of the SEC61 gene from the yeast Pichia anomala. The cloned gene includes an ORF, interrupted after the first ten nucleotides by an intron of 131 bp, encoding a 479-amino acid putative polypeptide exhibiting homology to the products encoded by different eukaryotic SEC61 genes, particularly to those from other yeast species. We show that the P. anomala SEC61 gene is correctly processed (intron splicing) when expressed in S. cerevisiae and that it is able to complement the thermosensitive phenotype associated with a mutation in the S. cerevisiae SEC61 gene.
Evolutionary History of Ascomyceteous Yeasts
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haridas, Sajeet; Riley, Robert; Salamov, Asaf
2014-06-06
Yeasts are important for many industrial and biotechnological processes and show remarkable diversity despite morphological similarities. We have sequenced the genomes of 16 ascomycete yeasts of taxonomic and industrial importance including members of Saccharomycotina and Taphrinomycotina. A comparison of these with several other previously published yeast genomes have added increased confidence to the phylogenetic positions of previously poorly placed species including Saitoella complicata, Babjeviella inositovora and Metschnikowia bicuspidata. Phylogenetic analysis also showed that yeasts with alternative nuclear codon usage where CUG encodes serine instead of leucine are monophyletic within the Saccharomycotina. Most of the yeasts have compact genomes with amore » large fraction of single exon genes with Lipomyces starkeyi and the previously published Pneumocystis jirovecii being notable exceptions. Intron analysis suggests that early diverging species have more introns. We also observed a large number of unclassified lineage specific non-simple repeats in these genomes.« less
Intriguing Balancing Selection on the Intron 5 Region of LMBR1 in Human Population
He, Fang; Wu, Dong-Dong; Kong, Qing-Peng; Zhang, Ya-Ping
2008-01-01
Background The intron 5 of gene LMBR1 is the cis-acting regulatory module for the sonic hedgehog (SHH) gene. Mutation in this non-coding region is associated with preaxial polydactyly, and may play crucial roles in the evolution of limb and skeletal system. Methodology/Principal Findings We sequenced a region of the LMBR1 gene intron 5 in East Asian human population, and found a significant deviation of Tajima's D statistics from neutrality taking human population growth into account. Data from HapMap also demonstrated extended linkage disequilibrium in the region in East Asian and European population, and significantly low degree of genetic differentiation among human populations. Conclusion/Significance We proposed that the intron 5 of LMBR1 was presumably subject to balancing selection during the evolution of modern human. PMID:18698406
Hinske, Ludwig Christian; Galante, Pedro A. F.; Limbeck, Elisabeth; Möhnle, Patrick; Parmigiani, Raphael B.; Ohno-Machado, Lucila; Camargo, Anamaria A.; Kreth, Simone
2015-01-01
About half of the known miRNA genes are located within protein-coding host genes, and are thus subject to co-transcription. Accumulating data indicate that this coupling may be an intrinsic mechanism to directly regulate the host gene’s expression, constituting a negative feedback loop. Inevitably, the cell requires a yet largely unknown repertoire of methods to regulate this control mechanism. We propose APA as one possible mechanism by which negative feedback of intronic miRNA on their host genes might be regulated. Using in-silico analyses, we found that host genes that contain seed matching sites for their intronic miRNAs yield longer 32UTRs with more polyadenylation sites. Additionally, the distribution of polyadenylation signals differed significantly between these host genes and host genes of miRNAs that do not contain potential miRNA binding sites. We then transferred these in-silico results to a biological example and investigated the relationship between ZFR and its intronic miRNA miR-579 in a U87 cell line model. We found that ZFR is targeted by its intronic miRNA miR-579 and that alternative polyadenylation allows differential targeting. We additionally used bioinformatics analyses and RNA-Seq to evaluate a potential cross-talk between intronic miRNAs and alternative polyadenylation. CPSF2, a gene previously associated with alternative polyadenylation signal recognition, might be linked to intronic miRNA negative feedback by altering polyadenylation signal utilization. PMID:25799583
Hua, Yimin; Vickers, Timothy A.; Okunola, Hazeem L.; Bennett, C. Frank; Krainer, Adrian R.
2008-01-01
survival of motor neuron 2, centromeric (SMN2) is a gene that modifies the severity of spinal muscular atrophy (SMA), a motor-neuron disease that is the leading genetic cause of infant mortality. Increasing inclusion of SMN2 exon 7, which is predominantly skipped, holds promise to treat or possibly cure SMA; one practical strategy is the disruption of splicing silencers that impair exon 7 recognition. By using an antisense oligonucleotide (ASO)-tiling method, we systematically screened the proximal intronic regions flanking exon 7 and identified two intronic splicing silencers (ISSs): one in intron 6 and a recently described one in intron 7. We analyzed the intron 7 ISS by mutagenesis, coupled with splicing assays, RNA-affinity chromatography, and protein overexpression, and found two tandem hnRNP A1/A2 motifs within the ISS that are responsible for its inhibitory character. Mutations in these two motifs, or ASOs that block them, promote very efficient exon 7 inclusion. We screened 31 ASOs in this region and selected two optimal ones to test in human SMN2 transgenic mice. Both ASOs strongly increased hSMN2 exon 7 inclusion in the liver and kidney of the transgenic animals. Our results show that the high-resolution ASO-tiling approach can identify cis-elements that modulate splicing positively or negatively. Most importantly, our results highlight the therapeutic potential of some of these ASOs in the context of SMA. PMID:18371932
Sobczak, K; Kozłowski, P; Napierała, M; Czarny, J; Woźniak, M; Kapuścińska, M; Lośko, M; Koziczak, M; Jasińska, A; Powierska, J; Braczkowski, R; Breborowicz, J; Godlewski, D; Mackiewicz, A; Krzyzosiak, W
1997-10-09
Three different novel BRCA1 mutations, five independent cases of the same 12 bp insertion-duplication in intron-20 and two novel rare BRCA1 sequence variants were identified among 122 Polish women with positive, in most cases moderate family history of breast and/or ovarian cancer, 80 controls and 34 unselected breast cancer tissue specimens. All mutations and variants were germline. The 4153 delA frameshift mutation, the Tyr105Cys missense mutation and two cases of the alteration in intron-20 were found in the group of healthy women with positive family history. Two other cases of the intronic insertion were found in unselected controls. Their carriers had no family history of breast or ovarian cancer but other cancers occurred in their families. The 1782 Trp/STOP nonsense mutation and one case of the insertion in intron-20 were first found in tissue specimens of breast cancer patient and breast/ovarian cancer patient, respectively. Their carriers also had no family history of breast or ovarian cancer. The distribution of the insertion in intron-20 in analysed groups and results of RT-PCR experiments suggest a less prominent role for this variant considered earlier a splicing mutation. This study shows also, that more population-oriented research is needed, involving women with less profound or even no family history of breast and ovarian cancer, to better understand the role and significance of different BRCA1 variants and mutations.
Geng, Li-xia; Zheng, Rui; Ren, Jie; Niu, Zhi-tao; Sun, Yu-long; Xue, Qing-yun; Liu, Wei; Ding, Xiao-yu
2015-08-01
In this study, 17 kinds of Dendrobium species of Fengdous including 39 individuals were collected from 4 provinces. Mitochondrial gene sequences co I, nad 5, nad 1-intron 2 and chloroplast gene sequences rbcL, matK amd psbA-trnH were amplified from these materials, as well as nrDNA ITS. Furthermore, suitable sequences for identification of Dendrobium species of Fengdous were screened by K-2-P and P-distance. The results showed that during the mentioned 7 sequences, nrDNA ITS, nad 1-intron 2 and psbA-trnH which had a high degree of variability could be used to identify Dendrobium species of Fengdous. However, single fragment could not be used to distinguish D. moniliforme and D. huoshanense. Moreover, compared to other combined fragments, new type combined fragments nrDNA ITS+nad 1-intron 2 was more effective in identifying the original plants of Dendrobium species and could be used to identify D. huoshanense and D. moniliforme. Besides, according to the UPGMA tree constructed with nrDNA ITS+nad 1-intron 2, 3 inspected Dendrobium plants were identified as D. huoshanense, D. moniliforme and D. officinale, respectively. This study identified Dendrobium species of Fengdous by combined fragments nrDNA ITS+nad 1-intron 2 for the first time, which provided a more effective basis for identification of Dendrobium species. And this study will be helpful for regulating the market of Fengdous.
Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F
1985-01-01
Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512
Lynch syndrome: barriers to and facilitators of screening and disease management.
Watkins, Kathy E; Way, Christine Y; Fiander, Jacqueline J; Meadus, Robert J; Esplen, Mary Jane; Green, Jane S; Ludlow, Valerie C; Etchegary, Holly A; Parfrey, Patrick S
2011-09-07
Lynch syndrome is a hereditary cancer with confirmed carriers at high risk for colorectal (CRC) and extracolonic cancers. The purpose of the current study was to develop a greater understanding of the factors influencing decisions about disease management post-genetic testing. The study used a grounded theory approach to data collection and analysis as part of a multiphase project examining the psychosocial and behavioral impact of predictive DNA testing for Lynch syndrome. Individual and small group interviews were conducted with individuals from 10 families with the MSH2 intron 5 splice site mutation or exon 8 deletion. The data from confirmed carriers (n = 23) were subjected to re-analysis to identify key barriers to and/or facilitators of screening and disease management. Thematic analysis identified personal, health care provider and health care system factors as dominant barriers to and/or facilitators of managing Lynch syndrome. Person-centered factors reflect risk perceptions and decision-making, and enduring screening/disease management. The perceived knowledge and clinical management skills of health care providers also influenced participation in recommended protocols. The health care system barriers/facilitators are defined in terms of continuity of care and coordination of services among providers. Individuals with Lynch syndrome often encounter multiple barriers to and facilitators of disease management that go beyond the individual to the provider and health care system levels. The current organization and implementation of health care services are inadequate. A coordinated system of local services capable of providing integrated, efficient health care and follow-up, populated by providers with knowledge of hereditary cancer, is necessary to maintain optimal health.
Campos, W N; Massaro, J D; Martinelli, A L C; Halliwell, J A; Marsh, S G E; Mendes-Junior, C T; Donadi, E A
2017-10-01
The HFE molecule controls iron uptake from gut, and defects in the molecule have been associated with iron overload, particularly in hereditary hemochromatosis. The HFE gene including both coding and boundary intronic regions were sequenced in 304 Brazilian individuals, encompassing healthy individuals and patients exhibiting hereditary or acquired iron overload. Six sites of variation were detected: (1) H63D C>G in exon 2, (2) IVS2 (+4) T>C in intron 2, (3) a C>G transversion in intron 3, (4) C282Y G>A in exon 4, (5) IVS4 (-44) T>C in intron 4, and (6) a new guanine deletion (G>del) in intron 5, which were used for haplotype inference. Nine HFE alleles were detected and six of these were officially named on the basis of the HLA Nomenclature, defined by the World Health Organization (WHO) Nomenclature Committee for Factors of the HLA System, and published via the IPD-IMGT/HLA website. Four alleles, HFE*001, *002, *003, and *004 exhibited variation within their exon sequences. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Suh, E R; Waring, R B
1990-01-01
It has been proposed that recognition of the 3' splice site in many group I introns involves base pairing between the start of the 3' exon and a region of the intron known as the internal guide sequence (R. W. Davies, R. B. Waring, J. Ray, T. A. Brown, and C. Scazzocchio, Nature [London] 300:719-724, 1982). We have examined this hypothesis, using the self-splicing rRNA intron from Tetrahymena thermophila. Mutations in the 3' exon that weaken this proposed pairing increased use of a downstream cryptic 3' splice site. Compensatory mutations in the guide sequence that restore this pairing resulted in even stronger selection of the normal 3' splice site. These changes in 3' splice site usage were more pronounced in the background of a mutation (414A) which resulted in an adenine instead of a guanine being the last base of the intron. These results show that the proposed pairing (P10) plays an important role in ensuring that cryptic 3' splice sites are selected against. Surprisingly, the 414A mutation alone did not result in activation of the cryptic 3' splice site. Images PMID:2342465
Schultz, Kris Ann; Harris, Anne; Messinger, Yoav; Sencer, Susan; Baldinger, Shari; Dehner, Louis P.; Hill, D. Ashley
2015-01-01
Germline DICER1 mutations have been described in individuals with pleuropulmonary blastoma (PPB), ovarian Sertoli-Leydig cell tumor (SLCT), sarcomas, multinodular goiter, thyroid carcinoma, cystic nephroma and other neoplastic conditions. Early results from the International Ovarian and Testicular Stromal Tumor Registry show germline DICER1 mutations in 48% of girls and women with SLCT. In this report, a young woman presented with ovarian undifferentiated sarcoma. Four years later, she presented with SLCT. She was successfully treated for both malignancies. Sequence results showed a germline intronic mutation in DICER1. This mutation results in an exact duplication of the six bases at the splice site at the intron 23 and exon 24 junction. Predicted improper splicing leads to inclusion of 10 bases of intronic sequence, frameshift and premature truncation of the protein disrupting the RNase IIIb domain. A second individual with SLCT was found to have an identical germline mutation. In each of the ovarian tumors, an additional somatic mutation in the RNase IIIb domain of DICER1 was found. In rare patients, germline intronic mutations in DICER1 that are predicted to cause incorrect splicing can also contribute to the pathogenesis of SLCT. PMID:26289771
The intron 1 of HPV 16 has a suboptimal branch point at a guanosine.
De la Rosa-Rios, Marco Antonio; Martínez-Salazar, Martha; Martínez-Garcia, Martha; González-Bonilla, César; Villegas-Sepúlveda, Nicolás
2006-06-01
The branch point sequence (BPS) of intron 1 of the HPV-16 was determined via RT-PCR in a cell free system, using lariat intermediates obtained by in vitro splicing reactions. We used synthetic E6/E7 transcripts and HeLa nuclear protein extracts to obtain the splicing intermediates. Then, a divergent oligonucleotide primer set, pairing on the lariat RNA that encompassed the 2'-5' phosphodiester bond formed between the 5' end of the intron and the BPS, was used for cDNA synthesis and PCR amplification. Subsequent RT-PCR assays revealed four splicing intermediates, made up of a major intermediary corresponding to the BPS and four cryptic branched sequences. Only intermediates bound at the 5' end of the intron are probably the authentic branch point sequence, and all of them branch at guanosine 328 instead of the typical adenosine. Unusually, the BPS of intron 1 of HPV-16 is a suboptimal sequence (AGUGAGU) that differs from the eukaryotic consensus BPS, which correlates with the splicing profile observed for early transcripts of HPV-16 in tumors and tumor derived cell lines. The implications of this unusual branch point sequence for splicing of the HPV-16 pre-mRNA are discussed.
Doran, Mark; du Plessis, Daniel G; Ghadiali, Eric J; Mann, David M A; Pickering-Brown, Stuart; Larner, Andrew J
2007-10-01
Frontotemporal dementia with parkinsonism linked to chromosome 17 (FTDP-17) owing to the tau intron 10 + 16 mutation usually occurs with a prototypical frontotemporal dementia phenotype with prominent disinhibition and affective disturbances. To report a new FTDP-17 pedigree with the tau intron 10 + 16 mutation demonstrating a clinical phenotype suggestive of Alzheimer disease. Case reports. Regional neuroscience centers in northwest England. Patients We examined 4 members of a kindred in which 8 individuals were affected in 3 generations. All 4 patients reported memory difficulty. Marked anomia was also present, but behavioral disturbances were conspicuously absent in the early stages of disease. All patients had an initial clinical diagnosis of Alzheimer disease. No mutations were found in the presenilin or amyloid precursor protein genes. Pathologic examination of the proband showed features typical of FTDP-17, and tau gene analysis showed the intron 10 + 16 mutation. This pedigree illustrates the phenotypic variability of tau intron 10 + 16 mutations. In pedigrees with a clinical diagnosis of Alzheimer disease but without presenilin or amyloid precursor protein gene mutations, tau gene mutations may be found.
Bacterial Group II Introns: Identification and Mobility Assay.
Toro, Nicolás; Molina-Sánchez, María Dolores; Nisa-Martínez, Rafael; Martínez-Abarca, Francisco; García-Rodríguez, Fernando Manuel
2016-01-01
Group II introns are large catalytic RNAs and mobile retroelements that encode a reverse transcriptase. Here, we provide methods for their identification in bacterial genomes and further analysis of their splicing and mobility capacities.
Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki
2013-01-01
Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5′ trnK intron, matK, partial 3′ trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species. PMID:23610621
Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki
2013-04-01
Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5' trnK intron, matK, partial 3' trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species.
Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M
2017-03-27
Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.
Jansen, Robert K.; Wojciechowski, Martin F.; Sanniyasi, Elumalai; Lee, Seung-Bum; Daniell, Henry
2008-01-01
Chickpea (Cicer arietinum, Leguminosae), an important grain legume, is widely used for food and fodder throughout the world. We sequenced the complete plastid genome of chickpea, which is 125,319 bp in size, and contains only one copy of the inverted repeat (IR). The genome encodes 108 genes, including 4 rRNAs, 29 tRNAs, and 75 proteins. The genes rps16, infA, and ycf4 are absent in the chickpea plastid genome, and ndhB has an internal stop codon in the 5′exon, similar to other legumes. Two genes have lost their introns, one in the 3′exon of the transpliced gene rps12, and the one between exons 1 and 2 of clpP; this represents the first documented case of the loss of introns from both of these genes in the same plastid genome. An extensive phylogenetic survey of these intron losses was performed on 302 taxa across legumes and the related family Polygalaceae. The clpP intron has been lost exclusively in taxa from the temperate “IR-lacking clade” (IRLC), whereas the rps12 intron has been lost in most members of the IRLC (with the exception of Wisteria, Callerya, Afgekia, and certain species of Millettia, which represent the earliest diverging lineages of this clade), and in the tribe Desmodieae, which is closely related to the tribes Phaseoleae and Psoraleeae. Data provided here suggest that the loss of the rps12 intron occurred after the loss of the IR. The two new genomic changes identified in the present study provide additional support of the monophyly of the IR-loss clade, and resolution of the pattern of the earliest-branching lineages in this clade. The availability of the complete chickpea plastid genome sequence also provides valuable information on intergenic spacer regions among legumes and endogenous regulatory sequences for plastid genetic engineering. PMID:18638561
Reduced DNA methylation of FKBP5 in Cushing's syndrome.
Resmini, Eugenia; Santos, Alicia; Aulinas, Anna; Webb, Susan M; Vives-Gilabert, Yolanda; Cox, Olivia; Wand, Gary; Lee, Richard S
2016-12-01
FKBP5 encodes a co-chaperone of HSP90 protein that regulates intracellular glucocorticoid receptor sensitivity. When it is bound to the glucocorticoid receptor complex, cortisol binds with lower affinity to glucocorticoid receptor. Cushing's syndrome is associated with memory deficits, smaller hippocampal volumes, and wide range of cognitive impairments. We aimed at evaluating blood DNA methylation of FKBP5 and its relationship with memory and hippocampal volumes in Cushing's syndrome patients. Polymorphism rs1360780 in FKBP5 has also been assessed to determine whether genetic variations can also govern CpG methylation. Thirty-two Cushing's syndrome patients and 32 matched controls underwent memory tests, 3-Tesla MRI of the brain, and DNA extraction from total leukocytes. DNA samples were bisulfite treated, PCR amplified, and pyrosequenced to assess a total of 41CpG-dinucleotides in the introns 1, 2, 5, and 7 of FKBP5. Significantly lower intronic FKBP5 DNA methylation in CS patients compared to controls was observed in ten CpG-dinucleotides. DNA methylation at these CpGs correlated with left and right HV (Intron-2-Region-2-CpG-3: LHV, r = 0.73, p = 0.02; RHV, r = 0.58, p = 0.03). Cured and active CS patients showed both lower methylation of intron 2 (92.37, 91.8, and 93.34 %, respectively, p = 0.03 for both) and of intron 7 (77.08, 73.74, and 79.71 %, respectively, p = 0.02 and p < 0.01) than controls. Twenty-two subjects had the CC genotype, 34 had the TC genotype, and eight had the TT genotype. Lower average DNA methylation in intron 7 was observed in the TT subjects compared to CC (72.5vs. 79.5 %, p = 0.02) and to TC (72.5 vs. 79.0 %, p = 0.03). Our data demonstrate, for the first time, a reduction of intronic DNA methylation of FKBP5 in CS patients.
Ma, Long; Tan, Zhiping; Teng, Yanling; Hoersch, Sebastian; Horvitz, H. Robert
2011-01-01
The in vivo analysis of the roles of splicing factors in regulating alternative splicing in animals remains a challenge. Using a microarray-based screen, we identified a Caenorhabditis elegans gene, tos-1, that exhibited three of the four major types of alternative splicing: intron retention, exon skipping, and, in the presence of U2AF large subunit mutations, the use of alternative 3′ splice sites. Mutations in the splicing factors U2AF large subunit and SF1/BBP altered the splicing of tos-1. 3′ splice sites of the retained intron or before the skipped exon regulate the splicing pattern of tos-1. Our study provides in vivo evidence that intron retention and exon skipping can be regulated largely by the identities of 3′ splice sites. PMID:22033331
Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene
Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis
2012-01-01
Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272
Splicing stimulates siRNA formation at Drosophila DNA double-strand breaks
Merk, Karin; Breinig, Marco; Böttcher, Romy; Krebs, Stefan; Blum, Helmut; Boutros, Michael
2017-01-01
DNA double-strand breaks trigger the production of locus-derived siRNAs in fruit flies, human cells and plants. At least in flies, their biogenesis depends on active transcription running towards the break. Since siRNAs derive from a double-stranded RNA precursor, a major question is how broken DNA ends can generate matching sense and antisense transcripts. We performed a genome-wide RNAi-screen in cultured Drosophila cells, which revealed that in addition to DNA repair factors, many spliceosome components are required for efficient siRNA generation. We validated this observation through site-specific DNA cleavage with CRISPR-cas9 followed by deep sequencing of small RNAs. DNA breaks in intron-less genes or upstream of a gene’s first intron did not efficiently trigger siRNA production. When DNA double-strand breaks were induced downstream of an intron, however, this led to robust siRNA generation. Furthermore, a downstream break slowed down splicing of the upstream intron and a detailed analysis of siRNA coverage at the targeted locus revealed that unspliced pre-mRNA contributes the sense strand to the siRNA precursor. Since splicing factors are stimulating the response but unspliced transcripts are entering the siRNA biogenesis, the spliceosome is apparently stalled in a pre-catalytic state and serves as a signaling hub. We conclude that convergent transcription at DNA breaks is stimulated by a splicing dependent control process. The resulting double-stranded RNA is converted into siRNAs that instruct the degradation of cognate mRNAs. In addition to a potential role in DNA repair, the break-induced transcription may thus be a means to cull improper RNAs from the transcriptome of Drosophila melanogaster. Since the splicing factors identified in our screen also stimulated siRNA production from high copy transgenes, it is possible that this surveillance mechanism serves in genome defense beyond DNA double-strand breaks. PMID:28628606
Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H
2006-04-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.
Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.
2006-01-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031
Diverse Forms of RPS9 Splicing Are Part of an Evolving Autoregulatory Circuit
Plocik, Alex M.; Guthrie, Christine
2012-01-01
Ribosomal proteins are essential to life. While the functions of ribosomal protein-encoding genes (RPGs) are highly conserved, the evolution of their regulatory mechanisms is remarkably dynamic. In Saccharomyces cerevisiae, RPGs are unusual in that they are commonly present as two highly similar gene copies and in that they are over-represented among intron-containing genes. To investigate the role of introns in the regulation of RPG expression, we constructed 16 S. cerevisiae strains with precise deletions of RPG introns. We found that several yeast introns function to repress rather than to increase steady-state mRNA levels. Among these, the RPS9A and RPS9B introns were required for cross-regulation of the two paralogous gene copies, which is consistent with the duplication of an autoregulatory circuit. To test for similar intron function in animals, we performed an experimental test and comparative analyses for autoregulation among distantly related animal RPS9 orthologs. Overexpression of an exogenous RpS9 copy in Drosophila melanogaster S2 cells induced alternative splicing and degradation of the endogenous copy by nonsense-mediated decay (NMD). Also, analysis of expressed sequence tag data from distantly related animals, including Homo sapiens and Ciona intestinalis, revealed diverse alternatively-spliced RPS9 isoforms predicted to elicit NMD. We propose that multiple forms of splicing regulation among RPS9 orthologs from various eukaryotes operate analogously to translational repression of the alpha operon by S4, the distant prokaryotic ortholog. Thus, RPS9 orthologs appear to have independently evolved variations on a fundamental autoregulatory circuit. PMID:22479208
Umeda, Tomohiro; Yamashita, Takenari; Kimura, Tetsuya; Ohnishi, Kiyouhisa; Takuma, Hiroshi; Ozeki, Tomoko; Takashima, Akihiko; Tomiyama, Takami; Mori, Hiroshi
2013-07-01
Frontotemporal dementia and parkinsonism linked to chromosome 17 (FTDP-17) is a neurodegenerative disorder caused by mutations in the tau gene. Many mutations identified in FTDP-17 have been shown to affect tau exon 10 splicing in vitro, which presumably causes pathologic imbalances in exon 10(-) [3-repeat (3R)] and exon 10(+) [4-repeat (4R)] tau expression and leads to intracellular inclusions of hyperphosphorylated tau in patient brains. However, no reports have investigated this theory using model mice with a tau intronic mutation. Herein, we generated new transgenic mice harboring the tau intron 10 +16C → T mutation. We prepared a transgene construct containing intronic sequences required for exon 10 splicing in the longest tau isoform cDNA. Although mice bearing the construct without the intronic mutation showed normal developmental changes of the tau isoform from 3R tau to equal amounts of 3R and 4R tau, mice with the mutation showed much higher levels of 4R tau at the adult stage. 4R tau was selectively recovered in insoluble brain fractions in their old age. Furthermore, these mice displayed abnormal tau phosphorylation, synapse loss and dysfunction, memory impairment, glial activation, tangle formation, and neuronal loss in an age-dependent manner. These findings provide the first evidence in a mouse model that a tau intronic mutation-induced imbalance of 3R and 4R tau could be a cause of tauopathy. Copyright © 2013 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.
Gallagher, Carla J; Keene, Keith L; Mychaleckyj, Josyf C; Langefeld, Carl D; Hirschhorn, Joel N; Henderson, Brian E; Gordon, Candace J; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M
2007-03-01
The estrogen receptor-alpha gene (ESR1) was selected as a positional candidate under a type 2 diabetes linkage peak at 6q24-27. A total of 42 ESR1 single nucleotide polymorphisms (SNPs) were genotyped in 380 African-American type 2 diabetic case subjects with end-stage renal disease (ESRD) and 276 African-American control subjects. A total of 22 ancestry informative markers were also genotyped, and the program Admixmap was used to adjust allelic and haplotypic association tests for individual estimates of admixture. The most significant association with type 2 diabetes-ESRD was with rs1033182 in intron 2 (P = 0.013, admixture-adjusted P(a) = 0.021). Genotyping 17 SNPs across a region of ESR1 intron 1-intron 2 in an expanded population of 851 case and 635 control subjects supported association with rs1033182 (P = 0.004, P(a) = 0.027) and with an independent six-SNP haplotype of high linkage disequilibrium spanning 6.4 kb (P < 0.0001, P(a) < 0.0001). The same 17 ESR1 SNPs were genotyped in 300 European-American type 2 diabetes-ESRD case subjects and 310 European-American control subjects. Two intron 2 SNPs, rs2431260 (P = 0.015) and rs1709183 (P = 0.019), and a four-SNP haplotype containing these SNPs (P = 0.033) were associated with type 2 diabetes and/or ESRD. Results suggest that intron 1 and intron 2 of the ESR1 gene may contain functionally important regions related to type 2 diabetes or ESRD risk.
Leak, T. S.; Perlegas, P.S.; Smith, S.G.; Keene, K.L.; Hicks, P.J.; Langefeld, C.D.; Mychaleckyj, J.C.; Rich, S.S.; Kirk, J.K.; Freedman, B.I.; Bowden, D.W.; Sale, M.M.
2009-01-01
Variants in the engulfment and cell motility 1 (ELMO1) gene are associated with nephropathy due to type 2 diabetes mellitus (T2DM) in a Japanese cohort. We comprehensively evaluated this gene in African American (AA) T2DM patients with end-stage renal disease (ESRD). Three hundred nine HapMap tagging SNPs and 9 reportedly associated SNPs were genotyped in 577 AA T2DM-ESRD patients and 596 AA non-diabetic controls, plus 43 non-diabetic European American controls and 45 Yoruba Nigerian samples for admixture adjustment. Replication analyses were conducted in 558 AAs with T2DM-ESRD and 564 controls without diabetes. Extension analyses included 328 AA with T2DM lacking nephropathy and 326 with non-diabetic ESRD. The original and replication analyses confirmed association with four SNPs in intron 13 (permutation p-values for combined analyses = 0.001-0.003), one in intron 1 (P=0.004) and one in intron 5 (P=0.002) with T2DM-associated ESRD. In a subsequent combined analysis of all 1,135 T2DM-ESRD cases and 1,160 controls, an additional 7 intron 13 SNPs produced evidence of association (P = 3.5×10-5 – P=0.05). No associations were seen with these SNPs in those with T2DM lacking nephropathy or with ESRD due to non-diabetic causes. Variants in intron 13 of the ELMO1 gene appear to confer risk for diabetic nephropathy in AA. PMID:19183347
Petersen, Jessica L; Valberg, Stephanie J; Mickelson, James R; McCue, Molly E
2014-01-01
Summary Two variants in the equine myostatin gene (MSTN), including a T/C SNP substitution in the first intron and a 227-bp SINE insertion in the promoter, are associated with muscle fiber type proportions in the Quarter Horse (QH) and with the prediction of race distance propensity in the Thoroughbred (TB). Genotypes from these loci, along with 18 additional variants surrounding MSTN, were examined in 301 horses of 14 breeds to evaluate haplotype relationships and diversity. The C allele of intron 1 was found in 12 of 14 breeds at a frequency of 0.27; the SINE was observed in five breeds, but common in only the TB and QH (0.73 and 0.48 respectively). Haplotype data suggest the SINE insertion is contemporary to and arose upon a haplotype containing the intron 1 C allele. Gluteal muscle biopsies of TBs showed a significant association of the intron 1 C allele and SINE with a higher proportion of Type 2B and lower proportion of Type 1 fibers. However, in the Belgian horse, in which the SINE is not present, the intron 1 SNP was not associated with fiber type proportions, and evaluation of fiber type proportions across the Belgian, TB and QH breeds shows the significant effect of breed on fiber type proportions is negated when evaluating horses without the SINE variant. These data suggest the SINE, rather than the intron 1 SNP, is driving the observed muscle fiber type characteristics and is the variant targeted by selection for short-distance racing. PMID:25160752
[Identifying and sequence analysis of HLA-B*2736].
Li, Zhen; Zou, Hong-Yan; Shao, Chao-Peng; Tang, Si; Wang, Da-Ming; Cheng, Liang-Hong
2007-11-01
An unknown HLA-B allele which was similar to HLA-B*270401 was detected by FLOW-SSOPCR-SSP and heterozygous sequence-based typing (SBT) in Chinese Han individual. Its anomalous patterns suggested the possible presence of new allele. Amplifying exon 2-5(include intron 2-4) of the HLA-B*27 allele separately by using allele-specific primers and sequencing in both directions. Identifying the difference between the novel B*27 allele and B*270401. The sequence of novel B*27 from exon 2 to partial exon 5 is 1 815 bp. There are 10 nt changes from B*270401 in exon 3-4, at nt634where A-->C(codon130 AGC-->CGC, 130 S-->R); nt670 where A-->T (codon142 ACC-->TCC, 142 T-->S); nt683 where G-->T (codon146 TGG-->TTG, 146 W-->L); nt698 where A-->T (codon151 GAG-->GTG, 151 E-->V); nt774 where G-->C (codon176 GAG-->GAC, 176 E-->D); nt776 where C-->A (codon177 ACG-->AAG, 177 T-->K); nt781 where C-->G (codon179 CAG-->GAG, 179Q-->E); nt789 where G-->T (codon181 GCG-->GCT) resulting no coding change; nt1438 where C-->T (codon206 GGC-->GGT) resulting no coding change; nt1449 where G-->C (codon210 GGG-->GCG, 210G-->A). In IMGT/HLA database, only three alleles (B*270502/2706/2732) have sequences of introns. The same sequence in intron 2 showed homology between the novel HLA-B*27 allele and B*2706, but their homology could not be supported in intron 3-4. Comparing the sequence of the novel B*27 allele in intron 3 and 4 with B*27 group, it showed there are three mutations at nt106 C-->G, nt179 G-->A, nt536 G-->A and one deletion at nt168 in intron 3 and one mutations at nt82 T-->C in intron 4, but the sequence of the novel B*27 allele in intron 3 and 4 was all the same to B*070201. The sequence was submitted to Gen-Bank and the accession number was DQ915176. The allele has been confirmed as an extension of B*2736 by the WHO Nomenclature committee in November 2006.
Carter, James R; Keith, James H; Fraser, Tresa S; Dawson, James L; Kucharski, Cheryl A; Horne, Kate M; Higgs, Stephen; Fraser, Malcolm J
2014-06-13
Approximately 100 million confirmed infections and 20,000 deaths are caused by Dengue virus (DENV) outbreaks annually. Global warming and rapid dispersal have resulted in DENV epidemics in formally non-endemic regions. Currently no consistently effective preventive measures for DENV exist, prompting development of transgenic and paratransgenic vector control approaches. Production of transgenic mosquitoes refractory for virus infection and/or transmission is contingent upon defining antiviral genes that have low probability for allowing escape mutations, and are equally effective against multiple serotypes. Previously we demonstrated the effectiveness of an anti-viral group I intron targeting U143 of the DENV genome in mediating trans-splicing and expression of a marker gene with the capsid coding domain. In this report we examine the effectiveness of coupling expression of ΔN Bax to trans-splicing U143 intron activity as a means of suppressing DENV infection of mosquito cells. Targeting the conserved DENV circularization sequence (CS) by U143 intron trans-splicing activity appends a 3' exon RNA encoding ΔN Bax to the capsid coding region of the genomic RNA, resulting in a chimeric protein that induces premature cell death upon infection. TCID50-IFA analyses demonstrate an enhancement of DENV suppression for all DENV serotypes tested over the identical group I intron coupled with the non-apoptotic inducing firefly luciferase as the 3' exon. These cumulative results confirm the increased effectiveness of this αDENV-U143-ΔN Bax group I intron as a sequence specific antiviral that should be useful for suppression of DENV in transgenic mosquitoes. Annexin V staining, caspase 3 assays, and DNA ladder observations confirm DCA-ΔN Bax fusion protein expression induces apoptotic cell death. This report confirms the relative effectiveness of an anti-DENV group I intron coupled to an apoptosis-inducing ΔN Bax 3' exon that trans-splices conserved sequences of the 5' CS region of all DENV serotypes and induces apoptotic cell death upon infection. Our results confirm coupling the targeted ribozyme capabilities of the group I intron with the generation of an apoptosis-inducing transcript increases the effectiveness of infection suppression, improving the prospects of this unique approach as a means of inducing transgenic refractoriness in mosquitoes for all serotypes of this important disease.
Role of interleukin-15 receptor alpha polymorphisms in normal weight obese syndrome.
Di Renzo, L; Gloria-Bottini, F; Saccucci, P; Bigioni, M; Abenavoli, L; Gasbarrini, G; De Lorenzo, A
2009-01-01
Previous published studies have identified a class of women, Normal Weight Obese women (NWO) with normal BMI and high fat content. An important role of Interleukin-15 (IL-15) has been documented in facilitating muscle proliferation and promoting fat depletion. Indeed the presence of three types of IL-15 receptor subunits in fat tissue suggests a direct effect on adipose tissue. We studied three single nucleotide polymorphisms (SNP) of IL-15R-alpha receptor gene and investigated their relationship with NWO phenotype. We considered two classes of women according to their BMI and percent fat mass (percent FAT), class 1: including 72 overweight-obese women (high BMI-high fat mass) and class 2: including 36 NWO (normal BMI, high fat mass). Three sites of Interleukin-15 receptor subunit á gene were examined, located respectively in exon4, exon5 intron-exon border and exon7. Genotyping of the identified polymorphisms was performed by restriction fragment length polymorphism. Haplotype frequency estimation was performed by using the Mendel-University of Chicago program. Odds ratio analyses were calculated by EPISTAT program. Highly significant differences were observed for exon 7- exon5 intron-exon border and exon 4-exon 7 haplotype distribution between class 1 and class 2 women. These results strongly support the hypothesis that genetic variability of the IL-15 receptor has an important role in body fat composition. Our data underscore previous findings that suggest a potential role of IL-15 cytokine in NWO syndrome.
Coetzer, Willem G.; Downs, Colleen T.; Perrin, Mike R.; Willows-Munro, Sandi
2015-01-01
The taxonomic position of the Cape Parrot (Poicephalus robustus robustus) has been the focus of much debate. A number of authors suggest that the Cape Parrot should be viewed as a distinct species separate from the other two P. robustus subspecies (P. r. fuscicollis and P. r. suahelicus). These recommendations were based on morphological, ecological, and behavioural assessments. In this study we investigated the validity of these recommendations using multilocus DNA analyses. We genotyped 138 specimens from five Poicephalus species (P. cryptoxanthus, P. gulielmi, P. meyeri, P. robustus, and P. rueppellii) using 11 microsatellite loci. Additionally, two mitochondrial (cytochrome oxidase I gene and 16S ribosomal RNA) and one nuclear intron (intron 7 of the β-fibrinogen gene) markers were amplified and sequenced. Bayesian clustering analysis and pairwise FST analysis of microsatellite data identified P. r. robustus as genetically distinct from the other P. robustus subspecies. Phylogenetic and molecular clock analyses on sequence data also supported the microsatellite analyses, placing P. r. robustus in a distinct clade separate from the other P. robustus subspecies. Molecular clock analysis places the most recent common ancestor between P. r. robustus and P. r. fuscicollis / P. r. suahelicus at 2.13 to 2.67 million years ago. Our results all support previous recommendations to elevate the Cape Parrot to species level. This will facilitate better planning and implementation of international and local conservation management strategies for the Cape Parrot. PMID:26267261
Bian, Yang; Masuda, Akio; Matsuura, Tohru; Ito, Mikako; Okushin, Kazuya; Engel, Andrew G.; Ohno, Kinji
2009-01-01
We recently reported that the intronic splice-site mutation IVS3-8G>A of CHRNA1 that encodes the muscle nicotinic acetylcholine receptor α subunit disrupts binding of a splicing repressor, hnRNP H. This, in turn, results in exclusive inclusion of the downstream exon P3A. The P3A(+) transcript encodes a non-functional α subunit that comprises 50% of the transcripts in normal human skeletal muscle, but its functional significance remains undetermined. In an effort to search for a potential therapy, we screened off-label effects of 960 bioactive chemical compounds and found that tannic acid ameliorates the aberrant splicing due to IVS3-8G>A but without altering the expression of hnRNP H. Therefore, we searched for another splicing trans-factor. We found that the polypyrimidine tract binding protein (PTB) binds close to the 3′ end of CHRNA1 intron 3, that PTB induces skipping of exon P3A and that tannic acid increases the expression of PTB in a dose-dependent manner. Deletion assays of the PTB promoter region revealed that the tannic acid-responsive element is between positions −232 and −74 from the translation initiation site. These observations open the door to the discovery of novel therapies based on PTB overexpression and to detecting possible untoward effects of the overexpression. PMID:19147685
Chen, Huizhong; Li, Xin-Liang; Xu, Haiyan; Ljungdahl, Lars G; Cerniglia, Carl E
2006-01-01
Cyclophilins are an evolutionarily conserved family of peptidyl-prolyl cis-trans isomerases (PPIases). A cyclophilin B (cypB) gene from the anaerobic fungus Orpinomyces sp. strain PC-2 was cloned and overexpressed in Escherichia coli. It was expressed as an amino-terminal 6 x His-tagged recombinant protein to facilitate purification. Highly purified protein (26.5 kDa) was isolated by two chromatographic steps involving affinity and gel filtration for biochemical studies of the enzyme. The recombinant CypB displayed PPIase activity with a k(cat)/K(m) of 8.9 x 10(6) M(-1) s(-1) at 10 degrees C and pH 7.8. It was inhibited by cyclosporin A (CsA) with an IC(50) of 23.5 nM, similar to those of the native protein and other cyclophilin B enzymes from animals. Genomic DNA analysis of cypB revealed that it was present as a single copy in Orpinomyces PC-2 and contained two introns, indicating it has a eukaryotic origin. It is one of the most heavily interrupted genes with intron sequences found in anaerobic fungi. The three-dimensional model of Orpinomyces PC-2 CypB was predicted with a homology modeling approach using the Swiss-Model Protein Modeling Server and three dimensional structure of human CypB as a template. The overall architecture of the CypB molecule is very similar to that of human CypB.
Loss of p53 protein during radiation transformation of primary human mammary epithelial cells
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wazer, D.E.; Chu, Qiuming; Liu, Xiao Long
1994-04-01
The causative factors leading to breast cancer are largely unknown. Increased incidence of breast cancer following diagnostic or therapeutic radiation suggests that radiation may contribute to mammary oncogenesis. This report describes the in vitro neoplastic transformation of a normal human mammary epithelial cell strain, 76N, by fractionated [gamma]-irradiation at a clinically used dose (30 Gy). The transformed cells (76R-30) were immortal, had reduced growth factor requirements, and produced tumors in nude mice. Remarkably, the 76R-30 cells completely lacked the p53 tumor suppressor protein. Loss of p53 was due to deletion of the gene on one allele and a 26-bp deletionmore » within the third intron on the second allele which resulted in abnormal splicing out of either the third or fourth exon from the mRNA. PCR with a mutation-specific primer showed that intron 3 mutation was present in irradiated cells before selection for immortal phenotype. 76R-30 cells did not exhibit G[sub 1] arrest in response to radiation, indicating a loss of p53-mediated function. Expression of the wild-type p53 gene in 76R-30 cells led to their growth inhibition. Thus, loss of p53 protein appears to have contributed to neoplastic transformation of these cells. This unique model should facilitate analyses of molecular mechanisms of radiation-induced breast cancer and allow identification of p53-regulated cellular genes in breast cells. 44 refs., 8 figs., 1 tab.« less
Collins, Richard A; Stajich, Jason E; Field, Deborah J; Olive, Joan E; DeAbreu, Diane M
2015-05-01
When we expressed a small (0.9 kb) nonprotein-coding transcript derived from the mitochondrial VS plasmid in the nucleus of Neurospora we found that it was efficiently spliced at one or more of eight 5' splice sites and ten 3' splice sites, which are present apparently by chance in the sequence. Further experimental and bioinformatic analyses of other mitochondrial plasmids, random sequences, and natural nuclear genes in Neurospora and other fungi indicate that fungal spliceosomes recognize a wide range of 5' splice site and branchpoint sequences and predict introns to be present at high frequency in random sequence. In contrast, analysis of intronless fungal nuclear genes indicates that branchpoint, 5' splice site and 3' splice site consensus sequences are underrepresented compared with random sequences. This underrepresentation of splicing signals is sufficient to deplete the nuclear genome of splice sites at locations that do not comprise biologically relevant introns. Thus, the splicing machinery can recognize a wide range of splicing signal sequences, but splicing still occurs with great accuracy, not because the splicing machinery distinguishes correct from incorrect introns, but because incorrect introns are substantially depleted from the genome. © 2015 Collins et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Another heritage from the RNA world: self-excision of intron sequence from nuclear pre-tRNAs.
Weber, U; Beier, H; Gross, H J
1996-06-15
The intervening sequences of nuclear tRNA precursors are known to be excised by tRNA splicing endonuclease. We show here that a T7 transcript corresponding to a pre-tRNA(Tyr) from Arabidopsis thaliana has a highly specific activity for autolytic intron excision. Self-cleavage occurs precisely at the authentic 3'-splice site and at the phosphodiester bond one nucleotide downstream of the authentic 5'-splice site. The reaction results in fragments with 2',3'-cyclic phosphate and 5'-OH termini. It is resistant to proteinase K and/or SDS treatment and is not inhibited by added tRNA. The self-cleavage depends on Mg2+ and is stimulated by spermine and Triton X-100. A set of sequence variants at the cleavage sites has been analysed for autolytic intron excision and, in parallel, for enzymatic in vitro splicing in wheat germ S23 extract. Single-stranded loops are a prerequisite for both reactions. Self-cleavage not only occurs at pyrimidine-A but also at U-U bonds. Since intron self-excision is only about five times slower than the enzymatic intron excision in a wheat germ S23 extract, we propose that the splicing endonuclease may function by improving the preciseness and efficiency of an inherent pre-tRNA self-cleavage activity.
Sellem, C. H.; d'Aubenton-Carafa, Y.; Rossignol, M.; Belcour, L.
1996-01-01
The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optional sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences. In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group I intronic ORFs are mobile elements and that their transfer, and comcomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes. PMID:8725226
Bonen, Linda; Boer, Poppo H.; Gray, Michael W.
1984-01-01
We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565
Sellem, C H; d'Aubenton-Carafa, Y; Rossignol, M; Belcour, L
1996-06-01
The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optional sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences. In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group 1 intronic ORFs are mobile elements and that their transfer, and concomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes.
NASA Technical Reports Server (NTRS)
Kretsinger, R. H.; Nakayama, S.
1993-01-01
In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.
Schwaiger, F W; Weyers, E; Epplen, C; Brün, J; Ruff, G; Crawford, A; Epplen, J T
1993-09-01
Twenty-one different caprine and 13 ovine MHC-DRB exon 2 sequences were determined including part of the adjacent introns containing simple repetitive (gt)n(ga)m elements. The positions for highly polymorphic DRB amino acids vary slightly among ungulates and other mammals. From man and mouse to ungulates the basic (gt)n(ga)m structure is fixed in evolution for 7 x 10(7) years whereas ample variations exist in the tandem (gt)n and (ga)m dinucleotides and especially their "degenerated" derivatives. Phylogenetic trees for the alpha-helices and beta-pleated sheets of the ungulate DRB sequences suggest different evolutionary histories. In hoofed animals as well as in humans DRB beta-sheet encoding sequences and adjacent intronic repeats can be assembled into virtually identical groups suggesting coevolution of noncoding as well as coding DNA. In contrast alpha-helices and C-terminal parts of the first DRB domain evolve distinctly. In the absence of a defined mechanism causing specific, site-directed mutations, double-recombination or gene-conversion-like events would readily explain this fact. The role of the intronic simple (gt)n(ga)m repeat is discussed with respect to these genetic exchange mechanisms during evolution.
Ganaie, Safder S; Chen, Aaron Yun; Huang, Chun; Xu, Peng; Kleiboeker, Steve; Du, Aifang; Qiu, Jianming
2018-04-15
Human parvovirus B19 (B19V) expresses a single precursor mRNA (pre-mRNA), which undergoes alternative splicing and alternative polyadenylation to generate 12 viral mRNA transcripts that encode two structural proteins (VP1 and VP2) and three nonstructural proteins (NS1, 7.5-kDa protein, and 11-kDa protein). Splicing at the second 5' donor site (D2 site) of the B19V pre-mRNA is essential for the expression of VP2 and the 11-kDa protein. We previously identified that cis -acting intronic splicing enhancer 2 (ISE2) that lies immediately after the D2 site facilitates the recognition of the D2 donor for its efficient splicing. In this study, we report that ISE2 is critical for the expression of the 11-kDa viral nonstructural protein. We found that ISE2 harbors a consensus RNA binding motif protein 38 (RBM38) binding sequence, 5'-UGUGUG-3'. RBM38 is expressed during the middle stage of erythropoiesis. We first confirmed that RBM38 binds specifically with the ISE2 element in vitro The knockdown of RBM38 significantly decreases the level of spliced mRNA at D2 that encodes the 11-kDa protein but not that of the D2-spliced mRNA that encodes VP2. Importantly, we found that the 11-kDa protein enhances viral DNA replication and virion release. Accordingly, the knockdown of RBM38 decreases virus replication via downregulating 11-kDa protein expression. Taken together, these results suggest that the 11-kDa protein facilitates B19V DNA replication and that RBM38 is an essential host factor for B19V pre-mRNA splicing and for the expression of the 11-kDa protein. IMPORTANCE B19V is a human pathogen that can cause fifth disease, arthropathy, anemia in immunocompromised patients and sickle cell disease patients, myocarditis, and hydrops fetalis in pregnant women. Human erythroid progenitor cells (EPCs) are most susceptible to B19V infection and fully support viral DNA replication. The exclusive tropism of B19V for erythroid-lineage cells is dependent not only on the expression of viral receptors and coreceptors on the cell surface but also on the intracellular host factors that support B19V replication. Our present study shows that B19V uses a host factor, RNA binding motif protein 38 (RBM38), for the processing of its pre-mRNA during virus replication. Specifically, RBM38 interacts with the intronic splicing enhancer 2 (ISE2) element of B19V pre-mRNA and promotes 11-kDa protein expression, thereby regulating the 11-kDa protein-mediated augmentation of B19V replication. The identification of this novel host-pathogen interaction will provide mechanistic insights into B19V replication and aid in finding new targets for anti-B19V therapeutics. Copyright © 2018 American Society for Microbiology.
Yousry, Sherif M; Sedky, Yasser; Sobieh, Alaa
2016-10-01
Aim Rheumatic heart disease is an inflammatory disease of cardiac tissue. The underlying pathogenic mechanisms highlight a complex interplay of immunological, genetic, and environmental factors. The aim of the present study was to investigate whether IL-4 (intron 3) and IL-10 (-1082) gene polymorphisms could be associated with susceptibility and/or severity of rheumatic heart disease among patients from the Egyptian population. Materials and methods A cohort of 140 Egyptian children with rheumatic heart disease and 100 healthy controls were enrolled in this case-control study. Genotyping for IL-4 (intron 3) and IL-10 (-1082) gene polymorphisms was carried out for all patients using a polymerase chain reaction-based analysis. No significant difference in the distribution of genotypes and allelic frequencies between rheumatic heart disease cases and controls for IL-4 (intron 3) (p=0.17; OR 1.07, 95% CI 0.82-3.74) and IL-10 (-1082) (p=0.49; OR 1.03, 95% CI 0.65-2.71) gene polymorphisms was observed. Further categorisation of patients into mitral valve disease and combined valve disease subgroups showed that cases with mitral valve disease have significantly higher frequency of the RP2 allele of IL-4 (intron 3) (p=0.03; OR 2.98, 95% CI 1.93-6.15) and the G allele of IL-10 (-1082) (p=0.04; OR 2.14, 95% CI 1.62-4.95) when compared with controls. Discussion Our study shows that IL-4 (intron 3) and IL-10 (-1082) gene polymorphisms are not significantly associated with susceptibility to rheumatic heart disease, but they might play a role in the pathogenesis of patients with mitral valve disease.
Identification and analysis of multigene families by comparison of exon fingerprints.
Brown, N P; Whittaker, A J; Newell, W R; Rawlings, C J; Beck, S
1995-06-02
Gene families are often recognised by sequence homology using similarity searching to find relationships, however, genomic sequence data provides gene architectural information not used by conventional search methods. In particular, intron positions and phases are expected to be relatively conserved features, because mis-splicing and reading frame shifts should be selected against. A fast search technique capable of detecting possible weak sequence homologies apparent at the intron/exon level of gene organization is presented for comparing spliceosomal genes and gene fragments. FINEX compares strings of exons delimited by intron/exon boundary positions and intron phases (exon fingerprint) using a global dynamic programming algorithm with a combined intron phase identity and exon size dissimilarity score. Exon fingerprints are typically two orders of magnitude smaller than their nucleic acid sequence counterparts giving rise to fast search times: a ranked search against a library of 6755 fingerprints for a typical three exon fingerprint completes in under 30 seconds on an ordinary workstation, while a worst case largest fingerprint of 52 exons completes in just over one minute. The short "sequence" length of exon fingerprints in comparisons is compensated for by the large exon alphabet compounded of intron phase types and a wide range of exon sizes, the latter contributing the most information to alignments. FINEX performs better in some searches than conventional methods, finding matches with similar exon organization, but low sequence homology. A search using a human serum albumin finds all members of the multigene family in the FINEX database at the top of the search ranking, despite very low amino acid percentage identities between family members. The method should complement conventional sequence searching and alignment techniques, offering a means of identifying otherwise hard to detect homologies where genomic data are available.
Single-molecule RNA observation in vivo reveals dynamics of co-transcriptional splicing
NASA Astrophysics Data System (ADS)
Ferguson, M. L.; Coulon, A.; de Turris, V.; Palangat, M.; Chow, C. C.; Singer, R. H.; Larson, D. R.
2013-03-01
The synthesis of pre-mRNA and the splicing of that pre-mRNA to form completed transcripts requires coordination between two large multi-subunit complexes (the transcription elongation complex and the spliceosome). How this coordination occurs in vivo is unknown. Here we report the first experimental observation of transcription and splicing occurring at the same gene in living cells. By utilizing the PP7/MS2 fluorescent RNA reporter system, we can directly observe two distinct regions of the nascent RNA, allowing us to measure the rise and fall time of the intron and exon of a reporter gene stably integrated into a human cell line. The reporter gene consists of a beta globin gene where we have inserted a 24 RNA hairpin cassette into the intron/exon. Upon synthesis, the RNA hairpins are tightly bound by fluorescently-labeled PP7/MS2 bacteriophage coat proteins. After gene induction, a single locus of active transcription in the nucleus shows fluorescence intensity changes characteristic of the synthesis and excision of the intron/exon. Using fluctuation analysis, we determine the elongation rate to be 1.5 kb/min. From the temporal cross correlation function, we determine that splicing of this gene must be co-transcriptional with a splicing time of ~100 seconds before termination and a ~200 second pause at termination. We propose that dual-color RNA imaging may be extended to investigate other mechanisms of transcription, gene regulation, and RNA processing.
DeVry, C G; Tsai, W; Clarke, S
1996-11-15
The protein L-isoaspartyl/D-aspartyl O-methyltransferase (EC 2.1.1.77) catalyzes the first step in the repair of proteins damaged in the aging process by isomerization or racemization reactions at aspartyl and asparaginyl residues. A single gene has been localized to human chromosome 6 and multiple transcripts arising through alternative splicing have been identified. Restriction enzyme mapping, subcloning, and DNA sequence analysis of three overlapping clones from a human genomic library in bacteriophage P1 indicate that the gene spans approximately 60 kb and is composed of 8 exons interrupted by 7 introns. Analysis of intron/exon splice junctions reveals that all of the donor and acceptor splice sites are in agreement with the mammalian consensus splicing sequence. Determination of transcription initiation sites by primer extension analysis of poly(A)+ mRNA from human brain identifies multiple start sites, with a major site 159 nucleotides upstream from the ATG start codon. Sequence analysis of the 5'-untranslated region demonstrates several potential cis-acting DNA elements including SP1, ETF, AP1, AP2, ARE, XRE, CREB, MED-1, and half-palindromic ERE motifs. The promoter of this methyltransferase gene lacks an identifiable TATA box but is characterized by a CpG island which begins approximately 723 nucleotides upstream of the major transcriptional start site and extends through exon 1 and into the first intron. These features are characteristic of housekeeping genes and are consistent with the wide tissue distribution observed for this methyltransferase activity.
Juge, F; Audibert, A; Benoit, B; Simonelig, M
2000-01-01
The Suppressor of forked protein is the Drosophila homolog of the 77K subunit of human cleavage stimulation factor, a complex required for the first step of the mRNA 3'-end-processing reaction. We have shown previously that wild-type su(f) function is required for the accumulation of a truncated su(f) transcript polyadenylated in intron 4 of the gene. This led us to propose a model in which the Su(f) protein would negatively regulate its own accumulation by stimulating 3'-end formation of this truncated su(f) RNA. In this article, we demonstrate this model and show that su(f) autoregulation is tissue specific. The Su(f) protein accumulates at a high level in dividing tissues, but not in nondividing tissues. We show that this distribution of the Su(f) protein results from stimulation by Su(f) of the tissue-specific utilization of the su(f) intronic poly(A) site, leading to the accumulation of the truncated su(f) transcript in nondividing tissues. Utilization of this intronic poly(A) site is affected in a su(f) mutant and restored in the mutant with a transgene encoding wild-type Su(f) protein. These data provide an in vivo example of cell-type-specific regulation of a protein level by poly(A) site choice, and confirm the role of Su(f) in regulation of poly(A) site utilization. PMID:11105753
Factors influencing alternative splice site utilization in vivo.
Fu, X Y; Manley, J L
1987-01-01
To study factors that influence the choice of alternative pre-mRNA splicing pathways, we introduced plasmids expressing either wild-type or mutated simian virus 40 (SV40) early regions into tissue culture cells and then measured the quantities of small-t and large-T RNAs produced. One important element controlling splice site selection was found to be the size of the intron removed in the production of small-t mRNA; expansion of this intron (from 66 to 77 or more nucleotides) resulted in a substantial increase in the amount of small-t mRNA produced relative to large-T mRNA. This suggests that in the normal course of SV40 early pre-mRNA processing, large-T splicing is at a competitive advantage relative to small-t splicing because of the small size of the latter intron. Several additional features of the pre-mRNA that can influence splice site selection were also identified by analyzing the effects of mutations containing splice site duplications. These include the strengths of competing 5' splice sites and the relative positions of splice sites in the pre-mRNA. Finally, we showed that the ratio of small-t to large-T mRNA was 10 to 15-fold greater in human 293 cells than in HeLa cells or other mammalian cell types. These results suggest the existence of cell-specific trans-acting factors that can dramatically alter the pattern of splice site selection in a pre-mRNA. Images PMID:3029566
Gifford, Matthew E; Larson, Allan
2008-10-01
A previous phylogeographic study of mitochondrial haplotypes for the Hispaniolan lizard Ameiva chrysolaema revealed deep genetic structure associated with seawater inundation during the late Pliocene/early Pleistocene and evidence of subsequent population expansion into formerly inundated areas. We revisit hypotheses generated by our previous study using increased geographic sampling of populations and analysis of three nuclear markers (alpha-enolase intron 8, alpha-cardiac-actin intron 4, and beta-actin intron 3) in addition to mitochondrial haplotypes (ND2). Large genetic discontinuities correspond spatially and temporally with historical barriers to gene flow (sea inundations). NCPA cross-validation analysis and Bayesian multilocus analyses of divergence times (IMa and MCMCcoal) reveal two separate episodes of fragmentation associated with Pliocene and Pleistocene sea inundations, separating the species into historically separate Northern, East-Central, West-Central, and Southern population lineages. Multilocus Bayesian analysis using IMa indicates asymmetrical migration from the East-Central to the West-Central populations following secondary contact, consistent with expectations from the more pervasive sea inundation in the western region. The West-Central lineage has a genetic signature of population growth consistent with the expectation of geographic expansion into formerly inundated areas. Within each lineage, significant spatial genetic structure indicates isolation by distance at comparable temporal scales. This study adds to the growing body of evidence that vicariant speciation may be the prevailing source of lineage accumulation on oceanic islands. Thus, prior theories of island biogeography generally underestimate the role and temporal scale of intra-island vicariant processes.
Vasconcelos, O; Sivakumar, K; Dalakas, M C; Quezado, M; Nagle, J; Leon-Monzon, M; Dubnick, M; Gajdusek, D C; Goldfarb, L G
1995-01-01
Mutations in the human phosphofructokinase muscle subunit gene (PFKM) are known to cause myopathy classified as glycogenosis type VII (Tarui disease). Previously described molecular defects include base substitutions altering encoded amino acids or resulting in abnormal splicing. We report a mutation resulting in phosphofructokinase deficiency in three patients from an Ashkenazi Jewish family. Using a reverse transcription PCR assay, PFKM subunit transcripts differing by length were detected in skeletal muscle tissue of all three affected subjects. In the longer transcript, an insertion of 252 nucleotides totally homologous to the structure of the 10th intron of the PFKM gene was found separating exon 10 from exon 11. In addition, two single base transitions were identified by direct sequencing: [exon 6; codon 95; CGA (Arg) to TGA (stop)] and [exon 7; codon 172; ACC (Thr) to ACT (Thr)] in either transcript. Single-stranded conformational polymorphism and restriction enzyme analyses confirmed the presence of these point substitutions in genomic DNA and strongly suggested homozygosity for the pathogenic allele. The nonsense mutation at codon 95 appeared solely responsible for the phenotype in these patients, further expanding genetic heterogeneity of Tarui disease. Transcripts with and without intron 10 arising from identical mutant alleles probably resulted from differential pre-mRNA processing and may represent a novel message from the PFKM gene. Images Fig. 2 Fig. 4 Fig. 5 PMID:7479776
CELF1 preferentially binds to exon-intron boundary and regulates alternative splicing in HeLa cells.
Xia, Heng; Chen, Dong; Wu, Qijia; Wu, Gang; Zhou, Yanhong; Zhang, Yi; Zhang, Libin
2017-09-01
The current RIP-seq approach has been developed for the identification of genome-wide interaction between RNA binding protein (RBP) and the bound RNA transcripts, but still rarely for identifying its binding sites. In this study, we performed RIP-seq experiments in HeLa cells using a monoclonal antibody against CELF1. Mapping of the RIP-seq reads showed a biased distribution at the 3'UTR and intronic regions. A total of 15,285 and 1384 CELF1-specific sense and antisense peaks were identified using the ABLIRC software tool. Our bioinformatics analyses revealed that 5' and 3' splice site motifs and GU-rich motifs were highly enriched in the CELF1-bound peaks. Furthermore, transcriptome analyses revealed that alternative splicing was globally regulated by CELF1 in HeLa cells. For example, the inclusion of exon 16 of LMO7 gene, a marker gene of breast cancer, is positively regulated by CELF1. Taken together, we have shown that RIP-seq data can be used to decipher RBP binding sites and reveal an unexpected landscape of the genome-wide CELF1-RNA interactions in HeLa cells. In addition, we found that CELF1 globally regulates the alternative splicing by binding the exon-intron boundary in HeLa cells, which will deepen our understanding of the regulatory roles of CELF1 in the pre-mRNA splicing process. Copyright © 2017 Elsevier B.V. All rights reserved.
Zhou, Yong; Hu, Lifang; Wu, Hao; Jiang, Lunwei
2017-01-01
Superoxide dismutase (SOD) proteins are widely present in the plant kingdom and play important roles in different biological processes. However, little is known about the SOD genes in cucumber. In this study, night SOD genes were identified from cucumber (Cucumis sativus) using bioinformatics-based methods, including 5 Cu/ZnSODs, 3 FeSODs, and 1 MnSOD. Gene structure and motif analysis indicated that most of the SOD genes have relatively conserved exon/intron arrangement and motif composition. Phylogenetic analyses with SODs from cucumber and several other species revealed that these SOD proteins can be traced back to two ancestral SODs before the divergence of monocot and dicot plants. Many cis-elements related to stress responses and plant hormones were found in the promoter sequence of each CsSOD gene. Gene expression analysis revealed that most of the CsSOD genes are expressed in almost all the tested tissues. qRT-PCR analysis of 8 selected CsSOD genes showed that these genes could respond to heat, cold, osmotic, and salt stresses. Our results provide a basis for further functional research on SOD gene family in cucumber and facilitate their potential applications in the genetic improvement of cucumber. PMID:28808654
Chromosomal localization and cDNA cloning of the human DBP and TEF genes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Khatib, Z.A.; Inaba, T.; Valentine, M.
1994-09-15
The authors have isolated cDNA and genomic clones and determined the human chromosome positions of two genes encoding transcription factors expressed in the liver and the pituitary gland: albumin D-site-binding protein (DBP) and thyrotroph embryonic factor (TEF). Both proteins have been identified as members of the PAR (proline and acidic amino acid-rich) subfamily of bZIP transcription factors in the rat, but human homologues have not been characterized. Using a fluorescence in situ hybridization technique, the DBP locus was assigned to chromosome 19q13, and TEF to chromosome 22q13. Each assignment was confirmed by means of human chromosome segregation in somatic cellmore » hybrids. Coding sequences of DBP and TEF, extending beyond the bZIP domain to the PAR region, were highly conserved in both human-human and interspecies comparisons. Conservation of the exon-intron boundaries of each bZIP domain-encoding exon suggested derivation from a common ancestral gene. DBP and TEF mRNAs were expressed in all tissues and cell lines examined, including brain, lung, liver, spleen, and kidney. Knowledge of the human chromosome locations of these PAR proteins will facilitate studies to assess their involvement in carcinogenesis and other fundamental biological processes. 37 refs., 5 figs., 1 tab.« less
High-resolution phylogenic microbial community profiling
USDA-ARS?s Scientific Manuscript database
PIECE (Plant Intron Exon Comparison and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron–exon organization and provides valuable insights into the evolution of gene structure in pl...
The complete plastid genome sequence of Eustrephus latifolius (Asparagaceae: Lomandroideae).
Kim, Hyoung Tae; Kim, Jung Sung; Kim, Joo-Hwan
2016-01-01
The complete chloroplast (cp) genome sequence of Eustrephus latifolius was firstly determined in subfamily Lomandriodeae of family Asparagaceae. It was 159,736 bp and contained a large single copy region (82,403 bp) and a small single copy region (13,607 bp) which were separated by two inverted repeat regions (31,863 bp). In total, 132 genes were identified and they were consisted of 83 coding genes, 8 rRNA genes, 38 tRNA genes, 3 pseudogenes. rpl23 and clpP were pseudogenes due to sequence deletions. Among 23 genes containing introns, rps12 and ycf3 contained two introns and the rest had just one intron. The intact ycf68 was identified within an intron of trnI-GAU. The amino acid sequence was almost identical with Phoenix dactylifera in Aracales. Ycf1 of E. latifolius was completely located in IR. It was similar to cp genome structure of Lemna minor, Spirodela polyrhiza, Wolffiella lingulata, Wolffia australiana in Alismatales.
Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd
2017-01-26
The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Doerk, T.; Wulbrand, U.; Tuemmler, B.
1993-03-01
Single cases of the four novel splice site mutations 1525[minus]1 G [r arrow] A (intron 9), 3601[minus]2 A [r arrow] G (intron 18), 3850[minus]3 T [r arrow] G (intron 19), and 4374+1 G [r arrow] T (intron 23) were detected in the CFTR gene of cystic fibrosis patients of Indo-Iranian, Turkish, Polish, and Germany descent. The nucleotide substitutions at the +1, [minus]1, and [minus]2 positions all destroy splice sites and lead to severe disease alleles associated with features typical of gastrointestinal and pulmonary cystic fibrosis disease. The 3850[minus]3 T-to-G change was discovered in a very mildly affected 33-year-old [Delta]F508 compoundmore » heterozygote, suggesting that the T-to-G transversion at the less conserved [minus]3 position of the acceptor splice site may retain some wildtype function. 13 refs., 1 fig., 2 tabs.« less
Neuhaus, H; Link, G
1987-01-01
The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.
Wang, Yajun; Hamilton, Michael; Ben-Hur, Asa; Reddy, Anireddy S.N.
2015-01-01
Plant SR45 and its metazoan ortholog RNPS1 are serine/arginine-rich (SR)-like RNA binding proteins that function in splicing/postsplicing events and regulate diverse processes in eukaryotes. Interactions of SR45 with both RNAs and proteins are crucial for regulating RNA processing. However, in vivo RNA targets of SR45 are currently unclear. Using RNA immunoprecipitation followed by high-throughput sequencing, we identified over 4000 Arabidopsis thaliana RNAs that directly or indirectly associate with SR45, designated as SR45-associated RNAs (SARs). Comprehensive analyses of these SARs revealed several roles for SR45. First, SR45 associates with and regulates the expression of 30% of abscisic acid (ABA) signaling genes at the postsplicing level. Second, although most SARs are derived from intron-containing genes, surprisingly, 340 SARs are derived from intronless genes. Expression analysis of the SARs suggests that SR45 differentially regulates intronless and intron-containing SARs. Finally, we identified four overrepresented RNA motifs in SARs that likely mediate SR45’s recognition of its targets. Therefore, SR45 plays an unexpected role in mRNA processing of intronless genes, and numerous ABA signaling genes are targeted for regulation at the posttranscriptional level. The diverse molecular functions of SR45 uncovered in this study are likely applicable to other species in view of its conservation across eukaryotes. PMID:26603559
2010-01-01
Comparison of 18S rDNA gene sequences is a very promising method for identification and classification of living organisms. Molecular identification and discrimination of different Dunaliella species were carried out based on the size of 18S rDNA gene and, number and position of introns in the gene. Three types of 18S rDNA structure have already been reported: the gene with a size of ~1770 bp lacking any intron, with a size of ~2170 bp consisting one intron near 5' terminus, and with a size of ~2570 bp harbouring two introns near 5' and 3' termini. Hereby, we report a new 18S rDNA gene arrangement in terms of intron localization and nucleotide sequence in a Dunaliella isolated from Iranian salt lakes (ABRIINW-M1/2). PCR amplification with genus-specific primers resulted in production of a ~2170 bp DNA band, which is similar to that of D. salina 18S rDNA gene containing only one intron near 5' terminus. Whilst, sequence composition of the gene revealed the lack of any intron near 5' terminus in our isolate. Furthermore, another alteration was observed due to the presence of a 440 bp DNA fragment near 3' terminus. Accordingly, 18S rDNA gene of the isolate is clearly different from those of D. salina and any other Dunaliella species reported so far. Moreover, analysis of ITS region sequence showed the diversity of this region compared to the previously reported species. 18S rDNA and ITS sequences of our isolate were submitted with accesion numbers of EU678868 and EU927373 in NCBI database, respectively. The optimum growth rate of this isolate occured at the salinity level of 1 M NaCl. The maximum carotenoid content under stress condition of intense light (400 μmol photon m-2 s-1), high salinity (4 M NaCl) and deficiency of nitrate and phosphate nutritions reached to 240 ng/cell after 15 days. PMID:20377865
Rühl, Christina; Stauffer, Eva; Kahles, André; Wagner, Gabriele; Drechsel, Gabriele; Rätsch, Gunnar; Wachter, Andreas
2012-01-01
Alternative splicing (AS) generates transcript variants by variable exon/intron definition and massively expands transcriptome diversity. Changes in AS patterns have been found to be linked to manifold biological processes, yet fundamental aspects, such as the regulation of AS and its functional implications, largely remain to be addressed. In this work, widespread AS regulation by Arabidopsis thaliana Polypyrimidine tract binding protein homologs (PTBs) was revealed. In total, 452 AS events derived from 307 distinct genes were found to be responsive to the levels of the splicing factors PTB1 and PTB2, which predominantly triggered splicing of regulated introns, inclusion of cassette exons, and usage of upstream 5′ splice sites. By contrast, no major AS regulatory function of the distantly related PTB3 was found. Dependent on their position within the mRNA, PTB-regulated events can both modify the untranslated regions and give rise to alternative protein products. We find that PTB-mediated AS events are connected to diverse biological processes, and the functional implications of selected instances were further elucidated. Specifically, PTB misexpression changes AS of PHYTOCHROME INTERACTING FACTOR6, coinciding with altered rates of abscisic acid–dependent seed germination. Furthermore, AS patterns as well as the expression of key flowering regulators were massively changed in a PTB1/2 level-dependent manner. PMID:23192226
Li, Qingshun Q; Liu, Zhaoyang; Lu, Wenjia; Liu, Man
2017-05-17
Pre-mRNA alternative splicing and alternative polyadenylation have been implicated to play important roles during eukaryotic gene expression. However, much remains unknown regarding the regulatory mechanisms and the interactions of these two processes in plants. Here we focus on an Arabidopsis gene OXT6 (Oxidative Tolerant-6) that has been demonstrated to encode two proteins through alternative splicing and alternative polyadenylation. Specifically, alternative polyadenylation at Intron-2 of OXT6 produces a transcript coding for AtCPSF30, an Arabidopsis ortholog of 30 kDa subunit of the Cleavage and Polyadenylation Specificity Factor. On the other hand, alternative splicing of Intron-2 generates a longer transcript encoding a protein named AtC30Y, a polypeptide including most part of AtCPSF30 and a YT521B domain. To investigate the expression outcome of OXT6 in plants, a set of mutations were constructed to alter the splicing and polyadenylation patterns of OXT6. Analysis of transgenic plants bearing these mutations by quantitative RT-PCR revealed a competition relationship between these two processes. Moreover, when both splice sites and poly(A) signals were mutated, polyadenylation became the preferred mode of OXT6 processing. These results demonstrate the interplay between alternative splicing and alternative polyadenylation, and it is their concerted actions that define a gene's expression outcome.
Darnell, James E.
2013-01-01
Several strong conclusions emerge concerning pre-mRNA processing from both old and newer experiments. The RNAPII complex is involved with pre-mRNA processing through binding of processing proteins to the CTD (carboxyl terminal domain) of the largest RNAPII subunit. These interactions are necessary for efficient processing, but whether factor binding to the CTD and delivery to splicing sites is obligatory or facilitatory is unsettled. Capping, addition of an m7Gppp residue (cap) to the initial transcribed residue of a pre-mRNA, occurs within seconds. Splicing of pre-mRNA by spliceosomes at particular sites is most likely committed during transcription by the binding of initiating processing factors and ∼50% of the time is completed in mammalian cells before completion of the primary transcript. This fact has led to an outpouring in the literature about “cotranscriptional splicing.” However splicing requires several minutes for completion and can take longer. The RNAPII complex moves through very long introns and also through regions dense with alternating exons and introns at an average rate of ∼3 kb per min and is, therefore, not likely detained at each splice site for more than a few seconds, if at all. Cleavage of the primary transcript at the 3′ end and polyadenylation occurs within 30 sec or less at recognized polyA sites, and the majority of newly polyadenylated pre-mRNA molecules are much larger than the average mRNA. Finally, it seems quite likely that the nascent RNA most often remains associated with the chromosomal locus being transcribed until processing is complete, possibly acquiring factors related to the transport of the new mRNA to the cytoplasm. PMID:23440351
Hopper, Anita K.
2013-01-01
Transfer RNAs (tRNAs) are essential for protein synthesis. In eukaryotes, tRNA biosynthesis employs a specialized RNA polymerase that generates initial transcripts that must be subsequently altered via a multitude of post-transcriptional steps before the tRNAs beome mature molecules that function in protein synthesis. Genetic, genomic, biochemical, and cell biological approaches possible in the powerful Saccharomyces cerevisiae system have led to exciting advances in our understandings of tRNA post-transcriptional processing as well as to novel insights into tRNA turnover and tRNA subcellular dynamics. tRNA processing steps include removal of transcribed leader and trailer sequences, addition of CCA to the 3′ mature sequence and, for tRNAHis, addition of a 5′ G. About 20% of yeast tRNAs are encoded by intron-containing genes. The three-step splicing process to remove the introns surprisingly occurs in the cytoplasm in yeast and each of the splicing enzymes appears to moonlight in functions in addition to tRNA splicing. There are 25 different nucleoside modifications that are added post-transcriptionally, creating tRNAs in which ∼15% of the residues are nucleosides other than A, G, U, or C. These modified nucleosides serve numerous important functions including tRNA discrimination, translation fidelity, and tRNA quality control. Mature tRNAs are very stable, but nevertheless yeast cells possess multiple pathways to degrade inappropriately processed or folded tRNAs. Mature tRNAs are also dynamic in cells, moving from the cytoplasm to the nucleus and back again to the cytoplasm; the mechanism and function of this retrograde process is poorly understood. Here, the state of knowledge for tRNA post-transcriptional processing, turnover, and subcellular dynamics is addressed, highlighting the questions that remain. PMID:23633143
Quantitative Profiling of Peptides from RNAs classified as non-coding
Prabakaran, Sudhakaran; Hemberg, Martin; Chauhan, Ruchi; Winter, Dominic; Tweedie-Cullen, Ry Y.; Dittrich, Christian; Hong, Elizabeth; Gunawardena, Jeremy; Steen, Hanno; Kreiman, Gabriel; Steen, Judith A.
2014-01-01
Only a small fraction of the mammalian genome codes for messenger RNAs destined to be translated into proteins, and it is generally assumed that a large portion of transcribed sequences - including introns and several classes of non-coding RNAs (ncRNAs) do not give rise to peptide products. A systematic examination of translation and physiological regulation of ncRNAs has not been conducted. Here, we use computational methods to identify the products of non-canonical translation in mouse neurons by analyzing unannotated transcripts in combination with proteomic data. This study supports the existence of non-canonical translation products from both intragenic and extragenic genomic regions, including peptides derived from anti-sense transcripts and introns. Moreover, the studied novel translation products exhibit temporal regulation similar to that of proteins known to be involved in neuronal activity processes. These observations highlight a potentially large and complex set of biologically regulated translational events from transcripts formerly thought to lack coding potential. PMID:25403355
Splicing predictions reliably classify different types of alternative splicing
Busch, Anke; Hertel, Klemens J.
2015-01-01
Alternative splicing is a key player in the creation of complex mammalian transcriptomes and its misregulation is associated with many human diseases. Multiple mRNA isoforms are generated from most human genes, a process mediated by the interplay of various RNA signature elements and trans-acting factors that guide spliceosomal assembly and intron removal. Here, we introduce a splicing predictor that evaluates hundreds of RNA features simultaneously to successfully differentiate between exons that are constitutively spliced, exons that undergo alternative 5′ or 3′ splice-site selection, and alternative cassette-type exons. Surprisingly, the splicing predictor did not feature strong discriminatory contributions from binding sites for known splicing regulators. Rather, the ability of an exon to be involved in one or multiple types of alternative splicing is dictated by its immediate sequence context, mainly driven by the identity of the exon's splice sites, the conservation around them, and its exon/intron architecture. Thus, the splicing behavior of human exons can be reliably predicted based on basic RNA sequence elements. PMID:25805853
Modelling reveals kinetic advantages of co-transcriptional splicing.
Aitken, Stuart; Alexander, Ross D; Beggs, Jean D
2011-10-01
Messenger RNA splicing is an essential and complex process for the removal of intron sequences. Whereas the composition of the splicing machinery is mostly known, the kinetics of splicing, the catalytic activity of splicing factors and the interdependency of transcription, splicing and mRNA 3' end formation are less well understood. We propose a stochastic model of splicing kinetics that explains data obtained from high-resolution kinetic analyses of transcription, splicing and 3' end formation during induction of an intron-containing reporter gene in budding yeast. Modelling reveals co-transcriptional splicing to be the most probable and most efficient splicing pathway for the reporter transcripts, due in part to a positive feedback mechanism for co-transcriptional second step splicing. Model comparison is used to assess the alternative representations of reactions. Modelling also indicates the functional coupling of transcription and splicing, because both the rate of initiation of transcription and the probability that step one of splicing occurs co-transcriptionally are reduced, when the second step of splicing is abolished in a mutant reporter.
Left-handed Z-DNA: structure and function
NASA Technical Reports Server (NTRS)
Herbert, A.; Rich, A.
1999-01-01
Z-DNA is a high energy conformer of B-DNA that forms in vivo during transcription as a result of torsional strain generated by a moving polymerase. An understanding of the biological role of Z-DNA has advanced with the discovery that the RNA editing enzyme double-stranded RNA adenosine deaminase type I (ADAR1) has motifs specific for the Z-DNA conformation. Editing by ADAR1 requires a double-stranded RNA substrate. In the cases known, the substrate is formed by folding an intron back onto the exon that is targeted for modification. The use of introns to direct processing of exons requires that editing occurs before splicing. Recognition of Z-DNA by ADAR1 may allow editing of nascent transcripts to be initiated immediately after transcription, ensuring that editing and splicing are performed in the correct sequence. Structural characterization of the Z-DNA binding domain indicates that it belongs to the winged helix-turn-helix class of proteins and is similar to the globular domain of histone-H5.
BIALLELIC POLYMORPHISM IN THE INTRON REGION OF B-TUBULIN GENE OF CRYPTOSPORIDIUM PARASITES
Nucleotide sequencing of polymerase chain reaction-amplified intron region of the Cryptosporidium parvum B-tubulin gene in 26 human and 15 animal isolates revealed distinct genetic polymorphism between the human and bovine genotypes. The separation of 2 genotypes of C. parvum is...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sellem, C.H.; Rossignol, M.; Belcour, L.
1996-06-01
The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optical sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences.more » In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group I intronic ORFs are mobile elements and that their transfer, and comcomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes. 46 refs., 5 figs., 2 tabs.« less
Factor IX gene haplotypes in Amerindians.
Franco, R F; Araújo, A G; Zago, M A; Guerreiro, J F; Figueiredo, M S
1997-02-01
We have determined the haplotypes of the factor IX gene for 95 Indians from 5 Brazilian Amazon tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Eight polymorphisms linked to the factor IX gene were investigated: MseI (at 5', nt -698), BamHI (at 5', nt -561), DdeI (intron 1), BamHI (intron 2), XmnI (intron 3), TaqI (intron 4), MspI (intron 4), and HhaI (at 3', approximately 8 kb). The results of the haplotype distribution and the allele frequencies for each of the factor IX gene polymorphisms in Amerindians were similar to the results reported for Asian populations but differed from results for other ethnic groups. Only five haplotypes were identified within the entire Amerindian study population, and the haplotype distribution was significantly different among the five tribes, with one (Arára) to four (Wayampí) haplotypes being found per tribe. These findings indicate a significant heterogeneity among the Indian tribes and contrast with the homogeneous distribution of the beta-globin gene cluster haplotypes but agree with our recent findings on the distribution of alpha-globin gene cluster haplotypes and the allele frequencies for six VNTRs in the same Amerindian tribes. Our data represent the first study of factor IX-associated polymorphisms in Amerindian populations and emphasizes the applicability of these genetic markers for population and human evolution studies.
Ge, H; Noble, J; Colgan, J; Manley, J L
1990-01-01
We have studied splicing of the polyoma virus early region pre-mRNA in vitro. This RNA is alternatively spliced in vivo to produce mRNA encoding the large, middle-sized (MTAg), and small (StAg) tumor antigens. Our primary interest was to learn how the 48-nucleotide StAg intron is excised, because the length of this intron is significantly less than the apparent minimum established for mammalian introns. Although the products of all three splices are detected in vitro, characterization of the pathway and sequence requirements of StAg splicing suggests that splicing factors interact with the precursor RNA in an unexpected way to catalyze removal of this intron. Specifically, StAg splicing uses either of two lariat branch points, one of which is located only 4 nucleotides from the 3' splice site. Furthermore, the StAg splice absolutely requires that the alternative MTAg 3' splice site, located 14 nucleotides downstream of the StAg 3' splice site, be intact. Insertion mutations that increase or decrease the quality of the MTAg pyrimidine stretch enhance or repress StAg as well as MTAg splicing, and a single-base change in the MTAg AG splice acceptor totally blocks both splices. These results demonstrate the ability of two 3' splice sites to cooperate with each other to bring about removal of a single intron. Images PMID:2159146
A deep intronic mutation in the SLC12A3 gene leads to Gitelman syndrome.
Nozu, Kandai; Iijima, Kazumoto; Nozu, Yoshimi; Ikegami, Ei; Imai, Takehide; Fu, Xue Jun; Kaito, Hiroshi; Nakanishi, Koichi; Yoshikawa, Norishige; Matsuo, Masafumi
2009-11-01
Many mutations have been detected in the SLC12A3 gene of Gitelman syndrome (GS, OMIM 263800) patients. In previous studies, only one mutant allele was detected in approximately 20 to 41% of patients with GS; however, the exact reason for the nonidentification has not been established. In this study, we used RT-PCR using mRNA to investigate for the first time transcript abnormalities caused by deep intronic mutation. Direct sequencing analysis of leukocyte DNA identified one base insertion in exon 6 (c.818_819insG), but no mutation was detected in another allele. We analyzed RNA extracted from leukocytes and urine sediments and detected unknown sequence containing 238bp between exons 13 and 14. The genomic DNA analysis of intron 13 revealed a single-base substitution (c.1670-191C>T) that creates a new donor splice site within the intron resulting in the inclusion of a novel cryptic exon in mRNA. This is the first report of creation of a splice site by a deep intronic single-nucleotide change in GS and the first report to detect the onset mechanism in a patient with GS and missing mutation in one allele. This molecular onset mechanism may partly explain the poor success rate of mutation detection in both alleles of patients with GS.
Dool, Serena E; Puechmaille, Sebastien J; Foley, Nicole M; Allegrini, Benjamin; Bastian, Anna; Mutumi, Gregory L; Maluleke, Tinyiko G; Odendaal, Lizelle J; Teeling, Emma C; Jacobs, David S
2016-04-01
Despite many studies illustrating the perils of utilising mitochondrial DNA in phylogenetic studies, it remains one of the most widely used genetic markers for this purpose. Over the last decade, nuclear introns have been proposed as alternative markers for phylogenetic reconstruction. However, the resolution capabilities of mtDNA and nuclear introns have rarely been quantified and compared. In the current study we generated a novel ∼5kb dataset comprising six nuclear introns and a mtDNA fragment. We assessed the relative resolution capabilities of the six intronic fragments with respect to each other, when used in various combinations together, and when compared to the traditionally used mtDNA. We focused on a major clade in the horseshoe bat family (Afro-Palaearctic clade; Rhinolophidae) as our case study. This old, widely distributed and speciose group contains a high level of conserved morphology. This morphological stasis renders the reconstruction of the phylogeny of this group with traditional morphological characters complex. We sampled multiple individuals per species to represent their geographic distributions as best as possible (122 individuals, 24 species, 68 localities). We reconstructed the species phylogeny using several complementary methods (partitioned Maximum Likelihood and Bayesian and Bayesian multispecies-coalescent) and made inferences based on consensus across these methods. We computed pairwise comparisons based on Robinson-Foulds tree distance metric between all Bayesian topologies generated (27,000) for every gene(s) and visualised the tree space using multidimensional scaling (MDS) plots. Using our supported species phylogeny we estimated the ancestral state of key traits of interest within this group, e.g. echolocation peak frequency which has been implicated in speciation. Our results revealed many potential cryptic species within this group, even in taxa where this was not suspected a priori and also found evidence for mtDNA introgression. We demonstrated that by using just two introns one can recover a better supported species tree than when using the mtDNA alone, despite the shorter overall length of the combined introns. Additionally, when combining any single intron with mtDNA, we showed that the result is highly similar to the mtDNA gene tree and far from the true species tree and therefore this approach should be avoided. We caution against the indiscriminate use of mtDNA in phylogenetic studies and advocate for pilot studies to select nuclear introns. The selection of marker type and number is a crucial step that is best based on critical examination of preliminary or previously published data. Based on our findings and previous publications, we recommend the following markers to recover phylogenetic relationships between recently diverged taxa (<20 My) in bats and other mammals: ACOX2, COPS7A, BGN, ROGDI and STAT5A. Copyright © 2016 Elsevier Inc. All rights reserved.
The histone variant H2A.Z promotes efficient cotranscriptional splicing in S. cerevisiae
Neves, Lauren T.; Douglass, Stephen; Spreafico, Roberto; Venkataramanan, Srivats; Kress, Tracy L.; Johnson, Tracy L.
2017-01-01
In eukaryotes, a dynamic ribonucleic protein machine known as the spliceosome catalyzes the removal of introns from premessenger RNA (pre-mRNA). Recent studies show the processes of RNA synthesis and RNA processing to be spatio–temporally coordinated, indicating that RNA splicing takes place in the context of chromatin. H2A.Z is a highly conserved histone variant of the canonical histone H2A. In Saccharomyces cerevisiae, H2A.Z is deposited into chromatin by the SWR-C complex, is found near the 5′ ends of protein-coding genes, and has been implicated in transcription regulation. Here we show that splicing of intron-containing genes in cells lacking H2A.Z is impaired, particularly under suboptimal splicing conditions. Cells lacking H2A.Z are especially dependent on a functional U2 snRNP (small nuclear RNA [snRNA] plus associated proteins), as H2A.Z shows extensive genetic interactions with U2 snRNP-associated proteins, and RNA sequencing (RNA-seq) reveals that introns with nonconsensus branch points are particularly sensitive to H2A.Z loss. Consistently, H2A.Z promotes efficient spliceosomal rearrangements involving the U2 snRNP, as H2A.Z loss results in persistent U2 snRNP association and decreased recruitment of downstream snRNPs to nascent RNA. H2A.Z impairs transcription elongation, suggesting that spliceosome rearrangements are tied to H2A.Z's role in elongation. Depletion of disassembly factor Prp43 suppresses H2A.Z-mediated splice defects, indicating that, in the absence of H2A.Z, stalled spliceosomes are disassembled, and unspliced RNAs are released. Together, these data demonstrate that H2A.Z is required for efficient pre-mRNA splicing and indicate a role for H2A.Z in coordinating the kinetics of transcription elongation and splicing. PMID:28446598
Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan
2012-01-01
Background HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. Methods and Findings In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Conclusions Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and duplication. PMID:22808219
Zhou, Mi; Yan, Jun; Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan
2012-01-01
HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and duplication.
Lozier, Jay N; Kloos, Mark T; Merricks, Elizabeth P; Lemoine, Nathaly; Whitford, Margaret H; Raymer, Robin A; Bellinger, Dwight A; Nichols, Timothy C
2016-01-01
Animals with hemophilia are models for gene therapy, factor replacement, and inhibitor development in humans. We have actively sought dogs with severe hemophilia A that have novel factor VIII mutations unlike the previously described factor VIII intron 22 inversion. A male Old English Sheepdog with recurrent soft-tissue hemorrhage and hemarthrosis was diagnosed with severe hemophilia A (factor VIII activity less than 1% of normal). We purified genomic DNA from this dog and ruled out the common intron 22 inversion; we then sequenced all 26 exons. Comparing the results with the normal canine factor VIII sequence revealed a C→T transition in exon 12 of the factor VIII gene that created a premature stop codon at amino acid 577 in the A2 domain of the protein. In addition, 2 previously described polymorphisms that do not cause hemophilia were present at amino acids 909 and 1184. The hemophilia mutation creates a new TaqI site that facilitates rapid genotyping of affected offspring by PCR and restriction endonuclease analyses. This mutation is analogous to the previously described human factor VIII mutation at Arg583, which likewise is a CpG dinucleotide transition causing a premature stop codon in exon 12. Thus far, despite extensive treatment with factor VIII, this dog has not developed neutralizing antibodies (‘inhibitors’) to the protein. This novel mutation in a dog gives rise to severe hemophilia A analogous to a mutation seen in humans. This model will be useful for studies of the treatment of hemophilia. PMID:27780008
Functional and evolution characterization of SWEET sugar transporters in Ananas comosus.
Guo, Chengying; Li, Huayang; Xia, Xinyao; Liu, Xiuyuan; Yang, Long
2018-02-05
Sugars will eventually be exported transporters (SWEETs) are a group of recently identified sugar transporters in plants that play important roles in diverse physiological processes. However, currently, limited information about this gene family is available in pineapple (Ananas comosus). The availability of the recently released pineapple genome sequence provides the opportunity to identify SWEET genes in a Bromeliaceae family member at the genome level. In this study, 39 pineapple SWEET genes were identified in two pineapple cultivars (18 AnfSWEET and 21 AnmSWEET) and further phylogenetically classified into five clades. A phylogenetic analysis revealed distinct evolutionary paths for the SWEET genes of the two pineapple cultivars. The MD2 cultivar might have experienced a different expansion than the F153 cultivar because two additional duplications exist, which separately gave rise to clades III and IV. A gene exon/intron structure analysis showed that the pineapple SWEET genes contained highly conserved exon/intron numbers. An analysis of public RNA-seq data and expression profiling showed that SWEET genes may be involved in fruit development and ripening processes. AnmSWEET5 and AnmSWEET11 were highly expressed in the early stages of pineapple fruit development and then decreased. The study increases the understanding of the roles of SWEET genes in pineapple. Copyright © 2018 Elsevier Inc. All rights reserved.
Piece2.0: an update for the pant gene structure comparison and evolution database
USDA-ARS?s Scientific Manuscript database
PIECE (Plant Intron Exon Comparison and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron–exon organization and provides valuable insights into the evolution of gene structure in pl...
CryoEM structure of the spliceosome immediately after branching
Galej, Wojciech P.; Wilkinson, Max E.; Fica, Sebastian M.; Oubridge, Chris; Newman, Andrew J.; Nagai, Kiyoshi
2016-01-01
Pre-mRNA splicing proceeds by two consecutive trans-esterification reactions via a lariat-intron intermediate. We present the 3.8Å cryoEM structure of the spliceosome immediately after lariat formation. The 5’-splice site is cleaved but remains close to the catalytic Mg2+ site in the U2/U6 snRNA triplex, and the 5’-phosphate of the intron nucleotide G(+1) is linked to the branch adenosine 2’OH. The 5’-exon is held between the Prp8 N-terminal and Linker domains, and base-pairs with U5 snRNA loop 1. Non-Watson-Crick interactions between the branch helix and 5’-splice site dock the branch adenosine into the active site, while intron nucleotides +3 to +6 base-pair with the U6 snRNA ACAGAGA sequence. Isy1 and the step one factors Yju2 and Cwc25 stabilise docking of the branch helix. The intron downstream of the branch site emerges between the Prp8 RT and Linker domains and extends towards Prp16 helicase, suggesting a plausible mechanism of remodelling before exon ligation. PMID:27459055
Associations of polymorphisms in the Pit-1 gene with growth and carcass traits in Angus beef cattle.
Zhao, Q; Davis, M E; Hines, H C
2004-08-01
The Pit-1 gene was studied as a candidate for genetic markers of growth and carcass traits. Angus beef cattle that were divergently selected for high- or low-blood serum IGF-I concentration were used in this study. The single-strand conformation polymorphism method was used to identify polymorphism in the Pit-1 gene including regions from intron 2 to exon 6. Two polymorphisms, Pit1I3H (HinfI) and Pit1I3NL (NlaIII), were detected in intron 3 of the Pit-1 gene. One polymorphism, Pit1I4N (BstNI), was found in intron 4, and a single nucleotide polymorphism, Pit1I5, was found in intron 5. The previously reported polymorphism in exon 6, Pit1E6H (HinfI), was also studied in 416 Angus beef cattle. Associations of the polymorphisms with growth traits, carcass traits, and IGF-I concentration were analyzed using a general linear model procedure. No significant associations were observed between these polymorphisms and growth and carcass traits.
Barbazuk, W. Brad
2017-01-01
RNA splicing of U12-type introns functions in human cell differentiation, but it is not known whether this class of introns has a similar role in plants. The maize ROUGH ENDOSPERM3 (RGH3) protein is orthologous to the human splicing factor, ZRSR2. ZRSR2 mutations are associated with myelodysplastic syndrome (MDS) and cause U12 splicing defects. Maize rgh3 mutants have aberrant endosperm cell differentiation and proliferation. We found that most U12-type introns are retained or misspliced in rgh3. Genes affected in rgh3 and ZRSR2 mutants identify cell cycle and protein glycosylation as common pathways disrupted. Transcripts with retained U12-type introns can be found in polysomes, suggesting that splicing efficiency can alter protein isoforms. The rgh3 mutant protein disrupts colocalization with a known ZRSR2-interacting protein, U2AF2. These results indicate conserved function for RGH3/ZRSR2 in U12 splicing and a deeply conserved role for the minor spliceosome to promote cell differentiation from stem cells to terminal fates. PMID:28242684
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.
2003-06-01
OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally importantmore » for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.« less
Malagrino, Pamella A; Sponton, Carlos H G; Esposti, Rodrigo D; Franco-Penteado, Carla F; Fernandes, Romulo A; Bezerra, Marcos André C; Albuquerque, Dulcinéia M; Rodovalho, Cynara M; Bacci, Maurício; Zanesco, Angelina
2013-02-01
To evaluate the influence of the interaction between endothelial nitric oxide synthase gene (NOS3) polymorphisms at positions -786T>C, Glu298Asp and intron 4b/a, and cardiorespiratory fitness on plasma nitrite/nitrate levels, blood pressure, lipid profile, and prevalence of cardiometabolic disorders. Ninety-two volunteers were genotyped for NOS3 polymorphisms at positions (-786T>C and Glu298Asp) and (intron 4b/a) and divided according to the genotype: non-polymorphic (NP) and polymorphic (P). After that, they were subdivided according to the cardiorespiratory fitness associated with genotype: high (HNP and HP) and low (LNP and LP). The subjects with polymorphism for the interactions at positions Glu298Asp + intron 4b/a, and Glu298Asp+-786T>C showed the highest values in total cholesterol, as well as dyslipidemia. Our findings show that NOS3 gene polymorphisms at positions -786T>C, Glu298Asp, and intron 4b/a exert negative effects on the lipid profile compared with those who do not carry polymorphisms.
Sensing Self and Foreign Circular RNAs by Intron Identity.
Chen, Y Grace; Kim, Myoungjoo V; Chen, Xingqi; Batista, Pedro J; Aoyama, Saeko; Wilusz, Jeremy E; Iwasaki, Akiko; Chang, Howard Y
2017-07-20
Circular RNAs (circRNAs) are single-stranded RNAs that are joined head to tail with largely unknown functions. Here we show that transfection of purified in vitro generated circRNA into mammalian cells led to potent induction of innate immunity genes and confers protection against viral infection. The nucleic acid sensor RIG-I is necessary to sense foreign circRNA, and RIG-I and foreign circRNA co-aggregate in cytoplasmic foci. CircRNA activation of innate immunity is independent of a 5' triphosphate, double-stranded RNA structure, or the primary sequence of the foreign circRNA. Instead, self-nonself discrimination depends on the intron that programs the circRNA. Use of a human intron to express a foreign circRNA sequence abrogates immune activation, and mature human circRNA is associated with diverse RNA binding proteins reflecting its endogenous splicing and biogenesis. These results reveal innate immune sensing of circRNA and highlight introns-the predominant output of mammalian transcription-as arbiters of self-nonself identity. Copyright © 2017 Elsevier Inc. All rights reserved.
Quantitation of normal CFTR mRNA in CF patients with splice-site mutations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Z.; Olsen, J.C.; Silverman, L.M.
Previously we identified two mutations in introns of the CFTR gene associated with partially active splice sites and unusual clinical phenotypes. One mutation in intron 19 (3849+10 kb C to T) is common in CF patients with normal sweat chloride values; an 84 bp sequence from intron 19, which contains a stop codon, is inserted between exon 19 and exon 20 in most nasal CFTR transcripts. The other mutation in intron 14B (2789+5 G to A) is associated with elevated sweat chloride levels, but mild pulmonary disease; exon 14B (38 bp) is spliced out of most nasal CFTR transcipts. Themore » remaining CFTR cDNA sequences, other than the 84 bp insertion of exon 14B deletion, are identical to the published sequence. To correlate genotype and phenotype, we used quantitative RT-PCR to determine the levels of normally-spliced CFTR mRNA in nasal epithelia from these patients. CFTR cDNA was amplified (25 cycles) by using primers specific for normally-spliced species, {gamma}-actin cDNA was amplified as a standard.« less
One novel SNP of growth hormone gene and its associations with growth and carcass traits in ducks.
Wu, Y; Pan, A L; Pi, J S; Pu, Y J; Du, J P; Liang, Z H; Shen, J
2012-08-01
In this study, the growth hormone (GH) gene was studied as a candidate gene for growth and carcass traits of three duck populations (Cherry Valley duck, Muscovy duck and Jingjiang duck). Three pairs of primers were designed to detect single nucleotide polymorphisms of introns 2, 3 and 4 of the GH gene by polymerase chain reaction-restriction fragment length polymorphism and sequencing methods. Only the products amplified from intron 2 displayed polymorphism. The results showed one novel polymorphism: a variation in intron 2 of GH gene (C172T, JN408701 and JN408702). It was associated with some growth and carcass traits in three duck populations including birth weight, 8-week weight, carcass weight, breast muscle weight, leg muscle weight, eviscerated weight, lean meat rate, dressing percentage, etc. And the TT and CT genotypes were associated with superior growth and carcass traits in carcass weight, dressing percentage and percentage of eviscerated weight. Therefore, the variation in intron 2 of GH may be a molecular marker for superior growth and carcass traits in above duck populations.
Comuzzie, Anthony G.; Cole, Shelley A.; Laston, Sandra L.; Voruganti, V. Saroja; Haack, Karin; Gibbs, Richard A.; Butte, Nancy F.
2012-01-01
Genetic variants responsible for susceptibility to obesity and its comorbidities among Hispanic children have not been identified. The VIVA LA FAMILIA Study was designed to genetically map childhood obesity and associated biological processes in the Hispanic population. A genome-wide association study (GWAS) entailed genotyping 1.1 million single nucleotide polymorphisms (SNPs) using the Illumina Infinium technology in 815 children. Measured genotype analysis was performed between genetic markers and obesity-related traits i.e., anthropometry, body composition, growth, metabolites, hormones, inflammation, diet, energy expenditure, substrate utilization and physical activity. Identified genome-wide significant loci: 1) corroborated genes implicated in other studies (MTNR1B, ZNF259/APOA5, XPA/FOXE1 (TTF-2), DARC, CCR3, ABO); 2) localized novel genes in plausible biological pathways (PCSK2, ARHGAP11A, CHRNA3); and 3) revealed novel genes with unknown function in obesity pathogenesis (MATK, COL4A1). Salient findings include a nonsynonymous SNP (rs1056513) in INADL (p = 1.2E-07) for weight; an intronic variant in MTNR1B associated with fasting glucose (p = 3.7E-08); variants in the APOA5-ZNF259 region associated with triglycerides (p = 2.5-4.8E-08); an intronic variant in PCSK2 associated with total antioxidants (p = 7.6E-08); a block of 23 SNPs in XPA/FOXE1 (TTF-2) associated with serum TSH (p = 5.5E-08 to 1.0E-09); a nonsynonymous SNP (p = 1.3E-21), an intronic SNP (p = 3.6E-13) in DARC identified for MCP-1; an intronic variant in ARHGAP11A associated with sleep duration (p = 5.0E-08); and, after adjusting for body weight, variants in MATK for total energy expenditure (p = 2.7E-08) and in CHRNA3 for sleeping energy expenditure (p = 6.0E-08). Unprecedented phenotyping and high-density SNP genotyping enabled localization of novel genetic loci associated with the pathophysiology of childhood obesity. PMID:23251661
Miyata, Y; Sugita, C; Maruyama, K; Sugita, M
2008-03-01
RNA editing of cytidine (C) to uridine (U) transitions occurs in plastids and mitochondria of most land plants. In this study, we amplified and sequenced the group I intron-containing tRNA Leu gene, trnL-CAA, from Takakia lepidozioides, a moss. DNA sequence analysis revealed that the T. lepidozioides tRNA Leu gene consisted of a 35-bp 5' exon, a 469-bp group I intron and a 50-bp 3' exon. The intron was inserted between the first and second position of the tRNA Leu anticodon. In general, plastid tRNA Leu genes with a group I intron code for a TAA anticodon in most land plants. This strongly suggests that the first nucleotide of the CAA anticodon could be edited in T. lepidozioides plastids. To investigate this possibility, we analysed cDNAs derived from the trnL-CAA transcripts. We demonstrated that the first nucleotide C of the anticodon was edited to create a canonical UAA anticodon in T. lepidozioides plastids. cDNA sequencing analyses of the spliced or unspliced tRNA Leu transcripts revealed that, while the spliced tRNA was completely edited, editing in the unspliced tRNAs were only partial. This is the first experimental evidence that the anticodon editing of tRNA occurs before RNA splicing in plastids. This suggests that this editing is a prerequisite to splicing of pre-tRNA Leu.
Tumor necrosis factor-alpha and interleukin-4 gene polymorphisms in Chinese patients with gout.
Chen, M-L; Tsai, F-J; Tsai, C-H; Huang, C-M
2007-01-01
The purpose of this study was to examine whether polymorphisms of interleukin-4 (IL-4) (promoter-590 and intron 3) and tumor necrosis factor-alpha (TNF-alpha) promoter-308 genes are markers of susceptibility to or clinical manifestations of gout in Taiwanese patients. The study included 196 Taiwanese patients with gout and 103 unrelated healthy control subjects living in central Taiwan. Polymorphisms of the IL-4 (promoter-590 and intron 3) and TNF-alpha (promoter-308) genes were typed from genomic DNA. Allelic frequencies and carriage rates were then compared between gout patients and control subjects. The correlation between allelic frequencies, carriage rates and clinical manifestations of gout were evaluated. No significant differences were observed in the allelic frequencies and carriage rates of the IL-4 (promoter-590 and intron 3) and TNF-alpha gene polymorphisms between patients with gout and healthy control subjects. Furthermore, the IL-4 (promoter-590 and intron 3) and TNF-alpha genotypes were not found to be associated with the clinical and laboratory profiles in gout patients. However, there was a significant difference in the TNF-alphapolymorphism genotype between patients with and without hypertriglyceridemia (P=0.001, xi2=11.47, OR=10.3, 95%CI=3.57-29.7). The results of our study suggest that polymorphisms of the IL-4 (promoter-590 and intron 3) and TNF-alpha promoter-308 genes are not related to gout in Chinese patients in Taiwan.
Evaluation of non-coding variation in GLUT1 deficiency.
Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S
2016-12-01
Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.
Expression of a polyubiquitin promoter isolated from Gladiolus.
Joung, Young Hee; Kamo, Kathryn
2006-10-01
A polyubiquitin promoter (GUBQ1) including its 5'UTR and intron was isolated from the floral monocot Gladiolus because high levels of expression could not be obtained using publicly available promoters isolated from either cereals or dicots. Sequencing of the promoter revealed highly conserved 5' and 3' intron splicing sites for the 1.234 kb intron. The coding sequence of the first two ubiquitin genes showed the highest homology (87 and 86%, respectively) to the ubiquitin genes of Nicotiana tabacum and Oryza sativa RUBQ2. Transient expression following gene gun bombardment showed that relative levels of GUS activity with the GUBQ1 promoter were comparable to the CaMV 35S promoter in gladiolus, tobacco, rose, rice, and the floral monocot freesia. The highest levels of GUS expression with GUBQ1 were attained with Gladiolus. The full-length GUBQ1 promoter including 5'UTR and intron were necessary for maximum GUS expression in Gladiolus. The relative GUS activity for the promoter only was 9%, and the activity for the promoter with 5'UTR and 399 bp of the full-length 1.234 kb intron was 41%. Arabidopsis plants transformed with uidA under GUBQ1 showed moderate GUS expression throughout young leaves and in the vasculature of older leaves. The highest levels of transient GUS expression in Gladiolus have been achieved using the GUBQ1 promoter. This promoter should be useful for genetic engineering of disease resistance in Gladiolus, rose, and freesia, where high levels of gene expression are important.
Coordinated tissue-specific regulation of adjacent alternative 3′ splice sites in C. elegans
Ragle, James Matthew; Katzman, Sol; Akers, Taylor F.; Barberan-Soler, Sergio; Zahler, Alan M.
2015-01-01
Adjacent alternative 3′ splice sites, those separated by ≤18 nucleotides, provide a unique problem in the study of alternative splicing regulation; there is overlap of the cis-elements that define the adjacent sites. Identification of the intron's 3′ end depends upon sequence elements that define the branchpoint, polypyrimidine tract, and terminal AG dinucleotide. Starting with RNA-seq data from germline-enriched and somatic cell-enriched Caenorhabditis elegans samples, we identify hundreds of introns with adjacent alternative 3′ splice sites. We identify 203 events that undergo tissue-specific alternative splicing. For these, the regulation is monodirectional, with somatic cells preferring to splice at the distal 3′ splice site (furthest from the 5′ end of the intron) and germline cells showing a distinct shift toward usage of the adjacent proximal 3′ splice site (closer to the 5′ end of the intron). Splicing patterns in somatic cells follow C. elegans consensus rules of 3′ splice site definition; a short stretch of pyrimidines preceding an AG dinucleotide. Splicing in germline cells occurs at proximal 3′ splice sites that lack a preceding polypyrimidine tract, and in three instances the germline-specific site lacks the AG dinucleotide. We provide evidence that use of germline-specific proximal 3′ splice sites is conserved across Caenorhabditis species. We propose that there are differences between germline and somatic cells in the way that the basal splicing machinery functions to determine the intron terminus. PMID:25922281
Kumari, Priyanka; Singh, Subodh Kumar; Raman, Rajiva
2018-06-05
Genome-wide linkage analysis and whole genome sequencing in a Van der Woude syndrome (VWS) family revealed that the SNP, rs539075, within intron 2 of the cadherin 2 gene (CDH2) co-segregated with the disease phenotype. A study with nonsyndromic cleft lip with or without cleft palate (NSCL ± P) cases (N = 292) and controls (N = 287) established association of this SNP with NSCL ± P as a risk factor. RT-PCR based expression analysis of the SNP-harbouring region of intron 2 of CDH2 in the clefted lip and/or palate tissues of 16 patients revealed that the mutant allele expressed in all those individuals having it (hetero-/homozygous), whereas the wild type allele expressed in <50% of the samples in which it was present. The intronic transcript was also present in the prospective lip and palate region of 13.5 dpc mouse embryo, detected by RNA in situ hybridization and RT-PCR. These results including the in silico, characterization of the ~200 nt-intronic transcript showed that conformationally it fits best with noncoding small RNA, possibly a precursor of miRNA. Its function in the orofacial organogenesis remains to be elucidated which will enable us to define the role of this mutant ncRNA in the clefting of lip and palate. Copyright © 2018 Elsevier B.V. All rights reserved.
A kinetic and thermodynamic framework for the Azoarcus group I ribozyme reaction
Gleitsman, Kristin R.
2014-01-01
Determination of quantitative thermodynamic and kinetic frameworks for ribozymes derived from the Azoarcus group I intron and comparisons to their well-studied analogs from the Tetrahymena group I intron reveal similarities and differences between these RNAs. The guanosine (G) substrate binds to the Azoarcus and Tetrahymena ribozymes with similar equilibrium binding constants and similar very slow association rate constants. These and additional literature observations support a model in which the free ribozyme is not conformationally competent to bind G and in which the probability of assuming the binding-competent state is determined by tertiary interactions of peripheral elements. As proposed previously, the slow binding of guanosine may play a role in the specificity of group I intron self-splicing, and slow binding may be used analogously in other biological processes. The internal equilibrium between ribozyme-bound substrates and products is similar for these ribozymes, but the Azoarcus ribozyme does not display the coupling in the binding of substrates that is observed with the Tetrahymena ribozyme, suggesting that local preorganization of the active site and rearrangements within the active site upon substrate binding are different for these ribozymes. Our results also confirm the much greater tertiary binding energy of the 5′-splice site analog with the Azoarcus ribozyme, binding energy that presumably compensates for the fewer base-pairing interactions to allow the 5′-exon intermediate in self splicing to remain bound subsequent to 5′-exon cleavage and prior to exon ligation. Most generally, these frameworks provide a foundation for design and interpretation of experiments investigating fundamental properties of these and other structured RNAs. PMID:25246656
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, S.I.; Wirth, D.F.
1988-06-01
The 5' ends of Leishmania mRNAs contain an identical 35-nucleotide sequence termed the spliced leader (SL) or 5' mini-exon. The SL sequence is at the 5' end of an 85-nucleotide primary transcript that contains a consensus eucaryotic 5' intron-exon splice junction immediately 3' to the SL. The SL is added to protein-coding genes immediately 3' to a consensus eucaryotic 3' intron-exon splice junction. The authors' previous work demonstrated possible intermediates in discontinuous mRNA processing that contain the 50 nucleotides of the SL primary transcript 3' to the SL, the SL intron sequence (SLIS). These RNAs have a 5' terminus atmore » the splice junction of the SL and the SLIS. The authors examined a Leishmania nuclear extract for these RNAs in ribonucleoprotein (RNP) particles. Density centrifugation analysis showed that the SL RNA is predominately in RNP complexes at 60S, while the SLIS-containing RNAs are in complexes at 40S. They also demonstrated that the SLIS can be released from polyadenylated RNA by incubation with a HeLa cell extract containing debranching enzymatic activity. These data suggested that Leishmania enriettii mRNAs are assembled by bimolecular or trans splicing as has been recently demonstrated for Trypanosoma brucei. Furthermore, they determined the partial sequence of the Leishmania U2 equivalent RNA and demonstrated that it cosediments with the SL RNA at 60S in a nuclear extract. These RNP particles may be analogous to so-called spliceosomes that have been demonstrated in other systems.« less
López-Urrutia, Eduardo; Valdés, Jesús; Bonilla-Moreno, Raúl; Martínez-Salazar, Martha; Martínez-Garcia, Martha; Berumen, Jaime; Villegas-Sepúlveda, Nicolás
2012-06-01
The HPV-16 E6/E7 genes, which contain intron 1, are processed by alternative splicing and its transcripts are detected with a heterogeneous profile in tumours cells. Frequently, the HPV-16 positive carcinoma cells bear viral variants that contain single nucleotide polymorphisms into its DNA sequence. We were interested in analysing the contribution of this polymorphism to the heterogeneity in the pattern of the E6/E7 spliced transcripts. Using the E6/E7 sequences from three closely related HPV-16 variants, we have shown that a few nucleotide changes are sufficient to produce heterogeneity in the splicing profile. Furthermore, using mutants that contained a single SNP, we also showed that one nucleotide change was sufficient to reproduce the heterogeneous splicing profile. Additionally, a difference of two or three SNPs among these viral sequences was sufficient to recruit differentially several splicing factors to the polymorphic E6/E7 transcripts. Moreover, only one SNP was sufficient to alter the binding site of at least one splicing factor, changing the ability of splicing factors to bind the transcript. Finally, the factors that were differentially bound to the short form of intron 1 of one of these E6/E7 variants were identified as TIA1 and/or TIAR and U1-70k, while U2AF65, U5-52k and PTB were preferentially bound to the transcript of the other variants. Copyright © 2012 Elsevier B.V. All rights reserved.
Nuclear sequestration of COL1A1 mRNA transcript associated with type I osteogenesis imperfecta (OI)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Primorac, D.; Stover, M.L.; McKinstry, M.B.
Previously we identified an OI type I patient with a splice donor mutation that resulted in intron 26 retention instead of exon skipping and sequestration of normal levels of the mutant transcript in the nuclear compartment. Intron retention was consistent with the exon definition hypothesis for splice site selection since the size of the exon-intron-exon unit was less than 300 bp. Furthermore, the retained intron contained in-frame stop codons which is thought to cause the mutant RNA to remain within the nucleus rather than appearing in the cytoplasm. To test these hypotheses, genomic fragments containing the normal sequence or themore » donor mutation were cloned into a collagen minigene and expressed in stably tansfected NIH 3T3 cells. None of the modifications to the normal intron altered the level of RNA that accumulated in the cytoplasm, as expected. However none of the modifications to the mutant intron allowed accumulation of normal levels of mRNA in the cytoplasm. Moreover, in contrast to our findings in the patient`s cells only low levels of mutant transcript were found in the nucleus; a fraction of the transcript did appear in the cytoplasm which had spliced the mutant donor site correctly. Nuclear run-on experiments demonstrated equal levels of transcription from each transgene. Expression of another donor mutation known to cause in-frame exon skipping in OI type IV was accurately reproduced in the minigene in transfected 3T3 cells. Our experience suggests that either mechanism can lead to formation of a null allele possibly related to the type of splicing events surrounding the potential stop codons. Understanding the rules governing inactivation of a collagen RNA transcript may be important in designing a strategy to inactivate a dominate negative mutation associated with the more severe forms of OI.« less
Xia, Zunjing; Lin, Jie; Lu, Lingping; Kim, Chol; Yu, Ping; Qi, Ming
2018-06-01
: Hemophilia A is a bleeding disorder caused by coagulation factor VIII protein deficiency or dysfunction, which is classified into severe, moderate, and mild according to factor clotting activity. An overwhelming majority of missense and nonsense mutations occur in exons of F8 gene, whereas mutations in introns can also be pathogenic. This study aimed to investigate the effect of an intronic mutation, c.6430-3C>G (IVS22-3C>G), on pre-mRNA splicing of the F8 gene. We applied DNA and cDNA sequencing in a Chinese boy with hemophilia A to search if any pathogenic mutation in the F8 gene. Functional analysis was performed to investigate the effect of an intronic mutation at the transcriptional level. Human Splicing Finder and PyMol were also used to predict its effect. We found the mutation c.6430-3C>G (IVS22-3C>G) in the F8 gene in the affected boy, with his mother being a carrier. cDNA from the mother and pSPL3 splicing assay showed that the mutation IVS22-3C>G results in a two-nucleotide AG inclusion at the 3' end of intron 22 and leads to a truncated coagulation factor VIII protein, with partial loss of the C1 domain and complete loss of the C2 domain. The in-silico tool predicted that the mutation induces altered pre-mRNA splicing by using a cryptic acceptor site in intron 22. The IVS22-3C>G mutation was confirmed to affect pre-mRNA splicing and produce a truncated protein, which reduces the stability of binding between the F8 protein and von Willebrand factor carrier protein due to the loss of an interaction domain.
Sponton, Carlos H; Esposti, Rodrigo; Rodovalho, Cynara M; Ferreira, Maycon J; Jarrete, Aline P; Anaruma, Chadi P; Bacci, Mauricio; Zanesco, Angelina
2014-06-15
The number of studies that have evaluated exercise training (ET) and nitric oxide synthase (NOS)3 gene polymorphisms is scarce. The present study was designed to evaluate the relationship between exercise training and NOS3 polymorphisms at -786T>C, 894G>T, and intron 4b/a on blood pressure (BP) using 24-h ambulatory BP monitoring (ABPM), nitrate/nitrite levels (NOx), and redox state. Eighty-six volunteers (51 ± 0.6 yr old) were genotyped into nonpolymorphic and polymorphic groups for each of the three positions of NOS3 polymorphisms. Auscultatory BP, ABPM, SOD activity, catalase activity, NOx levels, and malondialdehyde levels were measured. DNA was extracted from leukocytes, and PCR followed by sequencing was applied for genotype analysis. Aerobic ET consisted of 24 sessions for 3 days/wk for 40 min at moderate intensity. This study was performed in a double-blind and crossover format. ET was effective in lowering office BP (systolic BP: 3.2% and diastolic BP: 3%) as well as ABPM (systolic BP: 2% and diastolic BP: 1.3%). Increased SOD and catalase activity (42.6% and 15.1%, respectively) were also observed. The NOS3 polymorphism for intron 4 mitigated the beneficial effect of ET for systolic BP (nonpolymorphic group: -3.0% and polymorphic group: -0.6%) and diastolic BP (nonpolymorphic group: -3.2% and polymorphic group: -0.5%), but it was not associated with NOx level and redox state. Paradoxical responses were found for positions T786-C and G894T for the NOS3 gene. Consistently, the presence of the polymorphism for intron 4 blunted the beneficial effects of ET in middle-aged adults. Possibly, this effect might be as consequence of intron 4 acting as a short intronic repeat RNA controlling endothelial NOS activity epigenetically. Copyright © 2014 the American Physiological Society.
Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes
López-Garriga, Juan; Cadilla, Carmen L.
2016-01-01
The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233
[Study of gene mutation in 62 hemophilia A children].
Hu, Q; Liu, A G; Zhang, L Q; Zhang, A; Wang, Y Q; Wang, S M; Lu, Y J; Wang, X
2017-11-02
Objective: To analyze the mutation type of FⅧ gene in children with hemophilia A and to explore the relationship among hemophilia gene mutation spectrum, gene mutation and clinical phenotype. Method: Sixty-two children with hemophilia A from Department of Pediatric Hematology, Tongji Hospital of Tongji Medical College, Huazhong University of Science and Technology between January 2015 and March 2017 were enrolled. All patients were male, aged from 4 months to 7 years and F Ⅷ activity ranged 0.2%-11.0%. Fifty cases had severe, 10 cases had moderate and 2 cases had mild hemophilia A. DNA was isolated from peripheral blood in hemophilia A children and the target gene fragment was amplified by PCR, in combination with the second generation sequencing, 22 and 1 introns were detected. Negative cases were detected by the second generation sequencing and results were compared with those of the international FⅧ gene mutation database. Result: There were 20 cases (32%) of intron 22 inversion, 2 cases (3%) of intron 1 inversion, 18 cases (29%) of missense mutation, 5 cases (8%) of nonsense mutation, 7 cases (11%) of deletion mutation, 1 case(2%)of splice site mutation, 2 cases (3%) of large fragment deletion and 1 case of insertion mutation (2%). No mutation was detected in 2 cases (3%), and 4 cases (7%) failed to amplify. The correlation between phenotype and genotype showed that the most common gene mutation in severe hemophilia A was intron 22 inversion (20 cases), accounting for 40% of severe patients, followed by 11 cases of missense mutation (22%). The most common mutation in moderate hemophilia A was missense mutation (6 cases), accounting for 60% of moderate patients. Conclusion: The most frequent mutation type in hemophilia A was intron 22 inversion, followed by missense mutation, again for missing mutation. The relationship between phenotype and genotype: the most frequent gene mutation in severe hemophilia A is intron 22 inversion, followed by missense mutation; the most frequent gene mutation in medium hemophilia A is missense mutation.
2010-01-01
Background Molecular characterization of collagen-VI related myopathies currently relies on standard sequencing, which yields a detection rate approximating 75-79% in Ullrich congenital muscular dystrophy (UCMD) and 60-65% in Bethlem myopathy (BM) patients as PCR-based techniques tend to miss gross genomic rearrangements as well as copy number variations (CNVs) in both the coding sequence and intronic regions. Methods We have designed a custom oligonucleotide CGH array in order to investigate the presence of CNVs in the coding and non-coding regions of COL6A1, A2, A3, A5 and A6 genes and a group of genes functionally related to collagen VI. A cohort of 12 patients with UCMD/BM negative at sequencing analysis and 2 subjects carrying a single COL6 mutation whose clinical phenotype was not explicable by inheritance were selected and the occurrence of allelic and genetic heterogeneity explored. Results A deletion within intron 1A of the COL6A2 gene, occurring in compound heterozygosity with a small deletion in exon 28, previously detected by routine sequencing, was identified in a BM patient. RNA studies showed monoallelic transcription of the COL6A2 gene, thus elucidating the functional effect of the intronic deletion. No pathogenic mutations were identified in the remaining analyzed patients, either within COL6A genes, or in genes functionally related to collagen VI. Conclusions Our custom CGH array may represent a useful complementary diagnostic tool, especially in recessive forms of the disease, when only one mutant allele is detected by standard sequencing. The intronic deletion we identified represents the first example of a pure intronic mutation in COL6A genes. PMID:20302629
Turmel, Monique; Otis, Christian; Lemieux, Claude
2005-01-01
Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178
Bryant, Jessica M; Regnault, Clément; Scheidig-Benatar, Christine; Baumgarten, Sebastian; Guizetti, Julien; Scherf, Artur
2017-07-11
Plasmodium falciparum relies on monoallelic expression of 1 of 60 var virulence genes for antigenic variation and host immune evasion. Each var gene contains a conserved intron which has been implicated in previous studies in both activation and repression of transcription via several epigenetic mechanisms, including interaction with the var promoter, production of long noncoding RNAs (lncRNAs), and localization to repressive perinuclear sites. However, functional studies have relied primarily on artificial expression constructs. Using the recently developed P. falciparum clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 system, we directly deleted the var2csa P. falciparum 3D7_1200600 (Pf3D7_1200600) endogenous intron, resulting in an intronless var gene in a natural, marker-free chromosomal context. Deletion of the var2csa intron resulted in an upregulation of transcription of the var2csa gene in ring-stage parasites and subsequent expression of the PfEMP1 protein in late-stage parasites. Intron deletion did not affect the normal temporal regulation and subsequent transcriptional silencing of the var gene in trophozoites but did result in increased rates of var gene switching in some mutant clones. Transcriptional repression of the intronless var2csa gene could be achieved via long-term culture or panning with the CD36 receptor, after which reactivation was possible with chondroitin sulfate A (CSA) panning. These data suggest that the var2csa intron is not required for silencing or activation in ring-stage parasites but point to a subtle role in regulation of switching within the var gene family. IMPORTANCE Plasmodium falciparum is the most virulent species of malaria parasite, causing high rates of morbidity and mortality in those infected. Chronic infection depends on an immune evasion mechanism termed antigenic variation, which in turn relies on monoallelic expression of 1 of ~60 var genes. Understanding antigenic variation and the transcriptional regulation of monoallelic expression is important for developing drugs and/or vaccines. The var gene family encodes the antigenic surface proteins that decorate infected erythrocytes. Until recently, studying the underlying genetic elements that regulate monoallelic expression in P. falciparum was difficult, and most studies relied on artificial systems such as episomal reporter genes. Our study was the first to use CRISPR/Cas9 genome editing for the functional study of an important, conserved genetic element of var genes-the intron-in an endogenous, episome-free manner. Our findings shed light on the role of the var gene intron in transcriptional regulation of monoallelic expression. Copyright © 2017 Bryant et al.
Sen, Arko; Gurdziel, Katherine; Liu, Jenney; Qu, Wen; Nuga, Oluwademi O; Burl, Rayanne B; Hüttemann, Maik; Pique-Regi, Roger; Ruden, Douglas M
2017-01-01
Traumatic brain injury (TBI) can cause persistent pathological alteration of neurons. This may lead to cognitive dysfunction, depression and increased susceptibility to life threatening diseases, such as epilepsy and Alzheimer's disease. To investigate the underlying genetic and molecular basis of TBI, we subjected w 1118 Drosophila melanogaster to mild closed head trauma and found that mitochondrial activity is reduced in the brains of these flies 24 h after inflicting trauma. To determine the transcriptomic changes after mild TBI, we collected fly heads 24 h after inflicting trauma, and performed RNA-seq analyses. Classification of alternative splicing changes showed selective retention (RI) of long introns (>81 bps), with a mean size of ~3,000 nucleotides. Some of the genes containing RI showed a significant reduction in transcript abundance and are involved in mitochondrial metabolism such as Isocitrate dehydrogenase (Idh), which makes α-KG, a co-factor needed for both DNA and histone demethylase enzymes. The long introns are enriched in CA-rich motifs known to bind to Smooth (Sm), a heterogeneous nuclear ribonucleoprotein L (hnRNP-L) class of splicing factor, which has been shown to interact with the H3K36 histone methyltransferase, SET2, and to be involved in intron retention in human cells. H3K36me3 is a histone mark that demarcates exons in genes by interacting with the mRNA splicing machinery. Mutating sm ( sm 4 /Df) resulted in loss of both basal and induced levels of RI in many of the same long-intron containing genes. Reducing the levels of Kdm4A, the H3K36me3 histone demethylase, also resulted in loss of basal levels of RI in many of the same long-intron containing genes. Chromatin immunoprecipitation followed by deep sequencing (ChIP-seq) for H3K36me3 revealed increased levels of this histone modification in retained introns post-trauma at CA-rich motifs. Based on these results, we propose a model in which TBI temporarily decreases mitochondrial activity in the brain 24 h after inflicting trauma, which decreases α-KG levels, and increases H3K36me3 levels and intron retention of long introns by decreasing Kdm4A activity. The consequent reduction in mature mRNA levels in metabolism genes, such as Idh, further reduces α-KG levels in a negative feedback loop. We further propose that decreasing metabolism after TBI in such a manner is a protective mechanism that gives the brain time to repair cellular damage induced by TBI.
Autogenous Regulation of Splicing of the Transcript of a Yeast Ribosomal Protein Gene
NASA Astrophysics Data System (ADS)
Dabeva, Mariana D.; Post-Beittenmiller, Martha A.; Warner, Jonathan R.
1986-08-01
The gene for a yeast ribosomal protein, RPL32, contains a single intron. The product of this gene appears to participate in feedback control of the splicing of the intron from the transcript. This autogenous regulation of splicing provides a striking analogy to the autogenous regulation of translation of ribosomal proteins in Escherichia coli.
Aurora-A as a Modifier of Breast Cancer Risk in BRCA 1/2 Mutation Carriers
2007-06-01
Dieter Schaefer, Institute of Human Genetics, University of Frankfurt, Frankfurt, Germany; Norbert Arnold, University of Schleswig- Holstein , Campus...Intron 2 Opossum Mouse Rat Cow Dog Intron 1 Figure 3 | The FGFR2 locus. a, Map of the whole FGFR2 gene, viewed relative to common SNPs on HapMap
Lee, Kwanuk; Lee, Hwa Jung; Kim, Dong Hyun; Jeon, Young; Pai, Hyun-Sook; Kang, Hunseung
2014-04-16
Although several chloroplast RNA splicing and ribosome maturation (CRM) domain-containing proteins have been characterized for intron splicing and rRNA processing during chloroplast gene expression, the functional role of a majority of CRM domain proteins in plant growth and development as well as chloroplast RNA metabolism remains largely unknown. Here, we characterized the developmental and stress response roles of a nuclear-encoded chloroplast protein harboring a single CRM domain (At4g39040), designated CFM4, in Arabidopsis thaliana. Analysis of CFM4-GFP fusion proteins revealed that CFM4 is localized to chloroplasts. The loss-of-function T-DNA insertion mutants for CFM4 (cfm4) displayed retarded growth and delayed senescence, suggesting that CFM4 plays a role in growth and development of plants under normal growth conditions. In addition, cfm4 mutants showed retarded seed germination and seedling growth under stress conditions. No alteration in the splicing patterns of intron-containing chloroplast genes was observed in the mutant plants, but the processing of 16S and 4.5S rRNAs was abnormal in the mutant plants. Importantly, CFM4 was determined to possess RNA chaperone activity. These results suggest that the chloroplast-targeted CFM4, one of two Arabidopsis genes encoding a single CRM domain-containing protein, harbors RNA chaperone activity and plays a role in the Arabidopsis growth and stress response by affecting rRNA processing in chloroplasts.
2014-01-01
Background Although several chloroplast RNA splicing and ribosome maturation (CRM) domain-containing proteins have been characterized for intron splicing and rRNA processing during chloroplast gene expression, the functional role of a majority of CRM domain proteins in plant growth and development as well as chloroplast RNA metabolism remains largely unknown. Here, we characterized the developmental and stress response roles of a nuclear-encoded chloroplast protein harboring a single CRM domain (At4g39040), designated CFM4, in Arabidopsis thaliana. Results Analysis of CFM4-GFP fusion proteins revealed that CFM4 is localized to chloroplasts. The loss-of-function T-DNA insertion mutants for CFM4 (cfm4) displayed retarded growth and delayed senescence, suggesting that CFM4 plays a role in growth and development of plants under normal growth conditions. In addition, cfm4 mutants showed retarded seed germination and seedling growth under stress conditions. No alteration in the splicing patterns of intron-containing chloroplast genes was observed in the mutant plants, but the processing of 16S and 4.5S rRNAs was abnormal in the mutant plants. Importantly, CFM4 was determined to possess RNA chaperone activity. Conclusions These results suggest that the chloroplast-targeted CFM4, one of two Arabidopsis genes encoding a single CRM domain-containing protein, harbors RNA chaperone activity and plays a role in the Arabidopsis growth and stress response by affecting rRNA processing in chloroplasts. PMID:24739417
Sam, Mohammad Reza; Azadbakhsh, Azadeh Sadat; Farokhi, Farrah; Rezazadeh, Kobra; Sam, Sohrab; Zomorodipour, Alireza; Haddad-Mashadrizeh, Aliakbar; Delirezh, Nowruz; Mokarizadeh, Aram
2016-05-01
Ex-vivo gene therapy of hemophilias requires suitable bioreactors for secretion of hFIX into the circulation and stem cells hold great potentials in this regard. Viral vectors are widely manipulated and used to transfer hFIX gene into stem cells. However, little attention has been paid to the manipulation of hFIX transgene itself. Concurrently, the efficacy of such a therapeutic approach depends on determination of which vectors give maximal transgene expression. With this in mind, TF-1 (primary hematopoietic lineage) and rat-bone marrow mesenchymal stem cells (BMSCs) were transfected with five hFIX-expressing plasmids containing different combinations of two human β-globin (hBG) introns inside the hFIX-cDNA and Kozak element and hFIX expression was evaluated by different methods. In BMSCs and TF-1 cells, the highest hFIX level was obtained from the intron-less and hBG intron-I,II containing plasmids respectively. The highest hFIX activity was obtained from the cells that carrying the hBG intron-I,II containing plasmids. BMSCs were able to produce higher hFIX by 1.4 to 4.7-fold increase with activity by 2.4 to 4.4-fold increase compared to TF-1 cells transfected with the same constructs. BMSCs and TF-1 cells could be effectively bioengineered without the use of viral vectors and hFIX minigene containing hBG introns could represent a particular interest in stem cell-based gene therapy of hemophilias. Copyright © 2016 International Alliance for Biological Standardization. Published by Elsevier Ltd. All rights reserved.
Chen, Pei; Jou, Yuh-Shan; Fann, Cathy S J; Chen, Jaw-Wen; Chung, Chia-Min; Lin, Chin-Yu; Wu, Sheng-Yeu; Kang, Mei-Jyh; Chen, Ying-Chuang; Jong, Yuh-Shiun; Lo, Huey-Ming; Kang, Chih-Sen; Chen, Chien-Chung; Chang, Huan-Cheng; Huang, Nai-Kuei; Wu, Yi-Lin; Pan, Wen-Harn
2009-01-01
Previously, we observed that young-onset hypertension was independently associated with elevated plasma triglyceride(s) (TG) levels to a greater extent than other metabolic risk factors. Thus, focusing on the endophenotype--hypertension combined with elevated TG--we designed a family-based haplotype association study to explore its genetic connection with novel genetic variants of lipoprotein lipase gene (LPL), which encodes a major lipid metabolizing enzyme. Young-onset hypertension probands and their families were recruited, numbering 1,002 individuals from 345 families. Single-nucleotide polymorphism discovery for LPL, linkage disequilibrium (LD) analysis, transmission disequilibrium tests (TDT), bin construction, haplotype TDT association and logistic regression analysis were performed. We found that the CC- haplotype (i) spanning from intron 2 to intron 4 and the ACATT haplotype (ii) spanning from intron 5 to intron 6 were significantly associated with hypertension-related phenotypes: hypertension (ii, P=0.05), elevated TG (i, P=0.01), and hypertension combined with elevated TG (i, P=0.001; ii, P<0.0001), according to TDT. The risk of this hypertension subtype increased with the number of risk haplotypes in the two loci, using logistic regression model after adjusting within-family correlation. The relationships between LPL variants and hypertension-related disorders were also confirmed by an independent association study. Finally, we showed a trend that individuals with homozygous risk haplotypes had decreased LPL expression after a fatty meal, as opposed to those with protective haplotypes. In conclusion, this study strongly suggests that two LPL intronic variants may be associated with development of the hypertension endophenotype with elevated TG. Copyright 2008 Wiley-Liss, Inc.
Saraco, Nora; Nesi-Franca, Suzana; Sainz, Romina; Marino, Roxana; Marques-Pereira, Rosana; La Pastina, Julia; Perez Garrido, Natalia; Sandrini, Romolo; Rivarola, Marco Aurelio; de Lacerda, Luiz; Belgorosky, Alicia
2015-01-01
Splicing CYP19 gene variants causing aromatase deficiency in 46,XX disorder of sexual development (DSD) patients have been reported in a few cases. A misbalance between normal and aberrant splicing variants was proposed to explain spontaneous pubertal breast development but an incomplete sex maturation progress. The aim of this study was to functionally characterize a novel CYP19A1 intronic homozygote mutation (IVS9+5G>A) in a 46,XX DSD girl presenting spontaneous breast development and primary amenorrhea, and to evaluate similar splicing variant expression in normal steroidogenic tissues. Genomic DNA analysis, splicing prediction programs, splicing assays, and in vitro protein expression and enzyme activity analyses were carried out. CYP19A1 mRNA expression in human steroidogenic tissues was also studied. A novel IVS9+5G>A homozygote mutation was found. In silico analysis predicts the disappearance of the splicing donor site in intron 9, confirmed by patient peripheral leukocyte cP450arom and in vitro studies. Protein analysis showed a shorter and inactive protein. The intron 9 transcript variant was also found in human steroidogenic tissues. The mutation IVS9+5G>A generates a splicing variant that includes intron 9 which is also present in normal human steroidogenic tissues, suggesting that a misbalance between normal and aberrant splicing variants might occur in target tissues, explaining the clinical phenotype in the affected patient. © 2015 S. Karger AG, Basel.
Rath, Matthias; Jenssen, Sönke E; Schwefel, Konrad; Spiegler, Stefanie; Kleimeier, Dana; Sperling, Christian; Kaderali, Lars; Felbor, Ute
2017-09-01
Cerebral cavernous malformations (CCM) are vascular lesions of the central nervous system that can cause headaches, seizures and hemorrhagic stroke. Disease-associated mutations have been identified in three genes: CCM1/KRIT1, CCM2 and CCM3/PDCD10. The precise proportion of deep-intronic variants in these genes and their clinical relevance is yet unknown. Here, a long-range PCR (LR-PCR) approach for target enrichment of the entire genomic regions of the three genes was combined with next generation sequencing (NGS) to screen for coding and non-coding variants. NGS detected all six CCM1/KRIT1, two CCM2 and four CCM3/PDCD10 mutations that had previously been identified by Sanger sequencing. Two of the pathogenic variants presented here are novel. Additionally, 20 stringently selected CCM index cases that had remained mutation-negative after conventional sequencing and exclusion of copy number variations were screened for deep-intronic mutations. The combination of bioinformatics filtering and transcript analyses did not reveal any deep-intronic splice mutations in these cases. Our results demonstrate that target enrichment by LR-PCR combined with NGS can be used for a comprehensive analysis of the entire genomic regions of the CCM genes in a research context. However, its clinical utility is limited as deep-intronic splice mutations in CCM1/KRIT1, CCM2 and CCM3/PDCD10 seem to be rather rare. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Tian, Ying; Wang, Genjie; Hu, Qingzhu; Xiao, Xichun; Chen, Shuxia
2018-04-01
The AML1/ETO onco-fusion protein is crucial for the genesis of t(8;21) acute myeloid leukemia (AML) and is well documented as a transcriptional repressor through dominant-negative effect. However, little is known about the transactivation mechanism of AML1/ETO. Through large cohort of patient's expression level data analysis and a series of experimental validation, we report here that AML1/ETO transactivates c-KIT expression through directly binding to and mediating the long-range interaction between the promoter and intronic enhancer regions of c-KIT. Gene expression analyses verify that c-KIT expression is significantly high in t(8;21) AML. Further ChIP-seq analysis and motif scanning identify two regulatory regions located in the promoter and intronic enhancer region of c-KIT, respectively. Both regions are enriched by co-factors of AML1/ETO, such as AML1, CEBPe, c-Jun, and c-Fos. Further luciferase reporter assays show that AML1/ETO trans-activates c-KIT promoter activity through directly recognizing the AML1 motif and the co-existence of co-factors. The induction of c-KIT promoter activity is reinforced with the existence of intronic enhancer region. Furthermore, ChIP-3C-qPCR assays verify that AML1/ETO mediates the formation of DNA-looping between the c-KIT promoter and intronic enhancer region through the long-range interaction. Collectively, our data uncover a novel transcriptional activity mechanism of AML1/ETO and enrich our knowledge of the onco-fusion protein mediated transcription regulation. © 2017 Wiley Periodicals, Inc.
The U6 snRNA m6A Methyltransferase METTL16 Regulates SAM Synthetase Intron Retention.
Pendleton, Kathryn E; Chen, Beibei; Liu, Kuanqing; Hunter, Olga V; Xie, Yang; Tu, Benjamin P; Conrad, Nicholas K
2017-05-18
Maintenance of proper levels of the methyl donor S-adenosylmethionine (SAM) is critical for a wide variety of biological processes. We demonstrate that the N 6 -adenosine methyltransferase METTL16 regulates expression of human MAT2A, which encodes the SAM synthetase expressed in most cells. Upon SAM depletion by methionine starvation, cells induce MAT2A expression by enhanced splicing of a retained intron. Induction requires METTL16 and its methylation substrate, a vertebrate conserved hairpin (hp1) in the MAT2A 3' UTR. Increasing METTL16 occupancy on the MAT2A 3' UTR is sufficient to induce efficient splicing. We propose that, under SAM-limiting conditions, METTL16 occupancy on hp1 increases due to inefficient enzymatic turnover, which promotes MAT2A splicing. We further show that METTL16 is the long-unknown methyltransferase for the U6 spliceosomal small nuclear RNA (snRNA). These observations suggest that the conserved U6 snRNA methyltransferase evolved an additional function in vertebrates to regulate SAM homeostasis. Copyright © 2017 Elsevier Inc. All rights reserved.
Detection of Splice Sites Using Support Vector Machine
NASA Astrophysics Data System (ADS)
Varadwaj, Pritish; Purohit, Neetesh; Arora, Bhumika
Automatic identification and annotation of exon and intron region of gene, from DNA sequences has been an important research area in field of computational biology. Several approaches viz. Hidden Markov Model (HMM), Artificial Intelligence (AI) based machine learning and Digital Signal Processing (DSP) techniques have extensively and independently been used by various researchers to cater this challenging task. In this work, we propose a Support Vector Machine based kernel learning approach for detection of splice sites (the exon-intron boundary) in a gene. Electron-Ion Interaction Potential (EIIP) values of nucleotides have been used for mapping character sequences to corresponding numeric sequences. Radial Basis Function (RBF) SVM kernel is trained using EIIP numeric sequences. Furthermore this was tested on test gene dataset for detection of splice site by window (of 12 residues) shifting. Optimum values of window size, various important parameters of SVM kernel have been optimized for a better accuracy. Receiver Operating Characteristic (ROC) curves have been utilized for displaying the sensitivity rate of the classifier and results showed 94.82% accuracy for splice site detection on test dataset.
Huang, P L; Do, Y Y; Huang, F C; Thay, T S; Chang, T W
1997-04-01
A cDNA encoding the banana 1-aminocyclopropane-1-carboxylate (ACC) oxidase has previously been isolated from a cDNA library that was constructed by extracting poly(A)+ RNA from peels of ripening banana. This cDNA, designated as pMAO2, has 1,199 bp and contains an open reading frame of 318 amino acids. In order to identify ripening-related promoters of the banana ACC oxidase gene, pMAO2 was used as a probe to screen a banana genomic library constructed in the lambda EMBL3 vector. The banana ACC oxidase MAO2 gene has four exons and three introns, with all of the boundaries between these introns and exons sharing a consensus dinucleotide sequence of GT-AG. The expression of MAO2 gene in banana begins after the onset of ripening (stage 2) and continuous into later stages of the ripening process. The accumulation of MAO2 mRNA can be induced by 1 microliter/l exogenous ethylene, and it reached steady state level when 100 microliters/l exogenous ethylene was present.
Akay, Alper; Di Domenico, Tomas; Suen, Kin M; Nabih, Amena; Parada, Guillermo E; Larance, Mark; Medhi, Ragini; Berkyurek, Ahmet C; Zhang, Xinlian; Wedeles, Christopher J; Rudolph, Konrad L M; Engelhardt, Jan; Hemberg, Martin; Ma, Ping; Lamond, Angus I; Claycomb, Julie M; Miska, Eric A
2017-08-07
Small RNAs play a crucial role in genome defense against transposable elements and guide Argonaute proteins to nascent RNA transcripts to induce co-transcriptional gene silencing. However, the molecular basis of this process remains unknown. Here, we identify the conserved RNA helicase Aquarius/EMB-4 as a direct and essential link between small RNA pathways and the transcriptional machinery in Caenorhabditis elegans. Aquarius physically interacts with the germline Argonaute HRDE-1. Aquarius is required to initiate small-RNA-induced heritable gene silencing. HRDE-1 and Aquarius silence overlapping sets of genes and transposable elements. Surprisingly, removal of introns from a target gene abolishes the requirement for Aquarius, but not HRDE-1, for small RNA-dependent gene silencing. We conclude that Aquarius allows small RNA pathways to compete for access to nascent transcripts undergoing co-transcriptional splicing in order to detect and silence transposable elements. Thus, Aquarius and HRDE-1 act as gatekeepers coordinating gene expression and genome defense. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Yin, Changchuan
2015-04-01
To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
[Hsp70 Genes of the Megaphragma amalphitanum (Hymenoptera: Trichogrammatidae) Parasitic Wasp].
Chuvakova, L N; Sharko, F S; Nedoluzhko, A V; Polilov, A A; Prokhorchuk, E B; Skryabin, K G; Evgen'ev, M B
2017-01-01
Miniaturization is an evolutionary process that is widely represented in both invertebrates and vertebrates. Miniaturization frequently affects not only the size of the organism and its constituent cells, but also changes the genome structure and functioning. The structure of the main heat shock genes (hsp70 and hsp83) was studied in one of the smallest insects, the Megaphragma amalphitanum (Hymenoptera: Trichogrammatidae) parasitic wasp, which is comparable in size with unicellular organisms. An analysis of the sequenced genome has detected six genes that relate to the hsp70 family, some of which are apparently induced upon heat shock. Both induced and constitutively expressed hsp70 genes contain a large number of introns, which is not typical for the genes of this family. Moreover, none of the found genes form clusters, and they are all very heterogeneous (individual copies are only 75-85% identical), which indicates the absence of gene conversion, which provides the identity of genes of this family in Drosophila and other organisms. Two hsp83 genes, one of which contains an intron, have also been found in the M. amalphitanum genome.
Martínez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Martínez-Montiel, Mónica; Gaspariano-Cholula, Mayra Patricia; Martínez-Contreras, Rebeca D
2016-01-01
In eukaryotes, genes are frequently interrupted with noncoding sequences named introns. Alternative splicing is a nuclear mechanism by which these introns are removed and flanking coding regions named exons are joined together to generate a message that will be translated in the cytoplasm. This mechanism is catalyzed by a complex machinery known as the spliceosome, which is conformed by more than 300 proteins and ribonucleoproteins that activate and regulate the precision of gene expression when assembled. It has been proposed that several genetic diseases are related to defects in the splicing process, including cancer. For this reason, natural products that show the ability to regulate splicing have attracted enormous attention due to its potential use for cancer treatment. Some microbial metabolites have shown the ability to inhibit gene splicing and the molecular mechanism responsible for this inhibition is being studied for future applications. Here, we summarize the main types of natural products that have been characterized as splicing inhibitors, the recent advances regarding molecular and cellular effects related to these molecules, and the applications reported so far in cancer therapeutics.
Intron size and genome size in plants.
J. Wendel; R. Cronn; I. Alvarez; B. Liu; R. Small; D. Senchina
2002-01-01
It has long been known that genomes vary over a remarkable range of sizes in both plants (Bennett, Cox, and Leitch 1997) and animals (Gregory 2001). It also has become evident that across the broad phylogenetic sweep, genome size may be correlated with intron size (Deutsch and Long 1999; Vinogradov 1999; McLysaght et al. 2000), suggesting that some component of genome...
Here we document introns in two Symbiodinium clades that were most likely gained following divergence of this genus from other peridinin-containing dinoflagellate lineages. Soluble peridinin-chlorophyll a-proteins (sPCP) occur in short and long forms in different species, and all...
USDA-ARS?s Scientific Manuscript database
The carpel- and stamen-specific AtAGIP promoter derived from the Arabidopsis AGAMOUS (AG) second intron/enhancer is ideal for engineering complete sterility, but it is highly host-specific. To ascertain that a chimeric promoter with similar tissue specificity can be created for species other than A...
X-linked hypophosphatemia attributable to pseudoexons of the PHEX gene.
Christie, P T; Harding, B; Nesbit, M A; Whyte, M P; Thakker, R V
2001-08-01
X-linked hypophosphatemia is commonly caused by mutations of the coding region of PHEX (phosphate-regulating gene with homologies to endopeptidases on the X chromosome). However, such PHEX mutations are not detected in approximately one third of X-linked hypophosphatemia patients who may harbor defects in the noncoding or intronic regions. We have therefore investigated 11 unrelated X-linked hypophosphatemia patients in whom coding region mutations had been excluded, for intronic mutations that may lead to mRNA splicing abnormalities, by the use of lymphoblastoid RNA and RT-PCRs. One X-linked hypophosphatemia patient was found to have 3 abnormally large transcripts, resulting from 51-bp, 100-bp, and 170-bp insertions, all of which would lead to missense peptides and premature termination codons. The origin of these transcripts was a mutation (g to t) at position +1268 of intron 7, which resulted in the occurrence of a high quality novel donor splice site (ggaagg to gtaagg). Splicing between this novel donor splice site and 3 preexisting, but normally silent, acceptor splice sites within intron 7 resulted in the occurrences of the 3 pseudoexons. This represents the first report of PHEX pseudoexons and reveals further the diversity of genetic abnormalities causing X-linked hypophosphatemia.
2010-01-01
β-tubulins are structural components of microtubules and the targets of benzimidazole fungicides used to control many diseases of agricultural importance. Intron polymorphisms in the intron-rich genes of these proteins have been used in phylogeographic investigations of phytopathogenic fungi. In this work, we sequenced 2764 nucleotides of the β-tubulin gene (Pp tubB) in samples of Phakopsora pachyrhizi collected from seven soybean fields in Brazil. Pp tubB contained an open reading frame of 1341 nucleotides, including nine exons and eight introns. Exon length varied from 14 to 880 nucleotides, whereas intron length varied from 76 to 102 nucleotides. The presence of only four polymorphic sites limited the usefulness of Pp tubB for phylogeographic studies in P. pachyrhizi. The gene structures of Pp tubB and orthologous β-tubulin genes of Melampsora lini and Uromyces viciae-fabae were highly conserved. The amino acid substitutions in β-tubulin proteins associated with the onset of benzimidazole resistance in model organisms, especially at His 6 , Glu 198 and Phe 200 , were absent from the predicted sequence of the P. pachyrhizi β-tubulin protein. PMID:21637494
Yap, Karen; Lim, Zhao Qin; Khandelia, Piyush; Friedman, Brad; Makeyev, Eugene V.
2012-01-01
Differentiated cells acquire unique structural and functional traits through coordinated expression of lineage-specific genes. An extensive battery of genes encoding components of the synaptic transmission machinery and specialized cytoskeletal proteins is activated during neurogenesis, but the underlying regulation is not well understood. Here we show that genes encoding critical presynaptic proteins are transcribed at a detectable level in both neurons and nonneuronal cells. However, in nonneuronal cells, the splicing of 3′-terminal introns within these genes is repressed by the polypyrimidine tract-binding protein (Ptbp1). This inhibits the export of incompletely spliced mRNAs to the cytoplasm and triggers their nuclear degradation. Clearance of these intron-containing transcripts occurs independently of the nonsense-mediated decay (NMD) pathway but requires components of the nuclear RNA surveillance machinery, including the nuclear pore-associated protein Tpr and the exosome complex. When Ptbp1 expression decreases during neuronal differentiation, the regulated introns are spliced out, thus allowing the accumulation of translation-competent mRNAs in the cytoplasm. We propose that this mechanism counters ectopic and precocious expression of functionally linked neuron-specific genes and ensures their coherent activation in the appropriate developmental context. PMID:22661231
Yap, Karen; Lim, Zhao Qin; Khandelia, Piyush; Friedman, Brad; Makeyev, Eugene V
2012-06-01
Differentiated cells acquire unique structural and functional traits through coordinated expression of lineage-specific genes. An extensive battery of genes encoding components of the synaptic transmission machinery and specialized cytoskeletal proteins is activated during neurogenesis, but the underlying regulation is not well understood. Here we show that genes encoding critical presynaptic proteins are transcribed at a detectable level in both neurons and nonneuronal cells. However, in nonneuronal cells, the splicing of 3'-terminal introns within these genes is repressed by the polypyrimidine tract-binding protein (Ptbp1). This inhibits the export of incompletely spliced mRNAs to the cytoplasm and triggers their nuclear degradation. Clearance of these intron-containing transcripts occurs independently of the nonsense-mediated decay (NMD) pathway but requires components of the nuclear RNA surveillance machinery, including the nuclear pore-associated protein Tpr and the exosome complex. When Ptbp1 expression decreases during neuronal differentiation, the regulated introns are spliced out, thus allowing the accumulation of translation-competent mRNAs in the cytoplasm. We propose that this mechanism counters ectopic and precocious expression of functionally linked neuron-specific genes and ensures their coherent activation in the appropriate developmental context.
Circular RNA biogenesis can proceed through an exon-containing lariat precursor.
Barrett, Steven P; Wang, Peter L; Salzman, Julia
2015-06-09
Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical 'backsplicing' event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure.
RatMap--rat genome tools and data.
Petersen, Greta; Johnson, Per; Andersson, Lars; Klinga-Levan, Karin; Gómez-Fabre, Pedro M; Ståhl, Fredrik
2005-01-01
The rat genome database RatMap (http://ratmap.org or http://ratmap.gen.gu.se) has been one of the main resources for rat genome information since 1994. The database is maintained by CMB-Genetics at Goteborg University in Sweden and provides information on rat genes, polymorphic rat DNA-markers and rat quantitative trait loci (QTLs), all curated at RatMap. The database is under the supervision of the Rat Gene and Nomenclature Committee (RGNC); thus much attention is paid to rat gene nomenclature. RatMap presents information on rat idiograms, karyotypes and provides a unified presentation of the rat genome sequence and integrated rat linkage maps. A set of tools is also available to facilitate the identification and characterization of rat QTLs, as well as the estimation of exon/intron number and sizes in individual rat genes. Furthermore, comparative gene maps of rat in regard to mouse and human are provided.
RatMap—rat genome tools and data
Petersen, Greta; Johnson, Per; Andersson, Lars; Klinga-Levan, Karin; Gómez-Fabre, Pedro M.; Ståhl, Fredrik
2005-01-01
The rat genome database RatMap (http://ratmap.org or http://ratmap.gen.gu.se) has been one of the main resources for rat genome information since 1994. The database is maintained by CMB–Genetics at Göteborg University in Sweden and provides information on rat genes, polymorphic rat DNA-markers and rat quantitative trait loci (QTLs), all curated at RatMap. The database is under the supervision of the Rat Gene and Nomenclature Committee (RGNC); thus much attention is paid to rat gene nomenclature. RatMap presents information on rat idiograms, karyotypes and provides a unified presentation of the rat genome sequence and integrated rat linkage maps. A set of tools is also available to facilitate the identification and characterization of rat QTLs, as well as the estimation of exon/intron number and sizes in individual rat genes. Furthermore, comparative gene maps of rat in regard to mouse and human are provided. PMID:15608244
Cancer-Associated Perturbations in Alternative Pre-messenger RNA Splicing.
Shkreta, Lulzim; Bell, Brendan; Revil, Timothée; Venables, Julian P; Prinos, Panagiotis; Elela, Sherif Abou; Chabot, Benoit
2013-01-01
For most of our 25,000 genes, the removal of introns by pre-messenger RNA (pre-mRNA) splicing represents an essential step toward the production of functional messenger RNAs (mRNAs). Alternative splicing of a single pre-mRNA results in the production of different mRNAs. Although complex organisms use alternative splicing to expand protein function and phenotypic diversity, patterns of alternative splicing are often altered in cancer cells. Alternative splicing contributes to tumorigenesis by producing splice isoforms that can stimulate cell proliferation and cell migration or induce resistance to apoptosis and anticancer agents. Cancer-specific changes in splicing profiles can occur through mutations that are affecting splice sites and splicing control elements, and also by alterations in the expression of proteins that control splicing decisions. Recent progress in global approaches that interrogate splicing diversity should help to obtain specific splicing signatures for cancer types. The development of innovative approaches for annotating and reprogramming splicing events will more fully establish the essential contribution of alternative splicing to the biology of cancer and will hopefully provide novel targets and anticancer strategies. Metazoan genes are usually made up of several exons interrupted by introns. The introns are removed from the pre-mRNA by RNA splicing. In conjunction with other maturation steps, such as capping and polyadenylation, the spliced mRNA is then transported to the cytoplasm to be translated into a functional protein. The basic mechanism of splicing requires accurate recognition of each extremity of each intron by the spliceosome. Introns are identified by the binding of U1 snRNP to the 5' splice site and the U2AF65/U2AF35 complex to the 3' splice site. Following these interactions, other proteins and snRNPs are recruited to generate the complete spliceosomal complex needed to excise the intron. While many introns are constitutively removed by the spliceosome, other splice junctions are not used systematically, generating the phenomenon of alternative splicing. Alternative splicing is therefore the process by which a single species of pre-mRNA can be matured to produce different mRNA molecules (Fig. 1). Depending on the number and types of alternative splicing events, a pre-mRNA can generate from two to several thousands different mRNAs leading to the production of a corresponding number of proteins. It is now believed that the expression of at least 70 % of human genes is subjected to alternative splicing, implying an enormous contribution to proteomic diversity, and by extension, to the development and the evolution of complex animals. Defects in splicing have been associated with human diseases (Caceres and Kornblihtt, Trends Genet 18(4):186-93, 2002, Cartegni et al., Nat Rev Genet 3(4):285-98, 2002, Pagani and Baralle, Nat Rev Genet 5(5):389-96, 2004), including cancer (Brinkman, Clin Biochem 37(7):584-94, 2004, Venables, Bioessays 28(4):378-86, 2006, Srebrow and Kornblihtt, J Cell Sci 119(Pt 13):2635-2641, 2006, Revil et al., Bull Cancer 93(9):909-919, 2006, Venables, Transworld Res Network, 2006, Pajares et al., Lancet Oncol 8(4):349-57, 2007, Skotheim and Nees, Int J Biochem Cell Biol 39:1432-1449, 2007). Numerous studies have now confirmed the existence of specific differences in the alternative splicing profiles between normal and cancer tissues. Although there are a few cases where specific mutations are the primary cause for these changes, global alterations in alternative splicing in cancer cells may be primarily derived from changes in the expression of RNA-binding proteins that control splice site selection. Overall, these cancer-specific differences in alternative splicing offer an immense potential to improve the diagnosis and the prognosis of cancer. This review will focus on the functional impact of cancer-associated alternative splicing variants, the molecular determinants that alter the splicing decisions in cancer cells, and future therapeutic strategies.
Adams, Gerard C; Surve-Iyer, Rupa S; Iezzoni, Amy F
2002-01-01
Leucostoma species that are the causal agents of Cytospora canker of stone and pome fruit trees were studied in detail. DNA sequence of the internal transcribed spacer regions and the 5.8S of the nuclear ribosomal DNA operon (ITS rDNA) supplied sufficient characters to assess the phylogenetic relationships among species of Leucostoma, Valsa, Valsella, and related anamorphs in Cytospora. Parsimony analysis of the aligned sequence divided Cytospora isolates from fruit trees into clades that generally agreed with the morphological species concepts, and with some of the phenetic groupings (PG 1-6) identified previously by isozyme analysis and cultural characteristics. Phylogenetic analysis inferred that isolates of L. persoonii formed two well-resolved clades distinct from isolates of L. cinctum. Phylogenetic analysis of the ITS rDNA, isozyme analysis, and cultural characteristics supported the inference that L. persoonii groups PG 2 and PG 3 were populations of a new species apparently more genetically different from L. persoonii PG 1 than from isolates representative of L. massariana, L. niveum, L. translucens, and Valsella melastoma. The new species, L. parapersoonii, was described. A diverse collection of isolates of L. cinctum, L. persoonii, and L. parapersoonii were examined for genetic variation using restriction fragment length polymorphism (RFLP) analysis of the ITS rDNA and the five prime end of the large subunit of the rDNA (LSU rDNA). HinfI and HpaII endonucleases were each useful in dividing the Leucostoma isolates into RFLP profiles corresponding to the isozyme phenetic groups, PG 1-6. RFLP analysis was more effective than isozyme analysis in uncovering variation among isolates of L. persoonii PG 1, but less effective within L. cinctum populations. Isolates representative of seven of the L. persoonii formae speciales proposed by G. Défago in 1935 were found to be genetically diverse isolates of PG 1. Two large insertions, 415 and 309 nucleotides long, in the small subunit (SSU) of the nuclear rDNA of L. cinctum were identified as Group 1 introns; intron 1 at position 943 and intron 2 at position 1199. The two introns were found to be consistently present in isolates of L. cinctum PG 4 and PG 5 and absent from L. cinctum PG 6 isolates, despite the similarity of the ITS sequence and teleomorph morphology. Intron 1 was of subgroup 1C1 whereas intron 2 was of an unknown subgroup. RFLP patterns and presence/absence of introns were useful characters for expediting the identification of cultures of Leucostoma isolated from stone and pome fruit cankers. RFLP patterns from 13 endonucleases provided an effective method for selecting an array of diverse PG 1 isolates useful in screening plant germplasm for disease-resistance.
Mizrahi-Aviv, Ela; Mills, David; Benzioni, Aliza; Bar-Zvi, Dudy
2005-03-01
Chloroplast metabolism is rapidly affected by salt stress. Photosynthesis is one of the first processes known to be affected by salinity. Here, we report that salinity inhibits chloroplast post-transcriptional RNA processing. A differentially expressed 680-bp cDNA, containing the 3' sequence of 16S rRNA, transcribed intergenic spacer, exon 1 and intron of tRNA(Ile), was isolated by differential display reverse transcriptase PCR from salt-grown jojoba (Simmondsia chinesis) shoot cultures. Northern blot analysis indicated that although most rRNA appears to be fully processed, partially processed chloroplast 16S rRNA accumulates in salt-grown cultures. Thus, salinity appears to decrease the processing of the rrn transcript. The possible effect of this decreased processing on physiological processes is, as yet, unknown.
Nadjar-Boger, Elisabeth; Funkenstein, Bruria
2011-02-01
Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily that functions as a negative regulator of skeletal muscle development and growth in mammals. Fish express at least two genes for MSTN: MSTN-1 and MSTN-2. To date, MSTN-2 promoters have been cloned only from salmonids and zebrafish. Here we described the cloning and sequence analysis of MSTN-2 gene and its 5' flanking region in the marine fish Sparus aurata (saMSTN-2). We demonstrate the existence of three alleles of the promoter and three alleles of the first intron. Sequence comparison of the promoter region in the three alleles revealed that although the sequences of the first 1050 bp upstream of the translation start site are almost identical in the three alleles, a substantial sequence divergence is seen further upstream. Careful sequence analysis of the region upstream of the first 1050 bp in the three alleles identified several elements that appear to be repeated in some or all sequences, at different positions. This suggests that the promoter region of saMSTN-2 has been subjected to various chromosomal rearrangements during the course of evolution, reflecting either insertion or deletion events. Screening of several genomic DNA collections indicated differences in allele frequency, with allele 'b' being the most abundant, followed by allele 'c', whereas allele 'a' is relatively rare. Sequence analysis of saMSTN-2 gene also revealed polymorphism in the first intron, identifying three alleles. The length difference in alleles '1R' and '2R' of the first intron is due to the presence of one or two copies of a repeated block of approximately 150 bp, located at the 5' end of the first intron. The third allele, '4R', has an additional insertion of 323 bp located 116 bp upstream of the 3' end of the first intron. Analysis of several DNA collections showed that the '2R' allele is the most common, followed by the '4R' allele, whereas the '1R' allele is relatively rare. Progeny analysis of a full-sib family showed a Mendelian mode of inheritance of the two genetic loci. No clear association was found between the two genetic markers and growth rate. These results show for the first time a substantial degree of polymorphism in both the promoter and first intron of MSTN-2 gene in a perciform fish species which points to chromosomal rearrangements that took place during evolution.
Wickramasinghe, Susiji; Yatawara, Lalani; Nagataki, Mitsuru; Agatsuma, Takeshi
2016-10-01
To determine exon/intron organization of the Toxocara canis (T. canis) AK (TCAK) and to test green and black tea and several other chemicals against the activity of recombinant TCAK in the guanidino-specific region by site-directed mutants. Amplification of genomic DNA fragments containing introns was carried out by PCRs. The open-reading frame (1200 bp) of TCAK (wild type) was cloned into the BamH1/SalI site of pMAL-c2X. The maltose-binding protein-TCAK fusion protein was expressed in Escherichia coli TB1 cells. The purity of the expressed enzyme was verified by SDS-PAGE. Mutations were introduced into the guanidino-specific region and other areas of pMAL/TCAK by PCR. Enzyme activity was measured with an NADH-linked assay at 25 °C for the forward reaction (phosphagen synthesis). Arginine kinase in T. canis has a seven-exon/six-intron gene structure. The lengths of the introns ranged from 542 bp to 2 500 bp. All introns begin with gt and end with ag. Furthermore, we measured the enzyme activity of site-directed mutants of the recombinant TCAK. The K m value of the mutant (Alanine to Serine) decreased indicating a higher affinity for substrate arginine than the wild-type. The K m value of the mutant (Serine to Glycine) increased to 0.19 mM. The K m value (0.19 mM) of the double mutant (Alanine-Serine to Serine-Glycine) was slightly greater than in the wild-type (0.12 mM). In addition, several other chemicals were tested; including plant extract Azadiracta indica (A. indica), an aminoglycoside antibiotic (aminosidine), a citrus flavonoid glycoside (rutin) and a commercially available catechin mixture against TCAK. Green and black tea (1:10 dilution) produced 15% and 25% inhibition of TCAK, respectively. The extract of A. indica produced 5% inhibition of TCAK. Moreover, green and black tea produced a non-competitive type of inhibition and A. indica produced a mixed-type of inhibition on TCAK. Arginine kinase in T. canis has a seven-exon/six-intron gene structure. However, further studies are needed to identify a specific compound within the extract causing the inhibitory effect and also to determine the molecular mechanisms behind inhibition of arginine kinase in T. canis. Copyright © 2016 Hainan Medical University. Production and hosting by Elsevier B.V. All rights reserved.
Li, Wencheng; You, Bei; Hoque, Mainul; Zheng, Dinghai; Luo, Wenting; Ji, Zhe; Park, Ji Yeon; Gunderson, Samuel I.; Kalsotra, Auinash; Manley, James L.; Tian, Bin
2015-01-01
Alternative cleavage and polyadenylation (APA) results in mRNA isoforms containing different 3’ untranslated regions (3’UTRs) and/or coding sequences. How core cleavage/polyadenylation (C/P) factors regulate APA is not well understood. Using siRNA knockdown coupled with deep sequencing, we found that several C/P factors can play significant roles in 3’UTR-APA. Whereas Pcf11 and Fip1 enhance usage of proximal poly(A) sites (pAs), CFI-25/68, PABPN1 and PABPC1 promote usage of distal pAs. Strong cis element biases were found for pAs regulated by CFI-25/68 or Fip1, and the distance between pAs plays an important role in APA regulation. In addition, intronic pAs are substantially regulated by splicing factors, with U1 mostly inhibiting C/P events in introns near the 5’ end of gene and U2 suppressing those in introns with features for efficient splicing. Furthermore, PABPN1 inhibits expression of transcripts with pAs near the transcription start site (TSS), a property possibly related to its role in RNA degradation. Finally, we found that groups of APA events regulated by C/P factors are also modulated in cell differentiation and development with distinct trends. Together, our results support an APA code where an APA event in a given cellular context is regulated by a number of parameters, including relative location to the TSS, splicing context, distance between competing pAs, surrounding cis elements and concentrations of core C/P factors. PMID:25906188
Haïli, Nawel; Planchard, Noelya; Arnal, Nadège; Quadrado, Martine; Vrielynck, Nathalie; Dahan, Jennifer; des Francs-Small, Catherine Colas; Mireau, Hakim
2016-01-01
Mitochondrial translation involves a complex interplay of ancient bacteria-like features and host-derived functionalities. Although the basic components of the mitochondrial translation apparatus have been recognized, very few protein factors aiding in recruiting ribosomes on mitochondria-encoded messenger RNA (mRNAs) have been identified in higher plants. In this study, we describe the identification of the Arabidopsis (Arabidopsis thaliana) MITOCHONDRIAL TRANSLATION FACTOR1 (MTL1) protein, a new member of the Pentatricopeptide Repeat family, and show that it is essential for the translation of the mitochondrial NADH dehydrogenase subunit7 (nad7) mRNA. We demonstrate that mtl1 mutant plants fail to accumulate the Nad7 protein, even though the nad7 mature mRNA is produced and bears the same 5' and 3' extremities as in wild-type plants. We next observed that polysome association of nad7 mature mRNA is specifically disrupted in mtl1 mutants, indicating that the absence of Nad7 results from a lack of translation of nad7 mRNA. These findings illustrate that mitochondrial translation requires the intervention of gene-specific nucleus-encoded PPR trans-factors and that their action does not necessarily involve the 5' processing of their target mRNA, as observed previously. Interestingly, a partial decrease in nad7 intron 2 splicing was also detected in mtl1 mutants, suggesting that MTL1 is also involved in group II intron splicing. However, this second function appears to be less essential for nad7 expression than its role in translation. MTL1 will be instrumental to understand the multifunctionality of PPR proteins and the mechanisms governing mRNA translation and intron splicing in plant mitochondria. © 2016 American Society of Plant Biologists. All Rights Reserved.
Rocchi, V; Janni, M; Bellincampi, D; Giardina, T; D'Ovidio, R
2012-03-01
Pectin is an important component of the plant cell wall and its remodelling occurs during normal plant growth or following stress responses. Pectin is secreted into the cell wall in a highly methyl-esterified form and subsequently de-methyl-esterified by pectin methyl esterase (PME), whose activity is controlled by the pectin methyl esterase inhibitor protein (PMEI). Cereal cell wall contains a low amount of pectin; nonetheless the level and pattern of pectin methyl esterification play a primary role during development or pathogen infection. Since few data are available on the role of PMEI in plant development and defence of cereal species, we isolated and characterised three Pmei genes (Tdpmei2.1, Tdpmei2.2 and Tdpmei3) and their encoded products in wheat. Sequence comparisons showed a low level of intra- and inter-specific sequence conservation of PMEIs. Tdpmei2.1 and Tdpmei2.2 share 94% identity at protein level, but only 20% identity with the product of Tdpmei3. All three Tdpmei genes code for functional inhibitors of plant PMEs and do not inhibit microbial PMEs or a plant invertase. RT-PCR analyses demonstrated, for the first time to our knowledge, that Pmei genes are regulated by intron retention. Processed and unprocessed transcripts of Tdpmei2.1 and Tdpmei2.2 accumulated in several organs, but anthers contained only mature transcripts. Tdpmei3 lacks introns and its transcript accumulated mainly in stem internodes. These findings suggest that products encoded by these Tdpmei genes control organ- or tissue-specific activity of specific PME isoforms in wheat. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.
Fernández-Guerra, Paula; Navarrete, Rosa; Weisiger, Kara; Desviat, Lourdes R; Packman, Seymour; Ugarte, Magdalena; Rodríguez-Pombo, Pilar
2010-12-01
Mutations in any of the three different genes--BCKDHA, BCKDHB, and DBT--encoding for the E1α, E1β, and E2 catalytic components of the branched-chain α-ketoacid dehydrogenase complex can cause maple syrup urine disease (MSUD). Disease severity ranges from the classic to the mildest variant types and precise genotypes, mostly based on missense mutations, have been associated to the less severe presentations of the disease. Herein, we examine the consequences at the messenger RNA (mRNA) level of the novel intronic alteration c.288+9C>T found in heterozygous fashion in a BCKDHA variant MSUD patient who also carries the nucleotide change c.745G>A (p.Gly249Ser), previously described as a severe change. Direct analysis of the processed transcripts from the patient showed--in addition to a low but measurable level of normal mRNA product--an aberrantly spliced mRNA containing a 7-bp fragment of intron 2, which could be rescued when the patient's cells were treated with emetine. This aberrant transcript with a premature stop codon would be unstable, supporting the possible activation of nonsense-mediated mRNA decay pathway. Consistent with this finding, minigene splicing assays demonstrated that the point mutation c.288+9C>T is sufficient to create a cryptic splice site and cause the observed 7-bp insertion. Furthermore, our results strongly suggest that the c.288+9C>T allele in the patient generates both normal and aberrant transcripts that could sustain the variant presentation of the disease, highlighting the importance of correct genotyping to establish genotype-phenotype correlations and as basis for the development of therapeutic interventions.
Metal dependence and branched RNA cocrystal structures of the RNA lariat debranching enzyme Dbr1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Clark, Nathaniel E.; Katolik, Adam; Roberts, Kenneth M.
Intron lariats are circular, branched RNAs (bRNAs) produced during pre-mRNA splicing. Their unusual chemical and topological properties arise from branch-point nucleotides harboring vicinal 2',5'- and 3',5'-phosphodiester linkages. The 2',5'-bonds must be hydrolyzed by the RNA debranching enzyme Dbr1 before spliced introns can be degraded or processed into small nucleolar RNA and microRNA derived from intronic RNA. Here, we measure the activity of Dbr1 from Entamoeba histolytica by using a synthetic, dark-quenched bRNA substrate that fluoresces upon hydrolysis. Purified enzyme contains nearly stoichiometric equivalents of Fe and Zn per polypeptide and demonstrates turnover rates of ~3 s -1. Similar rates aremore » observed when apo-Dbr1 is reconstituted with Fe(II)+Zn(II) under aerobic conditions. Under anaerobic conditions, a rate of ~4.0 s -1 is observed when apoenzyme is reconstituted with Fe(II). In contrast, apo-Dbr1 reconstituted with Mn(II) or Fe(II) under aerobic conditions is inactive. Diffraction data from crystals of purified enzyme using X-rays tuned to the Fe absorption edge show Fe partitions primarily to the β-pocket and Zn to the α-pocket. Structures of the catalytic mutant H91A in complex with 7-mer and 16-mer synthetic bRNAs reveal bona fide RNA branchpoints in the Dbr1 active site. A bridging hydroxide is in optimal position for nucleophilic attack of the scissile phosphate. The results clarify uncertainties regarding structure/function relationships in Dbr1 enzymes, and the fluorogenic probe permits high-throughput screening for inhibitors that may hold promise as treatments for retroviral infections and neurodegenerative disease.« less
Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo
2013-11-01
Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC 3 (NM_003786.3:c.1783-1G>A), KLHDC 1 (NM_172193.1:c.568-2A>G), HOOK 1 (NM_015888.4:c.1662-1G>A), SMAD 9 (NM_001127217.2:c.1004-1C>T), and DNAH 9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC 3, HOOK 1. In ABCC 3 and HOOK 1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK 1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4-6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis.
Zeng, Xian-Chun; Liu, Yichen; Shi, Wanxia; Zhang, Lei; Luo, Xuesong; Nie, Yao; Yang, Ye
2014-03-01
It was shown that peptides containing trypsin inhibitor-like cysteine-rich (TIL) domain are able to inhibit proteinase activities, and thus play important roles in various biological processes, such as immune response and anticoagulation. However, only a limited number of the TIL peptides have been identified and characterized so far; and little has been known about the evolutionary relationships of the genes encoding the TIL peptides. BmKAPi is a TIL domain-containing peptide that was identified from Mesobuthus martensii Karsch. Here, we conducted genome-wide searches for new peptides that are homologous to BmKAPi or possess a cysteine pattern similar to that of BmKAPi. As a result, we identified a total of 80 different TIL peptides from 34 species of arthropods. We found that these peptides can be classified into seven evolutionarily distinct groups. Furthermore, we cloned the genomic sequence of BmKAPi; the genomic sequences of the majority of other TIL peptides were also identified from the GenBank database using bioinformatical approaches. Through phylogenetic and comparative genomic analysis, we found 26 cases of intron gain events occurred in the genes of the TIL peptides; however, no instances of intron loss were observed. Moreover, we found that alternative splicing contributes to the diversification of the TIL peptides. It is interesting to see that four genes of the TIL domain-containing peptides overlap in a DNA region located on the chromosome LG B15 of Bombus terretris. These data suggest that the evolution of the TIL peptide genes are dynamic, which was dominated by intron gain. Copyright © 2013 Elsevier Inc. All rights reserved.
An RNA internal loop acts as a hinge to facilitate ribozyme folding and catalysis.
Szewczak, A A; Cech, T R
1997-01-01
RNA molecules commonly consist of helical regions separated by internal loops, and in many cases these internal loops have been found to assume stable structures. We have examined the function and dynamics of an internal loop, J5/5a, that joins the two halves of the P4-P6 domain of the Tetrahymena self-splicing group I intron. P4-P6 RNAs with mutations in the J5/5a region showed nondenaturing gel electrophoretic mobilities and levels of Fe(II)-EDTA cleavage protection intermediate between those of wild-type RNA and a mutant incapable of folding into the native P4-P6 tertiary structure. Mutants with the least structured J5/5a loops behaved the most like wild-type P4-P6, and required smaller amounts of Mg2+ to rescue folding. The activity of reconstituted introns containing mutant P4-P6 RNAs correlated similarly with the nature of the J5/5a mutation. Our results suggest that, in solution, the P4-P6 RNA is in a two-state equilibrium between folded and unfolded states. We conclude that this internal loop mainly acts as a flexible hinge, allowing the coaxially stacked helical regions on either side of it to interact via specific tertiary contacts. To a lesser extent, the specific bases within the loop contribute to folding. Furthermore, it is crucial that the junction remain unstructured in the unfolded state. These conclusions cannot be derived from a simple examination of the P4-P6 crystal structure (Cate JH et al., 1996, Science 273:1678-1685), showing once again that structure determination must be supplemented with mutational and thermodynamic analysis to provide a complete picture of a folded macromolecule. PMID:9257643
Kelly, Scott A.; Bell, Timothy A.; Selitsky, Sara R.; Buus, Ryan J.; Hua, Kunjie; Weinstock, George M.; Garland, Theodore; Pardo-Manuel de Villena, Fernando; Pomp, Daniel
2013-01-01
Replicated artificial selection for high levels of voluntary wheel running in an outbred strain of mice favored an autosomal recessive allele whose primary phenotypic effect is a 50% reduction in hind-limb muscle mass. Within the High Runner (HR) lines of mice, the numerous pleiotropic effects (e.g., larger hearts, reduced total body mass and fat mass, longer hind-limb bones) of this hypothesized adaptive allele include functional characteristics that facilitate high levels of voluntary wheel running (e.g., doubling of mass-specific muscle aerobic capacity, increased fatigue resistance of isolated muscles, longer hind-limb bones). Previously, we created a backcross population suitable for mapping the responsible locus. We phenotypically characterized the population and mapped the Minimsc locus to a 2.6-Mb interval on MMU11, a region containing ∼100 known or predicted genes. Here, we present a novel strategy to identify the genetic variant causing the mini-muscle phenotype. Using high-density genotyping and whole-genome sequencing of key backcross individuals and HR mice with and without the mini-muscle mutation, from both recent and historical generations of the HR lines, we show that a SNP representing a C-to-T transition located in a 709-bp intron between exons 11 and 12 of the Myosin heavy polypeptide 4 (Myh4) skeletal muscle gene (position 67,244,850 on MMU11; assembly, December 2011, GRCm38/mm10; ENSMUSG00000057003) is responsible for the mini-muscle phenotype, Myh4Minimsc. Using next-generation sequencing, our approach can be extended to identify causative mutations arising in mouse inbred lines and thus offers a great avenue to overcome one of the most challenging steps in quantitative genetics. PMID:24056412
Kelly, Scott A; Bell, Timothy A; Selitsky, Sara R; Buus, Ryan J; Hua, Kunjie; Weinstock, George M; Garland, Theodore; Pardo-Manuel de Villena, Fernando; Pomp, Daniel
2013-12-01
Replicated artificial selection for high levels of voluntary wheel running in an outbred strain of mice favored an autosomal recessive allele whose primary phenotypic effect is a 50% reduction in hind-limb muscle mass. Within the High Runner (HR) lines of mice, the numerous pleiotropic effects (e.g., larger hearts, reduced total body mass and fat mass, longer hind-limb bones) of this hypothesized adaptive allele include functional characteristics that facilitate high levels of voluntary wheel running (e.g., doubling of mass-specific muscle aerobic capacity, increased fatigue resistance of isolated muscles, longer hind-limb bones). Previously, we created a backcross population suitable for mapping the responsible locus. We phenotypically characterized the population and mapped the Minimsc locus to a 2.6-Mb interval on MMU11, a region containing ∼100 known or predicted genes. Here, we present a novel strategy to identify the genetic variant causing the mini-muscle phenotype. Using high-density genotyping and whole-genome sequencing of key backcross individuals and HR mice with and without the mini-muscle mutation, from both recent and historical generations of the HR lines, we show that a SNP representing a C-to-T transition located in a 709-bp intron between exons 11 and 12 of the Myosin heavy polypeptide 4 (Myh4) skeletal muscle gene (position 67,244,850 on MMU11; assembly, December 2011, GRCm38/mm10; ENSMUSG00000057003) is responsible for the mini-muscle phenotype, Myh4(Minimsc). Using next-generation sequencing, our approach can be extended to identify causative mutations arising in mouse inbred lines and thus offers a great avenue to overcome one of the most challenging steps in quantitative genetics.
Xu, Jin; Lu, Zhigang; Xu, Mingming; Pan, Ling; Deng, Yi; Xie, Xiaohu; Liu, Huifen; Ding, Shixiong; Hurd, Yasmin L.; Pasternak, Gavril W.; Klein, Robert J.; Cartegni, Luca
2014-01-01
Single nucleotide polymorphisms (SNPs) in the OPRM1 gene have been associated with vulnerability to opioid dependence. The current study identifies an association of an intronic SNP (rs9479757) with the severity of heroin addiction among Han-Chinese male heroin addicts. Individual SNP analysis and haplotype-based analysis with additional SNPs in the OPRM1 locus showed that mild heroin addiction was associated with the AG genotype, whereas severe heroin addiction was associated with the GG genotype. In vitro studies such as electrophoretic mobility shift assay, minigene, siRNA, and antisense morpholino oligonucleotide studies have identified heterogeneous nuclear ribonucleoprotein H (hnRNPH) as the major binding partner for the G-containing SNP site. The G-to-A transition weakens hnRNPH binding and facilitates exon 2 skipping, leading to altered expressions of OPRM1 splice-variant mRNAs and hMOR-1 proteins. Similar changes in splicing and hMOR-1 proteins were observed in human postmortem prefrontal cortex with the AG genotype of this SNP when compared with the GG genotype. Interestingly, the altered splicing led to an increase in hMOR-1 protein levels despite decreased hMOR-1 mRNA levels, which is likely contributed by a concurrent increase in single transmembrane domain variants that have a chaperone-like function on MOR-1 protein stability. Our studies delineate the role of this SNP as a modifier of OPRM1 alternative splicing via hnRNPH interactions, and suggest a functional link between an SNP-containing splicing modifier and the severity of heroin addiction. PMID:25122903
Chang, Yan-Li; Li, Wen-Yan; Miao, Hai; Yang, Shuai-Qi; Li, Ri; Wang, Xiang; Li, Wen-Qiang; Chen, Kun-Ming
2016-02-23
Plasma membrane NADPH oxidases (NOXs) are key producers of reactive oxygen species under both normal and stress conditions in plants and they form functional subfamilies. Studies of these subfamilies indicated that they show considerable evolutionary selection. We performed a comparative genomic analysis that identified 50 ferric reduction oxidases (FRO) and 77 NOX gene homologs from 20 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots, and eudicots. Phylogenetic and structural analysis classified these FRO and NOX genes into four well-conserved groups represented as NOX, FRO I, FRO II, and FRO III. Further analysis of NOXs of phylogenetic and exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures during the evolution of NOXs family genes and which were classified into four conserved subfamilies which are represented as Sub.I, Sub.II, Sub.III, and Sub.IV. Additionally, both available global microarray data analysis and quantitative real-time PCR experiments revealed that the NOX genes in Arabidopsis and rice (Oryza sativa) have different expression patterns in different developmental stages, various abiotic stresses and hormone treatments. Finally, coexpression network analysis of NOX genes in Arabidopsis and rice revealed that NOXs have significantly correlated expression profiles with genes which are involved in plants metabolic and resistance progresses. All these results suggest that NOX family underscores the functional diversity and divergence in plants. This finding will facilitate further studies of the NOX family and provide valuable information for functional validation of this family in plants. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Verma, Henu Kumar; Jha, Aditya Nath; Khodiar, Prafulla Kumar; Patra, Pradeep Kumar; Bhaskar, Lakkakula Venkata Kameswara Subrahmanya
2016-06-01
Cytokines are cell signaling molecules which upon release by cells facilitate the recruitment of immune-modulatory cells towards the sites of inflammation. Genetic variations in cytokine genes are shown to regulate their production and affect the risk of infectious as well as autoimmune diseases. Intron-3 of interleukin-4 gene (IL-4) harbors 70-bp variable number of tandem repeats (VNTR) that may alter the expression level of IL-4 gene. To determine the distribution of IL-4 70-bp VNTR polymorphism in seven genetically heterogeneous populations of Chhattisgarh, India and their comparison with the finding of other Indian and world populations. A total of 371 healthy unrelated individuals from 5 caste and 2 tribal populations were included in the present study. The IL-4 70-bp VNTR genotyping was carried out using PCR and electrophoresis. Overall, 3 alleles of IL-4 70-bp VNTR (a2, a3 and a4) were detected. The results demonstrated the variability of the IL-4 70-bp VNTR polymorphism in Chhattisgarh populations. Allele a3 was the most common allele at the 70-bp VNTR locus in all populations followed by a2 allele. This study reports the presence four repeat allele a4 at a low frequency in the majority of the Chhattisgarh populations studied. Further, the frequency of the minor allele (a2) in Chhattisgarh populations showed similarity with the frequencies of European populations but not with the East Asian populations where the a2 allele is a major allele. Our study provides a baseline for future research into the role of the IL-4 locus in diseases linked to inflammation in Indian populations.
Demissie, Zerihun A; Cella, Monica A; Sarker, Lukman S; Thompson, Travis J; Rheault, Mark R; Mahmoud, Soheil S
2012-07-01
Several members of the genus Lavandula produce valuable essential oils (EOs) that are primarily constituted of the low molecular weight isoprenoids, particularly monoterpenes. We isolated over 8,000 ESTs from the glandular trichomes of L. x intermedia flowers (where bulk of the EO is synthesized) to facilitate the discovery of genes that control the biosynthesis of EO constituents. The expression profile of these ESTs in L. x intermedia and its parents L. angustifolia and L. latifolia was established using microarrays. The resulting data highlighted a differentially expressed, previously uncharacterized cDNA with strong homology to known 1,8-cineole synthase (CINS) genes. The ORF, excluding the transit peptide, of this cDNA was expressed in E. coli, purified by Ni-NTA agarose affinity chromatography and functionally characterized in vitro. The ca. 63 kDa bacterially produced recombinant protein, designated L. x intermedia CINS (LiCINS), converted geranyl diphosphate (the linear monoterpene precursor) primarily to 1,8-cineole with K ( m ) and k ( cat ) values of 5.75 μM and 8.8 × 10(-3) s(-1), respectively. The genomic DNA of CINS in the studied Lavandula species had identical exon-intron architecture and coding sequences, except for a single polymorphic nucleotide in the L. angustifolia ortholog which did not alter protein function. Additional nucleotide variations restricted to L. angustifolia introns were also observed, suggesting that LiCINS was most likely inherited from L. latifolia. The LiCINS mRNA levels paralleled the 1,8-cineole content in mature flowers of the three lavender species, and in developmental stages of L. x intermedia inflorescence indicating that the production of 1,8 cineole in Lavandula is most likely controlled through transcriptional regulation of LiCINS.
Zhang, Lili; Sun, Chengfei; Ye, Xing; Zou, Shuming; Lu, Maixin; Liu, Zhigang; Tian, Yuanyuan
2014-02-01
Heat-shock proteins (Hsps), known as stress proteins and extrinsic chaperones, play important roles in the folding, translocation, and refolding/degradation of proteins. In this study, we identified four Hsps in Nile tilapia (Oreochromis niloticus), which display conserved Hsp characteristics in their predicted amino acid sequences. Further analyses on the structures, homology, and phylogenetics revealed that the four Hsps belong to Hsp70 family. One of them does not contain introns and is named Hsp70, while all the other three contain introns and are named Hsc70-1, Hsc70-2, and Hsc70-3. Expressions of the four Hsp proteins were observed in all examined tissues. Six hours after infection of Streptococcus agalactiae in Nile tilapia, the expression of Hsp70 was significantly increased in the liver, head kidney, spleen and gill, while Hsc70s' expression was unchanged in all examined tissues except the head kidney that showed significantly reduced expression of both Hsc70-2 and Hsc70-3. These results suggest that Hsp70 may participate in the defense against S. agalactiae infection. We then isolated the promoter of Hsp70 gene and inserted it into the donor plasmid of Tgf2 transposon system containing green fluorescent protein (GFP) gene. The plasmid was microinjected into zebrafish embryos, where the expression of GFP was induced by heat shock, S. agalactiae immersion challenge, indicating that the isolated Hsp70 promoter has transcriptional activity and is inducible by both heat shock and bacterial challenge. This promoter may facilitate the future construction of disease-resistant transgenic fish. The work also contributes to the further study of immune response of tilapia after bacterial infection.
Xue, Yufei; Chen, Baojun; Wang, Rui; Win, Aung Naing; Li, Jiana; Chai, Yourong
2018-02-01
Rapeseed (Brassica napus) is an important oilseed crop worldwide, and fatty acid (FA) compositions determine the nutritional and economic value of its seed oil. Fatty acid desaturases (FADs) play a pivotal role in regulating FA compositions, but to date, no comprehensive genome-wide analysis of FAD gene family in rapeseed and its parent species has been reported. In this study, using homology searches, 84, 45, and 44 FAD genes were identified in rapeseed, Brassica rapa, and Brassica oleracea genomes, respectively. These FAD genes were unevenly located in 17 chromosomes and 2 scaffolds of rapeseed, 9 chromosomes and 1 scaffold of B. rapa, and all the chromosomes of B. oleracea. Phylogenetic analysis showed that the soluble and membrane-bound FADs in the three Brassica species were divided into four and six subfamilies, respectively. Generally, the soluble FADs contained two conserved histidine boxes, while three highly conserved histidine boxes were harbored in membrane-bound FADs. Exon-intron structure, intron phase, and motif composition and position were highly conserved in each FAD subfamily. Putative subcellular locations of FAD proteins in three Brassica species were consistent with those of corresponding known FADs. In total, 25 of simple sequence repeat (SSR) loci were found in FAD genes of the three Brassica species. Transcripts of selected FAD genes in the three species were examined in various organs/tissues or stress treatments from NCBI expressed sequence tag (EST) database. This study provides a critical molecular basis for quality improvement of rapeseed oil and facilitates our understanding of key roles of FAD genes in plant growth and development and stress response.
DNA Barcoding in a Crop Genebank: Resolving the Capsicum annuum Species Complex.
USDA-ARS?s Scientific Manuscript database
Variability within eight cpDNA introns including trnS-trnfM, trnL-trnT, trnH-psbA, trnF-trnL, trnD-trnT, trnC-rpoB, rps16, and matK, and the nuclear waxy intron was examined in seven species of Capsicum (C. annuum, C. baccatum, C. chinense, C. frutescens, C. pubescens, C. chacoense, and C. rhomboide...
Fractal landscape analysis of DNA walks
NASA Technical Reports Server (NTRS)
Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H. E.
1992-01-01
By mapping nucleotide sequences onto a "DNA walk", we uncovered remarkably long-range power law correlations [Nature 356 (1992) 168] that imply a new scale invariant property of DNA. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences, but not in cDNA sequences or intron-less genes. In this paper, we present more explicit evidences to support our findings.
SMITten by the Speed of Splicing.
Johnson, Tracy L; Ares, Manuel
2016-04-07
Splicing occurs co-transcriptionally, but relative rates of splicing and transcription that might reveal mechanisms of their coordinated control have remained mysterious. Now, Carrillo Oesterreich et al. show that the fastest introns are gone nearly as soon as the 3' splice site is transcribed and that introns have distinct splicing kinetics with respect to polymerase progression along the gene. Copyright © 2016 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willing, M.; Deschenes, S.
We have identified a G to A substitution in the 5{prime} donor splice site of intron 18 of one COL1A1 allele in two unrelated families with osteogenesis imperfecta (OI) type I. A third OI type I family has a G to A substitution at the identical position in intron 48 of one COL1A1 allele. Both mutations abolish normal splicing and lead to reduced steady-state levels of mRNA from the mutant COL1A1 allele. The intron 18 mutation leads to both exon 18 skipping in the mRNA and to utilization of a single alternative splice site near the 3{prime} end of exonmore » 18. The latter results in deletion of the last 8 nucleotides of exon 18 from the mRNA, a shift in the translational reading-frame, and the creation of a premature termination codon in exon 19. Of the potential alternative 5{prime} splice sites in exon 18 and intron 18, the one utilized has a surrounding nucleotide sequence which most closely resembles that of the natural splice site. Although a G to A mutation was detected at the identical position in intron 48 of one COL1A1 allele in another OI type I family, nine complex alternative splicing patterns were identified by sequence analysis of cDNA clones derived from fibroblast mRNA from this cell strain. All result in partial or complete skipping of exon 48, with in-frame deletions of portions of exons 47 and/or 49. The different patterns of RNA splicing were not explained by their sequence homology with naturally occuring 5{prime} splice sites, but rather by recombination between highly homologous exon sequences, suggesting that we may not have identified the major splicing alternative(s) in this cell strain. Both G to A mutations result in decreased production of type I collagen, the common biochemical correlate of OI type I.« less
Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean
2012-01-01
Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop. PMID:22734675
RNomics in Archaea reveals a further link between splicing of archaeal introns and rRNA processing
Tang, Thean Hock; Rozhdestvensky, Timofey S.; d’Orval, Béatrice Clouet; Bortolin, Marie-Line; Huber, Harald; Charpentier, Bruno; Branlant, Christiane; Bachellerie, Jean-Pierre; Brosius, Jürgen; Hüttenhofer, Alexander
2002-01-01
The bulge–helix–bulge (BHB) motif recognised by the archaeal splicing endonuclease is also found in the long processing stems of archaeal rRNA precursors in which it is cleaved to generate pre-16S and pre-23S rRNAs. We show that in two species, Archaeoglobus fulgidus and Sulfolobus solfataricus, representatives from the two major archaeal kingdoms Euryarchaeota and Crenarchaeota, respectively, the pre-rRNA spacers cleaved at the BHB motifs surrounding pre-16S and pre-23S rRNAs subsequently become ligated. In addition, we present evidence that this is accompanied by circularisation of ribosomal pre-16S and pre-23S rRNAs in both species. These data reveal a further link between intron splicing and pre-rRNA processing in Archaea, which might reflect a common evolutionary origin of the two processes. One spliced RNA species designated 16S-D RNA, resulting from religation at the BHB motif of 16S pre-rRNA, is a highly abundant and stable RNA which folds into a three-stem structure interrupted by two single-stranded regions as assessed by chemical probing. It spans a region of the pre-rRNA 5′ external transcribed spacer exhibiting a highly conserved folding pattern in Archaea. Surprisingly, 16S-D RNA contains structural motifs found in archaeal C/D box small RNAs and binds to the L7Ae protein, a core component of archaeal C/D box RNPs. This supports the notion that it might have an important but still unknown role in pre-rRNA biogenesis or might even target RNA molecules other than rRNA. PMID:11842103
Petersen, Sanne M; Dandanell, Mette; Rasmussen, Lene J; Gerdes, Anne-Marie; Krogh, Lotte N; Bernstein, Inge; Okkels, Henrik; Wikman, Friedrik; Nielsen, Finn C; Hansen, Thomas V O
2013-10-03
Germ-line mutations in the DNA mismatch repair genes MLH1, MSH2, and MSH6 predispose to the development of colorectal cancer (Lynch syndrome or hereditary nonpolyposis colorectal cancer). These mutations include disease-causing frame-shift, nonsense, and splicing mutations as well as large genomic rearrangements. However, a large number of mutations, including missense, silent, and intronic variants, are classified as variants of unknown clinical significance. Intronic MLH1, MSH2, or MSH6 variants were investigated using in silico prediction tools and mini-gene assay to asses the effect on splicing. We describe in silico and in vitro characterization of nine intronic MLH1, MSH2, or MSH6 mutations identified in Danish colorectal cancer patients, of which four mutations are novel. The analysis revealed aberrant splicing of five mutations (MLH1 c.588 + 5G > A, MLH1 c.677 + 3A > T, MLH1 c.1732-2A > T, MSH2 c.1276 + 1G > T, and MSH2 c.1662-2A > C), while four mutations had no effect on splicing compared to wild type (MLH1 c.117-34A > T, MLH1 c.1039-8 T > A, MSH2 c.2459-18delT, and MSH6 c.3439-16C > T). In conclusion, we classify five MLH1/MSH2 mutations as pathogenic, whereas four MLH1/MSH2/MSH6 mutations are classified as neutral. This study supports the notion that in silico prediction tools and mini-gene assays are important for the classification of intronic variants, and thereby crucial for the genetic counseling of patients and their family members.